A genomic data mining pipeline for 15 species of the genus Olea

Constantinos Salis; Eleni Papakonstantinou; Katerina Pierouli; Athanasios Mitsis; Lia Basdeki; Vasileios Megalooikonomou; Dimitrios Vlachakis; Marianna Hagidimitriou

doi:10.14806/ej.24.0.922

Authors

Constantinos Salis Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens
Eleni Papakonstantinou Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens
Katerina Pierouli Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens
Athanasios Mitsis Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens
Lia Basdeki Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens
Vasileios Megalooikonomou Computer Engineering and Informatics Department, School of Engineering, University of Patras
Dimitrios Vlachakis Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens; Lab of Molecular Endocrinology, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens; Faculty of Natural and Mathematical Sciences, King's College London (UK)
Marianna Hagidimitriou Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens

DOI:

https://doi.org/10.14806/ej.24.0.922

Keywords:

data mining, semantics, Olea, pipeline, clustering

Abstract

In the big data era, conventional bioinformatics seems to fail in managing the full extent of the available genomic information. The current study is focused on olive tree species and the collection and analysis of genetic and genomic data, which are fragmented in various depositories. Extra virgin olive oil is classified as a medical food, due to nutraceutical benefits and its protective properties against cancer, cardiovascular diseases, age-related diseases, neurodegenerative disorders, and many other diseases. Extensive studies have reported the benefits of olive oil on human health. However, available data at the nucleotide sequence level are highly unstructured. Towards this aim, we describe an in-silico approach that combines methods from data mining and machine learning pipelines to ontology classification and semantic annotation. Fusing and analysing all available olive tree data is a step of uttermost importance in classifying and characterising the various cultivars, towards a comprehensive approach under the context of food safety and public health.

A genomic data mining pipeline for 15 species of the genus Olea

Authors

DOI:

Keywords:

Abstract

Downloads

Additional Files

Published

Issue

Section

License

Language

Developed By

Information