High-throughput functional annotation and data mining with the Blast2GO suite.

TitleHigh-throughput functional annotation and data mining with the Blast2GO suite.
Publication TypeJournal Article
Year of Publication2008
AuthorsGötz, S, García-Gómez, JMiguel, Terol, J, Williams, TD, Nagaraj, SH, Nueda, MJosé, Robles, M, Talon, M, Dopazo, J, Conesa, A
JournalNucleic Acids Res
Volume36
Issue10
Pagination3420-35
Date Published2008 Jun
ISSN1362-4962
KeywordsAnimals; Computational Biology; Computer Graphics; Databases, Genetic; Expressed Sequence Tags; Genes; Genomics; Sequence Analysis, DNA; Sequence Analysis, Protein; Software; Vocabulary, Controlled
Abstract

Functional genomics technologies have been widely adopted in the biological research of both model and non-model species. An efficient functional annotation of DNA or protein sequences is a major requirement for the successful application of these approaches as functional information on gene products is often the key to the interpretation of experimental results. Therefore, there is an increasing need for bioinformatics resources which are able to cope with large amount of sequence data, produce valuable annotation results and are easily accessible to laboratories where functional genomics projects are being undertaken. We present the Blast2GO suite as an integrated and biologist-oriented solution for the high-throughput and automatic functional annotation of DNA or protein sequences based on the Gene Ontology vocabulary. The most outstanding Blast2GO features are: (i) the combination of various annotation strategies and tools controlling type and intensity of annotation, (ii) the numerous graphical features such as the interactive GO-graph visualization for gene-set function profiling or descriptive charts, (iii) the general sequence management features and (iv) high-throughput capabilities. We used the Blast2GO framework to carry out a detailed analysis of annotation behaviour through homology transfer and its impact in functional genomics research. Our aim is to offer biologists useful information to take into account when addressing the task of functionally characterizing their sequence data.

DOI10.1093/nar/gkn176
Alternate JournalNucleic Acids Res
PubMed ID18445632
PubMed Central IDPMC2425479