TY - CHAP T1 - Comparative genomics-based prediction of protein function T2 - Methods in Molecular Biology Y1 - 2008 A1 - Gabaldón, T. JF - Methods in Molecular Biology PB - M. Starkey and R. Elaswarapu, Humana press VL - 439 UR - http://www.springerprotocols.com/Abstract/doi/10.1007/978-1-59745-188-8_26 ER - TY - CHAP T1 - The core of a minimal gene set: insights from natural reduced genomes T2 - Protocells: Bridging nonliving and living matter Y1 - 2008 A1 - Gabaldón, T. A1 - Gil, R. A1 - Peretó, J. A1 - Latorre, A. A1 - Moya, A. JF - Protocells: Bridging nonliving and living matter PB - The MIT Press CY - USA ER - TY - JOUR T1 - PhylomeDB: a database for genome-wide collections of gene phylogenies JF - Nucleic Acids Res Y1 - 2008 A1 - Huerta-Cepas, J. A1 - Bueno, A. A1 - Dopazo, J. A1 - Gabaldón, T. KW - Ancient Humans *Phylogeny Proteins/classification/genetics Saccharomyces cerevisiae/classification/genetics Sequence Alignment KW - Base Sequence Escherichia coli/classification/genetics Genes *Genomics History AB - The complete collection of evolutionary histories of all genes in a genome, also known as phylome, constitutes a valuable source of information. The reconstruction of phylomes has been previously prevented by large demands of time and computer power, but is now feasible thanks to recent developments in computers and algorithms. To provide a publicly available repository of complete phylomes that allows researchers to access and store large-scale phylogenomic analyses, we have developed PhylomeDB. PhylomeDB is a database of complete phylomes derived for different genomes within a specific taxonomic range. All phylomes in the database are built using a high-quality phylogenetic pipeline that includes evolutionary model testing and alignment trimming phases. For each genome, PhylomeDB provides the alignments, phylogentic trees and tree-based orthology predictions for every single encoded protein. The current version of PhylomeDB includes the phylomes of Human, the yeast Saccharomyces cerevisiae and the bacterium Escherichia coli, comprising a total of 32 289 seed sequences with their corresponding alignments and 172 324 phylogenetic trees. PhylomeDB can be publicly accessed at http://phylomedb.bioinfo.cipf.es. VL - 36 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17962297 N1 - Huerta-Cepas, Jaime Bueno, Anibal Dopazo, Joaquin Gabaldon, Toni Historical Article Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2008 Jan;36(Database issue):D491-6. Epub 2007 Oct 25. ER - TY - JOUR T1 - From endosymbiont to host-controlled organelle: the hijacking of mitochondrial protein synthesis and metabolism JF - PLoS Comput Biol Y1 - 2007 A1 - Gabaldón, T. A1 - M. A. Huynen KW - Computer Simulation DNA Mutational Analysis/methods Evolution *Evolution KW - Genetic Organelles/physiology Protein Biosynthesis/*genetics Symbiosis/*genetics KW - Molecular Fungal Proteins/*physiology Genetic Variation/genetics Humans Mitochondria/*physiology Mitochondrial Proteins/*physiology *Models AB - Mitochondria are eukaryotic organelles that originated from the endosymbiosis of an alpha-proteobacterium. To gain insight into the evolution of the mitochondrial proteome as it proceeded through the transition from a free-living cell to a specialized organelle, we compared a reconstructed ancestral proteome of the mitochondrion with the proteomes of alpha-proteobacteria as well as with the mitochondrial proteomes in yeast and man. Overall, there has been a large turnover of the mitochondrial proteome during the evolution of mitochondria. Early in the evolution of the mitochondrion, proteins involved in cell envelope synthesis have virtually disappeared, whereas proteins involved in replication, transcription, cell division, transport, regulation, and signal transduction have been replaced by eukaryotic proteins. More than half of what remains from the mitochondrial ancestor in modern mitochondria corresponds to translation, including post-translational modifications, and to metabolic pathways that are directly, or indirectly, involved in energy conversion. Altogether, the results indicate that the eukaryotic host has hijacked the proto-mitochondrion, taking control of its protein synthesis and metabolism. VL - 3 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17983265 N1 - Gabaldon, Toni Huynen, Martijn A Research Support, Non-U.S. Gov’t United States PLoS computational biology PLoS Comput Biol. 2007 Nov;3(11):e219. Epub 2007 Sep 26. ER - TY - JOUR T1 - The human phylome JF - Genome Biol Y1 - 2007 A1 - Huerta-Cepas, J. A1 - H. Dopazo A1 - Dopazo, J. A1 - Gabaldón, T. KW - Animals *Evolution Evolution KW - DNA KW - Molecular Gene Duplication *Genome Humans *Phylogeny Proteins/genetics Sequence Analysis AB - BACKGROUND: Phylogenomics analyses serve to establish evolutionary relationships among organisms and their genes. A phylome, the complete collection of all gene phylogenies in a genome, constitutes a valuable source of information, but its use in large genomes still constitutes a technical challenge. The use of phylomes also requires the development of new methods that help us to interpret them. RESULTS: We reconstruct here the human phylome, which includes the evolutionary relationships of all human proteins and their homologs among 39 fully sequenced eukaryotes. Phylogenetic techniques used include alignment trimming, branch length optimization, evolutionary model testing and maximum likelihood and Bayesian methods. Although differences with alternative topologies are minor, most of the trees support the Coelomata and Unikont hypotheses as well as the grouping of primates with laurasatheria to the exclusion of rodents. We assess the extent of gene duplication events and their relationship with the functional roles of the protein families involved. We find support for at least one, and probably two, rounds of whole genome duplications before vertebrate radiation. Using a novel algorithm that is independent from a species phylogeny, we derive orthology and paralogy relationships of human proteins among eukaryotic genomes. CONCLUSION: Topological variations among phylogenies for different genes are to be expected, highlighting the danger of gene-sampling effects in phylogenomic analyses. Several links can be established between the functions of gene families duplicated at certain phylogenetic splits and major evolutionary transitions in those lineages. The pipeline implemented here can be easily adapted for use in other organisms. VL - 8 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17567924 N1 - Huerta-Cepas, Jaime Dopazo, Hernan Dopazo, Joaquin Gabaldon, Toni Research Support, Non-U.S. Gov’t England Genome biology Genome Biol. 2007;8(6):R109. ER - TY - JOUR T1 - PeroxisomeDB: a database for the peroxisomal proteome, functional genomics and disease JF - Nucleic Acids Res Y1 - 2007 A1 - Schluter, A. A1 - Fourcade, S. A1 - Domenech-Estevez, E. A1 - Gabaldón, T. A1 - Huerta-Cepas, J. A1 - Berthommier, G. A1 - Ripp, R. A1 - Wanders, R. J. A1 - Poch, O. A1 - Pujol, A. KW - Animals *Databases KW - Protein Genomics Humans Internet Mice Peroxisomal Disorders/*genetics Peroxisomes/*metabolism Protein Sorting Signals Proteome/chemistry/*genetics/*physiology Rats Saccharomyces cerevisiae Proteins/genetics/physiology Software User-Computer Interface AB - Peroxisomes are essential organelles of eukaryotic origin, ubiquitously distributed in cells and organisms, playing key roles in lipid and antioxidant metabolism. Loss or malfunction of peroxisomes causes more than 20 fatal inherited conditions. We have created a peroxisomal database (http://www.peroxisomeDB.org) that includes the complete peroxisomal proteome of Homo sapiens and Saccharomyces cerevisiae, by gathering, updating and integrating the available genetic and functional information on peroxisomal genes. PeroxisomeDB is structured in interrelated sections ’Genes’, ’Functions’, ’Metabolic pathways’ and ’Diseases’, that include hyperlinks to selected features of NCBI, ENSEMBL and UCSC databases. We have designed graphical depictions of the main peroxisomal metabolic routes and have included updated flow charts for diagnosis. Precomputed BLAST, PSI-BLAST, multiple sequence alignment (MUSCLE) and phylogenetic trees are provided to assist in direct multispecies comparison to study evolutionary conserved functions and pathways. Highlights of the PeroxisomeDB include new tools developed for facilitating (i) identification of novel peroxisomal proteins, by means of identifying proteins carrying peroxisome targeting signal (PTS) motifs, (ii) detection of peroxisomes in silico, particularly useful for screening the deluge of newly sequenced genomes. PeroxisomeDB should contribute to the systematic characterization of the peroxisomal proteome and facilitate system biology approaches on the organelle. VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17135190 N1 - Schluter, Agatha Fourcade, Stephane Domenech-Estevez, Enric Gabaldon, Toni Huerta-Cepas, Jaime Berthommier, Guillaume Ripp, Raymond Wanders, Ronald J A Poch, Olivier Pujol, Aurora Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jan;35(Database issue):D815-22. Epub 2006 Nov 28. ER - TY - JOUR T1 - Phylemon: a suite of web tools for molecular evolution, phylogenetics and phylogenomics JF - Nucleic Acids Res Y1 - 2007 A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Arbiza, L. A1 - Huerta-Cepas, J. A1 - Gabaldón, T. A1 - Dopazo, J. A1 - H. Dopazo KW - Animals Computational Biology/*methods Databases KW - DNA Sequence Analysis KW - Genetic Evolution KW - Molecular Genetic Techniques Humans *Internet Models KW - Protein Software User-Computer Interface KW - Statistical *Phylogeny Programming Languages Sequence Alignment Sequence Analysis AB - Phylemon is an online platform for phylogenetic and evolutionary analyses of molecular sequence data. It has been developed as a web server that integrates a suite of different tools selected among the most popular stand-alone programs in phylogenetic and evolutionary analysis. It has been conceived as a natural response to the increasing demand of data analysis of many experimental scientists wishing to add a molecular evolution and phylogenetics insight into their research. Tools included in Phylemon cover a wide yet selected range of programs: from the most basic for multiple sequence alignment to elaborate statistical methods of phylogenetic reconstruction including methods for evolutionary rates analyses and molecular adaptation. Phylemon has several features that differentiates it from other resources: (i) It offers an integrated environment that enables the direct concatenation of evolutionary analyses, the storage of results and handles required data format conversions, (ii) Once an outfile is produced, Phylemon suggests the next possible analyses, thus guiding the user and facilitating the integration of multi-step analyses, and (iii) users can define and save complete pipelines for specific phylogenetic analysis to be automatically used on many genes in subsequent sessions or multiple genes in a single session (phylogenomics). The Phylemon web server is available at http://phylemon.bioinfo.cipf.es. VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17452346 N1 - Tarraga, Joaquin Medina, Ignacio Arbiza, Leonardo Huerta-Cepas, Jaime Gabaldon, Toni Dopazo, Joaquin Dopazo, Hernan Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W38-42. Epub 2007 Apr 22. ER - TY - CHAP T1 - Reconstruction of ancestral proteomes T2 - Ancestral Sequence Reconstruction Y1 - 2007 A1 - Gabaldón, T. A1 - M. A. Huynen JF - Ancestral Sequence Reconstruction PB - D. Liberles CY - Oxford UR - http://www.us.oup.com/us/catalog/general/subject/LifeSciences/EvolutionaryBiology/?view=usa&ci=9780199299188 ER - TY - JOUR T1 - Structural analyses of a hypothetical minimal metabolism JF - Philos Trans R Soc Lond B Biol Sci Y1 - 2007 A1 - Gabaldón, T. A1 - Peretó, J. A1 - Montero, F. A1 - Gil, R. A1 - Latorre, A. A1 - Moya, A. KW - *Cell Physiological Phenomena Cells/*metabolism Cluster Analysis *Computer Simulation *Metabolic Networks and Pathways *Models KW - Biological Models KW - Statistical AB - By integrating data from comparative genomics and large-scale deletion studies, we previously proposed a minimal gene set comprising 206 protein-coding genes. To evaluate the consistency of the metabolism encoded by such a minimal genome, we have carried out a series of computational analyses. Firstly, the topology of the minimal metabolism was compared with that of the reconstructed networks from natural bacterial genomes. Secondly, the robustness of the metabolic network was evaluated by simulated mutagenesis and, finally, the stoichiometric consistency was assessed by automatically deriving the steady-state solutions from the reaction set. The results indicated that the proposed minimal metabolism presents stoichiometric consistency and that it is organized as a complex power-law network with topological parameters falling within the expected range for a natural metabolism of its size. The robustness analyses revealed that most random mutations do not alter the topology of the network significantly, but do cause significant damage by preventing the synthesis of several compounds or compromising the stoichiometric consistency of the metabolism. The implications that these results have on the origins of metabolic complexity and the theoretical design of an artificial minimal cell are discussed. VL - 362 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17510022 N1 - Gabaldon, Toni Pereto, Juli Montero, Francisco Gil, Rosario Latorre, Amparo Moya, Andres Research Support, Non-U.S. Gov’t England Philosophical transactions of the Royal Society of London. Series B, Biological sciences Philos Trans R Soc Lond B Biol Sci. 2007 Oct 29;362(1486):1751-62. ER - TY - JOUR T1 - Computational approaches for the prediction of protein function in the mitochondrion JF - Am J Physiol Cell Physiol Y1 - 2006 A1 - Gabaldón, T. KW - *Computational Biology *Computer Simulation Humans Mitochondria/*metabolism Mitochondrial Proteins/genetics/*metabolism Mutation AB - Understanding a complex biological system, such as the mitochondrion, requires the identification of the complete repertoire of proteins targeted to the organelle, the characterization of these, and finally, the elucidation of the functional and physical interactions that occur within the mitochondrion. In the last decade, significant developments have contributed to increase our understanding of the mitochondrion, and among these, computational research has played a significant role. Not only general bioinformatics tools have been applied in the context of the mitochondrion, but also some computational techniques have been specifically developed to address problems that arose from within the mitochondrial research field. In this review the contribution of bioinformatics to mitochondrial biology is addressed through a survey of current computational methods that can be applied to predict which proteins will be localized to the mitochondrion and to unravel their functional interactions. VL - 291 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16870830 N1 - Gabaldon, Toni Research Support, Non-U.S. Gov’t Review United States American journal of physiology. Cell physiology Am J Physiol Cell Physiol. 2006 Dec;291(6):C1121-8. Epub 2006 Jul 26. ER - TY - JOUR T1 - Origin and evolution of the peroxisomal proteome JF - Biol Direct Y1 - 2006 A1 - Gabaldón, T. A1 - B. Snel A1 - van Zimmeren, F. A1 - Hemrika, W. A1 - Tabak, H. A1 - M. A. Huynen AB - BACKGROUND: Peroxisomes are ubiquitous eukaryotic organelles involved in various oxidative reactions. Their enzymatic content varies between species, but the presence of common protein import and organelle biogenesis systems support a single evolutionary origin. The precise scenario for this origin remains however to be established. The ability of peroxisomes to divide and import proteins post-translationally, just like mitochondria and chloroplasts, supports an endosymbiotic origin. However, this view has been challenged by recent discoveries that mutant, peroxisome-less cells restore peroxisomes upon introduction of the wild-type gene, and that peroxisomes are formed from the Endoplasmic Reticulum. The lack of a peroxisomal genome precludes the use of classical analyses, as those performed with mitochondria or chloroplasts, to settle the debate. We therefore conducted large-scale phylogenetic analyses of the yeast and rat peroxisomal proteomes. RESULTS : Our results show that most peroxisomal proteins (39-58%) are of eukaryotic origin, comprising all proteins involved in organelle biogenesis or maintenance. A significant fraction (13-18%), consisting mainly of enzymes, has an alpha-proteobacterial origin and appears to be the result of the recruitment of proteins originally targeted to mitochondria. Consistent with the findings that peroxisomes are formed in the Endoplasmic Reticulum, we find that the most universally conserved Peroxisome biogenesis and maintenance proteins are homologous to proteins from the Endoplasmic Reticulum Assisted Decay pathway. CONCLUSION: Altogether our results indicate that the peroxisome does not have an endosymbiotic origin and that its proteins were recruited from pools existing within the primitive eukaryote. Moreover the reconstruction of primitive peroxisomal proteomes suggests that ontogenetically as well as phylogenetically, peroxisomes stem from the Endoplasmic Reticulum. REVIEWERS: This article was reviewed by Arcady Mushegian, Gaspar Jekely and John Logsdon. OPEN PEER REVIEW: Reviewed by Arcady Mushegian, Gaspar Jekely and John Logsdon. For the full reviews, please go to the Reviewers’ comments section. VL - 1 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16556314 N1 - Gabaldon, Toni Snel, Berend van Zimmeren, Frank Hemrika, Wieger Tabak, Henk Huynen, Martijn A England Biology direct Biol Direct. 2006 Mar 23;1:8. ER - TY - JOUR T1 - An anaerobic mitochondrion that produces hydrogen JF - Nature Y1 - 2005 A1 - Boxma, B. A1 - de Graaf, R. M. A1 - van der Staay, G. W. A1 - van Alen, T. A. A1 - Ricard, G. A1 - Gabaldón, T. A1 - van Hoek, A. H. A1 - Moon-van der Staay, S. Y. A1 - Koopman, W. J. A1 - van Hellemond, J. J. A1 - Tielens, A. G. A1 - Friedrich, T. A1 - Veenhuis, M. A1 - M. A. Huynen A1 - Hackstein, J. H. KW - *Anaerobiosis Animals Ciliophora/*cytology/genetics/*metabolism/ultrastructure Cockroaches/parasitology DNA KW - Mitochondrial/genetics Electron Transport Electron Transport Complex I/antagonists & inhibitors/metabolism Genome Glucose/metabolism Hydrogen/*metabolism Mitochondria/enzymology/genetics/*metabolism/ultrastructure Molecular Sequence Data Open Reading Fra AB - Hydrogenosomes are organelles that produce ATP and hydrogen, and are found in various unrelated eukaryotes, such as anaerobic flagellates, chytridiomycete fungi and ciliates. Although all of these organelles generate hydrogen, the hydrogenosomes from these organisms are structurally and metabolically quite different, just like mitochondria where large differences also exist. These differences have led to a continuing debate about the evolutionary origin of hydrogenosomes. Here we show that the hydrogenosomes of the anaerobic ciliate Nyctotherus ovalis, which thrives in the hindgut of cockroaches, have retained a rudimentary genome encoding components of a mitochondrial electron transport chain. Phylogenetic analyses reveal that those proteins cluster with their homologues from aerobic ciliates. In addition, several nucleus-encoded components of the mitochondrial proteome, such as pyruvate dehydrogenase and complex II, were identified. The N. ovalis hydrogenosome is sensitive to inhibitors of mitochondrial complex I and produces succinate as a major metabolic end product–biochemical traits typical of anaerobic mitochondria. The production of hydrogen, together with the presence of a genome encoding respiratory chain components, and biochemical features characteristic of anaerobic mitochondria, identify the N. ovalis organelle as a missing link between mitochondria and hydrogenosomes. VL - 434 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15744302 N1 - Boxma, Brigitte de Graaf, Rob M van der Staay, Georg W M van Alen, Theo A Ricard, Guenola Gabaldon, Toni van Hoek, Angela H A M Moon-van der Staay, Seung Yeo Koopman, Werner J H van Hellemond, Jaap J Tielens, Aloysius G M Friedrich, Thorsten Veenhuis, Marten Huynen, Martijn A Hackstein, Johannes H P Research Support, Non-U.S. Gov’t England Nature Nature. 2005 Mar 3;434(7029):74-9. ER - TY - JOUR T1 - Combining data from genomes, Y2H and 3D structure indicates that BolA is a reductase interacting with a glutaredoxin JF - FEBS Lett Y1 - 2005 A1 - M. A. Huynen A1 - Spronk, C. A. A1 - Gabaldón, T. A1 - B. Snel KW - *Genome Glutaredoxins Models KW - Molecular Oxidoreductases/chemistry/*metabolism Phylogeny Protein Conformation AB - Genomes, functional genomics data and 3D structure reflect different aspects of protein function. Here, we combine these data to predict that BolA, a widely distributed protein family with unknown function, is a reductase that interacts with a glutaredoxin. Comparisons at the 3D structure level as well as at the sequence profile level indicate homology between BolA and OsmC, an enzyme that reduces organic peroxides. Complementary to this, comparative analyses of genomes and genomics data provide strong evidence of an interaction between BolA and the mono-thiol glutaredoxin family. The interaction between BolA and a mono-thiol glutaredoxin is of particular interest because BolA does not, in contrast to its homolog OsmC, have evolutionarily conserved cysteines to provide it with reducing equivalents. We propose that BolA uses the mono-thiol glutaredoxin as the source for these. VL - 579 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15670813 N1 - Huynen, Martijn A Spronk, Chris A E M Gabaldon, Toni Snel, Berend Research Support, Non-U.S. Gov’t Netherlands FEBS letters FEBS Lett. 2005 Jan 31;579(3):591-6. ER - TY - JOUR T1 - Evolution of proteins and proteomes: a phylogenetics approach JF - Evol Bioinform Online Y1 - 2005 A1 - Gabaldón, T. AB - The study of evolutionary relationships among protein sequences was one of the first applications of bioinformatics. Since then, and accompanying the wealth of biological data produced by genome sequencing and other high-throughput techniques, the use of bioinformatics in general and phylogenetics in particular has been gaining ground in the study of protein and proteome evolution. Nowadays, the use of phylogenetics is instrumental not only to infer the evolutionary relationships among species and their genome sequences, but also to reconstruct ancestral states of proteins and proteomes and hence trace the paths followed by evolution. Here I survey recent progress in the elucidation of mechanisms of protein and proteome evolution in which phylogenetics has played a determinant role. VL - 1 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=19325853 N1 - Gabaldon, Toni New Zealand Evolutionary bioinformatics online Evol Bioinform Online. 2005;1:51-61. ER - TY - JOUR T1 - Lineage-specific gene loss following mitochondrial endosymbiosis and its potential for function prediction in eukaryotes JF - Bioinformatics Y1 - 2005 A1 - Gabaldón, T. A1 - M. A. Huynen KW - Animals Chromosome Mapping/*methods DNA KW - Mitochondrial/*genetics *Evolution KW - Molecular *Gene Deletion Genetic Variation/genetics Humans Linkage Disequilibrium/*genetics Mitochondrial Proteins/*genetics Sequence Homology KW - Nucleic Acid Species Specificity Symbiosis/*genetics AB - MOTIVATION: The endosymbiotic origin of mitochondria has resulted in a massive horizontal transfer of genetic material from an alpha-proteobacterium to the early eukaryotes. Using large-scale phylogenetic analysis we have previously identified 630 orthologous groups of proteins derived from this event. Here we show that this proto-mitochondrial protein set has undergone extensive lineage-specific gene loss in the eukaryotes, with an average of three losses per orthologous group in a phylogeny of nine species. This gene loss has resulted in a high variability of the alphaproteobacterial-derived gene content of present-day eukaryotic genomes that might reflect functional adaptation to different environments. Proteins functioning in the same biochemical pathway tend to have a similar history of gene loss events, and we use this property to predict functional interactions among proteins in our set. VL - 21 Suppl 2 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16204094 N1 - Gabaldon, Toni Huynen, Martijn A Research Support, Non-U.S. Gov’t England Bioinformatics (Oxford, England) Bioinformatics. 2005 Sep 1;21 Suppl 2:ii144-50. ER - TY - JOUR T1 - Tracing the evolution of a large protein complex in the eukaryotes, NADH:ubiquinone oxidoreductase (Complex I) JF - J Mol Biol Y1 - 2005 A1 - Gabaldón, T. A1 - Rainey, D. A1 - M. A. Huynen KW - Amino Acid Sequence Animals Computational Biology Electron Transport Complex I/*chemistry/*genetics/metabolism Eukaryotic Cells/*enzymology *Evolution KW - Molecular Humans Molecular Sequence Data Photosynthesis Phylogeny Plastids/enzymology Protein Binding Protein Subunits/chemistry/genetics/metabolism Sequence Alignment Structural Homology KW - Protein AB - The increasing availability of sequenced genomes enables the reconstruction of the evolutionary history of large protein complexes. Here, we trace the evolution of NADH:ubiquinone oxidoreductase (Complex I), which has increased in size, by so-called supernumary subunits, from 14 subunits in the bacteria to 30 in the plants and algae, 37 in the fungi and 46 in the mammals. Using a combination of pair-wise and profile-based sequence comparisons at the levels of proteins and the DNA of the sequenced eukaryotic genomes, combined with phylogenetic analyses to establish orthology relationships, we were able to (1) trace the origin of six of the supernumerary subunits to the alpha-proteobacterial ancestor of the mitochondria, (2) detect previously unidentified homology relations between subunits from fungi and mammals, (3) detect previously unidentified subunits in the genomes of several species and (4) document several cases of gene duplications among supernumerary subunits in the eukaryotes. One of these, a duplication of N7BM (B17.2), is particularly interesting as it has been lost from genomes that have also lost Complex I proteins, making it a candidate for a Complex I interacting protein. A parsimonious reconstruction of eukaryotic Complex I evolution shows an initial increase in size that predates the separation of plants, fungi and metazoa, followed by a gradual adding and incidental losses of subunits in the various evolutionary lineages. This evolutionary scenario is in contrast to that for Complex I in the prokaryotes, for which the combination of several separate, and previously independently functioning modules into a single complex has been proposed. VL - 348 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15843018 N1 - Gabaldon, Toni Rainey, Daphne Huynen, Martijn A Research Support, Non-U.S. Gov’t England Journal of molecular biology J Mol Biol. 2005 May 13;348(4):857-70. ER - TY - JOUR T1 - Variation and evolution of biomolecular systems: searching for functional relevance JF - FEBS Lett Y1 - 2005 A1 - M. A. Huynen A1 - Gabaldón, T. A1 - B. Snel KW - *Evolution KW - Molecular Genetic Variation Multiprotein Complexes/*genetics Phylogeny Protein Binding/genetics AB - The availability of genome sequences and functional genomics data from multiple species enables us to compare the composition of biomolecular systems like biochemical pathways and protein complexes between species. Here, we review small- and large-scale, "genomics-based" approaches to biomolecular systems variation. In general, caution is required when comparing the results of bioinformatics analyses of genomes or of functional genomics data between species. Limitations to the sensitivity of sequence analysis tools and the noisy nature of genomics data tend to lead to systematic overestimates of the amount of variation. Nevertheless, the results from detailed manual analyses, and of large-scale analyses that filter out systematic biases, point to a large amount of variation in the composition of biomolecular systems. Such observations challenge our understanding of the function of the systems and their individual components and can potentially facilitate the identification and functional characterization of sub-systems within a system. Mapping the inter-species variation of complex biomolecular systems on a phylogenetic species tree allows one to reconstruct their evolution. VL - 579 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15763561 N1 - Huynen, Martijn A Gabaldon, Toni Snel, Berend Review Netherlands FEBS letters FEBS Lett. 2005 Mar 21;579(8):1839-45. ER - TY - JOUR T1 - Perceptions about postdocs JF - EMBO Rep Y1 - 2004 A1 - Vella, F. A1 - Mietchen, D. A1 - Gabaldón, T. KW - Europe *Fellowships and Scholarships *Research Personnel VL - 5 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15577920 N1 - Vella, Francis Mietchen, Daniel Gabaldon, Toni Eurodoc Council Comment Letter England EMBO reports EMBO Rep. 2004 Dec;5(12):1104. ER - TY - JOUR T1 - Prediction of protein function and pathways in the genome era JF - Cell Mol Life Sci Y1 - 2004 A1 - Gabaldón, T. A1 - M. A. Huynen KW - ATP-Binding Cassette Transporters/genetics/metabolism Amino Acid Sequence Animals Artificial Gene Fusion Base Sequence Chaperonins/genetics/metabolism Chromosomes/genetics/metabolism Evolution KW - Molecular *Genome Genomics Humans Molecular Sequence Data Phylogeny *Proteins/classification/genetics/metabolism RNA KW - Ribosomal/metabolism Sequence Alignment AB - The growing number of completely sequenced genomes adds new dimensions to the use of sequence analysis to predict protein function. Compared with the classical knowledge transfer from one protein to a similar sequence (homology-based function prediction), knowledge about the corresponding genes in other genomes (orthology-based function prediction) provides more specific information about the protein’s function, while the analysis of the sequence in its genomic context (context-based function prediction) provides information about its functional context. Whereas homology-based methods predict the molecular function of a protein, genomic context methods predict the biological process in which it plays a role. These complementary approaches can be combined to elucidate complete functional networks and biochemical pathways from the genome sequence of an organism. Here we review recent advances in the field of genomic-context based methods of protein function prediction. Techniques are highlighted with examples, including an analysis that combines information from genomic-context with homology to predict a role of the RNase L inhibitor in the maturation of ribosomal RNA. VL - 61 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15095013 N1 - Gabaldon, T Huynen, M A Review Switzerland Cellular and molecular life sciences : CMLS Cell Mol Life Sci. 2004 Apr;61(7-8):930-44. ER - TY - JOUR T1 - Shaping the mitochondrial proteome JF - Biochim Biophys Acta Y1 - 2004 A1 - Gabaldón, T. A1 - M. A. Huynen KW - Animals Biological Transport Energy Metabolism Eukaryotic Cells/physiology *Evolution Humans Mitochondria/*physiology Phylogeny Proteome/*physiology AB - Mitochondria are eukaryotic organelles that originated from a single bacterial endosymbiosis some 2 billion years ago. The transition from the ancestral endosymbiont to the modern mitochondrion has been accompanied by major changes in its protein content, the so-called proteome. These changes included complete loss of some bacterial pathways, amelioration of others and gain of completely new complexes of eukaryotic origin such as the ATP/ADP translocase and most of the mitochondrial protein import machinery. This renewal of proteins has been so extensive that only 14-16% of modern mitochondrial proteome has an origin that can be traced back to the bacterial endosymbiont. The rest consists of proteins of diverse origin that were eventually recruited to function in the organelle. This shaping of the proteome content reflects the transformation of mitochondria into a highly specialized organelle that, besides ATP production, comprises a variety of functions within the eukaryotic metabolism. Here we review recent advances in the fields of comparative genomics and proteomics that are throwing light on the origin and evolution of the mitochondrial proteome. VL - 1659 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15576054 N1 - Gabaldon, Toni Huynen, Martijn A Research Support, Non-U.S. Gov’t Review Netherlands Biochimica et biophysica acta Biochim Biophys Acta. 2004 Dec 6;1659(2-3):212-20. ER - TY - JOUR T1 - Reconstruction of the proto-mitochondrial metabolism JF - Science Y1 - 2003 A1 - Gabaldón, T. A1 - M. A. Huynen KW - Aerobiosis Algorithms Alphaproteobacteria/chemistry/genetics/*metabolism Amino Acids/metabolism Animals Bacterial Proteins/chemistry/*metabolism Genome Genome KW - Bacterial Glycerol/metabolism Humans Lipid Metabolism Mitochondria/chemistry/genetics/*metabolism Phylogeny *Proteome Symbiosis Yeasts/metabolism VL - 301 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=12893934 N1 - Gabaldon, Toni Huynen, Martijn A Comparative Study Research Support, Non-U.S. Gov’t United States Science (New York, N.Y.) Science. 2003 Aug 1;301(5633):609. ER -