<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">E. Capriotti</style></author><author><style face="normal" font="default" size="100%">Arbiza, L.</style></author><author><style face="normal" font="default" size="100%">Casadio, R.</style></author><author><style face="normal" font="default" size="100%">Dopazo, J.</style></author><author><style face="normal" font="default" size="100%">H. Dopazo</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans</style></title><secondary-title><style face="normal" font="default" size="100%">Hum Mutat</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms Codon/genetics Computational Biology/*methods *DNA Mutational Analysis Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Human Humans Iduronic Acid/analogs &amp; derivatives/metabolism *Point Mutation Polymorphism</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular *Genetic Predisposition to Disease Genetic Variation Genome</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein *Evolution</style></keyword><keyword><style  face="normal" font="default" size="100%">Single Nucleotide Proteins/chemistry/*genetics Tumor Suppressor Protein p53/genetics</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2008</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=17935148</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">1</style></number><volume><style face="normal" font="default" size="100%">29</style></volume><pages><style face="normal" font="default" size="100%">198-204</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies.</style></abstract><notes><style face="normal" font="default" size="100%">Capriotti, Emidio Arbiza, Leonardo Casadio, Rita Dopazo, Joaquin Dopazo, Hernan Marti-Renom, Marc A Evaluation Studies Research Support, Non-U.S. Gov’t United States Human mutation Hum Mutat. 2008 Jan;29(1):198-204.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Espadaler, J.</style></author><author><style face="normal" font="default" size="100%">Aragues, R.</style></author><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Querol, E.</style></author><author><style face="normal" font="default" size="100%">Aviles, F. X.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author><author><style face="normal" font="default" size="100%">Oliva, B.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Detecting remotely related proteins by their interactions and sequence similarity</style></title><secondary-title><style face="normal" font="default" size="100%">Proc Natl Acad Sci U S A</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Amino Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Protein Conformation Protein Folding Proteins/*genetics/*metabolism Proteomics/*methods *Sequence Homology</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein *Evolution</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2005</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=15883372</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">20</style></number><volume><style face="normal" font="default" size="100%">102</style></volume><pages><style face="normal" font="default" size="100%">7151-6</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The function of an uncharacterized protein is usually inferred either from its homology to, or its interactions with, characterized proteins. Here, we use both sequence similarity and protein interactions to identify relationships between remotely related protein sequences. We rely on the fact that homologous sequences share similar interactions, and, therefore, the set of interacting partners of the partners of a given protein is enriched by its homologs. The approach was bench-marked by assigning the fold and functional family to test sequences of known structure. Specifically, we relied on 1,434 proteins with known folds, as defined in the Structural Classification of Proteins (SCOP) database, and with known interacting partners, as defined in the Database of Interacting Proteins (DIP). For this subset, the specificity of fold assignment was increased from 54% for position-specific iterative BLAST to 75% for our approach, with a concomitant increase in sensitivity for a few percentage points. Similarly, the specificity of family assignment at the e-value threshold of 10(-8) was increased from 70% to 87%. The proposed method would be a useful tool for large-scale automated discovery of remote relationships between protein sequences, given its unique reliance on sequence similarity and protein-protein interactions.</style></abstract><notes><style face="normal" font="default" size="100%">Espadaler, Jordi Aragues, Ramon Eswar, Narayanan Marti-Renom, Marc A Querol, Enrique Aviles, Francesc X Sali, Andrej Oliva, Baldomero R01 GM54762/GM/NIGMS NIH HHS/United States Comparative Study Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. United States Proceedings of the National Academy of Sciences of the United States of America Proc Natl Acad Sci U S A. 2005 May 17;102(20):7151-6. Epub 2005 May 9.</style></notes></record></records></xml>