<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Capriotti, Emidio</style></author><author><style face="normal" font="default" size="100%">Arbiza, Leonardo</style></author><author><style face="normal" font="default" size="100%">Casadio, Rita</style></author><author><style face="normal" font="default" size="100%">Dopazo, Joaquin</style></author><author><style face="normal" font="default" size="100%">Dopazo, Hernán</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans.</style></title><secondary-title><style face="normal" font="default" size="100%">Hum Mutat</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Hum Mutat</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms</style></keyword><keyword><style  face="normal" font="default" size="100%">Codon</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology</style></keyword><keyword><style  face="normal" font="default" size="100%">Databases, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">DNA Mutational Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Evolution, Molecular</style></keyword><keyword><style  face="normal" font="default" size="100%">Genetic Predisposition to Disease</style></keyword><keyword><style  face="normal" font="default" size="100%">Genetic Variation</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">Iduronic Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Point Mutation</style></keyword><keyword><style  face="normal" font="default" size="100%">Polymorphism, Single Nucleotide</style></keyword><keyword><style  face="normal" font="default" size="100%">Proteins</style></keyword><keyword><style  face="normal" font="default" size="100%">Tumor Suppressor Protein p53</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2008</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2008 Jan</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">29</style></volume><pages><style face="normal" font="default" size="100%">198-204</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/17935148?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author><author><style face="normal" font="default" size="100%">Pieper, Ursula</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M S</style></author><author><style face="normal" font="default" size="100%">Rossi, Andrea</style></author><author><style face="normal" font="default" size="100%">Eswar, Narayanan</style></author><author><style face="normal" font="default" size="100%">Davis, Fred P</style></author><author><style face="normal" font="default" size="100%">Al-Shahrour, Fátima</style></author><author><style face="normal" font="default" size="100%">Dopazo, Joaquin</style></author><author><style face="normal" font="default" size="100%">Sali, Andrej</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">DBAli tools: mining the protein structure space.</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms</style></keyword><keyword><style  face="normal" font="default" size="100%">Amino Acid Sequence</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology</style></keyword><keyword><style  face="normal" font="default" size="100%">Data Interpretation, Statistical</style></keyword><keyword><style  face="normal" font="default" size="100%">Databases, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">Internet</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Sequence Data</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Conformation</style></keyword><keyword><style  face="normal" font="default" size="100%">Proteins</style></keyword><keyword><style  face="normal" font="default" size="100%">Pseudomonas aeruginosa</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Alignment</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Analysis, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Homology, Amino Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Software</style></keyword><keyword><style  face="normal" font="default" size="100%">Structure-Activity Relationship</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2007</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2007 Jul</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">35</style></volume><pages><style face="normal" font="default" size="100%">W393-7</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">Web Server issue</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/17478513?dopt=Abstract</style></custom1></record></records></xml>