TY - JOUR T1 - Drug-target identification in COVID-19 disease mechanisms using computational systems biology approaches. JF - Front Immunol Y1 - 2024 A1 - Niarakis, Anna A1 - Ostaszewski, Marek A1 - Mazein, Alexander A1 - Kuperstein, Inna A1 - Kutmon, Martina A1 - Gillespie, Marc E A1 - Funahashi, Akira A1 - Acencio, Marcio Luis A1 - Hemedan, Ahmed A1 - Aichem, Michael A1 - Klein, Karsten A1 - Czauderna, Tobias A1 - Burtscher, Felicia A1 - Yamada, Takahiro G A1 - Hiki, Yusuke A1 - Hiroi, Noriko F A1 - Hu, Finterly A1 - Pham, Nhung A1 - Ehrhart, Friederike A1 - Willighagen, Egon L A1 - Valdeolivas, Alberto A1 - Dugourd, Aurélien A1 - Messina, Francesco A1 - Esteban-Medina, Marina A1 - Peña-Chilet, Maria A1 - Rian, Kinza A1 - Soliman, Sylvain A1 - Aghamiri, Sara Sadat A1 - Puniya, Bhanwar Lal A1 - Naldi, Aurélien A1 - Helikar, Tomáš A1 - Singh, Vidisha A1 - Fernández, Marco Fariñas A1 - Bermudez, Viviam A1 - Tsirvouli, Eirini A1 - Montagud, Arnau A1 - Noël, Vincent A1 - Ponce-de-Leon, Miguel A1 - Maier, Dieter A1 - Bauch, Angela A1 - Gyori, Benjamin M A1 - Bachman, John A A1 - Luna, Augustin A1 - Piñero, Janet A1 - Furlong, Laura I A1 - Balaur, Irina A1 - Rougny, Adrien A1 - Jarosz, Yohan A1 - Overall, Rupert W A1 - Phair, Robert A1 - Perfetto, Livia A1 - Matthews, Lisa A1 - Rex, Devasahayam Arokia Balaya A1 - Orlic-Milacic, Marija A1 - Gomez, Luis Cristobal Monraz A1 - De Meulder, Bertrand A1 - Ravel, Jean Marie A1 - Jassal, Bijay A1 - Satagopam, Venkata A1 - Wu, Guanming A1 - Golebiewski, Martin A1 - Gawron, Piotr A1 - Calzone, Laurence A1 - Beckmann, Jacques S A1 - Evelo, Chris T A1 - D'Eustachio, Peter A1 - Schreiber, Falk A1 - Saez-Rodriguez, Julio A1 - Dopazo, Joaquin A1 - Kuiper, Martin A1 - Valencia, Alfonso A1 - Wolkenhauer, Olaf A1 - Kitano, Hiroaki A1 - Barillot, Emmanuel A1 - Auffray, Charles A1 - Balling, Rudi A1 - Schneider, Reinhard KW - Computer Simulation KW - COVID-19 KW - drug repositioning KW - Humans KW - SARS-CoV-2 KW - Systems biology AB -

INTRODUCTION: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing.

METHODS: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors.

RESULTS: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19.

DISCUSSION: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies.

VL - 14 ER - TY - BOOK T1 - Cell-Level Pathway Scoring Comparison with a Biologically Constrained Variational Autoencoder T2 - Lecture Notes in Computer Science. Computational Methods in Systems Biology Y1 - 2023 A1 - Gundogdu, Pelin A1 - Payá-Milans, Miriam A1 - Alamo-Alvarez, Inmaculada A1 - Nepomuceno-Chamorro, Isabel A. A1 - Dopazo, Joaquin A1 - Loucera, Carlos JF - Lecture Notes in Computer Science. Computational Methods in Systems Biology PB - Springer Nature Switzerland CY - Cham VL - 14137 SN - 978-3-031-42696-4 UR - https://link.springer.com/chapter/10.1007/978-3-031-42697-1_5 ER - TY - JOUR T1 - A Comprehensive Analysis of 21 Actionable Pharmacogenes in the Spanish Population: From Genetic Characterisation to Clinical Impact. JF - Pharmaceutics Y1 - 2023 A1 - Núñez-Torres, Rocío A1 - Pita, Guillermo A1 - Peña-Chilet, Maria A1 - López-López, Daniel A1 - Zamora, Jorge A1 - Roldán, Gema A1 - Herráez, Belén A1 - Alvarez, Nuria A1 - Alonso, María Rosario A1 - Dopazo, Joaquin A1 - González-Neira, Anna AB -

The implementation of pharmacogenetics (PGx) is a main milestones of precision medicine nowadays in order to achieve safer and more effective therapies. Nevertheless, the implementation of PGx diagnostics is extremely slow and unequal worldwide, in part due to a lack of ethnic PGx information. We analysed genetic data from 3006 Spanish individuals obtained by different high-throughput (HT) techniques. Allele frequencies were determined in our population for the main 21 actionable PGx genes associated with therapeutical changes. We found that 98% of the Spanish population harbours at least one allele associated with a therapeutical change and, thus, there would be a need for a therapeutical change in a mean of 3.31 of the 64 associated drugs. We also identified 326 putative deleterious variants that were not previously related with PGx in 18 out of the 21 main PGx genes evaluated and a total of 7122 putative deleterious variants for the 1045 PGx genes described. Additionally, we performed a comparison of the main HT diagnostic techniques, revealing that after whole genome sequencing, genotyping with the PGx HT array is the most suitable solution for PGx diagnostics. Finally, all this information was integrated in the Collaborative Spanish Variant Server to be available to and updated by the scientific community.

VL - 15 IS - 4 ER - TY - JOUR T1 - A crowdsourcing database for the copy-number variation of the Spanish population. JF - Hum Genomics Y1 - 2023 A1 - López-López, Daniel A1 - Roldán, Gema A1 - Fernandez-Rueda, Jose L A1 - Bostelmann, Gerrit A1 - Carmona, Rosario A1 - Aquino, Virginia A1 - Perez-Florido, Javier A1 - Ortuno, Francisco A1 - Pita, Guillermo A1 - Núñez-Torres, Rocío A1 - González-Neira, Anna A1 - Peña-Chilet, Maria A1 - Dopazo, Joaquin AB -

BACKGROUND: Despite being a very common type of genetic variation, the distribution of copy-number variations (CNVs) in the population is still poorly understood. The knowledge of the genetic variability, especially at the level of the local population, is a critical factor for distinguishing pathogenic from non-pathogenic variation in the discovery of new disease variants.

RESULTS: Here, we present the SPAnish Copy Number Alterations Collaborative Server (SPACNACS), which currently contains copy number variation profiles obtained from more than 400 genomes and exomes of unrelated Spanish individuals. By means of a collaborative crowdsourcing effort whole genome and whole exome sequencing data, produced by local genomic projects and for other purposes, is continuously collected. Once checked both, the Spanish ancestry and the lack of kinship with other individuals in the SPACNACS, the CNVs are inferred for these sequences and they are used to populate the database. A web interface allows querying the database with different filters that include ICD10 upper categories. This allows discarding samples from the disease under study and obtaining pseudo-control CNV profiles from the local population. We also show here additional studies on the local impact of CNVs in some phenotypes and on pharmacogenomic variants. SPACNACS can be accessed at: http://csvs.clinbioinfosspa.es/spacnacs/ .

CONCLUSION: SPACNACS facilitates disease gene discovery by providing detailed information of the local variability of the population and exemplifies how to reuse genomic data produced for other purposes to build a local reference database.

VL - 17 IS - 1 ER - TY - JOUR T1 - Detection of High Level of Co-Infection and the Emergence of Novel SARS CoV-2 Delta-Omicron and Omicron-Omicron Recombinants in the Epidemiological Surveillance of Andalusia. JF - Int J Mol Sci Y1 - 2023 A1 - Perez-Florido, Javier A1 - Casimiro-Soriguer, Carlos S A1 - Ortuno, Francisco A1 - Fernandez-Rueda, Jose L A1 - Aguado, Andrea A1 - Lara, María A1 - Riazzo, Cristina A1 - Rodriguez-Iglesias, Manuel A A1 - Camacho-Martinez, Pedro A1 - Merino-Diaz, Laura A1 - Pupo-Ledo, Inmaculada A1 - de Salazar, Adolfo A1 - Viñuela, Laura A1 - Fuentes, Ana A1 - Chueca, Natalia A1 - García, Federico A1 - Dopazo, Joaquin A1 - Lepe, Jose A AB -

Recombination is an evolutionary strategy to quickly acquire new viral properties inherited from the parental lineages. The systematic survey of the SARS-CoV-2 genome sequences of the Andalusian genomic surveillance strategy has allowed the detection of an unexpectedly high number of co-infections, which constitute the ideal scenario for the emergence of new recombinants. Whole genome sequence of SARS-CoV-2 has been carried out as part of the genomic surveillance programme. Sample sources included the main hospitals in the Andalusia region. In addition to the increase of co-infections and known recombinants, three novel SARS-CoV-2 delta-omicron and omicron-omicron recombinant variants with two break points have been detected. Our observations document an epidemiological scenario in which co-infection and recombination are detected more frequently. Finally, we describe a family case in which co-infection is followed by the detection of a recombinant made from the two co-infecting variants. This increased number of recombinants raises the risk of emergence of recombinant variants with increased transmissibility and pathogenicity.

VL - 24 IS - 3 ER - TY - JOUR T1 - Evidence of the association between increased use of direct oral anticoagulants and a reduction in the rate of atrial fibrillation-related stroke and major bleeding at the population level (2012-2019). JF - Med Clin (Barc) Y1 - 2023 A1 - Loucera, Carlos A1 - Carmona, Rosario A1 - Bostelmann, Gerrit A1 - Muñoyerro-Muñiz, Dolores A1 - Villegas, Román A1 - Gonzalez-Manzanares, Rafael A1 - Dopazo, Joaquin A1 - Anguita, Manuel AB -

BACKGROUND: The introduction of direct-acting oral anticoagulants (DOACs) has shown to decrease atrial fibrillation (AF)-related stroke and bleeding rates in clinical studies, but there is no certain evidence about their effects at the population level. Our aim was to assess changes in AF-related stroke and major bleeding rates between 2012 and 2019 in Andalusia (Spain), and the association between DOACs use and events rates at the population level.

METHODS: All patients with an AF diagnosis from 2012 to 2019 were identified using the Andalusian Health Population Base, that provides clinical information on all Andalusian people. Annual ischemic and hemorrhagic stroke, major bleeding rates, and used antithrombotic treatments were determined. Marginal hazard ratios (HR) were calculated for each treatment.

RESULTS: A total of 95,085 patients with an AF diagnosis were identified. Mean age was 76.1±10.2 years (49.7% women). An increase in the use of DOACs was observed throughout the study period in both males and females (p<0.001). The annual rate of ischemic stroke decreased by one third, while that of hemorrhagic stroke and major bleeding decreased 2-3-fold from 2012 to 2019. Marginal HR was lower than 0.50 for DOACs compared to VKA for all ischemic or hemorrhagic events.

CONCLUSIONS: In this contemporary population-based study using clinical and administrative databases in Andalusia, a significant reduction in the incidence of AF-related ischemic and hemorrhagic stroke and major bleeding was observed between 2012 and 2019. The increased use of DOACs seems to be associated with this reduction.

ER - TY - JOUR T1 - SigPrimedNet: A Signaling-Informed Neural Network for scRNA-seq Annotation of Known and Unknown Cell Types. JF - Biology (Basel) Y1 - 2023 A1 - Gundogdu, Pelin A1 - Alamo, Inmaculada A1 - Nepomuceno-Chamorro, Isabel A A1 - Dopazo, Joaquin A1 - Loucera, Carlos AB -

Single-cell RNA sequencing is increasing our understanding of the behavior of complex tissues or organs, by providing unprecedented details on the complex cell type landscape at the level of individual cells. Cell type definition and functional annotation are key steps to understanding the molecular processes behind the underlying cellular communication machinery. However, the exponential growth of scRNA-seq data has made the task of manually annotating cells unfeasible, due not only to an unparalleled resolution of the technology but to an ever-increasing heterogeneity of the data. Many supervised and unsupervised methods have been proposed to automatically annotate cells. Supervised approaches for cell-type annotation outperform unsupervised methods except when new (unknown) cell types are present. Here, we introduce SigPrimedNet an artificial neural network approach that leverages (i) efficient training by means of a sparsity-inducing signaling circuits-informed layer, (ii) feature representation learning through supervised training, and (iii) unknown cell-type identification by fitting an anomaly detection method on the learned representation. We show that SigPrimedNet can efficiently annotate known cell types while keeping a low false-positive rate for unseen cells across a set of publicly available datasets. In addition, the learned representation acts as a proxy for signaling circuit activity measurements, which provide useful estimations of the cell functionalities.

VL - 12 IS - 4 ER - TY - JOUR T1 - Visualization of automatically combined disease maps and pathway diagrams for rare diseases. JF - Front Bioinform Y1 - 2023 A1 - Gawron, Piotr A1 - Hoksza, David A1 - Piñero, Janet A1 - Peña-Chilet, Maria A1 - Esteban-Medina, Marina A1 - Fernandez-Rueda, Jose Luis A1 - Colonna, Vincenza A1 - Smula, Ewa A1 - Heirendt, Laurent A1 - Ancien, François A1 - Grouès, Valentin A1 - Satagopam, Venkata P A1 - Schneider, Reinhard A1 - Dopazo, Joaquin A1 - Furlong, Laura I A1 - Ostaszewski, Marek AB -

Investigation of molecular mechanisms of human disorders, especially rare diseases, require exploration of various knowledge repositories for building precise hypotheses and complex data interpretation. Recently, increasingly more resources offer diagrammatic representation of such mechanisms, including disease-dedicated schematics in pathway databases and disease maps. However, collection of knowledge across them is challenging, especially for research projects with limited manpower. In this article we present an automated workflow for construction of maps of molecular mechanisms for rare diseases. The workflow requires a standardized definition of a disease using Orphanet or HPO identifiers to collect relevant genes and variants, and to assemble a functional, visual repository of related mechanisms, including data overlays. The diagrams composing the final map are unified to a common systems biology format from CellDesigner SBML, GPML and SBML+layout+render. The constructed resource contains disease-relevant genes and variants as data overlays for immediate visual exploration, including embedded genetic variant browser and protein structure viewer. We demonstrate the functionality of our workflow on two examples of rare diseases: Kawasaki disease and retinitis pigmentosa. Two maps are constructed based on their corresponding identifiers. Moreover, for the retinitis pigmentosa use-case, we include a list of differentially expressed genes to demonstrate how to tailor the workflow using omics datasets. In summary, our work allows for an ad-hoc construction of molecular diagrams combined from different sources, preserving their layout and graphical style, but integrating them into a single resource. This allows to reduce time consuming tasks of prototyping of a molecular disease map, enabling visual exploration, hypothesis building, data visualization and further refinement. The code of the workflow is open and accessible at https://gitlab.lcsb.uni.lu/minerva/automap/.

VL - 3 ER - TY - JOUR T1 - CIBERER: Spanish National Network for Research on Rare Diseases: a highly productive collaborative initiative. JF - Clin Genet Y1 - 2022 A1 - Luque, Juan A1 - Mendes, Ingrid A1 - Gómez, Beatriz A1 - Morte, Beatriz A1 - de Heredia, Miguel López A1 - Herreras, Enrique A1 - Corrochano, Virginia A1 - Bueren, Juan A1 - Gallano, Pia A1 - Artuch, Rafael A1 - Fillat, Cristina A1 - Pérez-Jurado, Luis A A1 - Montoliu, Lluis A1 - Carracedo, Ángel A1 - Millán, José M A1 - Webb, Susan M A1 - Palau, Francesc A1 - Lapunzina, Pablo AB -

CIBER (Center for Biomedical Network Research; Centro de Investigación Biomédica En Red) is a public national consortium created in 2006 under the umbrella of the Spanish National Institute of Health Carlos III (ISCIII). This innovative research structure comprises 11 different specific areas dedicated to the main public health priorities in the National Health System. CIBERER, the thematic area of CIBER focused on Rare Diseases currently consists of 75 research groups belonging to universities, research centers and hospitals of the entire country. CIBERER's mission is to be a center prioritizing and favoring collaboration and cooperation between biomedical and clinical research groups, with special emphasis on the aspects of genetic, molecular, biochemical and cellular research of rare diseases. This research is the basis for providing new tools for the diagnosis and therapy of low-prevalence diseases, in line with the International Rare Diseases Research Consortium (IRDiRC) objectives, thus favoring translational research between the scientific environment of the laboratory and the clinical setting of health centers. In this paper, we intend to review CIBERER's 15-year journey and summarize the main results obtained in terms of internationalization, scientific production, contributions towards the discovery of new therapies and novel genes associated to diseases, cooperation with patients' associations and many other topics related to rare disease research. This article is protected by copyright. All rights reserved.

ER - TY - JOUR T1 - Endoglin and MMP14 Contribute to Ewing Sarcoma Spreading by Modulation of Cell-Matrix Interactions. JF - Int J Mol Sci Y1 - 2022 A1 - Puerto-Camacho, Pilar A1 - Diaz-Martin, Juan A1 - Olmedo-Pelayo, Joaquín A1 - Bolado-Carrancio, Alfonso A1 - Salguero-Aranda, Carmen A1 - Jordán-Pérez, Carmen A1 - Esteban-Medina, Marina A1 - Alamo-Alvarez, Inmaculada A1 - Delgado-Bellido, Daniel A1 - Lobo-Selma, Laura A1 - Dopazo, Joaquin A1 - Sastre, Ana A1 - Alonso, Javier A1 - Grünewald, Thomas G P A1 - Bernabeu, Carmelo A1 - Byron, Adam A1 - Brunton, Valerie G A1 - Amaral, Ana Teresa A1 - de Alava, Enrique KW - Bone Neoplasms KW - Endoglin KW - Humans KW - Matrix Metalloproteinase 14 KW - Proteomics KW - Receptors, Growth Factor KW - Sarcoma, Ewing KW - Signal Transduction AB -

Endoglin (ENG) is a mesenchymal stem cell (MSC) marker typically expressed by active endothelium. This transmembrane glycoprotein is shed by matrix metalloproteinase 14 (MMP14). Our previous work demonstrated potent preclinical activity of first-in-class anti-ENG antibody-drug conjugates as a nascent strategy to eradicate Ewing sarcoma (ES), a devastating rare bone/soft tissue cancer with a putative MSC origin. We also defined a correlation between ENG and MMP14 expression in ES. Herein, we show that ENG expression is significantly associated with a dismal prognosis in a large cohort of ES patients. Moreover, both ENG/MMP14 are frequently expressed in primary ES tumors and metastasis. To deepen in their functional relevance in ES, we conducted transcriptomic and proteomic profiling of in vitro ES models that unveiled a key role of ENG and MMP14 in cell mechano-transduction. Migration and adhesion assays confirmed that loss of ENG disrupts actin filament assembly and filopodia formation, with a concomitant effect on cell spreading. Furthermore, we observed that ENG regulates cell-matrix interaction through activation of focal adhesion signaling and protein kinase C expression. In turn, loss of MMP14 contributed to a more adhesive phenotype of ES cells by modulating the transcriptional extracellular matrix dynamics. Overall, these results suggest that ENG and MMP14 exert a significant role in mediating correct spreading machinery of ES cells, impacting the aggressiveness of the disease.

VL - 23 IS - 15 ER - TY - JOUR T1 - Incidence and Prevalence of Children's Diffuse Lung Disease in Spain. JF - Arch Bronconeumol Y1 - 2022 A1 - Torrent-Vernetta, Alba A1 - Gaboli, Mirella A1 - Castillo-Corullón, Silvia A1 - Mondéjar-López, Pedro A1 - Sanz Santiago, Verónica A1 - Costa-Colomer, Jordi A1 - Osona, Borja A1 - Torres-Borrego, Javier A1 - de la Serna-Blázquez, Olga A1 - Bellón Alonso, Sara A1 - Caro Aguilera, Pilar A1 - Gimeno-Díaz de Atauri, Álvaro A1 - Valenzuela Soria, Alfredo A1 - Ayats, Roser A1 - Martin de Vicente, Carlos A1 - Velasco González, Valle A1 - Moure González, José Domingo A1 - Canino Calderín, Elisa María A1 - Pastor-Vivero, María Dolores A1 - Villar Álvarez, María Ángeles A1 - Rovira-Amigo, Sandra A1 - Iglesias Serrano, Ignacio A1 - Díez Izquierdo, Ana A1 - de Mir Messa, Inés A1 - Gartner, Silvia A1 - Navarro, Alexandra A1 - Baz-Redón, Noelia A1 - Carmona, Rosario A1 - Camats-Tarruella, Núria A1 - Fernández-Cancio, Mónica A1 - Rapp, Christina A1 - Dopazo, Joaquin A1 - Griese, Matthias A1 - Moreno-Galdó, Antonio AB -

BACKGROUND: Children's diffuse lung disease, also known as children's Interstitial Lung Diseases (chILD), are a heterogeneous group of rare diseases with relevant morbidity and mortality, which diagnosis and classification are very complex. Epidemiological data are scarce. The aim of this study was to analyse incidence and prevalence of chILD in Spain.

METHODS: Multicentre observational prospective study in patients from 0 to 18 years of age with chILD to analyse its incidence and prevalence in Spain, based on data reported in 2018 and 2019.

RESULTS: A total of 381 cases with chILD were notified from 51 paediatric pulmonology units all over Spain, covering the 91.7% of the paediatric population. The average incidence of chILD was 8.18 (CI 95% 6.28-10.48) new cases/million of children per year. The average prevalence of chILD was 46.53 (CI 95% 41.81-51.62) cases/million of children. The age group with the highest prevalence were children under 1 year of age. Different types of disorders were seen in children 2-18 years of age compared with children 0-2 years of age. Most frequent cases were: primary pulmonary interstitial glycogenosis in neonates (17/65), neuroendocrine cell hyperplasia of infancy in infants from 1 to 12 months (44/144), idiopathic pulmonary haemosiderosis in children from 1 to 5 years old (13/74), hypersensitivity pneumonitis in children from 5 to 10 years old (9/51), and scleroderma in older than 10 years old (8/47).

CONCLUSIONS: We found a higher incidence and prevalence of chILD than previously described probably due to greater understanding and increased clinician awareness of these rare diseases.

VL - 58 IS - 1 ER - TY - JOUR T1 - Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data. JF - BioData Min Y1 - 2022 A1 - Gundogdu, Pelin A1 - Loucera, Carlos A1 - Alamo-Alvarez, Inmaculada A1 - Dopazo, Joaquin A1 - Nepomuceno, Isabel AB -

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) data provide valuable insights into cellular heterogeneity which is significantly improving the current knowledge on biology and human disease. One of the main applications of scRNA-seq data analysis is the identification of new cell types and cell states. Deep neural networks (DNNs) are among the best methods to address this problem. However, this performance comes with the trade-off for a lack of interpretability in the results. In this work we propose an intelligible pathway-driven neural network to correctly solve cell-type related problems at single-cell resolution while providing a biologically meaningful representation of the data.

RESULTS: In this study, we explored the deep neural networks constrained by several types of prior biological information, e.g. signaling pathway information, as a way to reduce the dimensionality of the scRNA-seq data. We have tested the proposed biologically-based architectures on thousands of cells of human and mouse origin across a collection of public datasets in order to check the performance of the model. Specifically, we tested the architecture across different validation scenarios that try to mimic how unknown cell types are clustered by the DNN and how it correctly annotates cell types by querying a database in a retrieval problem. Moreover, our approach demonstrated to be comparable to other less interpretable DNN approaches constrained by using protein-protein interactions gene regulation data. Finally, we show how the latent structure learned by the network could be used to visualize and to interpret the composition of human single cell datasets.

CONCLUSIONS: Here we demonstrate how the integration of pathways, which convey fundamental information on functional relationships between genes, with DNNs, that provide an excellent classification framework, results in an excellent alternative to learn a biologically meaningful representation of scRNA-seq data. In addition, the introduction of prior biological knowledge in the DNN reduces the size of the network architecture. Comparative results demonstrate a superior performance of this approach with respect to other similar approaches. As an additional advantage, the use of pathways within the DNN structure enables easy interpretability of the results by connecting features to cell functionalities by means of the pathway nodes, as demonstrated with an example with human melanoma tumor cells.

VL - 15 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/34980200?dopt=Abstract ER - TY - JOUR T1 - Novel genes and sex differences in COVID-19 severity. JF - Hum Mol Genet Y1 - 2022 A1 - Cruz, Raquel A1 - Almeida, Silvia Diz-de A1 - Heredia, Miguel López A1 - Quintela, Inés A1 - Ceballos, Francisco C A1 - Pita, Guillermo A1 - Lorenzo-Salazar, José M A1 - González-Montelongo, Rafaela A1 - Gago-Domínguez, Manuela A1 - Porras, Marta Sevilla A1 - Castaño, Jair Antonio Tenorio A1 - Nevado, Julián A1 - Aguado, Jose María A1 - Aguilar, Carlos A1 - Aguilera-Albesa, Sergio A1 - Almadana, Virginia A1 - Almoguera, Berta A1 - Alvarez, Nuria A1 - Andreu-Bernabeu, Álvaro A1 - Arana-Arri, Eunate A1 - Arango, Celso A1 - Arranz, María J A1 - Artiga, Maria-Jesus A1 - Baptista-Rosas, Raúl C A1 - Barreda-Sánchez, María A1 - Belhassen-Garcia, Moncef A1 - Bezerra, Joao F A1 - Bezerra, Marcos A C A1 - Boix-Palop, Lucía A1 - Brión, Maria A1 - Brugada, Ramón A1 - Bustos, Matilde A1 - Calderón, Enrique J A1 - Carbonell, Cristina A1 - Castano, Luis A1 - Castelao, Jose E A1 - Conde-Vicente, Rosa A1 - Cordero-Lorenzana, M Lourdes A1 - Cortes-Sanchez, Jose L A1 - Corton, Marta A1 - Darnaude, M Teresa A1 - De Martino-Rodríguez, Alba A1 - Campo-Pérez, Victor A1 - Bustamante, Aranzazu Diaz A1 - Domínguez-Garrido, Elena A1 - Luchessi, André D A1 - Eirós, Rocío A1 - Sanabria, Gladys Mercedes Estigarribia A1 - Fariñas, María Carmen A1 - Fernández-Robelo, Uxía A1 - Fernández-Rodríguez, Amanda A1 - Fernández-Villa, Tania A1 - Gil-Fournier, Belén A1 - Gómez-Arrue, Javier A1 - Álvarez, Beatriz González A1 - Quirós, Fernan Gonzalez Bernaldo A1 - González-Peñas, Javier A1 - Gutiérrez-Bautista, Juan F A1 - Herrero, María José A1 - Herrero-Gonzalez, Antonio A1 - Jimenez-Sousa, María A A1 - Lattig, María Claudia A1 - Borja, Anabel Liger A1 - Lopez-Rodriguez, Rosario A1 - Mancebo, Esther A1 - Martín-López, Caridad A1 - Martín, Vicente A1 - Martinez-Nieto, Oscar A1 - Martinez-Lopez, Iciar A1 - Martinez-Resendez, Michel F A1 - Martinez-Perez, Ángel A1 - Mazzeu, Juliana A A1 - Macías, Eleuterio Merayo A1 - Minguez, Pablo A1 - Cuerda, Victor Moreno A1 - Silbiger, Vivian N A1 - Oliveira, Silviene F A1 - Ortega-Paino, Eva A1 - Parellada, Mara A1 - Paz-Artal, Estela A1 - Santos, Ney P C A1 - Pérez-Matute, Patricia A1 - Perez, Patricia A1 - Pérez-Tomás, M Elena A1 - Perucho, Teresa A1 - Pinsach-Abuin, Mel Lina A1 - Pompa-Mera, Ericka N A1 - Porras-Hurtado, Gloria L A1 - Pujol, Aurora A1 - León, Soraya Ramiro A1 - Resino, Salvador A1 - Fernandes, Marianne R A1 - Rodríguez-Ruiz, Emilio A1 - Rodriguez-Artalejo, Fernando A1 - Rodriguez-Garcia, José A A1 - Ruiz-Cabello, Francisco A1 - Ruiz-Hornillos, Javier A1 - Ryan, Pablo A1 - Soria, José Manuel A1 - Souto, Juan Carlos A1 - Tamayo, Eduardo A1 - Tamayo-Velasco, Alvaro A1 - Taracido-Fernandez, Juan Carlos A1 - Teper, Alejandro A1 - Torres-Tobar, Lilian A1 - Urioste, Miguel A1 - Valencia-Ramos, Juan A1 - Yáñez, Zuleima A1 - Zarate, Ruth A1 - Nakanishi, Tomoko A1 - Pigazzini, Sara A1 - Degenhardt, Frauke A1 - Butler-Laporte, Guillaume A1 - Maya-Miles, Douglas A1 - Bujanda, Luis A1 - Bouysran, Youssef A1 - Palom, Adriana A1 - Ellinghaus, David A1 - Martínez-Bueno, Manuel A1 - Rolker, Selina A1 - Amitrano, Sara A1 - Roade, Luisa A1 - Fava, Francesca A1 - Spinner, Christoph D A1 - Prati, Daniele A1 - Bernardo, David A1 - García, Federico A1 - Darcis, Gilles A1 - Fernández-Cadenas, Israel A1 - Holter, Jan Cato A1 - Banales, Jesus M A1 - Frithiof, Robert A1 - Duga, Stefano A1 - Asselta, Rosanna A1 - Pereira, Alexandre C A1 - Romero-Gómez, Manuel A1 - Nafría-Jiménez, Beatriz A1 - Hov, Johannes R A1 - Migeotte, Isabelle A1 - Renieri, Alessandra A1 - Planas, Anna M A1 - Ludwig, Kerstin U A1 - Buti, Maria A1 - Rahmouni, Souad A1 - Alarcón-Riquelme, Marta E A1 - Schulte, Eva C A1 - Franke, Andre A1 - Karlsen, Tom H A1 - Valenti, Luca A1 - Zeberg, Hugo A1 - Richards, Brent A1 - Ganna, Andrea A1 - Boada, Mercè A1 - Rojas, Itziar A1 - Ruiz, Agustín A1 - Sánchez, Pascual A1 - Real, Luis Miguel A1 - Guillén-Navarro, Encarna A1 - Ayuso, Carmen A1 - González-Neira, Anna A1 - Riancho, José A A1 - Rojas-Martinez, Augusto A1 - Flores, Carlos A1 - Lapunzina, Pablo A1 - Carracedo, Ángel AB -

Here we describe the results of a genome-wide study conducted in 11 939 COVID-19 positive cases with an extensive clinical information that were recruited from 34 hospitals across Spain (SCOURGE consortium). In sex-disaggregated genome-wide association studies for COVID-19 hospitalization, genome-wide significance (p < 5x10-8) was crossed for variants in 3p21.31 and 21q22.11 loci only among males (p = 1.3x10-22 and p = 8.1x10-12, respectively), and for variants in 9q21.32 near TLE1 only among females (p = 4.4x10-8). In a second phase, results were combined with an independent Spanish cohort (1598 COVID-19 cases and 1068 population controls), revealing in the overall analysis two novel risk loci in 9p13.3 and 19q13.12, with fine-mapping prioritized variants functionally associated with AQP3 (p = 2.7x10-8) and ARHGAP33 (p = 1.3x10-8), respectively. The meta-analysis of both phases with four European studies stratified by sex from the Host Genetics Initiative confirmed the association of the 3p21.31 and 21q22.11 loci predominantly in males and replicated a recently reported variant in 11p13 (ELF5, p = 4.1x10-8). Six of the COVID-19 HGI discovered loci were replicated and an HGI-based genetic risk score predicted the severity strata in SCOURGE. We also found more SNP-heritability and larger heritability differences by age (<60 or ≥ 60 years) among males than among females. Parallel genome-wide screening of inbreeding depression in SCOURGE also showed an effect of homozygosity in COVID-19 hospitalization and severity and this effect was stronger among older males. In summary, new candidate genes for COVID-19 severity and evidence supporting genetic disparities among sexes are provided.

ER - TY - JOUR T1 - Protein and functional isoform levels and genetic variants of the BAFF and APRIL pathway components in systemic lupus erythematosus. JF - Sci Rep Y1 - 2022 A1 - Ortiz-Aljaro, Pilar A1 - Montes-Cano, Marco Antonio A1 - García-Lozano, José-Raúl A1 - Aquino, Virginia A1 - Carmona, Rosario A1 - Perez-Florido, Javier A1 - García-Hernández, Francisco José A1 - Dopazo, Joaquin A1 - González-Escribano, María Francisca AB -

Systemic lupus erythematosus (SLE) is the prototype of an autoimmune disease. Belimumab, a monoclonal antibody targets BAFF, is the only biologic approved for SLE and active lupus nephritis. BAFF is a cytokine with a key-regulatory role in the B cell homeostasis, which acts by binding to three receptors: BAFF-R, TACI and BCMA. TACI and BCMA also bind APRIL. Many studies reported elevated soluble BAFF and APRIL levels in the sera of SLE patients, but other questions about the role of this system in the disease remain open. The study aimed to investigate the utility of the cytokine levels in serum and urine as biomarkers, the role of non-functional isoforms, and the association of gene variants with the disease. This case-control study includes a cohort (women, 18-60 years old) of 100 patients (48% with nephritis) and 100 healthy controls. We used ELISA assays to measure the cytokine concentrations in serum (sBAFF and sAPRIL) and urine (uBAFF and uAPRIL); TaqMan Gene Expression Assays to quantify the relative mRNA expression of ΔBAFF, βAPRIL, and εAPRIL, and next-generation sequencing to genotype the cytokine (TNFSF13 and TNFSF13B) and receptor (TNFRSF13B, TNFRSF17 and TNFRSF13C) genes. The statistical tests used were: Kruskal-Wallis (qualitative variables), the Spearman Rho coefficient (correlations), the Chi-square and SKAT (association of common and rare genetic variants, respectively). As expected, sBAFF and sAPRIL levels were higher in patients than in controls (p ≤ 0.001) but found differences between patient subgroups. sBAFF and sAPRIL significantly correlated only in patients with nephritis (r = 0.67, p ≤ 0.001) and βAPRIL levels were lower in patients with nephritis (p = 0.04), and ΔBAFF levels were lower in patients with dsDNA antibodies (p = 0.04). Rare variants of TNFSF13 and TNFRSF13B and TNFSF13 p.Gly67Arg and TNFRSF13B p.Val220Ala were associated with SLE. Our study supports differences among SLE patient subgroups with diverse clinical features in the BAFF/APRIL pathway. In addition, it suggests the involvement of genetic variants in the susceptibility to the disease.

VL - 12 IS - 1 ER - TY - JOUR T1 - A comprehensive database for integrated analysis of omics data in autoimmune diseases. JF - BMC Bioinformatics Y1 - 2021 A1 - Martorell-Marugán, Jordi A1 - López-Domínguez, Raúl A1 - García-Moreno, Adrián A1 - Toro-Domínguez, Daniel A1 - Villatoro-García, Juan Antonio A1 - Barturen, Guillermo A1 - Martín-Gómez, Adoración A1 - Troule, Kevin A1 - Gómez-López, Gonzalo A1 - Al-Shahrour, Fátima A1 - González-Rumayor, Víctor A1 - Peña-Chilet, Maria A1 - Dopazo, Joaquin A1 - Saez-Rodriguez, Julio A1 - Alarcón-Riquelme, Marta E A1 - Carmona-Sáez, Pedro KW - Autoimmune Diseases KW - Computational Biology KW - Databases, Factual KW - Humans AB -

BACKGROUND: Autoimmune diseases are heterogeneous pathologies with difficult diagnosis and few therapeutic options. In the last decade, several omics studies have provided significant insights into the molecular mechanisms of these diseases. Nevertheless, data from different cohorts and pathologies are stored independently in public repositories and a unified resource is imperative to assist researchers in this field.

RESULTS: Here, we present Autoimmune Diseases Explorer ( https://adex.genyo.es ), a database that integrates 82 curated transcriptomics and methylation studies covering 5609 samples for some of the most common autoimmune diseases. The database provides, in an easy-to-use environment, advanced data analysis and statistical methods for exploring omics datasets, including meta-analysis, differential expression or pathway analysis.

CONCLUSIONS: This is the first omics database focused on autoimmune diseases. This resource incorporates homogeneously processed data to facilitate integrative analyses among studies.

VL - 22 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/34167460?dopt=Abstract ER - TY - JOUR T1 - COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms. JF - Mol Syst Biol Y1 - 2021 A1 - Ostaszewski, Marek A1 - Niarakis, Anna A1 - Mazein, Alexander A1 - Kuperstein, Inna A1 - Phair, Robert A1 - Orta-Resendiz, Aurelio A1 - Singh, Vidisha A1 - Aghamiri, Sara Sadat A1 - Acencio, Marcio Luis A1 - Glaab, Enrico A1 - Ruepp, Andreas A1 - Fobo, Gisela A1 - Montrone, Corinna A1 - Brauner, Barbara A1 - Frishman, Goar A1 - Monraz Gómez, Luis Cristóbal A1 - Somers, Julia A1 - Hoch, Matti A1 - Kumar Gupta, Shailendra A1 - Scheel, Julia A1 - Borlinghaus, Hanna A1 - Czauderna, Tobias A1 - Schreiber, Falk A1 - Montagud, Arnau A1 - Ponce de Leon, Miguel A1 - Funahashi, Akira A1 - Hiki, Yusuke A1 - Hiroi, Noriko A1 - Yamada, Takahiro G A1 - Dräger, Andreas A1 - Renz, Alina A1 - Naveez, Muhammad A1 - Bocskei, Zsolt A1 - Messina, Francesco A1 - Börnigen, Daniela A1 - Fergusson, Liam A1 - Conti, Marta A1 - Rameil, Marius A1 - Nakonecnij, Vanessa A1 - Vanhoefer, Jakob A1 - Schmiester, Leonard A1 - Wang, Muying A1 - Ackerman, Emily E A1 - Shoemaker, Jason E A1 - Zucker, Jeremy A1 - Oxford, Kristie A1 - Teuton, Jeremy A1 - Kocakaya, Ebru A1 - Summak, Gökçe Yağmur A1 - Hanspers, Kristina A1 - Kutmon, Martina A1 - Coort, Susan A1 - Eijssen, Lars A1 - Ehrhart, Friederike A1 - Rex, Devasahayam Arokia Balaya A1 - Slenter, Denise A1 - Martens, Marvin A1 - Pham, Nhung A1 - Haw, Robin A1 - Jassal, Bijay A1 - Matthews, Lisa A1 - Orlic-Milacic, Marija A1 - Senff Ribeiro, Andrea A1 - Rothfels, Karen A1 - Shamovsky, Veronica A1 - Stephan, Ralf A1 - Sevilla, Cristoffer A1 - Varusai, Thawfeek A1 - Ravel, Jean-Marie A1 - Fraser, Rupsha A1 - Ortseifen, Vera A1 - Marchesi, Silvia A1 - Gawron, Piotr A1 - Smula, Ewa A1 - Heirendt, Laurent A1 - Satagopam, Venkata A1 - Wu, Guanming A1 - Riutta, Anders A1 - Golebiewski, Martin A1 - Owen, Stuart A1 - Goble, Carole A1 - Hu, Xiaoming A1 - Overall, Rupert W A1 - Maier, Dieter A1 - Bauch, Angela A1 - Gyori, Benjamin M A1 - Bachman, John A A1 - Vega, Carlos A1 - Grouès, Valentin A1 - Vazquez, Miguel A1 - Porras, Pablo A1 - Licata, Luana A1 - Iannuccelli, Marta A1 - Sacco, Francesca A1 - Nesterova, Anastasia A1 - Yuryev, Anton A1 - de Waard, Anita A1 - Turei, Denes A1 - Luna, Augustin A1 - Babur, Ozgun A1 - Soliman, Sylvain A1 - Valdeolivas, Alberto A1 - Esteban-Medina, Marina A1 - Peña-Chilet, Maria A1 - Rian, Kinza A1 - Helikar, Tomáš A1 - Puniya, Bhanwar Lal A1 - Modos, Dezso A1 - Treveil, Agatha A1 - Olbei, Marton A1 - De Meulder, Bertrand A1 - Ballereau, Stephane A1 - Dugourd, Aurélien A1 - Naldi, Aurélien A1 - Noël, Vincent A1 - Calzone, Laurence A1 - Sander, Chris A1 - Demir, Emek A1 - Korcsmaros, Tamas A1 - Freeman, Tom C A1 - Augé, Franck A1 - Beckmann, Jacques S A1 - Hasenauer, Jan A1 - Wolkenhauer, Olaf A1 - Wilighagen, Egon L A1 - Pico, Alexander R A1 - Evelo, Chris T A1 - Gillespie, Marc E A1 - Stein, Lincoln D A1 - Hermjakob, Henning A1 - D'Eustachio, Peter A1 - Saez-Rodriguez, Julio A1 - Dopazo, Joaquin A1 - Valencia, Alfonso A1 - Kitano, Hiroaki A1 - Barillot, Emmanuel A1 - Auffray, Charles A1 - Balling, Rudi A1 - Schneider, Reinhard KW - Antiviral Agents KW - Computational Biology KW - Computer Graphics KW - COVID-19 KW - Cytokines KW - Data Mining KW - Databases, Factual KW - Gene Expression Regulation KW - Host Microbial Interactions KW - Humans KW - Immunity, Cellular KW - Immunity, Humoral KW - Immunity, Innate KW - Lymphocytes KW - Metabolic Networks and Pathways KW - Myeloid Cells KW - Protein Interaction Mapping KW - SARS-CoV-2 KW - Signal Transduction KW - Software KW - Transcription Factors KW - Viral Proteins AB -

We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective.

VL - 17 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/34664389?dopt=Abstract ER - TY - JOUR T1 - CSVS, a crowdsourcing database of the Spanish population genetic variability. JF - Nucleic Acids Res Y1 - 2021 A1 - Peña-Chilet, Maria A1 - Roldán, Gema A1 - Perez-Florido, Javier A1 - Ortuno, Francisco M A1 - Carmona, Rosario A1 - Aquino, Virginia A1 - López-López, Daniel A1 - Loucera, Carlos A1 - Fernandez-Rueda, Jose L A1 - Gallego, Asunción A1 - Garcia-Garcia, Francisco A1 - González-Neira, Anna A1 - Pita, Guillermo A1 - Núñez-Torres, Rocío A1 - Santoyo-López, Javier A1 - Ayuso, Carmen A1 - Minguez, Pablo A1 - Avila-Fernandez, Almudena A1 - Corton, Marta A1 - Moreno-Pelayo, Miguel Ángel A1 - Morin, Matías A1 - Gallego-Martinez, Alvaro A1 - Lopez-Escamez, Jose A A1 - Borrego, Salud A1 - Antiňolo, Guillermo A1 - Amigo, Jorge A1 - Salgado-Garrido, Josefa A1 - Pasalodos-Sanchez, Sara A1 - Morte, Beatriz A1 - Carracedo, Ángel A1 - Alonso, Ángel A1 - Dopazo, Joaquin KW - Alleles KW - Chromosome Mapping KW - Crowdsourcing KW - Databases, Genetic KW - Exome KW - Gene Frequency KW - Genetic Variation KW - Genetics, Population KW - Genome, Human KW - Genomics KW - Humans KW - Internet KW - Precision Medicine KW - Software KW - Spain AB -

The knowledge of the genetic variability of the local population is of utmost importance in personalized medicine and has been revealed as a critical factor for the discovery of new disease variants. Here, we present the Collaborative Spanish Variability Server (CSVS), which currently contains more than 2000 genomes and exomes of unrelated Spanish individuals. This database has been generated in a collaborative crowdsourcing effort collecting sequencing data produced by local genomic projects and for other purposes. Sequences have been grouped by ICD10 upper categories. A web interface allows querying the database removing one or more ICD10 categories. In this way, aggregated counts of allele frequencies of the pseudo-control Spanish population can be obtained for diseases belonging to the category removed. Interestingly, in addition to pseudo-control studies, some population studies can be made, as, for example, prevalence of pharmacogenomic variants, etc. In addition, this genomic data has been used to define the first Spanish Genome Reference Panel (SGRP1.0) for imputation. This is the first local repository of variability entirely produced by a crowdsourcing effort and constitutes an example for future initiatives to characterize local variability worldwide. CSVS is also part of the GA4GH Beacon network. CSVS can be accessed at: http://csvs.babelomics.org/.

VL - 49 IS - D1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/32990755?dopt=Abstract ER - TY - JOUR T1 - De novo small deletion affecting transcription start site of short isoform of AUTS2 gene in a patient with syndromic neurodevelopmental defects. JF - Am J Med Genet A Y1 - 2021 A1 - Martinez-Delgado, Beatriz A1 - Lopez-Martin, Estrella A1 - Lara-Herguedas, Julián A1 - Monzon, Sara A1 - Cuesta, Isabel A1 - Juliá, Miguel A1 - Aquino, Virginia A1 - Rodriguez-Martin, Carlos A1 - Damian, Alejandra A1 - Gonzalo, Irene A1 - Gomez-Mariano, Gema A1 - Baladron, Beatriz A1 - Cazorla, Rosario A1 - Iglesias, Gema A1 - Roman, Enriqueta A1 - Ros, Purificacion A1 - Tutor, Pablo A1 - Mellor, Susana A1 - Jimenez, Carlos A1 - Cabrejas, Maria Jose A1 - Gonzalez-Vioque, Emiliano A1 - Alonso, Javier A1 - Bermejo-Sánchez, Eva A1 - Posada, Manuel KW - Child, Preschool KW - Cytoskeletal Proteins KW - Dwarfism KW - Exons KW - Gene Expression Regulation KW - Genetic Association Studies KW - Humans KW - Male KW - Neurodevelopmental Disorders KW - Protein Isoforms KW - RNA, Messenger KW - Sequence Deletion KW - Syndrome KW - Transcription Factors KW - Transcription Initiation Site KW - Transcription, Genetic AB -

Disruption of the autism susceptibility candidate 2 (AUTS2) gene through genomic rearrangements, copy number variations (CNVs), and intragenic deletions and mutations, has been recurrently involved in syndromic forms of developmental delay and intellectual disability, known as AUTS2 syndrome. The AUTS2 gene plays an important role in regulation of neuronal migration, and when altered, associates with a variable phenotype from severely to mildly affected patients. The more severe phenotypes significantly correlate with the presence of defects affecting the C-terminus part of the gene. This article reports a new patient with a syndromic neurodevelopmental disorder, who presents a deletion of 30 nucleotides in the exon 9 of the AUTS2 gene. Importantly, this deletion includes the transcription start site for the AUTS2 short transcript isoform, which has an important role in brain development. Gene expression analysis of AUTS2 full-length and short isoforms revealed that the deletion found in this patient causes a remarkable reduction in the expression level, not only of the short isoform, but also of the full AUTS2 transcripts. This report adds more evidence for the role of mutated AUTS2 short transcripts in the development of a severe phenotype in the AUTS2 syndrome.

VL - 185 IS - 3 ER - TY - JOUR T1 - A DNA damage repair gene-associated signature predicts responses of patients with advanced soft-tissue sarcoma to treatment with trabectedin. JF - Mol Oncol Y1 - 2021 A1 - Moura, David S A1 - Peña-Chilet, Maria A1 - Cordero Varela, Juan Antonio A1 - Alvarez-Alegret, Ramiro A1 - Agra-Pujol, Carolina A1 - Izquierdo, Francisco A1 - Ramos, Rafael A1 - Ortega-Medina, Luis A1 - Martin-Davila, Francisco A1 - Castilla-Ramirez, Carolina A1 - Hernandez-Leon, Carmen Nieves A1 - Romagosa, Cleofe A1 - Vaz Salgado, Maria Angeles A1 - Lavernia, Javier A1 - Bagué, Silvia A1 - Mayodormo-Aranda, Empar A1 - Vicioso, Luis A1 - Hernández Barceló, Jose Emilio A1 - Rubio-Casadevall, Jordi A1 - de Juan, Ana A1 - Fiaño-Valverde, Maria Concepcion A1 - Hindi, Nadia A1 - Lopez-Alvarez, Maria A1 - Lacerenza, Serena A1 - Dopazo, Joaquin A1 - Gutierrez, Antonio A1 - Alvarez, Rosa A1 - Valverde, Claudia A1 - Martinez-Trufero, Javier A1 - Martin-Broto, Javier AB -

Predictive biomarkers of trabectedin represent an unmet need in advanced soft-tissue sarcomas (STS). DNA damage repair (DDR) genes, involved in homologous recombination or nucleotide excision repair, had been previously described as biomarkers of trabectedin resistance or sensitivity, respectively. The majority of these studies only focused on specific factors (ERCC1, ERCC5, and BRCA1) and did not evaluate several other DDR-related genes that could have a relevant role for trabectedin efficacy. In this retrospective translational study, 118 genes involved in DDR were evaluated to determine, by transcriptomics, a predictive gene signature of trabectedin efficacy. A six-gene predictive signature of trabectedin efficacy was built in a series of 139 tumor samples from patients with advanced STS. Patients in the high-risk gene signature group showed a significantly worse progression-free survival compared with patients in the low-risk group (2.1 vs 6.0 months, respectively). Differential gene expression analysis defined new potential predictive biomarkers of trabectedin sensitivity (PARP3 and CCNH) or resistance (DNAJB11 and PARP1). Our study identified a new gene signature that significantly predicts patients with higher probability to respond to treatment with trabectedin. Targeting some genes of this signature emerges as a potential strategy to enhance trabectedin efficacy.

VL - 15 IS - 12 U1 - https://www.ncbi.nlm.nih.gov/pubmed/33983674?dopt=Abstract ER - TY - JOUR T1 - Genome-scale mechanistic modeling of signaling pathways made easy: A bioconductor/cytoscape/web server framework for the analysis of omic data JF - Computational and Structural Biotechnology Journal Y1 - 2021 A1 - Rian, Kinza A1 - Hidalgo, Marta R. A1 - Cubuk, Cankut A1 - Falco, Matias M. A1 - Loucera, Carlos A1 - Esteban-Medina, Marina A1 - Alamo-Alvarez, Inmaculada A1 - Peña-Chilet, Maria A1 - Dopazo, Joaquin VL - 19 UR - https://linkinghub.elsevier.com/retrieve/pii/S2001037021002038 JO - Computational and Structural Biotechnology Journal ER - TY - JOUR T1 - Genome-wide analysis of DNA methylation in Hirschsprung enteric precursor cells: unraveling the epigenetic landscape of enteric nervous system developmentAbstractBackgroundResultsConclusionsGraphic abstract JF - Clinical Epigenetics Y1 - 2021 A1 - Villalba-Benito, Leticia A1 - López-López, Daniel A1 - Torroglosa, Ana A1 - Casimiro-Soriguer, Carlos S. A1 - Luzón-Toro, Berta A1 - Fernández, Raquel María A1 - Moya-Jiménez, María José A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin A1 - Borrego, Salud VL - 13 UR - http://link.springer.com/article/10.1186/s13148-021-01040-6/fulltext.html IS - 1 JO - Clin Epigenet ER - TY - JOUR T1 - The NCI Genomic Data Commons JF - Nature Genetics Y1 - 2021 A1 - Heath, Allison P. A1 - Ferretti, Vincent A1 - Agrawal, Stuti A1 - An, Maksim A1 - Angelakos, James C. A1 - Arya, Renuka A1 - Bajari, Rosita A1 - Baqar, Bilal A1 - Barnowski, Justin H. B. A1 - Burt, Jeffrey A1 - Catton, Ann A1 - Chan, Brandon F. A1 - Chu, Fay A1 - Cullion, Kim A1 - Davidsen, Tanja A1 - Do, Phuong-My A1 - Dompierre, Christian A1 - Ferguson, Martin L. A1 - Fitzsimons, Michael S. A1 - Ford, Michael A1 - Fukuma, Miyuki A1 - Gaheen, Sharon A1 - Ganji, Gajanan L. A1 - Garcia, Tzintzuni I. A1 - George, Sameera S. A1 - Gerhard, Daniela S. A1 - Gerthoffert, Francois A1 - Gomez, Fauzi A1 - Han, Kang A1 - Hernandez, Kyle M. A1 - Issac, Biju A1 - Jackson, Richard A1 - Jensen, Mark A. A1 - Joshi, Sid A1 - Kadam, Ajinkya A1 - Khurana, Aishmit A1 - Kim, Kyle M. J. A1 - Kraft, Victoria E. A1 - Li, Shenglai A1 - Lichtenberg, Tara M. A1 - Lodato, Janice A1 - Lolla, Laxmi A1 - Martinov, Plamen A1 - Mazzone, Jeffrey A. A1 - Miller, Daniel P. A1 - Miller, Ian A1 - Miller, Joshua S. A1 - Miyauchi, Koji A1 - Murphy, Mark W. A1 - Nullet, Thomas A1 - Ogwara, Rowland O. A1 - Ortuño, Francisco M. A1 - Pedrosa, Jesús A1 - Pham, Phuong L. A1 - Popov, Maxim Y. A1 - Porter, James J. A1 - Powell, Raymond A1 - Rademacher, Karl A1 - Reid, Colin P. A1 - Rich, Samantha A1 - Rogel, Bessie A1 - Sahni, Himanso A1 - Savage, Jeremiah H. A1 - Schmitt, Kyle A. A1 - Simmons, Trevar J. A1 - Sislow, Joseph A1 - Spring, Jonathan A1 - Stein, Lincoln A1 - Sullivan, Sean A1 - Tang, Yajing A1 - Thiagarajan, Mathangi A1 - Troyer, Heather D. A1 - Wang, Chang A1 - Wang, Zhining A1 - West, Bedford L. A1 - Wilmer, Alex A1 - Wilson, Shane A1 - Wu, Kaman A1 - Wysocki, William P. A1 - Xiang, Linda A1 - Yamada, Joseph T. A1 - Yang, Liming A1 - Yu, Christine A1 - Yung, Christina K. A1 - Zenklusen, Jean Claude A1 - Zhang, Junjun A1 - Zhang, Zhenyu A1 - Zhao, Yuanheng A1 - Zubair, Ariz A1 - Staudt, Louis M. A1 - Grossman, Robert L. UR - http://www.nature.com/articles/s41588-021-00791-5 JO - Nat Genet ER - TY - JOUR T1 - Reporting guidelines for human microbiome research: the STORMS checklist. JF - Nat Med Y1 - 2021 A1 - Mirzayi, Chloe A1 - Renson, Audrey A1 - Zohra, Fatima A1 - Elsafoury, Shaimaa A1 - Geistlinger, Ludwig A1 - Kasselman, Lora J A1 - Eckenrode, Kelly A1 - van de Wijgert, Janneke A1 - Loughman, Amy A1 - Marques, Francine Z A1 - MacIntyre, David A A1 - Arumugam, Manimozhiyan A1 - Azhar, Rimsha A1 - Beghini, Francesco A1 - Bergstrom, Kirk A1 - Bhatt, Ami A1 - Bisanz, Jordan E A1 - Braun, Jonathan A1 - Bravo, Hector Corrada A1 - Buck, Gregory A A1 - Bushman, Frederic A1 - Casero, David A1 - Clarke, Gerard A1 - Collado, Maria Carmen A1 - Cotter, Paul D A1 - Cryan, John F A1 - Demmer, Ryan T A1 - Devkota, Suzanne A1 - Elinav, Eran A1 - Escobar, Juan S A1 - Fettweis, Jennifer A1 - Finn, Robert D A1 - Fodor, Anthony A A1 - Forslund, Sofia A1 - Franke, Andre A1 - Furlanello, Cesare A1 - Gilbert, Jack A1 - Grice, Elizabeth A1 - Haibe-Kains, Benjamin A1 - Handley, Scott A1 - Herd, Pamela A1 - Holmes, Susan A1 - Jacobs, Jonathan P A1 - Karstens, Lisa A1 - Knight, Rob A1 - Knights, Dan A1 - Koren, Omry A1 - Kwon, Douglas S A1 - Langille, Morgan A1 - Lindsay, Brianna A1 - McGovern, Dermot A1 - McHardy, Alice C A1 - McWeeney, Shannon A1 - Mueller, Noel T A1 - Nezi, Luigi A1 - Olm, Matthew A1 - Palm, Noah A1 - Pasolli, Edoardo A1 - Raes, Jeroen A1 - Redinbo, Matthew R A1 - Rühlemann, Malte A1 - Balfour Sartor, R A1 - Schloss, Patrick D A1 - Schriml, Lynn A1 - Segal, Eran A1 - Shardell, Michelle A1 - Sharpton, Thomas A1 - Smirnova, Ekaterina A1 - Sokol, Harry A1 - Sonnenburg, Justin L A1 - Srinivasan, Sujatha A1 - Thingholm, Louise B A1 - Turnbaugh, Peter J A1 - Upadhyay, Vaibhav A1 - Walls, Ramona L A1 - Wilmes, Paul A1 - Yamada, Takuji A1 - Zeller, Georg A1 - Zhang, Mingyu A1 - Zhao, Ni A1 - Zhao, Liping A1 - Bao, Wenjun A1 - Culhane, Aedin A1 - Devanarayan, Viswanath A1 - Dopazo, Joaquin A1 - Fan, Xiaohui A1 - Fischer, Matthias A1 - Jones, Wendell A1 - Kusko, Rebecca A1 - Mason, Christopher E A1 - Mercer, Tim R A1 - Sansone, Susanna-Assunta A1 - Scherer, Andreas A1 - Shi, Leming A1 - Thakkar, Shraddha A1 - Tong, Weida A1 - Wolfinger, Russ A1 - Hunter, Christopher A1 - Segata, Nicola A1 - Huttenhower, Curtis A1 - Dowd, Jennifer B A1 - Jones, Heidi E A1 - Waldron, Levi KW - Computational Biology KW - Dysbiosis KW - Humans KW - Microbiota KW - Observational Studies as Topic KW - Research Design KW - Translational Science, Biomedical AB -

The particularly interdisciplinary nature of human microbiome research makes the organization and reporting of results spanning epidemiology, biology, bioinformatics, translational medicine and statistics a challenge. Commonly used reporting guidelines for observational or genetic epidemiology studies lack key features specific to microbiome studies. Therefore, a multidisciplinary group of microbiome epidemiology researchers adapted guidelines for observational and genetic studies to culture-independent human microbiome studies, and also developed new reporting elements for laboratory, bioinformatics and statistical analyses tailored to microbiome studies. The resulting tool, called 'Strengthening The Organization and Reporting of Microbiome Studies' (STORMS), is composed of a 17-item checklist organized into six sections that correspond to the typical sections of a scientific publication, presented as an editable table for inclusion in supplementary materials. The STORMS checklist provides guidance for concise and complete reporting of microbiome studies that will facilitate manuscript preparation, peer review, and reader comprehension of publications and comparative analysis of published results.

VL - 27 IS - 11 U1 - https://www.ncbi.nlm.nih.gov/pubmed/34789871?dopt=Abstract ER - TY - JOUR T1 - Schuurs–Hoeijmakers Syndrome (PACS1 Neurodevelopmental Disorder): Seven Novel Patients and a Review JF - Genes Y1 - 2021 A1 - Tenorio-Castaño, Jair A1 - Morte, Beatriz A1 - Nevado, Julián A1 - Martínez-Glez, Víctor A1 - Santos-Simarro, Fernando A1 - García-Miñaur, Sixto A1 - Palomares-Bralo, María A1 - Pacio-Míguez, Marta A1 - Gómez, Beatriz A1 - Arias, Pedro A1 - Alcochea, Alba A1 - Carrión, Juan A1 - Arias, Patricia A1 - Almoguera, Berta A1 - López-Grondona, Fermina A1 - Lorda-Sanchez, Isabel A1 - Galán-Gómez, Enrique A1 - Valenzuela, Irene A1 - Méndez Perez, María A1 - Cuscó, Ivón A1 - Barros, Francisco A1 - Pié, Juan A1 - Ramos, Sergio A1 - Ramos, Feliciano A1 - Kuechler, Alma A1 - Tizzano, Eduardo A1 - Ayuso, Carmen A1 - Kaiser, Frank A1 - Pérez-Jurado, Luis A1 - Carracedo, Ángel A1 - Lapunzina, Pablo VL - 12 UR - https://www.mdpi.com/2073-4425/12/5/738https://www.mdpi.com/2073-4425/12/5/738/pdf IS - 5 JO - Genes ER - TY - JOUR T1 - Uniform genomic data analysis in the NCI Genomic Data CommonsAbstract JF - Nature Communications Y1 - 2021 A1 - Zhang, Zhenyu A1 - Hernandez, Kyle A1 - Savage, Jeremiah A1 - Li, Shenglai A1 - Miller, Dan A1 - Agrawal, Stuti A1 - Ortuno, Francisco A1 - Staudt, Louis M. A1 - Heath, Allison A1 - Grossman, Robert L. VL - 12 UR - http://www.nature.com/articles/s41467-021-21254-9 IS - 1 JO - Nat Commun ER - TY - JOUR T1 - Association of a single nucleotide polymorphism in the ubxn6 gene with long-term non-progression phenotype in HIV-positive individuals. JF - Clin Microbiol Infect Y1 - 2020 A1 - Díez-Fuertes, F A1 - De La Torre-Tarazona, H E A1 - Calonge, E A1 - Pernas, M A1 - Bermejo, M A1 - García-Pérez, J A1 - Álvarez, A A1 - Capa, L A1 - García-García, F A1 - Saumoy, M A1 - Riera, M A1 - Boland-Auge, A A1 - López-Galíndez, C A1 - Lathrop, M A1 - Dopazo, J A1 - Sakuntabhai, A A1 - Alcamí, J KW - Adaptor Proteins, Vesicular Transport KW - Autophagy-Related Proteins KW - Caveolin 1 KW - Cohort Studies KW - Dendritic Cells KW - Disease Progression KW - Gene Frequency KW - Gene Knockdown Techniques KW - Genetic Association Studies KW - HeLa Cells KW - HIV Infections KW - HIV Long-Term Survivors KW - HIV-1 KW - Humans KW - Macrophages KW - Oligonucleotide Array Sequence Analysis KW - Phenotype KW - Polymorphism, Single Nucleotide KW - whole exome sequencing AB -

OBJECTIVES: The long-term non-progressors (LTNPs) are a heterogeneous group of HIV-positive individuals characterized by their ability to maintain high CD4 T-cell counts and partially control viral replication for years in the absence of antiretroviral therapy. The present study aims to identify host single nucleotide polymorphisms (SNPs) associated with non-progression in a cohort of 352 individuals.

METHODS: DNA microarrays and exome sequencing were used for genotyping about 240 000 functional polymorphisms throughout more than 20 000 human genes. The allele frequencies of 85 LTNPs were compared with a control population. SNPs associated with LTNPs were confirmed in a population of typical progressors. Functional analyses in the affected gene were carried out through knockdown experiments in HeLa-P4, macrophages and dendritic cells.

RESULTS: Several SNPs located within the major histocompatibility complex region previously related to LTNPs were confirmed in this new cohort. The SNP rs1127888 (UBXN6) surpassed the statistical significance of these markers after Bonferroni correction (q = 2.11 × 10). An uncommon allelic frequency of rs1127888 among LTNPs was confirmed by comparison with typical progressors and other publicly available populations. UBXN6 knockdown experiments caused an increase in CAV1 expression and its accumulation in the plasma membrane. In vitro infection of different cell types with HIV-1 replication-competent recombinant viruses caused a reduction of the viral replication capacity compared with their corresponding wild-type cells expressing UBXN6.

CONCLUSIONS: A higher prevalence of Ala31Thr in UBXN6 was found among LTNPs within its N-terminal region, which is crucial for UBXN6/VCP protein complex formation. UBXN6 knockdown affected CAV1 turnover and HIV-1 replication capacity.

VL - 26 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/31158522?dopt=Abstract ER - TY - JOUR T1 - COVID-19 Disease Map, building a computational repository of SARS-CoV-2 virus-host interaction mechanisms. JF - Sci Data Y1 - 2020 A1 - Ostaszewski, Marek A1 - Mazein, Alexander A1 - Gillespie, Marc E A1 - Kuperstein, Inna A1 - Niarakis, Anna A1 - Hermjakob, Henning A1 - Pico, Alexander R A1 - Willighagen, Egon L A1 - Evelo, Chris T A1 - Hasenauer, Jan A1 - Schreiber, Falk A1 - Dräger, Andreas A1 - Demir, Emek A1 - Wolkenhauer, Olaf A1 - Furlong, Laura I A1 - Barillot, Emmanuel A1 - Dopazo, Joaquin A1 - Orta-Resendiz, Aurelio A1 - Messina, Francesco A1 - Valencia, Alfonso A1 - Funahashi, Akira A1 - Kitano, Hiroaki A1 - Auffray, Charles A1 - Balling, Rudi A1 - Schneider, Reinhard KW - Betacoronavirus KW - Computational Biology KW - Coronavirus Infections KW - COVID-19 KW - Databases, Factual KW - Host Microbial Interactions KW - Host-Pathogen Interactions KW - Humans KW - International Cooperation KW - Models, Biological KW - Pandemics KW - Pneumonia, Viral KW - SARS-CoV-2 VL - 7 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/32371892?dopt=Abstract ER - TY - JOUR T1 - The ELIXIR Human Copy Number Variations Community: building bioinformatics infrastructure for research. JF - F1000Res Y1 - 2020 A1 - Salgado, David A1 - Armean, Irina M A1 - Baudis, Michael A1 - Beltran, Sergi A1 - Capella-Gutíerrez, Salvador A1 - Carvalho-Silva, Denise A1 - Dominguez Del Angel, Victoria A1 - Dopazo, Joaquin A1 - Furlong, Laura I A1 - Gao, Bo A1 - Garcia, Leyla A1 - Gerloff, Dietlind A1 - Gut, Ivo A1 - Gyenesei, Attila A1 - Habermann, Nina A1 - Hancock, John M A1 - Hanauer, Marc A1 - Hovig, Eivind A1 - Johansson, Lennart F A1 - Keane, Thomas A1 - Korbel, Jan A1 - Lauer, Katharina B A1 - Laurie, Steve A1 - Leskošek, Brane A1 - Lloyd, David A1 - Marqués-Bonet, Tomás A1 - Mei, Hailiang A1 - Monostory, Katalin A1 - Piñero, Janet A1 - Poterlowicz, Krzysztof A1 - Rath, Ana A1 - Samarakoon, Pubudu A1 - Sanz, Ferran A1 - Saunders, Gary A1 - Sie, Daoud A1 - Swertz, Morris A A1 - Tsukanov, Kirill A1 - Valencia, Alfonso A1 - Vidak, Marko A1 - Yenyxe González, Cristina A1 - Ylstra, Bauke A1 - Béroud, Christophe KW - Computational Biology KW - DNA Copy Number Variations KW - High-Throughput Nucleotide Sequencing KW - Humans AB -

Copy number variations (CNVs) are major causative contributors both in the genesis of genetic diseases and human neoplasias. While "High-Throughput" sequencing technologies are increasingly becoming the primary choice for genomic screening analysis, their ability to efficiently detect CNVs is still heterogeneous and remains to be developed. The aim of this white paper is to provide a guiding framework for the future contributions of ELIXIR's recently established with implications beyond human disease diagnostics and population genomics. This white paper is the direct result of a strategy meeting that took place in September 2018 in Hinxton (UK) and involved representatives of 11 ELIXIR Nodes. The meeting led to the definition of priority objectives and tasks, to address a wide range of CNV-related challenges ranging from detection and interpretation to sharing and training. Here, we provide suggestions on how to align these tasks within the ELIXIR Platforms strategy, and on how to frame the activities of this new ELIXIR Community in the international context.

VL - 9 U1 - https://www.ncbi.nlm.nih.gov/pubmed/34367618?dopt=Abstract ER - TY - JOUR T1 - Immune Cell Associations with Cancer Risk. JF - iScience Y1 - 2020 A1 - Palomero, Luis A1 - Galván-Femenía, Ivan A1 - de Cid, Rafael A1 - Espín, Roderic A1 - Barnes, Daniel R A1 - Blommaert, Eline A1 - Gil-Gil, Miguel A1 - Falo, Catalina A1 - Stradella, Agostina A1 - Ouchi, Dan A1 - Roso-Llorach, Albert A1 - Violan, Concepció A1 - Peña-Chilet, Maria A1 - Dopazo, Joaquin A1 - Extremera, Ana Isabel A1 - García-Valero, Mar A1 - Herranz, Carmen A1 - Mateo, Francesca A1 - Mereu, Elisabetta A1 - Beesley, Jonathan A1 - Chenevix-Trench, Georgia A1 - Roux, Cecilia A1 - Mak, Tak A1 - Brunet, Joan A1 - Hakem, Razq A1 - Gorrini, Chiara A1 - Antoniou, Antonis C A1 - Lázaro, Conxi A1 - Pujana, Miquel Angel AB -

Proper immune system function hinders cancer development, but little is known about whether genetic variants linked to cancer risk alter immune cells. Here, we report 57 cancer risk loci associated with differences in immune and/or stromal cell contents in the corresponding tissue. Predicted target genes show expression and regulatory associations with immune features. Polygenic risk scores also reveal associations with immune and/or stromal cell contents, and breast cancer scores show consistent results in normal and tumor tissue. SH2B3 links peripheral alterations of several immune cell types to the risk of this malignancy. Pleiotropic SH2B3 variants are associated with breast cancer risk in BRCA1/2 mutation carriers. A retrospective case-cohort study indicates a positive association between blood counts of basophils, leukocytes, and monocytes and age at breast cancer diagnosis. These findings broaden our knowledge of the role of the immune system in cancer and highlight promising prevention strategies for individuals at high risk.

VL - 23 IS - 7 U1 - https://www.ncbi.nlm.nih.gov/pubmed/32622267?dopt=Abstract ER - TY - JOUR T1 - Optimised molecular genetic diagnostics of Fanconi anaemia by whole exome sequencing and functional studies. JF - J Med Genet Y1 - 2020 A1 - Bogliolo, Massimo A1 - Pujol, Roser A1 - Aza-Carmona, Miriam A1 - Muñoz-Subirana, Núria A1 - Rodriguez-Santiago, Benjamin A1 - Casado, José Antonio A1 - Rio, Paula A1 - Bauser, Christopher A1 - Reina-Castillón, Judith A1 - Lopez-Sanchez, Marcos A1 - Gonzalez-Quereda, Lidia A1 - Gallano, Pia A1 - Catalá, Albert A1 - Ruiz-Llobet, Ana A1 - Badell, Isabel A1 - Diaz-Heredia, Cristina A1 - Hladun, Raquel A1 - Senent, Leonort A1 - Argiles, Bienvenida A1 - Bergua Burgues, Juan Miguel A1 - Bañez, Fatima A1 - Arrizabalaga, Beatriz A1 - López Almaraz, Ricardo A1 - Lopez, Monica A1 - Figuera, Ángela A1 - Molinés, Antonio A1 - Pérez de Soto, Inmaculada A1 - Hernando, Inés A1 - Muñoz, Juan Antonio A1 - Del Rosario Marin, Maria A1 - Balmaña, Judith A1 - Stjepanovic, Neda A1 - Carrasco, Estela A1 - Cuesta, Isabel A1 - Cosuelo, José Miguel A1 - Regueiro, Alexandra A1 - Moraleda Jimenez, José A1 - Galera-Miñarro, Ana Maria A1 - Rosiñol, Laura A1 - Carrió, Anna A1 - Beléndez-Bieler, Cristina A1 - Escudero Soto, Antonio A1 - Cela, Elena A1 - de la Mata, Gregorio A1 - Fernández-Delgado, Rafael A1 - Garcia-Pardos, Maria Carmen A1 - Sáez-Villaverde, Raquel A1 - Barragaño, Marta A1 - Portugal, Raquel A1 - Lendinez, Francisco A1 - Hernadez, Ines A1 - Vagace, José Manue A1 - Tapia, Maria A1 - Nieto, José A1 - Garcia, Marta A1 - Gonzalez, Macarena A1 - Vicho, Cristina A1 - Galvez, Eva A1 - Valiente, Alberto A1 - Antelo, Maria Luisa A1 - Ancliff, Phil A1 - García, Francisco A1 - Dopazo, Joaquin A1 - Sevilla, Julian A1 - Paprotka, Tobias A1 - Pérez-Jurado, Luis Alberto A1 - Bueren, Juan A1 - Surralles, Jordi KW - Cell Line KW - DNA Copy Number Variations KW - DNA Repair KW - DNA-Binding Proteins KW - Fanconi Anemia KW - Fanconi Anemia Complementation Group A Protein KW - Female KW - Gene Knockout Techniques KW - Genetic Predisposition to Disease KW - Humans KW - Male KW - Mutation, Missense KW - Polymorphism, Single Nucleotide KW - whole exome sequencing AB -

PURPOSE: Patients with Fanconi anaemia (FA), a rare DNA repair genetic disease, exhibit chromosome fragility, bone marrow failure, malformations and cancer susceptibility. FA molecular diagnosis is challenging since FA is caused by point mutations and large deletions in 22 genes following three heritability patterns. To optimise FA patients' characterisation, we developed a simplified but effective methodology based on whole exome sequencing (WES) and functional studies.

METHODS: 68 patients with FA were analysed by commercial WES services. Copy number variations were evaluated by sequencing data analysis with RStudio. To test missense variants, wt FANCA cDNA was cloned and variants were introduced by site-directed mutagenesis. Vectors were then tested for their ability to complement DNA repair defects of a FANCA-KO human cell line generated by TALEN technologies.

RESULTS: We identified 93.3% of mutated alleles including large deletions. We determined the pathogenicity of three FANCA missense variants and demonstrated that two variants reported in mutations databases as 'affecting functions' are SNPs. Deep analysis of sequencing data revealed patients' true mutations, highlighting the importance of functional analysis. In one patient, no pathogenic variant could be identified in any of the 22 known FA genes, and in seven patients, only one deleterious variant could be identified (three patients each with FANCA and FANCD2 and one patient with FANCE mutations) CONCLUSION: WES and proper bioinformatics analysis are sufficient to effectively characterise patients with FA regardless of the rarity of their complementation group, type of mutations, mosaic condition and DNA source.

VL - 57 IS - 4 U1 - https://www.ncbi.nlm.nih.gov/pubmed/31586946?dopt=Abstract ER - TY - JOUR T1 - Platform to study intracellular polystyrene nanoplastic pollution and clinical outcomes. JF - Stem Cells Y1 - 2020 A1 - Bojic, Sanja A1 - Falco, Matias M A1 - Stojkovic, Petra A1 - Ljujic, Biljana A1 - Gazdic Jankovic, Marina A1 - Armstrong, Lyle A1 - Markovic, Nebojsa A1 - Dopazo, Joaquin A1 - Lako, Majlinda A1 - Bauer, Roman A1 - Stojkovic, Miodrag KW - Environmental Pollution KW - Humans KW - Induced Pluripotent Stem Cells KW - Intracellular Space KW - Nanoparticles KW - Plastics KW - Polystyrenes KW - Transcriptome KW - Treatment Outcome AB -

Increased pollution by plastics has become a serious global environmental problem, but the concerns for human health have been raised after reported presence of microplastics (MPs) and nanoplastics (NPs) in food and beverages. Unfortunately, few studies have investigate the potentially harmful effects of MPs/NPs on early human development and human health. Therefore, we used a new platform to study possible effects of polystyrene NPs (PSNPs) on the transcription profile of preimplantation human embryos and human induced pluripotent stem cells (hiPSCs). Two pluripotency genes, LEFTY1 and LEFTY2, which encode secreted ligands of the transforming growth factor-beta, were downregulated, while CA4 and OCLM, which are related to eye development, were upregulated in both samples. The gene set enrichment analysis showed that the development of atrioventricular heart valves and the dysfunction of cellular components, including extracellular matrix, were significantly affected after exposure of hiPSCs to PSNPs. Finally, using the HiPathia method, which uncovers disease mechanisms and predicts clinical outcomes, we determined the APOC3 circuit, which is responsible for increased risk for ischemic cardiovascular disease. These results clearly demonstrate that better understanding of NPs bioactivities and its implications for human health is of extreme importance. Thus, the presented platform opens further aspects to study interactions between different environmental and intracellular pollutions with the aim to decipher the mechanism and origin of human diseases.

VL - 38 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/32614127?dopt=Abstract ER - TY - JOUR T1 - SMN1 copy-number and sequence variant analysis from next-generation sequencing data. JF - Hum Mutat Y1 - 2020 A1 - López-López, Daniel A1 - Loucera, Carlos A1 - Carmona, Rosario A1 - Aquino, Virginia A1 - Salgado, Josefa A1 - Pasalodos, Sara A1 - Miranda, María A1 - Alonso, Ángel A1 - Dopazo, Joaquin KW - Base Sequence KW - DNA Copy Number Variations KW - High-Throughput Nucleotide Sequencing KW - Humans KW - Reproducibility of Results KW - Software KW - Survival of Motor Neuron 1 Protein AB -

Spinal muscular atrophy (SMA) is a severe neuromuscular autosomal recessive disorder affecting 1/10,000 live births. Most SMA patients present homozygous deletion of SMN1, while the vast majority of SMA carriers present only a single SMN1 copy. The sequence similarity between SMN1 and SMN2, and the complexity of the SMN locus makes the estimation of the SMN1 copy-number by next-generation sequencing (NGS) very difficult. Here, we present SMAca, the first python tool to detect SMA carriers and estimate the absolute SMN1 copy-number using NGS data. Moreover, SMAca takes advantage of the knowledge of certain variants specific to SMN1 duplication to also identify silent carriers. This tool has been validated with a cohort of 326 samples from the Navarra 1000 Genomes Project (NAGEN1000). SMAca was developed with a focus on execution speed and easy installation. This combination makes it especially suitable to be integrated into production NGS pipelines. Source code and documentation are available at https://www.github.com/babelomics/SMAca.

VL - 41 IS - 12 U1 - https://www.ncbi.nlm.nih.gov/pubmed/33058415?dopt=Abstract ER - TY - JOUR T1 - Transcriptomic Analysis of a Diabetic Skin-Humanized Mouse Model Dissects Molecular Pathways Underlying the Delayed Wound Healing Response. JF - Genes (Basel) Y1 - 2020 A1 - León, Carlos A1 - Garcia-Garcia, Francisco A1 - Llames, Sara A1 - García-Pérez, Eva A1 - Carretero, Marta A1 - Arriba, María Del Carmen A1 - Dopazo, Joaquin A1 - Del Rio, Marcela A1 - Escamez, Maria José A1 - Martínez-Santamaría, Lucía KW - Animals KW - Diabetes Mellitus, Experimental KW - Gene Expression Profiling KW - Gene Expression Regulation KW - Gene ontology KW - Humans KW - Metabolic Networks and Pathways KW - Mice KW - Mice, Nude KW - Microarray Analysis KW - Molecular Sequence Annotation KW - Principal Component Analysis KW - Signal Transduction KW - Skin KW - Skin Transplantation KW - Skin Ulcer KW - Streptozocin KW - Tissue Engineering KW - Transcriptome KW - Transplantation, Heterologous KW - Wound Healing AB -

Defective healing leading to cutaneous ulcer formation is one of the most feared complications of diabetes due to its consequences on patients' quality of life and on the healthcare system. A more in-depth analysis of the underlying molecular pathophysiology is required to develop effective healing-promoting therapies for those patients. Major architectural and functional differences with human epidermis limit extrapolation of results coming from rodents and other small mammal-healing models. Therefore, the search for reliable humanized models has become mandatory. Previously, we developed a diabetes-induced delayed humanized wound healing model that faithfully recapitulated the major histological features of such skin repair-deficient condition. Herein, we present the results of a transcriptomic and functional enrichment analysis followed by a mechanistic analysis performed in such humanized wound healing model. The deregulation of genes implicated in functions such as angiogenesis, apoptosis, and inflammatory signaling processes were evidenced, confirming published data in diabetic patients that in fact might also underlie some of the histological features previously reported in the delayed skin-humanized healing model. Altogether, these molecular findings support the utility of such preclinical model as a valuable tool to gain insight into the molecular basis of the delayed diabetic healing with potential impact in the translational medicine field.

VL - 12 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/33396192?dopt=Abstract ER - TY - JOUR T1 - Transparency and reproducibility in artificial intelligence. JF - Nature Y1 - 2020 A1 - Haibe-Kains, Benjamin A1 - Adam, George Alexandru A1 - Hosny, Ahmed A1 - Khodakarami, Farnoosh A1 - Waldron, Levi A1 - Wang, Bo A1 - McIntosh, Chris A1 - Goldenberg, Anna A1 - Kundaje, Anshul A1 - Greene, Casey S A1 - Broderick, Tamara A1 - Hoffman, Michael M A1 - Leek, Jeffrey T A1 - Korthauer, Keegan A1 - Huber, Wolfgang A1 - Brazma, Alvis A1 - Pineau, Joelle A1 - Tibshirani, Robert A1 - Hastie, Trevor A1 - Ioannidis, John P A A1 - Quackenbush, John A1 - Aerts, Hugo J W L KW - Algorithms KW - Artificial Intelligence KW - Reproducibility of Results VL - 586 IS - 7829 U1 - https://www.ncbi.nlm.nih.gov/pubmed/33057217?dopt=Abstract ER - TY - JOUR T1 - Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen. JF - Nat Commun Y1 - 2019 A1 - Menden, Michael P A1 - Wang, Dennis A1 - Mason, Mike J A1 - Szalai, Bence A1 - Bulusu, Krishna C A1 - Guan, Yuanfang A1 - Yu, Thomas A1 - Kang, Jaewoo A1 - Jeon, Minji A1 - Wolfinger, Russ A1 - Nguyen, Tin A1 - Zaslavskiy, Mikhail A1 - Jang, In Sock A1 - Ghazoui, Zara A1 - Ahsen, Mehmet Eren A1 - Vogel, Robert A1 - Neto, Elias Chaibub A1 - Norman, Thea A1 - Tang, Eric K Y A1 - Garnett, Mathew J A1 - Veroli, Giovanni Y Di A1 - Fawell, Stephen A1 - Stolovitzky, Gustavo A1 - Guinney, Justin A1 - Dry, Jonathan R A1 - Saez-Rodriguez, Julio KW - ADAM17 Protein KW - Antineoplastic Combined Chemotherapy Protocols KW - Benchmarking KW - Biomarkers, Tumor KW - Cell Line, Tumor KW - Computational Biology KW - Datasets as Topic KW - Drug Antagonism KW - Drug Resistance, Neoplasm KW - Drug Synergism KW - Genomics KW - Humans KW - Molecular Targeted Therapy KW - mutation KW - Neoplasms KW - pharmacogenetics KW - Phosphatidylinositol 3-Kinases KW - Phosphoinositide-3 Kinase Inhibitors KW - Treatment Outcome AB -

The effectiveness of most cancer targeted therapies is short-lived. Tumors often develop resistance that might be overcome with drug combinations. However, the number of possible combinations is vast, necessitating data-driven approaches to find optimal patient-specific treatments. Here we report AstraZeneca's large drug combination dataset, consisting of 11,576 experiments from 910 combinations across 85 molecularly characterized cancer cell lines, and results of a DREAM Challenge to evaluate computational strategies for predicting synergistic drug pairs and biomarkers. 160 teams participated to provide a comprehensive methodological development and benchmarking. Winning methods incorporate prior knowledge of drug-target interactions. Synergy is predicted with an accuracy matching biological replicates for >60% of combinations. However, 20% of drug combinations are poorly predicted by all methods. Genomic rationale for synergy predictions are identified, including ADAM17 inhibitor antagonism when combined with PIK3CB/D inhibition contrasting to synergy when combined with other PI3K-pathway inhibitors in PIK3CA mutant cells.

VL - 10 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/31209238?dopt=Abstract ER - TY - JOUR T1 - A comparison of mechanistic signaling pathway activity analysis methods. JF - Brief Bioinform Y1 - 2019 A1 - Amadoz, Alicia A1 - Hidalgo, Marta R A1 - Cubuk, Cankut A1 - Carbonell-Caballero, José A1 - Dopazo, Joaquin KW - Algorithms KW - Humans KW - Postmortem Changes KW - Signal Transduction KW - Systems biology KW - Transcriptome AB -

Understanding the aspects of cell functionality that account for disease mechanisms or drug modes of action is a main challenge for precision medicine. Classical gene-based approaches ignore the modular nature of most human traits, whereas conventional pathway enrichment approaches produce only illustrative results of limited practical utility. Recently, a family of new methods has emerged that change the focus from the whole pathways to the definition of elementary subpathways within them that have any mechanistic significance and to the study of their activities. Thus, mechanistic pathway activity (MPA) methods constitute a new paradigm that allows recoding poorly informative genomic measurements into cell activity quantitative values and relate them to phenotypes. Here we provide a review on the MPA methods available and explain their contribution to systems medicine approaches for addressing challenges in the diagnostic and treatment of complex diseases.

VL - 20 IS - 5 U1 - https://www.ncbi.nlm.nih.gov/pubmed/29868818?dopt=Abstract ER - TY - JOUR T1 - Differential metabolic activity and discovery of therapeutic targets using summarized metabolic pathway models. JF - NPJ Syst Biol Appl Y1 - 2019 A1 - Cubuk, Cankut A1 - Hidalgo, Marta R A1 - Amadoz, Alicia A1 - Rian, Kinza A1 - Salavert, Francisco A1 - Pujana, Miguel A A1 - Mateo, Francesca A1 - Herranz, Carmen A1 - Carbonell-Caballero, José A1 - Dopazo, Joaquin KW - Computational Biology KW - Computer Simulation KW - Drug discovery KW - Gene Regulatory Networks KW - Humans KW - Internet KW - Metabolic Networks and Pathways KW - Models, Biological KW - Neoplasms KW - Phenotype KW - Software KW - Transcriptome AB -

In spite of the increasing availability of genomic and transcriptomic data, there is still a gap between the detection of perturbations in gene expression and the understanding of their contribution to the molecular mechanisms that ultimately account for the phenotype studied. Alterations in the metabolism are behind the initiation and progression of many diseases, including cancer. The wealth of available knowledge on metabolic processes can therefore be used to derive mechanistic models that link gene expression perturbations to changes in metabolic activity that provide relevant clues on molecular mechanisms of disease and drug modes of action (MoA). In particular, pathway modules, which recapitulate the main aspects of metabolism, are especially suitable for this type of modeling. We present Metabolizer, a web-based application that offers an intuitive, easy-to-use interactive interface to analyze differences in pathway metabolic module activities that can also be used for class prediction and in silico prediction of knock-out (KO) effects. Moreover, Metabolizer can automatically predict the optimal KO intervention for restoring a diseased phenotype. We provide different types of validations of some of the predictions made by Metabolizer. Metabolizer is a web tool that allows understanding molecular mechanisms of disease or the MoA of drugs within the context of the metabolism by using gene expression measurements. In addition, this tool automatically suggests potential therapeutic targets for individualized therapeutic interventions.

VL - 5 U1 - https://www.ncbi.nlm.nih.gov/pubmed/30854222?dopt=Abstract ER - TY - JOUR T1 - Precision medicine needs pioneering clinical bioinformaticians. JF - Brief Bioinform Y1 - 2019 A1 - Gómez-López, Gonzalo A1 - Dopazo, Joaquin A1 - Cigudosa, Juan C A1 - Valencia, Alfonso A1 - Al-Shahrour, Fátima KW - Cohort Studies KW - Computational Biology KW - Humans KW - Precision Medicine AB -

Success in precision medicine depends on accessing high-quality genetic and molecular data from large, well-annotated patient cohorts that couple biological samples to comprehensive clinical data, which in conjunction can lead to effective therapies. From such a scenario emerges the need for a new professional profile, an expert bioinformatician with training in clinical areas who can make sense of multi-omics data to improve therapeutic interventions in patients, and the design of optimized basket trials. In this review, we first describe the main policies and international initiatives that focus on precision medicine. Secondly, we review the currently ongoing clinical trials in precision medicine, introducing the concept of 'precision bioinformatics', and we describe current pioneering bioinformatics efforts aimed at implementing tools and computational infrastructures for precision medicine in health institutions around the world. Thirdly, we discuss the challenges related to the clinical training of bioinformaticians, and the urgent need for computational specialists capable of assimilating medical terminologies and protocols to address real clinical questions. We also propose some skills required to carry out common tasks in clinical bioinformatics and some tips for emergent groups. Finally, we explore the future perspectives and the challenges faced by precision medicine bioinformatics.

VL - 20 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/29077790?dopt=Abstract ER - TY - JOUR T1 - A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection JF - Nature Communications Y1 - 2018 A1 - Fourati, Slim A1 - Talla, Aarthi A1 - Mahmoudian, Mehrad A1 - Burkhart, Joshua G. A1 - Klén, Riku A1 - Henao, Ricardo A1 - Yu, Thomas A1 - Aydın, Zafer A1 - Yeung, Ka Yee A1 - Ahsen, Mehmet Eren A1 - Almugbel, Reem A1 - Jahandideh, Samad A1 - Liang, Xiao A1 - Nordling, Torbjörn E. M. A1 - Shiga, Motoki A1 - Stanescu, Ana A1 - Vogel, Robert A1 - Pandey, Gaurav A1 - Chiu, Christopher A1 - McClain, Micah T. A1 - Woods, Christopher W. A1 - Ginsburg, Geoffrey S. A1 - Elo, Laura L. A1 - Tsalik, Ephraim L. A1 - Mangravite, Lara M. A1 - Sieberts, Solveig K. VL - 9 UR - http://www.nature.com/articles/s41467-018-06735-8http://www.nature.com/articles/s41467-018-06735-8.pdfhttp://www.nature.com/articles/s41467-018-06735-8.pdfhttp://www.nature.com/articles/s41467-018-06735-8 IS - 1 JO - Nat Commun ER - TY - JOUR T1 - The effects of death and post-mortem cold ischemia on human tissue transcriptomes. JF - Nat Commun Y1 - 2018 A1 - Ferreira, Pedro G A1 - Muñoz-Aguirre, Manuel A1 - Reverter, Ferran A1 - Sá Godinho, Caio P A1 - Sousa, Abel A1 - Amadoz, Alicia A1 - Sodaei, Reza A1 - Hidalgo, Marta R A1 - Pervouchine, Dmitri A1 - Carbonell-Caballero, José A1 - Nurtdinov, Ramil A1 - Breschi, Alessandra A1 - Amador, Raziel A1 - Oliveira, Patrícia A1 - Cubuk, Cankut A1 - Curado, João A1 - Aguet, François A1 - Oliveira, Carla A1 - Dopazo, Joaquin A1 - Sammeth, Michael A1 - Ardlie, Kristin G A1 - Guigó, Roderic KW - Blood KW - Cold Ischemia KW - Death KW - Female KW - gene expression KW - Humans KW - Models, Biological KW - Postmortem Changes KW - RNA, Messenger KW - Stochastic Processes KW - Transcriptome AB -

Post-mortem tissues samples are a key resource for investigating patterns of gene expression. However, the processes triggered by death and the post-mortem interval (PMI) can significantly alter physiologically normal RNA levels. We investigate the impact of PMI on gene expression using data from multiple tissues of post-mortem donors obtained from the GTEx project. We find that many genes change expression over relatively short PMIs in a tissue-specific manner, but this potentially confounding effect in a biological analysis can be minimized by taking into account appropriate covariates. By comparing ante- and post-mortem blood samples, we identify the cascade of transcriptional events triggered by death of the organism. These events do not appear to simply reflect stochastic variation resulting from mRNA degradation, but active and ongoing regulation of transcription. Finally, we develop a model to predict the time since death from the analysis of the transcriptome of a few readily accessible tissues.

VL - 9 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/29440659?dopt=Abstract ER - TY - JOUR T1 - Gene Expression Integration into Pathway Modules Reveals a Pan-Cancer Metabolic Landscape. JF - Cancer Res Y1 - 2018 A1 - Cubuk, Cankut A1 - Hidalgo, Marta R A1 - Amadoz, Alicia A1 - Pujana, Miguel A A1 - Mateo, Francesca A1 - Herranz, Carmen A1 - Carbonell-Caballero, José A1 - Dopazo, Joaquin KW - Cell Line, Tumor KW - Cluster Analysis KW - Disease Progression KW - Gene Expression Profiling KW - Gene Expression Regulation, Neoplastic KW - Gene Regulatory Networks KW - Humans KW - Kaplan-Meier Estimate KW - Metabolome KW - mutation KW - Neoplasms KW - Oncogenes KW - Phenotype KW - Prognosis KW - RNA, Small Interfering KW - Sequence Analysis, RNA KW - Transcriptome KW - Treatment Outcome AB -

Metabolic reprogramming plays an important role in cancer development and progression and is a well-established hallmark of cancer. Despite its inherent complexity, cellular metabolism can be decomposed into functional modules that represent fundamental metabolic processes. Here, we performed a pan-cancer study involving 9,428 samples from 25 cancer types to reveal metabolic modules whose individual or coordinated activity predict cancer type and outcome, in turn highlighting novel therapeutic opportunities. Integration of gene expression levels into metabolic modules suggests that the activity of specific modules differs between cancers and the corresponding tissues of origin. Some modules may cooperate, as indicated by the positive correlation of their activity across a range of tumors. The activity of many metabolic modules was significantly associated with prognosis at a stronger magnitude than any of their constituent genes. Thus, modules may be classified as tumor suppressors and oncomodules according to their potential impact on cancer progression. Using this modeling framework, we also propose novel potential therapeutic targets that constitute alternative ways of treating cancer by inhibiting their reprogrammed metabolism. Collectively, this study provides an extensive resource of predicted cancer metabolic profiles and dependencies. Combining gene expression with metabolic modules identifies molecular mechanisms of cancer undetected on an individual gene level and allows discovery of new potential therapeutic targets. .

VL - 78 IS - 21 U1 - https://www.ncbi.nlm.nih.gov/pubmed/30135189?dopt=Abstract ER - TY - JOUR T1 - Genomics of the origin and evolution of Citrus. JF - Nature Y1 - 2018 A1 - Wu, Guohong Albert A1 - Terol, Javier A1 - Ibañez, Victoria A1 - López-García, Antonio A1 - Pérez-Román, Estela A1 - Borredá, Carles A1 - Domingo, Concha A1 - Tadeo, Francisco R A1 - Carbonell-Caballero, José A1 - Alonso, Roberto A1 - Curk, Franck A1 - Du, Dongliang A1 - Ollitrault, Patrick A1 - Roose, Mikeal L A1 - Dopazo, Joaquin A1 - Gmitter, Frederick G A1 - Rokhsar, Daniel S A1 - Talon, Manuel KW - Asia, Southeastern KW - Biodiversity KW - citrus KW - Crop Production KW - Evolution, Molecular KW - Genetic Speciation KW - Genome, Plant KW - Genomics KW - Haplotypes KW - Heterozygote KW - History, Ancient KW - Human Migration KW - Hybridization, Genetic KW - Phylogeny AB -

The genus Citrus, comprising some of the most widely cultivated fruit crops worldwide, includes an uncertain number of species. Here we describe ten natural citrus species, using genomic, phylogenetic and biogeographic analyses of 60 accessions representing diverse citrus germ plasms, and propose that citrus diversified during the late Miocene epoch through a rapid southeast Asian radiation that correlates with a marked weakening of the monsoons. A second radiation enabled by migration across the Wallace line gave rise to the Australian limes in the early Pliocene epoch. Further identification and analyses of hybrids and admixed genomes provides insights into the genealogy of major commercial cultivars of citrus. Among mandarins and sweet orange, we find an extensive network of relatedness that illuminates the domestication of these groups. Widespread pummelo admixture among these mandarins and its correlation with fruit size and acidity suggests a plausible role of pummelo introgression in the selection of palatable mandarins. This work provides a new evolutionary framework for the genus Citrus.

VL - 554 IS - 7692 U1 - https://www.ncbi.nlm.nih.gov/pubmed/29414943?dopt=Abstract ER - TY - JOUR T1 - Models of cell signaling uncover molecular mechanisms of high-risk neuroblastoma and predict disease outcome. JF - Biol Direct Y1 - 2018 A1 - Hidalgo, Marta R A1 - Amadoz, Alicia A1 - Cubuk, Cankut A1 - Carbonell-Caballero, José A1 - Dopazo, Joaquin KW - Computational Biology KW - Gene Expression Regulation, Neoplastic KW - Humans KW - JNK Mitogen-Activated Protein Kinases KW - Models, Theoretical KW - Neuroblastoma KW - Signal Transduction AB -

BACKGROUND: Despite the progress in neuroblastoma therapies the mortality of high-risk patients is still high (40-50%) and the molecular basis of the disease remains poorly known. Recently, a mathematical model was used to demonstrate that the network regulating stress signaling by the c-Jun N-terminal kinase pathway played a crucial role in survival of patients with neuroblastoma irrespective of their MYCN amplification status. This demonstrates the enormous potential of computational models of biological modules for the discovery of underlying molecular mechanisms of diseases.

RESULTS: Since signaling is known to be highly relevant in cancer, we have used a computational model of the whole cell signaling network to understand the molecular determinants of bad prognostic in neuroblastoma. Our model produced a comprehensive view of the molecular mechanisms of neuroblastoma tumorigenesis and progression.

CONCLUSION: We have also shown how the activity of signaling circuits can be considered a reliable model-based prognostic biomarker.

REVIEWERS: This article was reviewed by Tim Beissbarth, Wenzhong Xiao and Joanna Polanska. For the full reviews, please go to the Reviewers' comments section.

VL - 13 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/30134948?dopt=Abstract ER - TY - JOUR T1 - The modular network structure of the mutational landscape of Acute Myeloid Leukemia. JF - PLoS One Y1 - 2018 A1 - Ibáñez, Mariam A1 - Carbonell-Caballero, José A1 - Such, Esperanza A1 - García-Alonso, Luz A1 - Liquori, Alessandro A1 - López-Pavía, María A1 - LLop, Marta A1 - Alonso, Carmen A1 - Barragán, Eva A1 - Gómez-Seguí, Inés A1 - Neef, Alexander A1 - Hervás, David A1 - Montesinos, Pau A1 - Sanz, Guillermo A1 - Sanz, Miguel Angel A1 - Dopazo, Joaquin A1 - Cervera, José KW - Adult KW - Aged KW - Cytodiagnosis KW - Female KW - Gene Regulatory Networks KW - Genetic Association Studies KW - Genetic Heterogeneity KW - Humans KW - Karyotype KW - Leukemia, Myeloid, Acute KW - Male KW - Middle Aged KW - mutation KW - Neoplasm Proteins KW - Nucleophosmin KW - Prognosis KW - whole exome sequencing AB -

Acute myeloid leukemia (AML) is associated with the sequential accumulation of acquired genetic alterations. Although at diagnosis cytogenetic alterations are frequent in AML, roughly 50% of patients present an apparently normal karyotype (NK), leading to a highly heterogeneous prognosis. Due to this significant heterogeneity, it has been suggested that different molecular mechanisms may trigger the disease with diverse prognostic implications. We performed whole-exome sequencing (WES) of tumor-normal matched samples of de novo AML-NK patients lacking mutations in NPM1, CEBPA or FLT3-ITD to identify new gene mutations with potential prognostic and therapeutic relevance to patients with AML. Novel candidate-genes, together with others previously described, were targeted resequenced in an independent cohort of 100 de novo AML patients classified in the cytogenetic intermediate-risk (IR) category. A mean of 4.89 mutations per sample were detected in 73 genes, 35 of which were mutated in more than one patient. After a network enrichment analysis, we defined a single in silico model and established a set of seed-genes that may trigger leukemogenesis in patients with normal karyotype. The high heterogeneity of gene mutations observed in AML patients suggested that a specific alteration could not be as essential as the interaction of deregulated pathways.

VL - 13 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/30303964?dopt=Abstract ER - TY - JOUR T1 - GGPS1 Mutation and Atypical Femoral Fractures with Bisphosphonates. JF - N Engl J Med Y1 - 2017 A1 - Roca-Ayats, Neus A1 - Balcells, Susana A1 - Garcia-Giralt, Natàlia A1 - Falcó-Mascaró, Maite A1 - Martínez-Gil, Núria A1 - Abril, Josep F A1 - Urreizti, Roser A1 - Dopazo, Joaquin A1 - Quesada-Gómez, José M A1 - Nogués, Xavier A1 - Mellibovsky, Leonardo A1 - Prieto-Alhambra, Daniel A1 - Dunford, James E A1 - Javaid, Muhammad K A1 - Russell, R Graham A1 - Grinberg, Daniel A1 - Díez-Pérez, Adolfo KW - Aged KW - Amino Acid Sequence KW - Bone Density Conservation Agents KW - Dimethylallyltranstransferase KW - Diphosphonates KW - Exome KW - Farnesyltranstransferase KW - Female KW - Femoral Fractures KW - Geranyltranstransferase KW - Humans KW - Middle Aged KW - mutation VL - 376 UR - http://www.nejm.org/doi/full/10.1056/NEJMc1612804 IS - 18 U1 - https://www.ncbi.nlm.nih.gov/pubmed/28467865?dopt=Abstract ER - TY - JOUR T1 - High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes. JF - Oncotarget Y1 - 2017 A1 - Hidalgo, Marta R A1 - Cubuk, Cankut A1 - Amadoz, Alicia A1 - Salavert, Francisco A1 - Carbonell-Caballero, José A1 - Dopazo, Joaquin KW - Computational Biology KW - gene expression KW - Gene Regulatory Networks KW - Humans KW - mutation KW - Neoplasms KW - Precision Medicine KW - Sequence Analysis, RNA KW - Signal Transduction AB -

Understanding the aspects of the cell functionality that account for disease or drug action mechanisms is a main challenge for precision medicine. Here we propose a new method that models cell signaling using biological knowledge on signal transduction. The method recodes individual gene expression values (and/or gene mutations) into accurate measurements of changes in the activity of signaling circuits, which ultimately constitute high-throughput estimations of cell functionalities caused by gene activity within the pathway. Moreover, such estimations can be obtained either at cohort-level, in case/control comparisons, or personalized for individual patients. The accuracy of the method is demonstrated in an extensive analysis involving 5640 patients from 12 different cancer types. Circuit activity measurements not only have a high diagnostic value but also can be related to relevant disease outcomes such as survival, and can be used to assess therapeutic interventions.

VL - 8 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/28042959?dopt=Abstract ER - TY - JOUR T1 - Reference genome assessment from a population scale perspective: an accurate profile of variability and noise. JF - Bioinformatics Y1 - 2017 A1 - Carbonell-Caballero, José A1 - Amadoz, Alicia A1 - Alonso, Roberto A1 - Hidalgo, Marta R A1 - Cubuk, Cankut A1 - Conesa, David A1 - López-Quílez, Antonio A1 - Dopazo, Joaquin KW - Animals KW - Genetic Variation KW - Genome KW - Genomics KW - Genotype KW - Humans KW - Models, Statistical KW - Quality Control KW - Reproducibility of Results KW - Software AB -

Motivation: Current plant and animal genomic studies are often based on newly assembled genomes that have not been properly consolidated. In this scenario, misassembled regions can easily lead to false-positive findings. Despite quality control scores are included within genotyping protocols, they are usually employed to evaluate individual sample quality rather than reference sequence reliability. We propose a statistical model that combines quality control scores across samples in order to detect incongruent patterns at every genomic region. Our model is inherently robust since common artifact signals are expected to be shared between independent samples over misassembled regions of the genome.

Results: The reliability of our protocol has been extensively tested through different experiments and organisms with accurate results, improving state-of-the-art methods. Our analysis demonstrates synergistic relations between quality control scores and allelic variability estimators, that improve the detection of misassembled regions, and is able to find strong artifact signals even within the human reference assembly. Furthermore, we demonstrated how our model can be trained to properly rank the confidence of a set of candidate variants obtained from new independent samples.

Availability and implementation: This tool is freely available at http://gitlab.com/carbonell/ces.

Contact: jcarbonell.cipf@gmail.com or joaquin.dopazo@juntadeandalucia.es.

Supplementary information: Supplementary data are available at Bioinformatics online.

VL - 33 UR - https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btx482 IS - 22 U1 - https://www.ncbi.nlm.nih.gov/pubmed/28961772?dopt=Abstract ER - TY - JOUR T1 - VISMapper: ultra-fast exhaustive cartography of viral insertion sites for gene therapy. JF - BMC Bioinformatics Y1 - 2017 A1 - Juanes, José M A1 - Gallego, Asunción A1 - Tárraga, Joaquín A1 - Chaves, Felipe J A1 - Marin-Garcia, Pablo A1 - Medina, Ignacio A1 - Arnau, Vicente A1 - Dopazo, Joaquin KW - Base Sequence KW - Genetic Therapy KW - Genetic Vectors KW - High-Throughput Nucleotide Sequencing KW - Humans KW - Internet KW - User-Computer Interface KW - Virus Integration AB -

BACKGROUND: The possibility of integrating viral vectors to become a persistent part of the host genome makes them a crucial element of clinical gene therapy. However, viral integration has associated risks, such as the unintentional activation of oncogenes that can result in cancer. Therefore, the analysis of integration sites of retroviral vectors is a crucial step in developing safer vectors for therapeutic use.

RESULTS: Here we present VISMapper, a vector integration site analysis web server, to analyze next-generation sequencing data for retroviral vector integration sites. VISMapper can be found at: http://vismapper.babelomics.org .

CONCLUSIONS: Because it uses novel mapping algorithms VISMapper is remarkably faster than previous available programs. It also provides a useful graphical interface to analyze the integration sites found in the genomic context.

VL - 18 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/28931371?dopt=Abstract ER - TY - JOUR T1 - Whole exome sequencing coupled with unbiased functional analysis reveals new Hirschsprung disease genes. JF - Genome biology Y1 - 2017 A1 - Gui, Hongsheng A1 - Schriemer, Duco A1 - Cheng, William W A1 - Chauhan, Rajendra K A1 - Antiňolo, Guillermo A1 - Berrios, Courtney A1 - Bleda, Marta A1 - Brooks, Alice S A1 - Brouwer, Rutger W W A1 - Burns, Alan J A1 - Cherny, Stacey S A1 - Dopazo, Joaquin A1 - Eggen, Bart J L A1 - Griseri, Paola A1 - Jalloh, Binta A1 - Le, Thuy-Linh A1 - Lui, Vincent C H A1 - Luzón-Toro, Berta A1 - Matera, Ivana A1 - Ngan, Elly S W A1 - Pelet, Anna A1 - Ruiz-Ferrer, Macarena A1 - Sham, Pak C A1 - Shepherd, Iain T A1 - So, Man-Ting A1 - Sribudiani, Yunia A1 - Tang, Clara S M A1 - van den Hout, Mirjam C G N A1 - van der Linde, Herma C A1 - van Ham, Tjakko J A1 - van IJcken, Wilfred F J A1 - Verheij, Joke B G M A1 - Amiel, Jeanne A1 - Borrego, Salud A1 - Ceccherini, Isabella A1 - Chakravarti, Aravinda A1 - Lyonnet, Stanislas A1 - Tam, Paul K H A1 - Garcia-Barceló, Maria-Mercè A1 - Hofstra, Robert Mw KW - Hirschprung KW - Rare Disease KW - WES AB - BACKGROUND: Hirschsprung disease (HSCR), which is congenital obstruction of the bowel, results from a failure of enteric nervous system (ENS) progenitors to migrate, proliferate, differentiate, or survive within the distal intestine. Previous studies that have searched for genes underlying HSCR have focused on ENS-related pathways and genes not fitting the current knowledge have thus often been ignored. We identify and validate novel HSCR genes using whole exome sequencing (WES), burden tests, in silico prediction, unbiased in vivo analyses of the mutated genes in zebrafish, and expression analyses in zebrafish, mouse, and human. RESULTS: We performed de novo mutation (DNM) screening on 24 HSCR trios. We identify 28 DNMs in 21 different genes. Eight of the DNMs we identified occur in RET, the main HSCR gene, and the remaining 20 DNMs reside in genes not reported in the ENS. Knockdown of all 12 genes with missense or loss-of-function DNMs showed that the orthologs of four genes (DENND3, NCLN, NUP98, and TBATA) are indispensable for ENS development in zebrafish, and these results were confirmed by CRISPR knockout. These genes are also expressed in human and mouse gut and/or ENS progenitors. Importantly, the encoded proteins are linked to neuronal processes shared by the central nervous system and the ENS. CONCLUSIONS: Our data open new fields of investigation into HSCR pathology and provide novel insights into the development of the ENS. Moreover, the study demonstrates that functional analyses of genes carrying DNMs are warranted to delineate the full genetic architecture of rare complex diseases. VL - 18 UR - http://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1174-6 ER - TY - JOUR T1 - Whole exome sequencing coupled with unbiased functional analysis reveals new Hirschsprung disease genes JF - Genome Biology Y1 - 2017 A1 - Gui, Hongsheng A1 - Schriemer, Duco A1 - Cheng, William W. A1 - Chauhan, Rajendra K. A1 - Antiňolo, Guillermo A1 - Berrios, Courtney A1 - Bleda, Marta A1 - Brooks, Alice S. A1 - Brouwer, Rutger W. W. A1 - Burns, Alan J. A1 - Cherny, Stacey S. A1 - Dopazo, Joaquin A1 - Eggen, Bart J. L. A1 - Griseri, Paola A1 - Jalloh, Binta A1 - Le, Thuy-Linh A1 - Lui, Vincent C. H. A1 - Luzón-Toro, Berta A1 - Matera, Ivana A1 - Ngan, Elly S. W. A1 - Pelet, Anna A1 - Ruiz-Ferrer, Macarena A1 - Sham, Pak C. A1 - Shepherd, Iain T. A1 - So, Man-Ting A1 - Sribudiani, Yunia A1 - Tang, Clara S. M. A1 - van den Hout, Mirjam C. G. N. A1 - van der Linde, Herma C. A1 - van Ham, Tjakko J. A1 - van IJcken, Wilfred F. J. A1 - Verheij, Joke B. G. M. A1 - Amiel, Jeanne A1 - Borrego, Salud A1 - Ceccherini, Isabella A1 - Chakravarti, Aravinda A1 - Lyonnet, Stanislas A1 - Tam, Paul K. H. A1 - Garcia-Barceló, Maria-Mercè A1 - Hofstra, Robert M. W. VL - 18 UR - http://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1174-6http://link.springer.com/content/pdf/10.1186/s13059-017-1174-6.pdf IS - 1 JO - Genome Biol ER - TY - JOUR T1 - 267 Spanish exomes reveal population-specific differences in disease-related genetic variation. JF - Molecular biology and evolution Y1 - 2016 A1 - Joaquín Dopazo A1 - Amadoz, Alicia A1 - Bleda, Marta A1 - García-Alonso, Luz A1 - Alemán, Alejandro A1 - Garcia-Garcia, Francisco A1 - Rodriguez, Juan A A1 - Daub, Josephine T A1 - Muntané, Gerard A1 - Antonio Rueda A1 - Vela-Boza, Alicia A1 - López-Domingo, Francisco J A1 - Florido, Javier P A1 - Arce, Pablo A1 - Ruiz-Ferrer, Macarena A1 - Méndez-Vidal, Cristina A1 - Arnold, Todd E A1 - Spleiss, Olivia A1 - Alvarez-Tejado, Miguel A1 - Navarro, Arcadi A1 - Bhattacharya, Shomi S A1 - Borrego, Salud A1 - Santoyo-López, Javier A1 - Antiňolo, Guillermo KW - disease KW - NGS KW - polymorphisms KW - Population genomics KW - prioritization KW - SNP AB - Recent results from large-scale genomic projects suggest that allele frequencies, which are highly relevant for medical purposes, differ considerably across different populations. The need for a detailed catalogue of local variability motivated the whole exome sequencing of 267 unrelated individuals, representative of the healthy Spanish population. Like in other studies, a considerable number of rare variants were found (almost one third of the described variants). There were also relevant differences in allelic frequencies in polymorphic variants, including about 10,000 polymorphisms private to the Spanish population. The allelic frequencies of variants conferring susceptibility to complex diseases (including cancer, schizophrenia, Alzheimer disease, type 2 diabetes and other pathologies) were overall similar to those of other populations. However, the trend is the opposite for variants linked to Mendelian and rare diseases (including several retinal degenerative dystrophies and cardiomyopathies) that show marked frequency differences between populations. Interestingly, a correspondence between differences in allelic frequencies and disease prevalence was found, highlighting the relevance of frequency differences in disease risk. These differences are also observed in variants that disrupt known drug binding sites, suggesting an important role for local variability in population-specific drug resistances or adverse effects. We have made the Spanish population variant server web page that contains population frequency information for the complete list of 170,888 variant positions we found publicly available (http://spv.babelomics.org/), We show that it if fundamental to determine population-specific variant frequencies in order to distinguish real disease associations from population-specific polymorphisms. UR - https://mbe.oxfordjournals.org/content/early/2016/02/17/molbev.msw005.full ER - TY - JOUR T1 - Actionable pathways: interactive discovery of therapeutic targets using signaling pathway models. JF - Nucleic acids research Y1 - 2016 A1 - Salavert, Francisco A1 - Hidago, Marta R A1 - Amadoz, Alicia A1 - Cubuk, Cankut A1 - Medina, Ignacio A1 - Crespo, Daniel A1 - Carbonell-Caballero, José A1 - Joaquín Dopazo KW - actionable genes KW - Disease mechanism KW - drug action mechanism KW - Drug discovery KW - pathway analysis KW - personalized medicine KW - signalling KW - therapeutic targets AB - The discovery of actionable targets is crucial for targeted therapies and is also a constituent part of the drug discovery process. The success of an intervention over a target depends critically on its contribution, within the complex network of gene interactions, to the cellular processes responsible for disease progression or therapeutic response. Here we present PathAct, a web server that predicts the effect that interventions over genes (inhibitions or activations that simulate knock-outs, drug treatments or over-expressions) can have over signal transmission within signaling pathways and, ultimately, over the cell functionalities triggered by them. PathAct implements an advanced graphical interface that provides a unique interactive working environment in which the suitability of potentially actionable genes, that could eventually become drug targets for personalized or individualized therapies, can be easily tested. The PathAct tool can be found at: http://pathact.babelomics.org. UR - http://nar.oxfordjournals.org/content/early/2016/05/02/nar.gkw369.full ER - TY - JOUR T1 - Assessment of Targeted Next-Generation Sequencing as a Tool for the Diagnosis of Charcot-Marie-Tooth Disease and Hereditary Motor Neuropathy. JF - The Journal of molecular diagnostics : JMD Y1 - 2016 A1 - Lupo, Vincenzo A1 - Garcia-Garcia, Francisco A1 - Sancho, Paula A1 - Tello, Cristina A1 - García-Romero, Mar A1 - Villarreal, Liliana A1 - Alberti, Antonia A1 - Sivera, Rafael A1 - Joaquín Dopazo A1 - Pascual-Pascual, Samuel I A1 - Márquez-Infante, Celedonio A1 - Casasnovas, Carlos A1 - Sevilla, Teresa A1 - Espinós, Carmen KW - Charcot-Marie-Tooth KW - CMT KW - Diagnostic KW - NGS KW - Panels KW - rare diseases KW - Targeted resequencing AB - Charcot-Marie-Tooth disease is characterized by broad genetic heterogeneity with >50 known disease-associated genes. Mutations in some of these genes can cause a pure motor form of hereditary motor neuropathy, the genetics of which are poorly characterized. We designed a panel comprising 56 genes associated with Charcot-Marie-Tooth disease/hereditary motor neuropathy. We validated this diagnostic tool by first testing 11 patients with pathological mutations. A cohort of 33 affected subjects was selected for this study. The DNAJB2 c.352+1G>A mutation was detected in two cases; novel changes and/or variants with low frequency (<1%) were found in 12 cases. There were no candidate variants in 18 cases, and amplification failed for one sample. The DNAJB2 c.352+1G>A mutation was also detected in three additional families. On haplotype analysis, all of the patients from these five families shared the same haplotype; therefore, the DNAJB2 c.352+1G>A mutation may be a founder event. Our gene panel allowed us to perform a very rapid and cost-effective screening of genes involved in Charcot-Marie-Tooth disease/hereditary motor neuropathy. Our diagnostic strategy was robust in terms of both coverage and read depth for all of the genes and patient samples. These findings demonstrate the difficulty in achieving a definitive molecular diagnosis because of the complexity of interpreting new variants and the genetic heterogeneity that is associated with these neuropathies. UR - http://www.sciencedirect.com/science/article/pii/S1525157815002615 ER - TY - JOUR T1 - HPG pore: an efficient and scalable framework for nanopore sequencing data JF - BMC Bioinformatics Y1 - 2016 A1 - Tárraga, Joaquín A1 - Gallego, Asunción A1 - Arnau, Vicente A1 - Medina, Ignacio A1 - Dopazo, Joaquin VL - 17 UR - http://www.biomedcentral.com/1471-2105/17/107http://link.springer.com/content/pdf/10.1186/s12859-016-0966-0 IS - 1 JO - BMC Bioinformatics ER - TY - JOUR T1 - HPG pore: an efficient and scalable framework for nanopore sequencing data. JF - BMC bioinformatics Y1 - 2016 A1 - Tárraga, Joaquín A1 - Gallego, Asunción A1 - Arnau, Vicente A1 - Medina, Ignacio A1 - Dopazo, Joaquin KW - hadoop KW - HPC KW - nanopore KW - NGS AB - BACKGROUND: The use of nanopore technologies is expected to spread in the future because they are portable and can sequence long fragments of DNA molecules without prior amplification. The first nanopore sequencer available, the MinION™ from Oxford Nanopore Technologies, is a USB-connected, portable device that allows real-time DNA analysis. In addition, other new instruments are expected to be released soon, which promise to outperform the current short-read technologies in terms of throughput. Despite the flood of data expected from this technology, the data analysis solutions currently available are only designed to manage small projects and are not scalable. RESULTS: Here we present HPG Pore, a toolkit for exploring and analysing nanopore sequencing data. HPG Pore can run on both individual computers and in the Hadoop distributed computing framework, which allows easy scale-up to manage the large amounts of data expected to result from extensive use of nanopore technologies in the future. CONCLUSIONS: HPG Pore allows for virtually unlimited sequencing data scalability, thus guaranteeing its continued management in near future scenarios. HPG Pore is available in GitHub at http://github.com/opencb/hpg-pore . VL - 17 UR - http://www.biomedcentral.com/1471-2105/17/107 ER - TY - JOUR T1 - Human DNA methylomes of neurodegenerative diseases show common epigenomic patterns. JF - Transl Psychiatry Y1 - 2016 A1 - Sanchez-Mut, J V A1 - Heyn, H A1 - Vidal, E A1 - Moran, S A1 - Sayols, S A1 - Delgado-Morales, R A1 - Schultz, M D A1 - Ansoleaga, B A1 - Garcia-Esparcia, P A1 - Pons-Espinal, M A1 - de Lagran, M M A1 - Dopazo, J A1 - Rabano, A A1 - Avila, J A1 - Dierssen, M A1 - Lott, I A1 - Ferrer, I A1 - Ecker, J R A1 - Esteller, M KW - Adult KW - Aged KW - Aged, 80 and over KW - DNA Methylation KW - Epigenomics KW - Female KW - Humans KW - Male KW - Middle Aged KW - neurodegenerative diseases KW - Prefrontal Cortex KW - Tissue Array Analysis AB -

Different neurodegenerative disorders often show similar lesions, such as the presence of amyloid plaques, TAU-neurotangles and synuclein inclusions. The genetically inherited forms are rare, so we wondered whether shared epigenetic aberrations, such as those affecting DNA methylation, might also exist. The studied samples were gray matter samples from the prefrontal cortex of control and neurodegenerative disease-associated cases. We performed the DNA methylation analyses of Alzheimer's disease, dementia with Lewy bodies, Parkinson's disease and Alzheimer-like neurodegenerative profile associated with Down's syndrome samples. The DNA methylation landscapes obtained show that neurodegenerative diseases share similar aberrant CpG methylation shifts targeting a defined gene set. Our findings suggest that neurodegenerative disorders might have similar pathogenetic mechanisms that subsequently evolve into different clinical entities. The identified aberrant DNA methylation changes can be used as biomarkers of the disorders and as potential new targets for the development of new therapies.

VL - 6 U1 - https://www.ncbi.nlm.nih.gov/pubmed/26784972?dopt=Abstract ER - TY - JOUR T1 - Identification of the Photoreceptor Transcriptional Co-Repressor SAMD11 as Novel Cause of Autosomal Recessive Retinitis Pigmentosa. JF - Sci Rep Y1 - 2016 A1 - Corton, M A1 - Avila-Fernández, A A1 - Campello, L A1 - Sánchez, M A1 - Benavides, B A1 - López-Molina, M I A1 - Fernández-Sánchez, L A1 - Sánchez-Alcudia, R A1 - da Silva, L R J A1 - Reyes, N A1 - Martín-Garrido, E A1 - Zurita, O A1 - Fernández-San José, P A1 - Pérez-Carro, R A1 - García-García, F A1 - Dopazo, J A1 - García-Sandoval, B A1 - Cuenca, N A1 - Ayuso, C KW - Aged KW - Animals KW - Co-Repressor Proteins KW - Codon, Nonsense KW - Cohort Studies KW - Comparative Genomic Hybridization KW - Consanguinity KW - DNA Mutational Analysis KW - Exome KW - Eye Proteins KW - Female KW - Gene Expression Regulation KW - Genes, Recessive KW - Homeodomain Proteins KW - Homozygote KW - Humans KW - Male KW - Mice KW - Middle Aged KW - Polymorphism, Single Nucleotide KW - Protein Interaction Mapping KW - Retina KW - Retinal Dystrophies KW - Retinal Rod Photoreceptor Cells KW - Retinitis pigmentosa KW - Spain KW - Trans-Activators KW - Transcription Factors AB -

Retinitis pigmentosa (RP), the most frequent form of inherited retinal dystrophy is characterized by progressive photoreceptor degeneration. Many genes have been implicated in RP development, but several others remain to be identified. Using a combination of homozygosity mapping, whole-exome and targeted next-generation sequencing, we found a novel homozygous nonsense mutation in SAMD11 in five individuals diagnosed with adult-onset RP from two unrelated consanguineous Spanish families. SAMD11 is ortholog to the mouse major retinal SAM domain (mr-s) protein that is implicated in CRX-mediated transcriptional regulation in the retina. Accordingly, protein-protein network analysis revealed a significant interaction of SAMD11 with CRX. Immunoblotting analysis confirmed strong expression of SAMD11 in human retina. Immunolocalization studies revealed SAMD11 was detected in the three nuclear layers of the human retina and interestingly differential expression between cone and rod photoreceptors was observed. Our study strongly implicates SAMD11 as novel cause of RP playing an important role in the pathogenesis of human degeneration of photoreceptors.

VL - 6 U1 - https://www.ncbi.nlm.nih.gov/pubmed/27734943?dopt=Abstract ER - TY - JOUR T1 - Improving the management of Inherited Retinal Dystrophies by targeted sequencing of a population-specific gene panel. JF - Sci Rep Y1 - 2016 A1 - Bravo-Gil, Nereida A1 - Méndez-Vidal, Cristina A1 - Romero-Pérez, Laura A1 - González-del Pozo, María A1 - Rodríguez-de la Rúa, Enrique A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo KW - Alleles KW - Computer Simulation KW - DNA Copy Number Variations KW - DNA Mutational Analysis KW - Eye Proteins KW - Gene Library KW - Genetic Association Studies KW - Genetic Heterogeneity KW - Genetic Therapy KW - High-Throughput Nucleotide Sequencing KW - Humans KW - mutation KW - Phenotype KW - Retinal Dystrophies AB -

Next-generation sequencing (NGS) has overcome important limitations to the molecular diagnosis of Inherited Retinal Dystrophies (IRD) such as the high clinical and genetic heterogeneity and the overlapping phenotypes. The purpose of this study was the identification of the genetic defect in 32 Spanish families with different forms of IRD. With that aim, we implemented a custom NGS panel comprising 64 IRD-associated genes in our population, and three disease-associated intronic regions. A total of 37 pathogenic mutations (14 novels) were found in 73% of IRD patients ranging from 50% for autosomal dominant cases, 75% for syndromic cases, 83% for autosomal recessive cases, and 100% for X-linked cases. Additionally, unexpected phenotype-genotype correlations were found in 6 probands, which led to the refinement of their clinical diagnoses. Furthermore, intra- and interfamilial phenotypic variability was observed in two cases. Moreover, two cases unsuccessfully analysed by exome sequencing were resolved by applying this panel. Our results demonstrate that this hypothesis-free approach based on frequently mutated, population-specific loci is highly cost-efficient for the routine diagnosis of this heterogeneous condition and allows the unbiased analysis of a miscellaneous cohort. The molecular information found here has aid clinical diagnosis and has improved genetic counselling and patient management.

VL - 6 U1 - https://www.ncbi.nlm.nih.gov/pubmed/27032803?dopt=Abstract ER - TY - JOUR T1 - Screening of CD96 and ASXL1 in 11 patients with Opitz C or Bohring-Opitz syndromes. JF - Am J Med Genet A Y1 - 2016 A1 - Urreizti, Roser A1 - Roca-Ayats, Neus A1 - Trepat, Judith A1 - Garcia-Garcia, Francisco A1 - Alemán, Alejandro A1 - Orteschi, Daniela A1 - Marangi, Giuseppe A1 - Neri, Giovanni A1 - Opitz, John M A1 - Dopazo, Joaquin A1 - Cormand, Bru A1 - Vilageliu, Lluïsa A1 - Balcells, Susana A1 - Grinberg, Daniel KW - Adolescent KW - Antigens, CD KW - Child KW - Child, Preschool KW - Craniosynostoses KW - Exome KW - Female KW - High-Throughput Nucleotide Sequencing KW - Humans KW - Infant KW - Intellectual Disability KW - Male KW - mutation KW - Pedigree KW - Phenotype KW - Prognosis KW - Repressor Proteins AB -

Opitz C trigonocephaly (or Opitz C syndrome, OTCS) and Bohring-Opitz syndrome (BOS or C-like syndrome) are two rare genetic disorders with phenotypic overlap. The genetic causes of these diseases are not understood. However, two genes have been associated with OTCS or BOS with dominantly inherited de novo mutations. Whereas CD96 has been related to OTCS (one case) and to BOS (one case), ASXL1 has been related to BOS only (several cases). In this study we analyze CD96 and ASXL1 in a group of 11 affected individuals, including 2 sibs, 10 of them were diagnosed with OTCS, and one had a BOS phenotype. Exome sequences were available on six patients with OTCS and three parent pairs. Thus, we could analyze the CD96 and ASXL1 sequences in these patients bioinformatically. Sanger sequencing of all exons of CD96 and ASXL1 was carried out in the remaining patients. Detailed scrutiny of the sequences and assessment of variants allowed us to exclude putative pathogenic and private mutations in all but one of the patients. In this patient (with BOS) we identified a de novo mutation in ASXL1 (c.2100dupT). By nature and location within the gene, this mutation resembles those previously described in other BOS patients and we conclude that it may be responsible for the condition. Our results indicate that in 10 of 11, the disease (OTCS or BOS) cannot be explained by small changes in CD96 or ASXL1. However, the cohort is too small to make generalizations about the genetic etiology of these diseases.

VL - 170A IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/26768331?dopt=Abstract ER - TY - JOUR T1 - Web-based network analysis and visualization using CellMaps. JF - Bioinformatics Y1 - 2016 A1 - Salavert, Francisco A1 - García-Alonso, Luz A1 - Sánchez, Rubén A1 - Alonso, Roberto A1 - Bleda, Marta A1 - Medina, Ignacio A1 - Dopazo, Joaquin KW - Biochemical Phenomena KW - Internet KW - Software AB -

UNLABELLED: : CellMaps is an HTML5 open-source web tool that allows displaying, editing, exploring and analyzing biological networks as well as integrating metadata into them. Computations and analyses are remotely executed in high-end servers, and all the functionalities are available through RESTful web services. CellMaps can easily be integrated in any web page by using an available JavaScript API.

AVAILABILITY AND IMPLEMENTATION: The application is available at: http://cellmaps.babelomics.org/ and the code can be found in: https://github.com/opencb/cell-maps The client is implemented in JavaScript and the server in C and Java.

CONTACT: jdopazo@cipf.es

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

VL - 32 IS - 19 U1 - https://www.ncbi.nlm.nih.gov/pubmed/27296979?dopt=Abstract ER - TY - JOUR T1 - Whole exome sequencing of Rett syndrome-like patients reveals the mutational diversity of the clinical phenotype. JF - Hum Genet Y1 - 2016 A1 - Lucariello, Mario A1 - Vidal, Enrique A1 - Vidal, Silvia A1 - Saez, Mauricio A1 - Roa, Laura A1 - Huertas, Dori A1 - Pineda, Mercè A1 - Dalfó, Esther A1 - Dopazo, Joaquin A1 - Jurado, Paola A1 - Armstrong, Judith A1 - Esteller, Manel KW - Adolescent KW - Adult KW - Animals KW - Caenorhabditis elegans KW - Carrier Proteins KW - Cell Cycle Proteins KW - Child KW - Child, Preschool KW - DNA Mutational Analysis KW - Exome KW - Female KW - Forkhead Transcription Factors KW - Genetic Variation KW - High-Throughput Nucleotide Sequencing KW - Humans KW - Methyl-CpG-Binding Protein 2 KW - mutation KW - Nerve Tissue Proteins KW - Protein Serine-Threonine Kinases KW - Receptors, Nicotinic KW - Rett Syndrome AB -

Classical Rett syndrome (RTT) is a neurodevelopmental disorder where most of cases carry MECP2 mutations. Atypical RTT variants involve mutations in CDKL5 and FOXG1. However, a subset of RTT patients remains that do not carry any mutation in the described genes. Whole exome sequencing was carried out in a cohort of 21 female probands with clinical features overlapping with those of RTT, but without mutations in the customarily studied genes. Candidates were functionally validated by assessing the appearance of a neurological phenotype in Caenorhabditis elegans upon disruption of the corresponding ortholog gene. We detected pathogenic variants that accounted for the RTT-like phenotype in 14 (66.6 %) patients. Five patients were carriers of mutations in genes already known to be associated with other syndromic neurodevelopmental disorders. We determined that the other patients harbored mutations in genes that have not previously been linked to RTT or other neurodevelopmental syndromes, such as the ankyrin repeat containing protein ANKRD31 or the neuronal acetylcholine receptor subunit alpha-5 (CHRNA5). Furthermore, worm assays demonstrated that mutations in the studied candidate genes caused locomotion defects. Our findings indicate that mutations in a variety of genes contribute to the development of RTT-like phenotypes.

VL - 135 IS - 12 U1 - https://www.ncbi.nlm.nih.gov/pubmed/27541642?dopt=Abstract ER - TY - JOUR T1 - Assessing the impact of mutations found in next generation sequencing data over human signaling pathways. JF - Nucleic acids research Y1 - 2015 A1 - Hernansaiz-Ballesteros, Rosa D A1 - Salavert, Francisco A1 - Sebastián-Leon, Patricia A1 - Alemán, Alejandro A1 - Medina, Ignacio A1 - Joaquín Dopazo KW - NGS KW - pathways KW - signalling KW - Systems biology AB - Modern sequencing technologies produce increasingly detailed data on genomic variation. However, conventional methods for relating either individual variants or mutated genes to phenotypes present known limitations given the complex, multigenic nature of many diseases or traits. Here we present PATHiVar, a web-based tool that integrates genomic variation data with gene expression tissue information. PATHiVar constitutes a new generation of genomic data analysis methods that allow studying variants found in next generation sequencing experiment in the context of signaling pathways. Simple Boolean models of pathways provide detailed descriptions of the impact of mutations in cell functionality so as, recurrences in functionality failures can easily be related to diseases, even if they are produced by mutations in different genes. Patterns of changes in signal transmission circuits, often unpredictable from individual genes mutated, correspond to patterns of affected functionalities that can be related to complex traits such as disease progression, drug response, etc. PATHiVar is available at: http://pathivar.babelomics.org. VL - 43 UR - http://nar.oxfordjournals.org/content/43/W1/W270 ER - TY - JOUR T1 - Babelomics 5.0: functional interpretation for new generations of genomic data. JF - Nucleic acids research Y1 - 2015 A1 - Alonso, Roberto A1 - Salavert, Francisco A1 - Garcia-Garcia, Francisco A1 - Carbonell-Caballero, José A1 - Bleda, Marta A1 - García-Alonso, Luz A1 - Sanchis-Juan, Alba A1 - Perez-Gil, Daniel A1 - Marin-Garcia, Pablo A1 - Sánchez, Rubén A1 - Cubuk, Cankut A1 - Hidalgo, Marta R A1 - Amadoz, Alicia A1 - Hernansaiz-Ballesteros, Rosa D A1 - Alemán, Alejandro A1 - Tárraga, Joaquín A1 - Montaner, David A1 - Medina, Ignacio A1 - Dopazo, Joaquin KW - babelomics KW - data integration KW - gene set analysis KW - interactome KW - network analysis KW - NGS KW - RNA-seq KW - Systems biology KW - transcriptomics AB - Babelomics has been running for more than one decade offering a user-friendly interface for the functional analysis of gene expression and genomic data. Here we present its fifth release, which includes support for Next Generation Sequencing data including gene expression (RNA-seq), exome or genome resequencing. Babelomics has simplified its interface, being now more intuitive. Improved visualization options, such as a genome viewer as well as an interactive network viewer, have been implemented. New technical enhancements at both, client and server sides, makes the user experience faster and more dynamic. Babelomics offers user-friendly access to a full range of methods that cover: (i) primary data analysis, (ii) a variety of tests for different experimental designs and (iii) different enrichment and network analysis algorithms for the interpretation of the results of such tests in the proper functional context. In addition to the public server, local copies of Babelomics can be downloaded and installed. Babelomics is freely available at: http://www.babelomics.org. VL - 43 UR - http://nar.oxfordjournals.org/content/43/W1/W117 ER - TY - JOUR T1 - BRCA1 Alternative splicing landscape in breast tissue samples. JF - BMC cancer Y1 - 2015 A1 - Romero, Atocha A1 - Garcia-Garcia, Francisco A1 - López-Perolio, Irene A1 - Ruiz de Garibay, Gorka A1 - García-Sáenz, José A A1 - Garre, Pilar A1 - Ayllón, Patricia A1 - Benito, Esperanza A1 - Joaquín Dopazo A1 - Díaz-Rubio, Eduardo A1 - Caldés, Trinidad A1 - de la Hoya, Miguel AB - BACKGROUND: BRCA1 is a key protein in cell network, involved in DNA repair pathways and cell cycle. Recently, the ENIGMA consortium has reported a high number of alternative splicing (AS) events at this locus in blood-derived samples. However, BRCA1 splicing pattern in breast tissue samples is unknown. Here, we provide an accurate description of BRCA1 splicing events distribution in breast tissue samples. METHODS: BRCA1 splicing events were scanned in 70 breast tumor samples, 4 breast samples from healthy individuals and in 72 blood-derived samples by capillary electrophoresis (capillary EP). Molecular subtype was identified in all tumor samples. Splicing events were considered predominant if their relative expression level was at least the 10% of the full-length reference signal. RESULTS: 54 BRCA1 AS events were identified, 27 of them were annotated as predominant in at least one sample. Δ5q, Δ13, Δ9, Δ5 and ▼1aA were significantly more frequently annotated as predominant in breast tumor samples than in blood-derived samples. Predominant splicing events were, on average, more frequent in tumor samples than in normal breast tissue samples (P = 0.010). Similarly, likely inactivating splicing events (PTC-NMDs, Non-Coding, Δ5 and Δ18) were more frequently annotated as predominant in tumor than in normal breast samples (P = 0.020), whereas there were no significant differences for other splicing events (No-Fs) frequency distribution between tumor and normal breast samples (P = 0.689). CONCLUSIONS: Our results complement recent findings by the ENIGMA consortium, demonstrating that BRCA1 AS, despite its tremendous complexity, is similar in breast and blood samples, with no evidences for tissue specific AS events. Further on, we conclude that somatic inactivation of BRCA1 through spliciogenic mutations is, at best, a rare mechanism in breast carcinogenesis, albeit our data detects an excess of likely inactivating AS events in breast tumor samples. VL - 15 UR - http://www.biomedcentral.com/1471-2407/15/219 ER - TY - JOUR T1 - Comparative gene expression study of the vestibular organ of the Igf1 deficient mouse using whole-transcript arrays. JF - Hearing research Y1 - 2015 A1 - Rodríguez-de la Rosa, Lourdes A1 - Sánchez-Calderón, Hortensia A1 - Contreras, Julio A1 - Murillo-Cuesta, Silvia A1 - Falagan, Sandra A1 - Avendaño, Carlos A1 - Joaquín Dopazo A1 - Varela-Nieto, Isabel A1 - Milo, Marta AB - The auditory and vestibular organs form the inner ear and have a common developmental origin. Insulin like growth factor 1 (IGF-1) has a central role in the development of the cochlea and maintenance of hearing. Its deficiency causes sensorineural hearing loss in man and mice. During chicken early development, IGF-1 modulates neurogenesis of the cochleovestibular ganglion but no further studies have been conducted to explore the potential role of IGF-1 in the vestibular system. In this study we have compared the whole transcriptome of the vestibular organ from wild type and Igf1(-/-) mice at different developmental and postnatal times. RNA was prepared from E18.5, P15 and P90 vestibular organs of Igf1(-/-) and Igf1(+/+) mice and the transcriptome analysed in triplicates using Affymetrix® Mouse Gene 1.1 ST Array Plates. These plates are whole-transcript arrays that include probes to measure both messenger (mRNA) and long intergenic non-coding RNA transcripts (lincRNA), with a coverage of over 28 thousand coding transcripts and over 7 thousands non-coding transcripts. Given the complexity of the data we used two different methods VSN-RMA and mmBGX to analyse and compare the data. This is to better evaluate the number of false positives and to quantify uncertainty of low signals. We identified a number of differentially expressed genes that we described using functional analysis and validated using RT-qPCR. The morphology of the vestibular organ did not show differences between genotypes and no evident alterations were observed in the vestibular sensory areas of the null mice. However, well-defined cellular alterations were found in the vestibular neurons with respect their number and size. Although these mice did not show a dramatic vestibular phenotype, we conducted a functional analysis on differentially expressed genes between genotypes and across time. This was with the aim to identify new pathways that are involved in the development of the vestibular organ as well as pathways that maybe affected by the lack of IGF-1 and be associated to the morphological changes of the vestibular neurons that we observed in the Igf1(-/-) mice. UR - http://www.sciencedirect.com/science/article/pii/S0378595515001835 ER - TY - JOUR T1 - Deregulation of key signaling pathways involved in oocyte maturation in FMR1 premutation carriers with Fragile X-associated primary ovarian insufficiency. JF - Gene Y1 - 2015 A1 - Alvarez-Mora, M I A1 - Rodriguez-Revenga, L A1 - Madrigal, I A1 - García-García, F A1 - Duran, M A1 - Dopazo, J A1 - Estivill, X A1 - Milà, M KW - Adult KW - Aged KW - Female KW - Fragile X Mental Retardation Protein KW - Fragile X Syndrome KW - Gene Expression Profiling KW - Gene Expression Regulation, Developmental KW - Gene ontology KW - Genome-Wide Association Study KW - Heterozygote KW - Humans KW - Middle Aged KW - Models, Genetic KW - mutation KW - Oligonucleotide Array Sequence Analysis KW - Oocytes KW - Primary Ovarian Insufficiency KW - Signal Transduction AB -

FMR1 premutation female carriers are at risk for Fragile X-associated primary ovarian insufficiency (FXPOI). Insights from knock-in mouse model have recently demonstrated that FXPOI is due to an increased rate of follicle depletion or an impaired development of the growing follicles. Molecular mechanisms responsible for this reduced viability are still unknown. In an attempt to provide new data on the mechanisms that lead to FXPOI, we report the first investigation involving transcription profiling of total blood from FMR1 premutation female carriers with and without FXPOI. A total of 16 unrelated female individuals (6 FMR1 premutated females with FXPOI; 6 FMR1 premutated females without FXPOI; and 4 no-FXPOI females) were studied by whole human genome oligonucleotide microarray (Agilent Technologies). Fold change analysis did not show any genes with significant differential gene expression. However, functional profiling by gene set analysis showed large number of statistically significant deregulated GO annotations as well as numerous KEGG pathways in FXPOI females. These results suggest that the impairment of fertility in these females might be due to a generalized deregulation of key signaling pathways involved in oocyte maturation. In particular, the vasoendotelial growth factor signaling, the inositol phosphate metabolism, the cell cycle, and the MAPK signaling pathways were found to be down-regulated in FXPOI females. Furthermore, a high statistical enrichment of biological processes involved in cell death and survival were found deregulated among FXPOI females. Our results provide new strategic approaches to further investigate the molecular mechanisms and potential therapeutic targets for FXPOI not focused in a single gene but rather in the set of genes involved in these pathways.

VL - 571 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/26095811?dopt=Abstract ER - TY - JOUR T1 - Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease. JF - Scientific reports Y1 - 2015 A1 - Luzón-Toro, Berta A1 - Gui, Hongsheng A1 - Ruiz-Ferrer, Macarena A1 - Sze-Man Tang, Clara A1 - Fernández, Raquel M A1 - Sham, Pak-Chung A1 - Torroglosa, Ana A1 - Kwong-Hang Tam, Paul A1 - Espino-Paisán, Laura A1 - Cherny, Stacey S A1 - Bleda, Marta A1 - Enguix-Riego, María Del Valle A1 - Joaquín Dopazo A1 - Antiňolo, Guillermo A1 - Garcia-Barceló, Maria-Mercè A1 - Borrego, Salud KW - babelomics KW - Hirschprung KW - NGS KW - prioritization AB - Hirschsprung disease (HSCR; OMIM 142623) is a developmental disorder characterized by aganglionosis along variable lengths of the distal gastrointestinal tract, which results in intestinal obstruction. Interactions among known HSCR genes and/or unknown disease susceptibility loci lead to variable severity of phenotype. Neither linkage nor genome-wide association studies have efficiently contributed to completely dissect the genetic pathways underlying this complex genetic disorder. We have performed whole exome sequencing of 16 HSCR patients from 8 unrelated families with SOLID platform. Variants shared by affected relatives were validated by Sanger sequencing. We searched for genes recurrently mutated across families. Only variations in the FAT3 gene were significantly enriched in five families. Within-family analysis identified compound heterozygotes for AHNAK and several genes (N = 23) with heterozygous variants that co-segregated with the phenotype. Network and pathway analyses facilitated the discovery of polygenic inheritance involving FAT3, HSCR known genes and their gene partners. Altogether, our approach has facilitated the detection of more than one damaging variant in biologically plausible genes that could jointly contribute to the phenotype. Our data may contribute to the understanding of the complex interactions that occur during enteric nervous system development and the etiopathology of familial HSCR. VL - 5 UR - http://www.nature.com/articles/srep16473 ER - TY - JOUR T1 - Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease JF - Scientific Reports Y1 - 2015 A1 - Luzón-Toro, Berta A1 - Gui, Hongsheng A1 - Ruiz-Ferrer, Macarena A1 - Sze-Man Tang, Clara A1 - Fernández, Raquel M. A1 - Sham, Pak-Chung A1 - Torroglosa, Ana A1 - Kwong-Hang Tam, Paul A1 - Espino-Paisán, Laura A1 - Cherny, Stacey S. A1 - Bleda, Marta A1 - Enguix-Riego, María Del Valle A1 - Dopazo, Joaquin A1 - Antiňolo, Guillermo A1 - Garcia-Barceló, Maria-Mercè A1 - Borrego, Salud VL - 5 UR - http://www.nature.com/articles/srep16473http://www.nature.com/articles/srep16473.pdfhttp://www.nature.com/articles/srep16473.pdfhttp://www.nature.com/articles/srep16473 IS - 1 JO - Sci Rep ER - TY - JOUR T1 - Identification of epistatic interactions through genome-wide association studies in sporadic medullary and juvenile papillary thyroid carcinomas. JF - BMC medical genomics Y1 - 2015 A1 - Luzón-Toro, Berta A1 - Bleda, Marta A1 - Navarro, Elena A1 - García-Alonso, Luz A1 - Ruiz-Ferrer, Macarena A1 - Medina, Ignacio A1 - Martín-Sánchez, Marta A1 - Gonzalez, Cristina Y A1 - Fernández, Raquel M A1 - Torroglosa, Ana A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin A1 - Borrego, Salud KW - epistasis KW - GWAS KW - Thyroid cancer AB - BACKGROUND: The molecular mechanisms leading to sporadic medullary thyroid carcinoma (sMTC) and juvenile papillary thyroid carcinoma (PTC), two rare tumours of the thyroid gland, remain poorly understood. Genetic studies on thyroid carcinomas have been conducted, although just a few loci have been systematically associated. Given the difficulties to obtain single-loci associations, this work expands its scope to the study of epistatic interactions that could help to understand the genetic architecture of complex diseases and explain new heritable components of genetic risk. METHODS: We carried out the first screening for epistasis by Multifactor-Dimensionality Reduction (MDR) in genome-wide association study (GWAS) on sMTC and juvenile PTC, to identify the potential simultaneous involvement of pairs of variants in the disease. RESULTS: We have identified two significant epistatic gene interactions in sMTC (CHFR-AC016582.2 and C8orf37-RNU1-55P) and three in juvenile PTC (RP11-648k4.2-DIO1, RP11-648k4.2-DMGDH and RP11-648k4.2-LOXL1). Interestingly, each interacting gene pair included a non-coding RNA, providing thus support to the relevance that these elements are increasingly gaining to explain carcinoma development and progression. CONCLUSIONS: Overall, this study contributes to the understanding of the genetic basis of thyroid carcinoma susceptibility in two different case scenarios such as sMTC and juvenile PTC. VL - 8 UR - http://bmcmedgenomics.biomedcentral.com/articles/10.1186/s12920-015-0160-7 ER - TY - JOUR T1 - Identification of epistatic interactions through genome-wide association studies in sporadic medullary and juvenile papillary thyroid carcinomas JF - BMC Medical Genomics Y1 - 2015 A1 - Luzón-Toro, Berta A1 - Bleda, Marta A1 - Navarro, Elena A1 - García-Alonso, Luz A1 - Ruiz-Ferrer, Macarena A1 - Medina, Ignacio A1 - Martín-Sánchez, Marta A1 - Gonzalez, Cristina Y. A1 - Fernández, Raquel M. A1 - Torroglosa, Ana A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin A1 - Borrego, Salud AB - The molecular mechanisms leading to sporadic medullary thyroid carcinoma (sMTC) and juvenile papillary thyroid carcinoma (PTC), two rare tumours of the thyroid gland, remain poorly understood. Genetic studies on thyroid carcinomas have been conducted, although just a few loci have been systematically associated. Given the difficulties to obtain single-loci associations, this work expands its scope to the study of epistatic interactions that could help to understand the genetic architecture of complex diseases and explain new heritable components of genetic risk. VL - 8 UR - https://doi.org/10.1186/s12920-015-0160-7 ER - TY - JOUR T1 - Involvement of a citrus meiotic recombination TTC-repeat motif in the formation of gross deletions generated by ionizing radiation and MULE activation JF - BMC Genomics Y1 - 2015 A1 - Terol, Javier A1 - Ibañez, Victoria A1 - Carbonell, José A1 - Alonso, Roberto A1 - Estornell, Leandro H. A1 - Licciardello, Concetta A1 - Gut, Ivo G. A1 - Dopazo, Joaquin A1 - Talon, Manuel AB - Transposable-element mediated chromosomal rearrangements require the involvement of two transposons and two double-strand breaks (DSB) located in close proximity. In radiobiology, DSB proximity is also a major factor contributing to rearrangements. However, the whole issue of DSB proximity remains virtually unexplored. VL - 16 UR - https://doi.org/10.1186/s12864-015-1280-3 ER - TY - JOUR T1 - Involvement of a citrus meiotic recombination TTC-repeat motif in the formation of gross deletions generated by ionizing radiation and MULE activation. JF - BMC genomics Y1 - 2015 A1 - Terol, Javier A1 - Ibañez, Victoria A1 - Carbonell, José A1 - Alonso, Roberto A1 - Estornell, Leandro H A1 - Licciardello, Concetta A1 - Gut, Ivo G A1 - Joaquín Dopazo A1 - Talon, Manuel AB - BACKGROUND: Transposable-element mediated chromosomal rearrangements require the involvement of two transposons and two double-strand breaks (DSB) located in close proximity. In radiobiology, DSB proximity is also a major factor contributing to rearrangements. However, the whole issue of DSB proximity remains virtually unexplored. RESULTS: Based on DNA sequencing analysis we show that the genomes of 2 derived mutations, Arrufatina (sport) and Nero (irradiation), share a similar 2 Mb deletion of chromosome 3. A 7 kb Mutator-like element found in Clemenules was present in Arrufatina in inverted orientation flanking the 5’ end of the deletion. The Arrufatina Mule displayed "dissimilar" 9-bp target site duplications separated by 2 Mb. Fine-scale single nucleotide variant analyses of the deleted fragments identified a TTC-repeat sequence motif located in the center of the deletion responsible of a meiotic crossover detected in the citrus reference genome. CONCLUSIONS: Taken together, this information is compatible with the proposal that in both mutants, the TTC-repeat motif formed a triplex DNA structure generating a loop that brought in close proximity the originally distinct reactive ends. In Arrufatina, the loop brought the Mule ends nearby the 2 distinct insertion target sites and the inverted insertion of the transposable element between these target sites provoked the release of the in-between fragment. This proposal requires the involvement of a unique transposon and sheds light on the unresolved question of how two distinct sites become located in close proximity. These observations confer a crucial role to the TTC-repeats in fundamental plant processes as meiotic recombination and chromosomal rearrangements. VL - 16 UR - http://www.biomedcentral.com/1471-2164/16/69 ER - TY - JOUR T1 - A phylogenetic analysis of 34 chloroplast genomes elucidates the relationships between wild and domestic species within the genus Citrus. JF - Molecular biology and evolution Y1 - 2015 A1 - Carbonell-Caballero, José A1 - Alonso, Roberto A1 - Ibañez, Victoria A1 - Terol, Javier A1 - Talon, Manuel A1 - Dopazo, Joaquin KW - chloroplast KW - citrus KW - Phylogeny KW - WGS AB - Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analysed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus, but also provides original insights into other elusive evolutionary processes such as chloroplast inheritance, heteroplasmy and gene selection. VL - 32 UR - http://mbe.oxfordjournals.org/content/early/2015/04/27/molbev.msv082.full ER - TY - JOUR T1 - Prediction of human population responses to toxic compounds by a collaborative competition. JF - Nature biotechnology Y1 - 2015 A1 - Eduati, Federica A1 - Mangravite, Lara M A1 - Wang, Tao A1 - Tang, Hao A1 - Bare, J Christopher A1 - Huang, Ruili A1 - Norman, Thea A1 - Kellen, Mike A1 - Menden, Michael P A1 - Yang, Jichen A1 - Zhan, Xiaowei A1 - Zhong, Rui A1 - Xiao, Guanghua A1 - Xia, Menghang A1 - Abdo, Nour A1 - Kosyk, Oksana AB - The ability to computationally predict the effects of toxic compounds on humans could help address the deficiencies of current chemical safety testing. Here, we report the results from a community-based DREAM challenge to predict toxicities of environmental compounds with potential adverse health effects for human populations. We measured the cytotoxicity of 156 compounds in 884 lymphoblastoid cell lines for which genotype and transcriptional data are available as part of the Tox21 1000 Genomes Project. The challenge participants developed algorithms to predict interindividual variability of toxic response from genomic profiles and population-level cytotoxicity data from structural attributes of the compounds. 179 submitted predictions were evaluated against an experimental data set to which participants were blinded. Individual cytotoxicity predictions were better than random, with modest correlations (Pearson’s r < 0.28), consistent with complex trait genomic prediction. In contrast, predictions of population-level response to different compounds were higher (r < 0.66). The results highlight the possibility of predicting health risks associated with unknown compounds, although risk estimation accuracy remains suboptimal. UR - http://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.3299.html ER - TY - JOUR T1 - Re-evaluation casts doubt on the pathogenicity of homozygous USH2A p.C759F. JF - Am J Med Genet A Y1 - 2015 A1 - Pozo, María González-Del A1 - Bravo-Gil, Nereida A1 - Méndez-Vidal, Cristina A1 - Montero-de-Espinosa, Ignacio A1 - Millán, José M A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo KW - Base Sequence KW - Cyclic Nucleotide Phosphodiesterases, Type 6 KW - Extracellular Matrix Proteins KW - Gene Library KW - Humans KW - Molecular Sequence Data KW - Mutation, Missense KW - Pedigree KW - Retinitis pigmentosa KW - Sequence Analysis, DNA KW - Spain AB -

Mutations in USH2A are a common cause of Retinitis Pigmentosa (RP). Among the most frequently reported USH2A variants, c.2276G>T (p.C759F) has been found in both affected and healthy individuals. The pathogenicity of this variant remains controversial since it was detected in homozygosity in two healthy siblings of a Spanish family (S23), eleven years ago. The fact that these individuals remain asymptomatic today, prompted us to study the presence of other pathogenic variants in this family using targeted resequencing of 26 retinal genes in one of the affected individuals. This approach allowed us to identify one novel pathogenic homozygous mutation in exon 13 of PDE6B (c.1678C>T; p.R560C). This variant cosegregated with the disease and was absent in 200 control individuals. Remarkably, the identified variant in PDE6B corresponds to the mutation responsible of the retinal degeneration in the naturally occurring rd10 mutant mice. To our knowledge, this is the first report of the identification of the rd10 mice mutation in a RP family. These findings, together with a review of the literature, support the hypothesis that homozygous p.C759F mutations are not pathogenic and led us to exclude the implication of p.C759F in the RP of family S23. Our results indicate the need of re-evaluating all families genetically diagnosed with this mutation.

VL - 167 IS - 7 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25823529?dopt=Abstract ER - TY - JOUR T1 - Therapeutic targets for olive pollen allergy defined by gene markers modulated by Ole e 1-derived peptides. JF - Molecular immunology Y1 - 2015 A1 - Calzada, David A1 - Aguerri, Miriam A1 - Baos, Selene A1 - Montaner, David A1 - Mata, Manuel A1 - Joaquín Dopazo A1 - Quiralte, Joaquín A1 - Florido, Fernando A1 - Lahoz, Carlos A1 - Cárdaba, Blanca AB - Two regions of Ole e 1, the major olive-pollen allergen, have been characterized as T-cell epitopes, one as immunodominant region (aa91-130) and the other, as mainly recognized by non-allergic subjects (aa10-31). This report tries to characterize the specific relevance of these epitopes in the allergic response to olive pollen by analyzing the secreted cytokines and the gene expression profiles induced after specific stimulation of peripheral blood mononuclear cells (PBMCs). PBMCs from olive pollen-allergic and non-allergic control subjects were stimulated with olive-pollen extract and Ole e 1 dodecapeptides containing relevant T-cell epitopes. Levels of cytokines were measured in cellular supernatants and gene expression was determined by microarrays, on the RNAs extracted from PBMCs. One hundred eighty-nine differential genes (fold change >2 or <-2, P<0.05) were validated by qRT-PCR in a large population. It was not possible to define a pattern of response according the overall cytokine results but interesting differences were observed, mainly in the regulatory cytokines. Principal component (PCA) gene-expression analysis defined clusters that correlated with the experimental conditions in the group of allergic subjects. Gene expression and functional analyses revealed differential genes and pathways among the experimental conditions. A set of 51 genes (many essential to T-cell tolerance and homeostasis) correlated with the response to aa10-31 of Ole e 1. In conclusion, two peptides derived from Ole e 1 could regulate the immune response in allergic patients, by gene-expression modification of several regulation-related genes. These results open new research ways to the regulation of allergy by Oleaceae family members. VL - 64 UR - http://www.sciencedirect.com/science/article/pii/S0161589014003356 ER - TY - JOUR T1 - Using activation status of signaling pathways as mechanism-based biomarkers to predict drug sensitivity. JF - Sci Rep Y1 - 2015 A1 - Amadoz, Alicia A1 - Sebastián-Leon, Patricia A1 - Vidal, Enrique A1 - Salavert, Francisco A1 - Dopazo, Joaquin KW - Algorithms KW - Antineoplastic Agents KW - biomarkers KW - Cell Line, Tumor KW - Cell Survival KW - gene expression KW - Humans KW - Lethal Dose 50 KW - Neoplasms KW - Phosphorylation KW - Proteins KW - Signal Transduction AB -

Many complex traits, as drug response, are associated with changes in biological pathways rather than being caused by single gene alterations. Here, a predictive framework is presented in which gene expression data are recoded into activity statuses of signal transduction circuits (sub-pathways within signaling pathways that connect receptor proteins to final effector proteins that trigger cell actions). Such activity values are used as features by a prediction algorithm which can efficiently predict a continuous variable such as the IC50 value. The main advantage of this prediction method is that the features selected by the predictor, the signaling circuits, are themselves rich-informative, mechanism-based biomarkers which provide insight into or drug molecular mechanisms of action (MoA).

VL - 5 U1 - https://www.ncbi.nlm.nih.gov/pubmed/26678097?dopt=Abstract ER - TY - JOUR T1 - Whole Exome Sequencing Reveals ZNF408 as a New Gene Associated With Autosomal Recessive Retinitis Pigmentosa with Vitreal Alterations. JF - Human molecular genetics Y1 - 2015 A1 - Avila-Fernandez, Almudena A1 - Perez-Carro, Raquel A1 - Corton, Marta A1 - Lopez-Molina, Maria Isabel A1 - Campello, Laura A1 - Garanto, Alex A1 - Fernadez-Sanchez, Laura A1 - Duijkers, Lonneke A1 - Lopez-Martinez, Miguel Angel A1 - Riveiro-Alvarez, Rosa A1 - da Silva, Luciana Rodrigues Jacy A1 - Sanchez-Alcudia, Rocío A1 - Martin-Garrido, Esther A1 - Reyes, Noelia A1 - Garcia-Garcia, Francisco A1 - Dopazo, Joaquin A1 - Garcia-Sandoval, Blanca A1 - Collin, Rob W A1 - Cuenca, Nicolas A1 - Ayuso, Carmen AB - Retinitis Pigmentosa (RP) is a group of progressive inherited retinal dystrophies that cause visual impairment as a result of photoreceptor cell death. RP is heterogeneous, both clinically and genetically making difficult to establish precise genotype-phenotype correlations. In a Spanish family with autosomal recessive RP (arRP), homozygosity mapping and whole exome sequencing led to the identification of a homozygous mutation (c.358_359delGT; p.Ala122Leufs*2) in the ZNF408 gene. A screening performed in 217 additional unrelated families revealed another homozygous mutation (c.1621C>T; p.Arg541Cys) in an isolated RP case. ZNF408 encodes a transcription factor that harbors ten predicted C2H2-type fingers thought to be implicated in DNA binding. To elucidate the ZNF408 role in the retina and the pathogenesis of these mutations we have performed different functional studies. By immunohistochemical analysis in healthy human retina, we identified that ZNF408 is expressed in both cone and rod photoreceptors, in a specific type of amacrine and ganglion cells, and in retinal blood vessels. ZNF408 revealed a cytoplasmic localization and a nuclear distribution in areas corresponding with the euchromatin fraction. Immunolocalization studies showed a partial mislocalization of the p.Arg541Cys mutant protein retaining part of the WT protein in the cytoplasm. Our study demonstrates that ZNF408, previously associated with Familial Exudative Vitreoretinopathy (FEVR), is a new gene causing arRP with vitreous condensations supporting the evidence that this protein plays additional functions into the human retina. VL - 24 UR - http://hmg.oxfordjournals.org/content/early/2015/04/16/hmg.ddv140.abstract ER - TY - JOUR T1 - Whole-exome sequencing reveals ZNF408 as a new gene associated with autosomal recessive retinitis pigmentosa with vitreal alterations. JF - Hum Mol Genet Y1 - 2015 A1 - Avila-Fernandez, Almudena A1 - Perez-Carro, Raquel A1 - Corton, Marta A1 - Lopez-Molina, Maria Isabel A1 - Campello, Laura A1 - Garanto, Alejandro A1 - Fernandez-Sanchez, Laura A1 - Duijkers, Lonneke A1 - Lopez-Martinez, Miguel Angel A1 - Riveiro-Alvarez, Rosa A1 - da Silva, Luciana Rodrigues Jacy A1 - Sanchez-Alcudia, Rocío A1 - Martin-Garrido, Esther A1 - Reyes, Noelia A1 - Garcia-Garcia, Francisco A1 - Dopazo, Joaquin A1 - Garcia-Sandoval, Blanca A1 - Collin, Rob W J A1 - Cuenca, Nicolas A1 - Ayuso, Carmen KW - Amino Acid Sequence KW - Animals KW - Chlorocebus aethiops KW - Chromosome Mapping KW - COS Cells KW - DNA-Binding Proteins KW - Exome KW - Genome-Wide Association Study KW - High-Throughput Nucleotide Sequencing KW - Homozygote KW - Humans KW - Molecular Sequence Data KW - Mutant Proteins KW - Pedigree KW - Retina KW - Retinal Cone Photoreceptor Cells KW - Retinal Rod Photoreceptor Cells KW - Retinitis pigmentosa KW - Transcription Factors AB -

Retinitis pigmentosa (RP) is a group of progressive inherited retinal dystrophies that cause visual impairment as a result of photoreceptor cell death. RP is heterogeneous, both clinically and genetically making difficult to establish precise genotype-phenotype correlations. In a Spanish family with autosomal recessive RP (arRP), homozygosity mapping and whole-exome sequencing led to the identification of a homozygous mutation (c.358_359delGT; p.Ala122Leufs*2) in the ZNF408 gene. A screening performed in 217 additional unrelated families revealed another homozygous mutation (c.1621C>T; p.Arg541Cys) in an isolated RP case. ZNF408 encodes a transcription factor that harbors 10 predicted C2H2-type fingers thought to be implicated in DNA binding. To elucidate the ZNF408 role in the retina and the pathogenesis of these mutations we have performed different functional studies. By immunohistochemical analysis in healthy human retina, we identified that ZNF408 is expressed in both cone and rod photoreceptors, in a specific type of amacrine and ganglion cells, and in retinal blood vessels. ZNF408 revealed a cytoplasmic localization and a nuclear distribution in areas corresponding with the euchromatin fraction. Immunolocalization studies showed a partial mislocalization of the p.Arg541Cys mutant protein retaining part of the WT protein in the cytoplasm. Our study demonstrates that ZNF408, previously associated with Familial Exudative Vitreoretinopathy (FEVR), is a new gene causing arRP with vitreous condensations supporting the evidence that this protein plays additional functions into the human retina.

VL - 24 IS - 14 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25882705?dopt=Abstract ER - TY - JOUR T1 - Acceleration of short and long DNA read mapping without loss of accuracy using suffix array. JF - Bioinformatics (Oxford, England) Y1 - 2014 A1 - Tárraga, Joaquín A1 - Arnau, Vicente A1 - Martinez, Hector A1 - Moreno, Raul A1 - Cazorla, Diego A1 - Salavert-Torres, José A1 - Blanquer-Espert, Ignacio A1 - Joaquín Dopazo A1 - Medina, Ignacio KW - NGS KW - short read mapping. HPC. suffix arrays AB - HPG Aligner applies suffix arrays for DNA read mapping. This implementation produces a highly sensitive and extremely fast mapping of DNA reads that scales up almost linearly with read length. The approach presented here is faster (over 20x for long reads) and more sensitive (over 98% in a wide range of read lengths) than the current, state-of-the-art mappers. HPG Aligner is not only an optimal alternative for current sequencers but also the only solution available to cope with longer reads and growing throughputs produced by forthcoming sequencing technologies. VL - 30 UR - http://bioinformatics.oxfordjournals.org/content/early/2014/08/19/bioinformatics.btu553.long ER - TY - JOUR T1 - The Activation of the Sox2 RR2 Pluripotency Transcriptional Reporter in Human Breast Cancer Cell Lines is Dynamic and Labels Cells with Higher Tumorigenic Potential. JF - Front Oncol Y1 - 2014 A1 - Iglesias, Juan Manuel A1 - Leis, Olatz A1 - Pérez Ruiz, Estíbaliz A1 - Gumuzio Barrie, Juan A1 - Garcia-Garcia, Francisco A1 - Aduriz, Ariane A1 - Beloqui, Izaskun A1 - Hernandez-Garcia, Susana A1 - Lopez-Mato, Maria Paz A1 - Dopazo, Joaquin A1 - Pandiella, Atanasio A1 - Menendez, Javier A A1 - Martin, Angel Garcia AB -

The striking similarity displayed at the mechanistic level between tumorigenesis and the generation of induced pluripotent stem cells and the fact that genes and pathways relevant for embryonic development are reactivated during tumor progression highlights the link between pluripotency and cancer. Based on these observations, we tested whether it is possible to use a pluripotency-associated transcriptional reporter, whose activation is driven by the SRR2 enhancer from the Sox2 gene promoter (named S4+ reporter), to isolate cancer stem cells (CSCs) from breast cancer cell lines. The S4+ pluripotency transcriptional reporter allows the isolation of cells with enhanced tumorigenic potential and its activation was switched on and off in the cell lines studied, reflecting a plastic cellular process. Microarray analysis comparing the populations in which the reporter construct is active versus inactive showed that positive cells expressed higher mRNA levels of cytokines (IL-8, IL-6, TNF) and genes (such as ATF3, SNAI2, and KLF6) previously related with the CSC phenotype in breast cancer.

VL - 4 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25414831?dopt=Abstract ER - TY - JOUR T1 - Deciphering intrafamilial phenotypic variability by exome sequencing in a Bardet-Biedl family. JF - Mol Genet Genomic Med Y1 - 2014 A1 - González-del Pozo, María A1 - Méndez-Vidal, Cristina A1 - Santoyo-López, Javier A1 - Vela-Boza, Alicia A1 - Bravo-Gil, Nereida A1 - Rueda, Antonio A1 - García-Alonso, Luz A1 - Vázquez-Marouschek, Carmen A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo AB -

Bardet-Biedl syndrome (BBS) is a model ciliopathy characterized by a wide range of clinical variability. The heterogeneity of this condition is reflected in the number of underlying gene defects and the epistatic interactions between the proteins encoded. BBS is generally inherited in an autosomal recessive trait. However, in some families, mutations across different loci interact to modulate the expressivity of the phenotype. In order to investigate the magnitude of epistasis in one BBS family with remarkable intrafamilial phenotypic variability, we designed an exome sequencing-based approach using SOLID 5500xl platform. This strategy allowed the reliable detection of the primary causal mutations in our family consisting of two novel compound heterozygous mutations in McKusick-Kaufman syndrome (MKKS) gene (p.D90G and p.V396F). Additionally, exome sequencing enabled the detection of one novel heterozygous NPHP4 variant which is predicted to activate a cryptic acceptor splice site and is only present in the most severely affected patient. Here, we provide an exome sequencing analysis of a BBS family and show the potential utility of this tool, in combination with network analysis, to detect disease-causing mutations and second-site modifiers. Our data demonstrate how next-generation sequencing (NGS) can facilitate the dissection of epistatic phenomena, and shed light on the genetic basis of phenotypic variability.

VL - 2 IS - 2 U1 - https://www.ncbi.nlm.nih.gov/pubmed/24689075?dopt=Abstract ER - TY - JOUR T1 - Deciphering intrafamilial phenotypic variability by exome sequencing in a Bardet–Biedl family JF - Molecular Genetics & Genomic Medicine Y1 - 2014 A1 - González-del Pozo, María A1 - Méndez-Vidal, Cristina A1 - Santoyo-López, Javier A1 - Vela-Boza, Alicia A1 - Nereida Bravo-Gil A1 - Antonio Rueda A1 - García-Alonso, Luz A1 - Vázquez-Marouschek, Carmen A1 - Joaquín Dopazo A1 - Borrego, Salud A1 - Antiňolo, Guillermo AB - Bardet–Biedl syndrome (BBS) is a model ciliopathy characterized by a wide range of clinical variability. The heterogeneity of this condition is reflected in the number of underlying gene defects and the epistatic interactions between the proteins encoded. BBS is generally inherited in an autosomal recessive trait. However, in some families, mutations across different loci interact to modulate the expressivity of the phenotype. In order to investigate the magnitude of epistasis in one BBS family with remarkable intrafamilial phenotypic variability, we designed an exome sequencing–based approach using SOLID 5500xl platform. This strategy allowed the reliable detection of the primary causal mutations in our family consisting of two novel compound heterozygous mutations in McKusick–Kaufman syndrome (MKKS) gene (p.D90G and p.V396F). Additionally, exome sequencing enabled the detection of one novel heterozygous NPHP4 variant which is predicted to activate a cryptic acceptor splice site and is only present in the most severely affected patient. Here, we provide an exome sequencing analysis of a BBS family and show the potential utility of this tool, in combination with network analysis, to detect disease-causing mutations and second-site modifiers. Our data demonstrate how next-generation sequencing (NGS) can facilitate the dissection of epistatic phenomena, and shed light on the genetic basis of phenotypic variability. VL - 2 UR - http://onlinelibrary.wiley.com/doi/10.1002/mgg3.50/full ER - TY - JOUR T1 - Exome sequencing reveals novel and recurrent mutations with clinical significance in inherited retinal dystrophies. JF - PLoS One Y1 - 2014 A1 - González-del Pozo, María A1 - Méndez-Vidal, Cristina A1 - Bravo-Gil, Nereida A1 - Vela-Boza, Alicia A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo KW - Adolescent KW - Adult KW - Amino Acid Sequence KW - Base Sequence KW - Child KW - Chromosome Segregation KW - DNA Mutational Analysis KW - Exome KW - Family KW - Female KW - Humans KW - Inheritance Patterns KW - Male KW - Middle Aged KW - Molecular Sequence Data KW - mutation KW - Pedigree KW - Retinal Dystrophies KW - Rhodopsin AB -

This study aimed to identify the underlying molecular genetic cause in four Spanish families clinically diagnosed of Retinitis Pigmentosa (RP), comprising one autosomal dominant RP (adRP), two autosomal recessive RP (arRP) and one with two possible modes of inheritance: arRP or X-Linked RP (XLRP). We performed whole exome sequencing (WES) using NimbleGen SeqCap EZ Exome V3 sample preparation kit and SOLID 5500xl platform. All variants passing filter criteria were validated by Sanger sequencing to confirm familial segregation and the absence in local control population. This strategy allowed the detection of: (i) one novel heterozygous splice-site deletion in RHO, c.937-2_944del, (ii) one rare homozygous mutation in C2orf71, c.1795T>C; p.Cys599Arg, not previously associated with the disease, (iii) two heterozygous null mutations in ABCA4, c.2041C>T; p.R681* and c.6088C>T; p.R2030*, and (iv) one mutation, c.2405-2406delAG; p.Glu802Glyfs*31 in the ORF15 of RPGR. The molecular findings for RHO and C2orf71 confirmed the initial diagnosis of adRP and arRP, respectively, while patients with the two ABCA4 mutations, both previously associated with Stargardt disease, presented symptoms of RP with early macular involvement. Finally, the X-Linked inheritance was confirmed for the family with the RPGR mutation. This latter finding allowed the inclusion of carrier sisters in our preimplantational genetic diagnosis program.

VL - 9 IS - 12 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25544989?dopt=Abstract ER - TY - JOUR T1 - A New Overgrowth Syndrome is Due to Mutations in RNF125. JF - Human mutation Y1 - 2014 A1 - Tenorio, Jair A1 - Mansilla, Alicia A1 - Valencia, María A1 - Martínez-Glez, Víctor A1 - Romanelli, Valeria A1 - Arias, Pedro A1 - Castrejón, Nerea A1 - Poletta, Fernando A1 - Guillén-Navarro, Encarna A1 - Gordo, Gema A1 - Mansilla, Elena A1 - García-Santiago, Fé A1 - González-Casado, Isabel A1 - Vallespín, Elena A1 - Palomares, María A1 - Mori, María A A1 - Santos-Simarro, Fernando A1 - García-Miñaur, Sixto A1 - Fernández, Luis A1 - Mena, Rocío A1 - Benito-Sanz, Sara A1 - Del Pozo, Angela A1 - Silla, Juan Carlos A1 - Ibañez, Kristina A1 - López-Granados, Eduardo A1 - Martín-Trujillo, Alex A1 - Montaner, David A1 - Heath, Karen E A1 - Campos-Barros, Angel A1 - Joaquín Dopazo A1 - Nevado, Julián A1 - Monk, David A1 - Ruiz-Pérez, Víctor L A1 - Lapunzina, Pablo KW - NGS KW - prioritization KW - Rare Disease AB - Overgrowth syndromes (OGS) are a group of disorders in which all parameters of growth and physical development are above the mean for age and sex. We evaluated a series of 270 families from the Spanish Overgrowth Syndrome Registry with no known overgrowth syndrome. We identified one de novo deletion and three missense mutations in RNF125 in six patients from 4 families with overgrowth, macrocephaly, intellectual disability, mild hydrocephaly, hypoglycaemia and inflammatory diseases resembling Sjögren syndrome. RNF125 encodes an E3 ubiquitin ligase and is a novel gene of OGS. Our studies of the RNF125 pathway point to upregulation of RIG-I-IPS1-MDA5 and/or disruption of the PI3K-AKT and interferon signaling pathways as the putative final effectors. This article is protected by copyright. All rights reserved. VL - 35 UR - http://onlinelibrary.wiley.com/doi/10.1002/humu.22689/abstract ER - TY - JOUR T1 - A novel locus for a hereditary recurrent neuropathy on chromosome 21q21. JF - Neuromuscular disorders : NMD Y1 - 2014 A1 - Calpena, E A1 - Martínez-Rubio, D A1 - Arpa, J A1 - García-Peñas, J J A1 - Montaner, D. A1 - Dopazo, J. A1 - Palau, F A1 - Espinós, C AB - Hereditary recurrent neuropathies are uncommon. Disorders with a known molecular basis falling within this group include hereditary neuropathy with liability to pressure palsies (HNPP) due to the deletion of the PMP22 gene or to mutations in this same gene, and hereditary neuralgic amyotrophy (HNA) caused by mutations in the SEPT9 gene. We report a three-generation family presenting a hereditary recurrent neuropathy without pathological changes in either PMP22 or SEPT9 genes. We performed a genome-wide mapping, which yielded a locus of 12.4Mb on chromosome 21q21. The constructed haplotype fully segregated with the disease and we found significant evidence of linkage. After mutational screening of genes located within this locus, encoding for proteins and microRNAs, as well as analysis of large deletions/insertions, we identified 71 benign polymorphisms. Our findings suggest a novel genetic locus for a recurrent hereditary neuropathy of which the molecular defect remains elusive. Our results further underscore the clinical and genetic heterogeneity of this group of neuropathies. VL - 24 UR - http://www.sciencedirect.com/science/article/pii/S0960896614001060# ER - TY - JOUR T1 - Novel RP1 mutations and a recurrent BBS1 variant explain the co-existence of two distinct retinal phenotypes in the same pedigree. JF - BMC Genet Y1 - 2014 A1 - Méndez-Vidal, Cristina A1 - Bravo-Gil, Nereida A1 - González-del Pozo, María A1 - Vela-Boza, Alicia A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo KW - Bardet-Biedl Syndrome KW - Base Sequence KW - Case-Control Studies KW - DNA Mutational Analysis KW - Eye Proteins KW - Genes, Recessive KW - Genetic Association Studies KW - Humans KW - Microsatellite Repeats KW - Microtubule-Associated Proteins KW - Mutation, Missense KW - Pedigree KW - Phenotype KW - Retina KW - Retinitis pigmentosa AB -

BACKGROUND: Molecular diagnosis of Inherited Retinal Dystrophies (IRD) has long been challenging due to the extensive clinical and genetic heterogeneity present in this group of disorders. Here, we describe the clinical application of an integrated next-generation sequencing approach to determine the underlying genetic defects in a Spanish family with a provisional clinical diagnosis of autosomal recessive Retinitis Pigmentosa (arRP).

RESULTS: Exome sequencing of the index patient resulted in the identification of the homozygous BBS1 p.M390R mutation. Sanger sequencing of additional members of the family showed lack of co-segregation of the p.M390R variant in some individuals. Clinical reanalysis indicated co-ocurrence of two different phenotypes in the same family: Bardet-Biedl syndrome in the individual harboring the BBS1 mutation and non-syndromic arRP in extended family members. To identify possible causative mutations underlying arRP, we conducted disease-targeted gene sequencing using a panel of 26 IRD genes. The in-house custom panel was validated using 18 DNA samples known to harbor mutations in relevant genes. All variants were redetected, indicating a high mutation detection rate. This approach allowed the identification of two novel heterozygous null mutations in RP1 (c.4582_4585delATCA; p.I1528Vfs*10 and c.5962dupA; p.I1988Nfs*3) which co-segregated with the disease in arRP patients. Additionally, a mutational screening in 96 patients of our cohort with genetically unresolved IRD revealed the presence of the c.5962dupA mutation in one unrelated family.

CONCLUSIONS: The combination of molecular findings for RP1 and BBS1 genes through exome and gene panel sequencing enabled us to explain the co-existence of two different retinal phenotypes in a family. The identification of two novel variants in RP1 suggests that the use of panels containing the prevalent genes of a particular population, together with an optimized data analysis pipeline, is an efficient and cost-effective approach that can be reliably implemented into the routine diagnostic process of diverse inherited retinal disorders. Moreover, the identification of these novel variants in two unrelated families supports the relatively high prevalence of RP1 mutations in Spanish population and the role of private mutations for commonly mutated genes, while extending the mutational spectrum of RP1.

VL - 15 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25494902?dopt=Abstract ER - TY - JOUR T1 - Permanent cardiac sarcomere changes in a rabbit model of intrauterine growth restriction. JF - PLoS One Y1 - 2014 A1 - Torre, Iratxe A1 - González-Tendero, Anna A1 - García-Cañadilla, Patricia A1 - Crispi, Fátima A1 - Garcia-Garcia, Francisco A1 - Bijnens, Bart A1 - Iruretagoyena, Igor A1 - Dopazo, Joaquin A1 - Amat-Roldán, Ivan A1 - Gratacós, Eduard KW - Animals KW - biomarkers KW - Blood Pressure KW - Body Weight KW - Disease Models, Animal KW - Echocardiography KW - Female KW - Fetal Growth Retardation KW - Fetal Heart KW - Fetus KW - Gene Expression Profiling KW - Organ Size KW - Placenta KW - Pregnancy KW - Rabbits KW - Sarcomeres AB -

BACKGROUND: Intrauterine growth restriction (IUGR) induces fetal cardiac remodelling and dysfunction, which persists postnatally and may explain the link between low birth weight and increased cardiovascular mortality in adulthood. However, the cellular and molecular bases for these changes are still not well understood. We tested the hypothesis that IUGR is associated with structural and functional gene expression changes in the fetal sarcomere cytoarchitecture, which remain present in adulthood.

METHODS AND RESULTS: IUGR was induced in New Zealand pregnant rabbits by selective ligation of the utero-placental vessels. Fetal echocardiography demonstrated more globular hearts and signs of cardiac dysfunction in IUGR. Second harmonic generation microscopy (SHGM) showed shorter sarcomere length and shorter A-band and thick-thin filament interaction lengths, that were already present in utero and persisted at 70 postnatal days (adulthood). Sarcomeric M-band (GO: 0031430) functional term was over-represented in IUGR fetal hearts.

CONCLUSION: The results suggest that IUGR induces cardiac dysfunction and permanent changes on the sarcomere.

VL - 9 IS - 11 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25402351?dopt=Abstract ER - TY - JOUR T1 - Programmed cell death activated by Rose Bengal in Arabidopsis thaliana cell suspension cultures requires functional chloroplasts. JF - Journal of experimental botany Y1 - 2014 A1 - Gutiérrez, Jorge A1 - González-Pérez, Sergio A1 - Garcia-Garcia, Francisco A1 - Daly, Cara T A1 - Lorenzo, Oscar A1 - Revuelta, José L A1 - McCabe, Paul F A1 - Arellano, Juan B AB - Light-grown Arabidopsis thaliana cell suspension culture (ACSC) were subjected to mild photooxidative damage with Rose Bengal (RB) with the aim of gaining a better understanding of singlet oxygen-mediated defence responses in plants. Additionally, ACSC were treated with H2O2 at concentrations that induced comparable levels of protein oxidation damage. Under low to medium light conditions, both RB and H2O2 treatments activated transcriptional defence responses and inhibited photosynthetic activity, but they differed in that programmed cell death (PCD) was only observed in cells treated with RB. When dark-grown ACSC were subjected to RB in the light, PCD was suppressed, indicating that the singlet oxygen-mediated signalling pathway in ACSC requires functional chloroplasts. Analysis of up-regulated transcripts in light-grown ACSC, treated with RB in the light, showed that both singlet oxygen-responsive transcripts and transcripts with a key role in hormone-activated PCD (i.e. ethylene and jasmonic acid) were present. A co-regulation analysis proved that ACSC treated with RB exhibited higher correlation with the conditional fluorescence (flu) mutant than with other singlet oxygen-producing mutants or wild-type plants subjected to high light. However, there was no evidence for the up-regulation of EDS1, suggesting that activation of PCD was not associated with the EXECUTER- and EDS1-dependent signalling pathway described in the flu mutant. Indigo Carmine and Methylene Violet, two photosensitizers unable to enter chloroplasts, did not activate transcriptional defence responses in ACSC; however, whether this was due to their location or to their inherently low singlet oxygen quantum efficiencies was not determined. UR - http://jxb.oxfordjournals.org/content/early/2014/04/09/jxb.eru151.long ER - TY - JOUR T1 - The role of the interactome in the maintenance of deleterious variability in human populations. JF - Mol Syst Biol Y1 - 2014 A1 - García-Alonso, Luz A1 - Jiménez-Almazán, Jorge A1 - Carbonell-Caballero, José A1 - Vela-Boza, Alicia A1 - Santoyo-López, Javier A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin KW - Alleles KW - Exome KW - Gene Library KW - Genetic Variation KW - Genetics, Population KW - Genome, Human KW - Genomics KW - Humans KW - Models, Genetic KW - mutation KW - Phenotype KW - Protein Conformation KW - Protein Interaction Maps KW - Sequence Analysis, DNA KW - Whites AB -

Recent genomic projects have revealed the existence of an unexpectedly large amount of deleterious variability in the human genome. Several hypotheses have been proposed to explain such an apparently high mutational load. However, the mechanisms by which deleterious mutations in some genes cause a pathological effect but are apparently innocuous in other genes remain largely unknown. This study searched for deleterious variants in the 1,000 genomes populations, as well as in a newly sequenced population of 252 healthy Spanish individuals. In addition, variants causative of monogenic diseases and somatic variants from 41 chronic lymphocytic leukaemia patients were analysed. The deleterious variants found were analysed in the context of the interactome to understand the role of network topology in the maintenance of the observed mutational load. Our results suggest that one of the mechanisms whereby the effect of these deleterious variants on the phenotype is suppressed could be related to the configuration of the protein interaction network. Most of the deleterious variants observed in healthy individuals are concentrated in peripheral regions of the interactome, in combinations that preserve their connectivity, and have a marginal effect on interactome integrity. On the contrary, likely pathogenic cancer somatic deleterious variants tend to occur in internal regions of the interactome, often with associated structural consequences. Finally, variants causative of monogenic diseases seem to occupy an intermediate position. Our observations suggest that the real pathological potential of a variant might be more a systems property rather than an intrinsic property of individual proteins.

VL - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/25261458?dopt=Abstract ER - TY - JOUR T1 - Two novel mutations in the BCKDK (branched-chain keto-acid dehydrogenase kinase) gene are responsible for a neurobehavioral deficit in two pediatric unrelated patients. JF - Hum Mutat Y1 - 2014 A1 - García-Cazorla, Angels A1 - Oyarzabal, Alfonso A1 - Fort, Joana A1 - Robles, Concepción A1 - Castejón, Esperanza A1 - Ruiz-Sala, Pedro A1 - Bodoy, Susanna A1 - Merinero, Begoña A1 - Lopez-Sala, Anna A1 - Dopazo, Joaquin A1 - Nunes, Virginia A1 - Ugarte, Magdalena A1 - Artuch, Rafael A1 - Palacín, Manuel A1 - Rodríguez-Pombo, Pilar A1 - Alcaide, Patricia A1 - Navarrete, Rosa A1 - Sanz, Paloma A1 - Font-Llitjós, Mariona A1 - Vilaseca, Ma Antonia A1 - Ormaizabal, Aida A1 - Pristoupilova, Anna A1 - Agulló, Sergi Beltran KW - Amino Acids, Branched-Chain KW - Developmental Disabilities KW - Fibroblasts KW - Humans KW - Male KW - Mutation, Missense KW - Nervous System Diseases KW - Pediatrics KW - Protein Kinases AB -

Inactivating mutations in the BCKDK gene, which codes for the kinase responsible for the negative regulation of the branched-chain α-keto acid dehydrogenase complex (BCKD), have recently been associated with a form of autism in three families. In this work, two novel exonic BCKDK mutations, c.520C>G/p.R174G and c.1166T>C/p.L389P, were identified at the homozygous state in two unrelated children with persistently reduced body fluid levels of branched-chain amino acids (BCAAs), developmental delay, microcephaly, and neurobehavioral abnormalities. Functional analysis of the mutations confirmed the missense character of the c.1166T>C change and showed a splicing defect r.[520c>g;521_543del]/p.R174Gfs1*, for c.520C>G due to the presence of a new donor splice site. Mutation p.L389P showed total loss of kinase activity. Moreover, patient-derived fibroblasts showed undetectable (p.R174Gfs1*) or barely detectable (p.L389P) levels of BCKDK protein and its phosphorylated substrate (phospho-E1α), resulting in increased BCKD activity and the very rapid BCAA catabolism manifested by the patients' clinical phenotype. Based on these results, a protein-rich diet plus oral BCAA supplementation was implemented in the patient homozygous for p.R174Gfs1*. This treatment normalized plasma BCAA levels and improved growth, developmental and behavioral variables. Our results demonstrate that BCKDK mutations can result in neurobehavioral deficits in humans and support the rationale for dietary intervention.

VL - 35 IS - 4 U1 - https://www.ncbi.nlm.nih.gov/pubmed/24449431?dopt=Abstract ER - TY - JOUR T1 - Two Novel Mutations in the BCKDK Gene (Branched-Chain Keto-Acid Dehydrogenase Kinase) are Responsible of a Neurobehavioral Deficit in two Pediatric Unrelated Patients. JF - Human mutation Y1 - 2014 A1 - García-Cazorla, Angels A1 - Oyarzabal, Alfonso A1 - Fort, Joana A1 - Robles, Concepción A1 - Castejón, Esperanza A1 - Ruiz-Sala, Pedro A1 - Bodoy, Susanna A1 - Merinero, Begoña A1 - Lopez-Sala, Anna A1 - Joaquín Dopazo A1 - Nunes, Virginia A1 - Ugarte, Magdalena A1 - Artuch, Rafael A1 - Palacín, Manuel A1 - Rodríguez-Pombo, Pilar AB - Inactivating mutations in the BCKDK gene, which codes for the kinase responsible for the negative regulation of the branched-chain keto-acid dehydrogenase complex (BCKD), have recently been associated with a form of autism in three families. In this work, two novel exonic BCKDK mutations, c.520C>G/p.R174G and c.1166T>C/p.L389P, were identified at the homozygous state in two unrelated children with persistently reduced body fluid levels of branched-chain amino acids (BCAAs), developmental delay, microcephaly and neurobehavioral abnormalities. Functional analysis of the mutations confirmed the missense character of the c.1166T>C change and showed a splicing defect r.[520c>g;521_543del]/p.R174Gfs1*, for c.520C>G due to the presence of a new donor splice site. Mutation p.L389P showed total loss of kinase activity. Moreover, patient-derived fibroblasts showed undetectable (p.R174Gfs1*) or barely detectable (p.L389P) levels of BCKDK protein and its phosphorylated substrate (phospho-E1α), resulting in increased BCKD activity and the very rapid BCAA catabolism manifested by the patients’ clinical phenotype. Based on these results, a protein-rich diet plus oral BCAA supplementation was implemented in the patient homozygous for p.R174Gfs1*. This treatment normalized plasma BCAA levels and improved growth, developmental and behavioral variables. Our results demonstrate that BCKDK mutations can result in neurobehavioral deficits in humans and support the rationale for dietary intervention. This article is protected by copyright. All rights reserved. VL - 35 UR - http://onlinelibrary.wiley.com/doi/10.1002/humu.22513/abstract ER - TY - JOUR T1 - Understanding disease mechanisms with models of signaling pathway activities. JF - BMC systems biology Y1 - 2014 A1 - Sebastián-Leon, Patricia A1 - Vidal, Enrique A1 - Minguez, Pablo A1 - Ana Conesa A1 - Sonia Tarazona A1 - Amadoz, Alicia A1 - Armero, Carmen A1 - Salavert, Francisco A1 - Vidal-Puig, Antonio A1 - Montaner, David A1 - Joaquín Dopazo KW - Disease mechanism KW - pathway KW - signalling KW - Systems biology AB - BackgroundUnderstanding the aspects of the cell functionality that account for disease or drug action mechanisms is one of the main challenges in the analysis of genomic data and is on the basis of the future implementation of precision medicine.ResultsHere we propose a simple probabilistic model in which signaling pathways are separated into elementary sub-pathways or signal transmission circuits (which ultimately trigger cell functions) and then transforms gene expression measurements into probabilities of activation of such signal transmission circuits. Using this model, differential activation of such circuits between biological conditions can be estimated. Thus, circuit activation statuses can be interpreted as biomarkers that discriminate among the compared conditions. This type of mechanism-based biomarkers accounts for cell functional activities and can easily be associated to disease or drug action mechanisms. The accuracy of the proposed model is demonstrated with simulations and real datasets.ConclusionsThe proposed model provides detailed information that enables the interpretation disease mechanisms as a consequence of the complex combinations of altered gene expression values. Moreover, it offers a framework for suggesting possible ways of therapeutic intervention in a pathologically perturbed system. VL - 8 UR - http://www.biomedcentral.com/1752-0509/8/121/abstract ER - TY - JOUR T1 - Understanding disease mechanisms with models of signaling pathway activities JF - BMC systems biology Y1 - 2014 A1 - Sebastián-Leon, Patricia A1 - Vidal, Enrique A1 - Minguez, Pablo A1 - Conesa, Ana A1 - Tarazona, Sonia A1 - Amadoz, Alicia A1 - Armero, Carmen A1 - Salavert Torres, Francisco A1 - Vidal-Puig, Antonio A1 - Montaner, David A1 - Dopazo, Joaquin VL - 8 ER - TY - JOUR T1 - A web tool for the design and management of panels of genes for targeted enrichment and massive sequencing for clinical applications. JF - Nucleic acids research Y1 - 2014 A1 - Alemán, Alejandro A1 - Garcia-Garcia, Francisco A1 - Medina, Ignacio A1 - Joaquín Dopazo KW - Diagnostic KW - Targeted enrichment sequencing KW - WES AB - Disease targeted sequencing is gaining importance as a powerful and cost-effective application of high throughput sequencing technologies to the diagnosis. However, the lack of proper tools to process the data hinders its extensive adoption. Here we present TEAM, an intuitive and easy-to-use web tool that fills the gap between the predicted mutations and the final diagnostic in targeted enrichment sequencing analysis. The tool searches for known diagnostic mutations, corresponding to a disease panel, among the predicted patient’s variants. Diagnostic variants for the disease are taken from four databases of disease-related variants (HGMD-public, HUMSAVAR, ClinVar and COSMIC.) If no primary diagnostic variant is found, then a list of secondary findings that can help to establish a diagnostic is produced. TEAM also provides with an interface for the definition of and customization of panels, by means of which, genes and mutations can be added or discarded to adjust panel definitions. TEAM is freely available at: http://team.babelomics.org. VL - 42 UR - http://nar.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=24861626 ER - TY - JOUR T1 - A web-based interactive framework to assist in the prioritization of disease candidate genes in whole-exome sequencing studies. JF - Nucleic acids research Y1 - 2014 A1 - Alemán, Alejandro A1 - Garcia-Garcia, Francisco A1 - Salavert, Francisco A1 - Medina, Ignacio A1 - Joaquín Dopazo KW - NGS. prioritization AB - Whole-exome sequencing has become a fundamental tool for the discovery of disease-related genes of familial diseases and the identification of somatic driver variants in cancer. However, finding the causal mutation among the enormous background of individual variability in a small number of samples is still a big challenge. Here we describe a web-based tool, BiERapp, which efficiently helps in the identification of causative variants in family and sporadic genetic diseases. The program reads lists of predicted variants (nucleotide substitutions and indels) in affected individuals or tumor samples and controls. In family studies, different modes of inheritance can easily be defined to filter out variants that do not segregate with the disease along the family. Moreover, BiERapp integrates additional information such as allelic frequencies in the general population and the most popular damaging scores to further narrow down the number of putative variants in successive filtering steps. BiERapp provides an interactive and user-friendly interface that implements the filtering strategy used in the context of a large-scale genomic project carried out by the Spanish Network for Research in Rare Diseases (CIBERER) in which more than 800 exomes have been analyzed. BiERapp is freely available at: http://bierapp.babelomics.org/ VL - 42 UR - http://nar.oxfordjournals.org/content/42/W1/W88 ER - TY - JOUR T1 - Assessing Differential Expression Measurements by Highly Parallel Pyrosequencing and DNA Microarrays: A Comparative Study. JF - Omics : a journal of integrative biology Y1 - 2013 A1 - Ariño, Joaquín A1 - Casamayor, Antonio A1 - Pérez, Julián Perez A1 - Pedrola, Laia A1 - Alvarez-Tejado, Miguel A1 - Marbà, Martina A1 - Santoyo, Javier A1 - Joaquín Dopazo AB -

Abstract To explore the feasibility of pyrosequencing for quantitative differential gene expression analysis we have performed a comparative study of the results of the sequencing experiments to those obtained by a conventional DNA microarray platform. A conclusion from our analysis is that, over a threshold of 35 normalized reads per gene, the measurements of gene expression display a good correlation with the references. The observed concordance between pyrosequencing and DNA microarray platforms beyond the threshold was of 0.8, measured as a Pearson’s correlation coefficient. In differential gene expression the initial aim is the quantification the differences among transcripts when comparing experimental conditions. Thus, even in a scenario of low coverage the concordance in the measurements is quite acceptable. On the other hand, the comparatively longer read size obtained by pyrosequencing allows detecting unconventional splicing forms.

UR - http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3545353/ ER - TY - JOUR T1 - Capturing the biological impact of CDKN2A and MC1R genes as an early predisposing event in melanoma and non melanoma skin cancer. JF - Oncotarget Y1 - 2013 A1 - Puig-Butille, Joan Anton A1 - Escamez, Maria José A1 - Garcia-Garcia, Francisco A1 - Tell-Marti, Gemma A1 - Fabra, Angels A1 - Martínez-Santamaría, Lucía A1 - Badenas, Celia A1 - Aguilera, Paula A1 - Pevida, Marta A1 - Joaquín Dopazo A1 - Del Rio, Marcela A1 - Puig, Susana AB - Germline mutations in CDKN2A and/or red hair color variants in MC1R genes are associated with an increased susceptibility to develop cutaneous melanoma or non melanoma skin cancer. We studied the impact of the CDKN2A germinal mutation p.G101W and MC1R variants on gene expression and transcription profiles associated with skin cancer. To this end we set-up primary skin cell co-cultures from siblings of melanoma prone-families that were later analyzed using the expression array approach. As a result, we found that 1535 transcripts were deregulated in CDKN2A mutated cells, with over-expression of immunity-related genes (HLA-DPB1, CLEC2B, IFI44, IFI44L, IFI27, IFIT1, IFIT2, SP110 and IFNK) and down-regulation of genes playing a role in the Notch signaling pathway. 3570 transcripts were deregulated in MC1R variant carriers. In particular, genes related to oxidative stress and DNA damage pathways were up-regulated as well as genes associated with neurodegenerative diseases such as Parkinson’s, Alzheimer and Huntington. Finally, we observed that the expression signatures indentified in phenotypically normal cells carrying CDKN2A mutations or MC1R variants are maintained in skin cancer tumors (melanoma and squamous cell carcinoma). These results indicate that transcriptome deregulation represents an early event critical for skin cancer development. UR - http://www.impactjournals.com/oncotarget/index.php?journal=oncotarget&page=article&op=view&path%5B%5D=1444&path%5B%5D=1824 ER - TY - JOUR T1 - Differential gene-expression analysis defines a molecular pattern related to olive pollen allergy. JF - J Biol Regul Homeost Agents Y1 - 2013 A1 - Aguerri, M A1 - Calzada, D A1 - Montaner, D A1 - Mata, M A1 - Florido, F A1 - Quiralte, J A1 - Dopazo, J A1 - Lahoz, C A1 - Cardaba, B KW - Adult KW - Female KW - Gene Expression Profiling KW - Humans KW - Male KW - Middle Aged KW - Olea KW - Principal Component Analysis KW - Rhinitis, Allergic, Seasonal AB -

Analysis of gene-expression profiles by microarrays is useful for characterization of candidate genes, key regulatory networks, and to define phenotypes or molecular signatures which improve the diagnosis and/or classification of the allergic processes. We have used this approach in the study of olive pollen response in order to find differential molecular markers among responders and non-responders to this allergenic source. Five clinical groups, non-allergic, asymptomatic, allergic but not to olive pollen, untreated-olive-pollen allergic patients and olive-pollen allergic patients (under specific-immunotherapy), were assessed during and outside pollen seasons. Whole-genome gene expression analysis was performed in RNAs extracted from PBMCs. After assessment of data quality and principal components analysis (PCA), differential gene-expression, by multiple testing and, functional analyses by KEGG, for pathways and Gene-Ontology for biological processes were performed. Relevance was defined by fold change and corrected P values (less than 0.05). The most differential genes were validated by qRT-PCR in a larger set of individuals. Interestingly, gene-expression profiling obtained by PCA clearly showed five clusters of samples that correlated with the five clinical groups. Furthermore, differential gene expression and functional analyses revealed differential genes and pathways in the five clinical groups. The 93 most significant genes found were validated, and one set of 35 genes was able to discriminate profiles of olive pollen response. Our results, in addition to providing new information on allergic response, define a possible molecular signature for olive pollen allergy which could be useful for the diagnosis and treatment of this and other sensitizations.

VL - 27 IS - 2 U1 - https://www.ncbi.nlm.nih.gov/pubmed/23830385?dopt=Abstract ER - TY - JOUR T1 - Genome Maps, a new generation genome browser. JF - Nucleic acids research Y1 - 2013 A1 - Medina, Ignacio A1 - Salavert, Francisco A1 - Sánchez, Rubén A1 - De Maria, Alejandro A1 - Alonso, Roberto A1 - Escobar, Pablo A1 - Bleda, Marta A1 - Joaquín Dopazo KW - BAM KW - genome viewer KW - HTML5 KW - javascript KW - Next Generation Sequencing KW - NGS KW - SVG KW - VCF AB - Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. VL - 41 UR - http://nar.oxfordjournals.org/content/41/W1/W41 ER - TY - JOUR T1 - Novel genes detected by transcriptional profiling from whole-blood cells in patients with early onset of acute coronary syndrome. JF - Clin Chim Acta Y1 - 2013 A1 - Silbiger, Vivian N A1 - Luchessi, André D A1 - Hirata, Rosário D C A1 - Lima-Neto, Lídio G A1 - Cavichioli, Débora A1 - Carracedo, Ángel A1 - Brión, Maria A1 - Dopazo, Joaquin A1 - Garcia-Garcia, Francisco A1 - Dos Santos, Elizabete S A1 - Ramos, Rui F A1 - Sampaio, Marcelo F A1 - Armaganijan, Dikran A1 - Sousa, Amanda G M R A1 - Hirata, Mario H KW - Acute Coronary Syndrome KW - Acute-Phase Proteins KW - Adult KW - biomarkers KW - Blood Cells KW - Early Diagnosis KW - gene expression KW - Gene Expression Profiling KW - Humans KW - Male KW - Middle Aged KW - Oligonucleotide Array Sequence Analysis KW - RNA, Messenger KW - Transcriptome AB -

BACKGROUND: Genome-wide expression analysis using microarrays has been used as a research strategy to discovery new biomarkers and candidate genes for a number of diseases. We aim to find new biomarkers for the prediction of acute coronary syndrome (ACS) with a differentially expressed mRNA profiling approach using whole genomic expression analysis in a peripheral blood cell model from patients with early ACS.

METHODS AND RESULTS: This study was carried out in two phases. On phase 1 a restricted clinical criteria (ACS-Ph1, n=9 and CG-Ph1, n=6) was used in order to select potential mRNA biomarkers candidates. A subsequent phase 2 study was performed using selected phase 1 markers analyzed by RT-qPCR using a larger and independent casuistic (ACS-Ph2, n=74 and CG-Ph2, n=41). A total of 549 genes were found to be differentially expressed in the first 48 h after the ACS-Ph1. Technical and biological validation further confirmed that ALOX15, AREG, BCL2A1, BCL2L1, CA1, COX7B, ECHDC3, IL18R1, IRS2, KCNE1, MMP9, MYL4 and TREML4, are differentially expressed in both phases of this study.

CONCLUSIONS: Transcriptomic analysis by microarray technology demonstrated differential expression during a 48 h time course suggesting a potential use of some of these genes as biomarkers for very early stages of ACS, as well as for monitoring early cardiac ischemic recovery.

VL - 421 U1 - https://www.ncbi.nlm.nih.gov/pubmed/23535507?dopt=Abstract ER - TY - JOUR T1 - Novel genes detected by transcriptional profiling from whole-blood cells in patients with early onset of acute coronary syndrome: Transcriptional profiling of acute coronary syndrome. JF - Clinica chimica acta; international journal of clinical chemistry Y1 - 2013 A1 - Silbiger, Vivian N A1 - Luchessi, André D A1 - Hirata, Rosário D C A1 - Lima-Neto, Lídio G A1 - Cavichioli, Débora A1 - Carracedo, Ángel A1 - Brión, Maria A1 - Joaquín Dopazo A1 - Garcia-Garcia, Francisco A1 - Dos Santos, Elizabete S A1 - Ramos, Rui F A1 - Sampaio, Marcelo F A1 - Armaganijan, Dikran A1 - Sousa, Amanda G M R A1 - Hirata, Mario H AB - {BACKGROUND: Genome-wide expression analysis using microarrays has been used as a research strategy to discovery new biomarkers and candidate genes for a number of diseases. We aim to find new biomarkers for the prediction of acute coronary syndrome (ACS) with a differentially expressed mRNA profiling approach using whole genomic expression analysis in a peripheral blood cell model from patients with early ACS. METHODS AND RESULTS: This study was carried out in two phases. On phase 1 a restricted clinical criteria (ACS-Ph1 ER - TY - JOUR T1 - Pathways systematically associated to Hirschsprung’s disease. JF - Orphanet journal of rare diseases Y1 - 2013 A1 - Fernández, Raquel M A1 - Bleda, Marta A1 - Luzón-Toro, Berta A1 - García-Alonso, Luz A1 - Arnold, Stacey A1 - Sribudiani, Yunia A1 - Besmond, Claude A1 - Lantieri, Francesca A1 - Doan, Betty A1 - Ceccherini, Isabella A1 - Lyonnet, Stanislas A1 - Hofstra, Robert Mw A1 - Chakravarti, Aravinda A1 - Antiňolo, Guillermo A1 - Joaquín Dopazo A1 - Borrego, Salud KW - GWAS KW - Hirschprung KW - network analysis KW - Pathway Based Analysis AB - Despite it has been reported that several loci are involved in Hirschsprung’s disease, the molecular basis of the disease remains yet essentially unknown. The study of collective properties of modules of functionally-related genes provides an efficient and sensitive statistical framework that can overcome sample size limitations in the study of rare diseases. Here, we present the extension of a previous study of a Spanish series of HSCR trios to an international cohort of 162 HSCR trios to validate the generality of the underlying functional basis of the Hirschsprung’s disease mechanisms previously found. The Pathway-Based Analysis (PBA) confirms a strong association of gene ontology (GO) modules related to signal transduction and its regulation, enteric nervous system (ENS) formation and other processes related to the disease. In addition, network analysis recovers sub-networks significantly associated to the disease, which contain genes related to the same functionalities, thus providing an independent validation of these findings. The functional profiles of association obtained for patients populations from different countries were compared to each other. While gene associations were different at each series, the main functional associations were identical in all the five populations. These observations would also explain the reported low reproducibility of associations of individual disease genes across populations. VL - 8 UR - http://www.ojrd.com/content/8/1/187/abstract ER - TY - JOUR T1 - Pathways systematically associated to Hirschsprung's disease. JF - Orphanet J Rare Dis Y1 - 2013 A1 - Fernández, Raquel M A1 - Bleda, Marta A1 - Luzón-Toro, Berta A1 - García-Alonso, Luz A1 - Arnold, Stacey A1 - Sribudiani, Yunia A1 - Besmond, Claude A1 - Lantieri, Francesca A1 - Doan, Betty A1 - Ceccherini, Isabella A1 - Lyonnet, Stanislas A1 - Hofstra, Robert Mw A1 - Chakravarti, Aravinda A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin A1 - Borrego, Salud KW - Female KW - Genetic Predisposition to Disease KW - Genotype KW - Hirschsprung Disease KW - Humans KW - Male KW - Polymorphism, Single Nucleotide AB -

Despite it has been reported that several loci are involved in Hirschsprung's disease, the molecular basis of the disease remains yet essentially unknown. The study of collective properties of modules of functionally-related genes provides an efficient and sensitive statistical framework that can overcome sample size limitations in the study of rare diseases. Here, we present the extension of a previous study of a Spanish series of HSCR trios to an international cohort of 162 HSCR trios to validate the generality of the underlying functional basis of the Hirschsprung's disease mechanisms previously found. The Pathway-Based Analysis (PBA) confirms a strong association of gene ontology (GO) modules related to signal transduction and its regulation, enteric nervous system (ENS) formation and other processes related to the disease. In addition, network analysis recovers sub-networks significantly associated to the disease, which contain genes related to the same functionalities, thus providing an independent validation of these findings. The functional profiles of association obtained for patients populations from different countries were compared to each other. While gene associations were different at each series, the main functional associations were identical in all the five populations. These observations would also explain the reported low reproducibility of associations of individual disease genes across populations.

VL - 8 U1 - https://www.ncbi.nlm.nih.gov/pubmed/24289864?dopt=Abstract ER - TY - JOUR T1 - Whole-exome sequencing identifies novel compound heterozygous mutations in USH2A in Spanish patients with autosomal recessive retinitis pigmentosa. JF - Molecular vision Y1 - 2013 A1 - Méndez-Vidal, Cristina A1 - González-del Pozo, María A1 - Vela-Boza, Alicia A1 - Santoyo-López, Javier A1 - López-Domingo, Francisco J A1 - Vázquez-Marouschek, Carmen A1 - Dopazo, Joaquin A1 - Borrego, Salud A1 - Antiňolo, Guillermo AB - PURPOSE: Retinitis pigmentosa (RP) is an inherited retinal dystrophy characterized by extreme genetic and clinical heterogeneity. Thus, the diagnosis is not always easily performed due to phenotypic and genetic overlap. Current clinical practices have focused on the systematic evaluation of a set of known genes for each phenotype, but this approach may fail in patients with inaccurate diagnosis or infrequent genetic cause. In the present study, we investigated the genetic cause of autosomal recessive RP (arRP) in a Spanish family in which the causal mutation has not yet been identified with primer extension technology and resequencing. METHODS: We designed a whole-exome sequencing (WES)-based approach using NimbleGen SeqCap EZ Exome V3 sample preparation kit and the SOLiD 5500×l next-generation sequencing platform. We sequenced the exomes of both unaffected parents and two affected siblings. Exome analysis resulted in the identification of 43,204 variants in the index patient. All variants passing filter criteria were validated with Sanger sequencing to confirm familial segregation and absence in the control population. In silico prediction tools were used to determine mutational impact on protein function and the structure of the identified variants. RESULTS: Novel Usher syndrome type 2A (USH2A) compound heterozygous mutations, c.4325T>C (p.F1442S) and c.15188T>G (p.L5063R), located in exons 20 and 70, respectively, were identified as probable causative mutations for RP in this family. Family segregation of the variants showed the presence of both mutations in all affected members and in two siblings who were apparently asymptomatic at the time of family ascertainment. Clinical reassessment confirmed the diagnosis of RP in these patients. CONCLUSIONS: Using WES, we identified two heterozygous novel mutations in USH2A as the most likely disease-causing variants in a Spanish family diagnosed with arRP in which the cause of the disease had not yet been identified with commonly used techniques. Our data reinforce the clinical role of WES in the molecular diagnosis of highly heterogeneous genetic diseases where conventional genetic approaches have previously failed in achieving a proper diagnosis. VL - 19 UR - http://www.molvis.org/molvis/v19/2187/ ER - TY - JOUR T1 - Development, Characterization and Experimental Validation of a Cultivated Sunflower (Helianthus annuus L.) Gene Expression Oligonucleotide Microarray. JF - PloS one Y1 - 2012 A1 - Fernandez, Paula A1 - Soria, Marcelo A1 - Blesa, David A1 - Dirienzo, Julio A1 - Moschen, Sebastián A1 - Rivarola, Máximo A1 - Clavijo, Bernardo Jose A1 - Gonzalez, Sergio A1 - Peluffo, Lucila A1 - Príncipi, Dario A1 - Dosio, Guillermo A1 - Aguirrezabal, Luis A1 - Garcia-Garcia, Francisco A1 - Ana Conesa A1 - Hopp, Esteban A1 - Joaquín Dopazo A1 - Heinz, Ruth Amelia A1 - Paniego, Norma AB - Oligonucleotide-based microarrays with accurate gene coverage represent a key strategy for transcriptional studies in orphan species such as sunflower, H. annuus L., which lacks full genome sequences. The goal of this study was the development and functional annotation of a comprehensive sunflower unigene collection and the design and validation of a custom sunflower oligonucleotide-based microarray. A large scale EST (>130,000 ESTs) curation, assembly and sequence annotation was performed using Blast2GO (www.blast2go.de). The EST assembly comprises 41,013 putative transcripts (12,924 contigs and 28,089 singletons). The resulting Sunflower Unigen Resource (SUR version 1.0) was used to design an oligonucleotide-based Agilent microarray for cultivated sunflower. This microarray includes a total of 42,326 features: 1,417 Agilent controls, 74 control probes for sunflower replicated 10 times (740 controls) and 40,169 different non-control probes. Microarray performance was validated using a model experiment examining the induction of senescence by water deficit. Pre-processing and differential expression analysis of Agilent microarrays was performed using the Bioconductor limma package. The analyses based on p-values calculated by eBayes (p<0.01) allowed the detection of 558 differentially expressed genes between water stress and control conditions; from these, ten genes were further validated by qPCR. Over-represented ontologies were identified using FatiScan in the Babelomics suite. This work generated a curated and trustable sunflower unigene collection, and a custom, validated sunflower oligonucleotide-based microarray using Agilent technology. Both the curated unigene collection and the validated oligonucleotide microarray provide key resources for sunflower genome analysis, transcriptional studies, and molecular breeding for crop improvement. VL - 7 UR - http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0045899 ER - TY - JOUR T1 - Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments. JF - Nucleic Acids Res Y1 - 2012 A1 - García-Alonso, Luz A1 - Alonso, Roberto A1 - Vidal, Enrique A1 - Amadoz, Alicia A1 - De Maria, Alejandro A1 - Minguez, Pablo A1 - Medina, Ignacio A1 - Dopazo, Joaquin KW - Bipolar Disorder KW - Fanconi Anemia KW - Gene Regulatory Networks KW - Genes, Neoplasm KW - Genome-Wide Association Study KW - Genomics KW - Humans KW - Protein Interaction Mapping AB -

Genomic experiments (e.g. differential gene expression, single-nucleotide polymorphism association) typically produce ranked list of genes. We present a simple but powerful approach which uses protein-protein interaction data to detect sub-networks within such ranked lists of genes or proteins. We performed an exhaustive study of network parameters that allowed us concluding that the average number of components and the average number of nodes per component are the parameters that best discriminate between real and random networks. A novel aspect that increases the efficiency of this strategy in finding sub-networks is that, in addition to direct connections, also connections mediated by intermediate nodes are considered to build up the sub-networks. The possibility of using of such intermediate nodes makes this approach more robust to noise. It also overcomes some limitations intrinsic to experimental designs based on differential expression, in which some nodes are invariant across conditions. The proposed approach can also be used for candidate disease-gene prioritization. Here, we demonstrate the usefulness of the approach by means of several case examples that include a differential expression analysis in Fanconi Anemia, a genome-wide association study of bipolar disorder and a genome-scale study of essentiality in cancer genes. An efficient and easy-to-use web interface (available at http://www.babelomics.org) based on HTML5 technologies is also provided to run the algorithm and represent the network.

VL - 40 IS - 20 U1 - https://www.ncbi.nlm.nih.gov/pubmed/22844098?dopt=Abstract ER - TY - JOUR T1 - Evolutionary Genomics of Genes Involved in Olfactory Behavior in the Drosophila melanogaster Species Group. JF - Evolutionary bioinformatics online Y1 - 2012 A1 - Lavagnino, Nicolás A1 - Serra, François A1 - Arbiza, Leonardo A1 - Dopazo, Hernán A1 - Hasson, Esteban AB - Previous comparative genomic studies of genes involved in olfactory behavior in Drosophila focused only on particular gene families such as odorant receptor and/or odorant binding proteins. However, olfactory behavior has a complex genetic architecture that is orchestrated by many interacting genes. In this paper, we present a comparative genomic study of olfactory behavior in Drosophila including an extended set of genes known to affect olfactory behavior. We took advantage of the recent burst of whole genome sequences and the development of powerful statistical tools to analyze genomic data and test evolutionary and functional hypotheses of olfactory genes in the six species of the Drosophila melanogaster species group for which whole genome sequences are available. Our study reveals widespread purifying selection and limited incidence of positive selection on olfactory genes. We show that the pace of evolution of olfactory genes is mostly independent of the life cycle stage, and of the number of life cycle stages, in which they participate in olfaction. However, we detected a relationship between evolutionary rates and the position that the gene products occupy in the olfactory system, genes occupying central positions tend to be more constrained than peripheral genes. Finally, we demonstrate that specialization to one host does not seem to be associated with bursts of adaptive evolution in olfactory genes in D. sechellia and D. erecta, the two specialists species analyzed, but rather different lineages have idiosyncratic evolutionary histories in which both historical and ecological factors have been involved. VL - 8 UR - http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3273929/?tool=pubmed ER - TY - JOUR T1 - Four new loci associations discovered by pathway-based and network analyses of the genome-wide variability profile of Hirschsprung's disease. JF - Orphanet J Rare Dis Y1 - 2012 A1 - Fernández, Raquel Ma A1 - Bleda, Marta A1 - Núñez-Torres, Rocío A1 - Medina, Ignacio A1 - Luzón-Toro, Berta A1 - García-Alonso, Luz A1 - Torroglosa, Ana A1 - Marbà, Martina A1 - Enguix-Riego, Ma Valle A1 - Montaner, David A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin A1 - Borrego, Salud KW - Female KW - Genetic Predisposition to Disease KW - Genome-Wide Association Study KW - Genotype KW - Hirschsprung Disease KW - Humans KW - Male AB -

Finding gene associations in rare diseases is frequently hampered by the reduced numbers of patients accessible. Conventional gene-based association tests rely on the availability of large cohorts, which constitutes a serious limitation for its application in this scenario. To overcome this problem we have used here a combined strategy in which a pathway-based analysis (PBA) has been initially conducted to prioritize candidate genes in a Spanish cohort of 53 trios of short-segment Hirschsprung's disease. Candidate genes have been further validated in an independent population of 106 trios. The study revealed a strong association of 11 gene ontology (GO) modules related to signal transduction and its regulation, enteric nervous system (ENS) formation and other HSCR-related processes. Among the preselected candidates, a total of 4 loci, RASGEF1A, IQGAP2, DLC1 and CHRNA7, related to signal transduction and migration processes, were found to be significantly associated to HSCR. Network analysis also confirms their involvement in the network of already known disease genes. This approach, based on the study of functionally-related gene sets, requires of lower sample sizes and opens new opportunities for the study of rare diseases.

VL - 7 U1 - https://www.ncbi.nlm.nih.gov/pubmed/23270508?dopt=Abstract ER - TY - JOUR T1 - Four new loci associations discovered by pathway-based and network analyses of the genome-wide variability profile of Hirschsprung’s disease. JF - Orphanet journal of rare diseases Y1 - 2012 A1 - Fernández, Raquel Ma A1 - Bleda, Marta A1 - Núñez-Torres, Rocío A1 - Medina, Ignacio A1 - Luzón-Toro, Berta A1 - García-Alonso, Luz A1 - Torroglosa, Ana A1 - Marbà, Martina A1 - Enguix-Riego, Ma Valle A1 - Montaner, David A1 - Antiňolo, Guillermo A1 - Joaquín Dopazo A1 - Borrego, Salud AB - ABSTRACT: Finding gene associations in rare diseases is frequently hampered by the reduced numbers of patients accessible. Conventional gene-based association tests rely on the availability of large cohorts, which constitutes a serious limitation for its application in this scenario. To overcome this problem we have used here a combined strategy in which a pathway-based analysis (PBA) has been initially conducted to prioritize candidate genes in a Spanish cohort of 53 trios of short-segment Hirschsprung’s disease. Candidate genes have been further validated in an independent population of 106 trios. The study revealed a strong association of 11 gene ontology (GO) modules related to signal transduction and its regulation, enteric nervous system (ENS) formation and other HSCR-related processes. Among the preselected candidates, a total of 4 loci, RASGEF1A, IQGAP2, DLC1 and CHRNA7, related to signal transduction and migration processes, were found to be significantly associated to HSCR. Network analysis also confirms their involvement in the network of already known disease genes. This approach, based on the study of functionally-related gene sets, requires of lower sample sizes and opens new opportunities for the study of rare diseases. VL - 7 UR - http://www.ojrd.com/content/7/1/103/abstract ER - TY - JOUR T1 - Inferring the regulatory network behind a gene expression experiment. JF - Nucleic Acids Res Y1 - 2012 A1 - Bleda, Marta A1 - Medina, Ignacio A1 - Alonso, Roberto A1 - De Maria, Alejandro A1 - Salavert, Francisco A1 - Dopazo, Joaquin KW - Binding Sites KW - Databases, Genetic KW - Fanconi Anemia KW - Gene Regulatory Networks KW - Internet KW - MicroRNAs KW - Software KW - Transcription Factors KW - Transcriptome AB -

Transcription factors (TFs) and miRNAs are the most important dynamic regulators in the control of gene expression in multicellular organisms. These regulatory elements play crucial roles in development, cell cycling and cell signaling, and they have also been associated with many diseases. The Regulatory Network Analysis Tool (RENATO) web server makes the exploration of regulatory networks easy, enabling a better understanding of functional modularity and network integrity under specific perturbations. RENATO is suitable for the analysis of the result of expression profiling experiments. The program analyses lists of genes and search for the regulators compatible with its activation or deactivation. Tests of single enrichment or gene set enrichment allow the selection of the subset of TFs or miRNAs significantly involved in the regulation of the query genes. RENATO also offers an interactive advanced graphical interface that allows exploring the regulatory network found.RENATO is available at: http://renato.bioinfo.cipf.es/.

VL - 40 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/22693210?dopt=Abstract ER - TY - JOUR T1 - A map of human microRNA variation uncovers unexpectedly high levels of variability. JF - Genome medicine Y1 - 2012 A1 - Carbonell, José A1 - Alloza, Eva A1 - Arce, Pablo A1 - Borrego, Salud A1 - Santoyo, Javier A1 - Ruiz-Ferrer, Macarena A1 - Medina, Ignacio A1 - Jiménez-Almazán, Jorge A1 - Méndez-Vidal, Cristina A1 - González-del Pozo, María A1 - Vela, Alicia A1 - Bhattacharya, Shomi S A1 - Antiňolo, Guillermo A1 - Dopazo, Joaquin KW - NGS AB - ABSTRACT: BACKGROUND: MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in healthy population seems an obvious requirement for future research on miRNA-related pathologies. METHODS: Data on 14 populations from the 1000 Genomes Project were analysed, along with new data extracted from 60 exomes of healthy individuals from a southern Spain population, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. RESULTS: Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. CONCLUSIONS: Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. VL - 4 UR - http://genomemedicine.com/content/4/8/62/abstract ER - TY - JOUR T1 - Microarray analysis of Etrog citron (Citrus medica L.) reveals changes in chloroplast, cell wall, peroxidase and symporter activities in response to viroid infection. JF - Molecular plant pathology Y1 - 2012 A1 - Rizza, Serena A1 - Ana Conesa A1 - Juarez, José A1 - Catara, Antonino A1 - Navarro, Luis A1 - Duran-Vila, Nuria A1 - Ancillo, Gema AB - Viroids are small (246-401 nucleotides), single-stranded, circular RNA molecules that infect several crop plants and can cause diseases of economic importance. Citrus are the hosts in which the largest number of viroids have been identified. Citrus exocortis viroid (CEVd), the causal agent of citrus exocortis disease, induces considerable losses in citrus crops. Changes in the gene expression profile during the early (pre-symptomatic) and late (post-symptomatic) stages of Etrog citron infected with CEVd were investigated using a citrus cDNA microarray. MaSigPro analysis was performed and, on the basis of gene expression profiles as a function of the time after infection, the differentially expressed genes were classified into five clusters. FatiScan analysis revealed significant enrichment of functional categories for each cluster, indicating that viroid infection triggers important changes in chloroplast, cell wall, peroxidase and symporter activities. ER - TY - JOUR T1 - The protease MT1-MMP drives a combinatorial proteolytic program in activated endothelial cells. JF - FASEB J Y1 - 2012 A1 - Koziol, Agnieszka A1 - Gonzalo, Pilar A1 - Mota, Alba A1 - Pollán, Angela A1 - Lorenzo, Cristina A1 - Colomé, Nuria A1 - Montaner, David A1 - Dopazo, Joaquin A1 - Arribas, Joaquín A1 - Canals, Francesc A1 - Arroyo, Alicia G KW - Animals KW - Blotting, Western KW - Combinatorial Chemistry Techniques KW - Computational Biology KW - Endothelial Cells KW - Gene Expression Regulation, Enzymologic KW - Inflammation KW - Matrix Metalloproteinase 14 KW - Mice KW - Protein Array Analysis KW - Reverse Transcriptase Polymerase Chain Reaction KW - RNA Interference KW - RNA, Small Interfering KW - Transcriptome KW - Tumor Necrosis Factor-alpha AB -

The mechanism by which proteolytic events translate into biological responses is not well understood. To explore the link of pericellular proteolysis to events relevant to capillary sprouting within the inflammatory context, we aimed at the identification of the collection of substrates of the protease MT1-MMP in endothelial tip cells induced by inflammatory stimuli. We applied quantitative proteomics to endothelial cells (ECs) derived from wild-type and MT1-MMP-null mice to identify the substrate repertoire of this protease in TNF-α-activated ECs. Bioinformatics analysis revealed a combinatorial MT1-MMP proteolytic program, in which combined rather than single substrate processing would determine biological decisions by activated ECs, including chemotaxis, cell motility and adhesion, and vasculature development. MT1-MMP-deficient ECs inefficiently processed several of these substrates (TSP1, CYR61, NID1, and SEM3C), validating the model. This novel concept of MT1-MMP-driven combinatorial proteolysis in angiogenesis might be extendable to proteolytic actions in other cellular contexts.

VL - 26 IS - 11 U1 - https://www.ncbi.nlm.nih.gov/pubmed/22859368?dopt=Abstract ER - TY - JOUR T1 - Select your SNPs (SYSNPs): a web tool for automatic and massive selection of SNPs. JF - International journal of data mining and bioinformatics Y1 - 2012 A1 - Lorente-Galdos, Belén A1 - Medina, Ignacio A1 - Morcillo-Suarez, Carlos A1 - Heredia, Txema A1 - Carreño-Torres, Angel A1 - Sangrós, Ricardo A1 - Alegre, Josep A1 - Pita, Guillermo A1 - Vellalta, Gemma A1 - Malats, Nuria A1 - Pisano, David G A1 - Joaquín Dopazo A1 - Navarro, Arcadi AB - Association studies are the choice approach in the discovery of the genomic basis of complex traits. To carry out such analysis, researchers frequently need to (1) select optimally informative sets of Single Nucleotide Polymorphisms (SNPs) in candidate regions and (2) annotate the results of associations found by means of genome-wide SNP arrays. These are complex tasks, since many criteria have to be considered, including the SNPs’ functional properties, technological information and haplotype frequencies in given populations. SYSNPs implements algorithms that allow for efficient and simultaneous consideration of all the relevant criteria to obtain sets of SNPs that properly cover arbitrarily large lists of genes or genomic regions. Complementarily, SYSNPs allows for comprehensive functional annotation of SNPs linked to any given marker SNP. SYSNPs dramatically reduces the effort needed for SNP selection from days of searching various databases to a few minutes using a simple browser. VL - 6 UR - http://inderscience.metapress.com/content/f76740x8071u513n/ ER - TY - JOUR T1 - Transdifferentiation of MALME-3M and MCF-7 Cells toward Adipocyte-like Cells is Dependent on Clathrin-mediated Endocytosis. JF - SpringerPlus Y1 - 2012 A1 - Carcel-Trullols, Jaime A1 - Aguilar-Gallardo, Cristóbal A1 - García-Alcalde, Fernando A1 - Pardo-Cea, Miguel Angel A1 - Dopazo, Joaquin A1 - Ana Conesa A1 - Simon, Carlos AB - ABSTRACT: Enforced cell transdifferentiation of human cancer cells is a promising alternative to conventional chemotherapy. We previously identified albumin-associated lipid- and, more specifically, saturated fatty acid-induced transdifferentiation programs in human cancer cells (HCCLs). In this study, we further characterized the adipocyte-like cells, resulting from the transdifferentiation of human cancer cell lines MCF-7 and MALME-3M, and proposed a common mechanistic approach for these transdifferentiating programs. We showed the loss of pigmentation in MALME-3M cells treated with albumin-associated lipids, based on electron microscopic analysis, and the overexpression of perilipin 2 (PLIN2) by western blotting in MALME-3M and MCF-7 cells treated with unsaturated fatty acids. Comparing the gene expression profiles of naive melanoma MALME-3M cells and albumin-associated lipid-treated cells, based on RNA sequencing, we confirmed the transcriptional upregulation of some key adipogenic gene markers and also an alternative splicing of the adipogenic master regulator PPARG, that is probably related to the reported up regulated expression of the protein. Most importantly, these results also showed the upregulation of genes responsible for Clathrin (CLTC) and other adaptor-related proteins. An increase in CLTC expression in the transdifferentiated cells was confirmed by western blotting. Inactivation of CLTC by chlorpromazine (CHP), an inhibitor of CTLC mediated endocytosis (CME), and gene silencing by siRNAs, partially reversed the accumulation of neutral lipids observed in the transdifferentiated cells. These findings give a deeper insight into the phenotypic changes observed in HCCL to adipocyte-like transdifferentiation and point towards CME as a key pathway in distinct transdifferentiation programs. DISCLOSURES: Simon C and Aguilar-Gallardo C are co-inventors of the International Patent Application No. PCT/EP2011/004941 entitled "Methods for tumor treatment and adipogenesis differentiation". VL - 1 UR - http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3725915/ ER - TY - JOUR T1 - VARIANT: Command Line, Web service and Web interface for fast and accurate functional characterization of variants found by Next-Generation Sequencing. JF - Nucleic Acids Res Y1 - 2012 A1 - Medina, Ignacio A1 - De Maria, Alejandro A1 - Bleda, Marta A1 - Salavert, Francisco A1 - Alonso, Roberto A1 - Gonzalez, Cristina Y A1 - Dopazo, Joaquin KW - Databases, Nucleic Acid KW - Genetic Variation KW - High-Throughput Nucleotide Sequencing KW - Internet KW - Molecular Sequence Annotation KW - mutation KW - Polymorphism, Single Nucleotide KW - Software KW - User-Computer Interface AB -

The massive use of Next-Generation Sequencing (NGS) technologies is uncovering an unexpected amount of variability. The functional characterization of such variability, particularly in the most common form of variation found, the Single Nucleotide Variants (SNVs), has become a priority that needs to be addressed in a systematic way. VARIANT (VARIant ANalyis Tool) reports information on the variants found that include consequence type and annotations taken from different databases and repositories (SNPs and variants from dbSNP and 1000 genomes, and disease-related variants from the Genome-Wide Association Study (GWAS) catalog, Online Mendelian Inheritance in Man (OMIM), Catalog of Somatic Mutations in Cancer (COSMIC) mutations, etc). VARIANT also produces a rich variety of annotations that include information on the regulatory (transcription factor or miRNA-binding sites, etc.) or structural roles, or on the selective pressures on the sites affected by the variation. This information allows extending the conventional reports beyond the coding regions and expands the knowledge on the contribution of non-coding or synonymous variants to the phenotype studied. Contrarily to other tools, VARIANT uses a remote database and operates through efficient RESTful Web Services that optimize search and transaction operations. In this way, local problems of installation, update or disk size limitations are overcome without the need of sacrifice speed (thousands of variants are processed per minute). VARIANT is available at: http://variant.bioinfo.cipf.es.

VL - 40 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/22693211?dopt=Abstract ER - TY - JOUR T1 - Analysis of normal-tumour tissue interaction in tumours: prediction of prostate cancer features from the molecular profile of adjacent normal cells. JF - PloS one Y1 - 2011 A1 - Trevino, Victor A1 - Tadesse, Mahlet G A1 - Vannucci, Marina A1 - Fatima Al-Shahrour A1 - Antczak, Philipp A1 - Durant, Sarah A1 - Bikfalvi, Andreas A1 - Dopazo, Joaquin A1 - Campbell, Moray J A1 - Falciani, Francesco AB -

Statistical modelling, in combination with genome-wide expression profiling techniques, has demonstrated that the molecular state of the tumour is sufficient to infer its pathological state. These studies have been extremely important in diagnostics and have contributed to improving our understanding of tumour biology. However, their importance in in-depth understanding of cancer patho-physiology may be limited since they do not explicitly take into consideration the fundamental role of the tissue microenvironment in specifying tumour physiology. Because of the importance of normal cells in shaping the tissue microenvironment we formulate the hypothesis that molecular components of the profile of normal epithelial cells adjacent the tumour are predictive of tumour physiology. We addressed this hypothesis by developing statistical models that link gene expression profiles representing the molecular state of adjacent normal epithelial cells to tumour features in prostate cancer. Furthermore, network analysis showed that predictive genes are linked to the activity of important secreted factors, which have the potential to influence tumor biology, such as IL1, IGF1, PDGF BB, AGT, and TGFβ.

VL - 6 ER - TY - JOUR T1 - B2G-FAR, a species centered GO annotation repository. JF - Bioinformatics (Oxford, England) Y1 - 2011 A1 - Götz, Stefan A1 - Arnold, Roland A1 - Sebastián-Leon, Patricia A1 - Martín-Rodríguez, Samuel A1 - Tischler, Patrick A1 - Jehl, Marc-André A1 - Joaquín Dopazo A1 - Rattei, Thomas A1 - Ana Conesa AB -

MOTIVATION: Functional genomics research has expanded enormously in the last decade thanks to the cost-reduction in high-throughput technologies and the development of computational tools that generate, standardize and share information on gene and protein function such as the Gene Ontology (GO). Nevertheless many biologists, especially working with non-model organisms, still suffer from non-existing or low coverage functional annotation, or simply struggle retrieving, summarizing and querying these data. RESULTS: The Blast2GO Functional Annotation Repository (B2G-FAR) is a bioinformatics resource envisaged to provide functional information for otherwise uncharacterized sequence-data and offers data-mining tools to analyze a larger repertoire of species than currently available. This new annotation resource has been created by applying the Blast2GO functional annotation engine in a strongly high-throughput manner to the entire space of public available sequences. The resulting repository contains GO term predictions for over 13.2 million non-redundant protein sequences based on BLAST search alignments from the SIMAP database. We generated GO annotation for approximately 150.000 different taxa making available the 2000 species with the highest coverage through B2G-FAR. A second section within B2G-FAR holds functional annotations for 17 non-model organism Affymetrix GeneChips. Conclusions: B2G-FAR provides easy access to exhaustive functional annotation for 2000 species offering a good balance between quality and quantity, thereby supporting functional genomics research especially in the case of non-model organisms. AVAILABILITY: The annotation resource is available at http://b2gfar.bioinfo.cipf.es. CONTACT: aconesa@cipf.es, sgoetz@cipf.es.

VL - 27 ER - TY - JOUR T1 - Does singlet oxygen activate cell death in Arabidopsis cell suspension cultures? Analysis of the early transcriptional defence responses to high light stress. JF - Plant signaling & behavior Y1 - 2011 A1 - Gutiérrez, Jorge A1 - González-Pérez, Sergio A1 - Garcia-Garcia, Francisco A1 - Lorenzo, Oscar A1 - Arellano, Juan B AB -

Can Arabidopsis cell suspension cultures (ACSC) provide a useful working model to investigate genetically-controlled defence responses with signalling cascades starting in chloroplasts? In order to provide a convincing answer, we analysed the early transcriptional profile of Arabidopsis cells at high light (HL). The results showed that ACSC respond to HL in a manner that resembles the singlet oxygen ( ( 1) O 2)-mediated defence responses described for the conditional fluorescent (flu) mutant of Arabidopsis thaliana. The flu mutant is characterized by the accumulation of free protochlorophyllide (Pchlide) in plastids when put into darkness and the subsequent production of ( 1) O 2 when the light is on. In ACSC, ( 1) O 2 is produced in chloroplasts at HL when excess excitation energy flows into photosystem II (PSII). Other reactive oxygen species are also produced in ACSC at HL, but to a lesser extent. When the HL stress ceases, ACSC recovers the initial rate of oxygen evolution and cell growth continues. We can conclude that chloroplasts of ACSC are both photosynthetically active and capable of initiating ( 1) O 2-mediated signalling cascades that activate a broad range of genetically-controlled defence responses. The up-regulation of transcripts associated with the biosynthesis and signalling pathways of OPDA (12-oxophytodienoic acid) and ethylene (ET) suggests that the activated defence responses at HL are governed by these two hormones. In contrast to the flu mutant, the ( 1) O 2-mediated defence responses were independent of the up-regulation of EDS1 (enhanced disease susceptibility) required for the accumulation of salicylic acid (SA) and genetically-controlled cell death. 

VL - 6 ER - TY - JOUR T1 - Early transcriptional defence responses in Arabidopsis cell suspension culture under high light conditions. JF - Plant physiology Y1 - 2011 A1 - González-Pérez, Sergio A1 - Gutiérrez, Jorge A1 - Garcia-Garcia, Francisco A1 - Osuna, Daniel A1 - Joaquín Dopazo A1 - Lorenzo, Oscar A1 - Revuelta, José L A1 - Arellano, Juan B AB -

The early transcriptional defence responses and ROS production in Arabidopsis cell suspension culture (ACSC), containing functional chloroplasts, were examined at high light (HL). The transcriptional analysis revealed that most of the ROS markers identified among the 449 transcripts with significant differential expression were transcripts specifically up-regulated by singlet oxygen (1O2). On the contrary, minimal correlation was established with transcripts specifically up-regulated by superoxide radical (O2•) or hydrogen peroxide (H2O2). The transcriptional analysis was supported by fluorescence microscopy experiments. The incubation of ACSC with the 1O2 sensor green reagent and 2’,7’-dichlorofluorescein diacetate showed that the 30-min-HL-treated cultures emitted fluorescence that corresponded with the production of 1O2, but not of H2O2. Furthermore, the in vivo photodamage of the D1 protein of photosystem II (PSII) indicated that the photogeneration of 1O2 took place within the PSII reaction centre. Functional enrichment analyses identified transcripts that are key components of the ROS signalling transduction pathway in plants as well as others encoding transcription factors that regulate both ROS scavenging and water deficit stress. A meta-analysis examining the transcriptional profiles of mutants and hormone treatments in Arabidopsis showed a high correlation between ACSC at HL and the flu mutant family of Arabidopsis, a producer of 1O2 in plastids. Intriguingly, a high correlation was also observed with aba1 and max4, two mutants with defects in the biosynthesis pathways of two key (apo)carotenoid-derived plant hormones (i.e. ABA and strigolactones, respectively). ACSC has proven to be a valuable system for studying early transcriptional responses to HL stress.

VL - 156 UR - http://www.plantphysiol.org/content/early/2011/04/29/pp.111.177766.short?keytype=ref&ijkey=ph5B6J2khjnqwzN ER - TY - JOUR T1 - Early transcriptional defense responses in Arabidopsis cell suspension culture under high-light conditions. JF - Plant Physiol Y1 - 2011 A1 - González-Pérez, Sergio A1 - Gutiérrez, Jorge A1 - Garcia-Garcia, Francisco A1 - Osuna, Daniel A1 - Dopazo, Joaquin A1 - Lorenzo, Oscar A1 - Revuelta, José L A1 - Arellano, Juan B KW - Arabidopsis KW - Blotting, Western KW - Cell Culture Techniques KW - Cells, Cultured KW - Chloroplasts KW - Cluster Analysis KW - Gene Expression Profiling KW - Gene Expression Regulation, Plant KW - Hydrogen Peroxide KW - Light KW - mutation KW - Oligonucleotide Array Sequence Analysis KW - Photosystem II Protein Complex KW - Plant Growth Regulators KW - Reproducibility of Results KW - Reverse Transcriptase Polymerase Chain Reaction KW - RNA, Messenger KW - Signal Transduction KW - Stress, Physiological KW - Transcription, Genetic AB -

The early transcriptional defense responses and reactive oxygen species (ROS) production in Arabidopsis (Arabidopsis thaliana) cell suspension culture (ACSC), containing functional chloroplasts, were examined at high light (HL). The transcriptional analysis revealed that most of the ROS markers identified among the 449 transcripts with significant differential expression were transcripts specifically up-regulated by singlet oxygen ((1)O(2)). On the contrary, minimal correlation was established with transcripts specifically up-regulated by superoxide radical or hydrogen peroxide. The transcriptional analysis was supported by fluorescence microscopy experiments. The incubation of ACSC with the (1)O(2) sensor green reagent and 2',7'-dichlorofluorescein diacetate showed that the 30-min-HL-treated cultures emitted fluorescence that corresponded with the production of (1)O(2) but not of hydrogen peroxide. Furthermore, the in vivo photodamage of the D1 protein of photosystem II indicated that the photogeneration of (1)O(2) took place within the photosystem II reaction center. Functional enrichment analyses identified transcripts that are key components of the ROS signaling transduction pathway in plants as well as others encoding transcription factors that regulate both ROS scavenging and water deficit stress. A meta-analysis examining the transcriptional profiles of mutants and hormone treatments in Arabidopsis showed a high correlation between ACSC at HL and the fluorescent mutant family of Arabidopsis, a producer of (1)O(2) in plastids. Intriguingly, a high correlation was also observed with ABA deficient1 and more axillary growth4, two mutants with defects in the biosynthesis pathways of two key (apo)carotenoid-derived plant hormones (i.e. abscisic acid and strigolactones, respectively). ACSC has proven to be a valuable system for studying early transcriptional responses to HL stress.

VL - 156 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/21531897?dopt=Abstract ER - TY - JOUR T1 - Genome-wide heterogeneity of nucleotide substitution model fit. JF - Genome biology and evolution Y1 - 2011 A1 - Arbiza, Leonardo A1 - Patricio, Mateus A1 - Dopazo, Hernán A1 - Posada, David AB -

At a genomic scale, the patterns that have shaped molecular evolution are believed to be largely heterogeneous. Consequently, comparative analyses should use appropriate probabilistic substitution models that capture the main features under which different genomic regions have evolved. While efforts have concentrated in the development and understanding of model selection techniques, no descriptions of overall relative substitution model fit at the genome level have been reported. Here, we provide a characterization of best-fit substitution models across three genomic data sets including coding regions from mammals, vertebrates, and Drosophila (24,000 alignments). According to the Akaike Information Criterion (AIC), 82 of 88 models considered were selected as best-fit models at least in one occasion, although with very different frequencies. Most parameter estimates also varied broadly among genes. Patterns found for vertebrates and Drosophila were quite similar and often more complex than those found in mammals. Phylogenetic trees derived from models in the 95% confidence interval set showed much less variance and were significantly closer to the tree estimated under the best-fit model than trees derived from models outside this interval. Although alternative criteria selected simpler models than the AIC, they suggested similar patterns. All together our results show that at a genomic scale, different gene alignments for the same set of taxa are best explained by a large variety of different substitution models and that model choice has implications on different parameter estimates including the inferred phylogenetic trees. After taking into account the differences related to sample size, our results suggest a noticeable diversity in the underlying evolutionary process. All together, we conclude that the use of model selection techniques is important to obtain consistent phylogenetic estimates from real data at a genomic scale.

VL - 3 ER - TY - JOUR T1 - A large scale survey reveals that chromosomal copy-number alterations significantly affect gene modules involved in cancer initiation and progression JF - BMC Medical Genomics Y1 - 2011 A1 - Alloza, E. A1 - Fatima Al-Shahrour A1 - Cigudosa, J. C. A1 - Dopazo, J. AB -

Background

Recent observations point towards the existence of a large number of neighborhoods composed of functionally-related gene modules that lie together in the genome. This local component in the distribution of the functionality across chromosomes is probably affecting the own chromosomal architecture by limiting the possibilities in which genes can be arranged and distributed across the genome. As a direct consequence of this fact it is therefore presumable that diseases such as cancer, harboring DNA copy number alterations (CNAs), will have a symptomatology strongly dependent on modules of functionally-related genes rather than on a unique "important" gene.

Methods

We carried out a systematic analysis of more than 140,000 observations of CNAs in cancers and searched by enrichments in gene functional modules associated to high frequencies of loss or gains.

Results

The analysis of CNAs in cancers clearly demonstrates the existence of a significant pattern of loss of gene modules functionally related to cancer initiation and progression along with the amplification of modules of genes related to unspecific defense against xenobiotics (probably chemotherapeutical agents). With the extension of this analysis to an Array-CGH dataset (glioblastomas) from The Cancer Genome Atlas we demonstrate the validity of this approach to investigate the functional impact of CNAs.

Conclusions

The presented results indicate promising clinical and therapeutic implications. Our findings also directly point out to the necessity of adopting a function-centric, rather a gene-centric, view in the understanding of phenotypes or diseases harboring CNAs.

VL - 4 UR - http://www.biomedcentral.com/1755-8794/4/37 ER - TY - JOUR T1 - Large-scale transcriptional profiling and functional assays reveal important roles for Rho-GTPase signalling and SCL during haematopoietic differentiation of human embryonic stem cells. JF - Hum Mol Genet Y1 - 2011 A1 - Yung, Sun A1 - Ledran, Maria A1 - Moreno-Gimeno, Inmaculada A1 - Conesa, Ana A1 - Montaner, David A1 - Dopazo, Joaquin A1 - Dimmick, Ian A1 - Slater, Nicholas J A1 - Marenah, Lamin A1 - Real, Pedro J A1 - Paraskevopoulou, Iliana A1 - Bisbal, Viviana A1 - Burks, Deborah A1 - Santibanez-Koref, Mauro A1 - Moreno, Ruben A1 - Mountford, Joanne A1 - Menendez, Pablo A1 - Armstrong, Lyle A1 - Lako, Majlinda KW - Acute Disease KW - Anemia, Hemolytic KW - Animals KW - Basic Helix-Loop-Helix Transcription Factors KW - Cell Differentiation KW - Cell Line KW - Cell Lineage KW - Cluster Analysis KW - Embryonic Stem Cells KW - Erythroid Cells KW - Flow Cytometry KW - Gene Expression Profiling KW - Hematopoietic Stem Cells KW - Humans KW - Mice KW - Myeloid Cells KW - Paracrine Communication KW - Proto-Oncogene Proteins KW - Reverse Transcriptase Polymerase Chain Reaction KW - rho GTP-Binding Proteins KW - Signal Transduction KW - Stem Cell Transplantation KW - T-Cell Acute Lymphocytic Leukemia Protein 1 KW - Transcriptome AB -

Understanding the transcriptional cues that direct differentiation of human embryonic stem cells (hESCs) and human-induced pluripotent stem cells to defined and functional cell types is essential for future clinical applications. In this study, we have compared transcriptional profiles of haematopoietic progenitors derived from hESCs at various developmental stages of a feeder- and serum-free differentiation method and show that the largest transcriptional changes occur during the first 4 days of differentiation. Data mining on the basis of molecular function revealed Rho-GTPase signalling as a key regulator of differentiation. Inhibition of this pathway resulted in a significant reduction in the numbers of emerging haematopoietic progenitors throughout the differentiation window, thereby uncovering a previously unappreciated role for Rho-GTPase signalling during human haematopoietic development. Our analysis indicated that SCL was the 11th most upregulated transcript during the first 4 days of the hESC differentiation process. Overexpression of SCL in hESCs promoted differentiation to meso-endodermal lineages, the emergence of haematopoietic and erythro-megakaryocytic progenitors and accelerated erythroid differentiation. Importantly, intrasplenic transplantation of SCL-overexpressing hESC-derived haematopoietic cells enhanced recovery from induced acute anaemia without significant cell engraftment, suggesting a paracrine-mediated effect.

VL - 20 IS - 24 U1 - https://www.ncbi.nlm.nih.gov/pubmed/21937587?dopt=Abstract ER - TY - JOUR T1 - Mutation screening of multiple genes in Spanish patients with autosomal recessive retinitis pigmentosa by targeted resequencing. JF - PLoS One Y1 - 2011 A1 - González-del Pozo, María A1 - Borrego, Salud A1 - Barragán, Isabel A1 - Pieras, Juan I A1 - Santoyo, Javier A1 - Matamala, Nerea A1 - Naranjo, Belén A1 - Dopazo, Joaquin A1 - Antiňolo, Guillermo KW - Alleles KW - DNA Mutational Analysis KW - Exons KW - Genetic Variation KW - Genome KW - Hispanic or Latino KW - Humans KW - Introns KW - Language KW - mutation KW - Mutation, Missense KW - Oligonucleotide Array Sequence Analysis KW - Polymerase Chain Reaction KW - Reproducibility of Results KW - Retinitis pigmentosa KW - United States AB -

Retinitis Pigmentosa (RP) is a heterogeneous group of inherited retinal dystrophies characterised ultimately by the loss of photoreceptor cells. RP is the leading cause of visual loss in individuals younger than 60 years, with a prevalence of about 1 in 4000. The molecular genetic diagnosis of autosomal recessive RP (arRP) is challenging due to the large genetic and clinical heterogeneity. Traditional methods for sequencing arRP genes are often laborious and not easily available and a screening technique that enables the rapid detection of the genetic cause would be very helpful in the clinical practice. The goal of this study was to develop and apply microarray-based resequencing technology capable of detecting both known and novel mutations on a single high-throughput platform. Hence, the coding regions and exon/intron boundaries of 16 arRP genes were resequenced using microarrays in 102 Spanish patients with clinical diagnosis of arRP. All the detected variations were confirmed by direct sequencing and potential pathogenicity was assessed by functional predictions and frequency in controls. For validation purposes 4 positive controls for variants consisting of previously identified changes were hybridized on the array. As a result of the screening, we detected 44 variants, of which 15 are very likely pathogenic detected in 14 arRP families (14%). Finally, the design of this array can easily be transformed in an equivalent diagnostic system based on targeted enrichment followed by next generation sequencing.

VL - 6 IS - 12 U1 - https://www.ncbi.nlm.nih.gov/pubmed/22164218?dopt=Abstract ER - TY - JOUR T1 - Natural selection on functional modules, a genome-wide analysis. JF - PLoS Comput Biol Y1 - 2011 A1 - Serra, François A1 - Arbiza, Leonardo A1 - Dopazo, Joaquin A1 - Dopazo, Hernán KW - Animals KW - Databases, Genetic KW - Drosophila KW - Genome, Insect KW - Genome-Wide Association Study KW - Genomics KW - Mammals KW - Phylogeny KW - Selection, Genetic KW - Sequence Analysis, DNA AB -

Classically, the functional consequences of natural selection over genomes have been analyzed as the compound effects of individual genes. The current paradigm for large-scale analysis of adaptation is based on the observed significant deviations of rates of individual genes from neutral evolutionary expectation. This approach, which assumed independence among genes, has not been able to identify biological functions significantly enriched in positively selected genes in individual species. Alternatively, pooling related species has enhanced the search for signatures of selection. However, grouping signatures does not allow testing for adaptive differences between species. Here we introduce the Gene-Set Selection Analysis (GSSA), a new genome-wide approach to test for evidences of natural selection on functional modules. GSSA is able to detect lineage specific evolutionary rate changes in a notable number of functional modules. For example, in nine mammal and Drosophilae genomes GSSA identifies hundreds of functional modules with significant associations to high and low rates of evolution. Many of the detected functional modules with high evolutionary rates have been previously identified as biological functions under positive selection. Notably, GSSA identifies conserved functional modules with many positively selected genes, which questions whether they are exclusively selected for fitting genomes to environmental changes. Our results agree with previous studies suggesting that adaptation requires positive selection, but not every mutation under positive selection contributes to the adaptive dynamical process of the evolution of species.

VL - 7 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/21390268?dopt=Abstract ER - TY - JOUR T1 - Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing. JF - BMC genomics Y1 - 2011 A1 - Durban, Jordi A1 - Juárez, Paula A1 - Angulo, Yamileth A1 - Lomonte, Bruno A1 - Flores-Diaz, Marietta A1 - Alape-Girón, Alberto A1 - Sasa, Mahmood A1 - Sanz, Libia A1 - Gutiérrez, José M A1 - Joaquín Dopazo A1 - Ana Conesa A1 - Calvete, Juan J AB -

A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects.

VL - 12 ER - TY - JOUR T1 - Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus. JF - Genome research Y1 - 2010 A1 - Javierre, Biola M A1 - Fernandez, Agustin F A1 - Richter, Julia A1 - Fatima Al-Shahrour A1 - Martin-Subero, J Ignacio A1 - Rodriguez-Ubreva, Javier A1 - Berdasco, Maria A1 - Fraga, Mario F A1 - O’Hanlon, Terrance P A1 - Rider, Lisa G A1 - Jacinto, Filipe V A1 - Lopez-Longo, F Javier A1 - Dopazo, Joaquin A1 - Forn, Marta A1 - Peinado, Miguel A A1 - Carreño, Luis A1 - Sawalha, Amr H A1 - Harley, John B A1 - Siebert, Reiner A1 - Esteller, Manel A1 - Miller, Frederick W A1 - Ballestar, Esteban AB -

Monozygotic (MZ) twins are partially concordant for most complex diseases, including autoimmune disorders. Whereas phenotypic concordance can be used to study heritability, discordance suggests the role of non-genetic factors. In autoimmune diseases, environmentally driven epigenetic changes are thought to contribute to their etiology. Here we report the first high-throughput and candidate sequence analyses of DNA methylation to investigate discordance for autoimmune disease in twins. We used a cohort of MZ twins discordant for three diseases whose clinical signs often overlap: systemic lupus erythematosus (SLE), rheumatoid arthritis, and dermatomyositis. Only MZ twins discordant for SLE featured widespread changes in the DNA methylation status of a significant number of genes. Gene ontology analysis revealed enrichment in categories associated with immune function. Individual analysis confirmed the existence of DNA methylation and expression changes in genes relevant to SLE pathogenesis. These changes occurred in parallel with a global decrease in the 5-methylcytosine content that was concomitantly accompanied with changes in DNA methylation and expression levels of ribosomal RNA genes, although no changes in repetitive sequences were found. Our findings not only identify potentially relevant DNA methylation markers for the clinical characterization of SLE patients but also support the notion that epigenetic changes may be critical in the clinical manifestations of autoimmune disease.

VL - 20 ER - TY - JOUR T1 - DNA methylation epigenotypes in breast cancer molecular subtypes. JF - Breast Cancer Res Y1 - 2010 A1 - Bediaga, Naiara G A1 - Acha-Sagredo, Amelia A1 - Guerra, Isabel A1 - Viguri, Amparo A1 - Albaina, Carmen A1 - Ruiz Diaz, Irune A1 - Rezola, Ricardo A1 - Alberdi, Maria Jesus A1 - Dopazo, Joaquin A1 - Montaner, David A1 - Renobales, Mertxe A1 - Fernandez, Agustin F A1 - Field, John K A1 - Fraga, Mario F A1 - Liloglou, Triantafillos A1 - de Pancorbo, Marian M KW - Aged KW - Breast Neoplasms KW - CpG Islands KW - DNA Methylation KW - Epigenesis, Genetic KW - Female KW - Gene Expression Profiling KW - Genes, p53 KW - Genotype KW - Humans KW - Ki-67 Antigen KW - Middle Aged KW - mutation KW - Neoplasm Grading KW - Oligonucleotide Array Sequence Analysis KW - Receptor, ErbB-2 KW - Tumor Suppressor Protein p53 AB -

INTRODUCTION: Identification of gene expression based breast cancer subtypes is considered as a critical means of prognostication. Genetic mutations along with epigenetic alterations contribute to gene expression changes occurring in breast cancer. So far, these epigenetic contributions to sporadic breast cancer subtypes have not been well characterized, and there is only a limited understanding of the epigenetic mechanisms affected in those particular breast cancer subtypes. The present study was undertaken to dissect the breast cancer methylome and deliver specific epigenotypes associated with particular breast cancer subtypes.

METHODS: Using a microarray approach we analyzed DNA methylation in regulatory regions of 806 cancer related genes in 28 breast cancer paired samples. We subsequently performed substantial technical and biological validation by Pyrosequencing, investigating the top qualifying 19 CpG regions in independent cohorts encompassing 47 basal-like, 44 ERBB2+ overexpressing, 48 luminal A and 48 luminal B paired breast cancer/adjacent tissues. Using all-subset selection method, we identified the most subtype predictive methylation profiles in multivariable logistic regression analysis.

RESULTS: The approach efficiently recognized 15 individual CpG loci differentially methylated in breast cancer tumor subtypes. We further identify novel subtype specific epigenotypes which clearly demonstrate the differences in the methylation profiles of basal-like and human epidermal growth factor 2 (HER2)-overexpressing tumors.

CONCLUSIONS: Our results provide evidence that well defined DNA methylation profiles enables breast cancer subtype prediction and support the utilization of this biomarker for prognostication and therapeutic stratification of patients with breast cancer.

VL - 12 IS - 5 U1 - https://www.ncbi.nlm.nih.gov/pubmed/20920229?dopt=Abstract ER - TY - JOUR T1 - Fine-scale evolution: genomic, phenotypic and ecological differentiation in two coexisting Salinibacter ruber strains. JF - The ISME journal Y1 - 2010 A1 - Peña, Arantxa A1 - Teeling, Hanno A1 - Huerta-Cepas, Jaime A1 - Santos, Fernando A1 - Yarza, Pablo A1 - Brito-Echeverría, Jocelyn A1 - Lucio, Marianna A1 - Schmitt-Kopplin, Philippe A1 - Meseguer, Inmaculada A1 - Schenowitz, Chantal A1 - Dossat, Carole A1 - Barbe, Valerie A1 - Joaquín Dopazo A1 - Rosselló-Mora, Ramon A1 - Schüler, Margarete A1 - Glöckner, Frank Oliver A1 - Amann, Rudolf A1 - Gabaldón, Toni A1 - Antón, Josefa AB -

Genomic and metagenomic data indicate a high degree of genomic variation within microbial populations, although the ecological and evolutive meaning of this microdiversity remains unknown. Microevolution analyses, including genomic and experimental approaches, are so far very scarce for non-pathogenic bacteria. In this study, we compare the genomes, metabolomes and selected ecological traits of the strains M8 and M31 of the hyperhalophilic bacterium Salinibacter ruber that contain ribosomal RNA (rRNA) gene and intergenic regions that are identical in sequence and were simultaneously isolated from a Mediterranean solar saltern. Comparative analyses indicate that S. ruber genomes present a mosaic structure with conserved and hypervariable regions (HVRs). The HVRs or genomic islands, are enriched in transposases, genes related to surface properties, strain-specific genes and highly divergent orthologous. However, the many indels outside the HVRs indicate that genome plasticity extends beyond them. Overall, 10% of the genes encoded in the M8 genome are absent from M31 and could stem from recent acquisitions. S. ruber genomes also harbor 34 genes located outside HVRs that are transcribed during standard growth and probably derive from lateral gene transfers with Archaea preceding the M8/M31 divergence. Metabolomic analyses, phage susceptibility and competition experiments indicate that these genomic differences cannot be considered neutral from an ecological perspective. The results point to the avoidance of competition by micro-niche adaptation and response to viral predation as putative major forces that drive microevolution within these Salinibacter strains. In addition, this work highlights the extent of bacterial functional diversity and environmental adaptation, beyond the resolution of the 16S rRNA and internal transcribed spacers regions.The ISME Journal advance online publication, 18 February 2010; doi:10.1038/ismej.2010.6.

ER - TY - JOUR T1 - FM19G11, a new hypoxia-inducible factor (HIF) modulator, affects stem cell differentiation status. JF - The Journal of biological chemistry Y1 - 2010 A1 - Moreno-Manzano, Victoria A1 - Rodríguez-Jiménez, Francisco J A1 - Aceña-Bonilla, Jose L A1 - Fustero-Lardíes, Santos A1 - Erceg, Slaven A1 - Dopazo, Joaquin A1 - Montaner, David A1 - Stojkovic, Miodrag A1 - Sánchez-Puelles, Jose M AB -

The biology of the alpha subunits of hypoxia-inducible factors (HIFalpha) has expanded from their role in angiogenesis to their current position in the self-renewal and differentiation of stem cells. The results reported in this article show the discovery of FM19G11, a novel chemical entity that inhibits HIFalpha proteins that repress target genes of the two alpha subunits, in various tumor cell lines as well as in adult and embryonic stem cell models from rodents and humans, respectively. FM19G11 inhibits at nanomolar range the transcriptional and protein expression of Oct4, Sox2, Nanog, and Tgf-alpha undifferentiating factors, in adult rat and human embryonic stem cells, FM19G11 activity occurs in ependymal progenitor stem cells from rats (epSPC), a cell model reported for spinal cord regeneration, which allows the progression of oligodendrocyte cell differentiation in a hypoxic environment, has created interest in its characterization for pharmacological research. Experiments using small interfering RNA showed a significant depletion in Sox2 protein only in the case of HIF2alpha silencing, but not in HIF1alpha-mediated ablation. Moreover, chromatin immunoprecipitation data, together with the significant presence of functional hypoxia response element consensus sequences in the promoter region of Sox2, strongly validated that this factor behaves as a target gene of HIF2alpha in epSPCs. FM19G11 causes a reduction of overall histone acetylation with significant repression of p300, a histone acetyltransferase required as a co-factor for HIF-transcription activation. Arrays carried out in the presence and absence of the inhibitor showed the predominant involvement of epigenetic-associated events mediated by the drug.

VL - 285 ER - TY - JOUR T1 - Functional genomics of 5- to 8-cell stage human embryos by blastomere single-cell cDNA analysis. JF - PLoS One Y1 - 2010 A1 - Galan, Amparo A1 - Montaner, David A1 - Póo, M Eugenia A1 - Valbuena, Diana A1 - Ruiz, Veronica A1 - Aguilar, Cristóbal A1 - Dopazo, Joaquin A1 - Simon, Carlos KW - Blastomeres KW - DNA, Complementary KW - Gene Expression Profiling KW - Genomics KW - Humans KW - Oligonucleotide Array Sequence Analysis AB -

Blastomere fate and embryonic genome activation (EGA) during human embryonic development are unsolved areas of high scientific and clinical interest. Forty-nine blastomeres from 5- to 8-cell human embryos have been investigated following an efficient single-cell cDNA amplification protocol to provide a template for high-density microarray analysis. The previously described markers, characteristic of Inner Cell Mass (ICM) (n = 120), stemness (n = 190) and Trophectoderm (TE) (n = 45), were analyzed, and a housekeeping pattern of 46 genes was established. All the human blastomeres from the 5- to 8-cell stage embryo displayed a common gene expression pattern corresponding to ICM markers (e.g., DDX3, FOXD3, LEFTY1, MYC, NANOG, POU5F1), stemness (e.g., POU5F1, DNMT3B, GABRB3, SOX2, ZFP42, TERT), and TE markers (e.g., GATA6, EOMES, CDX2, LHCGR). The EGA profile was also investigated between the 5-6- and 8-cell stage embryos, and compared to the blastocyst stage. Known genes (n = 92) such as depleted maternal transcripts (e.g., CCNA1, CCNB1, DPPA2) and embryo-specific activation (e.g., POU5F1, CDH1, DPPA4), as well as novel genes, were confirmed. In summary, the global single-cell cDNA amplification microarray analysis of the 5- to 8-cell stage human embryos reveals that blastomere fate is not committed to ICM or TE. Finally, new EGA features in human embryogenesis are presented.

VL - 5 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/21049019?dopt=Abstract ER - TY - JOUR T1 - Hypoxia promotes efficient differentiation of human embryonic stem cells to functional endothelium. JF - Stem Cells Y1 - 2010 A1 - Prado-Lopez, Sonia A1 - Conesa, Ana A1 - Armiñán, Ana A1 - Martínez-Losa, Magdalena A1 - Escobedo-Lucea, Carmen A1 - Gandia, Carolina A1 - Tarazona, Sonia A1 - Melguizo, Dario A1 - Blesa, David A1 - Montaner, David A1 - Sanz-González, Silvia A1 - Sepúlveda, Pilar A1 - Götz, Stefan A1 - O'Connor, José Enrique A1 - Moreno, Ruben A1 - Dopazo, Joaquin A1 - Burks, Deborah J A1 - Stojkovic, Miodrag KW - Angiopoietin-1 KW - Animals KW - biomarkers KW - Cell Culture Techniques KW - Cell Differentiation KW - Cell Hypoxia KW - Cell Transplantation KW - Cells, Cultured KW - Down-Regulation KW - Embryonic Stem Cells KW - Endothelial Cells KW - Gene Expression Profiling KW - Gene Expression Regulation KW - Humans KW - Male KW - Myocardial Infarction KW - Neovascularization, Physiologic KW - Oxygen KW - Pluripotent Stem Cells KW - Rats KW - Rats, Nude KW - Vascular Endothelial Growth Factor A AB -

Early development of mammalian embryos occurs in an environment of relative hypoxia. Nevertheless, human embryonic stem cells (hESC), which are derived from the inner cell mass of blastocyst, are routinely cultured under the same atmospheric conditions (21% O(2)) as somatic cells. We hypothesized that O(2) levels modulate gene expression and differentiation potential of hESC, and thus, we performed gene profiling of hESC maintained under normoxic or hypoxic (1% or 5% O(2)) conditions. Our analysis revealed that hypoxia downregulates expression of pluripotency markers in hESC but increases significantly the expression of genes associated with angio- and vasculogenesis including vascular endothelial growth factor and angiopoitein-like proteins. Consequently, we were able to efficiently differentiate hESC to functional endothelial cells (EC) by varying O(2) levels; after 24 hours at 5% O(2), more than 50% of cells were CD34+. Transplantation of resulting endothelial-like cells improved both systolic function and fractional shortening in a rodent model of myocardial infarction. Moreover, analysis of the infarcted zone revealed that transplanted EC reduced the area of fibrous scar tissue by 50%. Thus, use of hypoxic conditions to specify the endothelial lineage suggests a novel strategy for cellular therapies aimed at repair of damaged vasculature in pathologies such as cerebral ischemia and myocardial infarction.

VL - 28 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/20049902?dopt=Abstract ER - TY - JOUR T1 - The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models. JF - Nature biotechnology Y1 - 2010 A1 - Shi, Leming A1 - Campbell, Gregory A1 - Jones, Wendell D A1 - Campagne, Fabien A1 - Wen, Zhining A1 - Walker, Stephen J A1 - Su, Zhenqiang A1 - Chu, Tzu-Ming A1 - Goodsaid, Federico M A1 - Pusztai, Lajos A1 - Shaughnessy, John D A1 - Oberthuer, André A1 - Thomas, Russell S A1 - Paules, Richard S A1 - Fielden, Mark A1 - Barlogie, Bart A1 - Chen, Weijie A1 - Du, Pan A1 - Fischer, Matthias A1 - Furlanello, Cesare A1 - Gallas, Brandon D A1 - Ge, Xijin A1 - Megherbi, Dalila B A1 - Symmans, W Fraser A1 - Wang, May D A1 - Zhang, John A1 - Bitter, Hans A1 - Brors, Benedikt A1 - Bushel, Pierre R A1 - Bylesjo, Max A1 - Chen, Minjun A1 - Cheng, Jie A1 - Cheng, Jing A1 - Chou, Jeff A1 - Davison, Timothy S A1 - Delorenzi, Mauro A1 - Deng, Youping A1 - Devanarayan, Viswanath A1 - Dix, David J A1 - Dopazo, Joaquin A1 - Dorff, Kevin C A1 - Elloumi, Fathi A1 - Fan, Jianqing A1 - Fan, Shicai A1 - Fan, Xiaohui A1 - Fang, Hong A1 - Gonzaludo, Nina A1 - Hess, Kenneth R A1 - Hong, Huixiao A1 - Huan, Jun A1 - Irizarry, Rafael A A1 - Judson, Richard A1 - Juraeva, Dilafruz A1 - Lababidi, Samir A1 - Lambert, Christophe G A1 - Li, Li A1 - Li, Yanen A1 - Li, Zhen A1 - Lin, Simon M A1 - Liu, Guozhen A1 - Lobenhofer, Edward K A1 - Luo, Jun A1 - Luo, Wen A1 - McCall, Matthew N A1 - Nikolsky, Yuri A1 - Pennello, Gene A A1 - Perkins, Roger G A1 - Philip, Reena A1 - Popovici, Vlad A1 - Price, Nathan D A1 - Qian, Feng A1 - Scherer, Andreas A1 - Shi, Tieliu A1 - Shi, Weiwei A1 - Sung, Jaeyun A1 - Thierry-Mieg, Danielle A1 - Thierry-Mieg, Jean A1 - Thodima, Venkata A1 - Trygg, Johan A1 - Vishnuvajjala, Lakshmi A1 - Wang, Sue Jane A1 - Wu, Jianping A1 - Wu, Yichao A1 - Xie, Qian A1 - Yousef, Waleed A A1 - Zhang, Liang A1 - Zhang, Xuegong A1 - Zhong, Sheng A1 - Zhou, Yiming A1 - Zhu, Sheng A1 - Arasappan, Dhivya A1 - Bao, Wenjun A1 - Lucas, Anne Bergstrom A1 - Berthold, Frank A1 - Brennan, Richard J A1 - Buness, Andreas A1 - Catalano, Jennifer G A1 - Chang, Chang A1 - Chen, Rong A1 - Cheng, Yiyu A1 - Cui, Jian A1 - Czika, Wendy A1 - Demichelis, Francesca A1 - Deng, Xutao A1 - Dosymbekov, Damir A1 - Eils, Roland A1 - Feng, Yang A1 - Fostel, Jennifer A1 - Fulmer-Smentek, Stephanie A1 - Fuscoe, James C A1 - Gatto, Laurent A1 - Ge, Weigong A1 - Goldstein, Darlene R A1 - Guo, Li A1 - Halbert, Donald N A1 - Han, Jing A1 - Harris, Stephen C A1 - Hatzis, Christos A1 - Herman, Damir A1 - Huang, Jianping A1 - Jensen, Roderick V A1 - Jiang, Rui A1 - Johnson, Charles D A1 - Jurman, Giuseppe A1 - Kahlert, Yvonne A1 - Khuder, Sadik A A1 - Kohl, Matthias A1 - Li, Jianying A1 - Li, Li A1 - Li, Menglong A1 - Li, Quan-Zhen A1 - Li, Shao A1 - Li, Zhiguang A1 - Liu, Jie A1 - Liu, Ying A1 - Liu, Zhichao A1 - Meng, Lu A1 - Madera, Manuel A1 - Martinez-Murillo, Francisco A1 - Medina, Ignacio A1 - Meehan, Joseph A1 - Miclaus, Kelci A1 - Moffitt, Richard A A1 - Montaner, David A1 - Mukherjee, Piali A1 - Mulligan, George J A1 - Neville, Padraic A1 - Nikolskaya, Tatiana A1 - Ning, Baitang A1 - Page, Grier P A1 - Parker, Joel A1 - Parry, R Mitchell A1 - Peng, Xuejun A1 - Peterson, Ron L A1 - Phan, John H A1 - Quanz, Brian A1 - Ren, Yi A1 - Riccadonna, Samantha A1 - Roter, Alan H A1 - Samuelson, Frank W A1 - Schumacher, Martin M A1 - Shambaugh, Joseph D A1 - Shi, Qiang A1 - Shippy, Richard A1 - Si, Shengzhu A1 - Smalter, Aaron A1 - Sotiriou, Christos A1 - Soukup, Mat A1 - Staedtler, Frank A1 - Steiner, Guido A1 - Stokes, Todd H A1 - Sun, Qinglan A1 - Tan, Pei-Yi A1 - Tang, Rong A1 - Tezak, Zivana A1 - Thorn, Brett A1 - Tsyganova, Marina A1 - Turpaz, Yaron A1 - Vega, Silvia C A1 - Visintainer, Roberto A1 - von Frese, Juergen A1 - Wang, Charles A1 - Wang, Eric A1 - Wang, Junwei A1 - Wang, Wei A1 - Westermann, Frank A1 - Willey, James C A1 - Woods, Matthew A1 - Wu, Shujian A1 - Xiao, Nianqing A1 - Xu, Joshua A1 - Xu, Lei A1 - Yang, Lun A1 - Zeng, Xiao A1 - Zhang, Jialu A1 - Zhang, Li A1 - Zhang, Min A1 - Zhao, Chen A1 - Puri, Raj K A1 - Scherf, Uwe A1 - Tong, Weida A1 - Wolfinger, Russell D AB -

Gene expression data from microarrays are being applied to predict preclinical and clinical endpoints, but the reliability of these predictions has not been established. In the MAQC-II project, 36 independent teams analyzed six microarray data sets to generate predictive models for classifying a sample with respect to one of 13 endpoints indicative of lung or liver toxicity in rodents, or of breast cancer, multiple myeloma or neuroblastoma in humans. In total, >30,000 models were built using many combinations of analytical methods. The teams generated predictive models without knowing the biological meaning of some of the endpoints and, to mimic clinical reality, tested the models on data that had not been used for training. We found that model performance depended largely on the endpoint and team proficiency and that different approaches generated models of similar performance. The conclusions and recommendations from MAQC-II should be useful for regulatory agencies, study committees and independent investigators that evaluate methods for global gene expression analysis.

VL - 28 UR - http://www.nature.com/nbt/journal/v28/n8/full/nbt.1665.html ER - TY - JOUR T1 - Mutation spectrum of EYS in Spanish patients with autosomal recessive retinitis pigmentosa. JF - Hum Mutat Y1 - 2010 A1 - Barragán, Isabel A1 - Borrego, Salud A1 - Pieras, Juan Ignacio A1 - González-del Pozo, María A1 - Santoyo, Javier A1 - Ayuso, Carmen A1 - Baiget, Montserrat A1 - Millán, José M A1 - Mena, Marcela A1 - Abd El-Aziz, Mai M A1 - Audo, Isabelle A1 - Zeitz, Christina A1 - Littink, Karin W A1 - Dopazo, Joaquin A1 - Bhattacharya, Shomi S A1 - Antiňolo, Guillermo KW - Amino Acid Sequence KW - Animals KW - Case-Control Studies KW - DNA Mutational Analysis KW - Drosophila Proteins KW - Evolution, Molecular KW - Eye Proteins KW - Female KW - Genes, Recessive KW - Genetic Variation KW - Humans KW - Male KW - Molecular Sequence Data KW - mutation KW - Pedigree KW - Polymorphism, Single Nucleotide KW - Protein Structure, Tertiary KW - Retinitis pigmentosa KW - Spain KW - Structural Homology, Protein AB -

Retinitis pigmentosa (RP) is a heterogeneous group of inherited retinal dystrophies characterised ultimately by the loss of photoreceptor cells. We have recently identified a new gene(EYS) encoding an ortholog of Drosophila space maker (spam) as a commonly mutated gene in autosomal recessive RP. In the present study, we report the identification of 73 sequence variations in EYS, of which 28 are novel. Of these, 42.9% (12/28) are very likely pathogenic, 17.9% (5/28)are possibly pathogenic, whereas 39.3% (11/28) are SNPs. In addition, we have detected 3 pathogenic changes previously reported in other populations. We are also presenting the characterisation of EYS homologues in different species, and a detailed analysis of the EYS domains, with the identification of an interesting novel feature: a putative coiled-coil domain.Majority of the mutations in the arRP patients have been found within the domain structures of EYS. The minimum observed prevalence of distinct EYS mutations in our group of patients is of 15.9% (15/94), confirming a major involvement of EYS in the pathogenesis of arRP in the Spanish population. Along with the detection of three recurrent mutations in Caucasian population, our hypothesis of EYS being the first prevalent gene in arRP has been reinforced in the present study.

VL - 31 IS - 11 U1 - https://www.ncbi.nlm.nih.gov/pubmed/21069908?dopt=Abstract ER - TY - JOUR T1 - Selection upon Genome Architecture: Conservation of Functional Neighborhoods with Changing Genes JF - PLoS Comput. Biol. Y1 - 2010 A1 - Al-Shahrour, Fátima A1 - Minguez, Pablo A1 - Marqués-Bonet, Tomás A1 - Gazave, Elodie A1 - Navarro, Arcadi A1 - Dopazo, Joaquin VL - 6 UR - http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000953 ER - TY - JOUR T1 - SIMAP–a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters. JF - Nucleic acids research Y1 - 2010 A1 - Rattei, Thomas A1 - Tischler, Patrick A1 - Götz, Stefan A1 - Jehl, Marc-André A1 - Hoser, Jonathan A1 - Arnold, Roland A1 - Ana Conesa A1 - Mewes, Hans-Werner AB -

The prediction of protein function as well as the reconstruction of evolutionary genesis employing sequence comparison at large is still the most powerful tool in sequence analysis. Due to the exponential growth of the number of known protein sequences and the subsequent quadratic growth of the similarity matrix, the computation of the Similarity Matrix of Proteins (SIMAP) becomes a computational intensive task. The SIMAP database provides a comprehensive and up-to-date pre-calculation of the protein sequence similarity matrix, sequence-based features and sequence clusters. As of September 2009, SIMAP covers 48 million proteins and more than 23 million non-redundant sequences. Novel features of SIMAP include the expansion of the sequence space by including databases such as ENSEMBL as well as the integration of metagenomes based on their consistent processing and annotation. Furthermore, protein function predictions by Blast2GO are pre-calculated for all sequences in SIMAP and the data access and query functions have been improved. SIMAP assists biologists to query the up-to-date sequence space systematically and facilitates large-scale downstream projects in computational biology. Access to SIMAP is freely provided through the web portal for individuals (http://mips.gsf.de/simap/) and for programmatic access through DAS (http://webclu.bio.wzw.tum.de/das/) and Web-Service (http://mips.gsf.de/webservices/services/SimapService2.0?wsdl).

VL - 38 ER - TY - JOUR T1 - Exploring the antimicrobial action of a carbon monoxide-releasing compound through whole-genome transcription profiling of Escherichia coli. JF - Microbiology (Reading) Y1 - 2009 A1 - Nobre, Lígia S A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin A1 - Saraiva, Lígia M KW - Biofilms KW - Carbon Monoxide KW - Escherichia coli KW - Escherichia coli Proteins KW - Gene Expression Profiling KW - Gene Expression Regulation, Bacterial KW - Genes, Bacterial KW - Genes, Regulator KW - Genetic Complementation Test KW - Methionine KW - Microbial Viability KW - mutation KW - Oligonucleotide Array Sequence Analysis KW - Organometallic Compounds KW - Phenotype KW - RNA, Bacterial AB -

We recently reported that carbon monoxide (CO) has bactericidal activity. To understand its mode of action we analysed the gene expression changes occurring when Escherichia coli, grown aerobically and anaerobically, is treated with the CO-releasing molecule CORM-2 (tricarbonyldichlororuthenium(II) dimer). Microarray analysis shows that the E. coli CORM-2 response is multifaceted, with a high number of differentially regulated genes spread through several functional categories, namely genes involved in inorganic ion transport and metabolism, regulators, and genes implicated in post-translational modification, such as chaperones. CORM-2 has a higher impact in E. coli cells grown anaerobically, as judged by the repression of genes belonging to eight functional classes which are not seen in the response of aerobically CORM-2-treated cells. The biological relevance of the variations caused by CORM-2 was substantiated by studying the CORM-2 sensitivity of selected E. coli mutants. The results show that the deletion of redox-sensing regulators SoxS and OxyR increased the sensitivity to CORM-2 and suggest that while SoxS plays an important role in protection against CORM-2 under both growth conditions, OxyR seems to participate only in the aerobic CORM-2 response. Under anaerobic conditions, we found that the heat-shock proteins IbpA and IbpB contribute to CORM-2 defence since the deletion of these genes increases the sensitivity of the strain. The induction of several met genes and the hypersensitivity to CORM-2 of the DeltametR, DeltametI and DeltametN mutant strains suggest that CO has effects on the methionine metabolism of E. coli. CORM-2 also affects the transcription of several E. coli biofilm-related genes and increases biofilm formation in E. coli. In particular, the absence of tqsA or bhsA increases the resistance of E. coli to CORM-2, and deletion of tsqA leads to a strain that has lost its capacity to form biofilm upon treatment with CORM-2. In spite of the relatively stable nature of the CO molecule, our results show that CO is able to trigger a significant alteration in the transcriptome of E. coli which necessarily has effects in several key metabolic pathways.

VL - 155 IS - Pt 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/19246752?dopt=Abstract ER - TY - JOUR T1 - Exploring the antimicrobial action of a carbon monoxide-releasing compound through whole-genome transcription profiling of Escherichia coli JF - Microbiology Y1 - 2009 A1 - Nobre, L. S. A1 - Fatima Al-Shahrour A1 - Dopazo, J. A1 - Saraiva, L. M. KW - Bacterial Genes KW - Bacterial/genetics KW - Biofilms Carbon Monoxide/*metabolism Escherichia coli/*genetics/metabolism Escherichia coli Proteins/genetics/metabolism *Gene Expression Profiling Gene Expression Regulation KW - Regulator Genetic Complementation Test Methionine/metabolism Microbial Viability Mutation Oligonucleotide Array Sequence Analysis Organometallic Compounds/*pharmacology Phenotype RNA AB -

We recently reported that carbon monoxide (CO) has bactericidal activity. To understand its mode of action we analysed the gene expression changes occurring when Escherichia coli, grown aerobically and anaerobically, is treated with the CO-releasing molecule CORM-2 (tricarbonyldichlororuthenium(II) dimer). Microarray analysis shows that the E. coli CORM-2 response is multifaceted, with a high number of differentially regulated genes spread through several functional categories, namely genes involved in inorganic ion transport and metabolism, regulators, and genes implicated in post-translational modification, such as chaperones. CORM-2 has a higher impact in E. coli cells grown anaerobically, as judged by the repression of genes belonging to eight functional classes which are not seen in the response of aerobically CORM-2-treated cells. The biological relevance of the variations caused by CORM-2 was substantiated by studying the CORM-2 sensitivity of selected E. coli mutants. The results show that the deletion of redox-sensing regulators SoxS and OxyR increased the sensitivity to CORM-2 and suggest that while SoxS plays an important role in protection against CORM-2 under both growth conditions, OxyR seems to participate only in the aerobic CORM-2 response. Under anaerobic conditions, we found that the heat-shock proteins IbpA and IbpB contribute to CORM-2 defence since the deletion of these genes increases the sensitivity of the strain. The induction of several met genes and the hypersensitivity to CORM-2 of the DeltametR, DeltametI and DeltametN mutant strains suggest that CO has effects on the methionine metabolism of E. coli. CORM-2 also affects the transcription of several E. coli biofilm-related genes and increases biofilm formation in E. coli. In particular, the absence of tqsA or bhsA increases the resistance of E. coli to CORM-2, and deletion of tsqA leads to a strain that has lost its capacity to form biofilm upon treatment with CORM-2. In spite of the relatively stable nature of the CO molecule, our results show that CO is able to trigger a significant alteration in the transcriptome of E. coli which necessarily has effects in several key metabolic pathways.

VL - 155 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=19246752 N1 -

Nobre, Ligia S Al-Shahrour, Fatima Dopazo, Joaquin Saraiva, Ligia M Research Support, Non-U.S. Gov’t England Microbiology (Reading, England) Microbiology. 2009 Mar;155(Pt 3):813-24.

ER - TY - JOUR T1 - Functional signatures identified in B-cell non-Hodgkin lymphoma profiles. JF - Leuk Lymphoma Y1 - 2009 A1 - Aggarwal, Mohit A1 - Sánchez-Beato, Margarita A1 - Gómez-López, Gonzalo A1 - Al-Shahrour, Fátima A1 - Martínez, Nerea A1 - Rodríguez, Antonia A1 - Ruiz-Ballesteros, Elena A1 - Camacho, Francisca I A1 - Pérez-Rosado, Alberto A1 - de la Cueva, Paloma A1 - Artiga, María J A1 - Pisano, David G A1 - Kimby, Eva A1 - Dopazo, Joaquin A1 - Villuendas, Raquel A1 - Piris, Miguel A KW - Adult KW - Cluster Analysis KW - Gene Expression Profiling KW - Gene Expression Regulation, Leukemic KW - Genetic Heterogeneity KW - Humans KW - Lymphoma, B-Cell KW - Neoplasm Proteins KW - Oligonucleotide Array Sequence Analysis KW - RNA, Messenger KW - RNA, Neoplasm KW - Transcription, Genetic AB -

Gene-expression profiling in B-cell lymphomas has provided crucial data on specific lymphoma types, which can contribute to the identification of essential lymphoma survival genes and pathways. In this study, the gene-expression profiling data of all major B-cell lymphoma types were analyzed by unsupervised clustering. The transcriptome classification so obtained, was explored using gene set enrichment analysis generating a heatmap for B-cell lymphoma that identifies common lymphoma survival mechanisms and potential therapeutic targets, recognizing sets of coregulated genes and functional pathways expressed in different lymphoma types. Some of the most relevant signatures (stroma, cell cycle, B-cell receptor (BCR)) are shared by multiple lymphoma types or subclasses. A specific attention was paid to the analysis of BCR and coregulated pathways, defining molecular heterogeneity within multiple B-cell lymphoma types.

VL - 50 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/19863341?dopt=Abstract ER - TY - JOUR T1 - Gene set internal coherence in the context of functional profiling. JF - BMC Genomics Y1 - 2009 A1 - Montaner, David A1 - Minguez, Pablo A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Algorithms KW - Breast Neoplasms KW - Carcinoma, Intraductal, Noninfiltrating KW - Computational Biology KW - Databases, Nucleic Acid KW - Female KW - Gene Expression Profiling KW - Genomics KW - Humans KW - Oligonucleotide Array Sequence Analysis KW - Papillomavirus Infections KW - Reproducibility of Results AB -

BACKGROUND: Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pathways, etc.) whose representation in a pre-defined list of genes is further studied. In the most popular type of microarray experimental designs (e.g. up- or down-regulated genes, clusters of co-expressing genes, etc.) or in other genomic experiments (e.g. Chip-on-chip, epigenomics, etc.) these lists are composed by genes with a high degree of co-expression. Therefore, an implicit assumption in the application of functional profiling methods within this context is that the genes corresponding to the modules tested are effectively defining sets of co-expressing genes. Nevertheless not all the functional modules are biologically coherent entities in terms of co-expression, which will eventually hinder its detection with conventional methods of functional enrichment.

RESULTS: Using a large collection of microarray data we have carried out a detailed survey of internal correlation in GO terms and KEGG pathways, providing a coherence index to be used for measuring functional module co-regulation. An unexpected low level of internal correlation was found among the modules studied. Only around 30% of the modules defined by GO terms and 57% of the modules defined by KEGG pathways display an internal correlation higher than the expected by chance.This information on the internal correlation of the genes within the functional modules can be used in the context of a logistic regression model in a simple way to improve their detection in gene expression experiments.

CONCLUSION: For the first time, an exhaustive study on the internal co-expression of the most popular functional categories has been carried out. Interestingly, the real level of coexpression within many of them is lower than expected (or even inexistent), which will preclude its detection by means of most conventional functional profiling methods. If the gene-to-function correlation information is used in functional profiling methods, the results obtained improve the ones obtained by conventional enrichment methods.

VL - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/19397819?dopt=Abstract ER - TY - JOUR T1 - Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies JF - Nucl. Acids Res. Y1 - 2009 A1 - Medina, Ignacio A1 - Montaner, David A1 - Bonifaci, Núria A1 - Pujana, Miguel Angel A1 - Carbonell, José A1 - Tárraga, Joaquín A1 - Fatima Al-Shahrour A1 - Dopazo, Joaquin KW - babelomics KW - gene set KW - GESBAP KW - pathway-based analysis KW - SNP AB -

Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/

VL - 37 UR - http://nar.oxfordjournals.org/cgi/content/abstract/37/suppl_2/W340 ER - TY - JOUR T1 - Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies. JF - Nucleic Acids Res Y1 - 2009 A1 - Medina, Ignacio A1 - Montaner, David A1 - Bonifaci, Núria A1 - Pujana, Miguel Angel A1 - Carbonell, José A1 - Tárraga, Joaquín A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Biological Phenomena KW - Breast Neoplasms KW - Female KW - Genes KW - Genetic Variation KW - Genome-Wide Association Study KW - Humans KW - Polymorphism, Single Nucleotide KW - Software KW - User-Computer Interface AB -

Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/.

VL - 37 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/19502494?dopt=Abstract ER - TY - CHAP T1 - Genómica Comparativa y Selección Natural. Aplicaciones en el Genoma Humano. Capítulo 1.6 T2 - Evolución y Adaptacón. 150 años después del Origen de las Especies Y1 - 2009 A1 - Serra, François A1 - Arbiza, L. A1 - H. Dopazo ED - H. Dopazo ED - Navarro, A. AB -

La búsqueda de los eventos adaptativos a nivel molecular que ha diferenciado el genoma humano del de nuestro pariente vivo más cercano, el chimpancé, ha sido una de las áreas de mayor investigación en genómica comparativa. Paralelamente, la predicción funcional de variantes genéticas en nuestra especie ha sido un área de intenso desarrollo en bioinformática. En este trabajo discutiremos resultados previos y otros más recientes que dan cuenta de estos desarrollos. Veremos que en todos los casos la estimación de las presiones selectivas a nivel de los genes individuales o de los residuos de las proteínas son el denominador común para discutir ambos aspectos. Finalmente mostraremos cómo el análisis de estas presiones selectivas por grupos funcionales de genes resulta una alternativa viable y con suficiente poder estadístico para el análisis de la adaptación y de las restricciones evolutivas a nivel genómico. 

JF - Evolución y Adaptacón. 150 años después del Origen de las Especies PB - Obrapropia. CY - Valencia ER - TY - JOUR T1 - ModLink+: Improving fold recognition by using protein-protein interactions JF - Bioinformatics Y1 - 2009 A1 - Fornes, O. A1 - Aragues, R. A1 - Espadaler, J. A1 - M. A. Marti-Renom A1 - Sali, A. A1 - Oliva, B. KW - protein folding AB -

MOTIVATION: Several strategies have been developed to predict the fold of a target protein sequence, most of which are based on aligning the target sequence to other sequences of known structure. Previously, we demonstrated that the consideration of protein-protein interactions significantly increases the accuracy of fold assignment compared to PSI-BLAST sequence comparisons. A drawback of our method was the low number of proteins to which a fold could be assigned. Here, we present an improved version of the method that addresses this limitation. We also compare our method to other state-of-the-art fold assignment methodologies. RESULTS: Our approach (ModLink+) has been tested on 3,716 proteins with domain folds classified in the Structural Classification Of Proteins (SCOP) as well as known interacting partners in the Database of Interacting Proteins (DIP). For this test set, the ratio of success (PPV) on fold assignment increases from 75% for PSI-BLAST, 83% for HHSearch and 81% for PRC to more than 90% for ModLink+ at the e-value cutoff of 10(-3). Under this e-value, ModLink+ can assign a fold to 30-45% of the proteins in the test set, while our previous method could cover less than 25%. When applied to 6,384 proteins with unknown fold in the yeast proteome, ModLink+ combined with PSI-BLAST assigns a fold for domains in 3,738 proteins, while PSI-BLAST alone only covers 2,122 proteins, HHSearch 2,969 and PRC 2,826 proteins, using a threshold e-value that would represent a PPV higher than 82% for each method in the test set. AVAILABILITY: The ModLink+ server is freely accessible in the World Wide Web at http://sbi.imim.es/modlink/. CONTACT: boliva@imim.es.

UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=19357100 N1 -

Journal article Bioinformatics (Oxford, England) Bioinformatics. 2009 Apr 8.

ER - TY - JOUR T1 - Pere Alberch: Originator of EvoDevo JF - Biological Theory Y1 - 2009 A1 - Reiss, JO A1 - Burke, A C A1 - Archer, C A1 - De Renzi, M A1 - H. Dopazo A1 - Etxeberria, A A1 - Gale, E A A1 - Hinchliffe, J R A1 - Nuño de la Rosa, L A1 - Rose, C S A1 - Rasskin-Gutman, D A1 - Müller, G VL - 3 ER - TY - CONF T1 - Peripheral blood cells transcriptome to study new biomarkers for myocardial infarction follow up Y1 - 2009 A1 - Silbiger, Vivian A1 - Luchessi, André A1 - Hirata, Rosario A1 - Carracedo, Ángel A1 - Brión, Maria A1 - Lima Neto, Lidio A1 - P. Pastorelli, C A1 - Dopazo, Joaquin A1 - Montaner, David A1 - Garcia, F A1 - P. Sampaio, M A1 - P. Pereira, M A1 - S. Santos, E A1 - Armaganijan, Dikran A1 - Hirata, Mario ER - TY - JOUR T1 - Sexual selection drives weak positive selection in protamine genes and high promoter divergence, enhancing sperm competitiveness JF - Proc Biol Sci Y1 - 2009 A1 - Martin-Coello, J. A1 - H. Dopazo A1 - Arbiza, L. A1 - Ausio, J. A1 - Roldan, E. R. A1 - Gomendio, M. KW - Adaptation KW - positive selection KW - sperm competition AB -

Phenotypic adaptations may be the result of changes in gene structure or gene regulation, but little is known about the evolution of gene expression. In addition, it is unclear whether the same selective forces may operate at both levels simultaneously. Reproductive proteins evolve rapidly, but the underlying selective forces promoting such rapid changes are still a matter of debate. In particular, the role of sexual selection in driving positive selection among reproductive proteins remains controversial, whereas its potential influence on changes in promoter regions has not been explored. Protamines are responsible for maintaining DNA in a compacted form in chromosomes in sperm and the available evidence suggests that they evolve rapidly. Because protamines condense DNA within the sperm nucleus, they influence sperm head shape. Here, we examine the influence of sperm competition upon protamine 1 and protamine 2 genes and their promoters, by comparing closely related species of Mus that differ in relative testes size, a reliable indicator of levels of sperm competition. We find evidence of positive selection in the protamine 2 gene in the species with the highest inferred levels of sperm competition. In addition, sperm competition levels across all species are strongly associated with high divergence in protamine 2 promoters that, in turn, are associated with sperm swimming speed. We suggest that changes in protamine 2 promoters are likely to enhance sperm swimming speed by making sperm heads more hydrodynamic. Such phenotypic changes are adaptive because sperm swimming speed may be a major determinant of fertilization success under sperm competition. Thus, when species have diverged recently, few changes in gene-coding sequences are found, while high divergence in promoters seems to be associated with the intensity of sexual selection.

UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=19364735 N1 -

Journal article Proceedings. Biological sciences / The Royal Society Proc Biol Sci. 2009 Apr 8.

ER - TY - JOUR T1 - SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks. JF - Nucleic Acids Res Y1 - 2009 A1 - Minguez, Pablo A1 - Götz, Stefan A1 - Montaner, David A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Computer Graphics KW - Data Interpretation, Statistical KW - Databases, Protein KW - Humans KW - Internet KW - Protein Interaction Mapping KW - Software AB -

Understanding the structure and the dynamics of the complex intercellular network of interactions that contributes to the structure and function of a living cell is one of the main challenges of today's biology. SNOW inputs a collection of protein (or gene) identifiers and, by using the interactome as scaffold, draws the connections among them, calculates several relevant network parameters and, as a novelty among the rest of tools of its class, it estimates their statistical significance. The parameters calculated for each node are: connectivity, betweenness and clustering coefficient. It also calculates the number of components, number of bicomponents and articulation points. An interactive network viewer is also available to explore the resulting network. SNOW is available at http://snow.bioinfo.cipf.es.

VL - 37 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/19454602?dopt=Abstract ER - TY - JOUR T1 - SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks JF - Nucl. Acids Res. Y1 - 2009 A1 - Minguez, Pablo A1 - Gotz, S. A1 - Montaner, David A1 - Fatima Al-Shahrour A1 - Dopazo, Joaquin KW - interactome KW - network KW - snow AB -

Understanding the structure and the dynamics of the complex intercellular network of interactions that contributes to the structure and function of a living cell is one of the main challenges of today’s biology. SNOW inputs a collection of protein (or gene) identifiers and, by using the interactome as scaffold, draws the connections among them, calculates several relevant network parameters and, as a novelty among the rest of tools of its class, it estimates their statistical significance. The parameters calculated for each node are: connectivity, betweenness and clustering coefficient. It also calculates the number of components, number of bicomponents and articulation points. An interactive network viewer is also available to explore the resulting network. SNOW is available at http://snow.bioinfo.cipf.es.

VL - 37 UR - http://nar.oxfordjournals.org/content/early/2009/05/19/nar.gkp402.full ER - TY - JOUR T1 - Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments JF - Nucleic Acids Res Y1 - 2008 A1 - Fatima Al-Shahrour A1 - Carbonell, J. A1 - Minguez, P. A1 - Goetz, S. A1 - A. Conesa A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Alloza, E. A1 - Montaner, D. A1 - Dopazo, J. KW - babelomics KW - funtional profiling AB -

We present a new version of Babelomics, a complete suite of web tools for the functional profiling of genome scale experiments, with new and improved methods as well as more types of functional definitions. Babelomics includes different flavours of conventional functional enrichment methods as well as more advanced gene set analysis methods that makes it a unique tool among the similar resources available. In addition to the well-known functional definitions (GO, KEGG), Babelomics includes new ones such as Biocarta pathways or text mining-derived functional terms. Regulatory modules implemented include transcriptional control (Transfac, CisRed) and other levels of regulation such as miRNA-mediated interference. Moreover, Babelomics allows for sub-selection of terms in order to test more focused hypothesis. Also gene annotation correspondence tables can be imported, which allows testing with user-defined functional modules. Finally, a tool for the ’de novo’ functional annotation of sequences has been included in the system. This allows using yet unannotated organisms in the program. Babelomics has been extensively re-engineered and now it includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. Babelomics is available at http://www.babelomics.org.

VL - 36 UR - http://nar.oxfordjournals.org/content/36/suppl_2/W341.long N1 -

Al-Shahrour, Fatima Carbonell, Jose Minguez, Pablo Goetz, Stefan Conesa, Ana Tarraga, Joaquin Medina, Ignacio Alloza, Eva Montaner, David Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W341-6. Epub 2008 May 31.

ER - TY - JOUR T1 - Expression and microarrays. JF - Methods Mol Biol Y1 - 2008 A1 - Dopazo, Joaquin A1 - Al-Shahrour, Fátima KW - Animals KW - Computational Biology KW - gene expression KW - Gene Expression Profiling KW - Humans KW - Oligonucleotide Array Sequence Analysis AB -

High throughput methodologies have increased by several orders of magnitude the amount of experimental microarray data available. Nevertheless, translating these data into useful biological knowledge remains a challenge. There is a risk of perceiving these methodologies as mere factories that produce never-ending quantities of data if a proper biological interpretation is not provided. Methods of interpreting these data are continuously evolving. Typically, a simple two-step approach has been used, in which genes of interest are first selected based on thresholds for the experimental values, and then enrichment in biologically relevant terms in the annotations of these genes is analyzed in a second step. For various reasons, such methods are quite poor in terms of performance and new procedures inspired by systems biology that directly address sets of functionally related genes are currently under development.

VL - 453 U1 - https://www.ncbi.nlm.nih.gov/pubmed/18712307?dopt=Abstract ER - TY - JOUR T1 - GEPAS, a web-based tool for microarray data analysis and interpretation. JF - Nucleic Acids Res Y1 - 2008 A1 - Tárraga, Joaquín A1 - Medina, Ignacio A1 - Carbonell, José A1 - Huerta-Cepas, Jaime A1 - Minguez, Pablo A1 - Alloza, Eva A1 - Al-Shahrour, Fátima A1 - Vegas-Azcárate, Susana A1 - Goetz, Stefan A1 - Escobar, Pablo A1 - Garcia-Garcia, Francisco A1 - Conesa, Ana A1 - Montaner, David A1 - Dopazo, Joaquin KW - Computer Graphics KW - Dose-Response Relationship, Drug KW - Gene Expression Profiling KW - Internet KW - Kinetics KW - Oligonucleotide Array Sequence Analysis KW - Software AB -

Gene Expression Profile Analysis Suite (GEPAS) is one of the most complete and extensively used web-based packages for microarray data analysis. During its more than 5 years of activity it has continuously been updated to keep pace with the state-of-the-art in the changing microarray data analysis arena. GEPAS offers diverse analysis options that include well established as well as novel algorithms for normalization, gene selection, class prediction, clustering and functional profiling of the experiment. New options for time-course (or dose-response) experiments, microarray-based class prediction, new clustering methods and new tests for differential expression have been included. The new pipeliner module allows automating the execution of sequential analysis steps by means of a simple but powerful graphic interface. An extensive re-engineering of GEPAS has been carried out which includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. GEPAS is nowadays the most quoted web tool in its field and it is extensively used by researchers of many countries and its records indicate an average usage rate of 500 experiments per day. GEPAS, is available at http://www.gepas.org.

VL - 36 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/18508806?dopt=Abstract ER - TY - JOUR T1 - GEPAS, a web-based tool for microarray data analysis and interpretation JF - Nucleic Acids Res Y1 - 2008 A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Carbonell, J. A1 - Huerta-Cepas, J. A1 - Minguez, P. A1 - Alloza, E. A1 - Fatima Al-Shahrour A1 - Vegas-Azcarate, S. A1 - Goetz, S. A1 - Escobar, P. A1 - Garcia-Garcia, F. A1 - A. Conesa A1 - Montaner, D. A1 - Dopazo, J. KW - gepas KW - microarray data analysis AB -

Gene Expression Profile Analysis Suite (GEPAS) is one of the most complete and extensively used web-based packages for microarray data analysis. During its more than 5 years of activity it has continuously been updated to keep pace with the state-of-the-art in the changing microarray data analysis arena. GEPAS offers diverse analysis options that include well established as well as novel algorithms for normalization, gene selection, class prediction, clustering and functional profiling of the experiment. New options for time-course (or dose-response) experiments, microarray-based class prediction, new clustering methods and new tests for differential expression have been included. The new pipeliner module allows automating the execution of sequential analysis steps by means of a simple but powerful graphic interface. An extensive re-engineering of GEPAS has been carried out which includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. GEPAS is nowadays the most quoted web tool in its field and it is extensively used by researchers of many countries and its records indicate an average usage rate of 500 experiments per day. GEPAS, is available at http://www.gepas.org.

VL - 36 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=18508806 N1 -

Tarraga, Joaquin Medina, Ignacio Carbonell, Jose Huerta-Cepas, Jaime Minguez, Pablo Alloza, Eva Al-Shahrour, Fatima Vegas-Azcarate, Susana Goetz, Stefan Escobar, Pablo Garcia-Garcia, Francisco Conesa, Ana Montaner, David Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W308-14. Epub 2008 May 28.

ER - TY - JOUR T1 - Interoperability with Moby 1.0--it's better than sharing your toothbrush! JF - Brief Bioinform Y1 - 2008 A1 - Wilkinson, Mark D A1 - Senger, Martin A1 - Kawas, Edward A1 - Bruskiewich, Richard A1 - Gouzy, Jerome A1 - Noirot, Celine A1 - Bardou, Philippe A1 - Ng, Ambrose A1 - Haase, Dirk A1 - Saiz, Enrique de Andres A1 - Wang, Dennis A1 - Gibbons, Frank A1 - Gordon, Paul M K A1 - Sensen, Christoph W A1 - Carrasco, Jose Manuel Rodriguez A1 - Fernández, José M A1 - Shen, Lixin A1 - Links, Matthew A1 - Ng, Michael A1 - Opushneva, Nina A1 - Neerincx, Pieter B T A1 - Leunissen, Jack A M A1 - Ernst, Rebecca A1 - Twigger, Simon A1 - Usadel, Bjorn A1 - Good, Benjamin A1 - Wong, Yan A1 - Stein, Lincoln A1 - Crosby, William A1 - Karlsson, Johan A1 - Royo, Romina A1 - Párraga, Iván A1 - Ramírez, Sergio A1 - Gelpi, Josep Lluis A1 - Trelles, Oswaldo A1 - Pisano, David G A1 - Jimenez, Natalia A1 - Kerhornou, Arnaud A1 - Rosset, Roman A1 - Zamacola, Leire A1 - Tárraga, Joaquín A1 - Huerta-Cepas, Jaime A1 - Carazo, Jose María A1 - Dopazo, Joaquin A1 - Guigó, Roderic A1 - Navarro, Arcadi A1 - Orozco, Modesto A1 - Valencia, Alfonso A1 - Claros, M Gonzalo A1 - Pérez, Antonio J A1 - Aldana, Jose A1 - Rojano, M Mar A1 - Fernandez-Santa Cruz, Raul A1 - Navas, Ismael A1 - Schiltz, Gary A1 - Farmer, Andrew A1 - Gessler, Damian A1 - Schoof, Heiko A1 - Groscurth, Andreas KW - Computational Biology KW - Database Management Systems KW - Databases, Factual KW - Information Storage and Retrieval KW - Internet KW - Programming Languages KW - Systems Integration AB -

The BioMoby project was initiated in 2001 from within the model organism database community. It aimed to standardize methodologies to facilitate information exchange and access to analytical resources, using a consensus driven approach. Six years later, the BioMoby development community is pleased to announce the release of the 1.0 version of the interoperability framework, registry Application Programming Interface and supporting Perl and Java code-bases. Together, these provide interoperable access to over 1400 bioinformatics resources worldwide through the BioMoby platform, and this number continues to grow. Here we highlight and discuss the features of BioMoby that make it distinct from other Semantic Web Service and interoperability initiatives, and that have been instrumental to its deployment and use by a wide community of bioinformatics service providers. The standard, client software, and supporting code libraries are all freely available at http://www.biomoby.org/.

VL - 9 IS - 3 U1 - https://www.ncbi.nlm.nih.gov/pubmed/18238804?dopt=Abstract ER - TY - JOUR T1 - Interoperability with Moby 1.0–it’s better than sharing your toothbrush! JF - Brief Bioinform Y1 - 2008 A1 - Wilkinson, M. D. A1 - Senger, M. A1 - Kawas, E. A1 - Bruskiewich, R. A1 - Gouzy, J. A1 - Noirot, C. A1 - Bardou, P. A1 - Ng, A. A1 - Haase, D. A1 - Saiz Ede, A. A1 - Wang, D. A1 - Gibbons, F. A1 - Gordon, P. M. A1 - Sensen, C. W. A1 - Carrasco, J. M. A1 - Fernandez, J. M. A1 - Shen, L. A1 - Links, M. A1 - Ng, M. A1 - Opushneva, N. A1 - Neerincx, P. B. A1 - Leunissen, J. A. A1 - Ernst, R. A1 - Twigger, S. A1 - Usadel, B. A1 - Good, B. A1 - Wong, Y. A1 - Stein, L. A1 - Crosby, W. A1 - Karlsson, J. A1 - Royo, R. A1 - Parraga, I. A1 - Ramirez, S. A1 - Gelpi, J. L. A1 - Trelles, O. A1 - Pisano, D. G. A1 - Jimenez, N. A1 - Kerhornou, A. A1 - Rosset, R. A1 - Zamacola, L. A1 - Tarraga, J. A1 - Huerta-Cepas, J. A1 - Carazo, J. M. A1 - Dopazo, J. A1 - R. Guigo A1 - Navarro, A. A1 - Orozco, M. A1 - Valencia, A. A1 - Claros, M. G. A1 - Perez, A. J. A1 - Aldana, J. A1 - Rojano, M. M. A1 - Fernandez-Santa Cruz, R. A1 - Navas, I. A1 - Schiltz, G. A1 - Farmer, A. A1 - Gessler, D. A1 - Schoof, H. A1 - Groscurth, A. KW - Computational Biology/*methods *Database Management Systems *Databases KW - Factual Information Storage and Retrieval/*methods *Internet *Programming Languages Systems Integration AB -

The BioMoby project was initiated in 2001 from within the model organism database community. It aimed to standardize methodologies to facilitate information exchange and access to analytical resources, using a consensus driven approach. Six years later, the BioMoby development community is pleased to announce the release of the 1.0 version of the interoperability framework, registry Application Programming Interface and supporting Perl and Java code-bases. Together, these provide interoperable access to over 1400 bioinformatics resources worldwide through the BioMoby platform, and this number continues to grow. Here we highlight and discuss the features of BioMoby that make it distinct from other Semantic Web Service and interoperability initiatives, and that have been instrumental to its deployment and use by a wide community of bioinformatics service providers. The standard, client software, and supporting code libraries are all freely available at http://www.biomoby.org/.

VL - 9 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=18238804 N1 -

BioMoby Consortium Wilkinson, Mark D Senger, Martin Kawas, Edward Bruskiewich, Richard Gouzy, Jerome Noirot, Celine Bardou, Philippe Ng, Ambrose Haase, Dirk Saiz, Enrique de Andres Wang, Dennis Gibbons, Frank Gordon, Paul M K Sensen, Christoph W Carrasco, Jose Manuel Rodriguez Fernandez, Jose M Shen, Lixin Links, Matthew Ng, Michael Opushneva, Nina Neerincx, Pieter B T Leunissen, Jack A M Ernst, Rebecca Twigger, Simon Usadel, Bjorn Good, Benjamin Wong, Yan Stein, Lincoln Crosby, William Karlsson, Johan Royo, Romina Parraga, Ivan Ramirez, Sergio Gelpi, Josep Lluis Trelles, Oswaldo Pisano, David G Jimenez, Natalia Kerhornou, Arnaud Rosset, Roman Zamacola, Leire Tarraga, Joaquin Huerta-Cepas, Jaime Carazo, Jose Maria Dopazo, Joaquin Guigo, Roderic Navarro, Arcadi Orozco, Modesto Valencia, Alfonso Claros, M Gonzalo Perez, Antonio J Aldana, Jose Rojano, M Mar Fernandez-Santa Cruz, Raul Navas, Ismael Schiltz, Gary Farmer, Andrew Gessler, Damian Schoof, Heiko Groscurth, Andreas Research Support, Non-U.S. Gov’t Review England Briefings in bioinformatics Brief Bioinform. 2008 May;9(3):220-31. Epub 2008 Jan 31.

ER - TY - JOUR T1 - Prediction of enzyme function by combining sequence similarity and protein interactions JF - BMC Bioinformatics Y1 - 2008 A1 - Espadaler, J. A1 - Eswar, N. A1 - Querol, E. A1 - Aviles, F. X. A1 - Sali, A. A1 - M. A. Marti-Renom A1 - Oliva, B. KW - Amino Acid *Software Structure-Activity Relationship Substrate Specificity/genetics KW - Amino Acid Sequence/physiology Databases KW - Automated Predictive Value of Tests Protein Interaction Mapping Proteins/analysis/metabolism Sequence Alignment Sequence Analysis KW - Protein *Sequence Homology KW - Protein Enzymes/analysis/*metabolism Fuzzy Logic Pattern Recognition AB - BACKGROUND: A number of studies have used protein interaction data alone for protein function prediction. Here, we introduce a computational approach for annotation of enzymes, based on the observation that similar protein sequences are more likely to perform the same function if they share similar interacting partners. RESULTS: The method has been tested against the PSI-BLAST program using a set of 3,890 protein sequences from which interaction data was available. For protein sequences that align with at least 40% sequence identity to a known enzyme, the specificity of our method in predicting the first three EC digits increased from 80% to 90% at 80% coverage when compared to PSI-BLAST. CONCLUSION: Our method can also be used in proteins for which homologous sequences with known interacting partners can be detected. Thus, our method could increase 10% the specificity of genome-wide enzyme predictions based on sequence matching by PSI-BLAST alone. VL - 9 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=18505562 N1 - Espadaler, Jordi Eswar, Narayanan Querol, Enrique Aviles, Francesc X Sali, Andrej Marti-Renom, Marc A Oliva, Baldomero GM54762/GM/NIGMS NIH HHS/United States GM71790/GM/NIGMS NIH HHS/United States GM74929/GM/NIGMS NIH HHS/United States GM74945/GM/NIGMS NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t England BMC bioinformatics BMC Bioinformatics. 2008 May 27;9:249. ER - TY - JOUR T1 - Time course profiling of the retinal transcriptome after optic nerve transection and optic nerve crush JF - Mol Vis Y1 - 2008 A1 - Agudo, M. A1 - Perez-Marin, M. C. A1 - Lonngren, U. A1 - Sobrado, P. A1 - A. Conesa A1 - Canovas, I. A1 - Salinas-Navarro, M. A1 - Miralles-Imperial, J. A1 - Hallbook, F. A1 - Vidal-Sanz, M. KW - Animals Cell Death Cluster Analysis Female *Gene Expression Profiling Gene Expression Regulation *Nerve Crush Optic Nerve/*metabolism/*pathology Optic Nerve Injuries/*genetics Rats Rats KW - Sprague-Dawley Reproducibility of Results Retina/*metabolism/*pathology Time Factors AB - PURPOSE: A time-course analysis of gene regulation in the adult rat retina after intraorbital nerve crush (IONC) and intraorbital nerve transection (IONT). METHODS: RNA was extracted from adult rat retinas undergoing either IONT or IONC at increasing times post-lesion. Affymetrix RAE230.2 arrays were hybridized and analyzed. Statistically regulated genes were annotated and functionally clustered. Arrays were validated by means of quantative reverse transcription polymerase chain reaction (qRT-PCR) on ten regulated genes at two times post-lesion. Western blotting and immunohistofluorescence for four pro-apoptotic proteins were performed on naive and injured retinas. Finally, custom signaling maps for IONT- and IONC-induced death response were generated (MetaCore, Genego Inc.). RESULTS: Here we show that over time, 3,219 sequences were regulated after IONT and 1,996 after IONC. Out of the total of regulated sequences, 1,078 were commonly regulated by both injuries. Interestingly, while IONT mainly triggers a gene upregulation-sustained over time, IONC causes a transitory downregulation. Functional clustering identified the regulation of high interest biologic processes, most importantly cell death wherein apoptosis was the most significant cluster. Ten death-related genes upregulated by both injuries were used for array validation by means of qRT-PCR. In addition, western blotting and immunohistofluorescence of total and active Caspase 3 (Casp3), tumor necrosis factor receptor type 1 associated death domain (TRADD), tumor necrosis factor receptor superfamily member 1a (TNFR1a), and c-fos were performed to confirm their protein regulation and expression pattern in naive and injured retinas. These analyses demonstrated that for these genes, protein regulation followed transcriptional regulation and that these pro-apoptotic proteins were expressed by retinal ganglion cells (RGCs). MetaCore-based death-signaling maps show that several apoptotic cascades were regulated in the retina following optic nerve injury and highlight the similarities and differences between IONT and IONC in cell death profiling. CONCLUSIONS: This comprehensive time course retinal transcriptome study comparing IONT and IONC lesions provides a unique valuable tool to understand the molecular mechanisms underlying optic nerve injury and to design neuroprotective protocols. VL - 14 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=18552980 N1 - Agudo, Marta Perez-Marin, Maria Cruz Lonngren, Ulrika Sobrado, Paloma Conesa, Ana Canovas, Isabel Salinas-Navarro, Manuel Miralles-Imperial, Jaime Hallbook, Finn Vidal-Sanz, Manuel Research Support, Non-U.S. Gov’t United States Molecular vision Mol Vis. 2008 Jun 3;14:1050-63. ER - TY - JOUR T1 - Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans JF - Hum Mutat Y1 - 2008 A1 - E. Capriotti A1 - Arbiza, L. A1 - Casadio, R. A1 - Dopazo, J. A1 - H. Dopazo A1 - M. A. Marti-Renom KW - Algorithms Codon/genetics Computational Biology/*methods *DNA Mutational Analysis Databases KW - Human Humans Iduronic Acid/analogs & derivatives/metabolism *Point Mutation Polymorphism KW - Molecular *Genetic Predisposition to Disease Genetic Variation Genome KW - Protein *Evolution KW - Single Nucleotide Proteins/chemistry/*genetics Tumor Suppressor Protein p53/genetics AB - Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies. VL - 29 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17935148 N1 - Capriotti, Emidio Arbiza, Leonardo Casadio, Rita Dopazo, Joaquin Dopazo, Hernan Marti-Renom, Marc A Evaluation Studies Research Support, Non-U.S. Gov’t United States Human mutation Hum Mutat. 2008 Jan;29(1):198-204. ER - TY - JOUR T1 - Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans. JF - Hum Mutat Y1 - 2008 A1 - Capriotti, Emidio A1 - Arbiza, Leonardo A1 - Casadio, Rita A1 - Dopazo, Joaquin A1 - Dopazo, Hernán A1 - Marti-Renom, Marc A KW - Algorithms KW - Codon KW - Computational Biology KW - Databases, Protein KW - DNA Mutational Analysis KW - Evolution, Molecular KW - Genetic Predisposition to Disease KW - Genetic Variation KW - Genome, Human KW - Humans KW - Iduronic Acid KW - Point Mutation KW - Polymorphism, Single Nucleotide KW - Proteins KW - Tumor Suppressor Protein p53 AB -

Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies.

VL - 29 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17935148?dopt=Abstract ER - TY - JOUR T1 - Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance JF - BMC Genomics Y1 - 2007 A1 - Terol, J. A1 - A. Conesa A1 - Colmenero, J. M. A1 - Cercos, M. A1 - Tadeo, F. A1 - Agusti, J. A1 - Alos, E. A1 - Andres, F. A1 - Soler, G. A1 - Brumos, J. A1 - Iglesias, D. J. A1 - Gotz, S. A1 - Legaz, F. A1 - Argout, X. A1 - Courtois, B. A1 - Ollitrault, P. A1 - Dossat, C. A1 - Wincker, P. A1 - Morillon, R. A1 - Talon, M. KW - Acclimatization/*genetics Amino Acid Motifs Citrus/*genetics Cluster Analysis Expressed Sequence Tags Fruit/genetics Gene Duplication *Gene Expression Regulation KW - Plant Gene Library Genes KW - Plant Genomics Molecular Sequence Data Multigene Family Phylogeny *Salts/adverse effects AB - BACKGROUND: Improvement of Citrus, the most economically important fruit crop in the world, is extremely slow and inherently costly because of the long-term nature of tree breeding and an unusual combination of reproductive characteristics. Aside from disease resistance, major commercial traits in Citrus are improved fruit quality, higher yield and tolerance to environmental stresses, especially salinity. RESULTS: A normalized full length and 9 standard cDNA libraries were generated, representing particular treatments and tissues from selected varieties (Citrus clementina and C. sinensis) and rootstocks (C. reshni, and C. sinenis x Poncirus trifoliata) differing in fruit quality, resistance to abscission, and tolerance to salinity. The goal of this work was to provide a large expressed sequence tag (EST) collection enriched with transcripts related to these well appreciated agronomical traits. Towards this end, more than 54000 ESTs derived from these libraries were analyzed and annotated. Assembly of 52626 useful sequences generated 15664 putative transcription units distributed in 7120 contigs, and 8544 singletons. BLAST annotation produced significant hits for more than 80% of the hypothetical transcription units and suggested that 647 of these might be Citrus specific unigenes. The unigene set, composed of 13000 putative different transcripts, including more than 5000 novel Citrus genes, was assigned with putative functions based on similarity, GO annotations and protein domains CONCLUSION: Comparative genomics with Arabidopsis revealed the presence of putative conserved orthologs and single copy genes in Citrus and also the occurrence of both gene duplication events and increased number of genes for specific pathways. In addition, phylogenetic analysis performed on the ammonium transporter family and glycosyl transferase family 20 suggested the existence of Citrus paralogs. Analysis of the Citrus gene space showed that the most important metabolic pathways known to affect fruit quality were represented in the unigene set. Overall, the similarity analyses indicated that the sequences of the genes belonging to these varieties and rootstocks were essentially identical, suggesting that the differential behaviour of these species cannot be attributed to major sequence divergences. This Citrus EST assembly contributes both crucial information to discover genes of agronomical interest and tools for genetic and genomic analyses, such as the development of new markers and microarrays. VL - 8 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17254327 N1 - Terol, Javier Conesa, Ana Colmenero, Jose M Cercos, Manuel Tadeo, Francisco Agusti, Javier Alos, Enriqueta Andres, Fernando Soler, Guillermo Brumos, Javier Iglesias, Domingo J Gotz, Stefan Legaz, Francisco Argout, Xavier Courtois, Brigitte Ollitrault, Patrick Dossat, Carole Wincker, Patrick Morillon, Raphael Talon, Manuel Comparative Study Research Support, Non-U.S. Gov’t England BMC genomics BMC Genomics. 2007 Jan 25;8:31. ER - TY - JOUR T1 - The AnnoLite and AnnoLyze programs for comparative annotation of protein structures JF - BMC Bioinformatics Y1 - 2007 A1 - M. A. Marti-Renom A1 - Rossi, A. A1 - Fatima Al-Shahrour A1 - Davis, F. P. A1 - Pieper, U. A1 - Dopazo, J. A1 - Sali, A. KW - *Algorithms Amino Acid Sequence Confidence Intervals Data Interpretation KW - Amino Acid *Software Structure-Activity Relationship KW - Protein Information Storage and Retrieval/methods Molecular Sequence Data Proteins/*chemistry/classification/*metabolism Sensitivity and Specificity Sequence Alignment/*methods Sequence Analysis KW - Protein/*methods Sequence Homology KW - Statistical *Databases AB - BACKGROUND: Advances in structural biology, including structural genomics, have resulted in a rapid increase in the number of experimentally determined protein structures. However, about half of the structures deposited by the structural genomics consortia have little or no information about their biological function. Therefore, there is a need for tools for automatically and comprehensively annotating the function of protein structures. We aim to provide such tools by applying comparative protein structure annotation that relies on detectable relationships between protein structures to transfer functional annotations. Here we introduce two programs, AnnoLite and AnnoLyze, which use the structural alignments deposited in the DBAli database. DESCRIPTION: AnnoLite predicts the SCOP, CATH, EC, InterPro, PfamA, and GO terms with an average sensitivity of 90% and average precision of 80%. AnnoLyze predicts ligand binding site and domain interaction patches with an average sensitivity of 70% and average precision of 30%, correctly localizing binding sites for small molecules in 95% of its predictions. CONCLUSION: The AnnoLite and AnnoLyze programs for comparative annotation of protein structures can reliably and automatically annotate new protein structures. The programs are fully accessible via the Internet as part of the DBAli suite of tools at http://salilab.org/DBAli/. VL - 8 Suppl 4 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17570147 N1 - Marti-Renom, Marc A Rossi, Andrea Al-Shahrour, Fatima Davis, Fred P Pieper, Ursula Dopazo, Joaquin Sali, Andrej Research Support, Non-U.S. Gov’t England BMC bioinformatics BMC Bioinformatics. 2007 May 22;8 Suppl 4:S4. ER - TY - JOUR T1 - Association study of 69 genes in the ret pathway identifies low-penetrance loci in sporadic medullary thyroid carcinoma JF - Cancer Res Y1 - 2007 A1 - Ruiz-Llorente, S. A1 - Montero-Conde, C. A1 - Milne, R. L. A1 - Moya, C. M. A1 - Cebrian, A. A1 - Leton, R. A1 - Cascon, A. A1 - Mercadillo, F. A1 - Landa, I. A1 - Borrego, S. A1 - Perez de Nanclares, G. A1 - Alvarez-Escola, C. A1 - Diaz-Perez, J. A. A1 - Carracedo, A. A1 - Urioste, M. A1 - Gonzalez-Neira, A. A1 - Benitez, J. A1 - Santisteban, P. A1 - Dopazo, J. A1 - Ponder, B. A. A1 - M. Robledo KW - 80 and over Carcinoma KW - Adolescent Adult Aged Aged KW - Genetic KW - Genetic Proto-Oncogene Proteins c-ret/*genetics/metabolism Signal Transduction Thyroid Neoplasms/*genetics/metabolism Transcription KW - Medullary/*genetics/metabolism Case-Control Studies Cyclin-Dependent Kinase Inhibitor p15/biosynthesis/genetics Female Genetic Predisposition to Disease Germ-Line Mutation Haplotypes Humans Male Middle Aged Penetrance Polymorphism KW - Single Nucleotide Promoter Regions AB - To date, few association studies have been done to better understand the genetic basis for the development of sporadic medullary thyroid carcinoma (sMTC). To identify additional low-penetrance genes, we have done a two-stage case-control study in two European populations using high-throughput genotyping. We selected 417 single nucleotide polymorphisms (SNP) belonging to 69 genes either related to RET signaling pathway/functions or involved in key processes for cancer development. TagSNPs and functional variants were included where possible. These SNPs were initially studied in the largest known series of sMTC cases (n = 266) and controls (n = 422), all of Spanish origin. In stage II, an independent British series of 155 sMTC patients and 531 controls was included to validate the previous results. Associations were assessed by an exhaustive analysis of individual SNPs but also considering gene- and linkage disequilibrium-based haplotypes. This strategy allowed us to identify seven low-penetrance genes, six of them (STAT1, AURKA, BCL2, CDKN2B, CDK6, and COMT) consistently associated with sMTC risk in the two case-control series and a seventh (HRAS) with individual SNPs and haplotypes associated with sMTC in the Spanish data set. The potential role of CDKN2B was confirmed by a functional assay showing a role of a SNP (rs7044859) in the promoter region in altering the binding of the transcription factor HNF1. These results highlight the utility of association studies using homogeneous series of cases for better understanding complex diseases. VL - 67 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17909067 N1 - Ruiz-Llorente, Sergio Montero-Conde, Cristina Milne, Roger L Moya, Christian M Cebrian, Arancha Leton, Rocio Cascon, Alberto Mercadillo, Fatima Landa, Inigo Borrego, Salud Perez de Nanclares, Guiomar Alvarez-Escola, Cristina Diaz-Perez, Jose Angel Carracedo, Angel Urioste, Miguel Gonzalez-Neira, Anna Benitez, Javier Santisteban, Pilar Dopazo, Joaquin Ponder, Bruce A Robledo, Mercedes Medullary Thyroid Carcinoma Clinical Group Research Support, Non-U.S. Gov’t United States Cancer research Cancer Res. 2007 Oct 1;67(19):9561-7. ER - TY - JOUR T1 - Characterization of protein hubs by inferring interacting motifs from protein interactions JF - PLoS Comput Biol Y1 - 2007 A1 - Aragues, R. A1 - Sali, A. A1 - Bonet, J. A1 - M. A. Marti-Renom A1 - Oliva, B. KW - Amino Acid Motifs Amino Acid Sequence Binding Sites Computer Simulation *Models KW - Chemical *Models KW - Molecular Molecular Sequence Data Protein Binding Protein Interaction Mapping/*methods Proteins/*chemistry Sequence Analysis KW - Protein/*methods AB - The characterization of protein interactions is essential for understanding biological systems. While genome-scale methods are available for identifying interacting proteins, they do not pinpoint the interacting motifs (e.g., a domain, sequence segments, a binding site, or a set of residues). Here, we develop and apply a method for delineating the interacting motifs of hub proteins (i.e., highly connected proteins). The method relies on the observation that proteins with common interaction partners tend to interact with these partners through a common interacting motif. The sole input for the method are binary protein interactions; neither sequence nor structure information is needed. The approach is evaluated by comparing the inferred interacting motifs with domain families defined for 368 proteins in the Structural Classification of Proteins (SCOP). The positive predictive value of the method for detecting proteins with common SCOP families is 75% at sensitivity of 10%. Most of the inferred interacting motifs were significantly associated with sequence patterns, which could be responsible for the common interactions. We find that yeast hubs with multiple interacting motifs are more likely to be essential than hubs with one or two interacting motifs, thus rationalizing the previously observed correlation between essentiality and the number of interacting partners of a protein. We also find that yeast hubs with multiple interacting motifs evolve slower than the average protein, contrary to the hubs with one or two interacting motifs. The proposed method will help us discover unknown interacting motifs and provide biological insights about protein hubs and their roles in interaction networks. VL - 3 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17941705 N1 - Aragues, Ramon Sali, Andrej Bonet, Jaume Marti-Renom, Marc A Oliva, Baldo PN2 EY016525,/EY/NEI NIH HHS/United States U54 RR022220/RR/NCRR NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t United States PLoS computational biology PLoS Comput Biol. 2007 Sep;3(9):1761-71. Epub 2007 Jul 30. ER - TY - JOUR T1 - DBAli tools: mining the protein structure space JF - Nucleic Acids Res Y1 - 2007 A1 - M. A. Marti-Renom A1 - Pieper, U. A1 - Madhusudhan, M. S. A1 - Rossi, A. A1 - Eswar, N. A1 - Davis, F. P. A1 - Fatima Al-Shahrour A1 - Dopazo, J. A1 - Sali, A. KW - *Algorithms Amino Acid Sequence Computational Biology/*methods Data Interpretation KW - Amino Acid *Software Structure-Activity Relationship KW - Protein Internet Molecular Sequence Data Protein Conformation Proteins/*chemistry/classification/*metabolism Pseudomonas aeruginosa/*metabolism Sequence Alignment/*methods Sequence Analysis KW - Protein/*methods Sequence Homology KW - Statistical *Databases AB - The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions. VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17478513 N1 - Marti-Renom, Marc A Pieper, Ursula Madhusudhan, M S Rossi, Andrea Eswar, Narayanan Davis, Fred P Al-Shahrour, Fatima Dopazo, Joaquin Sali, Andrej GM 62529/GM/NIGMS NIH HHS/United States GM074929/GM/NIGMS NIH HHS/United States GM54762/GM/NIGMS NIH HHS/United States GM71790/GM/NIGMS NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W393-7. Epub 2007 May 3. ER - TY - JOUR T1 - DBAli tools: mining the protein structure space. JF - Nucleic Acids Res Y1 - 2007 A1 - Marti-Renom, Marc A A1 - Pieper, Ursula A1 - Madhusudhan, M S A1 - Rossi, Andrea A1 - Eswar, Narayanan A1 - Davis, Fred P A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin A1 - Sali, Andrej KW - Algorithms KW - Amino Acid Sequence KW - Computational Biology KW - Data Interpretation, Statistical KW - Databases, Protein KW - Internet KW - Molecular Sequence Data KW - Protein Conformation KW - Proteins KW - Pseudomonas aeruginosa KW - Sequence Alignment KW - Sequence Analysis, Protein KW - Sequence Homology, Amino Acid KW - Software KW - Structure-Activity Relationship AB -

The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions.

VL - 35 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/17478513?dopt=Abstract ER - TY - JOUR T1 - Evidence for systems-level molecular mechanisms of tumorigenesis JF - BMC Genomics Y1 - 2007 A1 - Hernandez, P. A1 - Huerta-Cepas, J. A1 - Montaner, D. A1 - Fatima Al-Shahrour A1 - Valls, J. A1 - Gomez, L. A1 - Capella, G. A1 - Dopazo, J. A1 - Pujana, M. A. KW - *Cell Transformation KW - Biological Models KW - Genetic Models KW - Messenger/metabolism Signal Transduction Systems Biology KW - Neoplastic *Gene Expression Profiling *Gene Expression Regulation KW - Neoplastic Humans Male Models KW - Statistical Neoplasm Proteins/*physiology Neoplasms/etiology/*genetics Prostatic Neoplasms/genetics Protein Interaction Mapping RNA AB - BACKGROUND: Cancer arises from the consecutive acquisition of genetic alterations. Increasing evidence suggests that as a consequence of these alterations, molecular interactions are reprogrammed in the context of highly connected and regulated cellular networks. Coordinated reprogramming would allow the cell to acquire the capabilities for malignant growth. RESULTS: Here, we determine the coordinated function of cancer gene products (i.e., proteins encoded by differentially expressed genes in tumors relative to healthy tissue counterparts, hereafter referred to as "CGPs") defined as their topological properties and organization in the interactome network. We show that CGPs are central to information exchange and propagation and that they are specifically organized to promote tumorigenesis. Centrality is identified by both local (degree) and global (betweenness and closeness) measures, and systematically appears in down-regulated CGPs. Up-regulated CGPs do not consistently exhibit centrality, but both types of cancer products determine the overall integrity of the network structure. In addition to centrality, down-regulated CGPs show topological association that correlates with common biological processes and pathways involved in tumorigenesis. CONCLUSION: Given the current limited coverage of the human interactome, this study proposes that tumorigenesis takes place in a specific and organized way at the molecular systems-level and suggests a model that comprises the precise down-regulation of groups of topologically-associated proteins involved in particular functions, orchestrated with the up-regulation of specific proteins. VL - 8 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17584915 N1 - Hernandez, Pilar Huerta-Cepas, Jaime Montaner, David Al-Shahrour, Fatima Valls, Joan Gomez, Laia Capella, Gabriel Dopazo, Joaquin Pujana, Miguel Angel Research Support, Non-U.S. Gov’t England BMC genomics BMC Genomics. 2007 Jun 20;8:185. ER - TY - JOUR T1 - Evidence for systems-level molecular mechanisms of tumorigenesis. JF - BMC Genomics Y1 - 2007 A1 - Hernández, Pilar A1 - Huerta-Cepas, Jaime A1 - Montaner, David A1 - Al-Shahrour, Fátima A1 - Valls, Joan A1 - Gómez, Laia A1 - Capellà, Gabriel A1 - Dopazo, Joaquin A1 - Pujana, Miguel Angel KW - Cell Transformation, Neoplastic KW - Gene Expression Profiling KW - Gene Expression Regulation, Neoplastic KW - Humans KW - Male KW - Models, Biological KW - Models, Genetic KW - Models, Statistical KW - Neoplasm Proteins KW - Neoplasms KW - Prostatic Neoplasms KW - Protein Interaction Mapping KW - RNA, Messenger KW - Signal Transduction KW - Systems biology AB -

BACKGROUND: Cancer arises from the consecutive acquisition of genetic alterations. Increasing evidence suggests that as a consequence of these alterations, molecular interactions are reprogrammed in the context of highly connected and regulated cellular networks. Coordinated reprogramming would allow the cell to acquire the capabilities for malignant growth.

RESULTS: Here, we determine the coordinated function of cancer gene products (i.e., proteins encoded by differentially expressed genes in tumors relative to healthy tissue counterparts, hereafter referred to as "CGPs") defined as their topological properties and organization in the interactome network. We show that CGPs are central to information exchange and propagation and that they are specifically organized to promote tumorigenesis. Centrality is identified by both local (degree) and global (betweenness and closeness) measures, and systematically appears in down-regulated CGPs. Up-regulated CGPs do not consistently exhibit centrality, but both types of cancer products determine the overall integrity of the network structure. In addition to centrality, down-regulated CGPs show topological association that correlates with common biological processes and pathways involved in tumorigenesis.

CONCLUSION: Given the current limited coverage of the human interactome, this study proposes that tumorigenesis takes place in a specific and organized way at the molecular systems-level and suggests a model that comprises the precise down-regulation of groups of topologically-associated proteins involved in particular functions, orchestrated with the up-regulation of specific proteins.

VL - 8 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17584915?dopt=Abstract ER - TY - JOUR T1 - FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments. JF - Nucleic Acids Res Y1 - 2007 A1 - Al-Shahrour, Fátima A1 - Minguez, Pablo A1 - Tárraga, Joaquín A1 - Medina, Ignacio A1 - Alloza, Eva A1 - Montaner, David A1 - Dopazo, Joaquin KW - Amino Acid Motifs KW - Animals KW - Binding Sites KW - Computational Biology KW - Gene Expression Profiling KW - Genes KW - Genomics KW - Humans KW - Internet KW - Oligonucleotide Array Sequence Analysis KW - Programming Languages KW - Software KW - Systems Integration KW - Transcription Factors AB -

The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the data, relating the available information with the hypotheses that originated the experiment. Thus, functional profiling methods have become essential in diverse scenarios such as microarray experiments, proteomics, etc. We present the FatiGO+, a web-based tool for the functional profiling of genome-scale experiments, specially oriented to the interpretation of microarray experiments. In addition to different functional annotations (gene ontology, KEGG pathways, Interpro motifs, Swissprot keywords and text-mining based bioentities related to diseases and chemical compounds) FatiGO+ includes, as a novelty, regulatory and structural information. The regulatory information used includes predictions of targets for distinct regulatory elements (obtained from the Transfac and CisRed databases). Additionally FatiGO+ uses predictions of target motifs of miRNA to infer which of these can be activated or deactivated in the sample of genes studied. Finally, properties of gene products related to their relative location and connections in the interactome have also been used. Also, enrichment of any of these functional terms can be directly analysed on chromosomal coordinates. FatiGO+ can be found at: http://www.fatigoplus.org and within the Babelomics environment http://www.babelomics.org.

VL - 35 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/17478504?dopt=Abstract ER - TY - JOUR T1 - FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments JF - Nucleic Acids Res Y1 - 2007 A1 - Fatima Al-Shahrour A1 - Minguez, P. A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Alloza, E. A1 - Montaner, D. A1 - Dopazo, J. KW - babelomics KW - functional enrichment analysys AB -

The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the data, relating the available information with the hypotheses that originated the experiment. Thus, functional profiling methods have become essential in diverse scenarios such as microarray experiments, proteomics, etc. We present the FatiGO+, a web-based tool for the functional profiling of genome-scale experiments, specially oriented to the interpretation of microarray experiments. In addition to different functional annotations (gene ontology, KEGG pathways, Interpro motifs, Swissprot keywords and text-mining based bioentities related to diseases and chemical compounds) FatiGO+ includes, as a novelty, regulatory and structural information. The regulatory information used includes predictions of targets for distinct regulatory elements (obtained from the Transfac and CisRed databases). Additionally FatiGO+ uses predictions of target motifs of miRNA to infer which of these can be activated or deactivated in the sample of genes studied. Finally, properties of gene products related to their relative location and connections in the interactome have also been used. Also, enrichment of any of these functional terms can be directly analysed on chromosomal coordinates. FatiGO+ can be found at: http://www.fatigoplus.org and within the Babelomics environment http://www.babelomics.org.

VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17478504 N1 -

Al-Shahrour, Fatima Minguez, Pablo Tarraga, Joaquin Medina, Ignacio Alloza, Eva Montaner, David Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W91-6. Epub 2007 May 3.

ER - TY - JOUR T1 - From genes to functional classes in the study of biological systems JF - BMC Bioinformatics Y1 - 2007 A1 - Fatima Al-Shahrour A1 - Arbiza, L. A1 - H. Dopazo A1 - Huerta-Cepas, J. A1 - Minguez, P. A1 - Montaner, D. A1 - Dopazo, J. KW - Algorithms Chromosome Mapping/*methods Computer Simulation Gene Expression Profiling/methods *Models KW - babelomics KW - Biological Multigene Family/*physiology Signal Transduction/*physiology *Software Systems Biology/*methods *User-Computer Interface AB -

BACKGROUND: With the popularization of high-throughput techniques, the need for procedures that help in the biological interpretation of results has increased enormously. Recently, new procedures inspired in systems biology criteria have started to be developed. RESULTS: Here we present FatiScan, a web-based program which implements a threshold-independent test for the functional interpretation of large-scale experiments that does not depend on the pre-selection of genes based on the multiple application of independent tests to each gene. The test implemented aims to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes. In addition, the test does not depend on the type of the data used for obtaining significance values, and consequently different types of biologically informative terms (gene ontology, pathways, functional motifs, transcription factor binding sites or regulatory sites from CisRed) can be applied to different classes of genome-scale studies. We exemplify its application in microarray gene expression, evolution and interactomics. CONCLUSION: Methods for gene set enrichment which, in addition, are independent from the original data and experimental design constitute a promising alternative for the functional profiling of genome-scale experiments. A web server that performs the test described and other similar ones can be found at: http://www.babelomics.org.

VL - 8 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17407596 N1 -

Al-Shahrour, Fatima Arbiza, Leonardo Dopazo, Hernan Huerta-Cepas, Jaime Minguez, Pablo Montaner, David Dopazo, Joaquin Research Support, Non-U.S. Gov’t England BMC bioinformatics BMC Bioinformatics. 2007 Apr 3;8:114.

ER - TY - JOUR T1 - From genes to functional classes in the study of biological systems. JF - BMC Bioinformatics Y1 - 2007 A1 - Al-Shahrour, Fátima A1 - Arbiza, Leonardo A1 - Dopazo, Hernán A1 - Huerta-Cepas, Jaime A1 - Minguez, Pablo A1 - Montaner, David A1 - Dopazo, Joaquin KW - Algorithms KW - Chromosome Mapping KW - Computer Simulation KW - Gene Expression Profiling KW - Models, Biological KW - Multigene Family KW - Signal Transduction KW - Software KW - Systems biology KW - User-Computer Interface AB -

BACKGROUND: With the popularization of high-throughput techniques, the need for procedures that help in the biological interpretation of results has increased enormously. Recently, new procedures inspired in systems biology criteria have started to be developed.

RESULTS: Here we present FatiScan, a web-based program which implements a threshold-independent test for the functional interpretation of large-scale experiments that does not depend on the pre-selection of genes based on the multiple application of independent tests to each gene. The test implemented aims to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes. In addition, the test does not depend on the type of the data used for obtaining significance values, and consequently different types of biologically informative terms (gene ontology, pathways, functional motifs, transcription factor binding sites or regulatory sites from CisRed) can be applied to different classes of genome-scale studies. We exemplify its application in microarray gene expression, evolution and interactomics.

CONCLUSION: Methods for gene set enrichment which, in addition, are independent from the original data and experimental design constitute a promising alternative for the functional profiling of genome-scale experiments. A web server that performs the test described and other similar ones can be found at: http://www.babelomics.org.

VL - 8 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17407596?dopt=Abstract ER - TY - CHAP T1 - Functional annotation of microarray experiments T2 - Microarray Technology Through Applications Y1 - 2007 A1 - Dopazo, J. A1 - Fatima Al-Shahrour JF - Microarray Technology Through Applications PB - Taylor & Francis, F. Falciani CY - New York, USA ER - TY - JOUR T1 - Functional profiling and gene expression analysis of chromosomal copy number alterations. JF - Bioinformation Y1 - 2007 A1 - Conde, Lucia A1 - Montaner, David A1 - Burguet-Castell, Jordi A1 - Tárraga, Joaquín A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin AB -

Contrarily to the traditional view in which only one or a few key genes were supposed to be the causative factors of diseases, we discuss the importance of considering groups of functionally related genes in the study of pathologies characterised by chromosomal copy number alterations. Recent observations have reported the existence of regions in higher eukaryotic chromosomes (including humans) containing genes of related function that show a high degree of coregulation. Copy number alterations will consequently affect to clusters of functionally related genes, which will be the final causative agents of the diseased phenotype, in many cases. Therefore, we propose that the functional profiling of the regions affected by copy number alterations must be an important aspect to take into account in the understanding of this type of pathologies. To illustrate this, we present an integrated study of DNA copy number variations, gene expression along with the functional profiling of chromosomal regions in a case of multiple myeloma.

VL - 1 IS - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17597935?dopt=Abstract ER - TY - JOUR T1 - Functional profiling and gene expression analysis of chromosomal copy number alterations JF - Bioinformation Y1 - 2007 A1 - L. Conde A1 - Montaner, D. A1 - Burguet-Castell, J. A1 - Tarraga, J. A1 - Fatima Al-Shahrour A1 - Dopazo, J. KW - babelomics AB -

Contrarily to the traditional view in which only one or a few key genes were supposed to be the causative factors of diseases, we discuss the importance of considering groups of functionally related genes in the study of pathologies characterised by chromosomal copy number alterations. Recent observations have reported the existence of regions in higher eukaryotic chromosomes (including humans) containing genes of related function that show a high degree of coregulation. Copy number alterations will consequently affect to clusters of functionally related genes, which will be the final causative agents of the diseased phenotype, in many cases. Therefore, we propose that the functional profiling of the regions affected by copy number alterations must be an important aspect to take into account in the understanding of this type of pathologies. To illustrate this, we present an integrated study of DNA copy number variations, gene expression along with the functional profiling of chromosomal regions in a case of multiple myeloma.

VL - 1 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17597935 N1 -

Conde, Lucia Montaner, David Burguet-Castell, Jordi Tarraga, Joaquin Al-Shahrour, Fatima Dopazo, Joaquin Singapore Bioinformation Bioinformation. 2007 Apr 10;1(10):432-5.

ER - TY - JOUR T1 - Functional profiling of microarray experiments using text-mining derived bioentities. JF - Bioinformatics Y1 - 2007 A1 - Minguez, Pablo A1 - Al-Shahrour, Fátima A1 - Montaner, David A1 - Dopazo, Joaquin KW - Artificial Intelligence KW - Databases, Protein KW - Gene Expression Profiling KW - Information Storage and Retrieval KW - Natural Language Processing KW - Proteins KW - Research Design KW - Systems Integration AB -

MOTIVATION: The increasing use of microarray technologies brought about a parallel demand in methods for the functional interpretation of the results. Beyond the conventional functional annotations for genes, such as gene ontology, pathways, etc. other sources of information are still to be exploited. Text-mining methods allow extracting informative terms (bioentities) with different functional, chemical, clinical, etc. meanings, that can be associated to genes. We show how to use these associations within an appropriate statistical framework and how to apply them through easy-to-use, web-based environments to the functional interpretation of microarray experiments. Functional enrichment and gene set enrichment tests using bioentities are presented.

VL - 23 IS - 22 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17855415?dopt=Abstract ER - TY - JOUR T1 - Functional profiling of microarray experiments using text-mining derived bioentities JF - Bioinformatics Y1 - 2007 A1 - Minguez, P. A1 - Fatima Al-Shahrour A1 - Montaner, D. A1 - Dopazo, J. KW - Artificial Intelligence *Databases KW - babelomics KW - Protein Gene Expression Profiling/*methods Information Storage and Retrieval/*methods *Natural Language Processing Proteins/*classification/*metabolism Research/*methods Systems Integration AB -

MOTIVATION: The increasing use of microarray technologies brought about a parallel demand in methods for the functional interpretation of the results. Beyond the conventional functional annotations for genes, such as gene ontology, pathways, etc. other sources of information are still to be exploited. Text-mining methods allow extracting informative terms (bioentities) with different functional, chemical, clinical, etc. meanings, that can be associated to genes. We show how to use these associations within an appropriate statistical framework and how to apply them through easy-to-use, web-based environments to the functional interpretation of microarray experiments. Functional enrichment and gene set enrichment tests using bioentities are presented.

VL - 23 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17855415 N1 -

Minguez, Pablo Al-Shahrour, Fatima Montaner, David Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Bioinformatics (Oxford, England) Bioinformatics. 2007 Nov 15;23(22):3098-9. Epub 2007 Sep 13.

ER - TY - JOUR T1 - ISACGH: a web-based environment for the analysis of Array CGH and gene expression which includes functional profiling JF - Nucleic Acids Res Y1 - 2007 A1 - L. Conde A1 - Montaner, D. A1 - Burguet-Castell, J. A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Fatima Al-Shahrour A1 - Dopazo, J. KW - Animals Cluster Analysis Computational Biology/*methods Computer Graphics Gene Expression Profiling/*methods Humans Internet Models KW - Genetic *Nucleic Acid Hybridization Oligonucleotide Array Sequence Analysis/*methods Programming Languages *Software Systems Integration User-Computer Interface AB - We present the ISACGH, a web-based system that allows for the combination of genomic data with gene expression values and provides different options for functional profiling of the regions found. Several visualization options offer a convenient representation of the results. Different efficient methods for accurate estimation of genomic copy number from array-CGH hybridization data have been included in the program. Moreover, the connection to the gene expression analysis package GEPAS allows the use of different facilities for data pre-processing and analysis. A DAS server allows exporting the results to the Ensembl viewer where contextual genomic information can be obtained. The program is freely available at: http://isacgh.bioinfo.cipf.es or within http://www.gepas.org. VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17468499 N1 - Conde, Lucia Montaner, David Burguet-Castell, Jordi Tarraga, Joaquin Medina, Ignacio Al-Shahrour, Fatima Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W81-5. Epub 2007 Apr 27. ER - TY - JOUR T1 - ISACGH: a web-based environment for the analysis of Array CGH and gene expression which includes functional profiling. JF - Nucleic Acids Res Y1 - 2007 A1 - Conde, Lucia A1 - Montaner, David A1 - Burguet-Castell, Jordi A1 - Tárraga, Joaquín A1 - Medina, Ignacio A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Animals KW - Cluster Analysis KW - Computational Biology KW - Computer Graphics KW - Gene Expression Profiling KW - Humans KW - Internet KW - Models, Genetic KW - Nucleic Acid Hybridization KW - Oligonucleotide Array Sequence Analysis KW - Programming Languages KW - Software KW - Systems Integration KW - User-Computer Interface AB -

We present the ISACGH, a web-based system that allows for the combination of genomic data with gene expression values and provides different options for functional profiling of the regions found. Several visualization options offer a convenient representation of the results. Different efficient methods for accurate estimation of genomic copy number from array-CGH hybridization data have been included in the program. Moreover, the connection to the gene expression analysis package GEPAS allows the use of different facilities for data pre-processing and analysis. A DAS server allows exporting the results to the Ensembl viewer where contextual genomic information can be obtained. The program is freely available at: http://isacgh.bioinfo.cipf.es or within http://www.gepas.org.

VL - 35 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/17468499?dopt=Abstract ER - TY - CHAP T1 - New Trends in the Analysis of Functional Genomic Data T2 - Progress in Industrial Mathematics at ECMI 2006 Y1 - 2007 A1 - Montaner, D. A1 - Fatima Al-Shahrour A1 - Dopazo, J. JF - Progress in Industrial Mathematics at ECMI 2006 PB - Springer CY - Berlin VL - 12 UR - http://www.springerlink.com/content/m62p07r8111004vr/ ER - TY - JOUR T1 - Phylemon: a suite of web tools for molecular evolution, phylogenetics and phylogenomics JF - Nucleic Acids Res Y1 - 2007 A1 - Tarraga, J. A1 - Medina, Ignacio A1 - Arbiza, L. A1 - Huerta-Cepas, J. A1 - Gabaldón, T. A1 - Dopazo, J. A1 - H. Dopazo KW - Animals Computational Biology/*methods Databases KW - DNA Sequence Analysis KW - Genetic Evolution KW - Molecular Genetic Techniques Humans *Internet Models KW - Protein Software User-Computer Interface KW - Statistical *Phylogeny Programming Languages Sequence Alignment Sequence Analysis AB - Phylemon is an online platform for phylogenetic and evolutionary analyses of molecular sequence data. It has been developed as a web server that integrates a suite of different tools selected among the most popular stand-alone programs in phylogenetic and evolutionary analysis. It has been conceived as a natural response to the increasing demand of data analysis of many experimental scientists wishing to add a molecular evolution and phylogenetics insight into their research. Tools included in Phylemon cover a wide yet selected range of programs: from the most basic for multiple sequence alignment to elaborate statistical methods of phylogenetic reconstruction including methods for evolutionary rates analyses and molecular adaptation. Phylemon has several features that differentiates it from other resources: (i) It offers an integrated environment that enables the direct concatenation of evolutionary analyses, the storage of results and handles required data format conversions, (ii) Once an outfile is produced, Phylemon suggests the next possible analyses, thus guiding the user and facilitating the integration of multi-step analyses, and (iii) users can define and save complete pipelines for specific phylogenetic analysis to be automatically used on many genes in subsequent sessions or multiple genes in a single session (phylogenomics). The Phylemon web server is available at http://phylemon.bioinfo.cipf.es. VL - 35 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17452346 N1 - Tarraga, Joaquin Medina, Ignacio Arbiza, Leonardo Huerta-Cepas, Jaime Gabaldon, Toni Dopazo, Joaquin Dopazo, Hernan Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W38-42. Epub 2007 Apr 22. ER - TY - JOUR T1 - Transcriptional response of Citrus aurantifolia to infection by Citrus tristeza virus JF - Virology Y1 - 2007 A1 - Gandia, M. A1 - A. Conesa A1 - Ancillo, G. A1 - J. Gadea A1 - J. Forment A1 - Pallas, V. A1 - Flores, R. A1 - Duran-Vila, N. A1 - Moreno, P. A1 - Guerri, J. KW - Citrus/*genetics/physiology/virology Closterovirus/genetics/*physiology Genes KW - Genetic KW - Plant Oligonucleotide Array Sequence Analysis Reverse Transcriptase Polymerase Chain Reaction *Transcription AB - Changes in gene expression of Mexican lime plants in response to infection with a severe (T305) or a mild (T385) isolate of Citrus tristeza virus (CTV) were analyzed using a cDNA microarray containing 12,672 probes to 6875 different citrus genes. Statistically significant (P<0.01) expression changes of 334 genes were detected in response to infection with isolate T305, whereas infection with T385 induced no significant change. Induced genes included 145 without significant similarity with known sequences and 189 that were classified in seven functional categories. Genes related with response to stress and defense were the main category and included 28% of the genes induced. Selected transcription changes detected by microarray analysis were confirmed by quantitative real-time RT-PCR. Changes detected in the transcriptome upon infecting lime with T305 may be associated either with symptom expression, with a strain-specific defense mechanism, or with a general response to stress. VL - 367 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17617431 N1 - Gandia, Monica Conesa, Ana Ancillo, Gema Gadea, Jose Forment, Javier Pallas, Vicente Flores, Ricardo Duran-Vila, Nuria Moreno, Pedro Guerri, Jose Research Support, Non-U.S. Gov’t United States Virology Virology. 2007 Oct 25;367(2):298-306. Epub 2007 Jul 9. ER - TY - JOUR T1 - BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments JF - Nucleic Acids Res Y1 - 2006 A1 - Fatima Al-Shahrour A1 - Minguez, P. A1 - Tarraga, J. A1 - Montaner, D. A1 - Alloza, E. A1 - Vaquerizas, J. M. A1 - L. Conde A1 - Blaschke, C. A1 - Vera, J. A1 - Dopazo, J. KW - babelomics KW - functional profiling AB -

We present a new version of Babelomics, a complete suite of web tools for functional analysis of genome-scale experiments, with new and improved tools. New functionally relevant terms have been included such as CisRed motifs or bioentities obtained by text-mining procedures. An improved indexing has considerably speeded up several of the modules. An improved version of the FatiScan method for studying the coordinate behaviour of groups of functionally related genes is presented, along with a similar tool, the Gene Set Enrichment Analysis. Babelomics is now more oriented to test systems biology inspired hypotheses. Babelomics can be found at http://www.babelomics.org.

VL - 34 UR - http://nar.oxfordjournals.org/content/34/suppl_2/W472.long N1 -

Al-Shahrour, Fatima Minguez, Pablo Tarraga, Joaquin Montaner, David Alloza, Eva Vaquerizas, Juan M Conde, Lucia Blaschke, Christian Vera, Javier Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W472-6.

ER - TY - JOUR T1 - Blast2GO goes grid: developing a grid-enabled prototype for functional genomics analysis JF - Stud Health Technol Inform Y1 - 2006 A1 - Aparicio, G. A1 - Gotz, S. A1 - A. Conesa A1 - Segrelles, D. A1 - Blanquer, I. A1 - Garcia, J. M. A1 - Hernandez, V. A1 - Robles, M. A1 - Talon, M. KW - babelomics AB -

The vast amount in complexity of data generated in Genomic Research implies that new dedicated and powerful computational tools need to be developed to meet their analysis requirements. Blast2GO (B2G) is a bioinformatics tool for Gene Ontology-based DNA or protein sequence annotation and function-based data mining. The application has been developed with the aim of affering an easy-to-use tool for functional genomics research. Typical B2G users are middle size genomics labs carrying out sequencing, ETS and microarray projects, handling datasets up to several thousand sequences. In the current version of B2G. The power and analytical potential of both annotation and function data-mining is somehow restricted to the computational power behind each particular installation. In order to be able to offer the possibility of an enhanced computational capacity within this bioinformatics application, a Grid component is being developed. A prototype has been conceived for the particular problem of speeding up the Blast searches to obtain fast results for large datasets. Many efforts have been done in the literature concerning the speeding up of Blast searches, but few of them deal with the use of large heterogeneous production Grid Infrastructures. These are the infrastructures that could reach the largest number of resources and the best load balancing for data access. The Grid Service under development will analyse requests based on the number of sequences, splitting them accordingly to the available resources. Lower-level computation will be performed through MPIBLAST. The software architecture is based on the WSRF standard.

VL - 120 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16823138 N1 -

Aparicio, G Gotz, S Conesa, A Segrelles, D Blanquer, I Garcia, J M Hernandez, V Robles, M Talon, M Netherlands Studies in health technology and informatics Stud Health Technol Inform. 2006;120:194-204.

ER - TY - JOUR T1 - Discovery and hypothesis generation through bioinformatics JF - Genome Biol Y1 - 2006 A1 - Dopazo, J. A1 - Aloy, P. KW - *Computational Biology Genome KW - Genetic Phylogeny KW - Human *Genomics Humans *Models AB - A report on the 4th European Conference on Computational Biology and the 6th Spanish Annual Meeting on Bioinformatics, Madrid, Spain, 28 September-1 October 2005. VL - 7 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16522224 N1 - Dopazo, Joaquin Aloy, Patrick Congresses England Genome biology Genome Biol. 2006;7(2):307. Epub 2006 Feb 27. ER - TY - JOUR T1 - A function-centric approach to the biological interpretation of microarray time-series JF - Genome Inform Y1 - 2006 A1 - Minguez, P. A1 - Fatima Al-Shahrour A1 - Dopazo, J. KW - babelomics AB -

The interpretation of microarray experiments is commonly addressed by means a two-step approach in which the relevant genes are firstly selected uniquely on the basis of their experimental values (ignoring their coordinate behaviors) and in a second step their functional properties are studied to hypothesize about the biological roles they are fulfilling in the cell. Recently, different methods (e.g. GSEA or FatiScan) have been proposed to study the coordinate behavior of blocks of functionally-related genes. These methods study the distribution of functional information across lists of genes ranked according their different experimental values in a static situation, such as the comparison between two classes (e.g. healthy controls versus diseased cases). Nevertheless there is no an equivalent way of studying a dynamic situation from a functional point of view. We present a method for the functional analysis of microarrays series in which the experiments display autocorrelation between successive points (e.g. time series, dose-response experiments, etc.) The method allows to recover the dynamics of the molecular roles fulfilled by the genes along the series which provides a novel approach to functional interpretation of such experiments. The method finds blocks of functionally-related genes which are significantly and coordinately over-expressed at different points of the series. This method draws inspiration from systems biology given that the analysis does not focus on individual properties of genes but on collective behaving blocks of functionally-related genes. The FatiScan algorithm used in the method proposed is available at: http://fatiscan.bioinfo.cipf.es, or within the Babelomics suite: http://www.babelomics.org. Additional material is available at: http://bioinfo.cipf.es/data/plasmodium.

VL - 17 N1 -

Minguez, Pablo Al-Shahrour, Fatima Dopazo, Joaquin Research Support, Non-U.S. Gov’t Japan Genome informatics. International Conference on Genome Informatics Genome Inform. 2006;17(2):57-66.

ER - TY - JOUR T1 - Identification of overexpressed genes in frequently gained/amplified chromosome regions in multiple myeloma JF - Haematologica Y1 - 2006 A1 - Largo, C. A1 - Alvarez, S. A1 - Saez, B. A1 - Blesa, D. A1 - Martin-Subero, J. I. A1 - Gonzalez-Garcia, I. A1 - Brieva, J. A. A1 - Dopazo, J. A1 - Siebert, R. A1 - Calasanz, M. J. A1 - Cigudosa, J. C. KW - B-Cell KW - Caspases Cell Line KW - Human *Gene Amplification Gene Dosage Gene Expression Profiling *Gene Expression Regulation KW - Marginal Zone/genetics Multiple Myeloma/*genetics Neoplasm Proteins/genetics Proto-Oncogene Proteins c-bcl-2/genetics KW - Neoplasm Humans Immunoglobulin Heavy Chains/genetics Lymphoma KW - Neoplastic Gene Rearrangement *Genes KW - Tumor *Chromosomes AB - BACKGROUND AND OBJECTIVES: Multiple myeloma (MM) is a malignancy characterized by clonal expansion of plasma cells. In 50% of the cases, the neoplastic transformation begins with a chromosomal translocation that juxtaposes the IGH gene locus to an oncogene. Gene copy number changes are also frequent in MM but less characterized than in other neoplasias. We aimed to characterize genes that are amplified and overexpressed in human myeloma cell lines (HMCL) to provide putative molecular targets for MM therapy. DESIGN AND METHODS: Nine HMCL were characterized by fluorescent in situ hybridization, comparative genomic hybridization (CGH) and cDNA microarrays for gene expression profiling and copy number changes. RESULTS: After defining the IGH-translocations present in the cell lines, we conducted expression-profiling analysis. Supervised analysis identified 166 genes with significantly different expression among the cell lines harboring MMSET/FGFR3 (4p16), MAF (16q) and CCND1 (11q13) rearrangements. Array-CGH was then performed. Five chromosomes recurrently affected by gains/amplifications in primary samples and cell lines were analyzed in detail. Sixty amplified and overexpressed genes were found and 25 (42%) of them were only overexpressed when amplified; moreover, six showed a significant association between overexpression and gain/amplification. We also found co-amplification and overexpression for genes located within the same amplicons, such as MALT1 and BCL2. INTERPRETATION AND CONCLUSIONS: Parallel analysis of gene copy numbers and expression levels by cDNA microarray in MM allowed efficient identification of genes whose expression levels are elevated because of increased copy number. This is the first time that MALT1 and BCL2 have been shown to be overexpressed and amplified in MM. VL - 91 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16461302 N1 - Largo, Cristina Alvarez, Sara Saez, Borja Blesa, David Martin-Subero, Jose I Gonzalez-Garcia, Ines Brieva, Jose A Dopazo, Joaquin Siebert, Reiner Calasanz, Maria J Cigudosa, Juan C Research Support, Non-U.S. Gov’t Italy Haematologica Haematologica. 2006 Feb;91(2):184-91. ER - TY - JOUR T1 - Next station in microarray data analysis: GEPAS JF - Nucleic Acids Res Y1 - 2006 A1 - Montaner, D. A1 - Tarraga, J. A1 - Huerta-Cepas, J. A1 - Burguet, J. A1 - Vaquerizas, J. M. A1 - L. Conde A1 - Minguez, P. A1 - Vera, J. A1 - Mukherjee, S. A1 - Valls, J. A1 - Pujana, M. A. A1 - Alloza, E. A1 - Herrero, J. A1 - Fatima Al-Shahrour A1 - Dopazo, J. KW - gepas KW - microarray data analysis AB -

The Gene Expression Profile Analysis Suite (GEPAS) has been running for more than four years. During this time it has evolved to keep pace with the new interests and trends in the still changing world of microarray data analysis. GEPAS has been designed to provide an intuitive although powerful web-based interface that offers diverse analysis options from the early step of preprocessing (normalization of Affymetrix and two-colour microarray experiments and other preprocessing options), to the final step of the functional annotation of the experiment (using Gene Ontology, pathways, PubMed abstracts etc.), and include different possibilities for clustering, gene selection, class prediction and array-comparative genomic hybridization management. GEPAS is extensively used by researchers of many countries and its records indicate an average usage rate of 400 experiments per day. The web-based pipeline for microarray gene expression data, GEPAS, is available at http://www.gepas.org.

VL - 34 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16845056 N1 -

Montaner, David Tarraga, Joaquin Huerta-Cepas, Jaime Burguet, Jordi Vaquerizas, Juan M Conde, Lucia Minguez, Pablo Vera, Javier Mukherjee, Sach Valls, Joan Pujana, Miguel A G Alloza, Eva Herrero, Javier Al-Shahrour, Fatima Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W486-91.

ER - TY - JOUR T1 - Ontology-driven approaches to analyzing data in functional genomics JF - Methods Mol Biol Y1 - 2006 A1 - F. Azuaje A1 - Fatima Al-Shahrour A1 - Dopazo, J. KW - babelomics KW - Cluster Analysis KW - Cluster Analysis Computational Biology/*methods *Data Interpretation KW - Computational Biology KW - Statistical Gene Expression Profiling KW - Statistical Gene Expression Profiling *Genomics Humans AB -

Ontologies are fundamental knowledge representations that provide not only standards for annotating and indexing biological information, but also the basis for implementing functional classification and interpretation models. This chapter discusses the application of gene ontology (GO) for predictive tasks in functional genomics. It focuses on the problem of analyzing functional patterns associated with gene products. This chapter is divided into two main parts. The first part overviews GO and its applications for the development of functional classification models. The second part presents two methods for the characterization of genomic information using GO. It discusses methods for measuring functional similarity of gene products, and a tool for supporting gene expression clustering analysis and validation.

VL - 316 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16671401 N1 -

Azuaje, Francisco Al-Shahrour, Fatima Dopazo, Joaquin Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t Review United States Methods in molecular biology (Clifton, N.J.) Methods Mol Biol. 2006;316:67-86.

ER - TY - JOUR T1 - Positive selection, relaxation, and acceleration in the evolution of the human and chimp genome JF - PLoS Comput Biol Y1 - 2006 A1 - Arbiza, L. A1 - Dopazo, J. A1 - H. Dopazo KW - Adaptation KW - Biological/genetics Animals *Evolution KW - Molecular Genome/*genetics Humans Pan troglodytes/*genetics *Selection (Genetics) AB - For years evolutionary biologists have been interested in searching for the genetic bases underlying humanness. Recent efforts at a large or a complete genomic scale have been conducted to search for positively selected genes in human and in chimp. However, recently developed methods allowing for a more sensitive and controlled approach in the detection of positive selection can be employed. Here, using 13,198 genes, we have deduced the sets of genes involved in rate acceleration, positive selection, and relaxation of selective constraints in human, in chimp, and in their ancestral lineage since the divergence from murids. Significant deviations from the strict molecular clock were observed in 469 human and in 651 chimp genes. The more stringent branch-site test of positive selection detected 108 human and 577 chimp positively selected genes. An important proportion of the positively selected genes did not show a significant acceleration in rates, and similarly, many of the accelerated genes did not show significant signals of positive selection. Functional differentiation of genes under rate acceleration, positive selection, and relaxation was not statistically significant between human and chimp with the exception of terms related to G-protein coupled receptors and sensory perception. Both of these were over-represented under relaxation in human in relation to chimp. Comparing differences between derived and ancestral lineages, a more conspicuous change in trends seems to have favored positive selection in the human lineage. Since most of the positively selected genes are different under the same functional categories between these species, we suggest that the individual roles of the alternative positively selected genes may be an important factor underlying biological differences between these species. VL - 2 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16683019 N1 - Arbiza, Leonardo Dopazo, Joaquin Dopazo, Hernan Research Support, Non-U.S. Gov’t United States PLoS computational biology PLoS Comput Biol. 2006 Apr;2(4):e38. Epub 2006 Apr 28. ER - TY - JOUR T1 - PupaSuite: finding functional single nucleotide polymorphisms for large-scale genotyping purposes JF - Nucleic Acids Res Y1 - 2006 A1 - L. Conde A1 - Vaquerizas, J. M. A1 - H. Dopazo A1 - Arbiza, L. A1 - Reumers, J. A1 - Rousseau, F. A1 - Schymkowitz, J. A1 - Dopazo, J. KW - Algorithms Computer Graphics Databases KW - Molecular Genotype Haplotypes Internet Linkage Disequilibrium *Polymorphism KW - Nucleic Acid Evolution KW - Single Nucleotide *Software User-Computer Interface AB -

We have developed a web tool, PupaSuite, for the selection of single nucleotide polymorphisms (SNPs) with potential phenotypic effect, specifically oriented to help in the design of large-scale genotyping projects. PupaSuite uses a collection of data on SNPs from heterogeneous sources and a large number of pre-calculated predictions to offer a flexible and intuitive interface for selecting an optimal set of SNPs. It improves the functionality of PupaSNP and PupasView programs and implements new facilities such as the analysis of user’s data to derive haplotypes with functional information. A new estimator of putative effect of polymorphisms has been included that uses evolutionary information. Also SNPeffect database predictions have been included. The PupaSuite web interface is accessible through http://pupasuite.bioinfo.cipf.es and through http://www.pupasnp.org.

VL - 34 UR - http://nar.oxfordjournals.org/cgi/content/full/34/suppl_2/W621 N1 -

Conde, Lucia Vaquerizas, Juan M Dopazo, Hernan Arbiza, Leonardo Reumers, Joke Rousseau, Frederic Schymkowitz, Joost Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W621-5.

ER - TY - JOUR T1 - Selective pressures at a codon-level predict deleterious mutations in human disease genes JF - J Mol Biol Y1 - 2006 A1 - Arbiza, L. A1 - Duchi, S. A1 - Montaner, D. A1 - Burguet, J. A1 - Pantoja-Uceda, D. A1 - Pineda-Lucena, A. A1 - Dopazo, J. A1 - H. Dopazo KW - Amino Acid Sequence Amino Acid Substitution Codon/*genetics Databases KW - Genetic Evolution KW - Genetic Models KW - Human Humans Models KW - Inborn/*genetics Genome KW - Molecular Genes KW - Molecular Molecular Sequence Data *Mutation Neoplasms/genetics Proteins/genetics *Selection (Genetics) Tumor Suppressor Protein p53/chemistry/genetics KW - p53 Genetic Diseases AB - Deleterious mutations affecting biological function of proteins are constantly being rejected by purifying selection from the gene pool. The non-synonymous/synonymous substitution rate ratio (omega) is a measure of selective pressure on amino acid replacement mutations for protein-coding genes. Different methods have been developed in order to predict non-synonymous changes affecting gene function. However, none has considered the estimation of selective constraints acting on protein residues. Here, we have used codon-based maximum likelihood models in order to estimate the selective pressures on the individual amino acid residues of a well-known model protein: p53. We demonstrate that the number of residues under strong purifying selection in p53 is much higher than those that are strictly conserved during the evolution of the species. In agreement with theoretical expectations, residues that have been noted to be of structural relevance, or in direct association with DNA, were among those showing the highest signals of purifying selection. Conversely, those changing according to a neutral, or nearly neutral mode of evolution, were observed to be irrelevant for protein function. Finally, using more than 40 human disease genes, we demonstrate that residues evolving under strong selective pressures (omega<0.1) are significantly associated (p<0.01) with human disease. We hypothesize that non-synonymous change on amino acids showing omega<0.1 will most likely affect protein function. The application of this evolutionary prediction at a genomic scale will provide an a priori hypothesis of the phenotypic effect of non-synonymous coding single nucleotide polymorphisms (SNPs) in the human genome. VL - 358 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16584746 N1 - Arbiza, Leonardo Duchi, Serena Montaner, David Burguet, Jordi Pantoja-Uceda, David Pineda-Lucena, Antonio Dopazo, Joaquin Dopazo, Hernan Research Support, Non-U.S. Gov’t England Journal of molecular biology J Mol Biol. 2006 May 19;358(5):1390-404. Epub 2006 Mar 15. ER - TY - JOUR T1 - BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments JF - Nucleic Acids Res Y1 - 2005 A1 - Fatima Al-Shahrour A1 - Minguez, P. A1 - Vaquerizas, J. M. A1 - L. Conde A1 - Dopazo, J. KW - babelomics KW - functional profiling AB -

We present Babelomics, a complete suite of web tools for the functional analysis of groups of genes in high-throughput experiments, which includes the use of information on Gene Ontology terms, interpro motifs, KEGG pathways, Swiss-Prot keywords, analysis of predicted transcription factor binding sites, chromosomal positions and presence in tissues with determined histological characteristics, through five integrated modules: FatiGO (fast assignment and transference of information), FatiWise, transcription factor association test, GenomeGO and tissues mining tool, respectively. Additionally, another module, FatiScan, provides a new procedure that integrates biological information in combination with experimental results in order to find groups of genes with modest but coordinate significant differential behaviour. FatiScan is highly sensitive and is capable of finding significant asymmetries in the distribution of genes of common function across a list of ordered genes even if these asymmetries were not extreme. The strong multiple-testing nature of the contrasts made by the tools is taken into account. All the tools are integrated in the gene expression analysis package GEPAS. Babelomics is the natural evolution of our tool FatiGO (which analysed almost 22,000 experiments during the last year) to include more sources on information and new modes of using it. Babelomics can be found at http://www.babelomics.org.

VL - 33 UR - http://nar.oxfordjournals.org/content/33/suppl_2/W460.long N1 -

Al-Shahrour, Fatima Minguez, Pablo Vaquerizas, Juan M Conde, Lucia Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W460-4.

ER - TY - CHAP T1 - Data analysis and visualisation in genomics and proteomics Y1 - 2005 A1 - F. Azuaje A1 - Dopazo, J. KW - babelomics PB - Wiley, F. Azuaje and J. Dopazo ER - TY - CHAP T1 - Data and Predictive Model Integration: an Overview of Key Concepts, Problems and Solutions T2 - Data analysis and visualisation in genomics and proteomics Y1 - 2005 A1 - F. Azuaje A1 - Dopazo, J. A1 - Wang, H JF - Data analysis and visualisation in genomics and proteomics PB - Wiley, F. Azuaje and J. Dopazo ER - TY - JOUR T1 - Detecting remotely related proteins by their interactions and sequence similarity JF - Proc Natl Acad Sci U S A Y1 - 2005 A1 - Espadaler, J. A1 - Aragues, R. A1 - Eswar, N. A1 - M. A. Marti-Renom A1 - Querol, E. A1 - Aviles, F. X. A1 - Sali, A. A1 - Oliva, B. KW - Amino Acid KW - Computational Biology Databases KW - Molecular Protein Conformation Protein Folding Proteins/*genetics/*metabolism Proteomics/*methods *Sequence Homology KW - Protein *Evolution AB - The function of an uncharacterized protein is usually inferred either from its homology to, or its interactions with, characterized proteins. Here, we use both sequence similarity and protein interactions to identify relationships between remotely related protein sequences. We rely on the fact that homologous sequences share similar interactions, and, therefore, the set of interacting partners of the partners of a given protein is enriched by its homologs. The approach was bench-marked by assigning the fold and functional family to test sequences of known structure. Specifically, we relied on 1,434 proteins with known folds, as defined in the Structural Classification of Proteins (SCOP) database, and with known interacting partners, as defined in the Database of Interacting Proteins (DIP). For this subset, the specificity of fold assignment was increased from 54% for position-specific iterative BLAST to 75% for our approach, with a concomitant increase in sensitivity for a few percentage points. Similarly, the specificity of family assignment at the e-value threshold of 10(-8) was increased from 70% to 87%. The proposed method would be a useful tool for large-scale automated discovery of remote relationships between protein sequences, given its unique reliance on sequence similarity and protein-protein interactions. VL - 102 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15883372 N1 - Espadaler, Jordi Aragues, Ramon Eswar, Narayanan Marti-Renom, Marc A Querol, Enrique Aviles, Francesc X Sali, Andrej Oliva, Baldomero R01 GM54762/GM/NIGMS NIH HHS/United States Comparative Study Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. United States Proceedings of the National Academy of Sciences of the United States of America Proc Natl Acad Sci U S A. 2005 May 17;102(20):7151-6. Epub 2005 May 9. ER - TY - JOUR T1 - Development of a citrus genome-wide EST collection and cDNA microarray as resources for genomic studies JF - Plant Mol Biol Y1 - 2005 A1 - J. Forment A1 - J. Gadea A1 - Huerta, L. A1 - Abizanda, L. A1 - Agusti, J. A1 - Alamar, S. A1 - Alos, E. A1 - Andres, F. A1 - Arribas, R. A1 - Beltran, J. P. A1 - Berbel, A. A1 - Blazquez, M. A. A1 - Brumos, J. A1 - Canas, L. A. A1 - Cercos, M. A1 - Colmenero-Flores, J. M. A1 - A. Conesa A1 - Estables, B. A1 - Gandia, M. A1 - Garcia-Martinez, J. L. A1 - Gimeno, J. A1 - Gisbert, A. A1 - Gomez, G. A1 - Gonzalez-Candelas, L. A1 - Granell, A. A1 - Guerri, J. A1 - Lafuente, M. T. A1 - Madueno, F. A1 - Marcos, J. F. A1 - Marques, M. C. A1 - Martinez, F. A1 - Martinez-Godoy, M. A. A1 - Miralles, S. A1 - Moreno, P. A1 - Navarro, L. A1 - Pallas, V. A1 - Perez-Amador, M. A. A1 - Perez-Valle, J. A1 - Pons, C. A1 - Rodrigo, I. A1 - Rodriguez, P. L. A1 - Royo, C. A1 - Serrano, R. A1 - Soler, G. A1 - Tadeo, F. A1 - Talon, M. A1 - Terol, J. A1 - Trenor, M. A1 - Vaello, L. A1 - Vicente, O. A1 - Vidal, Ch A1 - Zacarias, L. A1 - Conejero, V. KW - Citrus/*genetics DNA KW - Complementary/chemistry/genetics *Expressed Sequence Tags Gene Expression Profiling Gene Library *Genome KW - DNA KW - Plant Genomics/*methods Molecular Sequence Data Oligonucleotide Array Sequence Analysis/*methods RNA KW - Plant/genetics/metabolism Reproducibility of Results Sequence Analysis AB - A functional genomics project has been initiated to approach the molecular characterization of the main biological and agronomical traits of citrus. As a key part of this project, a citrus EST collection has been generated from 25 cDNA libraries covering different tissues, developmental stages and stress conditions. The collection includes a total of 22,635 high-quality ESTs, grouped in 11,836 putative unigenes, which represent at least one third of the estimated number of genes in the citrus genome. Functional annotation of unigenes which have Arabidopsis orthologues (68% of all unigenes) revealed gene representation in every major functional category, suggesting that a genome-wide EST collection was obtained. A Citrus clementina Hort. ex Tan. cv. Clemenules genomic library, that will contribute to further characterization of relevant genes, has also been constructed. To initiate the analysis of citrus transcriptome, we have developed a cDNA microarray containing 12,672 probes corresponding to 6875 putative unigenes of the collection. Technical characterization of the microarray showed high intra- and inter-array reproducibility, as well as a good range of sensitivity. We have also validated gene expression data achieved with this microarray through an independent technique such as RNA gel blot analysis. VL - 57 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15830128 N1 - Forment, J Gadea, J Huerta, L Abizanda, L Agusti, J Alamar, S Alos, E Andres, F Arribas, R Beltran, J P Berbel, A Blazquez, M A Brumos, J Canas, L A Cercos, M Colmenero-Flores, J M Conesa, A Estables, B Gandia, M Garcia-Martinez, J L Gimeno, J Gisbert, A Gomez, G Gonzalez-Candelas, L Granell, A Guerri, J Lafuente, M T Madueno, F Marcos, J F Marques, M C Martinez, F Martinez-Godoy, M A Miralles, S Moreno, P Navarro, L Pallas, V Perez-Amador, M A Perez-Valle, J Pons, C Rodrigo, I Rodriguez, P L Royo, C Serrano, R Soler, G Tadeo, F Talon, M Terol, J Trenor, M Vaello, L Vicente, O Vidal, Ch Zacarias, L Conejero, V Comparative Study Research Support, U.S. Gov’t, Non-P.H.S. Netherlands Plant molecular biology Plant Mol Biol. 2005 Feb;57(3):375-91. ER - TY - JOUR T1 - Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information JF - Bioinformatics Y1 - 2005 A1 - Fatima Al-Shahrour A1 - Diaz-Uriarte, R. A1 - Dopazo, J. KW - babelomics KW - Biological Neoplasm Proteins/genetics/*metabolism Phenotype Software Structure-Activity Relationship Systems Integration Tumor Markers KW - Biological/genetics/*metabolism KW - Breast Neoplasms/genetics/*metabolism Computer Simulation *Database Management Systems *Databases KW - Protein Documentation/methods Gene Expression Profiling/*methods Humans *Models AB -

MOTIVATION: The analysis of genome-scale data from different high throughput techniques can be used to obtain lists of genes ordered according to their different behaviours under distinct experimental conditions corresponding to different phenotypes (e.g. differential gene expression between diseased samples and controls, different response to a drug, etc.). The order in which the genes appear in the list is a consequence of the biological roles that the genes play within the cell, which account, at molecular scale, for the macroscopic differences observed between the phenotypes studied. Typically, two steps are followed for understanding the biological processes that differentiate phenotypes at molecular level: first, genes with significant differential expression are selected on the basis of their experimental values and subsequently, the functional properties of these genes are analysed. Instead, we present a simple procedure which combines experimental measurements with available biological information in a way that genes are simultaneously tested in groups related by common functional properties. The method proposed constitutes a very sensitive tool for selecting genes with significant differential behaviour in the experimental conditions tested. RESULTS: We propose the use of a method to scan ordered lists of genes. The method allows the understanding of the biological processes operating at molecular level behind the macroscopic experiment from which the list was generated. This procedure can be useful in situations where it is not possible to obtain statistically significant differences based on the experimental measurements (e.g. low prevalence diseases, etc.). Two examples demonstrate its application in two microarray experiments and the type of information that can be extracted.

VL - 21 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15840702 N1 -

Al-Shahrour, Fatima Diaz-Uriarte, Ramon Dopazo, Joaquin Evaluation Studies Research Support, Non-U.S. Gov’t England Bioinformatics (Oxford, England) Bioinformatics. 2005 Jul 1;21(13):2988-93. Epub 2005 Apr 19.

ER - TY - JOUR T1 - GEPAS, an experiment-oriented pipeline for the analysis of microarray gene expression data JF - Nucleic Acids Res Y1 - 2005 A1 - Vaquerizas, J. M. A1 - L. Conde A1 - Yankilevich, P. A1 - Cabezon, A. A1 - Minguez, P. A1 - Diaz-Uriarte, R. A1 - Fatima Al-Shahrour A1 - Herrero, J. A1 - Dopazo, J. KW - gepas KW - microarray data analysis AB -

The Gene Expression Profile Analysis Suite, GEPAS, has been running for more than three years. With >76,000 experiments analysed during the last year and a daily average of almost 300 analyses, GEPAS can be considered a well-established and widely used platform for gene expression microarray data analysis. GEPAS is oriented to the analysis of whole series of experiments. Its design and development have been driven by the demands of the biomedical community, probably the most active collective in the field of microarray users. Although clustering methods have obviously been implemented in GEPAS, our interest has focused more on methods for finding genes differentially expressed among distinct classes of experiments or correlated to diverse clinical outcomes, as well as on building predictors. There is also a great interest in CGH-arrays which fostered the development of the corresponding tool in GEPAS: InSilicoCGH. Much effort has been invested in GEPAS for developing and implementing efficient methods for functional annotation of experiments in the proper statistical framework. Thus, the popular FatiGO has expanded to a suite of programs for functional annotation of experiments, including information on transcription factor binding sites, chromosomal location and tissues. The web-based pipeline for microarray gene expression data, GEPAS, is available at http://www.gepas.org.

VL - 33 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15980548 N1 -

Vaquerizas, Juan M Conde, Lucia Yankilevich, Patricio Cabezon, Amaya Minguez, Pablo Diaz-Uriarte, Ramon Al-Shahrour, Fatima Herrero, Javier Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W616-20.

ER - TY - CHAP T1 - Integrative Data Analysis and Visualization: Introduction to Critical Problems, Goals and Challenges T2 - Data analysis and visualisation in genomics and proteomics Y1 - 2005 A1 - F. Azuaje A1 - Dopazo, J. JF - Data analysis and visualisation in genomics and proteomics PB - Wiley, F. Azuaje and J. Dopazo ER - TY - CHAP T1 - Ontologies and functional genomics T2 - Data analysis and visualisation in genomics and proteomics Y1 - 2005 A1 - Fatima Al-Shahrour A1 - Dopazo, J. JF - Data analysis and visualisation in genomics and proteomics PB - Wiley, F. Azuaje and J. Dopazo ER - TY - JOUR T1 - Phenotypic characterization of BRCA1 and BRCA2 tumors based in a tissue microarray study with 37 immunohistochemical markers JF - Breast Cancer Res Treat Y1 - 2005 A1 - Palacios, J. A1 - Honrado, E. A1 - Osorio, A. A1 - Cazorla, A. A1 - Sarrio, D. A1 - Barroso, A. A1 - Rodriguez, S. A1 - Cigudosa, J. C. A1 - Diez, O. A1 - Alonso, C. A1 - Lerma, E. A1 - Dopazo, J. A1 - Rivas, C. A1 - Benitez, J. KW - Adult Apoptosis Breast Neoplasms/*genetics/*pathology Cell Cycle Proteins Cluster Analysis Female *Genes KW - Biological/genetics/metabolism KW - BRCA1 *Genes KW - BRCA2 Humans Immunohistochemistry In Situ Hybridization KW - Fluorescence Phenotype Spain *Tissue Array Analysis *Tumor Markers AB - Familial breast cancers that are associated with BRCA1 or BRCA2 germline mutations differ in both their morphological and immunohistochemical characteristics. To further characterize the molecular difference between genotypes, the authors evaluated the expression of 37 immunohistochemical markers in a tissue microarray (TMA) containing cores from 20 BRCA1, 14 BRCA2, and 59 sporadic age-matched breast carcinomas. Markers analyzed included, amog others, common markers in breast cancer, such as hormone receptors, p53 and HER2, along with 15 molecules involved in cell cycle regulation, such as cyclins, cyclin dependent kinases (CDK) and CDK inhibitors (CDKI), apoptosis markers, such as BCL2 and active caspase 3, and two basal/myoepithelial markers (CK 5/6 and P-cadherin). In addition, we analyzed the amplification of CCND1, CCNE, HER2 and MYC by FISH.Unsupervised cluster data analysis of both hereditary and sporadic cases using the complete set of immunohistochemical markers demonstrated that most BRCA1-associated carcinomas grouped in a branch of ER-, HER2-negative tumors that expressed basal cell markers and/or p53 and had higher expression of activated caspase 3. The cell cycle proteins associated with these tumors were E2F6, cyclins A, B1 and E, SKP2 and Topo IIalpha. In contrast, most BRCA2-associated carcinomas grouped in a branch composed by ER/PR/BCL2-positive tumors with a higher expression of the cell cycle proteins cyclin D1, cyclin D3, p27, p16, p21, CDK4, CDK2 and CDK1. In conclusion, our study in hereditary breast cancer tumors analyzing 37 immunohistochemical markers, define the molecular differences between BRCA1 and BRCA2 tumors with respect to hormonal receptors, cell cycle, apoptosis and basal cell markers. VL - 90 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15770521 N1 - Palacios, Jose Honrado, Emiliano Osorio, Ana Cazorla, Alicia Sarrio, David Barroso, Alicia Rodriguez, Sandra Cigudosa, Juan C Diez, Orland Alonso, Carmen Lerma, Enrique Dopazo, Joaquin Rivas, Carmen Benitez, Javier Research Support, Non-U.S. Gov’t Netherlands Breast cancer research and treatment Breast Cancer Res Treat. 2005 Mar;90(1):5-14. ER - TY - JOUR T1 - A predictor based on the somatic genomic changes of the BRCA1/BRCA2 breast cancer tumors identifies the non-BRCA1/BRCA2 tumors with BRCA1 promoter hypermethylation JF - Clin Cancer Res Y1 - 2005 A1 - Alvarez, S. A1 - Diaz-Uriarte, R. A1 - Osorio, A. A1 - Barroso, A. A1 - Melchor, L. A1 - Paz, M. F. A1 - Honrado, E. A1 - Rodriguez, R. A1 - Urioste, M. A1 - Valle, L. A1 - Diez, O. A1 - Cigudosa, J. C. A1 - Dopazo, J. A1 - Esteller, M. A1 - Benitez, J. KW - BRCA1 Protein/*genetics BRCA2 Protein/*genetics Breast Neoplasms/*genetics/pathology Chromosomes KW - Genetic/*genetics KW - Human KW - Human Humans Male Mutation Nucleic Acid Hybridization/methods Promoter Regions KW - Pair 12/genetics Chromosomes KW - Pair 15/genetics Chromosomes KW - Pair 18/genetics Chromosomes KW - Pair 2/genetics Chromosomes KW - Pair 8/genetics *DNA Methylation Female Genome AB - The genetic changes underlying in the development and progression of familial breast cancer are poorly understood. To identify a somatic genetic signature of tumor progression for each familial group, BRCA1, BRCA2, and non-BRCA1/BRCA2 (BRCAX) tumors, by high-resolution comparative genomic hybridization, we have analyzed 77 tumors previously characterized for BRCA1 and BRCA2 germ line mutations. Based on a combination of the somatic genetic changes observed at the six most different chromosomal regions and the status of the estrogen receptor, we developed using random forests a molecular classifier, which assigns to a given tumor a probability to belong either to the BRCA1 or to the BRCA2 class. Because 76.5% (26 of 34) of the BRCAX cases were classified with our predictor to the BRCA1 class with a probability of >50%, we analyzed the BRCA1 promoter region for aberrant methylation in all the BRCAX cases. We found that 15 of the 34 BRCAX analyzed tumors had hypermethylation of the BRCA1 gene. When we considered the predictor, we observed that all the cases with this epigenetic event were assigned to the BRCA1 class with a probability of >50%. Interestingly, 84.6% of the cases (11 of 13) assigned to the BRCA1 class with a probability >80% had an aberrant methylation of the BRCA1 promoter. This fact suggests that somatic BRCA1 inactivation could modify the profile of tumor progression in most of the BRCAX cases. VL - 11 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15709182 N1 - Alvarez, Sara Diaz-Uriarte, Ramon Osorio, Ana Barroso, Alicia Melchor, Lorenzo Paz, Maria Fe Honrado, Emiliano Rodriguez, Raquel Urioste, Miguel Valle, Laura Diez, Orland Cigudosa, Juan Cruz Dopazo, Joaquin Esteller, Manel Benitez, Javier Comparative Study Research Support, Non-U.S. Gov’t United States Clinical cancer research : an official journal of the American Association for Cancer Research Clin Cancer Res. 2005 Feb 1;11(3):1146-53. ER - TY - CHAP T1 - Salinibacter ruber: genomics and biogeography T2 - Adaptation to life in high salt concentrations in Archaea, Bacteria and Eukarya Y1 - 2005 A1 - Antón, J A1 - Peña, A A1 - Valens, M A1 - Santos, F A1 - Glöckner, F.O A1 - Bauer, M A1 - Dopazo, J. A1 - Herrero, J. A1 - Rosselló-Mora, R A1 - Amann, R JF - Adaptation to life in high salt concentrations in Archaea, Bacteria and Eukarya PB - Nina Gunde-Cimerman, Ana Plemenitas, and Aharon Oren. Kluwer Academic Publishers CY - Dordrecht, Netherlands VL - 9 ER - TY - JOUR T1 - Expression profiling of T-cell lymphomas differentiates peripheral and lymphoblastic lymphomas and defines survival related genes. JF - Clinical cancer research : an official journal of the American Association for Cancer Research Y1 - 2004 A1 - Martinez-Delgado, Beatriz A1 - Meléndez, Barbara A1 - Cuadros, Marta A1 - Alvarez, Javier A1 - Castrillo, Jose Maria A1 - Ruiz De La Parte, Ana A1 - Mollejo, Manuela A1 - Bellas, Carmen A1 - Diaz, Ramon A1 - Lombardía, Luis A1 - Fatima Al-Shahrour A1 - Domínguez, Orlando A1 - Cascon, Alberto A1 - Robledo, Mercedes A1 - Rivas, Carmen A1 - Benitez, Javier AB -

PURPOSE: T-Cell lymphomas constitute heterogeneous and aggressive tumors in which pathogenic alterations remain largely unknown. Expression profiling has demonstrated to be a useful tool for molecular classification of tumors. EXPERIMENTAL DESIGN: Using DNA microarrays (CNIO-OncoChip) containing 6386 cancer-related genes, we established the expression profiling of T-cell lymphomas and compared them to normal lymphocytes and lymph nodes. RESULTS: We found significant differences between the peripheral and lymphoblastic T-cell lymphomas, which include a deregulation of nuclear factor-kappaB signaling pathway. We also identify differentially expressed genes between peripheral T-cell lymphoma tumors and normal T lymphocytes or reactive lymph nodes, which could represent candidate tumor markers of these lymphomas. Additionally, a close relationship between genes associated to survival and those that differentiate among the stages of disease and responses to therapy was found. CONCLUSIONS: Our results reflect the value of gene expression profiling to gain insight about the molecular alterations involved in the pathogenesis of T-cell lymphomas.

VL - 10 UR - http://clincancerres.aacrjournals.org/content/10/15/4971.long ER - TY - JOUR T1 - FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes JF - Bioinformatics Y1 - 2004 A1 - Fatima Al-Shahrour A1 - Diaz-Uriarte, R. A1 - Dopazo, J. KW - *Algorithms Artificial Intelligence Databases KW - babelomics KW - DNA/*methods *Software KW - Genetic Gene Expression Profiling/*methods *Hypermedia Information Storage and Retrieval/*methods *Internet *Phylogeny Sequence Alignment/methods Sequence Analysis AB -

We present a simple but powerful procedure to extract Gene Ontology (GO) terms that are significantly over- or under-represented in sets of genes within the context of a genome-scale experiment (DNA microarray, proteomics, etc.). Said procedure has been implemented as a web application, FatiGO, allowing for easy and interactive querying. FatiGO, which takes the multiple-testing nature of statistical contrast into account, currently includes GO associations for diverse organisms (human, mouse, fly, worm and yeast) and the TrEMBL/Swissprot GOAnnotations@EBI correspondences from the European Bioinformatics Institute.

VL - 20 UR - http://bioinformatics.oxfordjournals.org/content/20/4/578.abstract N1 -

Al-Shahrour, Fatima Diaz-Uriarte, Ramon Dopazo, Joaquin England Bioinformatics (Oxford, England) Bioinformatics. 2004 Mar 1;20(4):578-80. Epub 2004 Jan 22.

ER - TY - CHAP T1 - Gene expression Correlation and Gene Ontology-Based Similarity: An Assessment of Quantitative Relationship T2 - IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology Y1 - 2004 A1 - Wang, H A1 - F. Azuaje A1 - Bodenreider, O A1 - Dopazo, J. JF - IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology ER - TY - JOUR T1 - New challenges in gene expression data analysis and the extended GEPAS JF - Nucleic Acids Res Y1 - 2004 A1 - Herrero, J. A1 - Vaquerizas, J. M. A1 - Fatima Al-Shahrour A1 - L. Conde A1 - A. Mateos A1 - Diaz-Uriarte, J. S. A1 - Dopazo, J. KW - gepas KW - microarray data analysis AB -

Since the first papers published in the late nineties, including, for the first time, a comprehensive analysis of microarray data, the number of questions that have been addressed through this technique have both increased and diversified. Initially, interest focussed on genes coexpressing across sets of experimental conditions, implying, essentially, the use of clustering techniques. Recently, however, interest has focussed more on finding genes differentially expressed among distinct classes of experiments, or correlated to diverse clinical outcomes, as well as in building predictors. In addition to this, the availability of accurate genomic data and the recent implementation of CGH arrays has made mapping expression and genomic data on the chromosomes possible. There is also a clear demand for methods that allow the automatic transfer of biological information to the results of microarray experiments. Different initiatives, such as the Gene Ontology (GO) consortium, pathways databases, protein functional motifs, etc., provide curated annotations for genes. Whereas many resources on the web focus mainly on clustering methods, GEPAS has evolved to cope with the aforementioned new challenges that have recently arisen in the field of microarray data analysis. The web-based pipeline for microarray gene expression data, GEPAS, is available at http://gepas.bioinfo.cnio.es.

VL - 32 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15215434 N1 -

Herrero, Javier Vaquerizas, Juan M Al-Shahrour, Fatima Conde, Lucia Mateos, Alvaro Diaz-Uriarte, Javier Santoyo Ramon Dopazo, Joaquin England Nucleic acids research Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W485-91.

ER - TY - JOUR T1 - PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level JF - Nucleic Acids Res Y1 - 2004 A1 - L. Conde A1 - Vaquerizas, J. M. A1 - J. Santoyo A1 - Fatima Al-Shahrour A1 - Ruiz-Llorente, S. A1 - M. Robledo A1 - Dopazo, J. KW - Amino Acid Substitution Binding Sites Humans Internet Phenotype *Polymorphism KW - Genetic KW - Single Nucleotide RNA Splicing *Software Transcription Factors/metabolism *Transcription AB - We have developed a web tool, PupaSNP Finder (PupaSNP for short), for high-throughput searching for single nucleotide polymorphisms (SNPs) with potential phenotypic effect. PupaSNP takes as its input lists of genes (or generates them from chromosomal coordinates) and retrieves SNPs that could affect the conserved regions that the cellular machinery uses for the correct processing of genes (intron/exon boundaries or exonic splicing enhancers), predicted transcription factor binding sites (TFBS) and changes in amino acids in the proteins. The program uses the mapping of SNPs in the genome provided by Ensembl. Additionally, user-defined SNPs (not yet mapped in the genome) can be easily provided to the program. Also, additional functional information from Gene Ontology, OMIM and homologies in other model organisms is provided. In contrast to other programs already available, which focus only on SNPs with possible effect in the protein, PupaSNP includes SNPs with possible transcriptional effect. PupaSNP will be of significant help in studies of multifactorial disorders, where the use of functional SNPs will increase the sensitivity of identification of the genes responsible for the disease. The PupaSNP web interface is accessible through http://pupasnp.bioinfo.cnio.es. VL - 32 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15215388 N1 - Conde, Lucia Vaquerizas, Juan M Santoyo, Javier Al-Shahrour, Fatima Ruiz-Llorente, Sergio Robledo, Mercedes Dopazo, Joaquin England Nucleic acids research Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W242-8. ER - TY - JOUR T1 - GEPAS: A web-based resource for microarray gene expression data analysis JF - Nucleic Acids Res Y1 - 2003 A1 - Herrero, J. A1 - Fatima Al-Shahrour A1 - Diaz-Uriarte, R. A1 - A. Mateos A1 - Vaquerizas, J. M. A1 - J. Santoyo A1 - Dopazo, J. KW - gepas KW - microarray data analysis AB -

We present a web-based pipeline for microarray gene expression profile analysis, GEPAS, which stands for Gene Expression Profile Analysis Suite (http://gepas.bioinfo.cnio.es). GEPAS is composed of different interconnected modules which include tools for data pre-processing, two-conditions comparison, unsupervised and supervised clustering (which include some of the most popular methods as well as home made algorithms) and several tests for differential gene expression among different classes, continuous variables or survival analysis. A multiple purpose tool for data mining, based on Gene Ontology, is also linked to the tools, which constitutes a very convenient way of analysing clustering results. On-line tutorials are available from our main web server (http://bioinfo.cnio.es).

VL - 31 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=12824345 N1 -

Herrero, Javier Al-Shahrour, Fatima Diaz-Uriarte, Ramon Mateos, Alvaro Vaquerizas, Juan M Santoyo, Javier Dopazo, Joaquin Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2003 Jul 1;31(13):3461-7.

ER - TY - CHAP T1 - Use of GO Terms to Understand the Biological Significance of Microarray Differential Gene Expression Data T2 - Microarray data analysis III Y1 - 2003 A1 - Díaz-Uriarte, R A1 - Fatima Al-Shahrour A1 - Dopazo, J. JF - Microarray data analysis III PB - Kluwer Academic, K. F. Johnson and S. M. Lin ER - TY - CHAP T1 - Using Gene Ontology on genome-scale studies to find significant associations of biologically relevant terms to group of genes T2 - Neural Networks for Signal Processing XIII Y1 - 2003 A1 - Fatima Al-Shahrour A1 - Herrero, J. A1 - A. Mateos A1 - J. Santoyo A1 - Díaz-Uriarte, R A1 - Dopazo, J. KW - babelomics JF - Neural Networks for Signal Processing XIII PB - IEEE Press CY - New York, USA ER - TY - JOUR T1 - Calnexin overexpression increases manganese peroxidase production in Aspergillus niger JF - Appl Environ Microbiol Y1 - 2002 A1 - A. Conesa A1 - Jeenes, D. A1 - Archer, D. B. A1 - van den Hondel, C. A. A1 - Punt, P. J. KW - Aspergillus niger/*enzymology/genetics Calcium-Binding Proteins/*metabolism Calnexin Culture Media *Fungal Proteins HSP70 Heat-Shock Proteins/metabolism Heme/metabolism Peroxidases/*biosynthesis/genetics Phanerochaete/enzymology/genetics Transformation KW - Genetic AB - Heme-containing peroxidases from white rot basidiomycetes, in contrast to most proteins of fungal origin, are poorly produced in industrial filamentous fungal strains. Factors limiting peroxidase production are believed to operate at the posttranslational level. In particular, insufficient availability of the prosthetic group which is required for peroxidase biosynthesis has been proposed to be an important bottleneck. In this work, we analyzed the role of two components of the secretion pathway, the chaperones calnexin and binding protein (BiP), in the production of a fungal peroxidase. Expression of the Phanerochaete chrysosporium manganese peroxidase (MnP) in Aspergillus niger resulted in an increase in the expression level of the clxA and bipA genes. In a heme-supplemented medium, where MnP was shown to be overproduced to higher levels, induction of clxA and bipA was also higher. Overexpression of these two chaperones in an MnP-producing strain was analyzed for its effect on MnP production. Whereas bipA overexpression seriously reduced MnP production, overexpression of calnexin resulted in a four- to fivefold increase in the extracellular MnP levels. However, when additional heme was provided in the culture medium, calnexin overexpression had no synergistic effect on MnP production. The possible function of these two chaperones in MnP maturation and production is discussed. VL - 68 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11823227 N1 - Conesa, Ana Jeenes, David Archer, David B van den Hondel, Cees A M J J Punt, Peter J United States Applied and environmental microbiology Appl Environ Microbiol. 2002 Feb;68(2):846-51. ER - TY - JOUR T1 - Filamentous fungi as cell factories for heterologous protein production JF - Trends Biotechnol Y1 - 2002 A1 - Punt, P. J. A1 - van Biezen, N. A1 - A. Conesa A1 - Albers, A. A1 - Mangnus, J. A1 - van den Hondel, C. KW - Fermentation/genetics/physiology Fungi/*genetics/*metabolism Humans Interleukin-6/analysis/*biosynthesis/genetics Peroxidases/analysis/*biosynthesis/genetics Protein Conformation Recombinant Proteins/analysis/*biosynthesis/genetics AB - Filamentous fungi have been used as sources of metabolites and enzymes for centuries. For about two decades, molecular genetic tools have enabled us to use these organisms to express extra copies of both endogenous and exogenous genes. This review of current practice reveals that molecular tools have enabled several new developments. But it has been process development that has driven the final breakthrough to achieving commercially relevant quantities of protein. Recent research into gene expression in filamentous fungi has explored their wealth of genetic diversity with a view to exploiting them as expression hosts and as a source of new genes. Inevitably, the progress in the ’genomics’ technology will further develop high-throughput technologies for these organisms. VL - 20 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11943375 N1 - Punt, Peter J van Biezen, Nick Conesa, Ana Albers, Alwin Mangnus, Jeroen van den Hondel, Cees Review England Trends in biotechnology Trends Biotechnol. 2002 May;20(5):200-6. ER - TY - JOUR T1 - Identification of genes involved in resistance to interferon-alpha in cutaneous T-cell lymphoma JF - Am J Pathol Y1 - 2002 A1 - Tracey, L. A1 - Villuendas, R. A1 - Ortiz, P. A1 - Dopazo, A. A1 - Spiteri, I. A1 - Lombardia, L. A1 - Rodriguez-Peralto, J. L. A1 - Fernandez-Herrera, J. A1 - Hernandez, A. A1 - Fraga, J. A1 - Dominguez, O. A1 - Herrero, J. A1 - Alonso, M. A. A1 - Dopazo, J. A1 - Piris, M. A. KW - Antineoplastic Agents/*pharmacology/therapeutic use Carrier Proteins/biosynthesis/genetics DNA-Binding Proteins/biosynthesis/genetics Drug Resistance KW - Biological Oligonucleotide Array Sequence Analysis RNA KW - Cultured KW - Cutaneous/diagnosis/drug therapy/*genetics/metabolism *Membrane Glycoproteins Models KW - Interleukin-1 Reproducibility of Results STAT1 Transcription Factor STAT3 Transcription Factor Trans-Activators/biosynthesis/genetics Tumor Cells KW - Neoplasm Gene Expression Profiling *Gene Expression Regulation KW - Neoplasm/biosynthesis *Receptors KW - Neoplastic Humans Interferon-alpha/*pharmacology/therapeutic use Kinetics Lymphoma KW - T-Cell AB - Interferon-alpha therapy has been shown to be active in the treatment of mycosis fungoides although the individual response to this therapy is unpredictable and dependent on essentially unknown factors. In an effort to better understand the molecular mechanisms of interferon-alpha resistance we have developed an interferon-alpha resistant variant from a sensitive cutaneous T-cell lymphoma cell line. We have performed expression analysis to detect genes differentially expressed between both variants using a cDNA microarray including 6386 cancer-implicated genes. The experiments showed that resistance to interferon-alpha is consistently associated with changes in the expression of a set of 39 genes, involved in signal transduction, apoptosis, transcription regulation, and cell growth. Additional studies performed confirm that STAT1 and STAT3 expression and interferon-alpha induction and activation are not altered between both variants. The gene MAL, highly overexpressed by resistant cells, was also found to be expressed by tumoral cells in a series of cutaneous T-cell lymphoma patients treated with interferon-alpha and/or photochemotherapy. MAL expression was associated with longer time to complete remission. Time-course experiments of the sensitive and resistant cells showed a differential expression of a subset of genes involved in interferon-response (1 to 4 hours), cell growth and apoptosis (24 to 48 hours.), and signal transduction. VL - 161 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=12414529 N1 - Tracey, Lorraine Villuendas, Raquel Ortiz, Pablo Dopazo, Ana Spiteri, Inmaculada Lombardia, Luis Rodriguez-Peralto, Jose L Fernandez-Herrera, Jesus Hernandez, Almudena Fraga, Javier Dominguez, Orlando Herrero, Javier Alonso, Miguel A Dopazo, Joaquin Piris, Miguel A Research Support, Non-U.S. Gov’t United States The American journal of pathology Am J Pathol. 2002 Nov;161(5):1825-37. ER - TY - JOUR T1 - Classification of protein disulphide-bridge topologies JF - J Comput Aided Mol Des Y1 - 2001 A1 - Mas, J. M. A1 - Aloy, P. A1 - M. A. Marti-Renom A1 - Oliva, B. A1 - de Llorens, R. A1 - Aviles, F. X. A1 - Querol, E. KW - Algorithms Computer Simulation Databases as Topic Disulfides/*chemistry Models KW - Molecular Protein Structure KW - Secondary Protein Structure KW - Tertiary Proteins/*chemistry/*classification Software AB - The preferential occurrence of certain disulphide-bridge topologies in proteins has prompted us to design a method and a program, KNOT-MATCH, for their classification. The program has been applied to a database of proteins with less than 65% homology and more than two disulphide bridges. We have investigated whether there are topological preferences that can be used to group proteins and if these can be applied to gain insight into the structural or functional relationships among them. The classification has been performed by Density Search and Hierarchical Clustering Techniques, yielding thirteen main protein classes from the superimposition and clustering process. It is noteworthy that besides the disulphide bridges, regular secondary structures and loops frequently become correctly aligned. Although the lack of significant sequence similarity among some clustered proteins precludes the easy establishment of evolutionary relationships, the program permits us to find out important structural or functional residues upon the superimposition of two protein structures apparently unrelated. The derived classification can be very useful for finding relationships among proteins which would escape detection by current sequence or topology-based analytical algorithms. VL - 15 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11394740 N1 - Mas, J M Aloy, P Marti-Renom, M A Oliva, B de Llorens, R Aviles, F X Querol, E Comparative Study Research Support, Non-U.S. Gov’t Netherlands Journal of computer-aided molecular design J Comput Aided Mol Des. 2001 May;15(5):477-87. ER - TY - JOUR T1 - Methods and approaches in the analysis of gene expression data JF - J Immunol Methods Y1 - 2001 A1 - Dopazo, J. A1 - Zanders, E. A1 - Dragoni, I. A1 - Amphlett, G. A1 - Falciani, F. AB -

The application of high-density DNA array technology to monitor gene transcription has been responsible for a real paradigm shift in biology. The majority of research groups now have the ability to measure the expression of a significant proportion of the human genome in a single experiment, resulting in an unprecedented volume of data being made available to the scientific community. As a consequence of this, the storage, analysis and interpretation of this information present a major challenge. In the field of immunology the analysis of gene expression profiles has opened new areas of investigation. The study of cellular responses has revealed that cells respond to an activation signal with waves of co-ordinated gene expression profiles and that the components of these responses are the key to understanding the specific mechanisms which lead to phenotypic differentiation. The discovery of ’cell type specific’ gene expression signatures have also helped the interpretation of the mechanisms leading to disease progression. Here we review the principles behind the most commonly used data analysis methods and discuss the approaches that have been employed in immunological research.

VL - 250 UR - http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11251224 N1 -

Dopazo, J Zanders, E Dragoni, I Amphlett, G Falciani, F Comparative Study Review Netherlands Journal of immunological methods J Immunol Methods. 2001 Apr;250(1-2):93-112.

ER -