TY - JOUR T1 - Novel genes and sex differences in COVID-19 severity. JF - Hum Mol Genet Y1 - 2022 A1 - Cruz, Raquel A1 - Almeida, Silvia Diz-de A1 - Heredia, Miguel López A1 - Quintela, Inés A1 - Ceballos, Francisco C A1 - Pita, Guillermo A1 - Lorenzo-Salazar, José M A1 - González-Montelongo, Rafaela A1 - Gago-Domínguez, Manuela A1 - Porras, Marta Sevilla A1 - Castaño, Jair Antonio Tenorio A1 - Nevado, Julián A1 - Aguado, Jose María A1 - Aguilar, Carlos A1 - Aguilera-Albesa, Sergio A1 - Almadana, Virginia A1 - Almoguera, Berta A1 - Alvarez, Nuria A1 - Andreu-Bernabeu, Álvaro A1 - Arana-Arri, Eunate A1 - Arango, Celso A1 - Arranz, María J A1 - Artiga, Maria-Jesus A1 - Baptista-Rosas, Raúl C A1 - Barreda-Sánchez, María A1 - Belhassen-Garcia, Moncef A1 - Bezerra, Joao F A1 - Bezerra, Marcos A C A1 - Boix-Palop, Lucía A1 - Brión, Maria A1 - Brugada, Ramón A1 - Bustos, Matilde A1 - Calderón, Enrique J A1 - Carbonell, Cristina A1 - Castano, Luis A1 - Castelao, Jose E A1 - Conde-Vicente, Rosa A1 - Cordero-Lorenzana, M Lourdes A1 - Cortes-Sanchez, Jose L A1 - Corton, Marta A1 - Darnaude, M Teresa A1 - De Martino-Rodríguez, Alba A1 - Campo-Pérez, Victor A1 - Bustamante, Aranzazu Diaz A1 - Domínguez-Garrido, Elena A1 - Luchessi, André D A1 - Eirós, Rocío A1 - Sanabria, Gladys Mercedes Estigarribia A1 - Fariñas, María Carmen A1 - Fernández-Robelo, Uxía A1 - Fernández-Rodríguez, Amanda A1 - Fernández-Villa, Tania A1 - Gil-Fournier, Belén A1 - Gómez-Arrue, Javier A1 - Álvarez, Beatriz González A1 - Quirós, Fernan Gonzalez Bernaldo A1 - González-Peñas, Javier A1 - Gutiérrez-Bautista, Juan F A1 - Herrero, María José A1 - Herrero-Gonzalez, Antonio A1 - Jimenez-Sousa, María A A1 - Lattig, María Claudia A1 - Borja, Anabel Liger A1 - Lopez-Rodriguez, Rosario A1 - Mancebo, Esther A1 - Martín-López, Caridad A1 - Martín, Vicente A1 - Martinez-Nieto, Oscar A1 - Martinez-Lopez, Iciar A1 - Martinez-Resendez, Michel F A1 - Martinez-Perez, Ángel A1 - Mazzeu, Juliana A A1 - Macías, Eleuterio Merayo A1 - Minguez, Pablo A1 - Cuerda, Victor Moreno A1 - Silbiger, Vivian N A1 - Oliveira, Silviene F A1 - Ortega-Paino, Eva A1 - Parellada, Mara A1 - Paz-Artal, Estela A1 - Santos, Ney P C A1 - Pérez-Matute, Patricia A1 - Perez, Patricia A1 - Pérez-Tomás, M Elena A1 - Perucho, Teresa A1 - Pinsach-Abuin, Mel Lina A1 - Pompa-Mera, Ericka N A1 - Porras-Hurtado, Gloria L A1 - Pujol, Aurora A1 - León, Soraya Ramiro A1 - Resino, Salvador A1 - Fernandes, Marianne R A1 - Rodríguez-Ruiz, Emilio A1 - Rodriguez-Artalejo, Fernando A1 - Rodriguez-Garcia, José A A1 - Ruiz-Cabello, Francisco A1 - Ruiz-Hornillos, Javier A1 - Ryan, Pablo A1 - Soria, José Manuel A1 - Souto, Juan Carlos A1 - Tamayo, Eduardo A1 - Tamayo-Velasco, Alvaro A1 - Taracido-Fernandez, Juan Carlos A1 - Teper, Alejandro A1 - Torres-Tobar, Lilian A1 - Urioste, Miguel A1 - Valencia-Ramos, Juan A1 - Yáñez, Zuleima A1 - Zarate, Ruth A1 - Nakanishi, Tomoko A1 - Pigazzini, Sara A1 - Degenhardt, Frauke A1 - Butler-Laporte, Guillaume A1 - Maya-Miles, Douglas A1 - Bujanda, Luis A1 - Bouysran, Youssef A1 - Palom, Adriana A1 - Ellinghaus, David A1 - Martínez-Bueno, Manuel A1 - Rolker, Selina A1 - Amitrano, Sara A1 - Roade, Luisa A1 - Fava, Francesca A1 - Spinner, Christoph D A1 - Prati, Daniele A1 - Bernardo, David A1 - García, Federico A1 - Darcis, Gilles A1 - Fernández-Cadenas, Israel A1 - Holter, Jan Cato A1 - Banales, Jesus M A1 - Frithiof, Robert A1 - Duga, Stefano A1 - Asselta, Rosanna A1 - Pereira, Alexandre C A1 - Romero-Gómez, Manuel A1 - Nafría-Jiménez, Beatriz A1 - Hov, Johannes R A1 - Migeotte, Isabelle A1 - Renieri, Alessandra A1 - Planas, Anna M A1 - Ludwig, Kerstin U A1 - Buti, Maria A1 - Rahmouni, Souad A1 - Alarcón-Riquelme, Marta E A1 - Schulte, Eva C A1 - Franke, Andre A1 - Karlsen, Tom H A1 - Valenti, Luca A1 - Zeberg, Hugo A1 - Richards, Brent A1 - Ganna, Andrea A1 - Boada, Mercè A1 - Rojas, Itziar A1 - Ruiz, Agustín A1 - Sánchez, Pascual A1 - Real, Luis Miguel A1 - Guillén-Navarro, Encarna A1 - Ayuso, Carmen A1 - González-Neira, Anna A1 - Riancho, José A A1 - Rojas-Martinez, Augusto A1 - Flores, Carlos A1 - Lapunzina, Pablo A1 - Carracedo, Ángel AB -

Here we describe the results of a genome-wide study conducted in 11 939 COVID-19 positive cases with an extensive clinical information that were recruited from 34 hospitals across Spain (SCOURGE consortium). In sex-disaggregated genome-wide association studies for COVID-19 hospitalization, genome-wide significance (p < 5x10-8) was crossed for variants in 3p21.31 and 21q22.11 loci only among males (p = 1.3x10-22 and p = 8.1x10-12, respectively), and for variants in 9q21.32 near TLE1 only among females (p = 4.4x10-8). In a second phase, results were combined with an independent Spanish cohort (1598 COVID-19 cases and 1068 population controls), revealing in the overall analysis two novel risk loci in 9p13.3 and 19q13.12, with fine-mapping prioritized variants functionally associated with AQP3 (p = 2.7x10-8) and ARHGAP33 (p = 1.3x10-8), respectively. The meta-analysis of both phases with four European studies stratified by sex from the Host Genetics Initiative confirmed the association of the 3p21.31 and 21q22.11 loci predominantly in males and replicated a recently reported variant in 11p13 (ELF5, p = 4.1x10-8). Six of the COVID-19 HGI discovered loci were replicated and an HGI-based genetic risk score predicted the severity strata in SCOURGE. We also found more SNP-heritability and larger heritability differences by age (<60 or ≥ 60 years) among males than among females. Parallel genome-wide screening of inbreeding depression in SCOURGE also showed an effect of homozygosity in COVID-19 hospitalization and severity and this effect was stronger among older males. In summary, new candidate genes for COVID-19 severity and evidence supporting genetic disparities among sexes are provided.

ER - TY - JOUR T1 - CSVS, a crowdsourcing database of the Spanish population genetic variability. JF - Nucleic Acids Res Y1 - 2021 A1 - Peña-Chilet, Maria A1 - Roldán, Gema A1 - Perez-Florido, Javier A1 - Ortuno, Francisco M A1 - Carmona, Rosario A1 - Aquino, Virginia A1 - López-López, Daniel A1 - Loucera, Carlos A1 - Fernandez-Rueda, Jose L A1 - Gallego, Asunción A1 - Garcia-Garcia, Francisco A1 - González-Neira, Anna A1 - Pita, Guillermo A1 - Núñez-Torres, Rocío A1 - Santoyo-López, Javier A1 - Ayuso, Carmen A1 - Minguez, Pablo A1 - Avila-Fernandez, Almudena A1 - Corton, Marta A1 - Moreno-Pelayo, Miguel Ángel A1 - Morin, Matías A1 - Gallego-Martinez, Alvaro A1 - Lopez-Escamez, Jose A A1 - Borrego, Salud A1 - Antiňolo, Guillermo A1 - Amigo, Jorge A1 - Salgado-Garrido, Josefa A1 - Pasalodos-Sanchez, Sara A1 - Morte, Beatriz A1 - Carracedo, Ángel A1 - Alonso, Ángel A1 - Dopazo, Joaquin KW - Alleles KW - Chromosome Mapping KW - Crowdsourcing KW - Databases, Genetic KW - Exome KW - Gene Frequency KW - Genetic Variation KW - Genetics, Population KW - Genome, Human KW - Genomics KW - Humans KW - Internet KW - Precision Medicine KW - Software KW - Spain AB -

The knowledge of the genetic variability of the local population is of utmost importance in personalized medicine and has been revealed as a critical factor for the discovery of new disease variants. Here, we present the Collaborative Spanish Variability Server (CSVS), which currently contains more than 2000 genomes and exomes of unrelated Spanish individuals. This database has been generated in a collaborative crowdsourcing effort collecting sequencing data produced by local genomic projects and for other purposes. Sequences have been grouped by ICD10 upper categories. A web interface allows querying the database removing one or more ICD10 categories. In this way, aggregated counts of allele frequencies of the pseudo-control Spanish population can be obtained for diseases belonging to the category removed. Interestingly, in addition to pseudo-control studies, some population studies can be made, as, for example, prevalence of pharmacogenomic variants, etc. In addition, this genomic data has been used to define the first Spanish Genome Reference Panel (SGRP1.0) for imputation. This is the first local repository of variability entirely produced by a crowdsourcing effort and constitutes an example for future initiatives to characterize local variability worldwide. CSVS is also part of the GA4GH Beacon network. CSVS can be accessed at: http://csvs.babelomics.org/.

VL - 49 IS - D1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/32990755?dopt=Abstract ER - TY - JOUR T1 - PTMcode v2: a resource for functional associations of post-translational modifications within and between proteins. JF - Nucleic Acids Res Y1 - 2015 A1 - Minguez, Pablo A1 - Letunic, Ivica A1 - Parca, Luca A1 - García-Alonso, Luz A1 - Dopazo, Joaquin A1 - Huerta-Cepas, Jaime A1 - Bork, Peer KW - Databases, Protein KW - Internet KW - Protein Interaction Mapping KW - Protein Processing, Post-Translational AB -

The post-translational regulation of proteins is mainly driven by two molecular events, their modification by several types of moieties and their interaction with other proteins. These two processes are interdependent and together are responsible for the function of the protein in a particular cell state. Several databases focus on the prediction and compilation of protein-protein interactions (PPIs) and no less on the collection and analysis of protein post-translational modifications (PTMs), however, there are no resources that concentrate on describing the regulatory role of PTMs in PPIs. We developed several methods based on residue co-evolution and proximity to predict the functional associations of pairs of PTMs that we apply to modifications in the same protein and between two interacting proteins. In order to make data available for understudied organisms, PTMcode v2 (http://ptmcode.embl.de) includes a new strategy to propagate PTMs from validated modified sites through orthologous proteins. The second release of PTMcode covers 19 eukaryotic species from which we collected more than 300,000 experimentally verified PTMs (>1,300,000 propagated) of 69 types extracting the post-translational regulation of >100,000 proteins and >100,000 interactions. In total, we report 8 million associations of PTMs regulating single proteins and over 9.4 million interplays tuning PPIs.

VL - 43 IS - Database issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/25361965?dopt=Abstract ER - TY - JOUR T1 - Understanding disease mechanisms with models of signaling pathway activities JF - BMC systems biology Y1 - 2014 A1 - Sebastián-Leon, Patricia A1 - Vidal, Enrique A1 - Minguez, Pablo A1 - Conesa, Ana A1 - Tarazona, Sonia A1 - Amadoz, Alicia A1 - Armero, Carmen A1 - Salavert Torres, Francisco A1 - Vidal-Puig, Antonio A1 - Montaner, David A1 - Dopazo, Joaquin VL - 8 ER - TY - JOUR T1 - Understanding disease mechanisms with models of signaling pathway activities. JF - BMC systems biology Y1 - 2014 A1 - Sebastián-Leon, Patricia A1 - Vidal, Enrique A1 - Minguez, Pablo A1 - Ana Conesa A1 - Sonia Tarazona A1 - Amadoz, Alicia A1 - Armero, Carmen A1 - Salavert, Francisco A1 - Vidal-Puig, Antonio A1 - Montaner, David A1 - Joaquín Dopazo KW - Disease mechanism KW - pathway KW - signalling KW - Systems biology AB - BackgroundUnderstanding the aspects of the cell functionality that account for disease or drug action mechanisms is one of the main challenges in the analysis of genomic data and is on the basis of the future implementation of precision medicine.ResultsHere we propose a simple probabilistic model in which signaling pathways are separated into elementary sub-pathways or signal transmission circuits (which ultimately trigger cell functions) and then transforms gene expression measurements into probabilities of activation of such signal transmission circuits. Using this model, differential activation of such circuits between biological conditions can be estimated. Thus, circuit activation statuses can be interpreted as biomarkers that discriminate among the compared conditions. This type of mechanism-based biomarkers accounts for cell functional activities and can easily be associated to disease or drug action mechanisms. The accuracy of the proposed model is demonstrated with simulations and real datasets.ConclusionsThe proposed model provides detailed information that enables the interpretation disease mechanisms as a consequence of the complex combinations of altered gene expression values. Moreover, it offers a framework for suggesting possible ways of therapeutic intervention in a pathologically perturbed system. VL - 8 UR - http://www.biomedcentral.com/1752-0509/8/121/abstract ER - TY - JOUR T1 - Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments. JF - Nucleic Acids Res Y1 - 2012 A1 - García-Alonso, Luz A1 - Alonso, Roberto A1 - Vidal, Enrique A1 - Amadoz, Alicia A1 - De Maria, Alejandro A1 - Minguez, Pablo A1 - Medina, Ignacio A1 - Dopazo, Joaquin KW - Bipolar Disorder KW - Fanconi Anemia KW - Gene Regulatory Networks KW - Genes, Neoplasm KW - Genome-Wide Association Study KW - Genomics KW - Humans KW - Protein Interaction Mapping AB -

Genomic experiments (e.g. differential gene expression, single-nucleotide polymorphism association) typically produce ranked list of genes. We present a simple but powerful approach which uses protein-protein interaction data to detect sub-networks within such ranked lists of genes or proteins. We performed an exhaustive study of network parameters that allowed us concluding that the average number of components and the average number of nodes per component are the parameters that best discriminate between real and random networks. A novel aspect that increases the efficiency of this strategy in finding sub-networks is that, in addition to direct connections, also connections mediated by intermediate nodes are considered to build up the sub-networks. The possibility of using of such intermediate nodes makes this approach more robust to noise. It also overcomes some limitations intrinsic to experimental designs based on differential expression, in which some nodes are invariant across conditions. The proposed approach can also be used for candidate disease-gene prioritization. Here, we demonstrate the usefulness of the approach by means of several case examples that include a differential expression analysis in Fanconi Anemia, a genome-wide association study of bipolar disorder and a genome-scale study of essentiality in cancer genes. An efficient and easy-to-use web interface (available at http://www.babelomics.org) based on HTML5 technologies is also provided to run the algorithm and represent the network.

VL - 40 IS - 20 U1 - https://www.ncbi.nlm.nih.gov/pubmed/22844098?dopt=Abstract ER - TY - JOUR T1 - Assessing the biological significance of gene expression signatures and co-expression modules by studying their network properties. JF - PloS one Y1 - 2011 A1 - Minguez, Pablo A1 - Dopazo, Joaquin AB -

Microarray experiments have been extensively used to define signatures, which are sets of genes that can be considered markers of experimental conditions (typically diseases). Paradoxically, in spite of the apparent functional role that might be attributed to such gene sets, signatures do not seem to be reproducible across experiments. Given the close relationship between function and protein interaction, network properties can be used to study to what extent signatures are composed of genes whose resulting proteins show a considerable level of interaction (and consequently a putative common functional role).We have analysed 618 signatures and 507 modules of co-expression in cancer looking for significant values of four main protein-protein interaction (PPI) network parameters: connection degree, cluster coefficient, betweenness and number of components. A total of 3904 gene ontology (GO) modules, 146 KEGG pathways, and 263 Biocarta pathways have been used as functional modules of reference.Co-expression modules found in microarray experiments display a high level of connectivity, similar to the one shown by conventional modules based on functional definitions (GO, KEGG and Biocarta). A general observation for all the classes studied is that the networks formed by the modules improve their topological parameters when an external protein is allowed to be introduced within the paths (up to the 70% of GO modules show network parameters beyond the random expectation). This fact suggests that functional definitions are incomplete and some genes might still be missing. Conversely, signatures are clearly not capturing the altered functions in the corresponding studies. This is probably because the way in which the genes have been selected in the signatures is too conservative. These results suggest that gene selection methods which take into account relationships among genes should be superior to methods that assume independence among genes outside their functional contexts.

VL - 6 UR - http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0017474 ER - TY - JOUR T1 - Functional genomics and networks: new approaches in the extraction of complex gene modules. JF - Expert Rev Proteomics Y1 - 2010 A1 - Minguez, Pablo A1 - Dopazo, Joaquin KW - Gene Expression Regulation KW - Gene Regulatory Networks KW - Genomics KW - Protein Binding KW - Proteins KW - Systems biology AB -

The engine that makes the cell work is made of an intricate network of molecular interactions. Nowadays, the elements and relationships of this complex network can be studied with several types of high-throughput techniques. The dream of having a global picture of the cell from different perspectives that can jointly explain cell behavior is, at least technically, feasible. However, this task can only be accomplished by filling the gap between data and information. The availability of methods capable of accurately managing, integrating and analyzing the results from these experiments is crucial for this purpose. Here, we review the new challenges raised by the availability of different genomic data, as well as the new proposals presented to cope with the increasing data complexity. Special emphasis is given to approaches that explore the transcriptome trying to describe the modules of genes that account for the traits studied.

VL - 7 IS - 1 U1 - https://www.ncbi.nlm.nih.gov/pubmed/20121476?dopt=Abstract ER - TY - JOUR T1 - Selection upon Genome Architecture: Conservation of Functional Neighborhoods with Changing Genes JF - PLoS Comput. Biol. Y1 - 2010 A1 - Al-Shahrour, Fátima A1 - Minguez, Pablo A1 - Marqués-Bonet, Tomás A1 - Gazave, Elodie A1 - Navarro, Arcadi A1 - Dopazo, Joaquin VL - 6 UR - http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1000953 ER - TY - JOUR T1 - Gene set internal coherence in the context of functional profiling. JF - BMC Genomics Y1 - 2009 A1 - Montaner, David A1 - Minguez, Pablo A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Algorithms KW - Breast Neoplasms KW - Carcinoma, Intraductal, Noninfiltrating KW - Computational Biology KW - Databases, Nucleic Acid KW - Female KW - Gene Expression Profiling KW - Genomics KW - Humans KW - Oligonucleotide Array Sequence Analysis KW - Papillomavirus Infections KW - Reproducibility of Results AB -

BACKGROUND: Functional profiling methods have been extensively used in the context of high-throughput experiments and, in particular, in microarray data analysis. Such methods use available biological information to define different types of functional gene modules (e.g. gene ontology -GO-, KEGG pathways, etc.) whose representation in a pre-defined list of genes is further studied. In the most popular type of microarray experimental designs (e.g. up- or down-regulated genes, clusters of co-expressing genes, etc.) or in other genomic experiments (e.g. Chip-on-chip, epigenomics, etc.) these lists are composed by genes with a high degree of co-expression. Therefore, an implicit assumption in the application of functional profiling methods within this context is that the genes corresponding to the modules tested are effectively defining sets of co-expressing genes. Nevertheless not all the functional modules are biologically coherent entities in terms of co-expression, which will eventually hinder its detection with conventional methods of functional enrichment.

RESULTS: Using a large collection of microarray data we have carried out a detailed survey of internal correlation in GO terms and KEGG pathways, providing a coherence index to be used for measuring functional module co-regulation. An unexpected low level of internal correlation was found among the modules studied. Only around 30% of the modules defined by GO terms and 57% of the modules defined by KEGG pathways display an internal correlation higher than the expected by chance.This information on the internal correlation of the genes within the functional modules can be used in the context of a logistic regression model in a simple way to improve their detection in gene expression experiments.

CONCLUSION: For the first time, an exhaustive study on the internal co-expression of the most popular functional categories has been carried out. Interestingly, the real level of coexpression within many of them is lower than expected (or even inexistent), which will preclude its detection by means of most conventional functional profiling methods. If the gene-to-function correlation information is used in functional profiling methods, the results obtained improve the ones obtained by conventional enrichment methods.

VL - 10 U1 - https://www.ncbi.nlm.nih.gov/pubmed/19397819?dopt=Abstract ER - TY - JOUR T1 - SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks. JF - Nucleic Acids Res Y1 - 2009 A1 - Minguez, Pablo A1 - Götz, Stefan A1 - Montaner, David A1 - Al-Shahrour, Fátima A1 - Dopazo, Joaquin KW - Computer Graphics KW - Data Interpretation, Statistical KW - Databases, Protein KW - Humans KW - Internet KW - Protein Interaction Mapping KW - Software AB -

Understanding the structure and the dynamics of the complex intercellular network of interactions that contributes to the structure and function of a living cell is one of the main challenges of today's biology. SNOW inputs a collection of protein (or gene) identifiers and, by using the interactome as scaffold, draws the connections among them, calculates several relevant network parameters and, as a novelty among the rest of tools of its class, it estimates their statistical significance. The parameters calculated for each node are: connectivity, betweenness and clustering coefficient. It also calculates the number of components, number of bicomponents and articulation points. An interactive network viewer is also available to explore the resulting network. SNOW is available at http://snow.bioinfo.cipf.es.

VL - 37 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/19454602?dopt=Abstract ER - TY - JOUR T1 - SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks JF - Nucl. Acids Res. Y1 - 2009 A1 - Minguez, Pablo A1 - Gotz, S. A1 - Montaner, David A1 - Fatima Al-Shahrour A1 - Dopazo, Joaquin KW - interactome KW - network KW - snow AB -

Understanding the structure and the dynamics of the complex intercellular network of interactions that contributes to the structure and function of a living cell is one of the main challenges of today’s biology. SNOW inputs a collection of protein (or gene) identifiers and, by using the interactome as scaffold, draws the connections among them, calculates several relevant network parameters and, as a novelty among the rest of tools of its class, it estimates their statistical significance. The parameters calculated for each node are: connectivity, betweenness and clustering coefficient. It also calculates the number of components, number of bicomponents and articulation points. An interactive network viewer is also available to explore the resulting network. SNOW is available at http://snow.bioinfo.cipf.es.

VL - 37 UR - http://nar.oxfordjournals.org/content/early/2009/05/19/nar.gkp402.full ER - TY - JOUR T1 - GEPAS, a web-based tool for microarray data analysis and interpretation. JF - Nucleic Acids Res Y1 - 2008 A1 - Tárraga, Joaquín A1 - Medina, Ignacio A1 - Carbonell, José A1 - Huerta-Cepas, Jaime A1 - Minguez, Pablo A1 - Alloza, Eva A1 - Al-Shahrour, Fátima A1 - Vegas-Azcárate, Susana A1 - Goetz, Stefan A1 - Escobar, Pablo A1 - Garcia-Garcia, Francisco A1 - Conesa, Ana A1 - Montaner, David A1 - Dopazo, Joaquin KW - Computer Graphics KW - Dose-Response Relationship, Drug KW - Gene Expression Profiling KW - Internet KW - Kinetics KW - Oligonucleotide Array Sequence Analysis KW - Software AB -

Gene Expression Profile Analysis Suite (GEPAS) is one of the most complete and extensively used web-based packages for microarray data analysis. During its more than 5 years of activity it has continuously been updated to keep pace with the state-of-the-art in the changing microarray data analysis arena. GEPAS offers diverse analysis options that include well established as well as novel algorithms for normalization, gene selection, class prediction, clustering and functional profiling of the experiment. New options for time-course (or dose-response) experiments, microarray-based class prediction, new clustering methods and new tests for differential expression have been included. The new pipeliner module allows automating the execution of sequential analysis steps by means of a simple but powerful graphic interface. An extensive re-engineering of GEPAS has been carried out which includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. GEPAS is nowadays the most quoted web tool in its field and it is extensively used by researchers of many countries and its records indicate an average usage rate of 500 experiments per day. GEPAS, is available at http://www.gepas.org.

VL - 36 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/18508806?dopt=Abstract ER - TY - JOUR T1 - FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments. JF - Nucleic Acids Res Y1 - 2007 A1 - Al-Shahrour, Fátima A1 - Minguez, Pablo A1 - Tárraga, Joaquín A1 - Medina, Ignacio A1 - Alloza, Eva A1 - Montaner, David A1 - Dopazo, Joaquin KW - Amino Acid Motifs KW - Animals KW - Binding Sites KW - Computational Biology KW - Gene Expression Profiling KW - Genes KW - Genomics KW - Humans KW - Internet KW - Oligonucleotide Array Sequence Analysis KW - Programming Languages KW - Software KW - Systems Integration KW - Transcription Factors AB -

The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the data, relating the available information with the hypotheses that originated the experiment. Thus, functional profiling methods have become essential in diverse scenarios such as microarray experiments, proteomics, etc. We present the FatiGO+, a web-based tool for the functional profiling of genome-scale experiments, specially oriented to the interpretation of microarray experiments. In addition to different functional annotations (gene ontology, KEGG pathways, Interpro motifs, Swissprot keywords and text-mining based bioentities related to diseases and chemical compounds) FatiGO+ includes, as a novelty, regulatory and structural information. The regulatory information used includes predictions of targets for distinct regulatory elements (obtained from the Transfac and CisRed databases). Additionally FatiGO+ uses predictions of target motifs of miRNA to infer which of these can be activated or deactivated in the sample of genes studied. Finally, properties of gene products related to their relative location and connections in the interactome have also been used. Also, enrichment of any of these functional terms can be directly analysed on chromosomal coordinates. FatiGO+ can be found at: http://www.fatigoplus.org and within the Babelomics environment http://www.babelomics.org.

VL - 35 IS - Web Server issue U1 - https://www.ncbi.nlm.nih.gov/pubmed/17478504?dopt=Abstract ER - TY - JOUR T1 - From genes to functional classes in the study of biological systems. JF - BMC Bioinformatics Y1 - 2007 A1 - Al-Shahrour, Fátima A1 - Arbiza, Leonardo A1 - Dopazo, Hernán A1 - Huerta-Cepas, Jaime A1 - Minguez, Pablo A1 - Montaner, David A1 - Dopazo, Joaquin KW - Algorithms KW - Chromosome Mapping KW - Computer Simulation KW - Gene Expression Profiling KW - Models, Biological KW - Multigene Family KW - Signal Transduction KW - Software KW - Systems biology KW - User-Computer Interface AB -

BACKGROUND: With the popularization of high-throughput techniques, the need for procedures that help in the biological interpretation of results has increased enormously. Recently, new procedures inspired in systems biology criteria have started to be developed.

RESULTS: Here we present FatiScan, a web-based program which implements a threshold-independent test for the functional interpretation of large-scale experiments that does not depend on the pre-selection of genes based on the multiple application of independent tests to each gene. The test implemented aims to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes. In addition, the test does not depend on the type of the data used for obtaining significance values, and consequently different types of biologically informative terms (gene ontology, pathways, functional motifs, transcription factor binding sites or regulatory sites from CisRed) can be applied to different classes of genome-scale studies. We exemplify its application in microarray gene expression, evolution and interactomics.

CONCLUSION: Methods for gene set enrichment which, in addition, are independent from the original data and experimental design constitute a promising alternative for the functional profiling of genome-scale experiments. A web server that performs the test described and other similar ones can be found at: http://www.babelomics.org.

VL - 8 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17407596?dopt=Abstract ER - TY - JOUR T1 - Functional profiling of microarray experiments using text-mining derived bioentities. JF - Bioinformatics Y1 - 2007 A1 - Minguez, Pablo A1 - Al-Shahrour, Fátima A1 - Montaner, David A1 - Dopazo, Joaquin KW - Artificial Intelligence KW - Databases, Protein KW - Gene Expression Profiling KW - Information Storage and Retrieval KW - Natural Language Processing KW - Proteins KW - Research Design KW - Systems Integration AB -

MOTIVATION: The increasing use of microarray technologies brought about a parallel demand in methods for the functional interpretation of the results. Beyond the conventional functional annotations for genes, such as gene ontology, pathways, etc. other sources of information are still to be exploited. Text-mining methods allow extracting informative terms (bioentities) with different functional, chemical, clinical, etc. meanings, that can be associated to genes. We show how to use these associations within an appropriate statistical framework and how to apply them through easy-to-use, web-based environments to the functional interpretation of microarray experiments. Functional enrichment and gene set enrichment tests using bioentities are presented.

VL - 23 IS - 22 U1 - https://www.ncbi.nlm.nih.gov/pubmed/17855415?dopt=Abstract ER -