SGD Paper Help



Middendorf M, et al.  (2004) Predicting genetic regulatory response using classification. Bioinformatics 20 Suppl 1():i232-40

Abstract: MOTIVATION: Studying gene regulatory mechanisms in simple model organisms through analysis of high-throughput genomic data has emerged as a central problem in computational biology. Most approaches in the literature have focused either on finding a few strong regulatory patterns or on learning descriptive models from training data. However, these approaches are not yet adequate for making accurate predictions about which genes will be up- or down-regulated in new or held-out experiments. By introducing a predictive methodology for this problem, we can use powerful tools from machine learning and assess the statistical significance of our predictions. RESULTS: We present a novel classification-based method for learning to predict gene regulatory response. Our approach is motivated by the hypothesis that in simple organisms such as Saccharomyces cerevisiae, we can learn a decision rule for predicting whether a gene is up- or down-regulated in a particular experiment based on (1) the presence of binding site subsequences ('motifs') in the gene's regulatory region and (2) the expression levels of regulators such as transcription factors in the experiment ('parents'). Thus, our learning task integrates two qualitatively different data sources: genome-wide cDNA microarray data across multiple perturbation and mutant experiments along with motif profile data from regulatory sequences. We convert the regression task of predicting real-valued gene expression measurements to a classification task of predicting +1 and -1 labels, corresponding to up- and down-regulation beyond the levels of biological and measurement noise in microarray measurements. The learning algorithm employed is boosting with a margin-based generalization of decision trees, alternating decision trees. This large-margin classifier is sufficiently flexible to allow complex logical functions, yet sufficiently simple to give insight into the combinatorial mechanisms of gene regulation. We observe encouraging prediction accuracy on experiments based on the Gasch S.cerevisiae dataset, and we show that we can accurately predict up- and down-regulation on held-out experiments. We also show how to extract significant regulators, motifs and motif-regulator pairs from the learned models for various stress responses. Our method thus provides predictive hypotheses, suggests biological experiments, and provides interpretable insight into the structure of genetic regulatory networks. AVAILABILITY: The MLJava package is available upon request to the authors. Supplementary: Additional results are available from http://www.cs.columbia.edu/compbio/geneclas.

Status: Published Type: Journal Article PubMed ID: 15262804

Topics addressed in this paper

Number of different genes curated to this paper: 22

Jump to Summary Chart for:

  • To find other papers on a gene and topic, click on the colored ball in the appropriate box.
  • displays other papers with information about that topic for that gene.
  • displays other papers in SGD that are associated with that topic.
    The topic is addressed in these papers but does not describe a specific gene or chromosomal feature.
  • To go to the Locus page for a gene, click on the gene name.
Topics Topics not linked to Genes Genes linked to topics (#1 - 10 )
ADR1 CAT8 CHA4 GAC1 GAL4 GCN20 GCN4 GIS1 HSF1 MET31
Additional Literature blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball
Computational analysis blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball
Omics yg ball

Topics Genes linked to topics (#11 - 20 )
MIG1 MSN2 MSN4 PDR3 PPT1 RAP1 REB1 SLT2 TPK1 USV1
Additional Literature blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball
Computational analysis blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball blue ball
Primary Literature blue ball

Topics Genes linked to topics (#21 - 22 )
XBP1 YAP1
Additional Literature blue ball blue ball
Computational analysis blue ball blue ball

Author Searches

To find contact information or other publications by the authors of this paper, follow these three steps:
  1. (1) Choose an author,
  2. (2) Choose a search parameter,
  3. (3) Click to implement