Regression trees for regulatory element identification

Cited 21 time in webofscience Cited 24 time in scopus
  • Hit : 393
  • Download : 508
Motivation: The transcription of a gene is largely determined by short sequence motifs that serve as binding sites for transcription factors. Recent findings suggest direct relationships between the motifs and gene expression levels. In this work, we present a method for identifying regulatory motifs. Our method makes use of tree-based techniques for recovering the relationships between motifs and gene expression levels. Results: We treat regulatory motifs and gene expression levels as predictor variables and responses, respectively, and use a regression tree model to identify the structural relationships between them. The regression tree methodology is extended to handle responses from multiple experiments by modifying the split function. The significance of regulatory elements is determined by analyzing tree structures and using a variable importance measure. When applied to two data sets of the yeast Saccharomyces cerevisiae, the method successfully identifies most of the regulatory motifs that are known to control gene transcription under the given experimental conditions, and suggests several new putative motifs. Analysis of the tree structures also reconfirms several pairs of motifs that are known to regulate gene transcription in combination.
Publisher
OXFORD UNIV PRESS
Issue Date
2004-03
Language
English
Article Type
Article
Keywords

EXPRESSION; DISCOVERY; NETWORKS; SITES; YEAST; GENES

Citation

BIOINFORMATICS, v.20, no.5, pp.750 - 757

ISSN
1367-4803
DOI
10.1093/bioinformatics/btg480
URI
http://hdl.handle.net/10203/18465
Appears in Collection
BiS-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 21 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0