Degree Programs
List of Courses
Search
Navigation
Recognition of Gene Acceptor Site via Ensemble Boosting
J.R.L. Micor, E.R.E. Mojica and J.P. PABICO. 10th Eurasia Conference on Chemical Sciences (EuAsC2S-10), Manila, Philippines, 07-11 January 2008.
Abstract
The complete identification of human genes involves determining subsequences that generate proteins (exons) and those that do not (introns). In RNA splicing, the problem of recognizing the gene acceptor site is concerned with the recognition of the boundaries between these regions, where the current procedure employed by researchers is the GU-AG rule: exon/GU-intron-AG/exon. However, the GU-AG motifs occur so frequently that a typical intron will contain several GUs and AGs within it, resulting in many false sites being recognized. This work investigates the use of a boosting ensemble of neural network classifiers for identification of gene acceptor sites in the human genome. A published dataset of primate exon-intron boundaries was used as a training set for the members of the ensemble to perform two recognition tasks: exon/intron boundaries (EI) and intron/exon boundaries (IE). The ensemble uses the boosting algorithm to pool each member’s output. The proposed boosting ensemble has been applied to recognize gene acceptor junctions of tumor suppressor genes p53 and BRCA1, and an artificial mRNA generated by a random function. The ensemble recognized 5%, 6%, and 11% fewer false sites for p53, BRCA1 and artificial mRNA, respectively, compared to that of the individual recognition by each member.
Keywords: exon, intron, ensemble, boosting.
J.R.L. Micor is Assistant Professor in the Institute of Chemistry, College of Arts and Sciences, University of the Philippines Los Baños.
E.R.E. Mojica is Assistant Professor in the Institute of Chemistry, College of Arts and Sciences, University of the Philippines Los Baños and Graduate Research Assistant in the Department of Chemistry, State University of New York, Buffalo, New York.
Submitted 13 August 2007; Accepted 20 August 2007.
Suggested citation for this online article:
_______. Recognition Of Gene Acceptor Site Via Ensemble Boosting. Accessed 08 September 2008. UPLB-ICS webpage (http://www.ics.uplb.edu.ph/node/251).







