Breaking the computational barrier: a divide-conquer and aggregate based approach for Alu insertion site characterisation. (5th January 2010)
- Record Type:
- Journal Article
- Title:
- Breaking the computational barrier: a divide-conquer and aggregate based approach for Alu insertion site characterisation. (5th January 2010)
- Main Title:
- Breaking the computational barrier: a divide-conquer and aggregate based approach for Alu insertion site characterisation
- Authors:
- Zhang, Kun
Fan, Wei
Deininger, Prescott
Edwards, Andrea
Xu, Zujia
Zhu, Dongxiao - Abstract:
- Insertion site characterisation of Alu elements is an important problem in primate-specific bioinformatics research. Key characteristics of this challenging problem include: data are not in the pre-defined feature vectors for predictive model construction; without any prior knowledge, can we discover the general patterns that could exist and also make biological insights?; how to obtain the compact yet discriminative patterns given a search space of 4200? This paper provides an integrated algorithmic framework for fulfilling the above mining tasks. Compared to the benchmark biological study, our results provide a further refined analysis of the patterns involved in Alu insertion. In particular, we acquire a 200nt predictive profile around the primary insertion site which not only contains the widely accepted consensus, but also suggests a longer pattern (T)7AA[G'A]AATAA. This pattern provides more insight into the favourable sequence variations allowed for preferred binding and cleavage by the L1 ORF2 endonuclease. The proposed method is general enough that can be also applied to other sequence detection problems, such as microRNA target prediction.
- Is Part Of:
- International journal of computational biology and drug design. Volume 2:Number 4(2009)
- Journal:
- International journal of computational biology and drug design
- Issue:
- Volume 2:Number 4(2009)
- Issue Display:
- Volume 2, Issue 4 (2009)
- Year:
- 2009
- Volume:
- 2
- Issue:
- 4
- Issue Sort Value:
- 2009-0002-0004-0000
- Page Start:
- 302
- Page End:
- 322
- Publication Date:
- 2010-01-05
- Subjects:
- frequent pattern discovery -- Alu insertion sites -- feature construction -- sequence-based prediction -- data mining -- machine learning -- primate-specific bioinformatics -- Alu elements -- sequence detection -- microRNA target prediction -- retrotransposable elements -- mobile DNA sequences
Computational biology -- Periodicals
Drugs -- Design -- Periodicals
570.285 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijcbdd ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1756-0756
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8398.xml