New methodology for repetitive sequences identification in human X and Y chromosomes. (February 2021)
- Record Type:
- Journal Article
- Title:
- New methodology for repetitive sequences identification in human X and Y chromosomes. (February 2021)
- Main Title:
- New methodology for repetitive sequences identification in human X and Y chromosomes
- Authors:
- Touati, Rabeb
Tajouri, Asma
Mesaoudi, Imen
Oueslati, Afef Elloumi
Lachiri, Zied
Kharrat, Maher - Abstract:
- Graphical abstract: Highlights: We converted X and Y chromosomes genomic sequences to numerical representation: DNA images. We developed a new algorithm in the goal to localize the repetitive patterns in the DNA images corresponding to the repetitive sequences. Based on Convolutional neural network (CNN), we developed a classification system to predict the repetitive DNA classes. Furthermore, to the best of our knowledge, our work provides the first deep learning methods applied to DNA images classification task. Abstract: Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species' genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion orGraphical abstract: Highlights: We converted X and Y chromosomes genomic sequences to numerical representation: DNA images. We developed a new algorithm in the goal to localize the repetitive patterns in the DNA images corresponding to the repetitive sequences. Based on Convolutional neural network (CNN), we developed a classification system to predict the repetitive DNA classes. Furthermore, to the best of our knowledge, our work provides the first deep learning methods applied to DNA images classification task. Abstract: Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species' genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn't affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 64(2021)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 64(2021)
- Issue Display:
- Volume 64, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 64
- Issue:
- 2021
- Issue Sort Value:
- 2021-0064-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Repetitive sequences -- Satellites -- Wavelet transform -- Canny edge detection -- Human genome -- New repetitions database -- Convolutional neural network (CNN)
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2020.102207 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23002.xml