A new approach in DNA sequence compression: Fast DNA sequence compression using parallel chaos game representation. (February 2019)
- Record Type:
- Journal Article
- Title:
- A new approach in DNA sequence compression: Fast DNA sequence compression using parallel chaos game representation. (February 2019)
- Main Title:
- A new approach in DNA sequence compression: Fast DNA sequence compression using parallel chaos game representation
- Authors:
- poor, Nafise Ramezani
Yaghoobi, Mahdi - Abstract:
- Highlights: A new fast DNA sequence compression using parallel chaos game representation is proposed. The mechanism relies on compression DNA sequences using parallel chaos game representation. Chaos game is an iterative function to create a fractal image using DNA sequence. Frequency Chaos Game Representation uses the repetition of characters in DNA. This method demonstrate the high efficiency of the algorithm and the nice compression rate. Abstract: DNA sequence is a long string and contains some hidden significant genetic information which are considered by biological researchers in different laboratories, comparing genomes, medicine, engineering and etc. Due to ascending growth of DNA researches, users have faced some challenges in some fields like transfer, maintenance and data storage. Due to the large size of such sequences, there is a need to have a lot of space for storage, so a method is needed to reduce the amount of required space. Data compression may be an efficient way to reduce the size of DNA sequences and results in reduced storage space and transfer bandwidth requirements. Some patterns of effectiveness and importance of methods in compressing data can be seen in compressing existed sequences in database, compressing image and video and some standards like DICOM.The proposed algorithm is a hybrid one consisting of 4 phases: in phase 1 it divides the sequences into subsequences and takes a parallel chaos game representation approach, in phase 2 it replacesHighlights: A new fast DNA sequence compression using parallel chaos game representation is proposed. The mechanism relies on compression DNA sequences using parallel chaos game representation. Chaos game is an iterative function to create a fractal image using DNA sequence. Frequency Chaos Game Representation uses the repetition of characters in DNA. This method demonstrate the high efficiency of the algorithm and the nice compression rate. Abstract: DNA sequence is a long string and contains some hidden significant genetic information which are considered by biological researchers in different laboratories, comparing genomes, medicine, engineering and etc. Due to ascending growth of DNA researches, users have faced some challenges in some fields like transfer, maintenance and data storage. Due to the large size of such sequences, there is a need to have a lot of space for storage, so a method is needed to reduce the amount of required space. Data compression may be an efficient way to reduce the size of DNA sequences and results in reduced storage space and transfer bandwidth requirements. Some patterns of effectiveness and importance of methods in compressing data can be seen in compressing existed sequences in database, compressing image and video and some standards like DICOM.The proposed algorithm is a hybrid one consisting of 4 phases: in phase 1 it divides the sequences into subsequences and takes a parallel chaos game representation approach, in phase 2 it replaces the high-frequency substrings using a dictionary method, in phase 3 it uses a parallel Hoffman coding approach, and in phase 4 it creates a structure based on Hoffman results. Since the algorithm runs in parallel mode and creates a dictionary for each subsequence, it increases the compression speed. Also due to the fact that CGR provides all possible patterns, there is no need to search for patterns and results in reduced computation complexity and time. Through the use of this method a benchmarked DNA string "MPOMTCG" gained a compression ratio of 1.6. … (more)
- Is Part Of:
- Expert systems with applications. Volume 116(2019)
- Journal:
- Expert systems with applications
- Issue:
- Volume 116(2019)
- Issue Display:
- Volume 116, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 116
- Issue:
- 2019
- Issue Sort Value:
- 2019-0116-2019-0000
- Page Start:
- 487
- Page End:
- 493
- Publication Date:
- 2019-02
- Subjects:
- Chaos game representation -- Parallel chaos game representation -- DNA sequence -- Huffman coding
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2018.09.012 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7943.xml