DNA‐Based Concatenated Encoding System for High‐Reliability and High‐Density Data Storage. Issue 4 (10th February 2022)
- Record Type:
- Journal Article
- Title:
- DNA‐Based Concatenated Encoding System for High‐Reliability and High‐Density Data Storage. Issue 4 (10th February 2022)
- Main Title:
- DNA‐Based Concatenated Encoding System for High‐Reliability and High‐Density Data Storage
- Authors:
- Ren, Yubin
Zhang, Yi
Liu, Yawei
Wu, Qinglin
Su, Juanjuan
Wang, Fan
Chen, Dong
Fan, Chunhai
Liu, Kai
Zhang, Hongjie - Abstract:
- Abstract: Information storage based on DNA molecules provides a promising solution with advantages of low‐energy consumption, high storage efficiency, and long lifespan. However, there are only four natural nucleotides and DNA storage is thus limited by 2 bits per nucleotide. Here, artificial nucleotides into DNA data storage to achieve higher coding efficiency than 2 bits per nucleotide is introduced. To accommodate the characteristics of DNA synthesis and sequencing, two high‐reliability encoding systems suitable for four, six, and eight nucleotides, i.e., the RaptorQ‐Arithmetic‐LZW‐RS (RALR) and RaptorQ‐Arithmetic‐Base64‐RS (RABR) systems, are developed. The two concatenated encoding systems realize the advantages of correcting DNA sequence losses, correcting errors within DNA sequences, reducing homopolymers, and controlling specific nucleotide contents. The average coding efficiencies with error correction and without arithmetic compression by the RALR system using four, six, and eight nucleotides reach 1.27, 1.61, and 1.85 bits per nucleotide, respectively. While the average coding efficiencies by the RABR system are up to 1.50, 2.00, and 2.35 bits per nucleotide, respectively. The coding efficiency, versatility, and tunability of the developed artificial DNA systems might provide significant guidance for high‐reliability and high‐density data storage. Abstract : Two types of high‐reliability DNA encoding systems suitable for the combination of natural and artificialAbstract: Information storage based on DNA molecules provides a promising solution with advantages of low‐energy consumption, high storage efficiency, and long lifespan. However, there are only four natural nucleotides and DNA storage is thus limited by 2 bits per nucleotide. Here, artificial nucleotides into DNA data storage to achieve higher coding efficiency than 2 bits per nucleotide is introduced. To accommodate the characteristics of DNA synthesis and sequencing, two high‐reliability encoding systems suitable for four, six, and eight nucleotides, i.e., the RaptorQ‐Arithmetic‐LZW‐RS (RALR) and RaptorQ‐Arithmetic‐Base64‐RS (RABR) systems, are developed. The two concatenated encoding systems realize the advantages of correcting DNA sequence losses, correcting errors within DNA sequences, reducing homopolymers, and controlling specific nucleotide contents. The average coding efficiencies with error correction and without arithmetic compression by the RALR system using four, six, and eight nucleotides reach 1.27, 1.61, and 1.85 bits per nucleotide, respectively. While the average coding efficiencies by the RABR system are up to 1.50, 2.00, and 2.35 bits per nucleotide, respectively. The coding efficiency, versatility, and tunability of the developed artificial DNA systems might provide significant guidance for high‐reliability and high‐density data storage. Abstract : Two types of high‐reliability DNA encoding systems suitable for the combination of natural and artificial nucleotides are developed for high‐reliability and high‐density data storage. Several advantages including ultra‐high coding efficiency, correcting DNA sequence losses, correcting errors within DNA sequences, reducing homopolymers, and controlling specific nucleotide contents are realized by the concatenated encoding strategies. … (more)
- Is Part Of:
- Small methods. Volume 6:Issue 4(2022)
- Journal:
- Small methods
- Issue:
- Volume 6:Issue 4(2022)
- Issue Display:
- Volume 6, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 6
- Issue:
- 4
- Issue Sort Value:
- 2022-0006-0004-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2022-02-10
- Subjects:
- artificial nucleotides -- data storage -- DNA -- encoding systems -- high‐density data storage
Nanotechnology -- Methodology -- Periodicals
Nanotechnology -- Periodicals
Periodicals
620.5028 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2366-9608 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/smtd.202101335 ↗
- Languages:
- English
- ISSNs:
- 2366-9608
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8310.049300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21305.xml