PDC: a highly compact file format to store protein 3D coordinates. (3rd April 2023)
- Record Type:
- Journal Article
- Title:
- PDC: a highly compact file format to store protein 3D coordinates. (3rd April 2023)
- Main Title:
- PDC: a highly compact file format to store protein 3D coordinates
- Authors:
- Zhang, Chengxin
Pyle, Anna Marie - Abstract:
- Abstract: Recent improvements in computational and experimental techniques for obtaining protein structures have resulted in an explosion of 3D coordinate data. To cope with the ever-increasing sizes of structure databases, this work proposes the Protein Data Compression (PDC) format, which compresses coordinates and temperature factors of full-atomic and Cα-only protein structures. Without loss of precision, PDC results in 69% to 78% smaller file sizes than Protein Data Bank (PDB) and macromolecular Crystallographic Information File (mmCIF) files with standard GZIP compression. It uses ∼60% less space than existing compression algorithms specific to macromolecular structures. PDC optionally performs lossy compression with minimal sacrifice of precision, which allows reduction of file sizes by another 79%. Conversion between PDC, mmCIF and PDB formats is typically achieved within 0.02 s. The compactness and fast reading/writing speed of PDC make it valuable for storage and analysis of large quantity of tertiary structural data. Database URL https://github.com/kad-ecoli/pdc
- Is Part Of:
- Database. Volume 2023(2023)
- Journal:
- Database
- Issue:
- Volume 2023(2023)
- Issue Display:
- Volume 2023, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 2023
- Issue:
- 2023
- Issue Sort Value:
- 2023-2023-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-04-03
- Subjects:
- Biology -- Databases -- Periodicals
Bioinformatics -- Periodicals
570.285 - Journal URLs:
- http://database.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/database/baad018 ↗
- Languages:
- English
- ISSNs:
- 1758-0463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27013.xml