Transcoding billions of Unicode characters per second with SIMD instructions. (13th October 2021)
- Record Type:
- Journal Article
- Title:
- Transcoding billions of Unicode characters per second with SIMD instructions. (13th October 2021)
- Main Title:
- Transcoding billions of Unicode characters per second with SIMD instructions
- Authors:
- Lemire, Daniel
Muła, Wojciech - Abstract:
- Abstract: In software, text is often represented using Unicode formats (UTF‐8 and UTF‐16). We frequently have to convert text from one format to the other, a process called transcoding. Popular transcoding functions are slower than state‐of‐the‐art disks and networks. These transcoding functions make little use of the single‐instruction‐multiple‐data (SIMD) instructions available on commodity processors. By designing transcoding algorithms for SIMD instructions, we multiply the speed of transcoding on current systems ( × 64 and ARM). To ensure reproducibility, we make our software freely available as an open source library.
- Is Part Of:
- Software, practice & experience. Volume 52:Number 2(2022)
- Journal:
- Software, practice & experience
- Issue:
- Volume 52:Number 2(2022)
- Issue Display:
- Volume 52, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 52
- Issue:
- 2
- Issue Sort Value:
- 2022-0052-0002-0000
- Page Start:
- 555
- Page End:
- 575
- Publication Date:
- 2021-10-13
- Subjects:
- character encoding -- text processing -- Unicode -- vectorization
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.3036 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20392.xml