Semantics to the rescue of document‐based XML diff: A JATS case study. (12th February 2022)
- Record Type:
- Journal Article
- Title:
- Semantics to the rescue of document‐based XML diff: A JATS case study. (12th February 2022)
- Main Title:
- Semantics to the rescue of document‐based XML diff: A JATS case study
- Authors:
- Cuculovic, Milos
Fondement, Frederic
Devanne, Maxime
Weber, Jonathan
Hassenforder, Michel - Abstract:
- ABSTRACT: The writing of digital text documents has become a longer process that usually goes through revision rounds. Document comparison is important for the human reader interested in changes made by the authors. These documents contain structural data using text‐centric XML as one of their main storage systems. Current XML diff algorithms are able to represent differences with a limited number of edit operations: insert, delete, move and update. This approach does not fit the scope of digital text document comparison where the human reader needs to understand actual modifications made by the author. With JATS being a text‐centric XML vocabulary, we propose within this paper a new XML diff algorithm called jats‐diff, able to support bijection between higher‐level modifications made by the authors, such as structural changes and restyling, and the changes detected between XML documents. In addition, jats‐diff provides similarity information between different nodes in order to measure the impact of the text changes on the XML tree.
- Is Part Of:
- Software, practice & experience. Volume 52:Number 6(2022)
- Journal:
- Software, practice & experience
- Issue:
- Volume 52:Number 6(2022)
- Issue Display:
- Volume 52, Issue 6 (2022)
- Year:
- 2022
- Volume:
- 52
- Issue:
- 6
- Issue Sort Value:
- 2022-0052-0006-0000
- Page Start:
- 1496
- Page End:
- 1516
- Publication Date:
- 2022-02-12
- Subjects:
- academic publishing -- change semantics -- diff algorithms -- document comparison -- high‐level changes -- JATS -- text‐centric XML -- XML diff
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.3074 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21383.xml