Layout logical labelling and finding the semantic relationships between citing and cited paper content. (16th June 2020)
- Record Type:
- Journal Article
- Title:
- Layout logical labelling and finding the semantic relationships between citing and cited paper content. (16th June 2020)
- Main Title:
- Layout logical labelling and finding the semantic relationships between citing and cited paper content
- Authors:
- Parinov, Sergey
Bakarov, Amir
Vodolazcky, Daniil - Abstract:
- Currently, large data sets of in-text citations and citation contexts are becoming available for research and developing tools. Using the "topic model" method to analyse these data, one can characterise thematic relationships between citation contexts from citing and the cited paper content. However, to build relevant topic models and to compare them accurately for papers linked by citation relationships we have to know the semantic labels of PDF papers' layout such as section titles, paragraph boundaries, etc. Recent achievements in papers' conversion from a PDF form into a rich attributed JSON format allow us to develop new approaches for the logical labelling of the papers' layout. This paper presents a re-usable method and open source software for the logical labelling of PDF papers, which gave good quality of a layout element's recognition for a set of research papers. Using these semantic labels we made a precise comparison of topic models built for citing and cited papers and we found some level of similarity between them.
- Is Part Of:
- International journal of metadata, semantics and ontologies. Volume 14:Number 1(2020)
- Journal:
- International journal of metadata, semantics and ontologies
- Issue:
- Volume 14:Number 1(2020)
- Issue Display:
- Volume 14, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 14
- Issue:
- 1
- Issue Sort Value:
- 2020-0014-0001-0000
- Page Start:
- 54
- Page End:
- 62
- Publication Date:
- 2020-06-16
- Subjects:
- Cirtec project -- in-text citation -- citation contexts -- research paper layout recognition -- logical labelling -- hierarchical topic models
Metadata -- Periodicals
Semantic Web -- Periodicals
Ontologies (Information retrieval) -- Periodicals
Data structures (Computer science) -- Periodicals
Information theory -- Periodicals
005.74 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalID=152 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1744-2621
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 13094.xml