Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories. (26th January 2021)
- Record Type:
- Journal Article
- Title:
- Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories. (26th January 2021)
- Main Title:
- Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories
- Authors:
- Phiri, Lighton
- Abstract:
- Higher education institutions typically employ Institutional Repositories (IRs) in order to curate and make available Electronic Theses and Dissertations (ETDs). While most of these IRs are implemented with self-archiving functionalities, self-archiving practices are still a challenge. This arguably leads to inconsistencies in the tagging of digital objects with descriptive metadata, potentially compromising searching and browsing of scholarly research output in IRs. This paper proposes an approach to automatically classify ETDs in IRs, using supervised machine learning techniques, by extracting features from the minimum possible input expected from document authors: the ETD manuscript. The experiment results demonstrate the feasibility of automatically classifying IR ETDs and, additionally, ensuring that repository digital objects are appropriately structured. Automatic classification of repository objects has the obvious benefit of improving the searching and browsing of content in IRs and further presents opportunities for the implementation of third-party tools and extensions that could potentially result in effective self-archiving strategies.
- Is Part Of:
- International journal of metadata, semantics and ontologies. Volume 14:Number 3(2020)
- Journal:
- International journal of metadata, semantics and ontologies
- Issue:
- Volume 14:Number 3(2020)
- Issue Display:
- Volume 14, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 14
- Issue:
- 3
- Issue Sort Value:
- 2020-0014-0003-0000
- Page Start:
- 234
- Page End:
- 248
- Publication Date:
- 2021-01-26
- Subjects:
- digital libraries -- Dublin core -- OAI-PMH -- document classification -- automatic classification -- digital objects -- metadata quality -- electronic theses and dissertations -- ETDs -- institutional repositories -- self-archiving
Metadata -- Periodicals
Semantic Web -- Periodicals
Ontologies (Information retrieval) -- Periodicals
Data structures (Computer science) -- Periodicals
Information theory -- Periodicals
005.74 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalID=152 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1744-2621
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 14746.xml