A method for automatic analysis Table of Contents in Chinese books. Issue 3 (21st September 2015)
- Record Type:
- Journal Article
- Title:
- A method for automatic analysis Table of Contents in Chinese books. Issue 3 (21st September 2015)
- Main Title:
- A method for automatic analysis Table of Contents in Chinese books
- Authors:
- Chen, Jing
Lu, Quan - Abstract:
- <abstract> <title> <x content-type="archive" xml:space="preserve">Abstract</x> </title> <sec> <title content-type="abstract-heading">Purpose</title> <p> – The purpose of this paper is to propose a novel method to analyze Table of Contents (TOC) in Chinese books automatically based on the hierarchy organization rules which gained by investigation. </p> </sec> <sec> <title content-type="abstract-heading">Design/methodology/approach</title> <p> – This paper analyzed the main literature in this field first, then hierarchy organization rules of Chinese book TOC were generated and the method parsing TOC automatically based on these rules was proposed. A prototype system implementing the method was also developed. The method was evaluated through processing a corpus on the prototype system, and the results were checked with calculation of precision and recall. </p> </sec> <sec> <title content-type="abstract-heading">Findings</title> <p> – The experiment result illustrated the superiority (extensive application, recall is 95.34 percent and precision is 94.44 percent) of the method. </p> </sec> <sec> <title content-type="abstract-heading">Practical implications</title> <p> – The result can help Chinese libraries deal with electronic texts from four aspects. First, it can be used to complement or enhance current digitization and optical character recognition methods and cut the financial and labor cost of Chinese libraries. Second, it can help libraries to keep information on indexing<abstract> <title> <x content-type="archive" xml:space="preserve">Abstract</x> </title> <sec> <title content-type="abstract-heading">Purpose</title> <p> – The purpose of this paper is to propose a novel method to analyze Table of Contents (TOC) in Chinese books automatically based on the hierarchy organization rules which gained by investigation. </p> </sec> <sec> <title content-type="abstract-heading">Design/methodology/approach</title> <p> – This paper analyzed the main literature in this field first, then hierarchy organization rules of Chinese book TOC were generated and the method parsing TOC automatically based on these rules was proposed. A prototype system implementing the method was also developed. The method was evaluated through processing a corpus on the prototype system, and the results were checked with calculation of precision and recall. </p> </sec> <sec> <title content-type="abstract-heading">Findings</title> <p> – The experiment result illustrated the superiority (extensive application, recall is 95.34 percent and precision is 94.44 percent) of the method. </p> </sec> <sec> <title content-type="abstract-heading">Practical implications</title> <p> – The result can help Chinese libraries deal with electronic texts from four aspects. First, it can be used to complement or enhance current digitization and optical character recognition methods and cut the financial and labor cost of Chinese libraries. Second, it can help libraries to keep information on indexing words as well as chapters, sections and subsections in Chinese book databases, which ensures easy retrieval and extract any intended portion as demanded by user. Third, it helps to enrich the services and then enhances the user experiences in Chinese libraries. Fourth, it improves the specification and policy of digitalizing Chinese books. </p> </sec> <sec> <title content-type="abstract-heading">Originality/value</title> <p> – The paper provided insight into the hierarchy organization of TOCs in Chinese books, the method based on the rules has extensive application than other methods. This method for Chinese book TOC automatic analysis is also as reference for English book TOC automatic analysis.</p> </sec> </abstract> … (more)
- Is Part Of:
- Library hi tech. Volume 33:Issue 3(2015)
- Journal:
- Library hi tech
- Issue:
- Volume 33:Issue 3(2015)
- Issue Display:
- Volume 33, Issue 3 (2015)
- Year:
- 2015
- Volume:
- 33
- Issue:
- 3
- Issue Sort Value:
- 2015-0033-0003-0000
- Page Start:
- 424
- Page End:
- 438
- Publication Date:
- 2015-09-21
- Subjects:
- Library science -- Technological innovations -- Periodicals
Libraries -- Automation -- Periodicals
Information science -- Periodicals
025.00285 - Journal URLs:
- http://www.emeraldinsight.com/0737-8831.htm ↗
http://www.emeraldinsight.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1108/LHT-05-2015-0043 ↗
- Languages:
- English
- ISSNs:
- 0737-8831
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5198.870000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3852.xml