Semantic indexing with deep learning: a case study. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Semantic indexing with deep learning: a case study. Issue 1 (December 2016)
- Main Title:
- Semantic indexing with deep learning: a case study
- Authors:
- Yan, Yan
Yin, Xu-Cheng
Zhang, Bo-Wen
Yang, Chun
Hao, Hong-Wei - Abstract:
- Abstract Background Deep learning techniques, particularly convolutional neural networks (CNNs), are poised for widespread application in the research fields of information retrieval and natural language processing. However, there are very few publications addressing semantic indexing with deep learning. In particular, there are few studies of semantic indexing in biomedical literature because of several specific challenges including a vast amount of semantic labels from automatically annotating MeSH terms for MEDLINE citations and a massive collection with only the title and abstract information. Results In this paper, we introduce a novel CNN-based semantic indexing method for biomedical abstract document collections. First, we adaptively group word2vec categories into (coarse) subsets by clustering. Next, we construct a high-dimensional space representation with Wikipedia category extension, which contains more semantic information than bag-of-words. Thereafter, we design a hierarchical CNN indexing architecture for learning documents from a coarse- to fine-grained level with several multi-label training techniques. We believe that the low-dimensional representation of the output layer in CNNs should be more compact and effective. Finally, we perform comparative experiments for semantic indexing of biomedical abstract documents. Conclusion Experimental results on the MEDLINE dataset show that our model achieves superior performance than conventional models.
- Is Part Of:
- Big data analytics. Volume 1:Issue 1(2016)
- Journal:
- Big data analytics
- Issue:
- Volume 1:Issue 1(2016)
- Issue Display:
- Volume 1, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 1
- Issue:
- 1
- Issue Sort Value:
- 2016-0001-0001-0000
- Page Start:
- 1
- Page End:
- 13
- Publication Date:
- 2016-12
- Subjects:
- Deep learning -- Semantic indexing -- Convolutional neural networks -- Biomedical documents
Big data -- Periodicals
Biology -- Data processing -- Periodicals
570.28557 - Journal URLs:
- https://bdataanalytics.biomedcentral.com/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s41044-016-0007-z ↗
- Languages:
- English
- ISSNs:
- 2058-6345
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9927.xml