TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet*. (10th July 2015)
- Record Type:
- Journal Article
- Title:
- TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet*. (10th July 2015)
- Main Title:
- TwitterNEED: A hybrid approach for named entity extraction and disambiguation for tweet*
- Authors:
- HABIB, MENA B.
VAN KEULEN, MAURICE - Abstract:
- Abstract: Twitter is a rich source of continuously and instantly updated information. Shortness and informality of tweets are challenges for Natural Language Processing tasks. In this paper, we present TwitterNEED, a hybrid approach for Named Entity Extraction and Named Entity Disambiguation for tweets. We believe that disambiguation can help to improve the extraction process. This mimics the way humans understand language and reduces error propagation in the whole system. Our extraction approach aims for high extraction recall first, after which a Support Vector Machine attempts to filter out false positives among the extracted candidates using features derived from the disambiguation phase in addition to other word shape and Knowledge Base features. For Named Entity Disambiguation, we obtain a list of entity candidates from the YAGO Knowledge Base in addition to top-ranked pages from the Google search engine for each extracted mention. We use a Support Vector Machine to rank the candidate pages according to a set of URL and context similarity features. For evaluation, five data sets are used to evaluate the extraction approach, and three of them to evaluate both the disambiguation approach and the combined extraction and disambiguation approach. Experiments show better results compared to our competitors DBpedia Spotlight, Stanford Named Entity Recognition, and the AIDA disambiguation system.
- Is Part Of:
- Natural language engineering. Volume 22:Part 3(2016)
- Journal:
- Natural language engineering
- Issue:
- Volume 22:Part 3(2016)
- Issue Display:
- Volume 22, Issue 3, Part 3 (2016)
- Year:
- 2016
- Volume:
- 22
- Issue:
- 3
- Part:
- 3
- Issue Sort Value:
- 2016-0022-0003-0003
- Page Start:
- 423
- Page End:
- 456
- Publication Date:
- 2015-07-10
- Subjects:
- Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324915000194 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 2269.xml