ACRank: a multi-evidence text-mining model for alliance discovery from news articles. Issue 5 (24th June 2020)
- Record Type:
- Journal Article
- Title:
- ACRank: a multi-evidence text-mining model for alliance discovery from news articles. Issue 5 (24th June 2020)
- Main Title:
- ACRank: a multi-evidence text-mining model for alliance discovery from news articles
- Authors:
- Zhou, Yilu
Xue, Yuan - Abstract:
- Abstract : Purpose: Strategic alliances among organizations are some of the central drivers of innovation and economic growth. However, the discovery of alliances has relied on pure manual search and has limited scope. This paper proposes a text-mining framework, ACRank, that automatically extracts alliances from news articles. ACRank aims to provide human analysts with a higher coverage of strategic alliances compared to existing databases, yet maintain a reasonable extraction precision. It has the potential to discover alliances involving less well-known companies, a situation often neglected by commercial databases. Design/methodology/approach: The proposed framework is a systematic process of alliance extraction and validation using natural language processing techniques and alliance domain knowledge. The process integrates news article search, entity extraction, and syntactic and semantic linguistic parsing techniques. In particular, Alliance Discovery Template (ADT) identifies a number of linguistic templates expanded from expert domain knowledge and extract potential alliances at sentence-level. Alliance Confidence Ranking (ACRank)further validates each unique alliance based on multiple features at document-level. The framework is designed to deal with extremely skewed, noisy data from news articles. Findings: In evaluating the performance of ACRank on a gold standard data set of IBM alliances (2006–2008) showed that: Sentence-level ADT-based extraction achieved 78.1%Abstract : Purpose: Strategic alliances among organizations are some of the central drivers of innovation and economic growth. However, the discovery of alliances has relied on pure manual search and has limited scope. This paper proposes a text-mining framework, ACRank, that automatically extracts alliances from news articles. ACRank aims to provide human analysts with a higher coverage of strategic alliances compared to existing databases, yet maintain a reasonable extraction precision. It has the potential to discover alliances involving less well-known companies, a situation often neglected by commercial databases. Design/methodology/approach: The proposed framework is a systematic process of alliance extraction and validation using natural language processing techniques and alliance domain knowledge. The process integrates news article search, entity extraction, and syntactic and semantic linguistic parsing techniques. In particular, Alliance Discovery Template (ADT) identifies a number of linguistic templates expanded from expert domain knowledge and extract potential alliances at sentence-level. Alliance Confidence Ranking (ACRank)further validates each unique alliance based on multiple features at document-level. The framework is designed to deal with extremely skewed, noisy data from news articles. Findings: In evaluating the performance of ACRank on a gold standard data set of IBM alliances (2006–2008) showed that: Sentence-level ADT-based extraction achieved 78.1% recall and 44.7% precision and eliminated over 99% of the noise in news articles. ACRank further improved precision to 97% with the top20% of extracted alliance instances. Further comparison with Thomson Reuters SDC database showed that SDC covered less than 20% of total alliances, while ACRank covered 67%. When applying ACRank to Dow 30 company news articles, ACRank is estimated to achieve a recall between 0.48 and 0.95, and only 15% of the alliances appeared in SDC. Originality/value: The research framework proposed in this paper indicates a promising direction of building a comprehensive alliance database using automatic approaches. It adds value to academic studies and business analyses that require in-depth knowledge of strategic alliances. It also encourages other innovative studies that use text mining and data analytics to study business relations. … (more)
- Is Part Of:
- Information technology & people. Volume 33:Issue 5(2020)
- Journal:
- Information technology & people
- Issue:
- Volume 33:Issue 5(2020)
- Issue Display:
- Volume 33, Issue 5 (2020)
- Year:
- 2020
- Volume:
- 33
- Issue:
- 5
- Issue Sort Value:
- 2020-0033-0005-0000
- Page Start:
- 1357
- Page End:
- 1380
- Publication Date:
- 2020-06-24
- Subjects:
- Strategic alliances -- Knowledge discovery -- Business intelligence -- Web mining -- Text mining -- Information extraction -- Template-based -- Chunk parsing
Information technology -- Periodicals
Management information systems -- Periodicals
Human-computer interaction -- Periodicals
004 - Journal URLs:
- http://info.emeraldinsight.com/products/journals/journals.htm?id=itp ↗
http://www.emeraldinsight.com/0959-3845.htm ↗
http://www.emeraldinsight.com/itp.htm ↗
http://firstsearch.oclc.org ↗
http://www.emeraldinsight.com/ ↗ - DOI:
- 10.1108/ITP-06-2018-0272 ↗
- Languages:
- English
- ISSNs:
- 0959-3845
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.368733
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22340.xml