Adapting natural language processing for technical text. Issue 3 (30th June 2021)
- Record Type:
- Journal Article
- Title:
- Adapting natural language processing for technical text. Issue 3 (30th June 2021)
- Main Title:
- Adapting natural language processing for technical text
- Authors:
- Dima, Alden
Lukens, Sarah
Hodkiewicz, Melinda
Sexton, Thurston
Brundage, Michael P. - Abstract:
- Abstract: Despite recent dramatic successes, natural language processing (NLP) is not ready to address a variety of real‐world problems. Its reliance on large standard corpora, a training and evaluation paradigm that favors the learning of shallow heuristics, and large computational resource requirements, makes domain‐specific application of even the most successful NLP techniques difficult. This paper proposes technical language processing (TLP) which brings engineering principles and practices to NLP specifically for the purpose of extracting actionable information from language generated by experts in their technical tasks, systems, and processes. TLP envisages NLP as a socio‐technical system rather than as an algorithmic pipeline. We describe how the TLP approach to meaning and generalization differs from that of NLP, how data quantity and quality can be addressed in engineering technical domains, and the potential risks of not adapting NLP for technical use cases. Engineering problems can benefit immensely from the inclusion of knowledge from unstructured data, currently unavailable due to issues with out of the box NLP packages. We illustrate the TLP approach by focusing on maintenance in industrial organizations as a case‐study. Abstract : This paper proposes technical language processing (TLP) which brings engineering principles and practices to natural language processing (NLP) specifically for the purpose of extracting actionable information from language generatedAbstract: Despite recent dramatic successes, natural language processing (NLP) is not ready to address a variety of real‐world problems. Its reliance on large standard corpora, a training and evaluation paradigm that favors the learning of shallow heuristics, and large computational resource requirements, makes domain‐specific application of even the most successful NLP techniques difficult. This paper proposes technical language processing (TLP) which brings engineering principles and practices to NLP specifically for the purpose of extracting actionable information from language generated by experts in their technical tasks, systems, and processes. TLP envisages NLP as a socio‐technical system rather than as an algorithmic pipeline. We describe how the TLP approach to meaning and generalization differs from that of NLP, how data quantity and quality can be addressed in engineering technical domains, and the potential risks of not adapting NLP for technical use cases. Engineering problems can benefit immensely from the inclusion of knowledge from unstructured data, currently unavailable due to issues with out of the box NLP packages. We illustrate the TLP approach by focusing on maintenance in industrial organizations as a case‐study. Abstract : This paper proposes technical language processing (TLP) which brings engineering principles and practices to natural language processing (NLP) specifically for the purpose of extracting actionable information from language generated by experts in their technical tasks, systems, and processes. TLP envisages NLP as a socio‐technical system rather than as an algorithmic pipeline. We describe how the TLP approach to meaning and generalization differs from that of NLP, how data quantity and quality can be addressed in engineering technical domains, and the potential risks of not adapting NLP for technical use cases. … (more)
- Is Part Of:
- Applied AI Letters. Volume 2:Issue 3(2021)
- Journal:
- Applied AI Letters
- Issue:
- Volume 2:Issue 3(2021)
- Issue Display:
- Volume 2, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 2
- Issue:
- 3
- Issue Sort Value:
- 2021-0002-0003-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-06-30
- Subjects:
- domain adaptation -- maintenance records -- natural language processing -- technical data -- technical language processing
006.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/ail2.33 ↗
- Languages:
- English
- ISSNs:
- 2689-5595
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18982.xml