UIMA Ruta: Rapid development of rule-based information extraction applications. (8th October 2014)
- Record Type:
- Journal Article
- Title:
- UIMA Ruta: Rapid development of rule-based information extraction applications. (8th October 2014)
- Main Title:
- UIMA Ruta: Rapid development of rule-based information extraction applications
- Authors:
- KLUEGL, PETER
TOEPFER, MARTIN
BECK, PHILIP-DANIEL
FETTE, GEORG
PUPPE, FRANK - Abstract:
- Abstract: Rule-based information extraction is an important approach for processing the increasingly available amount of unstructured data. The manual creation of rule-based applications is a time-consuming and tedious task, which requires qualified knowledge engineers. The costs of this process can be reduced by providing a suitable rule language and extensive tooling support. This paper presents UIMA Ruta, a tool for rule-based information extraction and text processing applications. The system was designed with focus on rapid development. The rule language and its matching paradigm facilitate the quick specification of comprehensible extraction knowledge. They support a compact representation while still providing a high level of expressiveness. These advantages are supplemented by the development environment UIMA Ruta Workbench. It provides, in addition to extensive editing support, essential assistance for explanation of rule execution, introspection, automatic validation, and rule induction. UIMA Ruta is a useful tool for academia and industry due to its open source license. We compare UIMA Ruta to related rule-based systems especially concerning the compactness of the rule representation, the expressiveness, and the provided tooling support. The competitiveness of the runtime performance is shown in relation to a popular and freely-available system. A selection of case studies implemented with UIMA Ruta illustrates the usefulness of the system in real-world scenarios.
- Is Part Of:
- Natural language engineering. Volume 22:Part 1(2016)
- Journal:
- Natural language engineering
- Issue:
- Volume 22:Part 1(2016)
- Issue Display:
- Volume 22, Issue 1, Part 1 (2016)
- Year:
- 2016
- Volume:
- 22
- Issue:
- 1
- Part:
- 1
- Issue Sort Value:
- 2016-0022-0001-0001
- Page Start:
- 1
- Page End:
- 40
- Publication Date:
- 2014-10-08
- Subjects:
- Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324914000114 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 1688.xml