Variable indexing method in rule documents for ship design using extraction of portable document format elements. Issue 6 (16th November 2022)
- Record Type:
- Journal Article
- Title:
- Variable indexing method in rule documents for ship design using extraction of portable document format elements. Issue 6 (16th November 2022)
- Main Title:
- Variable indexing method in rule documents for ship design using extraction of portable document format elements
- Authors:
- Kong, Min-Chul
Roh, Myung-Il
Kim, Ki-Su
Kim, Jongoh
Kim, Ju-Sung
Park, Hogyun - Abstract:
- Abstract: Design rules for ships have become more extensive and detailed due to an increase in the sizes of ships. Several variables and equations used in the rules are complex, thereby impeding their review by reviewers due to their voluminosity. In addition, because these rules are constantly revised, professional investigators may miss these changes. To prevent such confusion, a shipping register, which approves ship drawings, constantly automates the search and review processes of the rules. Consequently, this study proposes a method for recognizing variables in documents to review the rules and build relationships between variables. Each component of a document must be accurately identified. The document containing these rules includes different components such as equations, figures, and strings. Because these rules are mainly converted to a portable document format (PDF) for compatibility, it is challenging to extract each component as raw data. This study used a public library to extract elements from the PDF and utilized the positional relationship between the elements to identify the variables. By applying the Levenshtein distance algorithm, which compares the differences between two strings, the document was partitioned following to the table of contents. Hence, the identified variables were indexed into sections of the table of content. Additionally, based on the indexed information, a data structure was proposed to show the equations, definition of variables, andAbstract: Design rules for ships have become more extensive and detailed due to an increase in the sizes of ships. Several variables and equations used in the rules are complex, thereby impeding their review by reviewers due to their voluminosity. In addition, because these rules are constantly revised, professional investigators may miss these changes. To prevent such confusion, a shipping register, which approves ship drawings, constantly automates the search and review processes of the rules. Consequently, this study proposes a method for recognizing variables in documents to review the rules and build relationships between variables. Each component of a document must be accurately identified. The document containing these rules includes different components such as equations, figures, and strings. Because these rules are mainly converted to a portable document format (PDF) for compatibility, it is challenging to extract each component as raw data. This study used a public library to extract elements from the PDF and utilized the positional relationship between the elements to identify the variables. By applying the Levenshtein distance algorithm, which compares the differences between two strings, the document was partitioned following to the table of contents. Hence, the identified variables were indexed into sections of the table of content. Additionally, based on the indexed information, a data structure was proposed to show the equations, definition of variables, and relationships. This study applied it to common structural rules, which are widely used in the shipbuilding industry. The effectiveness of the proposed method was confirmed by achieving the F 1 score = 0.93 in variable recognition and intuitively visualizing the relationship between the variables. Graphical Abstract: … (more)
- Is Part Of:
- Journal of computational design and engineering. Volume 9:Issue 6(2022)
- Journal:
- Journal of computational design and engineering
- Issue:
- Volume 9:Issue 6(2022)
- Issue Display:
- Volume 9, Issue 6 (2022)
- Year:
- 2022
- Volume:
- 9
- Issue:
- 6
- Issue Sort Value:
- 2022-0009-0006-0000
- Page Start:
- 2556
- Page End:
- 2573
- Publication Date:
- 2022-11-16
- Subjects:
- ship design -- design rule -- rule examination -- variable indexing -- chaining rule -- PDF extraction
Engineering -- Data processing -- Periodicals
Computer-aided design -- Periodicals
Computer-aided design
Engineering -- Data processing
Electronic journals
Electronic journals
Periodicals
620.0042 - Journal URLs:
- http://bibpurl.oclc.org/web/76338 http://www.jcde.org/ ↗
http://www.sciencedirect.com/science/journal/22884300 ↗
http://www.journals.elsevier.com/journal-of-computational-design-and-engineering ↗
https://academic.oup.com/jcde ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1093/jcde/qwac123 ↗
- Languages:
- English
- ISSNs:
- 2288-4300
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24814.xml