Automated patent classification for crop protection via domain adaptation. Issue 1 (27th February 2023)
- Record Type:
- Journal Article
- Title:
- Automated patent classification for crop protection via domain adaptation. Issue 1 (27th February 2023)
- Main Title:
- Automated patent classification for crop protection via domain adaptation
- Authors:
- Christofidellis, Dimitrios
Lehmann, Marzena Maria
Luksch, Torsten
Stenta, Marco
Manica, Matteo - Abstract:
- Abstract: Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state‐of‐the‐art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain‐adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state‐of‐the‐art models with increased precision, recall, and F 1‐score. We base our evaluation on both standard patent classification datasets derived from patent offices‐defined code hierarchies and more practical real‐world use‐case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated. Abstract : We study how domain adaptation push the performance boundaries inAbstract: Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state‐of‐the‐art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain‐adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state‐of‐the‐art models with increased precision, recall, and F 1‐score. We base our evaluation on both standard patent classification datasets derived from patent offices‐defined code hierarchies and more practical real‐world use‐case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated. Abstract : We study how domain adaptation push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain‐adaptive pretraining and adapters. We base our evaluation on both standard patent classification baseline datasets and more practical real‐world use‐case scenarios containing labels from the agrochemical industrial domain. … (more)
- Is Part Of:
- Applied AI Letters. Volume 4:Issue 1(2023)
- Journal:
- Applied AI Letters
- Issue:
- Volume 4:Issue 1(2023)
- Issue Display:
- Volume 4, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 4
- Issue:
- 1
- Issue Sort Value:
- 2023-0004-0001-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2023-02-27
- Subjects:
- BERT -- domain‐adaption -- NLP -- patent analysis -- patent classification -- transformers
006.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/ail2.80 ↗
- Languages:
- English
- ISSNs:
- 2689-5595
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26052.xml