Assessment of chemistry knowledge in large language models that generate code. (10th February 2023)
- Record Type:
- Journal Article
- Title:
- Assessment of chemistry knowledge in large language models that generate code. (10th February 2023)
- Main Title:
- Assessment of chemistry knowledge in large language models that generate code
- Authors:
- White, Andrew D.
Hocky, Glen M.
Gandhi, Heta A.
Ansari, Mehrad
Cox, Sam
Wellawatte, Geemi P.
Sasmal, Subarna
Yang, Ziyue
Liu, Kangxin
Singh, Yuvraj
Peña Ccoa, Willmor J. - Abstract:
- Abstract : In this work, we investigate the question: do code-generating large language models know chemistry? Our results indicate, mostly yes. Abstract : In this work, we investigate the question: do code-generating large language models know chemistry? Our results indicate, mostly yes. To evaluate this, we introduce an expandable framework for evaluating chemistry knowledge in these models, through prompting models to solve chemistry problems posed as coding tasks. To do so, we produce a benchmark set of problems, and evaluate these models based on correctness of code by automated testing and evaluation by experts. We find that recent LLMs are able to write correct code across a variety of topics in chemistry and their accuracy can be increased by 30 percentage points via prompt engineering strategies, like putting copyright notices at the top of files. Our dataset and evaluation tools are open source which can be contributed to or built upon by future researchers, and will serve as a community resource for evaluating the performance of new models as they emerge. We also describe some good practices for employing LLMs in chemistry. The general success of these models demonstrates that their impact on chemistry teaching and research is poised to be enormous.
- Is Part Of:
- Digital discovery. Volume 2:Number 2(2023)
- Journal:
- Digital discovery
- Issue:
- Volume 2:Number 2(2023)
- Issue Display:
- Volume 2, Issue 2 (2023)
- Year:
- 2023
- Volume:
- 2
- Issue:
- 2
- Issue Sort Value:
- 2023-0002-0002-0000
- Page Start:
- 368
- Page End:
- 376
- Publication Date:
- 2023-02-10
- Subjects:
- Chemistry -- Data processing -- Periodicals
Medical sciences -- Data processing -- Periodicals
Machine learning -- Periodicals
542.85 - Journal URLs:
- https://www.rsc.org/journals-books-databases/about-journals/digital-discovery/ ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/d2dd00087c ↗
- Languages:
- English
- ISSNs:
- 2635-098X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26931.xml