Skip to main navigation Skip to search Skip to main content

Large language models surpass human experts in predicting neuroscience results

Xiaoliang Luo*, Akilles Rechardt, Guangzhi Sun, Kevin K. Nejad, Felipe Yáñez, Bati Yilmaz, Kangjoo Lee, Alexandra O. Cohen, Valentina Borghesani, Anton Pashkov, Daniele Marinazzo, Jonathan Nicholas, Alessandro Salatiello, Ilia Sucholutsky, Pasquale Minervini, Sepehr Razavi, Roberta Rocca, Elkhan Yusifov, Tereza Okalova, Nianlong GuMartin Ferianc, Mikail Khona, Kaustubh R. Patil, Pui Shee Lee, Rui Mata, Nicholas E. Myers, Jennifer K. Bizley, Sebastian Musslick, Isil Poyraz Bilgin, Guiomar Niso, Justin M. Ales, Michael Gaebler, N. Apurva Ratan Murty, Leyla Loued-Khenissi, Anna Behler, Chloe M. Hall, Jessica Dafflon, Sherry Dongqi Bao, Bradley C. Love*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. Here, to evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs indicated high confidence in their predictions, their responses were more likely to be correct, which presages a future where LLMs assist humans in making discoveries. Our approach is not neuroscience specific and is transferable to other knowledge-intensive endeavours.

Original languageEnglish
Pages (from-to)305-315
Number of pages11
JournalNature Human Behaviour
Volume9
Early online date27 Nov 2024
DOIs
Publication statusPublished - 1 Feb 2025

Keywords

  • Neuroscience
  • Scientific community

Fingerprint

Dive into the research topics of 'Large language models surpass human experts in predicting neuroscience results'. Together they form a unique fingerprint.

Cite this