• Dapps:16.23K
  • Blockchains:78
  • Active users:66.47M
  • 30d volume:$303.26B
  • 30d transactions:$879.24M
Change in medicine: ChatGPT successfully passed the neurology exam

Change in medicine: ChatGPT successfully passed the neurology exam

user avatar

by Liza Tanasova

3 years ago


OpenAI's LLM 4.0 passed the Clinical Neurology exam, answering 85% of questions correctly from the American Board of Psychiatry and Neurology. The study authors believe that with some improvements, LLMs could find widespread use in clinical neurology.

The results of the experiment, conducted by a team of researchers from Heidelberg University Hospital and the German Cancer Research Center, were published on December 7. The test, which took place on May 31, involved two LLM models - ChatGPT 3.5 and the more recent ChatGPT 4.0. For the experiment, the researchers used a neuroscience question bank from the American Board of Psychiatry and Neurology, as well as a small group of questions from the European Board.

While the older version of ChatGPT scored 66.8% correct, answering 1,306 out of 1,956 questions correctly, the newer model, ChatGPT 4.0, scored 85% with 1,662 correct answers. It is worth noting that ChatGPT 4.0 outperformed regular users in questions related to behavior, cognition and psychology, and passed the neuroscience exam, as 70% correct answers are considered a passing grade in educational institutions. However, both models showed lower performance in tasks requiring “higher order thinking.”

The researchers who conducted the experiment believe that these results support the promise of using LLMs in clinical neurology, with some modifications. However, they also note that there are still some limitations. The implementation of LLMs in clinical neurology as a documentation and decision support system requires caution, as they are still not mature enough to solve cognitive problems.

0

Rewards

chest
chest
chest
chest

More rewards

Discover enhanced rewards on our social media.

chest

Other news

Ornith10: Tailored for Agentic Coding, Not General AI

chest

Ornith10 is specifically designed for agentic coding tasks, making it unsuitable for general-purpose AI applications.

user avatarKaterina Papadopoulou

DeepReinforce Unveils Ornith10: A Breakthrough in Open Source Coding Models

chest

DeepReinforce has launched Ornith10, a family of open-source coding models available in four sizes, optimized for agentic coding tasks.

user avatarMaya Lundqvist

New Report on Market and Onchain Data Released

chest

A report based on publicly available market and onchain data has been published. This report aims to provide insights into current market trends and dynamics.

user avatarLeo van der Veen

Cryip Emphasizes Commitment to Quality Reporting

chest

Cryip has published a report that emphasizes its strict editorial policy focusing on accuracy, relevance, and impartiality.

user avatarLi Weicheng

Beincrypto's Commitment to Editorial Integrity

chest

Beincrypto has released a report highlighting its strict editorial policy that focuses on accuracy, relevance, and impartiality.

user avatarAisha Farooq

Dailycoin Emphasizes Strict Editorial Policy

chest

Dailycoin emphasizes its strict editorial policy prioritizing accuracy and impartiality.

user avatarTenzin Dorje

Important disclaimer: The information presented on the Dapp.Expert portal is intended solely for informational purposes and does not constitute an investment recommendation or a guide to action in the field of cryptocurrencies. The Dapp.Expert team is not responsible for any potential losses or missed profits associated with the use of materials published on the site. Before making investment decisions in cryptocurrencies, we recommend consulting a qualified financial advisor.