Benchmarking proprietary and open-source language and vision-language models for gastroenterology clinical reasoning

Safavi-Naini SAA, Ali S, Shahab O, Shahhoseini Z, Savage T, Rafiee S, Samaan JS, Al Shabeeb R, Ladak F, Yang JO, Echavarria J, Babar S, Shaukat A, Margolis S, Tatonetti NP, Nadkarni G, El Kurdi B, Soroush A. Benchmarking proprietary and open-source language and vision-language models for gastroenterology clinical reasoning. NPJ Digit Med. 2025 Nov 27. doi: 10.1038/s41746-025-02174-0. Epub ahead of print. PMID: 41310206.


Related Posts