A Decade-Scale Benchmark Evaluating LLMs’ Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations
Andong Tan, Shuyun Dai, Jinglu Wang, Fengtao Zhou, Yan Lu, Xi (Ada) Wang, Ying-Che Chen, Can Yang, Shujie Liu, Hao Chen
March 2026