公開日 Toward Consistent World Models with Multi-Token Prediction and Latent Semantic Enhancement Qimin Zhong, Hao Liao, Haiming Qin, Mingyang Zhou, Rui Mao, Wei Chen, Naipeng Chao ACL 2026 | April 2026
公開日 Quotient-Space Diffusion Model Yixian Xu, Yusong Wang, Shengjie Luo, Kaiyuan Gao, Tianyu He, Di He, Chang Liu ICLR 2026 | April 2026
公開日 A Decade-Scale Benchmark Evaluating LLMs’ Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations Andong Tan, Shuyun Dai, Jinglu Wang, Fengtao Zhou, Yan Lu, Xi (Ada) Wang, Ying-Che Chen, Can Yang, Shujie Liu, Hao Chen March 2026
公開日 HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models Huizhi Liang, Yichao Shen, Yu Deng, Sicheng Xu, Zhiyuan Feng, Tong Zhang, Yaobo Liang, Jiaolong Yang CVPR 2026 | March 2026
公開日 Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities Vanshaj Khattar, Md. Rafi Ur Rashid, Moumita Choudhury, Jing Liu, T. Koike-Akino, Ming Jin, Ye Wang March 2026
公開日 Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Junjie Li, Xinru Guo, Yuhao Wu, Roy Ka-Wei Lee, Hongzhi Li, Yutao Xie March 2026
公開日 Online Experiential Learning for Language Models Tianzhu Ye, Li Dong, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei arXiv: Computation and Language | March 2026, 巻2603(16856)
公開日 Temperature as a Meta-Policy: Adaptive Temperature in LLM Reinforcement Learning Haoran Dang, Cuiling Lan, Hai Wan, Xibin Zhao, Yan Lu ICLR 2026 | February 2026
公開日 Improving Long-Context Summarization with Multi-Granularity Retrieval Optimization Xueyu Chen, Kaitao Song, Zifan Song, Dongsheng Li, Cairong Zhao AAAI 2026 | January 2026
公開日 DocReward: A Document Reward Model for Structuring and Stylizing Junpeng Liu, Yuzhong Zhao, Bowen Cao, Jiayu Ding, Yilin Jia, Tengchao Lv, Yupan Huang, Shaohan Huang, Nan Yang, Li Dong, Lei Cui, Tao Ge, Xun Wang, Huitian Jiao, Sun Mao, Fnu Kartik, Siqing Chen, Wai Lam, Furu Wei October 2025