Publication Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search Yifei Zhang, Xu Yang, Xiao Yang, Bowen Xian, Qizheng Li, Shikai Fang, Jingyuan Li, Jian Wang, Mingrui Xu, Weiqing Liu, Jiang Bian March 2026 Github
Publication Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning Jiebin Zhang, Zhenghan Yu, Liang Wang, Nan Yang, Eugene J. Yu, Zheng Li, Yifan Song, Dawei Zhu, Xingxing Zhang, Furu Wei, Sujian Li March 2026
Publication Online Experiential Learning for Language Models Tianzhu Ye, Li Dong, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei arXiv: Computation and Language | March 2026, Vol 2603(16856)
Publication Reasoning-Driven Multimodal LLM for Domain Generalization Zhipeng Xu, Zilong Wang, Xinyang Jiang, Dongsheng Li, De Cheng, Nannan Wang February 2026
Publication TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation Steven Liu, Jane Luo, Xin Zhang, Aofan Liu, Hao Liu, J. Wu, Ziyang Huang, Yangyu Huang, Yu Kang, Scarlett Li February 2026 Github
Publication Beyond Correctness: Learning Robust Reasoning via Transfer Hyunseok Lee, Soheil Abbasloo, Jihoon Tack, Jinwoo Shin February 2026
Publication Reducing the Costs of Proof Synthesis on Rust Systems by Scaling Up a Seed Training Set Nongyu Di, Tianyu Chen, Shan Lu, Shuai Lu, Yeyun Gong, Peng Cheng, Jay Lorch, Yuan Yao, Xiaoxing Ma February 2026 Project
Publication Routing Channel-Patch Dependencies in Time Series Forecasting with Graph Spectral Decomposition Dongyuan Li, Shun Zheng, Chang Xu, Jiang Bian, Renhe Jiang ICLR 2026 | February 2026
Publication Improving Long-Context Summarization with Multi-Granularity Retrieval Optimization Xueyu Chen, Kaitao Song, Zifan Song, Dongsheng Li, Cairong Zhao AAAI 2026 | January 2026