Publication Probabilistic Inference and Learning with Stein’s Method Qiang Liu, Lester Mackey, C. Oates March 2026
Publication Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models Zongqian Li, Shaohan Huang, Zewen Chi, Yixuan Su, Lexin Zhou, Li Dong, Nigel Collier, Furu Wei March 2026
Publication LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis Tao Zhang, Rui Ma, Shuotao Xu, Yongqiang Xiong, Peng Cheng March 2026
Publication Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Junjie Li, Xinru Guo, Yuhao Wu, Roy Ka-Wei Lee, Hongzhi Li, Yutao Xie March 2026
Publication Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity Di Zhang, Xun Wu, Shaohan Huang, Yudong Wang, Hanyong Shao, Yingbo Hao, Zewen Chi, Li Dong, Ting Song, Yan Xia, Zhifang Sui, Furu Wei March 2026
Publication SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity Hanyong Shao, Yingbo Hao, Ting Song, Yan Xia, Di Zhang, Shaohan Huang, Xun Wu, Songcheng Xu, Le Xu, Li Dong, Zewen Chi, Yinxue Zou, Furu Wei March 2026
Publication Latent Policy Steering through One-Step Flow Policies Hokyun Im, Andrey Kolobov, Jianlong Fu, Youngwoon Lee March 2026
Publication Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces Karan Gupta, Pranav Vajreshwari, Yash Pandya, Raghav Magazine, Akshay Nambi, Ahmed Awadallah ICLR Agents in the Wild | March 2026
Publication Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts Sanae Lotfi, Lucas Caccia, Alessandro Sordoni, Jordan Ash, Miroslav Dudík March 2026
Publication Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Aradhye Agarwal, Gurdit Siyan, Yash Pandya, Joykirat Singh, Akshay Nambi, Ahmed Awadallah ICLR Agents in the Wild: Safety, Security, and Beyond | March 2026