Ax Jingzhi Fang, Xiong Gao, Renwei Zhang, Zichun Ye, Lei Chen, Jie Zhao, Chengnuo Huang, Hui Xu, Xuefeng Jin 3/26/2026

DVM: Real-Time Kernel Generation for Dynamic AI Models

DVM enables real-time kernel generation for dynamic AI models, addressing compilation overhead and memory footprint issues in runtime compilation.

Ax Cursor Reseach, :, Aaron Chan, Ahmed Shalaby, Alexander Wettig, Aman Sanger, Andrew Zhai, Anurag Ajay, Ashvin Nair, Charlie Snell, Chen Lu, Chen Shen, Emily Jia, Federico Cassano, Hanpeng Liu, Haoyu Chen, Henry Wildermuth, Jacob Jackson, Janet Li, Jediah Katz, Jiajun Yao, Joey Hejna, Josh Warner, Julius Vering, Kevin Frans, Lee Danilek, Less Wright, Lujing Cen, Luke Melas-Kyriazi, Michael Truell, Michiel de Jong, Naman Jain, Nate Schmidt, Nathan Wang, Niklas Muennighoff, Oleg Rybkin, Paul Loh, Phillip Kravtsov, Rishabh Yadav, Sahil Shah, Sam Kottler, Alexander M Rush, Shengtong Zhang, Shomil Jain, Sriram Sankar, Stefan Heule, Stuart H. Sul, Sualeh Asif, Victor Rong, Wanqi Zhu, William Lin, Yuchen Wu, Yuri Volkov, Yury Zemlyanskiy, Zack Holbrook, Zhiyuan Zhang 3/26/2026

Composer 2 Technical Report

Composer 2 model specialized for agentic software engineering with long-term planning and coding abilities trained via continued pretraining and reinforcement learning.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 3/26/2026

Moonwalk: Inverse-Forward Differentiation

Inverse-forward differentiation method to reduce memory requirements for backpropagation by avoiding activation storage.

Ax Kefan Song, Amir Moeini, Peng Wang, Lei Gong, Rohan Chandra, Shangtong Zhang, Yanjun Qi 3/26/2026

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Research paper demonstrating LLMs perform in-context reinforcement learning during inference. ICRL prompting framework enables inference-time self-improvement.

Ax Divyat Mahajan, Sachin Goyal, Badr Youbi Idrissi, Mohammad Pezeshki, Ioannis Mitliagkas, David Lopez-Paz, Kartik Ahuja 3/26/2026

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Proposes future summary pretraining for LLMs as alternative to next-token prediction, addressing limitations in long-horizon reasoning and planning tasks.