Ax Bernd Bohnet, Michael C. Mozer, Kevin Swersky, Wil Cunningham, Aaron Parisi, Kathleen Kenealy, Noah Fiedel 4/6/2026

Analysis of Optimality of Large Language Models on Planning Problems

Analyzes frontier LLMs on classic AI planning problems, examining whether models reason optimally or rely on heuristic strategies in Blocksworld domain.

Ax Qianshan Wei, Yishan Yang, Siyi Wang, Jinglin Chen, Binyu Wang, Jiaming Wang, Shuang Chen, Zechen Li, Yang Shi, Yuqi Tang, Weining Wang, Yi Yu, Chaoyou Fu, Qi Li, Yi-Fan Zhang 4/6/2026

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Benchmark evaluating multimodal LLM agents with tool integration capabilities including visual expansion and web search through agentic reasoning.

Ax Fabian Gloeckle, Ahmad Rammal, Charles Arnal, Remi Munos, Vivien Cabannes, Gabriel Synnaeve, Amaury Hayat 4/6/2026

Automatic Textbook Formalization

AI system automatically formalizes 500+ page graduate-level algebraic combinatorics textbook to Lean, achieving 130K lines of formal code.

Ax Mengzhou Wu, Yuzhe Guo, Yuan Cao, Haochuan Lu, Songhe Zhu, Pingzhe Qu, Xin Chen, Kang Qin, Zhongpu Wang, Xiaode Zhang, Xinyi Wang, Wei Dai, Gang Cao, Yuetang Deng, Zhi Gong, Dezhi Ran, Linyi Li, Wei Yang, Tao Xie 4/6/2026

UI-Oceanus: Scaling GUI Agents with Synthetic Environmental Dynamics

Framework for scaling GUI agents using synthetic environmental dynamics and self-supervised learning from ground-truth interaction feedback.

Ax Dun Yuan, Fuyuan Lyu, Ye Yuan, Weixu Zhang, Bowei He, Jiayi Geng, Linfeng Du, Zipeng Sun, Yankai Chen, Changjiang Han, Jikun Kang, Alex Chen, Haolun Wu, Xue Liu 4/6/2026

Beyond Message Passing: Toward Semantically Aligned Agent Communication

Analysis of agent communication protocols for LLM systems organized into communication, syntactic, and semantic layers with systematic evaluation of 18 protocols.

Ax Zachary Bogorad, Ibrahim Elsharkawy, Yonatan Kahn, Andrew J. Larkoski, Noam Levi 4/6/2026

Generative models on phase space

Research on deep generative models (diffusion, flow matching) for high-dimensional distributions on constrained submanifolds in physics data.

Ax Timothy Gould, Sidike Paheding 4/6/2026

Self-Directed Task Identification

Self-Directed Task Identification framework enabling models to autonomously identify target variables in zero-shot learning without pre-training.

Ax Haodong Xie, Yujun Cai, Rahul Singh Maharjan, Yiwei Wang, Federico Tavella, Angelo Cangelosi 4/6/2026

Hierarchical, Interpretable, Label-Free Concept Bottleneck Model

Hierarchical Interpretable Label-Free Concept Bottleneck Model enabling interpretability at multiple abstraction levels unlike single-level existing CBMs.

Ax Darya Kaviani, Alp Eren Ozdarendeli, Jinhao Zhu, Yu Ding, Raluca Ada Popa 4/6/2026

Opal: Private Memory for Personal AI

System for private long-term memory in personal AI using trusted hardware and oblivious RAM to hide data access patterns from providers.