Ax Haonian Ji, Kaiwen Xiong, Siwei Han, Peng Xia, Shi Qiu, Yiyang Zhou, Jiaqi Liu, Jinlong Li, Bingzhou Li, Zeyu Zheng, Cihang Xie, Huaxiu Yao 4/7/2026

ClawArena: Benchmarking AI Agents in Evolving Information Environments

ClawArena benchmark for evaluating AI agents in dynamic environments with evolving information, contradictions, and implicit user feedback.

Ax Mark Braverman, Roi Livni, Yishay Mansour, Shay Moran, Kobbi Nissim 4/7/2026

Learning from Equivalence Queries, Revisited

Revisits learning from equivalence queries model for modern ML systems like generative models and recommendation systems with periodic updates.

Ax Qing Zhou, Bingxuan Zhao, Tao Yang, Hongyuan Zhang, Junyu Gao, Qi Wang 4/7/2026

Batch Loss Score for Dynamic Data Pruning

Batch Loss Score metric for dynamic data pruning using exponential moving averages, accelerating deep learning training.

Ax Asena Karolin \"Ozdemir, Lars H. Heyen, Arvid Weyrauch, Achim Streit, Markus G\"otz, Charlotte Debus 4/7/2026

Sampling Parallelism for Fast and Efficient Bayesian Learning

Sampling parallelism approach for efficient Bayesian neural networks and uncertainty quantification in risk-sensitive applications.