Ax Sebasti\'an Andr\'es Cajas Ord\'o\~nez, Luis Fernando Torres Torres, Mackenzie J. Meni, Carlos Andr\'es Duran Paredes, Eric Arazo, Cristian Bosch, Ricardo Simon Carbajo, Yuan Lai, Leo Anthony Celi 3/26/2026

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

Proposes curiosity-driven quantized Mixture-of-Experts framework using Bayesian uncertainty for deploying neural networks on resource-constrained devices.

Ax Ziwei Liu, Borui Kang, Hangjie Yuan, Zixiang Zhao, Wei Li, Yifan Zhu, Tao Feng 3/26/2026

Continual GUI Agents

Introduces continual learning task for GUI agents that must adapt to shifting domains and resolutions over time, identifying failure modes in existing agent methods.

Ax Bjarni Haukur Bjarnason, Andr\'e Silva, Martin Monperrus 3/26/2026

On Randomness in Agentic Evals

Study of variance in agentic system evaluations using 60,000 trajectories on SWE-Bench-Verified, showing pass@1 estimates vary significantly across runs, questioning single-run reliability assumptions.

Ax Gregor Kornhardt, Jannis Chemseddine, Christian Wald, Gabriele Steidl 3/26/2026

Self-Aware Markov Models for Discrete Reasoning

Masked discrete diffusion model with self-aware Markov transition kernels enabling adaptive reasoning and error correction in discrete tasks.

Ax Rohan Shad, Cyril Zakka, Dhamanpreet Kaur, Mrudang Mathur, Robyn Fong, Joseph Cho, Ross Warren Filice, John Mongan, Kimberly Kalianos, Nishith Khandwala, David Eng, Matthew Leipzig, Walter R. Witschey, Alejandro de Feria, Victor A. Ferrari, Euan A. Ashley, Michael A. Acker, Curtis Langlotz, William Hiesinger 3/26/2026

A Generalizable Deep Learning System for Cardiac MRI

Self-supervised deep learning system for cardiac MRI analysis. Vision model trained via contrastive learning from visual concepts and text descriptions.

Ax Nina Corvelo Benz, Stratis Tsirtsis, Eleni Straitouri, Ivi Chatzi, Ander Artola Velasco, Suhas Thejaswi, Manuel Gomez-Rodriguez 3/26/2026

Evaluation of Large Language Models via Coupled Token Generation

Causal framework for evaluating LLMs controlling for randomization in token generation. Proposes coupled generation model for fair model comparison and ranking.

Ax Judy Hanwen Shen, Ellen Vitercik, Anders Wikum 3/26/2026

Algorithms with Calibrated Machine Learning Predictions

Framework integrating ML prediction uncertainty into online algorithm design. Uses calibration to leverage prediction-level confidence in algorithms with predictions.

Ax Leo Zhang, Peter Potaptchik, Jiajun He, Yuanqi Du, Arnaud Doucet, Francisco Vargas, Hai-Dang Dau, Saifuddin Syed 3/26/2026

Accelerated Parallel Tempering via Neural Transports

Neural transport methods to accelerate Parallel Tempering MCMC sampling. Improves sample efficiency on high-dimensional and multimodal distributions.

Ax Andreas Panayiotou, Panayiotis Charalambous, Ioannis Karamouzas 3/26/2026

Gen-C: Populating Virtual Worlds with Generative Crowds

Gen-C: Generative framework for simulating high-level crowd behaviors in virtual environments. Captures agent-agent and agent-environment interactions over time.