Ax Zihan Wang, Chi Gui, Xing Jin, Qineng Wang, Licheng Liu, Kangrui Wang, Shiqi Chen, Linjie Li, Zhengyuan Yang, Pingyue Zhang, Yiping Lu, Jiajun Wu, Li Fei-Fei, Lijuan Wang, Yejin Choi, Manling Li 21d ago

RAGEN-2: Reasoning Collapse in Agentic RL

RAGEN-2 identifies reasoning collapse in RL-trained multi-turn LLM agents where models use input-agnostic templates despite stable entropy metrics.

Ax Marzi Heidari, Hanping Zhang, Hao Yan, Yuhong Guo 21d ago

Bi-Level Optimization for Single Domain Generalization

Bi-level optimization framework (BiSDG) for single domain generalization that decouples task learning from domain modeling using surrogate distributions.

Ax Mingchen Zhuge, Changsheng Zhao, Haozhe Liu, Zijian Zhou, Shuming Liu, Wenyi Wang, Ernie Chang, Gael Le Lan, Junjie Fei, Wenxuan Zhang, Yasheng Sun, Zhipeng Cai, Zechun Liu, Yunyang Xiong, Yining Yang, Yuandong Tian, Yangyang Shi, Vikas Chandra, J\"urgen Schmidhuber 21d ago

Neural Computers

Proposes Neural Computers (NCs) that unify computation, memory, and I/O in learned runtime states, aiming toward fully neural computing systems that replace explicit programs.

Ax Maojiang Su, Po-Chung Hsieh, Weimin Wu, Mingcheng Lu, Jiunhau Chen, Jerry Yao-Chieh Hu, Han Liu 21d ago

Discrete Flow Matching Policy Optimization

DoMinO: unified RL framework for fine-tuning discrete flow matching models viewing sampling as multi-step MDP.

Ax Robert C. Williamson 21d ago

The Rhetoric of Machine Learning

Philosophical examination of machine learning through rhetoric lens, arguing ML is inherently rhetorical rather than objective.

Ax Alessandro Pasqui, Jim Martin Catacora Ocana, Anshuman Sinha, Matthieu Perez, Fabrice Delbary, Giorgio Gosti, Mattia Miotto, Domenico Caudo, Maxence Ernoult, Herv\'e Turlier 21d ago

VertAX: a differentiable vertex model for learning epithelial tissue mechanics

JAX-based differentiable framework for vertex-modeling epithelial tissue mechanics with automatic differentiation and GPU acceleration.