Ax Kun Zhang, Jiaqi Sun, Yiqing Li, Ignavier Ng, Namrata Deka, Shaoan Xie 4/6/2026

SEDGE: Structural Extrapolated Data Generation

Framework for generating synthetic data under specified conditions with approximate identifiability guarantees for data distribution extrapolation.

Ax Yasushi Nishida 4/6/2026

AXELRAM: Quantize Once, Never Dequantize

AXELRAM smart SRAM architecture computing attention scores from quantized KV cache without dequantization using orthogonal-transform quantization.

Ax Cristian P\'erez-Corral, Jose I. Mestre, Alberto Fern\'andez-Hern\'andez, Manuel F. Dolz, Jos\'e Duato, Enrique S. Quintana-Ort\'i 4/6/2026

FedSQ: Optimized Weight Averaging via Fixed Gating

Federated learning optimization via fixed gating for weight averaging under statistical heterogeneity.

Ax Xinyu Wang, Hanwei Wu, Jingwei Song, Shuyuan Zhang, Jiayi Zhang, Fanqi Kong, Tung Sum Thomas Kwok, Xiao-Wen Chang, Yuyu Luo, Chenglin Wu, Bang Liu 4/6/2026

Co-Evolution of Policy and Internal Reward for Language Agents

Self-Guide framework for LLM agents using co-evolved internal rewards to address sparse reward problem in long-horizon tasks.

Ax Chenxu Yang, Chuanyu Qin, Qingyi Si, Minghui Chen, Naibin Gu, Dingyu Yao, Zheng Lin, Weiping Wang, Jiaqi Wang, Nan Duan 4/6/2026

Self-Distilled RLVR

On-policy self-distillation training paradigm for LLMs combining dense signals from larger teacher models.