Ax Cristian P\'erez-Corral, Jose I. Mestre, Alberto Fern\'andez-Hern\'andez, Manuel F. Dolz, Jos\'e Duato, Enrique S. Quintana-Ort\'i 1d ago

FedSQ: Optimized Weight Averaging via Fixed Gating

Federated learning optimization via fixed gating for weight averaging under statistical heterogeneity.

Ax Xinyu Wang, Hanwei Wu, Jingwei Song, Shuyuan Zhang, Jiayi Zhang, Fanqi Kong, Tung Sum Thomas Kwok, Xiao-Wen Chang, Yuyu Luo, Chenglin Wu, Bang Liu 1d ago

Co-Evolution of Policy and Internal Reward for Language Agents

Self-Guide framework for LLM agents using co-evolved internal rewards to address sparse reward problem in long-horizon tasks.

Ax Chenxu Yang, Chuanyu Qin, Qingyi Si, Minghui Chen, Naibin Gu, Dingyu Yao, Zheng Lin, Weiping Wang, Jiaqi Wang, Nan Duan 1d ago

Self-Distilled RLVR

On-policy self-distillation training paradigm for LLMs combining dense signals from larger teacher models.

Ax O\u{g}uzhan Ersoy, Nikolay Blagoev, Jona te Lintelo, Stefanos Koffas, Marina Kr\v{c}ek, Stjepan Picek 1d ago

Backdoor Attacks on Decentralised Post-Training

arXiv paper analyzing backdoor attacks on decentralized LLM post-training via pipeline parallelism, examining vulnerabilities from malicious participants.

Ax Guy Blanc 1d ago

Robust Learning with Optimal Error

Optimal error algorithms for adversarial learning using randomized hypotheses, improving deterministic hypothesis bounds by factor of 1/2.

Ax Mirali Purohit, Bimal Gajera, Irish Mehta, Bhanu Tokas, Jacob Adler, Steven Lu, Scott Dickenshied, Serina Diniega, Brian Bue, Umaa Rebbapragada, Hannah Kerner 1d ago

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

MOMO: Foundation model merging multi-sensor Mars remote sensing data (HiRISE, CTX, THEMIS) using Equal Validation Loss alignment.

Ax Justin Reverdi, Sixin Zhang, Fabrice Gamboa, Serge Gratton 1d ago

Lipschitz bounds for integral kernels

Theoretical characterization of Lipschitz constants for kernel feature maps in positive definite kernels.

Ax Inbal Rimon, Oren Gal, Haim Permuter 1d ago

Split and Conquer Partial Deepfake Speech

Split-and-conquer framework for detecting manipulated speech regions via boundary detection and segment-level classification.