Ax Ahsan Bilal, Muhammad Ahmed Mohsin, Muhammad Umer, Asad Aali, Muhammad Usman Khanzada, Muhammad Usman Rafique, Zihao He, Emily Fox, Dean F. Hougen 21d ago

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

S³: stratified scaling search for test-time inference in diffusion language models using classical verifiers to improve generation without additional training.

Ax Apimuk Sornsaeng, Si Min Chan, Wenxuan Zhang, Swee Liang Wong, Joshua Lim, Dario Poletti 21d ago

SMT-AD: a scalable quantum-inspired anomaly detection approach

Quantum-inspired tensor network anomaly detection (SMT-AD) using superposition of bond-dimension-1 matrix product operators with Fourier feature embeddings.

Ax Zihan Wang, Chi Gui, Xing Jin, Qineng Wang, Licheng Liu, Kangrui Wang, Shiqi Chen, Linjie Li, Zhengyuan Yang, Pingyue Zhang, Yiping Lu, Jiajun Wu, Li Fei-Fei, Lijuan Wang, Yejin Choi, Manling Li 21d ago

RAGEN-2: Reasoning Collapse in Agentic RL

RAGEN-2 identifies reasoning collapse in RL-trained multi-turn LLM agents where models use input-agnostic templates despite stable entropy metrics.

Ax Marzi Heidari, Hanping Zhang, Hao Yan, Yuhong Guo 21d ago

Bi-Level Optimization for Single Domain Generalization

Bi-level optimization framework (BiSDG) for single domain generalization that decouples task learning from domain modeling using surrogate distributions.

Ax Mingchen Zhuge, Changsheng Zhao, Haozhe Liu, Zijian Zhou, Shuming Liu, Wenyi Wang, Ernie Chang, Gael Le Lan, Junjie Fei, Wenxuan Zhang, Yasheng Sun, Zhipeng Cai, Zechun Liu, Yunyang Xiong, Yining Yang, Yuandong Tian, Yangyang Shi, Vikas Chandra, J\"urgen Schmidhuber 21d ago

Neural Computers

Proposes Neural Computers (NCs) that unify computation, memory, and I/O in learned runtime states, aiming toward fully neural computing systems that replace explicit programs.

Ax Maojiang Su, Po-Chung Hsieh, Weimin Wu, Mingcheng Lu, Jiunhau Chen, Jerry Yao-Chieh Hu, Han Liu 21d ago

Discrete Flow Matching Policy Optimization

DoMinO: unified RL framework for fine-tuning discrete flow matching models viewing sampling as multi-step MDP.