Ax Haozhen Zhang, Haodong Yue, Tao Feng, Quanyu Long, Jianzhu Bao, Bowen Jin, Weizhi Zhang, Xiao Li, Jiaxuan You, Chengwei Qin, Wenya Wang 15h ago

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Query-aware runtime memory routing system for LLM agents operating across extended context windows with cost-performance tradeoffs.

Ax Hui Lu, Zheng Chai, Shipeng Bai, Hao Zhang, Zhifang Fan, Kunmin Bai, Ke Sun, Yingwen Wu, Bingzheng Wei, Xiang Sun, Ziyan Gong, Tianyi Liu, Hua Chen, Deping Xie, Zhongkai Chen, Zhiliang Guo, Qiwei Chen, Yuchao Zheng 15h ago

Compute Only Once: UG-Separation for Efficient Large Recommendation Models

UG-Separation technique reducing compute costs in large recommendation models by decoupling user and group feature interactions.

Ax Yannis Montreuil, Le\"ina Montreuil, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi 15h ago

Learning-to-Defer with Expert-Conditional Advice

Framework for routing inputs to experts with dynamic information selection, applicable to LLM systems with retrieved documents and tool outputs.

Ax Lin Song, Wenbo Li, Guoqing Ma, Wei Tang, Bo Wang, Yuan Zhang, Yijun Yang, Yicheng Xiao, Jianhui Liu, Yanbing Zhang, Guohui Zhang, Wenhu Zhang, Hang Xu, Nan Jiang, Xin Han, Haoze Sun, Maoquan Zhang, Haoyang Huang, Nan Duan 15h ago

JoyAI-Image: Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

JoyAI-Image unified multimodal foundation model combines spatially-enhanced MLLM with diffusion transformer for visual understanding and image generation.

Ax Dang Hoang Duy, Yannis Montreuil, Maxime Meyer, Axel Carlier, Lai Xing Ng, Wei Tsang Ooi 15h ago

Online Learning-to-Defer with Varying Experts

First online learning-to-defer algorithm for multiclass classification with dynamically varying expert pools and streaming data.

Ax Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau, Marylou Gabri\'e 15h ago

The critical slowing down in diffusion models

Theoretical analysis of computational sampling behavior in diffusion models using statistical mechanics, providing insights into convergence properties.

Ax Ryan Wei Heng Quek, Sanghyuk Lee, Alfred Wei Lun Leong, Arun Verma, Alok Prakash, Nancy F. Chen, Bryan Kian Hsiang Low, Daniela Rus, Armando Solar-Lezama 15h ago

MeMo: Memory as a Model

MeMo framework encodes new knowledge into dedicated memory models while keeping LLMs frozen, enabling efficient incorporation of domain-specific information without retraining.

Ax Caoliwen Wang, Minghao Guo, Siyuan Chen, Heng Zhang, Mengdi Wang, Xingyu Ni, Hanson Sun, Kunyi Wang, Zherong Pan, Kui Wu, Lingjie Liu, Yin Yang, Chenfanfu Jiang, Taku Komura, Wojciech Matusik, Peter Yichen Chen 15h ago

WorldParticle: Unified World Simulation of Lagrangian Particle Dynamics via Transformer

Transformer-based physics simulator modeling diverse phenomena (cloth, fluids, solids) using unified Lagrangian particle representation and prediction-correction design.