Ax Sara Papi, Javier Garcia Gilabert, Zachary Hopton, Vil\'em Zouhar, Carlos Escolano, Gerard I. G\'allego, Jorge Iranzo-S\'anchez, Ahrii Kim, Dominik Mach\'a\v{c}ek, Patricia Schmidtova, Maike Z\"ufle 3/30/2026

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

Study comparing SpeechLLMs that directly process speech for translation against cascaded transcription pipelines, evaluating speech modality integration effectiveness.

Ax Laura Dietz, Bryan Li, Gabrielle Liu, Jia-Huei Ju, Eugene Yang, Dawn Lawrie, William Walden, James Mayfield 3/30/2026

Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Crucible system augments RAG with Q&A nuggets from documents, preserving citation provenance and improving extraction, selection, and report generation.

Ax Xiangbo Gao, Renjie Li, Xinghao Chen, Yuheng Wu, Suofei Feng, Qing Yin, Zhengzhong Tu 3/30/2026

PISCO: Precise Video Instance Insertion with Sparse Control

Video generation model for precise instance insertion with sparse control in filmmaking applications, moving beyond prompt-engineering toward controllable generation.

Ax Yaolun Zhang, Ruohui Wang, Jiahao Wang, Yepeng Tang, Xuanyu Zheng, Haonan Duan, Hao Lu, Hanming Deng, Lewei Lu 3/30/2026

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

EVA: reinforcement learning framework for video agents using MLLMs with adaptive reasoning to handle long video sequences and temporal dependencies efficiently.

Ax Saswata Bose, Suvadeep Maiti, Shivam Kumar Sharma, Mythirayee S, Tapabrata Chakraborti, Srijitesh Rajendran, Raju S. Bapi 3/30/2026

AI Generalisation Gap In Comorbid Sleep Disorder Staging

Deep learning model for automated sleep staging shows poor generalization to clinical populations with comorbid sleep disorders; proposes iSLEEPS to address limitations.

Ax Tom Marty, Eric Elmoznino, Leo Gagnon, Tejas Kasetty, Mizu Nishikawa-Toomey, Sarthak Mittal, Guillaume Lajoie, Dhanya Sridhar 3/30/2026

A Compression Perspective on Simplicity Bias

Theoretical analysis of simplicity bias in neural networks using minimum description length principle and compression framework.