Ax Miao Yu, Siyuan Fu, Moayad Aloqaily, Zhenhong Zhou, Safa Otoum, Xing fan, Kun Wang, Yufei Guo, Qingsong Wen 3/25/2026

SafeSeek: Universal Attribution of Safety Circuits in Language Models

Mechanistic interpretability framework identifying and attributing safety circuits in LLMs responsible for alignment, jailbreak, and backdoor behaviors.

Ax Peng-Yuan Wang, Ziniu Li, Tian Xu, Bohan Yang, Tian-Shuo Liu, ChenYang Wang, Xiong-Hui Chen, Yi-Chen Li, Tianyun Yang, Congliang Chen, Yang Yu 3/25/2026

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Off-policy value-based reinforcement learning framework for LLMs enabling improved data utilization and sample efficiency for long-horizon tasks.

Ax Jenny Gao (College of Arts and Science, New York University, New York, NY), Yongfeng Zhang (Department of Computer Sciences, School of Arts & Sciences, Rutgers University, Piscataway, NJ), Mary L Disis (UW Medicine Cancer Vaccine Institute University of Washington, Seattle, WA), Lanjing Zhang (Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ, Department of Pathology, Princeton Medical Center, Plainsboro, NJ, Rutgers Cancer Institute, New Brunswick, NJ) 3/25/2026

Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

Quantitative assessment of reference retrieval errors from 5 LLM platforms on 2,000 medical literature references. Evaluates Grok-2, ChatGPT, Gemini, Perplexity, DeepSeek.

Ax Ricardo Olmedo, Bernhard Sch\"olkopf, Moritz Hardt 3/25/2026

Computational Arbitrage in AI Model Markets

Framework for computational arbitrage in AI model markets where arbitrageurs allocate inference budget across competing providers to undercut pricing.

Ax Tom Ulanovski (Tel Aviv University), Eyal Blyachman (Tel Aviv University), Maya Bechler-Speicher (Meta) 3/25/2026

Improving LLM Predictions via Inter-Layer Structural Encoders

Method leveraging intermediate layer representations in LLMs via Inter-Layer Structural Encoders to improve task-specific predictions beyond final-layer features.

Ax Elisabeth Griesbauer, Leiv R{\o}nneberg, Arnoldo Frigessi, Claudia Czado, Ingrid Hob{\ae}k Haff 3/25/2026

Stepwise Variational Inference with Vine Copulas

Novel stepwise variational inference method using vine copulas for estimating complex latent dependencies in probabilistic models.