Isolater - Feed

Ax Zhangyong Liang, Huanhuan Gao 5/14/2026

Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs

Research on Physics-Informed Neural Networks reducing spatial derivative complexity from O(d^k) to O(1) using stochastic dimension-free estimators.

Ax Alberto G. Rodriguez Salgado 5/14/2026

From Pixels to BFS: High Maze Accuracy Does Not Imply Visual Planning

MazeBench evaluates whether multimodal models solve visual mazes through genuine planning or token-space search, finding high accuracy scores misleading.

Ax Dmitrii Seletkov, Paul Hager, Georgios Kaissis, Rickmer Braren, Daniel Rueckert, Raphael Rehms 5/14/2026

Survival In-Context: Amortized Bayesian Survival Analysis via Prior-Fitted Networks

Research on survival analysis for medical applications using prior-fitted Bayesian networks on tabular data with censoring.

Ax Fangxin Wang, Peyman Baghershahi, Langzhou He, Henry Peng Zou, Sourav Medya, Philip S. Yu 5/14/2026

Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning

Online gradient-based data selection and reweighting framework for LLM fine-tuning with optimizer-aware design.

Ax Benjamin Amoh, Geoffrey Parker, Wesley Marrero 5/14/2026

Multi-Agent Decision-Focused Learning via Value-Aware Sequential Communication

SeqComm-DFL: Multi-agent learning framework with value-aware sequential communication optimized for task performance.

Ax Zichao Yan, Yan Wu, Mica Xu Ji, Chaitra Agrahar, Esther Wershof, Marcel Nassar, Mehrshad Sadria, Ridvan Eksi, Vladimir Trifonov, Ignacio Ibarra, Telmo Felgueira, B{\l}a\.zej Osi\'nski, Rory Stark 5/14/2026

PRiMeFlow: Capturing Complex Expression Heterogeneity in Perturbation Response Modelling

PRiMeFlow: Flow matching approach for modeling genetic and drug perturbation effects on single-cell gene expression.

Ax Jennifer Wendland, Nicolas Freitag, Maik Kschischo 5/14/2026

Observable Neural ODEs for Identifiable Causal Forecasting in Continuous Time

Observable neural ODEs framework for continuous-time causal forecasting with identifiability via control-theoretic observability.

Ax Xinyou Wang, Liang Hong, Jiasheng Ye, Zaixiang Zheng, Yu Li, Shujian Huang, Quanquan Gu 5/14/2026

Towards A Generative Protein Evolution Machine with DPLM-Evo

DPLM-Evo: Discrete diffusion protein language model capturing evolutionary constraints for protein design and generation.

Ax Mohammad Al Ridhawi, Mahtab Haj Ali, Hussein Al Osman 5/14/2026

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback

Behavioral evaluation framework for agentic LLM-based stock prediction systems using closed-loop RL feedback and multi-dimensional metrics.

Ax Nan Jia, Haojin Yang, Xing Ma, Jiesong Lian, Shuailiang Zhang, Weipeng Zhang, Ke Zeng, Xunliang Cai, Zequn Sun 5/14/2026

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level

Asymmetric on-policy distillation: Token-level approach addressing variance, vanishing gradients, and exploration issues in LLM training.

Ax Leonel Aguilar, Jan Nagler, Christoph Hoelscher, Nino Antulov-Fantulin 5/14/2026

Does Your Neural Network Extrapolate? Feature Engineering as Identifiability Bias for OOD Generalization

Study of when neural networks fail at OOD generalization, decoupling feature learning from data-generating-process identifiability.

Ax Joshua Shay Kricheli, Alexander Lawrence Reid, Soumajyoti Sarkar, Venkata Gandikota, Paulo Shakarian 5/14/2026

Tokens-per-Parameter Coverage Is Critical for Robust LLM Scaling Law Extrapolation

Analysis of LLM scaling laws showing tokens-per-parameter ratio affects extrapolation robustness; demonstrates collinearity causes ill-conditioning.

Ax Karim Othman, Jonas Petersen, Matei Ignuta-Ciuncanu, Camilla Mazzoleni, Federico Martelli, Alessandro Lombardi, Riccardo Maggioni, Philipp Petersen 5/14/2026

FactoryNet: A Large-Scale Dataset toward Industrial Time-Series Foundation Models

FactoryNet: First large-scale industrial time-series pretraining corpus with 51M datapoints for foundation models with cross-embodiment transfer.

Ax Musa Cim, Poovaiah Palangappa, Miro Hodak, Ravi Dwivedula, Meena Arunachalam, Mahmut Taylan Kandemir 5/14/2026

Pretraining large language models with MXFP4 on Native FP4 Hardware

Study of MXFP4 quantization for full-pipeline FP4 training of LLMs, analyzing divergence in forward/backward passes on native FP4 hardware.

Ax Daniel Goldstein, Eugene Cheah 5/14/2026

Key-Value Means: Transformers with Expandable Block-Recurrent Compressed Memory

Key-Value Means: A block-recurrent attention mechanism enabling O(N) transformers with fixed or expandable memory for long-context tasks.

Ax Debashis Guha 5/14/2026

Consolidation-Expansion Operator Mechanics:A Unified Framework for Adaptive Learning

Consolidation-Expansion Operator Mechanics: A framework for understanding adaptive learning systems through order-gap formalism.

Ax Huilin Zhou, Jian Zhao, Yilu Zhong, Zhen Liang, Xiuyuan Chen, Yuchen Yuan, Tianle Zhang, Chi Zhang, Lan Zhang, Xuelong Li 5/14/2026

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization

Metis: A framework using policy optimization to automate jailbreaking of LLMs via red teaming, treating attacks as POMDP inference.

Ax Marcin Kostrzewa, Sebastian Tomczak, Roman Furman, Anna Poberezhna, Micha{\l} Furga{\l}a, Julia Farganus, Oleksii Furman, Maciej Zi\k{e}ba 5/14/2026

V4FinBench: Benchmarking Tabular Foundation Models, LLMs, and Standard Methods on Corporate Bankruptcy Prediction

V4FinBench: A benchmark with 1M+ company-year records for evaluating tabular foundation models and LLMs on corporate bankruptcy prediction.

Ax Yaxin Du, Xiyuan Yang, Zhifan Zhou, Wanxu Liu, Zixing Lei, Zimeng Chen, Fenyi Liu, Haotian Wu, Yuzhu Cai, Zexi Liu, Xinyu Zhu, WenHao Wang, Linfeng Zhang, Chen Qian, Siheng Chen 5/14/2026

DataMaster: Data-Centric Autonomous AI Research

Autonomous AI system for data-centric machine learning that searches, adapts, and validates datasets to improve model performance.

Ax Jiaming Li, Chenyu Zhu, Nanxi Yi, Youjun Bao, Li Sun, Quanying Lv, Xiang Fang, Daizong Liu, Jianjun Li, Kun He, Bowen Zhou, Zhiyuan Ma 5/14/2026

TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment

Trajectory matching policy optimization for diffusion model alignment that prevents reward hacking through probability distribution constraints.

Ax Jonas Petersen, Gian-Alessandro Lombardi, Riccardo Maggioni, Camilla Mazzoleni, Federico Martelli, Philipp Petersen 5/14/2026

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

Self-supervised architecture for rare event prediction in multivariate time series using causal transformers and joint-embedding predictive learning.

Ax Abhishek Moturu, Anna Goldenberg, Babak Taati 5/14/2026

LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection

Lightweight method for selecting targeted synthetic training data by scoring samples on boundary proximity, uncertainty, and real-data density.

Ax Yizhu Jiao, Ruixiang Zhang, Richard Bai, Jiawei Han, Ronan Collobert, Yizhe Zhang 5/14/2026

Primal Generation, Dual Judgment: Self-Training from Test-Time Scaling

Leverages test-time scaling comparative information as self-training signal in dual judgment space for improved code generation.

Ax Jeongsol Kim, Hongeun Kim, Jian Wang, Jong Chul Ye 5/14/2026

Gradient-Free Noise Optimization for Reward Alignment in Generative Models

Proposes ZeNO, a gradient-free optimization method for reward alignment in diffusion and flow models using noise-space optimization.

Ax Han Xiao 5/14/2026

Test-Time Compute for Dense Retrieval: Agentic Program Generation with Frozen Embedding Models

Uses agentic program search to apply test-time compute to frozen embedding models, optimizing inference programs for dense retrieval.

Ax DatologyAI, :, Siddharth Joshi, Haoli Yin, Rishabh Adiga, Haakon Mongstad, Alvin Deng, Aldo Carranza, Alex Fang, Amro Abbas, Anshuman Suri, Brett Larsen, Daniel Zayas, Darren Teh, David Schwab, Diego Kiner, Fan Pan, Jack Urbanek, Jason Lee, Jason Telanoff, Josh Wills, Kaleigh Mentzer, Luke Merrick, Maximilian B\"other, Parth Doshi, Paul Burstein, Pratyush Maini, Ties Robroek, Tony Jiang, Vidhi Jain, Vineeth Dorna, Zhengping Wang, Bogdan Gaza, Ari Morcos, Matthew Leavitt 5/14/2026

20/20 Vision Language Models: A Prescription for Better VLMs through Data Curation Alone

Demonstrates data curation alone improves vision-language model performance by 11.7pp without architecture or compute changes.

Ax Yan Jiang, Ruihong Qiu, Zi Huang 5/14/2026

Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models

Investigates block size effects in reinforcement learning for diffusion large language models with semi-autoregressive generation.

Ax Sayantani Ghosh, Amit Kumar Das, Amlan Chakrabarti 5/14/2026

A New Technique for AI Explainability using Feature Association Map

Proposes FAMeX algorithm for AI explainability using graph-theoretic feature association maps.

Ax Jeffrey T. H. Wong, Cheng Zhang, Xinye Cao, Pedro Gimenes, Christos-Savvas Bouganis, George A. Constantinides, Wayne Luk, Yiren Zhao 5/14/2026

A3 : an Analytical Low-Rank Approximation Framework for Attention

Low-rank approximation framework for compressing transformer attention layers by analyzing architectural characteristics rather than individual layer outputs.

Ax Liran Ringel, Elad Tolochinsky, Yaniv Romano 5/14/2026

Learning a Continue-Thinking Token for Enhanced Test-Time Scaling

Explores learned continue-thinking tokens to extend reasoning steps and improve LLM performance through test-time compute scaling.

Ax Omer Luxembourg, Haim Permuter, Eliya Nachmani 5/14/2026

Plan for Speed: Dilated Scheduling for Masked Diffusion Language Models

Proposes Dilated Unmasking Scheduler for faster non-autoregressive text generation in masked diffusion language models by scheduling token unmasking to avoid sequential behavior.

Ax Millicent Li, Alberto Mario Ceballos Arroyo, Giordano Rogers, Naomi Saphra, Byron C. Wallace 5/14/2026

Do Activation Verbalization Methods Convey Privileged Information?

Investigates whether activation verbalization methods (using LLMs to describe internal representations) reveal privileged information or just input information.

Ax Samuel Willis, Paul Duckworth, Jack Simons, Aleksandra Kalisz, Krisztina Sinkovics, Noam Ghenassia, Shikha Surana, Henry T. Oldroyd, Alexandru I. Stere, Dragos D Margineantu, Carl Henrik Ek, Henry Moss, Erik Bodin 5/14/2026

Sample-Efficient Optimisation over the Outputs of Generative Models

O3 method for sample-efficient optimization within generative model outputs (diffusion/flow models) for task-specific criteria.

Ax Kathy Garcia, Leyla Isik 5/14/2026

Behavioral Geometric Supervision Aligns Video Foundation Models with Human Social Perception

Aligns video foundation models with human social perception using geometric supervision of behavioral features.

Ax Hamza Cherkaoui, H\'el\`ene Halconruy, Yohan Petetin 5/14/2026

When to Transfer: Adaptive Source Selection for Positive Transfer in Linear Models

Greedy algorithm for selecting which source tasks and sample sizes to transfer in multi-source transfer learning for linear models.

Ax Yuu Jinnai 5/14/2026

Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition

Evaluates Minimum Bayes Risk decoding versus beam search for automatic speech recognition tasks.

Ax Jiaming Qu, Mengtian Guo, Yue Wang 5/14/2026

Why is "Chicago" Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues

Uses LLMs to explain what linguistic features (e.g., city names) predict deceptive reviews, leveraging language models for feature interpretation.

Ax Mario Markov (INSAIT, Sofia University "St. Kliment Ohridski"), Stefan Maria Ailuro (INSAIT, Sofia University "St. Kliment Ohridski"), Luc Van Gool (INSAIT, Sofia University "St. Kliment Ohridski"), Konrad Schindler (ETH Zurich), Danda Pani Paudel (INSAIT, Sofia University "St. Kliment Ohridski") 5/14/2026