Isolater - Feed

Ax Haotian Xu, Yuning You, Tengfei Ma 5/4/2026

When Structure Doesn't Help: LLMs Do Not Read Text-Attributed Graphs as Effectively as We Expected

Study showing LLMs struggle to effectively reason over text-attributed graphs despite their capabilities in natural language and cross-modal understanding.

Ax Tianwei Ni, Esther Derman, Vineet Jain, Vincent Taboga, Siamak Ravanbakhsh, Pierre-Luc Bacon 5/4/2026

Long-Horizon Model-Based Offline Reinforcement Learning Without Explicit Conservatism

Bayesian approach to offline reinforcement learning using posterior over world models without explicit conservatism constraints.

Ax Haotian Xu, Jiannan Yang, Tian Gao, Tsui-Wei Weng, Tengfei Ma 5/4/2026

Resting Neurons, Active Insights: Robustify Activation Sparsity for Large Language Models

Method to improve activation sparsity in LLMs by addressing representational instability caused by suppressing hidden activations.

Ax Nima Dehmamy, Benjamin Hoover, Bishwajit Saha, Leo Kozachkov, Jean-Jacques Slotine, Dmitry Krotov 5/4/2026

NRGPT: An Energy-based Alternative for GPT

NRGPT reframes GPT inference as energy-based model exploration, proposing minimal architectural modifications to unify with EBM framework.

Ax Markus Mueller, Kathrin Gruber, Dennis Fok 5/4/2026

Cascaded Flow Matching for Heterogeneous Tabular Data with Mixed-Type Features

Cascaded Flow Matching approach for generating tabular data with mixed discrete and continuous features using diffusion models.

Ax Yuanteng Chen, Peisong Wang, Nanxin Zeng, Yuantian Shao, Shuang Qiu, Gang Li, Jing Liu, Jian Cheng 5/4/2026

Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE

Expert-Sample method leverages fine-grained MoE routing patterns for test-time scaling in LLMs without temperature tuning.

Ax Hicham Eddoubi, Umar Faruk Abdullahi, Fadi Hassan 5/4/2026

Beyond Suffixes: Token Position in GCG Adversarial Attacks on Large Language Models

Analysis of GCG adversarial attacks on LLMs examining token position beyond suffix-based approaches for jailbreak robustness evaluation.

Ax Dongyeop Woo, Marta Skreta, Seonghyun Park, Kirill Neklyudov, Sungsoo Ahn 5/4/2026

Riemannian MeanFlow

Riemannian MeanFlow framework reduces neural network evaluations needed for diffusion models on Riemannian manifolds for generative modeling.

Ax Amith Bhat, Haipeng Luo, Aadirupa Saha 5/4/2026

One Good Source is All You Need: Near-Optimal Regret for Bandits under Heterogeneous Noise

SOAR algorithm for multi-armed bandits with heterogeneous noise sources across multiple data sources with adaptive source selection.

Ax Xiuying Wei, Caglar Gulcehre 5/4/2026

RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

RAT+ introduces recurrence-augmented attention enabling sparse dilated inference while maintaining accuracy, reducing FLOPs and KV cache with flexible configuration.

Ax Guanzhe Zhang, Shanshan Ding, Zhezhen Jin 5/4/2026

A Comparative Study of UMAP and Other Dimensionality Reduction Methods

Comparative analysis of UMAP against PCA, Kernel PCA, SIR, and t-SNE for dimensionality reduction techniques.

Ax Edward Izgorodin 5/4/2026

Semantic Level of Detail for Knowledge Graphs: Discovering Abstraction Boundaries via Spectral Heat Diffusion

Method using spectral heat diffusion to discover continuous abstraction levels in knowledge graphs and GraphRAG systems.

Ax Gregory N. Frank 5/4/2026

Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails

Analysis of alignment evaluation gaps showing concept detection differs from routing behavior using Chinese language model case study.

Ax Xueqiao Peng, Andrew Perrault 5/4/2026

Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning

Hierarchical RL approach for allocating limited public health resources across asynchronous disease outbreak clusters.

Ax Andoni Irazusta Garmendia 5/4/2026

A First Guess is Rarely the Final Answer: Learning to Search in the Traveling Salesperson Problem

Learned neural improvement policy for TSP that iteratively applies local modifications conditioned on candidate solutions.

Ax Tao Li, Kaiyuan Hou, Tuan Vinh, Monika Raj, Zhichun Guo, Carl Yang 5/4/2026

Reinforcement Learning with LLM-Guided Action Spaces for Synthesizable Lead Optimization

LLM-guided RL approach for drug lead optimization that ensures synthesizable molecular modifications through action space constraints.

Ax Yangyi Fang, Jiaye Lin, Xiaoliang Fu, Cong Qin, Haolin Shi 5/4/2026

Placing Puzzle Pieces Where They Matter: A Question Augmentation Framework for Reinforcement Learning

Question augmentation framework using partial solutions as hints to optimize RL training efficiency for LLM reasoning.

Ax S. Gratton, Ph. L. Toint 5/4/2026

A unified convergence theory for adaptive first-order methods in the nonconvex case, including AdaNorm, full and diagonal AdaGrad, Shampoo and Muo

Unified convergence theory for adaptive first-order optimization methods including AdaGrad, AdaNorm, Shampoo, and Muon.

Ax Wei Chen, Yubing Wu, Junmei Yang, Delu Zeng, Qibin Zhao, John Paisley, Min Chen, Zhou Wang 5/4/2026

Towards Disentangled Preference Optimization Dynamics: Suppress the Loser, Preserve the Winner

Unified framework for preference optimization revealing common update dynamics and preventing degradation of chosen responses.

Ax Stephan Xie, Ben Cohen, Mononito Goswami, Junhong Shen, Emaad Khwaja, Chenghao Liu, David Asker, Othmane Abou-Amal, Ameet Talwalkar 5/4/2026

ARFBench: Benchmarking Time Series Question Answering Ability for Software Incident Response

ARFBench benchmark evaluating multimodal foundation models on time series anomaly understanding for software incident response.

Ax Emil Ryd, Henning Bartsch, Julian Stastny, Joe Benton, Vivek Hebbar 5/4/2026

Removing Sandbagging in LLMs by Training with Weak Supervision

Training method using weak supervision to prevent LLM sandbagging and elicit best performance without full output verification.

Ax Dharshan Kumaran, Viorica Patraucean, Simon Osindero, Petar Veli\v{c}kovi\'c, Nathaniel Daw 5/4/2026

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Investigation of internal confidence signals and error detection mechanisms in LLMs using decision neuroscience frameworks.

Ax Charles Xu, Jost Tobias Springenberg, Michael Equi, Ali Amin, Adnan Esmail, Sergey Levine, Liyiming Ke 5/4/2026

RL Token: Bootstrapping Online RL with Vision-Language-Action Models

Lightweight online RL fine-tuning method for vision-language-action models using RL tokens for robot manipulation.

Ax Wenshuo Zhao (May), Qi Zhu (May), Xingshan Zeng (May), Fei Mi (May), Lifeng Shang (May), Yi R. (May), Fung 5/4/2026

Entropy Centroids as Intrinsic Rewards for Test-Time Scaling

Intrinsic reward method using entropy centroids for test-time compute scaling and response selection in large language models.

Ax Jason Wu, Shir-Kang Scott Jin, Yuyang Yuan, Maggie Wigness, Lance M. Kaplan, Hang Qiu, Mani Srivastava 5/4/2026

SWAN: World-Aware Adaptive Multimodal Networks for Runtime Variations

Adaptive multimodal networks that handle runtime variations in modality quality, input complexity, and compute resources.

Ax Boris Shigida, Boris Hanin, Andrey Gromov 5/4/2026

Learning Rate Transfer in Normalized Transformers

Method to achieve learning rate transfer across model sizes in Normalized Transformers using alignment exponents.

Ax Preston Rozwood, Edward Mehrez, Ludger Paehler, Wen Sun, Steven L. Brunton 5/4/2026

Koopman-Assisted Reinforcement Learning

Reinforcement learning algorithms using data-driven Koopman operator to linearize nonlinear dynamics for tractable control.

Ax Hongyi Pan, Emadeldeen Hamdan, Xin Zhu, Ahmet Enis Cetin, Ulas Bagci 5/4/2026

Discrete Cosine Transform Based Decorrelated Attention for Vision Transformers

DCT-based approach to improve Vision Transformer efficiency and initialization for self-attention mechanisms.

Ax Yufei Guo, Muzhe Guo, Juntao Su, Zhou Yang, Mengqiu Zhu, Hongfei Li, Mengyang Qiu, Shuo Shuo Liu 5/4/2026

Bias in Large Language Models: Origin, Evaluation, and Mitigation

Comprehensive review of bias sources, evaluation methods, and mitigation strategies in large language models.

Ax Guangyu Zhao, Kewei Lian, Haoxuan Ru, Borong Zhang, Haowei Lin, Zhancun Mu, Haobo Fu, Qiang Fu, Shaofei Cai, Zihao Wang, Yitao Liang 5/4/2026

Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies

Post-training method using latent control to adapt frozen goal-conditioned policies without discrete text prompts.

Ax Cameron Yetman 5/4/2026

Representation in large language models

Research paper analyzing internal representations and mechanisms in LLMs, addressing theoretical disagreements about how these models function.

Ax Anderson Melchor Hernandez, Davide Pastorello, Giacomo De Palma 5/4/2026

Mean-field limit from general mixtures of experts to quantum neural networks

Mathematical analysis of Mixture of Experts networks using mean-field theory and gradient flow, studying convergence properties as expert count increases.

Ax Kaizhao Liu, Qi Long, Zhekun Shi, Weijie J. Su, Jiancong Xiao 5/4/2026

Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium

Theoretical analysis of fundamental statistical limits in aligning LLMs with diverse human preferences, connecting preference aggregation to game theory concepts.

Ax Hanrui Wang, Shuo Wang, Chun-Shien Lu, Isao Echizen 5/4/2026

DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion

DiffMI diffusion-based model inversion attack on face recognition systems that recovers identity information from embeddings without iterative optimization.

Ax Raja Gond, Nipun Kwatra, Ramachandran Ramjee 5/4/2026

TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference

TokenWeave technique for efficient distributed LLM inference via compute-communication overlap in tensor parallelism, reducing 20% overhead.

Ax Zexi Liu, Jingyi Chai, Xinyu Zhu, Shuo Tang, Rui Ye, Bo Zhang, Lei Bai, Siheng Chen 5/4/2026

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering

ML-Agent framework for autonomous ML engineering using reinforcement learning to improve LLM agents' ability to learn from execution trajectories.

Ax Rahul Ramachandran, Ali Garjani, Roman Bachmann, Andrei Atanov, O\u{g}uzhan Fatih Kar, Amir Zamir 5/4/2026

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Benchmark evaluation of multimodal foundation models (GPT-4o, Gemini, Claude, Llama) on standard computer vision tasks beyond question answering.

Ax Sazzad Hossain, Ponkrshnan Thiagarajan, Shashank Pathrudkar, Stephanie Taylor, Abhijeet S. Gangan, Amartya S. Banerjee, Susanta Ghosh 5/4/2026

Surprisingly High Redundancy in Electronic Structure Data Across Materials Explained by Low Intrinsic Dimensionality

Analysis revealing low intrinsic dimensionality and redundancy in electronic structure datasets, reducing computational requirements for ML model training.

Ax Chenhui Xu, Fuxun Yu, Michael J. Bianco, Jacob Kovarskiy, Raphael Tang, Qi Zhang, Zirui Xu, Will LeVine, Brandon Dubbs, Heming Liao, Cassandra Burgess, Suvam Bag, Jay Patravali, Rupanjali Kukal, Mikael Figueroa, Rishi Madhok, Nikolaos Karianakis, Jinjun Xiong 5/4/2026

Unlocking Zero-Shot Geospatial Reasoning via Indirect Rewards

Method for zero-shot geospatial reasoning in vision-language models using indirect rewards from metadata to overcome supervision scarcity.

Ax Alexius Wadell, Anoushka Bhutani, Victor Azumah, Austin R. Ellis-Mohr, Andrew J. Stier, Kareem Hegazy, Alexander Brace, Hancheng Zhao, Celia Kelly, Anuj K. Nayak, Yuhan Chen, Dimitrios Simatos, Hongyi Lin, Murali Emani, Venkatram Vishwanath, Kevin Gering, Melisa Alkan, Tom Gibbs, Jack Wells, Wesley W. Qian, Richard C. Gerkin, Benjamin Amorelli, Alexander B. Wiltschko, Lav R. Varshney, Bharath Ramsundar, Karthik Duraisamy, Michael W. Mahoney, Arvind Ramanathan, Venkatasubramanian Viswanathan 5/4/2026

Foundation Models for Discovery and Exploration in Chemical Space

MIST foundation models family for molecular property prediction and chemical space discovery, trained on large unlabeled datasets for materials innovation.

Ax Jackson Hassell, Dan Zhang, Hannah Kim, Tom Mitchell, Estevam Hruschka 5/4/2026

Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation

Memory-augmented framework enabling LLM agents to learn classification functions from labeled examples without parameter updates using semantic and episodic memory.

Ax William R\'eveillard, Vasileios Saketos, Alexandre Proutiere, Richard Combes 5/4/2026

Minimizing Human Intervention in Online Classification

Active learning framework for LLM classification systems minimizing costly human feedback while maintaining error guarantees through efficient labeling strategies.

Ax Feijie Wu, Weiwu Zhu, Yuxiang Zhang, Soumya Chatterjee, Jiarong Zhu, Fan Mo, Rong Luo, Jing Gao 5/4/2026

PORTool: Importance-Aware Policy Optimization with Rewarded Tree for Multi-Tool-Integrated Reasoning

PORTool algorithm for training tool-use LLM agents with improved credit assignment through importance-aware policy optimization on multi-tool reasoning tasks.

Ax Anne Harrington, A. Sophia Koepke, Shyamgopal Karthik, Trevor Darrell, Alexei A. Efros 5/4/2026

It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models

Method for reducing mode collapse in text-to-image diffusion models through noise optimization rather than guidance mechanisms.

Ax Huan Li, Yiming Dong, Zhouchen Lin 5/4/2026

Convergence Rate Analysis of the AdamW-Style Shampoo: Unifying One-sided and Two-Sided Preconditioning

Theoretical analysis of AdamW-style Shampoo optimizer convergence rates, unifying preconditioning approaches with nuclear norm guarantees.

Ax Zhaoyi Li, Jiatong Li, Gangwei Jiang, Linqi Song, Defu Lian, Ying Wei 5/4/2026

Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language Models

Research on chain-of-thought reasoning failures in LLMs when reasoning steps exceed training distributions, analyzing internal mechanisms and proposing improvements.

Ax Dongwon Jo, Beomseok Kang, Jiwon Song, Jae-Joon Kim 5/4/2026

Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

Token Sparse Attention reduces quadratic complexity of LLM long-context inference via dynamic layer-wise token selection mechanism.

Ax Alejandro Breen Herrera, Aayush Sheth, Steven G. Xu, Zhucheng Zhan, Charles Wright, Marcus Yearwood, Hongtai Wei, Sudeep Das, Danny Nightingale, Meg Watson, Charles Pollnow V 5/4/2026

Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants

Framework for building and optimizing multi-agent conversational shopping assistants with evaluation metrics for multi-turn interactions.

Ax Zhe Zhang, Jing Li, Wanli Xue, Xu Cheng, Jianhua Zhang, Qinghua Hu, Shengyong Chen 5/4/2026

Adaptive Dual-Teacher Distillation with Subnetwork Rectification for Bridging Semantic Gaps in Black-Box Domain Adaptation

Black-box domain adaptation method using dual-teacher distillation and subnetwork rectification to handle semantic gaps without source data.

Ax Abhishek Bhandwaldar, Mihir Choudhury, Ruchir Puri, Akash Srivastava 5/4/2026

Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?

Agent factory pipeline using general-purpose coding agents to autonomously optimize hardware designs from high-level specifications.