Isolater - Feed

Ax Hongxi Mao, Wei Zhou, Mengting Jia, Tao Fang, Huan Gao, Bin Zhang, Shangyang Li 5/6/2026

Schema-Adaptive Tabular Representation Learning with LLMs for Generalizable Multimodal Clinical Reasoning

LLM-based method for schema-adaptive tabular representation learning to improve generalization across varying EHR schemas in clinical reasoning.

Ax Bronislav Sidik, Lior Rokach 5/6/2026

Beyond Static Sandboxing: Learned Capability Governance for Autonomous AI Agents

Framework for learned capability governance in autonomous AI agents, addressing overprovision of tool access across different task types.

Ax Jasmine Brazilek, Miles Tidmarsh 5/6/2026

Alignment midtraining for animals

Study on LLM value alignment via midtraining using synthetic documents; releases ANIMA evaluation dataset for animal compassion reasoning.

Ax Yoo-Min Jung, Leekyung Kim 5/6/2026

MambaSL: Exploring Single-Layer Mamba for Time Series Classification

Single-layer Mamba architecture for time series classification with minimal design modifications.

Ax Antonio Valerio Miceli Barone, Poon Tsz Nok 5/6/2026

Improving LLM Code Reasoning via Semantic Equivalence Self-Play with Formal Verification

Self-play framework with formal verification to improve LLM code reasoning in Haskell, includes 28k synthetic dataset.

Ax Rajveer Singh Pall 5/6/2026

IndiaFinBench: An Evaluation Benchmark for Large Language Model Performance on Indian Financial Regulatory Text

Benchmark dataset for evaluating LLM performance on Indian financial regulatory documents, addressing non-Western text gap.

Ax Derya C\"ogendez, Verena Zimmermann, No\'e Zufferey 5/6/2026

Can LLMs Infer Conversational Agent Users' Personality Traits from Chat History?

Privacy analysis: study of personality trait inference from ChatGPT chat history across 668 users.

Ax Meghyn Bienvenu, Camille Bourgaux, Robin Jean, Giuseppe Mazzotta 5/6/2026

Using ASP(Q) to Handle Inconsistent Prioritized Data

ASP(Q) framework for inconsistency-tolerant querying of prioritized data with optimal repair semantics.

Ax Grigory Sapunov 5/6/2026

Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning

Study of memory tokens as computational scratchpad in Universal Transformers for adaptive recursive reasoning on combinatorial problems.

Ax Varun Totakura, Shayok Chakraborty 5/6/2026

MetaErr: Towards Predicting Error Patterns in Deep Neural Networks

MetaErr: method for predicting error patterns and failure modes in deep neural networks.

Ax Amir Noorizadegan 5/6/2026

Partition-of-Unity Gaussian Kolmogorov-Arnold Networks

PU-GKAN: Kolmogorov-Arnold network using partition-of-unity Gaussian basis functions with normalized activations.

Ax Yifan Zhang, Xiaohan Wang, Yueke Zhang, Yu Huang, Kevin Leach 5/6/2026

Constraint-Guided Multi-Agent Decompilation for Executable Binary Recovery

Multi-agent constraint-guided framework for decompilation recovering executable source code from binaries.

Ax Qiliang Liang, Hansi Wang, Zhong Liang, Yang Liu 5/6/2026

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

Structured skill representation framework converting text-based skill descriptions into machine-usable scheduling, logic, and control structures for LLM agents.

Ax Aditya Sharma, Vinti Agarwal, Rajesh Kumar 5/6/2026

G-Loss: Graph-Guided Fine-Tuning of Language Models

G-Loss: graph-guided loss function incorporating semi-supervised label propagation for fine-tuning language models.

Ax Zichao Wei 5/6/2026

Structural Generalization on SLOG without Hand-Written Rules

Neural cellular automaton for semantic parsing with structural generalization without hand-written compositional rules.

Ax Paraskevas V. Lekeas, Giorgos Stamatopoulos 5/6/2026

What Suppresses Nash Equilibrium Play in Large Language Models? Mechanistic Evidence and Causal Control

Mechanistic analysis of why LLM agents deviate from Nash equilibria in games and methods to reverse deviations.

Ax Neha Nagaraja, Hayretdin Bahsi, Carlo R. da Cunha 5/6/2026

From Prompt to Physical Actuation: Holistic Threat Modeling of LLM-Enabled Robotic Systems

Threat modeling of LLM-enabled robotic systems tracing attack propagation from prompts to physical actuation.

Ax Yuval Domb 5/6/2026

Why Self-Supervised Encoders Want to Be Normal

Analysis of why self-supervised encoders prefer normal distributions in joint-embedding predictive architectures.

Ax Bokai Pan, Mingyue Cheng, Zhiding Liu, Shuo Yu, Xiaoyu Tao, Yuchong Wu, Qi Liu, Defu Lian, Enhong Chen 5/6/2026

CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting

CastFlow: multi-agent system using role-specialized LLM workflows for improved time series forecasting with iterative refinement.

Ax Sihong Wu, Owen Jiang, Yilun Zhao, Tiansheng Hu, Yiling Ma, Kaiyan Zhang, Manasi Patwardhan, Arman Cohan 5/6/2026

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

Survey of LLMs in peer review automation: covers review generation, agent systems, RL methods, and future paradigms.

Ax Alexis Kafantaris 5/6/2026

Attractor FCM

Attractor FCM: gradient descent-based fuzzy cognitive map using residual memory and backpropagation through time.

Ax Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin 5/6/2026

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Black-box on-policy distillation for multimodal model alignment, addressing distributional drift in supervised fine-tuning.

Ax Fazle Rabbi, Lin Ling, Song Wang, Jinqiu Yang 5/6/2026

Social Bias in LLM-Generated Code: Benchmark and Mitigation

SocialBias-Bench: benchmark studying social bias in LLM-generated code across 343 real-world tasks with mitigation strategies.

Ax Roberto Tacconelli 5/6/2026

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

StateSMix: lossless compression algorithm using Mamba state space models and n-gram mixing without pre-trained weights.

Ax Pei-Chun Su 5/6/2026

eOptShrinkQ: Near-Lossless KV Cache Compression Through Optimal Spectral Denoising and Quantization

eOptShrinkQ: KV cache compression for transformers using spectral denoising and quantization to reduce memory overhead.

Ax Jingkai He, Pengfei Chen, Chenghui Wu, Shuang Liang, Ye Li, Gou Tan, Xiadao Wen, Chuanfu Zhang 5/6/2026

An End-to-End Framework for Building Large Language Models for Software Operations

OpsLLM: domain-specific LLM framework for software operations with knowledge-based QA and root cause analysis capabilities.

Ax Kazuki Egashira, Mark Vero, Jasper Dekoninck, Florian E. Dorner, Robin Staab, Martin Vechev 5/6/2026

Delay, Plateau, or Collapse: Evaluating the Impact of Systematic Verification Error on RLVR

Analysis of systematic verification errors in RL with verifiable rewards for LLM reasoning improvement.

Ax Robert-Jeron Reifert, Alaa Alameer Ahmad, Hayssam Dahrouj, Aydin Sezgin 5/6/2026

Agentic AI-Based Joint Computing and Networking via Mixture of Experts and Large Language Models

Agentic AI framework using MoE and LLMs for 6G network optimization and resource orchestration.

Ax Rohan Surana, Gagan Mundada, Xunyi Jiang, Chuhan Wang, Zhenwei Tang, Difan Jiao, Zihan Huang, Yuxin Xiong, Junda Wu, Sheldon Yu, Xintong Li, Raghav Jain, Nikki Kuang, Sizhe Zhou, Bowen Jin, Zhendong Chu, Tong Yu, Ryan Rossi, Kuan-Hao Huang, Jingbo Shang, Jiawei Han, Julian McAuley 5/6/2026

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Comprehensive survey of rollout strategies in LLM reinforcement learning for reasoning and tool use.

Ax Ismail Hossain, Sai Puppala, Jannatul Ferdaus, Md Jahangir Alam, Yoonpyo Lee, Syed Bahauddin Alam, Sajedul Talukder 5/6/2026

When Safety Geometry Collapses: Fine-Tuning Vulnerabilities in Agentic Guard Models

Safety collapse in fine-tuned guard models for agentic AI pipelines through domain specialization.

Ax Junhong Lai, Shuzhong Lai, Yanhao Yu, Wanlin Chen, Chenyu Yan, Haifeng Li, Lin Yao, Yueming Wang 5/6/2026

From Synthesis to Clinical Assistance: A Strategy-Aware Agent Framework for Autism Intervention based on Real Clinical Dataset

Strategy-aware AI agent framework for autism intervention using LLMs trained on real clinical ABA data.

Ax Sheng Wong, Ravi Shankar, Beth Albert, Hao Fei, Lin Li, Imane Ben M'Barek, Manu Vatish, Gabriel Davis Jones 5/6/2026

PRISM-CTG: A Foundation Model for Cardiotocography Analysis with Multi-View SSL

Self-supervised foundation model for cardiotocography analysis using multi-view SSL and clinical metadata.

Ax Takato Yasuno 5/6/2026

Heterogeneous Graph Importance Scoring and Clustering with Automated LLM-based Interpretation

Heterogeneous graph analysis and automated LLM-based interpretation for assessing urban bridge infrastructure importance.

Ax Haihan Duan, Tengfei Ma, Yuyang Qin, Runhao Zeng, Wei Cai, Victor C. M. Leung, Xiping Hu 5/6/2026

DeRelayL: Sustainable Decentralized Relay Learning

Decentralized relay learning approach for sustainable large-scale machine learning training.

Ax Fang Wu, Weihao Xuan, Heli Qi, Hanqun Cao, Heng-Jui Chang, Zeqi Zhou, Haokai Zhao, Ma Jian, Carl Ma, Yu-Chi Cheng, Kuan Pang, Xiangru Tang, Zehong Wang, Guanlue Li, Hanchen Wang, Kejun Ying, Pan Lu, Chiho Im, Seungju Han, Peng Xia, Tinson Xu, Yinxi Li, Deyao Zhu, Pheng-Ann Heng, Naoto Yokoya, Masashi Sugiyama, Li Erran Li, Jure Leskovec, Yejin Choi 5/6/2026

Proteo-R1: Reasoning Foundation Models for De Novo Protein Design

Reasoning foundation models for de novo protein design with explicit biochemical reasoning.

Ax Zihan Ding, Ziyuan Yang, Yi Zhang 5/6/2026

From Static Analysis to Audience Dissemination: A Training-Free Multimodal Controversy Detection Multi-Agent Framework

Training-free multi-agent framework for multimodal controversy detection in videos using audience perspectives.

Ax Zihan Ding, Ziyuan Yang, Yi Zhang 5/6/2026

PrismAgent: Illuminating Harm in Memes via a Zero-Shot Interpretable Multi-Agent Framework

Zero-shot interpretable multi-agent framework for detecting harmful content in memes without annotation.

Ax Aya Elgebaly, Joris Fournel, Benjamin Laine J{\o}nch Jurgensen, Kamil Mikolaj, Anders Christensen, Martin Tolsgaard, Claes Ladefoged, Aasa Feragen 5/6/2026

A Framework for Exploring and Disentangling Intersectional Bias: A Case Study in Fetal Ultrasound

Framework for analyzing intersectional bias in fetal ultrasound medical imaging tasks.

Ax Minbyul Jeong 5/6/2026

Healthcare AI GYM for Medical Agents

Comprehensive empirical study of multi-turn agentic RL for medical AI agents across clinical domains.

Ax Xin-Ye Li, Ren-Biao Liu, Yun-Ji Zhang, Hui Sun, Zheng Xie, Ming Li 5/6/2026

Exploring Pass-Rate Reward in Reinforcement Learning for Code Generation

Study of pass-rate rewards in reinforcement learning for LLM code generation, addressing sparse reward problem.

Ax Zhiyuan Xu, Joseph Gardiner, Sana Belguith, Lichao Wu 5/6/2026

RouteHijack: Routing-Aware Attack on Mixture-of-Experts LLMs

Adversarial attack on Mixture-of-Experts LLMs exploiting routing mechanisms to bypass safety alignment.

Ax Mohit Kumar, Somayeh Kargaran, Bernhard A. Moser, Manuela Gei{\ss} 5/6/2026

Kernel Affine Hull Machines for Compute-Efficient Query-Side Semantic Encoding

Kernel Affine Hull Machines enabling efficient query-side semantic encoding without repeated neural inference for transformer retrieval systems.

Ax Zhaoyuan Su, Olatunji Ruwase, Karthik Ganesan, Aurick Qiao, Samyam Rajbhandari, Juncheng Yang, Yue Cheng, Yuxiong He 5/6/2026

ZeRO-Prefill: Zero Redundancy Overheads in MoE Prefill Serving

ZeRO-Prefill optimization reducing distributed execution overhead in mixture-of-experts model serving for prefill-only discriminative tasks.

Ax Michael Chertkov 5/6/2026

Analytic Bridge Diffusions for Controlled Path Generation

Bridge diffusion method with closed-form solutions for score functions and drift fields enabling analytical controlled path generation without neural networks.

Ax Barbara Tarantino, Sun Kim, Yijingxiu Lu, Paolo Giudici 5/6/2026

ISAAC: Auditing Causal Reasoning in Deep Models for Drug-Target Interaction

ISAAC framework auditing causal reasoning in deep learning drug-target interaction models via intervention-based structural sensitivity probing.

Ax Kunvar Thaman 5/6/2026

Reward Hacking Benchmark: Measuring Exploits in LLM Agents with Tool Use

Benchmark for evaluating LLM agents with tool use detecting reward hacking exploits through multi-step task evaluation with naturalistic shortcuts.

Ax Yang Fu, Peng Qin, Liming Chen, Zihao Zhang, Hao Yu, Yifei Wang 5/6/2026

Joint Energy Management and Coordinated AIGC Workload Scheduling for Distributed Data Centers: A Diffusion-Aided Reward Shaping Approach

Diffusion-aided reward shaping approach for scheduling AIGC workloads across distributed data centers while minimizing energy costs.

Ax Xintan Zeng, Yongchao Liu, Yice Luo, Jiajun Zhen 5/6/2026