Isolater - Feed

Ax Hefei Xu, Le Wu, Yu Wang, Min Hou, Han Wu, Zhen Zhang, Meng Wang 3/20/2026

VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models

VC-Soup: Method for aligning LLMs with multiple conflicting human values using value-consistency guidance for trustworthy AI development.

Ax Jing Wang, Jie Shen, Amar Sra, Qiaomin Xie, Jeremy C Weiss 3/20/2026

LLM-Augmented Computational Phenotyping of Long Covid

Grace Cycle: LLM-augmented computational phenotyping framework for discovering clinical subtypes in Long COVID through iterative hypothesis generation and evidence extraction.

Ax Jianwei Zhang 3/20/2026

Intellectual Stewardship: Re-adapting Human Minds for Creative Knowledge Work in the Age of AI

Conceptual framework proposing intellectual stewardship for how humans should adapt their roles in creative knowledge work alongside AI systems.

Ax Yuhao Dong, Zuyan Liu, Shulin Tian, Yongming Rao, Ziwei Liu 3/20/2026

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Insight-V++: Multi-agent visual reasoning framework for MLLMs enabling long-chain reasoning with high-quality training data and optimized pipelines.

Ax Juan P Wachs 3/20/2026

Final Report for the Workshop on Robotics & AI in Medicine

Workshop report on advancing robotics and AI in healthcare, highlighting coordination needs between engineering and clinical priorities for safety and reliability.

Ax Marwa Abdulhai, Isadora White, Yanming Wan, Ibrahim Qureshi, Joel Leibo, Max Kleiman-Weiner, Natasha Jaques 3/20/2026

How LLMs Distort Our Written Language

User study demonstrating that extensive LLM use for writing assistance alters voice, tone, and meaning of human text with 70% increase in essay length.

Ax Mohammad Qazim Bhat, Yufan Huang, Niket Agarwal, Hao Wang, Michael Woods, John Kenyon, Tsung-Yi Lin, Xiaodong Yang, Ming-Yu Liu, Kevin Xie 3/20/2026

VLM-AutoDrive: Post-Training Vision-Language Models for Safety-Critical Autonomous Driving Events

Post-training framework adapting vision-language models for safety-critical autonomous driving event detection in dashcam footage through temporal alignment.

Ax Xavier Cadet, Aditya Vikram Singh, Harsh Mamania, Edward Koh, Alex Fitts, Dirk Van Bruggen, Simona Boboila, Peter Chin, Alina Oprea 3/20/2026

Retrieval-Augmented LLMs for Security Incident Analysis

RAG-based system using LLMs for automated cybersecurity incident analysis through targeted log filtering across multiple data sources.

Ax Wenshuo Wang, Fan Zhang 3/20/2026

Gradient-Informed Temporal Sampling Improves Rollout Accuracy in PDE Surrogate Training

Gradient-informed temporal sampling strategy for training neural PDE surrogates, improving rollout accuracy beyond uniform and augmentation-based sampling.

Ax Philippe Formont, Maxime Darrin, Ismail Ben Ayed, Pablo Piantanida 3/20/2026

MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models

MolRGen benchmark and training framework for evaluating reasoning-based LLMs on de novo molecular generation for drug discovery without ground-truth molecule pairs.

Ax Jiaxin Liu 3/20/2026

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

Interventional Boundary Discovery method using causal inference to identify controllable state dimensions in reinforcement learning with confounded distractors.

Ax Haocheng Luo, Zehang Deng, Thanh-Toan Do, Mehrtash Harandi, Dinh Phung, Trung Le 3/20/2026

Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization

Sharpness-aware minimization technique in logit space addressing squeezing effect in Direct Preference Optimization for LLM alignment.

Ax Tamer Shanableh 3/20/2026

LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression

Low-rank convolution optimization for neural video compression (NeRV) reducing computational cost and memory for resource-constrained environments.

Ax Gregory N. Frank 3/20/2026

Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails

Analysis of LLM alignment through concept routing rather than detection, studying political censorship in Chinese language models across nine open-weight models.

Ax Sara Pohland, Xenofon Foukas, Ganesh Ananthanarayanan, Andrey Kolobov, Sanjeev Mehrotra, Bozidar Radunovic, Ankit Verma 3/20/2026

Offload or Overload: A Platform Measurement Study of Mobile Robotic Manipulation Workloads

Measurement study comparing computational costs of mobile robotic manipulation workloads across onboard, edge, and cloud GPU platforms using foundation models.

Ax Nikhil Gosala, B. Ravi Kiran, Senthil Yogamani, Abhinav Valada 3/20/2026

Sparse3DTrack: Monocular 3D Object Tracking Using Sparse Supervision

Sparse supervised learning framework for monocular 3D object tracking in videos, reducing annotation requirements for autonomous agent perception.

Ax Jasmine Rienecker, Katarina Mpofu, Naman Goel, Siddhartha Datta, Jun Zhao, Oscar Danielsson, Fredrik Thorsen 3/20/2026

Auditing Preferences for Brands and Cultures in LLMs

ChoiceEval framework for auditing brand and cultural preference biases in LLMs used as market intermediaries affecting consumer choices.

Ax Kaiyang Li, Shihao Ji, Zhipeng Cai, Wei Li 3/20/2026

Approximate Subgraph Matching with Neural Graph Representations and Reinforcement Learning

Neural graph representation method using reinforcement learning to solve approximate subgraph matching, an NP-hard problem in graph analysis.

Ax Zilin Huang, Zihao Sheng, Zhengyang Wan, Yansong Qu, Junwei You, Sicong Jiang, Sikai Chen 3/20/2026

DriveVLM-RL: Neuroscience-Inspired Reinforcement Learning with Vision-Language Models for Safe and Deployable Autonomous Driving

Reinforcement learning approach combining vision-language models with neuroscience-inspired reward signals for safe autonomous driving without manual reward engineering.

Ax Zichen Xie, Wenxi Wang 3/20/2026

Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought

Evaluation framework (VCoT-Bench) measuring LLM reasoning ability for Rust program verification through intermediate verification steps, not just pass/fail outcomes.

Ax Yanchuan Tang, Taowen Wang, Yuefei Chen, Boxuan Zhang, Qiang Guan, Ruixiang Tang 3/20/2026

Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Uncertainty quantification method for Vision-Language-Action robotic models that detects safety-critical moments during continuous control rather than averaging uncertainty signals.

Ax Ruishuo Chen, Yu Chen, Zhuoran Li, Longbo Huang 3/20/2026

PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

PowerFlow: principled RLIF framework for unsupervised LLM capability elicitation via distribution matching instead of heuristic rewards.

Ax Guangsheng Yu, Qin Wang, Rui Lang, Shuai Su, Xu Wang 3/20/2026

PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents

Privacy-preserving LLM agent planning via abstractions preventing exposure of local environment data to cloud services.

Ax Sam Ganzfried 3/20/2026

Evolutionarily Stable Stackelberg Equilibrium

Game theory: evolutionarily stable Stackelberg equilibrium solution concept with leader-follower dynamics.

Ax Linfeng Zhang, Taoyong Cui, Dongzhan Zhou, Lei Bai, Sufei Zhang, Luca Rossi, Mao Su, Wanli Ouyang, Pheng-Ann Heng 3/20/2026

An SO(3)-equivariant reciprocal-space neural potential for long-range interactions

Neural potential with SO(3) equivariance for molecular systems with long-range electrostatic interactions.

Ax Arushi Rai, Qiang Zhang, Hanqing Zeng, Yunkai Zhang, Dipesh Tamboli, Xiangjun Fan, Zhuokai Zhao 3/20/2026

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

Token-level Adaptive Routing: inference-time alignment method for freezing LLMs toward structured reasoning without post-training.

Ax Li Wenxiu, Wen Zhanjie, Xia Jiechang, Guo Jingqiao 3/20/2026

The Spillover Effects of Peer AI Rinsing on Corporate Green Innovation

Economics study analyzing spillover effects of AI washing in corporate sustainability claims via semantic analysis.

Ax Arundhathi Dev, Justin Zhan 3/20/2026

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

Automated hyperparameter optimization framework for sparse attention mechanisms using Bayesian optimization and multi-fidelity search.

Ax Yang Liu, Jiyao Yang, Hongjin Zhao, Xiaoyong Li, Yanzhe Ji, Xingjian Li, Runmin Jiang, Tianyang Wang, Saeed Anwar, Dongwoo Kim, Yue Yao, Zhenyue Qin, Min Xu 3/20/2026

Mind the Rarities: Can Rare Skin Diseases Be Reliably Diagnosed via Diagnostic Reasoning?

Benchmark evaluating large vision-language models on rare skin disease diagnosis with long-context reasoning.

Ax Li Wenxiu, Wen Zhanjie, Xia Jiechang, Guo Jingqiao 3/20/2026

The Impact of Corporate AI Washing on Farmers' Digital Financial Behavior Response -- An Analysis from the Perspective of Digital Financial Exclusion

Economics paper analyzing corporate AI washing claims and impact on farmers' fintech adoption using CHFS data.

Ax Huy Che, Dinh-Duy Phan, Duc-Khai Lam 3/20/2026