Isolater - Feed

Ax Vasily Ilin 4/2/2026

Semi-Autonomous Formalization of the Vlasov-Maxwell-Landau Equilibrium

AI-assisted formalization of Vlasov-Maxwell-Landau system equilibrium in Lean 4 using DeepThink reasoning and Claude Code agent for automated theorem proving.

Ax Yi Nian, Haosen Cao, Shenzhe Zhu, Henry Peng Zou, Qingqing Luan, Yue Zhao 4/2/2026

When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution

Attribution method for multi-agent systems that identifies responsible agents without execution logs by analyzing final text only, addressing privacy-constrained scenarios.

Ax Chung-En Johnny Yu, Brian Jalaian, Nathaniel D. Bastian 4/2/2026

SCoOP: Semantic Consistent Opinion Pooling for Uncertainty Quantification in Multiple Vision-Language Model Systems

Training-free uncertainty quantification framework for combining multiple vision-language models through semantic-consistent opinion pooling to reduce hallucinations.

Ax Zehua Han, Jing Xiao, Yiqi Duan, Mengyu Xiang, Yuheng Ji, Xiaolong Zheng, Chenghanyu Zhang, Zhendong She, Junyu Shen, Dingwei Tan, Shichu Sun, Zhou Cong, Mingxuan Liu, Fengxiang Wang, Jinping Sun, Yangang Sun 4/2/2026

PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision

Foundation multimodal model for electromagnetic domain covering perception, recognition, and decision-making using LLM capabilities adapted for domain-specific applications.

Ax Lvmin Zhang, Maneesh Agrawala 4/2/2026

View-oriented Conversation Compiler for Agent Trace Analysis

Compiler for analyzing and visualizing structured agent traces including nested tool calls, reasoning blocks, and sub-agent invocations for better agentic system understanding.

Ax Davide Di Gioia 4/2/2026

Cognitive Friction: A Decision-Theoretic Framework for Bounded Deliberation in Tool-Using Agents

Decision-theoretic framework (Triadic Cognitive Architecture) for tool-using agents that bounds information-acquisition costs and tool usage to prevent systematic failures.

Ax Manuel Serra Nunes, Atabak Dehban, Yiannis Demiris, Jos\'e Santos-Victor 4/2/2026

Ego-Foresight: Self-supervised Learning of Agent-Aware Representations for Improved RL

Self-supervised learning method for RL agents that models agent and environment separately to improve sample efficiency without requiring supervisory signals.

Ax Benoit Coqueret, Mathieu Carbone, Olivier Sentieys, Gabriel Zaid 4/2/2026

A Divide-and-Conquer Strategy for Hard-Label Extraction of Deep Neural Networks via Side-Channel Attacks

Demonstrates hard-label extraction of deep neural networks via side-channel attacks using divide-and-conquer strategy for DNN intellectual property theft.

Ax Luigi Celona, Simone Bianco, Paolo Napoletano 4/2/2026

Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learning

Addresses accuracy loss in distracted driver classification across camera conditions using feature disentanglement and contrastive learning for robustness.

Ax Johnny Chan, Yuming Li 4/2/2026

Enhancing Team Diversity with Generative AI: A Novel Project Management Framework

Project management framework using generative AI agents to address team composition gaps by matching sociologically identified personality patterns and roles.

Ax Na Min An, Eunki Kim, Wan Ju Kang, Sangryul Kim, James Thorne, Hyunjung Shim 4/2/2026

How Blind and Low-Vision Individuals Prefer Large Vision-Language Model-Generated Scene Descriptions

User study with blind and low-vision participants evaluating preferences for LVLM-generated scene descriptions, examining effectiveness and user preferences.

Ax Jialuo Li, Wenhao Chai, Xingyu Fu, Haiyang Xu, Saining Xie 4/2/2026

Science-T2I: Addressing Scientific Illusions in Image Synthesis

ScienceT2I dataset and benchmark evaluating scientific correctness in image synthesis, addressing gap between visual fidelity and physical realism across 16 scientific domains.

Ax Carlos Rodriguez-Pardo, Leonardo Chiani, Emanuele Borgonovo, Massimo Tavoni 4/2/2026

Neural Conditional Transport Maps

Neural framework for learning conditional optimal transport maps with hypernetworks that generate adaptive transport parameters for categorical and continuous variables.

Ax Leon Eshuijs, Archie Chaudhury, Alan McBeth, Ethan Nguyen 4/2/2026

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

JUSSA framework uses steering vectors to improve LLM-as-judge reliability by detecting and mitigating subtle dishonesty like sycophancy through contrastive alternatives.

Ax Alejandro Murillo-Gonzalez, Lantao Liu 4/2/2026

Situationally-Aware Dynamics Learning

Framework for online learning of hidden state representations in autonomous robots to handle unobserved factors in complex, unstructured environments.

Ax Chunyang Jiang, Chi-min Chan, Yiyang Cai, Yulong Liu, Wei Xue, Yike Guo 4/2/2026

Graceful Forgetting in Generative Language Models

Proposes graceful forgetting methods to mitigate negative transfer by selectively forgetting detrimental pre-training knowledge during fine-tuning of language models.

Ax Shimao Zhang, Zhejian Lai, Xiang Liu, Shuaijie She, Xiao Liu, Yeyun Gong, Shujian Huang, Jiajun Chen 4/2/2026

How Does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Analyzes language-specific neurons to understand how multilingual alignment transfers capabilities from high-resource to low-resource languages in LLMs.

Ax Ananthu Aniraj, Cassio F. Dantas, Dino Ienco, Diego Marcos 4/2/2026

Two-stage Vision Transformers and Hard Masking offer Robust Object Representations

Two-stage vision transformer with hard masking approach for robust object representations that balance context dependence with distribution shift robustness.

Ax Kellie Yu Hui Sim, Roy Ka-Wei Lee, Kenny Tsu Wei Choo 4/2/2026

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

Investigates misalignments between LLM-supported peer supporters and mental health experts, examining quality and safety concerns in AI-driven psychosocial support.

Ax Hexiang Gu, Qifan Yu, Yuan Liu, Zikang Li, Saihui Hou, Jian Zhao, Zhaofeng He 4/2/2026

MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection

MemeMind dataset with chain-of-thought reasoning for detecting harmful memes, addressing implicit harmful content in multimodal text-image combinations.

Ax Rafael Sojo, Javier D\'iaz-Rozo, Concha Bielza, Pedro Larra\~naga 4/2/2026

Binned semiparametric Bayesian networks for efficient kernel density estimation

Introduces binned semiparametric Bayesian networks to reduce computational cost of kernel density estimation using data binning strategies.

Ax Zhenpeng Su, Leiyu Pan, Xue Bai, Dening Liu, Guanting Dong, Jiaming Huang, Minxuan Lv, Wenping Hu, Fuzheng Zhang, Kun Gai, Guorui Zhou 4/2/2026

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Klear-Reasoner model demonstrates long reasoning capabilities with gradient-preserving clipping for policy optimization, achieving strong benchmark performance with reproducible training details.

Ax Po-Hsien Yu, Yu-Syuan Tseng, Shao-Yi Chien 4/2/2026

FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification

Federated learning approach for person re-identification that addresses statistical heterogeneity and communication efficiency in privacy-preserving surveillance systems.

Ax Jubayer Ibn Hamid, Ifdita Hasan Orney, Ellen Xu, Chelsea Finn, Dorsa Sadigh 4/2/2026

Polychromic Objectives for Reinforcement Learning

Addresses mode collapse in reinforcement learning fine-tuning by introducing polychromic objectives that preserve policy diversity and enable better exploration.

Ax Yuanfang Xiang, Lun Ai 4/2/2026

Adaptive Data-Knowledge Alignment in Genetic Perturbation Prediction

Proposes end-to-end integration of data-driven learning and existing knowledge for predicting transcriptional responses to genetic perturbations in biological systems.

Ax Eunki Kim, Na Min An, Wan Ju Kang, Sangryul Kim, James Thorne, Hyunjung Shim 4/2/2026

Are Large Vision-Language Models Ready to Guide Blind and Low-Vision Individuals?

Evaluates whether large vision-language models can effectively guide blind and low-vision individuals, addressing how to measure real-world utility beyond standard metrics.

Ax Shira Schiber, Ofir Lindenbaum, Idan Schwartz 4/2/2026

TempoControl: Temporal Attention Guidance for Text-to-Video Models

TempoControl method enables fine-grained temporal control in text-to-video generative models, allowing specification of when visual elements appear in sequences without retraining.

Ax Jacek Karwowski, Raymond Douglas 4/2/2026

Incoherence in Goal-Conditioned Autoregressive Models

Mathematical analysis of incoherence in goal-conditioned autoregressive models fine-tuned with reinforcement learning.

Ax Elias Hossain, Mehrdad Shoeibi, Ivan Garibay, Niloofar Yousefi 4/2/2026

BIOGEN: Evidence-Grounded Multi-Agent Reasoning Framework for Transcriptomic Interpretation in Antimicrobial Resistance

Multi-agent reasoning framework for interpreting gene clusters in antimicrobial resistance studies using transcriptomic data.

Ax Miko{\l}aj Czarnecki, Micha{\l} Korniak, Oskar Skibski, Piotr Skowron 4/2/2026

Fair Indivisible Payoffs through Shapley Value

Fair division method for indivisible payoffs in coalitional games using Shapley value.

Ax Guneet S. Dhillon, Javier Gonz\'alez, Teodora Pandeva, Alicia Curth 4/2/2026

E-Scores for (In)Correctness Assessment of Generative Model Outputs

Conformal prediction framework for assessing correctness of LLM outputs with user-defined tolerance levels.

Ax Yishan Du, Conrad Borchers, Mutlu Cukurova 4/2/2026

Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Benchmarking framework using embeddings to detect gender bias in LLMs used for educational feedback on student essays.

Ax Farheen Ramzan (Cherise), Yusuf Kiberu (Cherise), Nikesh Jathanna (Cherise), Meryem Jabrane (Cherise), Vicente Grau (Cherise), Shahnaz Jamil-Copley (Cherise), Richard H. Clayton (Cherise), Chen (Cherise), Chen (Cherise) 4/2/2026

Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images

Multimodal framework for myocardial scar segmentation combining ECG signals with cardiac MRI imaging.

Ax Rui Lin, Zhiyue Wu, Jiahe Le, Kangdi Wang, Weixiong Chen, Junyu Dai, Tao Jiang 4/2/2026

DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

DuoTok source-aware dual-track tokenizer preserving high-fidelity reconstruction, predictability, and cross-track correspondence for music language models.

Ax Asad Aali, Muhammad Ahmed Mohsin, Vasiliki Bikia, Arnav Singhvi, Richard Gaus, Suhana Bedi, Hejie Cui, Miguel Fuentes, Alyssa Unell, Yifan Mai, Jordan Cahoon, Michael Pfeffer, Roxana Daneshjou, Sanmi Koyejo, Emily Alsentzer, Christopher Potts, Nigam H. Shah, Akshay S. Chaudhari 4/2/2026

Structured Prompts Improve Evaluation of Language Models

Study showing structured prompts significantly improve LLM evaluation accuracy and reduce prompt-dependent variance in benchmark frameworks like HELM.

Ax Sai Koneru, Matthias Huck, Jan Niehues 4/2/2026

OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion

OmniFusion modular approach for simultaneous multilingual multimodal translation combining speech recognition and translation in open-source LLM pipelines.

Ax Isha Chaudhary, Vedaant Jain, Prineet Parhar, Kavya Sachdeva, Avaljot Singh, Sayan Ranu, Gagandeep Singh 4/2/2026

Lumos: Let there be Language Model System Certification

Lumos framework for formally certifying language model system behaviors using imperative probabilistic programming with graph-based prompt generation.

Ax Kai Kohyama, Yoshimitsu Aoki, Guillermo Gallego, Shintaro Shiba 4/2/2026

Geometric-Photometric Event-based 3D Gaussian Ray Tracing

GPERT framework for event-based 3D Gaussian splatting balancing accuracy and temporal resolution using geometric-photometric event camera data.

Ax Md Jahedur Rahman, Ihsen Alouani 4/2/2026

Bypassing Prompt Injection Detectors through Evasive Injections

Study demonstrating evasive injection techniques that bypass ML-based prompt injection detectors in retrieval-augmented LLM systems.

Ax Sohan Venkatesh, Ashish Mahendran Kurapath 4/2/2026

On the Non-Identifiability of Steering Vectors in Large Language Models

Analysis showing steering vectors in LLMs are fundamentally non-identifiable with large equivalence classes, questioning interpretability of activation steering methods.

Ax Isaac Han, Sangyeon Park, Seungwon Oh, Donghu Kim, Hojoon Lee, Kyung-Joong Kim 4/2/2026

FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff

FIRE reinitialization method balancing stability-plasticity tradeoff in continual learning for deep neural networks through Frobenius-isometry constraints.

Ax Arshad Beg, Diarmuid O'Donoghue, Rosemary Monahan 4/2/2026

Evaluating LLM-Generated ACSL Annotations for Formal Verification

Empirical evaluation of LLM-generated ACSL formal specification annotations for C programs, assessing automatic verification without human assistance.

Ax Wenbo Nie, Zixiang Li, Renshuai Tao, Bin Wu, Yunchao Wei, Yao Zhao 4/2/2026

CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer

CoCoDiff training-free style transfer framework using diffusion models and correspondence consistency for fine-grained region-wise semantic preservation.

Ax Eason Chen, Sophia Judicke, Kayla Beigh, Xinyi Tang, Isabel Wang, Nina Yuan, Zimo Xiao, Chuangji Li, Shizhuo Li, Reed Luttmer, Shreya Singh, Maria Yampolsky, Naman Parikh, Yvonne Zhao, Meiyi Chen, Scarlett Huang, Anishka Mohanty, Gregory Johnson, John Mackey, Jionghao Lin, Ken Koedinger 4/2/2026

Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning

Empirical evaluation of GPTutor LLM tutoring system comparing embedded proof-review feedback versus chatbot support for discrete mathematics learning.

Ax Tugrul Gorgulu, Atakan Dag, M. Esat Kalfaoglu, Halil Ibrahim Kuru, Baris Can Cam, Halil Ibrahim Ozturk, Ozsel Kilinc 4/2/2026