Isolater - Feed

Ax Yuchi Wang, Haiyang Yu, Weikang Bian, Jiefeng Long, Xiao Liang, Chao Feng, Hongsheng Li 4/8/2026

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

MMEmb-R1 incorporates chain-of-thought reasoning into multimodal embeddings with pair-aware selection and adaptive control mechanisms.

Ax Zhengming Yu, Li Ma, Mingming He, Leo Isikdogan, Yuancheng Xu, Dmitriy Smirnov, Pablo Salamanca, Dao Mi, Pablo Delgado, Ning Yu, Julien Philip, Xin Li, Wenping Wang, Paul Debevec 4/8/2026

DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models

Diffusion model approach for converting low dynamic range video to HDR through scene radiance estimation.

Ax Guhao Feng, Shengjie Luo, Kai Hua, Ge Zhang, Di He, Wenhao Huang, Tianle Cai 4/8/2026

In-Place Test-Time Training

Test-time training method updates LLM fast weights at inference to adapt dynamically to new information streams.

Ax Alaa Saleh, Sasu Tarkoma, Praveen Kumar Donta, Anders Lindgren, Naser Hossein Motlagh, Schahram Dustdar, Susanna Pirttikangas, Lauri Lov\'en 4/8/2026

UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

UserCentrix is a hybrid agentic orchestration framework for smart spaces combining memory augmentation with multi-agent coordination.

Ax Tianyu Liu, Simeng Han, Hanchen Wang, Xiao Luo, Pan Lu, Biqing Zhu, Yuge Wang, Keyi Li, Jiapeng Chen, Rihao Qu, Yufeng Liu, Xinyue Cui, Aviv Yaish, Yuhang Chen, Minsheng Hao, Chuhan Li, Kexing Li, Yinsheng Lu, Xinyu Wei, Qinzhe Xing, Antonia Panescu, Mengbo Wang, Vibha Annaswamy, Alicia Sanchez, Jack Cloherty, Arman Cohan, Hua Xu, Mark Gerstein, James Zou, Hongyu Zhao 4/8/2026

Advancing AI Research Assistants with Expert-Involved Learning

ARIEL framework pairs expert-vetted biomedical tasks with LLMs for evaluation and optimization of AI research assistants.

Ax Bohan Tang, Dezhao Luo, Jianheng Liu, Jingxuan Chen, Shaogang Gong, Jianye Hao, Jun Wang, Kun Shao 4/8/2026

Beyond Syntax: Action Semantics Learning for App Agents

Fine-tunes open-source LLMs for smartphone app control by learning action semantics rather than syntax, reducing API costs.

Ax Michael Grosskopf, Nathan Debardeleben, Russell Bent, Rahul Somasundaram, Isaac Michaud, Arthur Lui, Alexius Wadell, Warren D. Graham, Golo A Wimmer, Sachin Shivakumar, Joan Vendrell Gallart, Harsha Nagarajan, Earl Lawrence 4/8/2026

URSA: The Universal Research and Scientific Agent

URSA framework enables LLMs to conduct autonomous research through complex reasoning, planning, coding, and multi-agent collaboration.

Ax Andrew Sellergren, Sahar Kazemzadeh, Tiam Jaroensri, Atilla Kiraly, Madeleine Traverse, Timo Kohlberger, Shawn Xu, Fayaz Jamil, C\'ian Hughes, Charles Lau, Justin Chen, Fereshteh Mahvar, Liron Yatziv, Tiffany Chen, Bram Sterling, Stefanie Anna Baby, Susanna Maria Baby, Jeremy Lai, Samuel Schmidgall, Lu Yang, Kejia Chen, Per Bjornsson, Shashir Reddy, Ryan Brush, Kenneth Philbrick, Mercy Asiedu, Ines Mezerreg, Howard Hu, Howard Yang, Richa Tiwari, Sunny Jansen, Preeti Singh, Yun Liu, Shekoofeh Azizi, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ram\'e, Morgane Riviere, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Elena Buchatskaya, Jean-Baptiste Alayrac, Dmitry Lepikhin, Vlad Feinberg, Sebastian Borgeaud, Alek Andreev, Cassidy Hardin, Robert Dadashi, L\'eonard Hussenot, Armand Joulin, Olivier Bachem, Yossi Matias, Katherine Chou, Avinatan Hassidim, Kavi Goel, Clement Farabet, Joelle Barral, Tris Warkentin, Jonathon Shlens, David Fleet, Victor Cotruta, Omar Sanseviero, Gus Martins, Phoebe Kirk, Anand Rao, Shravya Shetty, David F. Steiner, Can Kirmizibayrak, Rory Pilgrim, Daniel Golden, Lin Yang 4/8/2026

MedGemma Technical Report

MedGemma is a medical vision-language foundation model collection designed for healthcare AI tasks with privacy preservation.

Ax Yara Mohajerani 4/8/2026

Modelling Cascading Physical Climate Risk in Supply Chains with Adaptive Firms: A Spatial Agent-Based Framework

Agent-based model framework for simulating cascading climate risks in supply chains with adaptive firm behavior and economic network effects.

Ax Fang Wu, Xu Huang, Weihao Xuan, Zhiwei Zhang, Yijia Xiao, Guancheng Wan, Xiaomin Li, Bing Hu, Peng Xia, Jure Leskovec, Yejin Choi 4/8/2026

Multiplayer Nash Preference Optimization

Extends Nash learning from human feedback to multiplayer setting, addressing non-transitive and heterogeneous preference capture in LLM alignment.

Ax Fang Wu, Weihao Xuan, Heli Qi, Ximing Lu, Aaron Tu, Li Erran Li, Yejin Choi 4/8/2026

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

DeepSearch applies Monte Carlo Tree Search to overcome training plateaus in reinforcement learning from verifiable rewards for language model reasoning.

Ax Federico Tiblias, Irina Bigoulaeva, Jingcheng Niu, Simone Balloccu, Iryna Gurevych 4/8/2026

Hypothesis-Driven Feature Manifold Analysis in LLMs via Supervised Multi-Dimensional Scaling

Introduces Supervised Multi-Dimensional Scaling to analyze and compare feature manifold hypotheses in language models' latent spaces.

Ax Penghang Liu, Elizabeth Fons, Annita Vapsi, Mohsen Ghassemi, Svitlana Vyetrenko, Daniel Borrajo, Vamsi K. Potluru, Manuela Veloso 4/8/2026

TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering

TS-Agent enables LLMs to reason over raw time series data directly without converting to text/images, reducing hallucination and knowledge leakage.

Ax Meiru Zhang, Philipp Borchert, Milan Gritta, Gerasimos Lampouras 4/8/2026

DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems

DRIFT method automates mathematical theorem formalization for LLMs by decomposing statements and retrieving prerequisite knowledge in formal languages.

Ax Majid Ghasemi, Mark Crowley 4/8/2026

Toward Virtuous Reinforcement Learning: A Critique and Roadmap

Critiques rule-based and reward-based approaches in RL ethics, proposes virtue ethics framework for more robust machine ethics.

Ax Apostol Vassilev 4/8/2026

Robust AI Security and Alignment: A Sisyphean Endeavor?

Information-theoretic analysis extending Gödel's incompleteness to AI security and alignment, establishing fundamental limitations for robust AI systems.

Ax Runze Li, Yuwen Zhai, Bo Xu, LiWu Xu, Nian Shi, Wei Zhang, Ran Lin, Liang Wang 4/8/2026

EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration

Framework enabling GUI agents to build actionable memory from past tasks via self-exploration with critic guidance, improving generalization and reducing errors.

Ax Haoran Sun, Yongjian Guo, Zhong Guan, Shuai Di, Xiaodong Bai, Jing Long, Tianyun Zhao, Mingxi Luo, Hongke Zhao, Likang Wu, Xiaotie Deng, Xu Chu, Xi Xiao, Sheng Wen, Yicheng Gong, Junwu Xiong 4/8/2026

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training

Asynchronous reinforcement learning framework for vision-language-action model training, enabling flexible post-training optimization for embodied agents.

Ax Harvey Lederman, Kyle Mahowald 4/8/2026

Emergent Introspection in AI is Content-Agnostic

Study demonstrating that introspection mechanisms in LLMs are content-agnostic, detecting anomalies without understanding their semantic meaning.

Ax Liang Ding 4/8/2026

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

Framework adapting hindsight experience replay to recover training signal from failed LLM agent trajectories, addressing low real-world task success rates.

Ax Zining Fang, Cheng Xue, Chunhui Liu, Bin Xu, Ming Chen, Xiaowei Hu 4/8/2026

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

Diffusion-based surgical video restoration framework using physics and semantics-guided reinforcement learning to remove surgical smoke.

Ax Difan Jiao, Qianfeng Wen, Blair Yang, Zhenwei Tang, Ashton Anderson 4/8/2026

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Two-phase training framework jointly optimizing LLMs for reasoning and self-refinement using group relative policy optimization on correctness rewards.

Ax Xue Liu, Xin Ma, Yuxin Ma, Yongchang Peng, Duo Wang, Zhoufutu Wen, Ge Zhang, Kaiyuan Zhang, Xinyu Chen, Tianci He, Jiani Hou, Liang Hu, Ziyun Huang, Yongzhe Hui, Jianpeng Jiao, Chennan Ju, Yingru Kong, Yiran Li, Mengyun Liu, Luyao Ma, Fei Ni, Yiqing Ni, Yueyan Qiu, Yanle Ren, Zilin Shi, Zaiyuan Wang, Wenjie Yue, Shiyu Zhang, Xinyi Zhang, Kaiwen Zhao, Zhenwei Zhu, Shanshan Wu, Qi Zhao, Wenhao Huang 4/8/2026

Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation

High-fidelity benchmark with rubrics-based evaluation assessing LLMs on expert-level complex open-ended tasks across multiple domains.

Ax Hang Xu, Ling Yue, Chaoqian Ouyang, Yuchen Liu, Libin Zheng, Shaowu Pan, Shimin Di, Min-Ling Zhang 4/8/2026

FactReview: Evidence-Grounded Reviews with Literature Positioning and Execution-Based Claim Verification

LLM-based peer review system that verifies claims by checking related work and executing code, improving review quality beyond manuscript-only analysis.

Ax Saad Alqithami 4/8/2026

Soft Tournament Equilibrium

Theoretical framework for evaluating cyclic non-transitive interactions between LLM-based agents using equilibrium concepts instead of linear rankings.

Ax Zhimin Zhao 4/8/2026

Gradual Cognitive Externalization: From Modeling Cognition to Constituting It

Framework proposing that ambient AI systems transition from modeling to constituting users' cognitive functions through sustained causal coupling.

Ax Seohyeon Shin, HanJun Choi, Jun-Hyung Park, Hong Kook Kim, Mansu Kim 4/8/2026

MolDA: Molecular Understanding and Generation via Large Language Diffusion Model

Molecular discovery framework combining LLMs with diffusion models to improve generation of chemically valid molecules by relaxing autoregressive constraints.

Ax Jingyang Qiao, Weicheng Meng, Yu Cheng, Zhihang Lin, Zhizhong Zhang, Xin Tan, Jingyu Gong, Kun Shao, Yuan Xie 4/8/2026

Memory Intelligence Agent

Memory system for deep research agents that improves trajectory retrieval and memory evolution to enhance LLM reasoning and autonomous learning.

Ax Md Zarif Hossain, Ahmed Imteaj 4/8/2026

Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

Unsupervised fine-tuning method to improve adversarial robustness and semantic quality of vision-language models through siamese contrastive learning.

Ax Zhiqiang Yuan, Weitong Chen, Hanlin Wang, Xin Peng, Zhenpeng Chen, Yiling Lou 4/8/2026

TransAgent: Enhancing LLM-Based Code Translation via Fine-Grained Execution Alignment

LLM-based code translation agent using execution alignment to improve cross-language code generation without parallel training data.

Ax Fanrui Zhang, Jiawei Liu, Jiaying Zhu, Esther Sun, Dong Li, Qiang Zhang, Zheng-Jun Zha 4/8/2026

ForgeryGPT: A Multimodal LLM for Interpretable Image Forgery Detection and Localization

Multimodal LLM fine-tuned for image forgery detection and localization with interpretable visual reasoning capabilities.

Ax Saketh Ram Kasibatla, Arpan Agarwal, Yuriy Brun, Sorin Lerner, Talia Ringer, Emily First 4/8/2026

Cobblestone: A Divide-and-Conquer Approach for Automating Formal Verification

Divide-and-conquer proof synthesis approach using LLMs to automate formal verification in proof assistants like Coq, improving software quality verification.

Ax Gopi Krishnan Rajbahadur, Gustavo A. Oliva, Dayi Lin, Jiho Shin, Ahmed E. Hassan 4/8/2026

From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap

Systematic analysis of challenges in transitioning foundation model systems from demos to production, covering reliability, cost, scalability, and compliance issues.

Ax Xiao Liu, Lijun Zhang, Deepak Ganesan, Hui Guan 4/8/2026

Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models

Edge-cloud collaborative VQA system using aligned vector quantization to split vision-language model computation between edge and cloud devices, reducing bandwidth and utilizing edge resources.

Ax Kutay Tire, Ege Onur Taga, Muhammed Emrullah Ildiz, Samet Oymak 4/8/2026

Retrieval Augmented Time Series Forecasting

Retrieval-augmented generation applied to time-series foundation models for zero-shot forecasting across domains.

Ax Junhyeok Kang, Yooju Shin, Jae-Gil Lee 4/8/2026

VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting

VarDrop reduces computational cost in multivariate time series forecasting by eliminating variate token redundancy.

Ax Hammad Ayyubi, Junzhang Liu, Ali Asgarov, Zaber Ibn Abdul Hakim, Najibul Haque Sarker, Zhecan Wang, Chia-Wei Tang, Hani Alomari, Md. Atabuzzaman, Xudong Lin, Naveen Reddy Dyava, Shih-Fu Chang, Chris Thomas 4/8/2026

ENTER: Event Based Interpretable Reasoning for VideoQA

ENTER system uses event graphs for interpretable Video QA with code generation and contextual reasoning.

Ax Hadi Zare, Mostafa Abbasi, Maryam Ahang, Homayoun Najjaran 4/8/2026

An Innovative Next Activity Prediction Using Process Entropy and Dynamic Attribute-Wise-Transformer in Predictive Business Process Monitoring

Entropy-based framework with Transformer for next activity prediction in business process monitoring.

Ax Penghui Yang, Cunxiao Du, Fengzhuo Zhang, Haonan Wang, Tianyu Pang, Chao Du, Bo An 4/8/2026