Isolater - Feed

Ax Zhixiong Zhao, Haomin Li, Fangxin Liu, Yuncheng Lu, Zongwu Wang, Tao Yang, Li Jiang, Haibing Guan 3/26/2026

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

QUARK is an FPGA acceleration framework using quantization to exploit common patterns in transformer nonlinear operations for efficient inference.

Ax Sebasti\'an Andr\'es Cajas Ord\'o\~nez, Luis Fernando Torres Torres, Mackenzie J. Meni, Carlos Andr\'es Duran Paredes, Eric Arazo, Cristian Bosch, Ricardo Simon Carbajo, Yuan Lai, Leo Anthony Celi 3/26/2026

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

Proposes curiosity-driven quantized Mixture-of-Experts framework using Bayesian uncertainty for deploying neural networks on resource-constrained devices.

Ax Perceval Beja-Battais (CB), Alain Grosset\^ete (CB), Nicolas Vayatis (CB) 3/26/2026

Enhancing Nuclear Reactor Core Simulation through Data-Based Surrogate Models

Uses data-driven surrogate models to improve Model Predictive Control for nuclear reactor core simulation.

Ax Radman Rakhshandehroo, Daniel Coombs 3/26/2026

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

ContagionRL is a Gymnasium-compatible RL platform for reward engineering in spatial epidemic simulations, enabling systematic study of learned behavioral strategies.

Ax Haichen Hu, David Simchi-Levi 3/26/2026

Perturbing the Derivative: Doubly Wild Refitting for Model-Free Evaluation of Opaque Machine Learning Predictors

Presents wild refitting method for excess risk evaluation in empirical risk minimization without requiring knowledge of function class structure.

Ax Mingxing Rao, Bowen Qu, Daniel Moyer 3/26/2026

Latent Diffusion Inversion Requires Understanding the Latent Space

Investigates model inversion attacks on latent diffusion models, showing non-uniform memorization patterns across latent codes.

Ax Xiang Rao, Yina Liu, Yuxuan Shen 3/26/2026

Quantum-Classical Physics-Informed Neural Networks for Solving Reservoir Seepage Equations

Applies quantum-classical physics-informed neural networks to reservoir seepage modeling across multiple flow equations.

Ax Abhisek Ganguly, Santosh Ansumali, Sauro Succi 3/26/2026

Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning

Analyzes relationship between deep neural networks and discrete dynamical systems, comparing PINN solutions to standard numerical methods for PDEs.

Ax Sihan Zeng, Sujay Bhatt, Sumitra Ganesh, Alec Koppel 3/26/2026

A Hessian-Free Actor-Critic Algorithm for Bi-Level Reinforcement Learning with Applications to LLM Fine-Tuning

Develops Hessian-free actor-critic algorithm for bi-level RL optimization with applications to LLM fine-tuning, addressing second-order information requirements in policy optimization.

Ax Ziwei Liu, Borui Kang, Hangjie Yuan, Zixiang Zhao, Wei Li, Yifan Zhu, Tao Feng 3/26/2026

Continual GUI Agents

Introduces continual learning task for GUI agents that must adapt to shifting domains and resolutions over time, identifying failure modes in existing agent methods.

Ax Bjarni Haukur Bjarnason, Andr\'e Silva, Martin Monperrus 3/26/2026

On Randomness in Agentic Evals

Study of variance in agentic system evaluations using 60,000 trajectories on SWE-Bench-Verified, showing pass@1 estimates vary significantly across runs, questioning single-run reliability assumptions.

Ax Yuzhu Cai, Zexi Liu, Xinyu Zhu, Cheng Wang, Siheng Chen 3/26/2026

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

AceGRPO proposes adaptive curriculum learning with group relative policy optimization for autonomous ML engineering agents, addressing behavioral stagnation in LLM-based agents through RL with efficient data selection.

Ax Elias Malomgr\'e, Pieter Simoens 3/26/2026

Interactionless Inverse Reinforcement Learning: A Data-Centric Framework for Durable Alignment

Framework for learning inspectable alignment through inverse RL without direct policy modification, improving reusability and transparency.

Ax Egor Denisov, Svetlana Glazyrina, Maksim Kryzhanovskiy, Roman Ischenko 3/26/2026

Smooth Gate Functions for Soft Advantage Policy Optimization

Soft advantage policy optimization using smooth gate functions instead of hard clipping for stable LLM training and reasoning.

Ax Sunki Hong, Jisoo Lee 3/26/2026

Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting

Comprehensive benchmark comparing state space models, transformers, and recurrent networks for US power grid electricity demand forecasting.

Ax Afshin Khadangi 3/26/2026

Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns

Continual learning architecture for LLMs preventing catastrophic forgetting during sequential updates using thalamically routed cortical columns.

Ax Xiang Li, Yuheng Zhang, Nan Jiang 3/26/2026

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies

Offline reinforcement learning with parametric policies under general function approximation beyond state-wise mirror descent.

Ax Eman M. AbouNassar, Amr Elshall, Sameh Abdulah 3/26/2026

FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data

Federated learning algorithm addressing statistical heterogeneity and non-IID data with proximal-balanced scaling for privacy-preserving training.

Ax Mikoto Kudo, Takumi Tanabe, Akifumi Wachi, Youhei Akimoto 3/26/2026

Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning

Sample-efficient hypergradient estimation for decentralized bi-level reinforcement learning in strategic decision-making and environment design.

Ax Gregor Kornhardt, Jannis Chemseddine, Christian Wald, Gabriele Steidl 3/26/2026

Self-Aware Markov Models for Discrete Reasoning

Masked discrete diffusion model with self-aware Markov transition kernels enabling adaptive reasoning and error correction in discrete tasks.

Ax Lucas Maes, Quentin Le Lidec, Damien Scieur, Yann LeCun, Randall Balestriero 3/26/2026

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Stable end-to-end joint embedding predictive architecture learning world models from raw pixels without representation collapse.

Ax Celal Alag\"oz, Mehmet Kurnaz, Farhan Aadil 3/26/2026

MRMS-Net and LMRMS-Net: Scalable Multi-Representation Multi-Scale Networks for Time Series Classification

Multi-scale convolutional architectures for time series classification using diverse input representations and multi-representation learning.

Ax Giacomo Borghi, Hyesung Im, Lorenzo Pareschi 3/26/2026

Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training

Theoretical framework for population-based neural network training combining fast within-model optimization with slower population-level adaptation.

Ax Woosung Koh, Jeyoung Jeon, Youngjin Song, Yujin Cheon, Soowon Oh, Jaehyeong Choi, Se-Young Yun 3/26/2026

mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT

Multi-task supervised fine-tuning algorithm addressing heterogeneous overfitting across dataset mixtures with overfitting-aware data allocation.

Ax Yuze Qin, Qingyong Li, Zhiqing Guo, Wen Wang, Yan Liu, Yangli-ao Geng 3/26/2026

Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors

Precipitation nowcasting model combining radar observations with weather foundation model priors to improve long-lead forecasting accuracy.

Ax Eric Czech, Zhiwei Xu, Yael Elmatad, Yixin Wang, William Held 3/26/2026

Problems with Chinchilla Approach 2: Systematic Biases in IsoFLOP Parabola Fits

Analysis of systematic biases in Chinchilla scaling law fitting method applied to LLM training, showing parameter allocation errors in compute-optimal estimates.

Ax Nan Qiao, Shuning Wang, Sijing Duan, Wenpeng Cui, Yuzhe Chen, Qingchen Yang, Xingyuan Hua, Ju Ren 3/26/2026

Cloud-Edge Collaborative Large Models for Robust Photovoltaic Power Forecasting

Cloud-edge collaborative system for photovoltaic power forecasting using large models with latency constraints and robustness to weather distribution shifts.

Ax Donya Jafari, Farzan Farnia 3/26/2026

DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Method for routing prompts to optimal LLMs/generative models using diversity-aware adaptive selection beyond fidelity scores.

Ax Huaming Du, Cancan Feng, Yuqian Lei, Chenyang Zhang, Guisong Liu, Gang Kou, Carl Yang, Yu Zhao 3/26/2026

A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data and LLMs Perspective

Survey on enterprise financial risk prediction using big data and LLMs, covering AI/computer science approaches to finance and management risk analysis.

Ax Rohan Shad, Cyril Zakka, Dhamanpreet Kaur, Mrudang Mathur, Robyn Fong, Joseph Cho, Ross Warren Filice, John Mongan, Kimberly Kalianos, Nishith Khandwala, David Eng, Matthew Leipzig, Walter R. Witschey, Alejandro de Feria, Victor A. Ferrari, Euan A. Ashley, Michael A. Acker, Curtis Langlotz, William Hiesinger 3/26/2026

A Generalizable Deep Learning System for Cardiac MRI

Self-supervised deep learning system for cardiac MRI analysis. Vision model trained via contrastive learning from visual concepts and text descriptions.

Ax Yining Wu, Shengyu Duan, Gaole Sai, Chenhong Cao, Guobing Zou 3/26/2026

Accelerating Matrix Factorization by Dynamic Pruning for Fast Recommendation

Dynamic pruning method to accelerate matrix factorization for recommendation systems. Reduces computational complexity in collaborative filtering with large user/item bases.

Ax Arthur Jacot, Alexandre Kaiser 3/26/2026

Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

Theoretical study of feature learning in Leaky ResNets via Hamiltonian mechanics. Analyzes representation geodesics and bottleneck structures in infinite-depth limits.

Ax Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring 3/26/2026

Set2Seq Transformer: Temporal and Position-Aware Set Representations for Sequential Multiple-Instance Learning

Set2Seq Transformer for temporal multiple-instance learning with permutation-invariant set representations. Models internal structure and temporal relationships across timesteps.

Ax E-Ro Nguyen, Hieu Le, Dimitris Samaras, Michael S. Ryoo 3/26/2026

Phrase-Instance Alignment for Generalized Referring Segmentation

Instance-level reasoning for generalized referring segmentation. Reformulates GRES to predict instance-aware masks with phrase-to-visual correspondence.

Ax Parsa Moradi, Mohammad Ali Maddah-Ali 3/26/2026

General Coded Computing in a Probabilistic Straggler Regime

Coded computing schemes for distributed systems with probabilistic stragglers. Extends exact computation frameworks to handle approximate recovery scenarios.

Ax Nina Corvelo Benz, Stratis Tsirtsis, Eleni Straitouri, Ivi Chatzi, Ander Artola Velasco, Suhas Thejaswi, Manuel Gomez-Rodriguez 3/26/2026

Evaluation of Large Language Models via Coupled Token Generation

Causal framework for evaluating LLMs controlling for randomization in token generation. Proposes coupled generation model for fair model comparison and ranking.

Ax Judy Hanwen Shen, Ellen Vitercik, Anders Wikum 3/26/2026

Algorithms with Calibrated Machine Learning Predictions

Framework integrating ML prediction uncertainty into online algorithm design. Uses calibration to leverage prediction-level confidence in algorithms with predictions.

Ax Abdullahi Isa Ahmed, Jamal Bentahar, El Mehdi Amhoud 3/26/2026

Energy-Efficient UAV-assisted LoRa Gateways: A Multi-Agent Optimization Approach

Multi-agent optimization for UAV-assisted LoRa IoT gateways. Addresses energy efficiency in next-generation IoT networks.

Ax Leo Zhang, Peter Potaptchik, Jiajun He, Yuanqi Du, Arnaud Doucet, Francisco Vargas, Hai-Dang Dau, Saifuddin Syed 3/26/2026

Accelerated Parallel Tempering via Neural Transports

Neural transport methods to accelerate Parallel Tempering MCMC sampling. Improves sample efficiency on high-dimensional and multimodal distributions.

Ax Merkourios Simos, Alberto Silvio Chiappa, Alexander Mathis 3/26/2026

KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

Model-free RL framework for human motion imitation with musculoskeletal constraints. Improves on torque-controlled humanoids by modeling biomechanical realism.

Ax Andreas Panayiotou, Panayiotis Charalambous, Ioannis Karamouzas 3/26/2026

Gen-C: Populating Virtual Worlds with Generative Crowds

Gen-C: Generative framework for simulating high-level crowd behaviors in virtual environments. Captures agent-agent and agent-environment interactions over time.

Ax Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya 3/26/2026

Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

VidhikDastaavej: Model-agnostic wrapper for automated legal document generation in Indian context. Introduces large-scale anonymized dataset for long-form legal drafting.

Ax Zechen Li, Lanqing Yang, Yiheng Bian, Hao Pan, Yongjian Fu, Yezhou Wang, Zhuxi Chen, Yi-Chao Chen, Guangtao Xue 3/26/2026

Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian Splatting

3D Gaussian Splatting technique for wideband RF signal modeling across multiple frequency bands. Extends single-frequency 3DGS to handle diverse RF environments.

Ax Daniele Ravasio, Alessio La Bella, Marcello Farina, Andrea Ballarino 3/26/2026

Recurrent neural network-based robust control systems with regional properties and application to MPC design

RNN-based control system design using linear matrix inequalities for output-feedback and state-feedback. Applies incremental ISS stability for robust tracking.

Ax Rodrigo P\'erez Ortiz, Gibbs Nwemadji, Jean Barbier, Federica Gerace, Alessandro Ingrosso, Clarissa Lauditi, Enrico M. Malatesta 3/26/2026

Generalization performance of narrow one-hidden layer networks in the teacher-student setting

Theoretical analysis of generalization in one-hidden-layer neural networks using teacher-student framework. Provides complete characterization for generic activation functions.

Ax Zhihao Luo, Wentao Yan, Jingyu Gong, Min Wang, Zhizhong Zhang, Xuhong Wang, Yuan Xie, Xin Tan 3/26/2026

NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks

Unified agent framework (NaviMaster) handling both GUI navigation and embodied navigation tasks via MDP formulation. First model to combine disparate domains with shared training paradigm.

Ax Amirhossein Shahbazinia, Darong Huang, Luis Costero, David Atienza 3/26/2026

CloudFormer: An Attention-based Performance Prediction for Public Clouds with Unknown Workload

Attention-based ML model predicting cloud performance under unknown workload in multi-tenant environments. Addresses resource contention in virtualized infrastructure.

Ax Julian Cremer, Tuan Le, Mohammad M. Ghahremanpour, Emilia S{\l}ugocka, Filipe Menezes, Djork-Arn\'e Clevert 3/26/2026

FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

Flow-matching model for 3D ligand generation and binding affinity prediction in drug discovery. SE(3)-equivariant architecture with multi-endpoint prediction capabilities.

Ax Yuan Wang, Mingyu Li, Haibo Chen 3/26/2026

From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents

Declarative OS interfaces for computer-use agents to replace GUIs, enabling LLMs to execute high-level goals with fewer API calls and less decomposition.

Ax Yijie Xu, Huizai Yao, Zhiyu Guo, Pengteng Li, Aiwei Liu, Xuming Hu, Weiyu Guo, Hui Xiong 3/26/2026

You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

SyTTA: Label-free test-time adaptation for LLMs in specialized domains using only 4 extra tokens to mitigate distribution shifts.