Isolater - Feed

Ax Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam, Serbetar Karlo, Dong-Kyu Chae 3/20/2026

CADGL: Context-Aware Deep Graph Learning for Predicting Drug-Drug Interactions

CADGL uses context-aware deep graph learning for predicting drug-drug interactions with improved generalization and robustness.

Ax Benjamin Th\'erien, Charles-\'Etienne Joseph, Boris Knyazev, Edouard Oyallon, Irina Rish, Eugene Belilovsky 3/20/2026

$\mu$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

μLO derives Maximal Update Parametrization for learned optimizers to improve meta-generalization across network widths and unseen tasks.

Ax Yiming Ma, Jianzhi Teng, Xinjie Li, Xin Sun, Zhiyong Wang, Yuzhou Song, Lionel Z. Wang, Bin Chen 3/20/2026

Modeling Inverse Ellipsometry Problem via Flow Matching with a Large-Scale Dataset

Flow matching approach with large-scale synthetic dataset for solving inverse ellipsometry problem of reconstructing optical film properties.

Ax Yakir Yehuda, Kira Radinsky 3/20/2026

ODE-Constrained Generative Modeling of Cardiac Dynamics for 12-Lead ECG Synthesis

ODE-constrained generative model for synthesizing realistic 12-lead ECG training data to address scarcity of labeled medical recordings.

Ax Jakub Grudzien Kuba, Pieter Abbeel, Sergey Levine 3/20/2026

Cliqueformer: Model-Based Optimization with Structured Transformers

Cliqueformer uses structured transformers for model-based optimization in design problems like protein engineering via offline learning.

Ax \.Ilter Onat Korkmaz, Ya\c{s}ar Cahit Y{\i}ld{\i}r{\i}m, \c{C}a\u{g}{\i}n Ararat, Cem Tekin 3/20/2026

Vector Optimization with Gaussian Process Bandits

VOGP algorithm using Gaussian process bandits for black-box vector optimization with incomplete order relations and Pareto optimality guarantees.

Ax Alec S. Xu, Can Yaras, Peng Wang, Qing Qu 3/20/2026

Linearly Separable Features in Shallow Nonlinear Networks: Width Scales Polynomially with Intrinsic Data Dimension

Theoretical analysis showing shallow nonlinear networks learn linearly separable features with polynomial width scaling relative to data dimension.

Ax Di Chai, Pengbo Li, Feiyuan Zhang, Yilun Jin, Han Tian, Kaiqiang Xu, Binhang Yuan, Dian Shen, Junxue Zhang, Kai Chen 3/20/2026

Unlocking Full Efficiency of Token Filtering in Large Language Model Training

Methods to achieve real-world efficiency gains from token filtering in LLM training through improved sparsity and adaptive filtering strategies.

Ax Khawla Elhadri, Tomasz Michalski, Adam Wr\'obel, J\"org Schl\"otterer, Bartosz Zieli\'nski, Christin Seifert 3/20/2026

This looks like what? Challenges and Future Research Directions for Part-Prototype Models

Survey of Part-Prototype Models for explainable AI, examining interpretability mechanisms and competitive limitations versus alternative approaches.

Ax Jie Shi, Aleksej Cornelissen, Siamak Mehrkanoon 3/20/2026

Integrating Weather Station Data and Radar for Precipitation Nowcasting: SmaAt-fUsion and SmaAt-Krige-GNet

Two neural architectures for precipitation nowcasting integrating weather station data and radar measurements for improved forecast skill.

Ax Sindhuja Madabushi, Ahmad Faraz Khan, Haider Ali, Jin-Hee Cho 3/20/2026

OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning

OPUS-VFL addresses privacy-utility tradeoffs and incentive mechanisms in Vertical Federated Learning with heterogeneous client resources.

Ax Dip Roy 3/20/2026

Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability

Causal intervention framework for interpreting Variational Autoencoders mechanistically, addressing interpretability of generative models.

Ax Xiang Shi, Rui Zhang, Jiawei Liu, Yinpeng Liu, Qikai Cheng, Wei Lu 3/20/2026

Modality Equilibrium Matters: Minor-Modality-Aware Adaptive Alternating for Cross-Modal Memory Enhancement

Shapley Value-based alternating training framework for multimodal fusion that balances dominant and minor modalities.

Ax Antonio Ferrara, Francesco Cozzi, Alan Perotti, Andr\'e Panisson, Francesco Bonchi 3/20/2026

Size-adaptive Hypothesis Testing for Fairness

Statistical framework for fairness testing in algorithmic systems that accounts for sampling error and handles intersectional demographic analysis.

Ax Tongtian Zhu, Tianyu Zhang, Mingze Wang, Zhanpeng Zhou, Can Wang 3/20/2026

On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning

Analysis of communication scheduling in decentralized learning showing benefits of concentrating synchronization in later training stages.

Ax Kyeongjin Ahn, Sungwon Han, Seungeon Lee, Donghyun Ahn, Hyoshin Kim, Jungwon Kim, Jihee Kim, Sangyoon Park, Meeyoung Cha 3/20/2026

GeoReg: Weight-Constrained Few-Shot Regression for Socio-Economic Estimation using LLM

GeoReg uses LLMs with satellite imagery and geospatial data for socio-economic indicator estimation in data-scarce regions via few-shot regression.

Ax Zijian Liu 3/20/2026

Online Convex Optimization with Heavy Tails: Old Algorithms, New Regrets, and Applications

Research on Online Convex Optimization algorithms for heavy-tailed gradient distributions, extending beyond finite variance assumptions.

Ax Dhiraj S Kori, Abhinav Chandraker, Syed Abdur Rahman, Punit Rathore, Ankur Chauhan 3/20/2026

Physics-informed neural network for predicting fatigue life of unirradiated and irradiated austenitic and ferritic/martensitic steels under reactor-relevant conditions

Physics-informed neural network framework predicting fatigue life of steels under nuclear reactor conditions.

Ax Mominul Rubel, Adam Meyers, Gabriel Nicolosi 3/20/2026

Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning

Neural network architecture using nonharmonic Fourier series for scientific machine learning applications.

Ax Sepehr Maleki, Negar Pourmoazemi 3/20/2026

Pi-transformer: A prior-informed dual-attention model for multivariate time-series anomaly detection

Transformer architecture with dual attention for multivariate time-series anomaly detection using temporal invariants.

Ax Shuofeng Zhang, Ard Louis 3/20/2026

Closed-form $\ell_r$ norm scaling with data for overparameterized linear regression and diagonal linear networks under $\ell_p$ bias

Theoretical analysis of parameter norm scaling in overparameterized linear regression and diagonal networks.

Ax Pooneh Mousavi, Lovenya Jain, Mirco Ravanelli, Cem Subakan 3/20/2026

Investigating Faithfulness in Large Audio Language Models

Framework evaluating faithfulness of chain-of-thought reasoning in large audio language models for multimodal tasks.

Ax Elaheh Akbari, Shansita Sharma, Ping He, Ahmadreza Moradipari, Kyungtae Han, Hamed Pirsiavash, Yikun Bai, Soheil Kolouri 3/20/2026

OT-MeanFlow3D: Bridging Optimal Transport and Meanflow for Efficient 3D Point Cloud Generation

Flow-matching models for 3D point cloud generation using optimal transport and meanflow for single-step inference acceleration.

Ax Ange-Cl\'ement Akazan, Verlon Roel Mbingui 3/20/2026

Splines-Based Feature Importance in Kolmogorov-Arnold Networks: A Framework for Supervised Tabular Data Dimensionality Reduction

KAN-based feature selection framework for tabular data via spline-based importance scoring. Specialized ML technique.

Ax Maryam Aliakbarpour, Vladimir Braverman, Junze Yin, Haochen Zhang 3/20/2026

Support Basis: Fast Attention Beyond Bounded Entries

Sub-quadratic attention algorithm removing bounded-entry restrictions for LLM inference speedup. Foundational LLM efficiency research.

Ax Seunghyeon Kim, Taesun Yeom, Jinho Kim, Wonpyo Park, Kyuyeun Kim, Jaeho Lee 3/20/2026

Activation Quantization of Vision Encoders Needs Prefixing Registers

Quantization technique for vision encoders using prefix registers to handle outliers. Optimization research for multimodal models.

Ax Haolin Liu, Chen-Yu Wei, Julian Zimmert 3/20/2026

An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs

Theoretical reinforcement learning on decision-estimation coefficients for adversarial MDPs. Pure RL theory.

Ax Bernardo Perrone Ribeiro, Jana Faganeli Pucer 3/20/2026

FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching

Conditional flow matching for precipitation forecasting. Weather prediction ML, not core AI interests.

Ax Ziyue Wang, Yayati Jadhav, Peter Pak, Amir Barati Farimani 3/20/2026

Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model

Diffusion-Transformer model converting images directly to G-code for 3D printing. Applied ML, domain-specific.

Ax Anil K. Saini, Jose Guadalupe Hernandez, Emily F. Wong, Debanshi Misra, Tiffani J. Bright, Jason H. Moore 3/20/2026

Evolved Sample Weights for Bias Mitigation: Effectiveness Depends on the Fairness Objective

Genetic algorithm for sample reweighting to mitigate ML bias. Fairness-focused, not primary tech interests.

Ax Giulia Lanzillotta, Damiano Meier, Thomas Hofmann 3/20/2026

Heads collapse, features stay: Why Replay needs big buffers

Continual learning research on replay buffer size impact on feature retention vs. classifier forgetting. Specialized ML theory.

Ax Patrick Egenlauf, Iva B\v{r}ezinov\'a, Sabine Andergassen, Miriam Klopotek 3/20/2026

Capturing reduced-order quantum many-body dynamics out of equilibrium via neural ordinary differential equations

Neural ODEs for quantum many-body dynamics simulation. Physics-focused ML, not core AI interests.

Ax Yifan Zhang, Wei Bi, Kechi Zhang, Dongming Jin, Jie Fu, Zhi Jin 3/20/2026

Weights to Code: Extracting Interpretable Algorithms from the Discrete Transformer

Algorithm extraction from Discrete Transformers via symbolic program synthesis. Addresses representation entanglement in interpretability.

Ax Jiquan Wang, Sha Zhao, Yangxuan Zhou, Yiming Kang, Shijian Li, Gang Pan 3/20/2026

DeeperBrain: A Neuro-Grounded EEG Foundation Model Towards Universal BCI

EEG foundation model for brain-computer interfaces with biophysical grounding. Neuroscience domain, not AI/tech stack focused.

Ax Injin Kong, Hyoungjoon Lee, Yohan Jo 3/20/2026

Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models

Research analyzing mechanistic changes when post-training autoregressive models into masked diffusion models. Studies model internals via circuit analysis.

Ax Brijesh FNU, Viet Thanh Duy Nguyen, Ashima Sharma, Md Harun Rashid Molla, Chengyi Xu, Truong-Son Hy 3/20/2026

Multimodal Machine Learning for Soft High-k Elastomers under Data Scarcity

Machine learning for materials science: multimodal models predict dielectric elastomer properties under limited data. Domain-specific ML, not AI-focused.

Ax Qinglun Li, Anke Tang, Miao Zhang, Mengzhu Wang, Quanjun Yin, Li Shen 3/20/2026

A Unified Generalization Framework for Model Merging: Trade-offs, Non-Linearity, and Scaling Laws

Unified theoretical framework for model merging explaining effectiveness across heterogeneous fine-tuning hyperparameters with scaling laws.

Ax Rebecca Pelke, Joel Klein, Jose Cubero-Cascante, Nils Bosbach, Jan Moritz Joseph, Rainer Leupers 3/20/2026

Mixed-Precision Training and Compilation for RRAM-based Computing-in-Memory Accelerators

Mixed-precision training and compilation techniques for RRAM-based computing-in-memory ML accelerators with low bit-width constraints.

Ax Aneeqa Mehrab, Jan Willem Van Looy, Pietro Demurtas, Stefano Iotti, Emil Malucelli, Francesca Rossi, Ferdinando Zanchetta, Rita Fioresi 3/20/2026

Sheaf Neural Networks and biomedical applications

Application of sheaf neural networks to biomedical problems comparing performance against GCNs, GATs, and GraphSage.

Ax Rares Grozavescu, Pengyu Zhang, Etienne Meunier, Mark Girolami 3/20/2026

Koopman Autoencoders with Continuous-Time Latent Dynamics for Fluid Dynamics Forecasting

Continuous-time Koopman autoencoder for surrogate modeling of time-dependent PDEs in fluid dynamics.

Ax Jingkun Liu, Yisong Yue, Max Welling, Yue Song 3/20/2026

Krause Synchronization Transformers

Krause Attention: principled attention mechanism addressing representation collapse and attention sink issues in transformers.

Ax Nicolas Zumarraga, Thomas Kaar, Ning Wang, Maxwell A. Xu, Max Rosenblattl, Markus Kreft, Kevin O'Sullivan, Paul Schmiedmayer, Patrick Langer, Robert Jakob 3/20/2026

TS-Haystack: A Multi-Scale Retrieval Benchmark for Time Series Language Models

Multi-scale retrieval benchmark for time series language models addressing long-context temporal localization under computational constraints.

Ax Shruti Joshi, Aaron Mueller, David Klindt, Wieland Brendel, Patrik Reizinger, Dhanya Sridhar 3/20/2026

Causality is Key for Interpretability Claims to Generalise

Position paper on causal inference requirements for valid and generalizable interpretability claims in LLM research.

Ax Ali Saheb Pasand, Johan Obando-Ceron, Aaron Courville, Pouya Bashivan, Pablo Samuel Castro 3/20/2026

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

Deep reinforcement learning stability improvement through isotropic Gaussian embeddings under non-stationary training dynamics.

Ax Sunki Hong, Jisoo Lee, Yuanyuan Shi 3/20/2026

Benchmarking State Space Models, Transformers, and Recurrent Networks for US Grid Forecasting

Benchmark comparing state space models, transformers, and RNNs for US power grid electricity demand forecasting.

Ax Xuanhao Mu, Jakob Geiges, Nan Liu, Thorsten Schlachter, Veit Hagenmeyer 3/20/2026

Improving Spatial Allocation for Energy System Coupling with Graph Neural Networks

Graph neural network approach for spatial allocation in energy system coupling with mismatched resolutions.

Ax Hung-Hsuan Chen 3/20/2026

CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

CeRA: improved parameter-efficient fine-tuning method that surpasses LoRA's linear constraints via manifold expansion with gating and dropout.

Ax Yongzhong Xu 3/20/2026

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

Analysis of transformer training trajectories under AdamW showing low-dimensional drift directions and batch-gradient alignment patterns.

Ax Daniel S. Berman, Brian Merritt, Stanley Ta, Dana Udwin, Amanda Ernlund, Jeremy Ratcliff, Vijay Narayan 3/20/2026

What You Read is What You Classify: Highlighting Attributions to Text and Text-Like Inputs

Explainable AI method for highlighting token attributions in text classification using transformers.

Ax Nilesh Jain, Rohit Yadav, Sagar Kotian, Claude AI 3/20/2026

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Framework for autonomous neural architecture and hyperparameter search using self-evaluating RL agents without human supervision.