Isolater - Feed

Ax Bryan Cheng, Jasper Zhang 24d ago

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

Systematic study of target context conditioning for molecular property prediction across protein families and data regimes.

Ax Nan Zhang, Zishuo Wang, Shuyu Huang, Georgios Diamantopoulos, Nikos Tziritas, Panagiotis Oikonomou, Georgios Theodoropoulos 24d ago

TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning

TwinLoop: simulation-in-the-loop digital twin framework for online multi-agent reinforcement learning with context shifts.

Ax Xiancheng Wang, Lin Wang, Rui Wang, Zhibo Zhang, Minghang Zhao, Xiaoheng Zhang, Zhongyue Tan, Kaitai Mao 24d ago

PD-SOVNet: A Physics-Driven Second-Order Vibration Operator Network for Estimating Wheel Polygonal Roughness from Axle-Box Vibrations

Physics-driven neural network for estimating wheel polygonal roughness from vibration signals in rail vehicles.

Ax Zheng Jiang, Nan He, Yiming Chen, Lifeng Sun 24d ago

SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport

SubFLOT: federated learning method using optimal transport for personalized submodel extraction in heterogeneous settings.

Ax Zhengyang Ai, Zikang Shan, Xiaodong Ai, Jingxian Tang, Hangkai Hu, Pinyan Lu 24d ago

SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning

SHAPE framework for LLM reasoning using stage-aware hierarchical advantage estimation to improve process supervision efficiency.

Ax Devender Singh, Tarun Sheel 24d ago

FlowAdam: Implicit Regularization via Geometry-Aware Soft Momentum Injection

FlowAdam: hybrid optimizer augmenting Adam with geometry-aware soft momentum injection for handling parameter couplings.

Ax Yue Fang, Weibin Liao, Yuxin Guo, Jiaran Gao, Hongxin Ding, Jinyang Zhang, Xinke Jiang, Zhibang Yang, Junfeng Zhao, Yasha Wang, Liantao Ma 24d ago

GraphWalker: Graph-Guided In-Context Learning for Clinical Reasoning on Electronic Health Records

GraphWalker: graph-guided in-context learning framework for LLM-based clinical reasoning on electronic health records.

Ax Qipeng Zhan, Zhuoping Zhou, Li Shen 24d ago

Towards Accurate and Calibrated Classification: Regularizing Cross-Entropy From A Generative Perspective

Method for improving classification calibration using generative perspective to regularize cross-entropy loss in deep networks.

Ax Qipeng Zhan, Zhuoping Zhou, Zexuan Wang, Qi Long, Li Shen 24d ago

Bi-Lipschitz Autoencoder With Injectivity Guarantee

Bi-Lipschitz autoencoder with injectivity guarantee for dimensionality reduction while preserving manifold geometry.

Ax Shengchao Chen, Guodong Long, Dikai Liu, Jing Jiang 24d ago

Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach

Federated learning approach for training time series foundation models using bi-level heterogeneous learning to address gradient conflicts.

Ax Fumito Kimura, Jun Ohkubo 24d ago

Extraction of linearized models from pre-trained networks via knowledge distillation

Framework for extracting linearized neural network models via knowledge distillation for photonic hardware compatibility.

Ax Zinaid Kapi\'c, Vladimir Ja\'cimovi\'c 24d ago

Busemann energy-based attention for emotion analysis in Poincar\'e discs

EmBolic: hyperbolic deep learning architecture for emotion analysis from text using Busemann energy-based attention.

Ax Robert C. Williamson 24d ago

The Rhetoric of Machine Learning

Philosophical examination of machine learning through rhetoric lens, arguing ML is inherently rhetorical rather than objective.

Ax Marshall Brett 24d ago

Geometric Properties of the Voronoi Tessellation in Latent Semantic Manifolds of Large Language Models

Empirical study of Voronoi tessellations in LLM latent spaces, validating scaling laws of expressibility gaps.

Ax Andrea Pollastro, Andrea Apicella, Francesco Isgr\`o, Roberto Prevete 24d ago

Instance-Adaptive Parametrization for Amortized Variational Inference

Instance-adaptive variational autoencoder addressing amortization gap in latent variable models through per-instance parametrization.

Ax Zhixiong Zhao, Zukang Xu, Zhixuan Chen, Dawei Yang 24d ago

MoBiE: Efficient Inference of Mixture of Binary Experts under Post-Training Quantization

MoBiE: Binarization framework for efficient inference of Mixture-of-Experts LLMs with post-training quantization techniques.

Ax Dihong Jiang, Ruoqi Cao, Zhiyuan Dang, Li Huang, Qingsong Zhang, Zhiyu Wang, Shihao Piao, Shenggao Zhu, Jianlong Chang, Zhouchen Lin, Qi Tian 24d ago

OmniTabBench: Mapping the Empirical Frontiers of GBDTs, Neural Networks, and Foundation Models for Tabular Data at Scale

OmniTabBench: Large-scale benchmark comparing GBDTs, neural networks, and foundation models on tabular data with 100+ datasets.

Ax Minglu Liu, Cunchen Hu, Liangliang Xu, Fengming Tang, Ruijia Wang, Fu Yu 24d ago

STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training

STQuant framework for adaptive quantization of optimizer states during large multimodal model training to reduce memory costs.

Ax Hyukjun Yang, Han-Dong Lim, Donghwan Lee 24d ago

Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem

Theoretical analysis of Bellman residual minimization for solving Markov decision processes under linear function approximation.

Ax Tianyue Yang, Xiao Xue 24d ago

MENO: MeanFlow-Enhanced Neural Operators for Dynamical Systems

Neural operator enhancement method for dynamical systems combining Fourier-based operators with diffusion-based high-frequency recovery.

Ax Alessandro Pasqui, Jim Martin Catacora Ocana, Anshuman Sinha, Matthieu Perez, Fabrice Delbary, Giorgio Gosti, Mattia Miotto, Domenico Caudo, Maxence Ernoult, Herv\'e Turlier 24d ago

VertAX: a differentiable vertex model for learning epithelial tissue mechanics

JAX-based differentiable framework for vertex-modeling epithelial tissue mechanics with automatic differentiation and GPU acceleration.

Ax Charbel Bou Chaaya, Mehdi Bennis 24d ago

Equivariant Multi-agent Reinforcement Learning for Multimodal Vehicle-to-Infrastructure Systems

Decentralized multi-agent RL approach for vehicle-to-infrastructure systems using equivariant neural networks.

Ax Yitong Li, Junsong Chen, Shuchen Xue, Pengcuo Zeren, Siyuan Fu, Dinghao Yang, Yangyang Tang, Junjie Bai, Ping Luo, Song Han, Enze Xie 24d ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Efficient scaling technique for diffusion RL post-training using low-precision exploration and higher-precision training.

Ax Andoni Irazusta Garmendia 24d ago

A First Guess is Rarely the Final Answer: Learning to Search in the Travelling Salesperson Problem

Neural method for learning search policies in Traveling Salesperson Problem, training models to iteratively improve solutions.

Ax Ioannis Kyprakis, Vasileios Skaramagkas, Georgia Karanasiou, Lampros Lakkas, Andri Papakonstantinou, Domen Ribnikar, Kalliopi Keramida, Dorothea Tsekoura, Ketti Mazzocco, Anastasia Constantinidou, Konstantinos Marias, Dimitrios I. Fotiadis, Manolis Tsiknakis 24d ago

Frailty Estimation in Elderly Oncology Patients Using Multimodal Wearable Data and Multi-Instance Learning

Frailty assessment framework for elderly oncology patients using multimodal wearable data and multi-instance learning.

Ax Ioannis Kyprakis, Vasileios Skaramagkas, Georgia Karanasiou, Vasilis Bouratzis, Andri Papakonstantinou, Dimitar Stefanovski, Kalliopi Keramida, Aristofania Simatou, Ketti Mazzocco, Anastasia Constantinidou, Konstantinos Marias, Dimitrios I. Fotiadis, Manolis Tsiknakis 24d ago

Stress Estimation in Elderly Oncology Patients Using Visual Wearable Representations and Multi-Instance Learning

Wearable-based stress estimation in elderly cancer patients using multimodal smartwatch and ECG data with multi-instance learning.

Ax Ruben Vereecken, Luke Dickens, Alessandra Russo 24d ago

Predictive Representations for Skill Transfer in Reinforcement Learning

Transfer learning formalism using Outcome-Predictive State Representations for knowledge generalization across RL tasks.

Ax Jimmy Gammell, Bishal Thapaliya, Yoon Jung, Riyasat Ohib, Bilel Fehri, Deepayan Chakrabarti 24d ago

Learning to Query History: Nonstationary Classification via Learned Retrieval

Nonstationary classification approach using learned retrieval to condition classifiers on historical examples beyond training cutoff.

Ax Tobias Falke, Nicolas Anastassacos, Samson Tan, Chankrisna Richy Meas, Chandana Satya Prakash, Nitesh Sekhar, M Saiful Bari, Krishna Kompella, Gamaleldin F. Elsayed 24d ago

MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale

Study of expert specialization and routing behavior in sparse Mixture-of-Experts architectures for large language models at small scale.

Ax Erik Y. Wang 24d ago

AdaBoost Does Not Always Cycle: A Computer-Assisted Counterexample

Computer-assisted proof providing counterexample to open question about AdaBoost convergence to finite cycles.

Ax Andreas Kampmeier, Kevin Badalian, Lucas Koch, Sung-Yong Lee, Jakob Andert 24d ago

Production-Ready Automated ECU Calibration using Residual Reinforcement Learning

Production application of residual reinforcement learning to automate electronic control unit calibration in vehicles.

Ax Abhilash Reddy Chenreddy, Erick Delage 24d ago

Epistemic Robust Offline Reinforcement Learning

Offline reinforcement learning approach addressing epistemic uncertainty through ensemble-based conservative value estimation.

Ax Manar D. Samad, Yina Hou, Shrabani Ghosh 24d ago

Mining Electronic Health Records to Investigate Effectiveness of Ensemble Deep Clustering

Evaluation of ensemble deep clustering methods versus traditional approaches for disease subtype detection in electronic health records.

Ax Changkun Guan, Mengfan Xu 24d ago

Are Stochastic Multi-objective Bandits Harder than Single-objective Bandits?

Theoretical analysis of multi-objective bandits comparing computational complexity to single-objective bandit problems.

Ax Ryyan Akhtar 24d ago

Selective Neuron Amplification for Training-Free Task Enhancement

Method to enhance LLM task performance by amplifying task-relevant neurons at inference time without parameter modification.

Ax Radu Negulescu 24d ago

Information as Structural Alignment: A Dynamical Theory of Continual Learning

Theoretical framework addressing catastrophic forgetting in continual learning through informational structural alignment rather than external mechanisms.

Ax Marek Gagolewski 24d ago

Lumbermark: Resistant Clustering by Chopping Up Mutual Reachability Minimum Spanning Trees

Novel divisive clustering algorithm using mutual reachability minimum spanning trees to detect clusters of varying sizes and densities.

Ax Ning Yang, Chuangxin Cheng, Haijun Zhang 24d ago

Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing

Research on using multi-turn reasoning LLMs with deep reinforcement learning for task offloading decisions in mobile edge computing systems.

Ax Alexandre Alouadi, Gr\'egoire Loeper, C\'elian Marsala, Othmane Mazhar, Huy\^en Pham 24d ago

SBBTS: A Unified Schr\"odinger-Bass Framework for Synthetic Financial Time Series

Framework for generating synthetic financial time series that model both distributions and temporal dynamics using Schrödinger-Bass Bridge methods.

Ax Yong Si, Mingfei Lu, Jing Li, Yang Hu, Guijiang Li, Yueheng Song, Zhaokui Wang 24d ago

Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization

Hierarchical reinforcement learning framework for military aviation maintenance and logistics decisions, addressing fleet-scale decision-making under uncertainty.

Ax Tom A. Lamb, Desi R. Ivanova, Philip H. S. Torr, Tim G. J. Rudner 24d ago

Improving Semantic Uncertainty Quantification in Language Model Question-Answering via Token-Level Temperature Scaling

Research on calibrating uncertainty quantification in LLMs for question-answering through token-level temperature scaling, addressing gaps in existing confidence measures.

Ax Yushi Hirose, Akito Narahara, Takafumi Kanamori 24d ago

Mixture Proportion Estimation and Weakly-supervised Kernel Test for Conditional Independence

Mixture proportion estimation from unlabeled data using conditional independence assumptions. Application to PU learning, label noise, and domain adaptation.

Ax Victor Kawasaki-Borruat, Clara Grotehans, Pierre Vandergheynst, Adam Gosztolai 24d ago

Diffusion Processes on Implicit Manifolds

SDE-based method for constructing diffusion processes on implicit data manifolds using only point clouds. Data-driven approach without geometric primitives.

Ax Lance Fortnow 24d ago

How Does Machine Learning Manage Complexity?

Computational complexity analysis of ML model expressiveness for complex systems. Studies how ML manages complexity through probability on sampleable distributions.

Ax Xiaoyu Li, Andi Han, Jiaojiao Jiang, Junbin Gao 24d ago

On the Price of Privacy for Language Identification and Generation

Theoretical study of differential privacy cost for language identification and generation. Establishes algorithms and lower bounds quantifying privacy-utility tradeoff.

Ax Vincent Abbott, Gioele Zardini 24d ago

Weaves, Wires, and Morphisms: Formalizing and Implementing the Algebra of Deep Learning

Categorical framework formalizing deep learning model architectures using array broadcasting and morphisms. Mathematical notation for neural network composition.

Ax Justin Lin, Julia Fukuyama 24d ago

A comparative analysis of machine learning models in SHAP analysis

Comparative analysis of SHAP explainability method applied to different ML models. Reviews interpretability for black-box model predictions.

Ax Guo Gan, Yuxuan Ding, Cong Chen, Yuwei Ren, Yin Huang, Hong Zhou 24d ago

Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions

Training method for Android UI agents improving RL efficiency using single state multiple actions paradigm to reduce sample inefficiency and emulator latency.

Ax Akzhol Almukhametov, Doyeong Lim, Rui Hu, Yang Liu 24d ago

Graph Neural ODE Digital Twins for Control-Oriented Reactor Thermal-Hydraulic Forecasting Under Partial Observability

Graph neural networks with neural ODEs for thermal-hydraulic forecasting in nuclear reactors under partial observability. Physics-informed surrogate modeling.

Ax Zehang Lin, Miao Yang, Haihan Zhu, Zheng Lin, Jianhao Huang, Jing Yang, Guangjin Pan, Dianxin Luan, Zihan Fang, Shunzhi Zhu, Wei Ni, John Thompson 24d ago

SL-FAC: A Communication-Efficient Split Learning Framework with Frequency-Aware Compression

Split learning framework with frequency-aware compression reducing communication overhead in distributed neural network training on resource-constrained edge devices.