Isolater - Feed

Ax ZhiShu Jiang, Haibo Liu, Xin Shen, Guanqiang QI, Chenxi Miao, Weikang Li, Liwei Qian, Xin Pei, Jizhou Huang 9d ago

Learning User-Aware Recall: Personalized Retrieval in Long-Term Conversational Memory

arXiv paper on personalized retrieval for long-term conversational agents using profile-guided memory recall in LLM-based systems.

Ax George Stamatelis, Hui Chen, Henk Henk Wymeersch, George C. Alexandropoulos 9d ago

Active Sensing for RIS-Aided Tracking and Power Control: A Hybrid Neuroevolution and Supervised Learning Approach

Hybrid neuroevolution and supervised learning for RIS-aided mobile tracking with power-efficient localization using feedback control.

Ax Santanu Ganguly, Xing Liang, Dimitrios Makris 9d ago

Spectral Geometry and Bosonic-Bloch Probes: Explorations in Quantum Learning

Quantum machine learning study of spectral geometry in graph-regularized quantum networks using two-boson interference probes.

Ax Xiaoxiong Zhang, Xiong Zeng, Wei Zhang 9d ago

From World Models to World Action Models: A Concise Tutorial for Robotics

Tutorial on world models as action-conditioned predictive models for embodied AI, comparing observation-space vs state-space approaches with trade-offs.

Ax Zhishang Xiang, Zerui Chen, Yunbo Tang, Zhimin Wei, Ruqin Ning, Yujie Lin, Qinggang Zhang, Jinsong Su 9d ago

MemSyco-Bench: Benchmarking Sycophancy in Agent Memory

MemSyco-Bench evaluates sycophancy in LLM-agent memory systems where retrieved memories cause over-alignment with users at cost of factual accuracy.

Ax Jingwei Song, Haofeng Xu, Jie Xiao, Chengke Bao, Jingwei Shi, Pengbin Feng, Weixun Wang, Yuhang Han, Chuan Wu, Linfeng Zhang, Bill Shi 9d ago

Staleness-Learning Rate Scaling Laws for Asynchronous RLHF

Study of stale rollout effects in asynchronous GRPO for high-throughput RLHF, analyzing learning-rate scaling laws for decoupled policy optimization.

Ax Yiyao Yang 9d ago

Multilayer Q-Matrix-Embedded Neural Network for Cognitive Diagnosis (M-QCDNet): Structure-Aware Deep Learning Architecture for Psychometric Interpretability

M-QCDNet integrates Q-matrices as structural priors into neural networks for cognitive diagnosis maintaining interpretability with psychometric theory.

Ax Yuan Si, Jialu Zhang 9d ago

Fixed-Set Robustness in Programming by Example: Example Corruption and Semantic Partition Recovery

Study of adversarial robustness in programming-by-example systems when examples are corrupted by adversaries aware of the synthesizer.

Ax Wenting Ma, Zhipeng Zhang, Xiaohang Yuan, Ningwei Xie, Yuxin Xie, Xiaolin Wang, Meng Guo, Xingang Chai, Zhenjie Yao 9d ago

Domain Knowledge Based Temporal-Spatial Graph Convolution Network for ECG Recognition

Domain knowledge-based graph convolution network for ECG recognition emphasizing interpretability in healthcare AI using cardiac landmarks.

Ax Matthew J Liu, Wei Hang Zheng, Vidhan Purohit, Siqi Xie, Chieh-En Li, Jerry Li, Noah Flynn 9d ago

Scaling Laws for Grid-Based Approximate Nearest Neighbor Search in High Dimensions

Scaling analysis of grid-based approximate nearest neighbor search revealing d-scaling crossover behavior on embeddings as dimensionality increases.

Ax Sakthi Prabhu Gunasekar, Prasanna Kumar Rangarajan 9d ago

IonSense-QKG: A Quantum-Readiness Metadata Framework for Lithium-Ion Battery Dataset Discovery

Metadata framework for lithium-ion battery dataset discovery, addressing variability in chemistry, modality, and preprocessing for ML applications.

Ax Paulo R. Ferreira Jr., Lucas Coutinho Freitas, La\'is dos Santos Gon\c{c}alves, William Borges Domingues, Lucas Petitemberte de Souza, Mariana B. Michalowski, Vinicius F. Campos 9d ago

A Novel Machine Learning Approach for Central Nervous System Tumor Classification from DNA Methylation

ML approach for CNS tumor classification using DNA methylation with sparse random projection dimensionality reduction and rigorous cross-cohort evaluation.

Ax Zhilin Zhao 9d ago

From Approximation to Emergence: A Theory of Deep Learning

Comprehensive theoretical book/paper unifying deep learning theory from approximation foundations through overparameterization, transformers, in-context learning, scaling laws, and emergence.

Ax Christopher Ellis, Shreyas Chaudhari, Mei-Yu Wang, Leighton Barnes, Giulia Fanti, Jos\'e M. F. Moura 9d ago

Black-Box Inference of LLM Architectural Properties with Restrictive API Access

Methods to infer LLM architectural properties (hidden dimensions, feed-forward layers) via black-box API access with restricted logits, studying commercial provider protections.

Ax Paimon Goulart, Chansong Lim, N\'icolas Roque dos Santos, Yue Dong, Sheldon Peterson, Jia Chen, Evangelos E. Papalexakis 9d ago

Multi-modal Rail Crossing Safety Analysis

Multi-modal AI system combining images and accident reports for railway crossing safety assessment using vision and structured data.

Ax Maria Elkj{\ae}r Montgomery, Christian Igel, Mikkel Odgaard, Martin Sillesen, Mads Nielsen 9d ago

How Should Transformers Encode Numeric Values in Electronic Health Records?

Systematic comparison of numeric encoding strategies (discrete, continuous, hybrid) for transformers on EHR data, evaluating precision and optimization stability trade-offs.

Ax Mengyu Li, Guoyao Shen, Chad W. Farris, Xin Zhang 9d ago

NeuroBridge: Bridging Multi-Task MRI Knowledge for Neurodegenerative Disease Diagnosis

NeuroBridge framework combines self-supervised MRI pretraining with multi-task learning for Alzheimer's and dementia diagnosis from brain imaging.

Ax Chenxing Liang, Yuchao Lin, Andrii Kryvenko, Wendi Yu, Chuan Li, Jianwen Xie, Xiaofeng Qian, Shuiwang Ji 9d ago

Spin-Weighted Spherical Harmonics Enable Complete and Scalable $\mathrm{E}(3)$-Equivariant Networks

SpinGTP approach improves scalability of E(3)-equivariant networks for 3D molecular modeling by generalizing Gaunt tensor products with spin-weighted spherical harmonics.

Ax Daniel Thi Graviet, Lovre Pesut, Ivan Dagelic, Vedran Jukic, Ivan Burazin 9d ago

The Rollout Infrastructure Tax in Coding-Agent Reinforcement Learning

Analysis of execution infrastructure overhead in coding-agent RL systems, measuring efficiency gains from different container/sandbox substrates for interactive rollouts.

Ax Robert Milletich, Justin Downes, Steve Goley, Newel Hirst 9d ago

Conditional Inference Trees and Forests for Feature Selection

Study of conditional inference trees/forests for feature selection using permutation tests, comparing computational efficiency vs ranking accuracy on benchmarks.

Ax Atsuki Yamaguchi, Szymon Palucha, L\'eo Bijar, Aline Villavicencio, Nikolaos Aletras 9d ago

On the Utility and Factual Reliability of Pruned Mixture-of-Experts Models in the Biomedical Domain

Evaluates factual reliability of pruned mixture-of-experts models in biomedical domain, examining trade-offs between inference speedup and accuracy.

Ax Sergei Kucherenko, Nilay Shah 9d ago

Geometry-Aware R-Structured Kolmogorov-Arnold Networks

GRS-KAN hybrid architecture combines Kolmogorov-Arnold Networks with R-functions to learn smooth structures and encode geometric constraints analytically.

Ax Kathan Shah 9d ago

Token Geometry

Ember lightweight optimizer exploits gradient geometry of embedding tables and LM-heads, improving Pareto frontier for finetuning, RL, and pretraining.

Ax Haemin Park, Diego Klabjan, Martin W. Braun, Xiuqi Li, Balakrishnan Ananthanarayanan 9d ago

Class-Grouped Normalized Momentum and Faster Hyperparameter Exploration to Tackle Class Imbalance in Federated Learning

FedCGNM optimizer for federated learning addresses class imbalance via class-grouped momentum and faster hyperparameter exploration.

Ax Fabian Schaipp 9d ago

How to Allocate Your Tokens? Scaling Laws with Training Steps and Batch Size

Three-term scaling law for LLM training explicitly modeling batch size and training steps, enabling robust fitting with fewer training runs.

Ax Juliette Decugis, Sean O'Brien, Francis Bach, Gabriel Synnaeve, Taco Cohen 9d ago

Don't Let Gains FADE: Breaking Down Policy Gradient Weights in RL

Framework decomposing advantage functions for RL post-training in LLMs, unifying diverse advantage formulations to address training instability.

Ax Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aur\'elien Bellet 9d ago

Unveiling the Non-Monotonic Effect of Privacy on Generalization under Byzantine Robustness

Shows privacy-generalization relationship in distributed learning depends on noise regime, contradicting prior Byzantine robustness trilemma results.

Ax Kevin Wang, Kevin Yang, Arjun Prakash, Amy Greenwald 9d ago

Towards Learning Representations of Policies in Two-Player Zero-Sum Imperfect-Information Games

Methods for creating policy datasets and learning policy embeddings in two-player zero-sum imperfect-information games with evaluation tasks.

Ax Lukas Haverbeck, Carmen Amo Alonso, Andres Felipe Posada-Moreno, Sebastian Trimpe, Marco Pavone 9d ago

The risk of KV cache compression

Theoretical analysis of KV cache compression in transformer inference showing when compression is impossible and deriving fundamental limits.

Ax Jiatong Li, Samuel Yeh, Sharon Li 9d ago

Multi-Head Recurrent Memory Agents

Multi-Head Recurrent Memory Agents diagnose reliability degradation in long-context LLMs, attributing failures to memory retention rather than capture.

Ax Abdullah Al Tasim, Wei Sun 9d ago

Wind-Aware Reinforcement Learning Control of a Small Quadrotor Using Learned Onboard Wind Estimation in Simulated Atmospheric Turbulence

Two-stage learning pipeline for quadrotor control: estimates wind from onboard sensors, then uses estimates in RL flight controller.

Ax Ege Onur Taga, Yilin Zhuang, M. Emrullah Ildiz, Petros Mol, Abhimanyu Das, Karthik Duraisamy, Samet Oymak 9d ago

Evolutionary Feature Engineering for Structured Data

EFE framework uses LLM-based evolutionary optimization to discover preprocessing transformations for structured data as Python programs.

Ax Leyan Li, Rennong Yang, Zhenxing Zhang, Liping Hu 9d ago

X-LogSMask: Expand Transformer for Graph-Structured Data

X-LogSMask modifies transformer architecture with explainable multi-head attention for improved performance on sparse, structured graph data.

Ax Aria Masoomi, Mahsa Bazzaz, Adel Javanmard, Vahab Mirrokni 9d ago

Geometric Signatures of Reasoning: A Spectral Perspective on Task Hardness

Studies geometric properties of chain-of-thought reasoning trajectories in transformer hidden states to understand task difficulty and reasoning mechanisms.

Ax Zewen Liu 9d ago

BOUNDARY_SYNC: Measuring Communication-Induced Representational Coupling in Multi-Agent LLM Systems

BOUNDARY_SYNC measures representational coupling in multi-agent LLM systems, quantifying how inter-agent communication causes convergence or divergence.

Ax Saoud Aldowaish, Yashwanth Karumanchi, Kai-Chen Chiang, Mohammed Ayman Habib, Finn Murphy, Rishen Cao, Morteza Fayazi 9d ago

SINA: A Fully Automated Circuit Schematic Image to Netlist Generator Using Artificial Intelligence

SINA uses AI to convert circuit schematic images to machine-readable netlists for electronic design automation tasks.

Ax Wenbo Zhang 9d ago

MKGR: Multimodal Knowledge-Graph Representation Learning for Cold-Start Protein-Protein Interaction Prediction

MKGR multimodal framework predicts protein-protein interactions for cold-start scenarios combining knowledge graphs and representation learning.

Ax Haotian Xie, Junlin Chen, Mingkai Zheng, Lishan Yang, Zhao Zhang 9d ago

DeadPool: Resilient LLM Training with Hot-Swapping via Zero-Overhead Checkpoint

DeadPool enables resilient LLM training at scale by implementing hot-swapping with zero-overhead checkpointing for GPU failure recovery.

Ax Jueqi Wang, Zachary Jacokes, John Darrell Van Horn, Kevin A. Pelphrey, Michael C. Schatz, Archana Venkataraman 9d ago

CALM: Interpretable Cross-Modal Alignment for Biomarker Discovery from Unpaired Data

CALM framework learns interpretable associations between brain ROIs and genetic pathways from disjoint populations using cross-modal alignment.

Ax Wei Xu, An Liu 9d ago

Message Passing Based Two-Timescale Bayesian Learning for Joint Channel and Memory Hardware Impairments Tracking

Message-passing Bayesian deep learning framework for joint channel and hardware impairment tracking in MIMO systems.

Ax Hao Zhou, Xiaoyu Wang, Chang Yao, Mingli Song, Yuanyu Wan 9d ago

Revisiting Decentralized Online Convex Optimization with Compressed Communication

Follow-the-regularized-leader algorithms for decentralized online convex optimization with compressed communication.

Ax Ronghui Xu, Tongxin Wu, Guozhen Zhang, Yihan Li, Chenjuan Guo, Bin Yang, Yong Li 9d ago

UniWind: Toward Unified Day-Ahead Wind Power Forecasting via Physics-Informed State Routing

Physics-informed state routing for unified wind power forecasting with meteorological and operational constraints.

Ax Mingkai Zheng, Junlin Chen, Haotian Xie, Zhao Zhang 9d ago

SCAPE: Accurate and Efficient LLM Training with Extreme Sparse Communication

Communication-efficient LLM training via extreme sparse gradient synchronization with stable sparse Adam optimizer.

Ax Tzu-Heng Huang, Aditya Goyal, John Cooper, Frederic Sala 9d ago

WARP: Weight-Space Analysis for Recovering Training Data Portfolios

Method for inferring training data mixture weights and source distributions from released foundation models.

Ax Long Minh Bui, Tuan Anh Le Van, Tung Phi Duc, Phi Le Nguyen, Jana Doppa, Trong Nghia Hoang 9d ago

Model Merging as Probabilistic Inference in Fine-Tuning Parameter Space

Probabilistic inference framework for merging task-specific fine-tuned models into multi-task solutions.

Ax Jianfeng Lu 9d ago

A Mathematical Introduction to Diffusion Models

Mathematical introduction to diffusion models covering sampling dynamics, error analysis, and inference-time control.

Ax Xiong Xiong, Ruonan Zhai, Zheng Zeng, Sheng Zhou, Rongchun Hu, Zichen Deng 9d ago

Frequency Shift Physics-Informed Extreme Learning Machine for Solving High-Frequency Partial Differential Equations

Physics-informed extreme learning machine addressing spectral bias for solving high-frequency PDEs.

Ax Nikolai Smolyanskiy 9d ago

Predicting Closed-Loop Performance of Latent World Models: Offline Checkpoint Selection for MPC and Model-Based RL Under Non-Markovian Rewards in LunarLander

Validation diagnostics for selecting optimal checkpoints of latent world models in model predictive control and model-based RL.

Ax Cheng Wan, Quyu Kong, Feng Zhou 9d ago

Efficient Temporal Point Processes via Monotone Alternating Splines

Monotone alternating splines for efficient temporal point process modeling via cumulative conditional intensity.

Ax He Huang, Lu Shen, Yunfeng Huang, Li Qi 9d ago

Role-Aware Neural Convex Divergence Heads for Asymmetric Representation Learning

Neural divergence heads for asymmetric representation learning in directed relational tasks.