Isolater - Feed

Ax Mohsen Amiri, Mohsen Amiri, Ali Beikmohammadi, Sindri Magnu\'sson, Mehdi Hosseinzadeh 7d ago

PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC

RL method for partial observability using privileged planner guidance during training with MPC.

Ax Dian S. Y. Pang, Endrias Y. Ergetu, Eric Topham, Ahmed E. Fetit 7d ago

Automating aggregation strategy selection in federated learning

Framework automating aggregation strategy selection in federated learning across heterogeneous settings.

Ax Ashutosh Adhikari, Mirella Lapata 7d ago

Multimodal Latent Reasoning via Predictive Embeddings

Pearl framework for multimodal reasoning using predictive embeddings to reduce tool-use overhead in VLMs.

Ax Yunusa Haruna, Adamu Lawan, Ibrahim Haruna Abdulhamid, Hamza Mohammed Dauda, Jiaquan Zhang, Chaoning Zhang, Shamsuddeen Hassan Muhammad 7d ago

Bias Redistribution in Visual Machine Unlearning: Does Forgetting One Group Harm Another?

Studies bias redistribution when vision models selectively unlearn demographic groups.

Ax Baihui Liu, Kaiyuan Tian, Wei Wang, Zhaoning Zhang, Linbo Qiao, Dongsheng Li 7d ago

Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference

Efficient MoE inference through budget-aware expert activation allocation reducing latency bottlenecks.

Ax Zhen Li (LMO, CELESTE, HEC Paris), Gilles Stoltz (LMO, CELESTE, HEC Paris) 7d ago

A Direct Approach for Handling Contextual Bandits with Latent State Dynamics

Bandit algorithm for contextual decision-making with latent hidden Markov chain dynamics.

Ax Teng Pang, Zhiqiang Dong, Yan Zhang, Rongjian Xu, Guoqiang Wu, Yilong Yin 7d ago

Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning

Flow-based method for offline multi-agent reinforcement learning using value guidance.

Ax Andrii Dzhoha, Egor Malykh 7d ago

Long-Term Embeddings for Balanced Personalization

Recommender system using long-term embeddings to balance recency bias and stable user preferences.

Ax Yunxiang Peng, Mengmeng Ma, Ziyu Yao, Xi Peng 7d ago

Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings

Method for assessing model generalization in Vision Transformers via internal representations under distribution shift.

Ax Yichen Gao, Altay Unal, Akshay Rangamani, Zhihui Zhu 7d ago

An Illusion of Unlearning? Assessing Machine Unlearning Through Internal Representations

Examines vulnerabilities in machine unlearning methods by analyzing internal representations and concept reintroduction.

Ax Zigeng Chen, Gongfan Fang, Xinyin Ma, Ruonan Yu, Xinchao Wang 7d ago

DMax: Aggressive Parallel Decoding for dLLMs

DMax enables efficient parallel decoding in diffusion language models through progressive self-refinement.

Ax Marcus Armstrong, Navid Ayoobi, Arjun Mukherjee 7d ago

Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models

Architecture combining frozen LLMs as nodes communicating through learned projections in a shared latent space.

Ax Danit Yanowsky, Daphna Weinshall 7d ago

Leveraging Complementary Embeddings for Replay Selection in Continual Learning with Small Buffers

Continual learning method using complementary self-supervised embeddings to improve replay buffer sample selection.

Ax Qiance Tang, Ziqi Wang, Jieyu Lin, Ziyun Li, Barbara De Salvo, Sai Qian Zhang 7d ago

EgoEverything: A Benchmark for Human Behavior Inspired Long Context Egocentric Video Understanding in AR Environment

Benchmark for egocentric video understanding in AR using long-context reasoning over temporal activities.

Ax Constantin Le Cle\"i, Nils Th\"urey, Xiaoxiang Zhu 7d ago

Bias-Constrained Diffusion Schedules for PDE Emulations: Reconstruction Error Minimization and Efficient Unrolled Training

Bias-constrained diffusion models for PDE emulation with improved accuracy and training efficiency.

Ax Tolga Dimlioglu, Nadine Chang, Maying Shen, Rafid Mahmood, Jose M. Alvarez 7d ago

Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems

Data selection framework for autonomous driving models balancing multiple evaluation metrics.

Ax Seyed Mahmoud Sajjadi Mohammadabadi, Xiaolong Ma, Lei Yang, Feng Yan, Junshan Zhang 7d ago

SOLAR: Communication-Efficient Model Adaptation via Subspace-Oriented Latent Adapter Reparametrization

Parameter-efficient fine-tuning compression framework reducing communication costs for model adaptation.

Ax Paul Quinlan, Qingguo Li, Xiaodan Zhu 7d ago

ADAPTive Input Training for Many-to-One Pre-Training on Time-Series Classification

Self-supervised pre-training method for time series classification with adaptive input handling.

Ax Simon Zhang, Ryan P. DeMilt, Kun Jin, Cathy H. Xia 7d ago

Adversarial Label Invariant Graph Data Augmentations for Out-of-Distribution Generalization

Data augmentation method using adversarial training for out-of-distribution generalization on graphs.

Ax Andrey Bocharnikov, Ivan Ermakov, Denis Kuznedelev, Vyacheslav Zhdanovskiy, Yegor Yershov 7d ago

KV Cache Offloading for Context-Intensive Tasks

KV cache offloading technique to reduce memory and latency overhead for long-context LLM inference.

Ax Haokai Ma, Lee Yan Zhen, Gang Yang, Yunshan Ma, Ee-Chien Chang, Tat-Seng Chua 7d ago

Less Approximates More: Harmonizing Performance and Confidence Faithfulness via Hybrid Post-Training for High-Stakes Tasks

Hybrid post-training combining reinforcement learning and distillation to improve LLM confidence calibration.

Ax Milad Leyli-Abadi, Lucas Thil, Sebastien Razakarivony, Guillaume Doquet, Jesse Read 7d ago

A Machine Learning Framework for Turbofan Health Estimation via Inverse Problem Formulation

Machine learning framework for estimating turbofan engine health from sensor data.

Ax Sikai Bai, Haoxi Li, Jie Zhang, Yongjiang Liu, Song Guo 7d ago

TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis

Test-time variational synthesis method for reinforcement learning in domains without verifiable rewards.

Ax Abdelkarim Loukili 7d ago

Quantization Impact on the Accuracy and Communication Efficiency Trade-off in Federated Learning for Aerospace Predictive Maintenance

Impact of quantization on federated learning accuracy-efficiency trade-offs for aerospace predictive maintenance.

Ax Tobias Schumacher, Simon Reichelt, Markus Strohmaier 7d ago

The Impact of Dimensionality on the Stability of Node Embeddings

Analysis of how embedding dimensionality affects stability of graph node embeddings.

Ax Stephen Cheng, Sarah Wiegreffe, Dinesh Manocha 7d ago

What Drives Representation Steering? A Mechanistic Case Study on Steering Refusal

Mechanistic study of how steering vectors modify LLM behavior for alignment and refusal control.

Ax Mu Nan, Muquan Yu, Weijian Mai, Jacob S. Prince, Hossein Adeli, Rui Zhang, Jiahang Cao, Benjamin Becker, John A. Pyles, Margaret M. Henderson, Chunfeng Song, Nikolaus Kriegeskorte, Michael J. Tarr, Xiaoqing Hu, Andrew F. Luo 7d ago

Meta-learning In-Context Enables Training-Free Cross Subject Brain Decoding

Meta-learning approach for brain signal decoding without per-subject training.

Ax Ali Reza Ibrahimzada, Brandon Paulsen, Daniel Kroening, Reyhaneh Jabbarvand 7d ago

ReCodeAgent: A Multi-Agent Workflow for Language-agnostic Translation and Validation of Large-scale Repositories

Multi-agent system for language-agnostic code translation and validation across programming languages.

Ax Fabrizio Pittorino, Manuel Roveri 7d ago

Position Paper: From Edge AI to Adaptive Edge AI

Framework for adaptive edge AI systems that adjust models during deployment as conditions change.

Ax Haiyang Peng, Deren Han, Xin Chen, Meng Huang 7d ago

NS-RGS: Newton-Schulz based Riemannian gradient method for orthogonal group synchronization

Newton-Schulz optimization method for orthogonal group synchronization problems.

Ax Jintao Zhang, Xuanyao Fong 7d ago

SHIELD: A Segmented Hierarchical Memory Architecture for Energy-Efficient LLM Inference on Edge NPUs

Memory architecture for efficient LLM inference on edge NPUs with optimized DRAM refresh for KV caches.

Ax Tatiana Petrova, Evgeny Polyachenko, Radu State 7d ago

Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

Research on memory capacity of Hopfield networks using geometric constraints and phase transitions.

Ax Krisanu Sarkar 7d ago

Score Shocks: The Burgers Equation Structure of Diffusion Generative Models

Theoretical analysis of diffusion models using Burgers equation to understand score field evolution.

Ax Xiangru Jian, Hao Xu, Wei Pang, Xinjian Zhao, Chengyu Tao, Qixin Zhang, Xikun Zhang, Chao Zhang, Guanzhi Deng, Alex Xue, Juan Du, Tianshu Yu, Garth Tarr, Linqi Song, Qiuzhuang Sun, Dacheng Tao 7d ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Benchmark dataset and evaluation for multimodal LLMs in manufacturing scenarios.

Ax Chao Zhang, Shuai Lin, ChengLei Dai, Ye Qian, Fan Mingyang, Yi Zhang, Yi Wang, Jingwei Zhuo 7d ago

Dual-Rerank: Fusing Causality and Utility for Industrial Generative Reranking

Industrial generative reranking system combining causality and utility for video search at scale.

Ax Yogesh Phalak, Wen Sin Lor, Apoorva Khairnar, Benjamin Jantzen, Noel Naughton, Suyi Li 7d ago

OpenPRC: A Unified Open-Source Framework for Physics-to-Task Evaluation in Physical Reservoir Computing

Open-source framework for evaluating physical reservoir computing systems across various substrates.

Ax Dustin Bryant, Jonathan Juli\'an Huerta y Munive, Cezary Kaliszyk, Josef Urban 7d ago

Munkres' General Topology Autoformalized in Isabelle/HOL

LLM-based coding agents formalized 85K lines of topology proofs in Isabelle/HOL using ChatGPT and Claude.

Ax Yu Liang, Liangxin Liu, Longzheng Wang, Yan Wang, Yueyang Zhang, Long Xia, Zhiyuan Sun, Daiting Shi 7d ago

ConsistRM: Improving Generative Reward Models via Consistency-Aware Self-Training

Paper on generative reward models for LLM alignment using consistency-aware self-training to improve scalability.

Ax Shlomi Hod, Debanuj Nayak, Jason R. Gantenberg, Iden Kalemaj, Thomas A. Trikalinos, Adam Smith 7d ago

Differentially Private Modeling of Disease Transmission within Human Contact Networks

Research on differentially private disease transmission models in contact networks using machine learning.

Ax Lech Madeyski 7d ago

Triage: Routing Software Engineering Tasks to Cost-Effective LLM Tiers via Code Quality Signals

Privacy-preserving epidemiologic modeling of disease transmission in contact networks using differential privacy.

Ax Jorge Alda 7d ago

Lecture notes on Machine Learning applications for global fits

Semi-autonomous multi-agent system for small molecule drug discovery using multi-modal AI agents and GNNs trained on 800M molecules.

Ax Ravindra Ganti, Steve Xu 7d ago

From LLM to Silicon: RL-Driven ASIC Architecture Exploration for On-Device AI Inference

RL-driven compiler using Soft Actor-Critic to jointly optimize ASIC architecture, memory hierarchy, and workload partitioning for on-device AI inference across technology nodes.

Ax F. Fernando Jurado-Lasso, J. F. Jurado 7d ago

RL-ASL: A Dynamic Listening Optimization for TSCH Networks Using Reinforcement Learning

Reinforcement learning optimization for TSCH MAC protocol in IoT networks to reduce idle listening and power consumption under dynamic traffic conditions.

Ax Michael Cuccarese 7d ago

Predicting Activity Cliffs for Autonomous Medicinal Chemistry

ML approach to predict activity cliffs in medicinal chemistry by identifying structural modifications that cause large potency shifts using ChEMBL molecular pair data.

Ax Tunazzina Islam 7d ago

Reasoning-Based Refinement of Unsupervised Text Clusters with LLMs

Framework using LLMs as semantic judges to validate and restructure outputs from unsupervised text clustering methods, improving coherence and grounding without labeled data.

Ax Mohamed Ehab (Faculty of Computer Science, October University for Modern Science & Arts, Giza, Egypt), Ali Hamdi (Faculty of Computer Science, October University for Modern Science & Arts, Giza, Egypt), Khaled Shaban (Department of Computer Science and Engineering, Qatar University, Doha, Qatar) 7d ago