Isolater - Feed

Ax Nikolai Smolyanskiy 9d ago

Predicting Closed-Loop Performance of Latent World Models: Offline Checkpoint Selection for MPC and Model-Based RL Under Non-Markovian Rewards in LunarLander

Diagnostic framework for predicting closed-loop performance of world models in model-based RL without explicit validation metrics.

Ax Stefano Masini, Cecilia Viscardi, Michela Baccini 9d ago

Full Bayesian Reinforcement Learning via LF-IBIS

Bayesian reinforcement learning method addressing data scarcity through prior knowledge and belief updates in sequential decision-making.

Ax Yuan Wang, Shujian Gao, Songtao Jiang, Zhengyu Hu, Zuozhu Liu 9d ago

MedStreamBench: A Time-Aware Benchmark for Streaming and Proactive Medical Video Understanding

Medical video benchmark for time-aware clinical AI predictions. Evaluates when models should answer vs defer in real-world deployment.

Ax Siyuan Zhang, Nachuan Xiao, Xin Liu 9d ago

Decentralized Stochastic Subgradient-type Methods with Communication Compression for Nonsmooth Nonconvex Optimization

Decentralized optimization with communication compression for nonsmooth problems. Distributed computing theory, marginal relevance.

Ax Jae-Ryung Hong, Ho-Joong Kim, Seong-Whan Lee 9d ago

ProCal: Inference-Time Proposal Calibration for Open-Vocabulary Object Detection

Open-vocabulary object detection calibration using frozen VLMs. Vision-language model application with limited novelty.

Ax Lizhou Liu, Xiaohui Chen, Zihan Tang, Mengyao Ma, Wenyi Zhang 9d ago

Scene-Conditioned PINN-GNN for Multipath RF Maps: Cross-Scene Generation and In-Scene Completion

Physics-informed neural networks and GNNs for RF map construction and multipath propagation. Wireless domain-specific, limited relevance.

Ax Ahin Lee, Sehyun Yun, Taesik Gong 9d ago

EPnG: Adaptive Expert Prune-and-Grow for Parameter-Efficient MoE Fine-tuning

Adaptive expert pruning and growing for efficient MoE fine-tuning using LoRA. Parameter-efficient training for large models.

Ax Shenghui Zhang, YuXuan Gao, Songwei Zhao, Jifeng Hu, Zijing Zhang, Hechang Chen 9d ago

Lightweight Safe Reinforcement Learning for End-to-End UAV Navigation

Safe reinforcement learning for UAV navigation with explicit safety mechanisms. Robotics application with limited relevance.

Ax Rowan Hussein, Mohamed Ouf 9d ago

Single-Channel EEG-Based Cognitive Load Assessment in Online Learning: A Hybrid Deep Learning Approach

EEG-based cognitive load assessment for online learning using deep learning. Healthcare/education application, minimal AI research depth.

Ax Rodrigo Mendoza-Smith 9d ago

Expander Sparse Autoencoders: Parameter-Efficient Dictionaries for Mechanistic Interpretability

Expander sparse autoencoders for mechanistic interpretability with reduced parameters. ML research on interpretability and efficient dictionaries.

Ax Weiwei Xu, Xuanning Cui, Hengzhi Ye, Minghui Zhou 9d ago

Decoupling Code Complexity from Newcomer Participation: A Causal Study of AI Coding Agent Adoption in OSS

Causal study of AI coding agent adoption on open-source projects, analyzing impact on newcomer participation. Developer tools and OSS ecosystem.

Ax Yuanzhi Liu, Shousheng Zhao, Bo Zhou, Kongming Liang, Zhanyu Ma 9d ago

MMBench-Live: A Continuously Evolving Benchmark for Multimodal Models

Continuously evolving multimodal benchmark using multi-agent pipeline for VLM evaluation. Developer tools and evaluation frameworks.

Ax Xuan-Phi Nguyen, Shrey Pandit, Yiran Zhao, Semih Yavuz, Silvio Savarese, Shafiq Joty 9d ago

Mixture-of-Parallelisms: Towards Memory-Efficient Training Stack for Mixture-of-Experts Models

Memory-efficient training stack combining parallelism techniques for MoE models. ML research on scalable training infrastructure.

Ax Yewon Kim, Apurva Gandhi, David Chung, Graham Neubig, Chris Donahue 9d ago

Decomposer: Learning to Decompile Symbolic Music to Programs

Music decompilation framework recovering executable programs from MIDI via post-training. Niche domain-specific application.

Ax Valentin J. J. Kreileder, Johannes Reisinger, Andreas Fischer 9d ago

Evaluating Chunking Strategies for Retrieval-Augmented Generation on Academic Texts

Evaluation of chunking strategies for RAG systems on academic texts using RAGAS framework. Directly relevant to LLM applications and retrieval techniques.

Ax Yongyi Ji, Jiaji Wang, Yi Zhou, Fuxiang Chen, Hongji Yang 9d ago

An Exploratory Study on LLM-Generated Code and Comments in Code Repositories

Empirical study of LLM-generated code and comments in real repositories, analyzing prevalence and quality concerns. Developer tools and LLM applications.

Ax Qi Lyu, Jiahua Dong, Baichen Liu, Xudong Wang, Mingfei Han, Yulun Zhang, Fahad Shahbaz Khan, Salman Khan, Lianqing Liu, Zhi Han 9d ago

SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Models

Binarization technique for vision-language models reducing memory/latency for deployment. Model optimization for efficient inference.

Ax Yuriy Maksyuta, George Bredis, Ruslan Rakhimov, Daniil Gavrilov 9d ago

Rank-Then-Act: Reward-Free Control from Frame-Order Progress

Reward-free reinforcement learning from video using VLM as progress scorer and GRPO objective. AI agents and novel training approach.

Ax Yidan Xu, Xiangmin Han, Rundong Xue, Huihui Ye 9d ago

SABER: A Semantic-Aligned Brain Network Analysis Framework via Multi-scale Hypergraphs

Brain disease diagnosis framework integrating LLM semantics with brain connectivity analysis via hypergraphs. LLM application with healthcare focus.

Ax Francisco Sede\~no, Francisco Chicano, Jamal Toutouh 9d ago

Population-Based Multi-Objective Training of Discriminators for Semi-Supervised GANs

Population-based evolutionary training for semi-supervised GANs with multi-objective optimization. Limited relevance to core interests.

Ax Jo\~ao Henrique Inacio de Souza, Mattia Merluzzi, Mateus P. Mota, Beatriz Soret, Petar Popovski 9d ago

Low-Latency Task-Oriented Image Transmission with Opportunistic Spectrum Access

Image transmission framework using VQ-VAE for latency reduction under spectrum constraints. Not directly relevant to user interests.

Ax Baran Bingol, Bahaeddin Turkoglu 9d ago

TUDUM: A Turkish-Thinking Reasoning Pipeline for Qwen3.5-27B

Pipeline for adapting Qwen 27B model to perform reasoning in Turkish rather than English, addressing multilingual LLM reasoning.

Ax Javier Irigoyen, Roberto Daza, Francisco Jurado, Julian Fierrez, Ruben Tolosana, Alvaro Ortigosa, Enrique Blas, Aythami Morales 9d ago

AIriskEval-edu: New Dataset for Risk Assessment in AI-mediated K-12 Educational Explanations

Dataset of 1,639 K-12 science explanations with human and LLM-generated alternatives for training risk assessment auditors.

Ax Nicholas Tagliapietra, Gian Lorenzo Marchioni, Moritz Willig, Juergen Luettin, Lavdim Halilaj, Kristian Kersting 9d ago

CausalSteward: An Agentic Divide-Conquer-Combine Copilot for Causal Discovery

CausalSTeward agentic divide-conquer-combine system for causal discovery integrating prior knowledge to identify causal models from high-dimensional data.

Ax Peng Yun, Shouwang Huang, Hao Li, Jinxi Li, Jianan Wang, Bo Yang 9d ago

PhysMani: Physics-principled 3D World Model for Dynamic Object Manipulation

PhysMani framework combines physics-principled 3D Gaussian world model with action policy for dynamic object manipulation in embodied AI.

Ax Zhiren Gong, Zihao Zeng, Chau Yuen, Wei Yang Bryan Lim 9d ago

Conditional Co-Ablation: Recovering Self-Repair Backups in Transformer Circuits

Conditional co-ablation technique reveals transformer self-repair mechanisms where dormant backups activate after primary component ablation.

Ax Minjong Cheon 9d ago

Robust for the Wrong Reasons: The Representational Geometry of LLM Robustness to Science Skepticism

Analyzes representational geometry showing LLMs become robust to science skepticism through problematic mechanisms rather than genuine understanding.

Ax Jinxi Li, Tianyi Zhang, Yafei Yang, Zihui Zhang, Peng Huang, Koon Wing Macgyver Lin, Bo Yang 9d ago

NeoMap: Training-free Novel-View Synthesis from Single Images and Videos

Novel view video synthesis method from single images using pre-trained video models without task-specific fine-tuning.

Ax Jan Drchal 9d ago

Object Aligner: A Configurable JSON Schema Similarity Score for Graphs, Applied to LLM Prompt Optimization

Object Aligner provides configurable JSON schema similarity scoring for measuring LLM output structure alignment in tool calling and agentic systems.

Ax Sofiane Ouaari, Kevin Vorwalder, Nico Pfeifer 9d ago

Assessing VLM Reliability for Medical Image Quality Evaluation Under Corruption and Bias

Evaluates Vision-Language Model reliability for medical image quality assessment under corruption and bias conditions.

Ax Beile Ning, Jiayi Yu, Zitong Wang, Yufei Hu, Wenjun Xu, Yuanhang Qian, Zhongxin Bai, Gongping Huang 9d ago

A Multi-Branch Hierarchy-Aware Framework for Heterogeneous Audio Classification

DCASE 2026 Challenge system for audio classification using CLAP audio-text representations with taxonomy-aware hierarchy constraints.

Ax Wenda Wang, Yihan Tong, Yuwei Hu, Zhewei Wei 9d ago

MolSight: A Graph-Aware Vision-Language Model for Unified Chemical Image Understanding

MolSight vision-language model combines molecular LLMs with graph-aware visual understanding for molecular structure and drug discovery tasks.

Ax Tasnim Shahriar 9d ago

Do Newer Lightweight CNNs Perform Better Under Resource Constraints? A Controlled Multigenerational Study of Architecture, Initialization, Training Budget, and Efficiency

Controlled study comparing nine lightweight CNN architectures across multiple datasets and hardware to assess efficiency claims.

Ax Shrikara Arun, Anjaly Parayil, Srikant Bharadwaj, Renee St. Amant, Victor R\"uhle 9d ago

Towards Load-Aware Prefill Deflection for Disaggregated LLM Serving

Proposes load-aware prefill deflection technique to improve disaggregated LLM serving efficiency by balancing prefill and decode GPU pools.

Ax Rheeya Uppaal, Seungwoo Lyu, Selina Sung, Junjie Hu 9d ago

OpenSafeIntent: Evaluating Intent-Calibrated Safe Completion Across Dual-Use Prompt Sets

OpenSafeIntent benchmark evaluates whether LLMs calibrate assistance appropriately across benign, dual-use, and malicious intent variants.

Ax Anna Chorna 9d ago

SPLIT: Cross-Lingual Empathy and Cultural Grounding in English and Ukrainian LLM Responses

SPLIT benchmark evaluates LLM cross-lingual empathy and cultural grounding in emotional-support contexts across English and Ukrainian.

Ax Prathamesh Patil, Arpit Jain, Aswanth Krishnan 9d ago

Beyond the Performance Illusion: Structure-Aware Stratified Partitioning and Curriculum Distributionally Robust Optimization for Spatially Correlated Domains

Demonstrates performance evaluation failures in spatiotemporally correlated domains due to data leakage from non-i.i.d. splits.

Ax Florian Tambon, Michael Konstantinou, Cedric Richter, Charles Chenouard, Mark Harman, Mike Papadakis 9d ago

Prompt Coverage Adequacy

Proposes prompt coverage adequacy testing framework to guide LLM and autonomous agent testing when prompts replace traditional code.

Ax Yang Li, Pan Hu, Yan Zhang, Wenfan Yang, Tao Wu, Lianbo Guo 9d ago

SA-HGNN: Sample-Adaptive Hyperbolic Graph Neural Network for EEG-Based Depression Recognition

Graph Neural Network model for EEG-based depression recognition using hyperbolic geometry to capture hierarchical brain network structure.

Ax Mahmoud Abdelfattah, Hamid Nasiri, Peter Garraghan 9d ago

kNNGuard: Turning LLM Hidden Activations into a Training-Free Configurable Guardrail

kNNGuard presents training-free guardrail for LLMs using activation space of off-the-shelf models to detect unsafe/adversarial prompts with minimal labeled data.

Ax Dipika Rajesh, Ahmed Khalifa, Julian Togelius 9d ago

Evolutionary Wave Function Collapse

Combines Wave Function Collapse procedural generation with evolutionary search to evolve input examples for level generation.

Ax Tien-Huy Nguyen, Minh-Nhat Nguyen, Nguyen Nhat Huy, Hung Viet Nguyen, Huy Nguyen Minh Nhat, Thanh-Huy Nguyen, Cuong Tuan Nguyen, Hoang M. Le, Dat Nguyen, Phat Kim Huynh, Min Xu, Ulas Bagci 9d ago

ESC: Emotional Self-Correction for Reliable Vision-Language Models

Introduces emotional self-correction mechanism for vision-language models to improve reasoning reliability without post-training or engineered feedback.

Ax Liuhaichen Yang, Zhuang Jiang, Chenchao Sheng, Zezhi Tang 9d ago

Guided Action Flow: Q-Guided Inference for Flow-Matching Vision-Language-Action Policies

Proposes test-time guidance framework for vision-language-action policies using learned critic to guide flow-matching inference without retraining base models.

Ax Haoran Wang, Jinchuan Tian, Siddhant Arora, Shinji Watanabe 9d ago

An Efficient vLLM-Based Inference Pipeline for Unified Audio Understanding and Generation

Presents vLLM-based inference pipeline for unified audio understanding and generation in speech language models with multi-token prediction support.

Ax William Hackett, Peter Garraghan 9d ago

Behind the Refusal: Determining Guardrail Activation via Behavioral Monitoring

Develops behavioral monitoring techniques to detect and analyze guardrail activations in LLMs, enabling black-box security testing of production AI systems.

Ax Yilie Huang, Wenpin Tang, Xun Yu Zhou 9d ago

ART for Diffusion Sampling: Continuous-Time Control and Actor-Critic Learning

Proposes Adaptive Reparameterized Time (ART) continuous-time control for optimizing timestep allocation in score-based diffusion sampling via actor-critic learning.

Ax Debopriya Ghosh 9d ago

Predicting Early Stages Of Alzheimer's Disease And Identifying Key Biomarkers Using Deep Artificial Neural Network And Ensemble Of Machine Learning Methodologies

Applies deep neural networks and ensemble methods to predict early-stage Alzheimer's disease and identify biomarkers from medical data.

Ax Di Wu, Huan Liu, Zhixiang Chi, Yuanhao Yu, Konstantinos N. Plataniotis, Yang Wang 9d ago