Isolater - Feed

Ax Ye Liu, Srijan Bansal, Bo Pang, Yang Li, Zeyu Leo Liu, Yifei Ming, Zixuan Ke, Shafiq Joty, Semih Yavuz 10d ago

Procedural Memory Distillation: Online Reflection for Self-Improving Language Models

Procedural Memory Distillation: method for language models to retain and reuse procedural information across episodes for self-improvement through online reflection.

Ax Hongyang He, Jiuming Liu, Victor Sanchez 10d ago

Revisiting Chain-of-Thought Reasoning under Limited Supervision: Semi-supervised Chain-of-Thought Learning

Semi-CoT: framework for semi-supervised chain-of-thought learning that reuses generated reasoning traces as learning signals to improve LLM reasoning capabilities.

Ax Ren\'e Carmona, Mathieu Lauri\`ere 10d ago

Mean Field Reinforcement Learning

Monograph introducing mean field reinforcement learning through Markov decision processes and large-population stochastic control with mathematical framework.

Ax Yang Xiang, Philipp G\"otz, Emanu\"el A. P. Habets, Andreas Walther, Wenwu Wang, Philip J. B. Jackson 10d ago

Quantifying the Uncertainty of Blindly Estimated Room Embeddings Using a Dispersion-Calibrated Score

Framework for learning robust room embeddings from reverberant speech with uncertainty quantification using dispersion-calibrated scoring without downstream supervision.

Ax David Courtis, Wenhao Li, Scott Sanner 10d ago

OPINE-World: Programmatic World Modeling with Ontology-error-Prioritized Interactive Exploration

OPINE-World: programmatic world modeling using LLMs and counterexample-guided synthesis to generate data-efficient, reusable environment models for agent adaptation.

Ax Jinliang Xu, Liping Ma 10d ago

MMAO-Cls: Metabolic Multi-Agent Optimization for Joint Feature Selection and Classifier Tuning

Proposes MMAO-Cls using metabolic multi-agent optimization as outer-loop optimizer for joint feature selection and classifier hyperparameter tuning.

Ax Zhaoyan Sun, Shan Zhong, Daizhou Wen, Jiaxing Han, Guoliang Li, Ying Yan, Peng Zhang, Yu Su, Xiang Qi, Baolin Sun, Chengyuan Yang, Tao Fang, Huaiyu Ruan 10d ago

AgenticDataBench: A Comprehensive Benchmark for Data Agents

AgenticDataBench: benchmark for evaluating LLM-based data agents on automating data science workflows including data wrangling, analysis, and visualization tasks.

Ax Mona Rajhans, Vishal Khawarey 10d ago

Beyond Gradient-Based Attacks: Adversarial Robustness and Explainability Stability in Cybersecurity Classifiers

Studies adversarial robustness and explainability stability of cybersecurity classifiers using SHAP-based explanations across multiple datasets and attack methods.

Ax Joshua Penman 10d ago

Epistemic Goggles: A Pretrained Module that Induces an Epistemic Frame via Gradient Editing

Introduces Goggles, a learned module using gradient editing to improve language models' ability to recognize fictional content, addressing the negation neglect problem.

Ax Zongxia Li, Dawei Liu, Fuxiao Liu, Yuhang Zhou, Xiyang Wu, Jingxi Chen, Jing Xie, Xiaomin Wu, Lichao Sun 10d ago

COMFYCLAW: Self-Evolving Skill Harnesses for Image Generation Workflows

COMFYCLAW: agentic system with self-evolving skill harnesses for image generation workflows, enabling agents to recall patterns and user preferences from prior runs.

Ax Yiquan Gao 10d ago

Quantum-Inspired Vision: Leveraging Wave-Particle Duality for Low-Illumination Enhancement

Theoretical framework applying wave-particle duality concepts to low-illumination image enhancement via Data Relativistic Uncertainty paradigm.

Ax Stefano Masini, Cecilia Viscardi, Michela Baccini 10d ago

Full Bayesian Reinforcement Learning via LF-IBIS

Full Bayesian reinforcement learning approach via Likelihood-Free Iterative Bayesian Importance Sampling for data-scarce settings.

Ax Siyuan Zhang, Nachuan Xiao, Xin Liu 10d ago

Decentralized Stochastic Subgradient-type Methods with Communication Compression for Nonsmooth Nonconvex Optimization

Decentralized optimization framework for nonsmooth nonconvex problems with communication compression and error compensation.

Ax Andikawati P Widjaja, Yongjun Kim, Hyounghun Kim, Jaeho Lee 10d ago

PARTREP: Learning What to Repeat for Decoder-only LLMs

PARTREP method enabling decoder-only LLMs to learn selective prompt repetition patterns, improving reasoning by redistributing contextual grounding across positions.

Ax Igor Mezi\'c, Jorge Cort\'es, Karl Worthmann, Mircea Lazar, Armin Lederer 10d ago

Koopman operator theory: fundamentals, control, and applications

Review of Koopman operator theory for linearizing nonlinear dynamical systems, covering data-driven techniques like EDMD and machine learning methods.

Ax Wenchen Han, Gingfung Matthew Yeung, Marco Barletta, William Toner, Amory Hoste, Adam Barker 10d ago

Lynx: Progressive Speculative Quantization for accelerating KV Transfer in Long-Context Inference

Lynx: progressive speculative KV cache quantization technique for accelerating long-context LLM inference in retrieval-augmented generation and agentic systems.

Ax Tristan Kirscher (ICube), Kim-Celine Kahl (DKFZ), Balint Kovacs (DKFZ), Maximilian R. Rokuss (DKFZ), Klaus Maier-Hein (DKFZ), Xavier Coubez (ICube), Philippe Meyer (ICube), Sylvain Faisan (ICube) 10d ago

Rethinking Post-Hoc Calibration in Semantic Segmentation

Study of post-hoc calibration methods for semantic segmentation to improve confidence estimate reliability in safety-critical applications.

Ax Peng Yun, Shouwang Huang, Hao Li, Jinxi Li, Jianan Wang, Bo Yang 10d ago

PhysMani: Physics-principled 3D World Model for Dynamic Object Manipulation

PhysMani framework coupling physics-principled 3D Gaussian world model with action policy for dynamic object manipulation in embodied AI.

Ax Xin Guan 10d ago

Statistical Properties of $k$-means Clustering for Data Missing Completely at Random

Statistical analysis of k-means clustering with missing data, establishing asymptotic risk bounds and convergence guarantees.

Ax Julian Cardenas, Jamie Arjona, Pedro Delicado 10d ago

Autorelevance function and other feature relevance measures for univariate time series

Model-agnostic methodology for measuring lag relevance in time series forecasting using Ghost variables and Shapley values.

Ax Sneha Ray Barman, Neeraj Kumar Sharma, Shakuntala Mahanta 10d ago

Towards a Phonology-Informed Evaluation of Multilingual TTS

Framework auditing multilingual text-to-speech systems against language-specific phonological patterns using classifier-based evaluation.

Ax Jan Drchal 10d ago

Object Aligner: A Configurable JSON Schema Similarity Score for Graphs, Applied to LLM Prompt Optimization

Object Aligner: configurable JSON schema similarity scoring for measuring LLM output alignment with structured schemas, enabling agentic planning and tool calling evaluation.

Ax Sofiane Ouaari, Kevin Vorwalder, Nico Pfeifer 10d ago

Assessing VLM Reliability for Medical Image Quality Evaluation Under Corruption and Bias

Evaluation of Vision-Language Model reliability for medical image quality assessment under image corruption and demographic bias.

Ax Ilie Sarpe, Federico Altieri, Andrea Pietracaprina, Geppino Pucci, Fabio Vandin 10d ago

Scalable and Distributed Silhouette Approximation

Scalable distributed algorithm for computing silhouette coefficients to assess k-clustering quality on large datasets.

Ax Ruiheng Jiang, Thomas Bi, Raffaello D'Andrea, Aswin Ramachandran 10d ago

Cross-Platform Control for Autonomous Surface Vehicles via Adaptive Reinforcement Learning

Adaptive reinforcement learning approach for zero-shot cross-platform control of autonomous surface vehicles with unknown dynamics.

Ax Ya Gao, Pekka Marttinen 10d ago

Evidence-State Rewards for Long-Context Reasoning

Maven RL framework with editable evidence memory for long-context reasoning, rewarding intermediate evidence state changes rather than just final answers.

Ax Navaneeth Sangameswaran, Preetham S, Ashmiya Lenin 10d ago

HaloGuard 1.0: An Open Weights Constitutional Classifier for Multilingual AI Safety

Open-weights constitutional classifier for multilingual AI safety filtering, achieving SOTA on prompt-safety benchmarks at 1/10th the size of competing models.

Ax Tomoshi Iiyama, Masahiro Suzuki, Yutaka Matsuo 10d ago

SUNTA: Hierarchical Video Prediction with Surprise-based Chunking

Hierarchical state-space model for video prediction using surprise-based chunk boundary detection instead of fixed-length or similarity approaches.

Ax Tien-Huy Nguyen, Minh-Nhat Nguyen, Nguyen Nhat Huy, Hung Viet Nguyen, Huy Nguyen Minh Nhat, Thanh-Huy Nguyen, Cuong Tuan Nguyen, Hoang M. Le, Dat Nguyen, Phat Kim Huynh, Min Xu, Ulas Bagci 10d ago

ESC: Emotional Self-Correction for Reliable Vision-Language Models

Emotional Self-Correction method improves vision-language model reliability by activating latent self-correction without post-training.

Ax Wan Song, Wei Zhou, Rui Wang, Jun Yu, Toru Kurihara, Jiajia Xu, Shu Zhan 10d ago

WBMM: Windowed Batch Matrix Multiplication for Efficient Large Receptive Field Convolution

WBMM efficiently implements large kernel depthwise convolutions via windowed batch matrix multiplication.

Ax Yue Zhang, Nandini Amit Gadhia, Georgios Karagiannis, Michalis Smyrnakis 10d ago

Structured Gaussian Processes for Uncertainty-Aware Classification of High-Dimensional, Small-Sampled Omics Data

Structured Gaussian process method for high-dimensional omics classification with small samples and class imbalance.

Ax Jan Ernsting, Gunnar Paul Kordes, Nils Johannaber, Lynn Ogoniak, Wolfgang Roll, Tim Hahn, Alexander Siegfried Busch, Benjamin Risse 10d ago

Population-Scale Segmentation of Penile Tissue in DIXON MRI using Deep Learning for Quantitative Phenotyping in Male Reproductive Health

Deep learning method for medical image segmentation of penile tissue in MRI for reproductive health phenotyping.

Ax Miko{\l}aj Jastrz\k{e}bski, Dawid Glinkowski, Dawid Zieli\'nski, Daniel Borkowski, Wojciech Koz{\l}owski, Kamil Adamczewski 10d ago