Isolater - Feed

Ax William Yicheng Zhu, Lei Zhu 21d ago

The Planetary Cost of AI Acceleration, Part II: The 10th Planetary Boundary and the 6.5-Year Countdown

Discusses environmental and computational costs of scaling LLM agents beyond human cognitive capacity, framing AI acceleration as paradigm shift.

Ax Xinhong Xu, Yimeng Zhang, Qichen Qian, Yuanlong Zhang 21d ago

Self-Supervised Foundation Model for Calcium-imaging Population Dynamics

CalM: Self-supervised foundation model for calcium-imaging neural data, adaptable to multiple neuroscience analysis tasks.

Ax Chan-Wei Hu, Zhengzhong Tu 21d ago

Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking

Region-R1: Framework for multi-modal retrieval-augmented generation re-ranking using query-side region cropping to improve image-question relevance.

Ax Dominik Blain, Maxime Noiseux 21d ago

Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code

Formal verification study of 3,500 code artifacts from 7 LLMs across 500 security-critical prompts, quantifying exploitable vulnerabilities in AI-generated code.

Ax Tinko Sebastian Bartels, Ruixiang Wu, Xinyu Lu, Yikai Lu, Fanzeng Xia, Haoxiang Yang, Yue Chen, Tongxin Li 21d ago

Bridging Natural Language and Microgrid Dynamics: A Context-Aware Simulator and Dataset

OpenCEM: Open-source digital twin simulator and dataset integrating natural language with renewable energy microgrid dynamics for intelligent energy management.

Ax Amit Vaisman, Gal Pomerants, Raz Lapid 21d ago

On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors

Analyzes robustness of diffusion-based image compression to bit-flip errors, comparing against classical and learned codecs.

Ax Uloma Okoro, Tammy Mackenzie, Branislav Radeljic 21d ago

Governance and Regulation of Artificial Intelligence in Developing Countries: A Case Study of Nigeria

Qualitative case study examining Nigerian legal professionals' perceptions of AI governance, regulatory gaps, and institutional readiness.

Ax Tashreef Muhammad, Tahsin Ahmed, Meherun Farzana, Md. Mahmudul Hasan, Abrar Eyasir, Md. Emon Khan, Mahafuzul Islam Shawon, Ferdous Mondol, Mahmudul Hasan, Muhammad Ibrahim 21d ago

A Benchmark of Classical and Deep Learning Models for Agricultural Commodity Price Forecasting on A Novel Bangladeshi Market Price Dataset

Introduces AgriPriceBD, a benchmark dataset of 1,779 daily commodity prices from Bangladesh, comparing classical and deep learning forecasting models.

Ax Gregory Magarshak 21d ago

Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse

Probabilistic language tries (PLTs) unify prefix structure representation serving as lossless compressor, decision policy, and execution reuse framework.

Ax Gaurav Narasimhan 21d ago

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

FLeX: Fourier-based low-rank expansion for parameter-efficient cross-lingual code generation transfer from Python to Java using Code Llama 7B.

Ax Yongzhong Xu 21d ago

Spectral Edge Dynamics Reveal Functional Modes of Learning

Analysis of grokking training dynamics showing spectral edge reveals functional modes invisible to mechanistic interpretability tools.

Ax Ahsan Bilal, Muhammad Ahmed Mohsin, Muhammad Umer, Asad Aali, Muhammad Usman Khanzada, Muhammad Usman Rafique, Zihao He, Emily Fox, Dean F. Hougen 21d ago

$S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models

S³: stratified scaling search for test-time inference in diffusion language models using classical verifiers to improve generation without additional training.

Ax Apimuk Sornsaeng, Si Min Chan, Wenxuan Zhang, Swee Liang Wong, Joshua Lim, Dario Poletti 21d ago

SMT-AD: a scalable quantum-inspired anomaly detection approach

Quantum-inspired tensor network anomaly detection (SMT-AD) using superposition of bond-dimension-1 matrix product operators with Fourier feature embeddings.

Ax Zixuan Chen, Heng Zhang, YuPeng Qin, WenPeng Xing, Qiang Wang, Da Wang, Changting Lin, Meng Han 21d ago

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

Multimodal VAE framework for survival risk modeling in multiple myeloma integrating heterogeneous omics and clinical data with improved latent regularization.

Ax Zihan Wang, Chi Gui, Xing Jin, Qineng Wang, Licheng Liu, Kangrui Wang, Shiqi Chen, Linjie Li, Zhengyuan Yang, Pingyue Zhang, Yiping Lu, Jiajun Wu, Li Fei-Fei, Lijuan Wang, Yejin Choi, Manling Li 21d ago

RAGEN-2: Reasoning Collapse in Agentic RL

RAGEN-2 identifies reasoning collapse in RL-trained multi-turn LLM agents where models use input-agnostic templates despite stable entropy metrics.

Ax Giulia Bertaglia, Raffaella Fiamma Cabini 21d ago

Asymptotic-Preserving Neural Networks for Viscoelastic Parameter Identification in Multiscale Blood Flow Modeling

Neural networks for identifying viscoelastic parameters in multiscale blood flow cardiovascular models using asymptotic-preserving methods.

Ax Lin Mu, Haiyang Wang, Li Ni, Lei Sang, Zhize Wu, Peiquan Jin, Yiwen Zhang 21d ago

TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models

TalkLoRA: communication-aware mixture of LoRA experts for parameter-efficient LLM fine-tuning addressing routing instability in MoE-augmented approaches.

Ax Wenyue Hua, Sripad Karne, Qian Xie, Armaan Agrawal, Nikos Pagonas, Kostis Kaffes, Tianyi Peng 21d ago

AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

AgentOpt framework for client-side optimization of LLM-based agents handling composition of local tools, remote APIs and diverse models with reduced costs.

Ax Suraj Yadav, Siddharth Yadav, Parth Goyal 21d ago

Limits of Difficulty Scaling: Hard Samples Yield Diminishing Returns in GRPO-Tuned SLMs

GRPO preference optimization applied to small language models shows diminishing returns on hard samples, revealing capacity boundaries in math reasoning tasks.

Ax Yi Yang, Ovidiu Daescu 21d ago

BiScale-GTR: Fragment-Aware Graph Transformers for Multi-Scale Molecular Representation Learning

Graph Transformer architecture combining GNNs and Transformers for multi-scale molecular property prediction with fragment-aware representation learning.

Ax Marzi Heidari, Hanping Zhang, Hao Yan, Yuhong Guo 21d ago

Bi-Level Optimization for Single Domain Generalization

Bi-level optimization framework (BiSDG) for single domain generalization that decouples task learning from domain modeling using surrogate distributions.

Ax Rishab Balasubramanian, Pin-Jie Lin, Rituraj Sharma, Anjie Fang, Fardin Abdi, Viktor Rozgic, Zheng Du, Mohit Bansal, Tu Vu 21d ago

The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment

Master Key Hypothesis proposes capabilities correspond to transferable directions in low-dimensional subspace; introduces UNLOCK for training-free cross-model capability transfer.

Ax Sakib Mostafa, Lei Xing, Md. Tauhidul Islam 21d ago

Toward a universal foundation model for graph-structured data

Develops universal foundation model for graph-structured biomedical data including molecular networks and regulatory circuits.

Ax Ruggero Freddi, Nicolas Seseri, Diana Nigrisoli, Alessio Basti 21d ago

Bridging Theory and Practice in Crafting Robust Spiking Reservoirs

Addresses hyperparameter tuning challenges in spiking reservoir computing by introducing robustness interval concept for edge-of-chaos operation.

Ax Xiao Shou 21d ago

ODE-free Neural Flow Matching for One-Step Generative Modeling

OT-NFM enables one-step generative modeling by learning transport maps directly instead of integrating vector fields, achieving single forward pass generation.

Ax Mingchen Zhuge, Changsheng Zhao, Haozhe Liu, Zijian Zhou, Shuming Liu, Wenyi Wang, Ernie Chang, Gael Le Lan, Junjie Fei, Wenxuan Zhang, Yasheng Sun, Zhipeng Cai, Zechun Liu, Yunyang Xiong, Yining Yang, Yuandong Tian, Yangyang Shi, Vikas Chandra, J\"urgen Schmidhuber 21d ago

Neural Computers

Proposes Neural Computers (NCs) that unify computation, memory, and I/O in learned runtime states, aiming toward fully neural computing systems that replace explicit programs.

Ax Yi Xu, Philipp Jettkant, Laura Ruis 21d ago

The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning

Study on latent reasoning limits in LLMs investigating whether models discover multi-step planning strategies without supervision.

Ax Srinidhi Madabhushi, Pranesh Vyas, Swathi Vaidyanathan, Mayur Kurup, Elliott Nash, Yegor Silyutin 21d ago

From Load Tests to Live Streams: Graph Embedding-Based Anomaly Detection in Microservice Architectures

Graph embedding-based anomaly detection for microservice architectures identifying under-represented services in load testing.

Ax Noufa Haneefa, Teddy Lazebnik, Einav Peretz-Andersson 21d ago

Quality-preserving Model for Electronics Production Quality Tests Reduction

Data-driven approach reducing electronics production test costs while adapting to changing defect distributions and controlling escape risk.

Ax Yuanjie Shi, Peihong Li, Zijian Zhang, Janardhan Rao Doppa, Yan Yan 21d ago

Conformal Margin Risk Minimization: An Envelope Framework for Robust Learning under Label Noise

Conformal Margin Risk Minimization framework for robust classification under label noise without privileged knowledge.

Ax Willa Potosnak, Nina \.Zukowska, Micha{\l} Wili\'nski, Dan Howarth, Ignacy St\k{e}pka, Mononito Goswami, Artur Dubrawski 21d ago

MICA: Multivariate Infini Compressive Attention for Time Series Forecasting

MICA architecture for multivariate time series forecasting addressing Transformer scalability with channel-dependent attention.

Ax Dev Arpan Desai, Shaoyi Huang, Zining Zhu 21d ago

Distributed Interpretability and Control for Large Language Models

Multi-GPU implementation of activation-level interpretability and steering techniques for large language models, extending single-GPU methods to distributed settings.

Ax David Cho, Yifan Wang, Fanping Sui, Ananth Grama 21d ago

Inference-Time Code Selection via Symbolic Equivalence Partitioning

Novel inference-time scaling method using symbolic execution to select correct code generation solutions from LLM candidates without expensive external verifiers.

Ax Maojiang Su, Po-Chung Hsieh, Weimin Wu, Mingcheng Lu, Jiunhau Chen, Jerry Yao-Chieh Hu, Han Liu 21d ago

Discrete Flow Matching Policy Optimization

DoMinO: unified RL framework for fine-tuning discrete flow matching models viewing sampling as multi-step MDP.

Ax Andrew Lowy 21d ago

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

Theoretical analysis of stochastic convex optimization with heavy-tailed gradients under differential privacy constraints.

Ax Philipp Hellwig, Willem Zuidema, Claire E. Stevenson, Martha Lewis 21d ago

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

Study of transformers learning analogical reasoning via copying intermediate representations using meta-learning for compositionality.

Ax Peigui Qi, Kunsheng Tang, Yanpu Yu, Jialin Wu, Yide Song, Wenbo Zhou, Zhicong Huang, Cheng Hong, Weiming Zhang, Nenghai Yu 21d ago

VLMShield: Efficient and Robust Defense of Vision-Language Models against Malicious Prompts

VLMShield: defense mechanism for vision-language models against malicious prompt attacks using multimodal feature extraction.

Ax Mohammed Nowaz Rabbani Chowdhury, Kaoutar El Maghraoui, Hsinyu Tsai, Naigang Wang, Geoffrey W. Burr, Liu Liu, Meng Wang 21d ago

Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees

Method for post-training quantization of sparse Mixture-of-Experts models with theoretical generalization guarantees.

Ax Yao Sun, Bo Hu, Jose Principe 21d ago

Time-Series Classification with Multivariate Statistical Dependence Features

Framework for time-series classification using cross density ratio instead of correlation-based statistics.

Ax Bryan Cheng, Jasper Zhang 21d ago

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

Systematic study of target context conditioning for molecular property prediction across protein families and data regimes.

Ax Nan Zhang, Zishuo Wang, Shuyu Huang, Georgios Diamantopoulos, Nikos Tziritas, Panagiotis Oikonomou, Georgios Theodoropoulos 21d ago

TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning

TwinLoop: simulation-in-the-loop digital twin framework for online multi-agent reinforcement learning with context shifts.

Ax Xiancheng Wang, Lin Wang, Rui Wang, Zhibo Zhang, Minghang Zhao, Xiaoheng Zhang, Zhongyue Tan, Kaitai Mao 21d ago

PD-SOVNet: A Physics-Driven Second-Order Vibration Operator Network for Estimating Wheel Polygonal Roughness from Axle-Box Vibrations

Physics-driven neural network for estimating wheel polygonal roughness from vibration signals in rail vehicles.

Ax Zheng Jiang, Nan He, Yiming Chen, Lifeng Sun 21d ago