Isolater - Feed

Ax Zinan Tang, Yukun Zhang, Shaomian Zheng, Zhuoshi Pan, Qizhi Pei, Dingnan Jin, Jun Zhou, Yujun Wang, Biqing Huang 11d ago

CausalMix: Data Mixture as Causal Inference for Language Model Training

arXiv paper on CausalMix for optimizing data mixture weights in LLM training using causal inference without retraining.

Ax Miruna Cretu, John Bradshaw, Patricia Suriana, Saeed Saremi, Omar Mahmood, Kirill Shmilovich, Kangway Chuang, Vishnu Sresht, Colin Grambow 11d ago

SynLaD: Latent Diffusion for Generating Synthesizable Molecules Conditioned on 3D Pharmacophore Profiles

arXiv paper on SynLaD latent diffusion framework for generating synthesizable drug molecules with pharmacophore constraints.

Ax Hao Huang 11d ago

Muon as a Residual Connection

arXiv paper proposing Muon optimizer as implicit residual connection during neural network training, explaining its effectiveness.

Ax Xun Dong, Yibo Xu, Naigang Wang, Xin Li, Penghang Yin, Zi Yang 11d ago

ZO-Act: Efficient Zeroth-Order Fine-Tuning via One-Shot Activation-Informed Low-Rank Subspaces

arXiv paper presenting ZO-Act for efficient zeroth-order fine-tuning of large language models using activation-informed low-rank subspaces.

Ax Meenakshi Krishnan, Pranav Pulijala, Ke Chen, Haizhao Yang, Ramani Duraiswami 11d ago

GAIA: Geometry-Adaptive Operator Learning for Forward and Inverse Problems

arXiv paper on GAIA, a geometry-adaptive neural operator learning method for PDEs including boundary value and inverse problems.

Ax Binglin Ji, Anindya Sarkar, Hengchang Lu, Jens Sj\"olund, Yevgeniy Vorobeychik 11d ago

Sequentially-Controlled Interactive Multi-Particle Flow-Maps for Online Feedback-Driven Search

arXiv paper proposing IMPF for reward alignment in generative models with sequential feedback-driven exploration.

Ax Siwon Kim 11d ago

A Lightweight Self-Supervised Learning Framework for Multivariate Time Series using Hierarchical-JEPA on ECG Data

ER-JEPA lightweight self-supervised learning framework using hierarchical joint-embedding predictive architecture for ECG signal analysis.

Ax Landon Dyken, Sharmistha Chakrabarti, Nathan Debardeleben, Steve Petruzza, Qi Wu, Will Usher, Sidharth Kumar 11d ago

Efficient Compression of Structured and Unstructured Volumes via Learned 3D Gaussian Representation

Learned 3D Gaussian representation for efficient compression of structured and unstructured volume data with reduced memory footprint.

Ax Kornelius Raeth, Nicole Ludwig 11d ago

Decision-Aware Training for Sample-Based Generative Models

Decision-aware training framework for generative models in forecasting that optimizes for downstream decision maker costs instead of standard scoring rules.

Ax Michael Y. Li, Anthony Zhan, Kanishk Gandhi, Noah D. Goodman, Emily B. Fox 11d ago

QuasiMoTTo: Quasi-Monte Carlo Test-Time Scaling

Quasi-Monte Carlo test-time scaling method for language models reducing redundancy in parallel sampling while improving inference efficiency.

Ax Mehul Damani, Isha Puri, Idan Shenfeld, Jacob Andreas 11d ago

Right in the Right Way: LM Training with Verifiable Rewards and Human Demonstrations

RLVR framework combining verifiable rewards and human demonstrations for LM training, addressing diversity collapse from objective-only optimization.

Ax Jingyi Chen, Xinyuan Zhang, Xinwu Qian 11d ago

Neural Certificate Pricing for Combinatorial Optimization Problems

Neural Certificate Pricing applies unsupervised learning to combinatorial optimization by leveraging asymmetry between search and verification complexity.

Ax Chuanming Yu, Jiaming Liu, Zihao Ge, Xiongfei Wu, Lulu Zhu, Pengzhan Zhao, Jianjun Zhao 11d ago

Quantum vs. Classical Machine Learning: A Unified Empirical Comparison

Empirical comparison of quantum machine learning models versus classical machine learning approaches across benchmarks.

Ax Patrick Podest, Marco Pichler, Elias B\"urger, Levente Z\'olyomi, Bernhard Voggenberger, Wilhelm Berghammer, Daniel Klotz, Sebastian B\"ock, G\"unter Klambauer, Sepp Hochreiter 11d ago

TiRex-2: Generalizing TiRex to Multivariate Data and Streaming

TiRex-2 extends univariate time series foundation model to multivariate forecasting using recurrent xLSTM with streaming capability.

Ax Chih-Han Yang, Dai-Jie Wu, Yun-Ping Huang, Ping-Chun Hsieh, Kenneth Marino, Shao-Hua Sun 11d ago

Language-Critique Imitation Learning from Suboptimal Demonstrations

Language-critique framework for imitation learning from suboptimal demonstrations using natural language feedback instead of scalar signals.

Ax Zijian Zhang, Rizhen Hu, Athanasios Glentis, Dawei Li, Chung-Yiu Yau, Hongzhou Lin, Mingyi Hong 11d ago

Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training

Study showing single transformer layer training matches full-parameter RL fine-tuning for LLMs, revealing unequal layer-wise contribution during RL post-training.

Ax Zhichao Geng, Yang Yang 11d ago

Why Advanced Encoders Lag on Sparse Retrieval? The Answer and an Approach to Bridging Vocabulary Gaps

Identifies vocabulary gap in modern encoders for sparse retrieval and proposes approach to bridge gap between dense and sparse retrieval.

Ax Eni Solomon Laughter 11d ago

Urban Deceleration Behavior Modes Under Scene Context: An Early-Kinematic Classifier from Argoverse 2 Multi-Agent Trajectories

Kinematic classifier for urban vehicle deceleration behaviors using K-means clustering on Argoverse 2 trajectory data.

Ax Ahmadreza Chokhachian, V. Roshan Joseph, Yu Ding 11d ago

Spatio-Temporal Gaussian Process for Building Terrain-Incorporating Wind Power Curves

Spatio-temporal Gaussian process model for wind turbine power curves incorporating terrain covariates.

Ax Nishant Subramani 11d ago

Harnessing the Latent Space: From Steering Vectors to Model Calibrators for Control and Trust

Framework using steering vectors and latent space analysis to control and calibrate language model behavior for trustworthy deployment.

Ax Julia Machnio, Mads Nielsen, Mostafa Mehdipour Ghazi 11d ago

A Mechanism-Driven Theory of Phase Transitions in Active Learning

Theoretical characterization of active learning budget regimes as shifts in dominant generalization mechanisms.

Ax Kai Hu, Akash Bharadwaj, Weichen Yu, Matt Fredrikson 11d ago

Steal the Patch Size: Adversarially Manipulate Vision-Language Models

Black-box attack recovering private vision-tokenizer configurations of vision-language models through side-channel analysis.

Ax Luke Chen, Cheng-Ju Wu, David R. Martin, Qilin Ye, Pramod Khargonekar, Mohammad Abdullah Al Faruque 11d ago

HydraCollab: Adaptive Collaborative-Perception for Distributed Autonomous Systems

Multi-robot collaborative perception system balancing communication bandwidth and perception accuracy for autonomous systems.

Ax Fabrizzio Sabelli 11d ago

Homogenization of $\ell_2$-Adversarial Training in High-Dimensions: Exact Dynamics under Stochastic Gradient Descent

Theoretical framework analyzing adversarial training dynamics for single-index models on Gaussian mixtures using SGD in high dimensions.

Ax Ruikang Zhao, Zhenting Wang, Han Gao, Ligong Han 11d ago

SLIM-RL: Risk-Budgeted Random-Masking RL for Diffusion LLMs Without Trajectory Slicing

SLIM-RL proposes risk-budgeted random-masking reinforcement learning for diffusion LLMs, eliminating trajectory slicing overhead from prior TraceRL method.

Ax Shuwen Chai, Qiaosen Wang 11d ago

Sample Complexities of Estimating Gumbel--Max Watermark Proportions with and without Reduction to Pivotal Statistics

Analysis of sample complexity for estimating watermark proportions in documents under Gumbel-max LLM watermarking mechanism.

Ax Ved G. Shah, Nabeel Rehemtulla, Adam A. Miller, Sushant Sharma Chaudhary, Michael W. Coughlin, Antoine Le Calloch, Matthew J. Graham, Joahan Castaneda Jaimes, Theophile Jegou du Laz, Ashish A. Mahabal, Frank J. Masci, Josiah Purdum, Reed Riddle, Jesper Sollerman, Anastasia Wei, Mansi M. Kasliwal 11d ago

Leveraging Multimodality for Real-Time Classification of Transients and Variables found by the Zwicky Transient Facility

Multimodal machine learning for real-time classification of transient astronomical objects from Zwicky Transient Facility survey.

Ax Masen Bachleda, Peter Lalor 11d ago

Computer vision-based neural networks for radioisotope identification in urban environments

Computer vision neural networks for radioisotope identification from gamma-ray spectrograms in urban environments.

Ax Riley Acker, Aman Desai, Garrett Kenyon, Frank Barrows 11d ago

Self-Organized Learning in Oscillatory Neural Networks with Memristive Signed Couplings

Self-organized learning in oscillatory neural networks with memristive couplings for associative memory and optimization.

Ax Xiangyue Liu, Zijian Zhang, Miles Yang, Zhao Zhong, Liefeng Bo, Ping Tan 11d ago

Rosetta: Composable Native Multimodal Pretraining

Rosetta: Composable multimodal pretraining approach addressing gradient conflicts when integrating new modalities without catastrophic forgetting.

Ax Hengyu Fu, Tianyu Guo, Zixuan Wang, Hanlin Zhu, Jason D. Lee, Jiantao Jiao, Stuart Russell, Song Mei 11d ago

DiscoLoop: Looping Discrete Embeddings and Continuous Hidden States for Multi-hop Reasoning

DiscoLoop: Method for internalizing multi-hop reasoning in LLMs within single forward pass using discrete embeddings and continuous states.

Ax Yujin Huang, Xin Zheng, Xingliang Yuan, Kwok-Yan Lam 11d ago

SoK: Attack and Defense Landscape of Mobile On-device AI Systems

Systematization of knowledge on attack and defense landscape for mobile on-device AI systems.

Ax Adeel Yousaf, Soumik Ghosh, James Beetham, Amrit Singh Bedi, Mubarak Shah 11d ago

The Illusion of High Utility in Safety Alignment of Text-to-Image Diffusion Models

Structured evaluation showing text-to-image diffusion safety alignment methods create illusion of high utility through coarse metrics.

Ax Zhenhang Li, Xin Zhou, Hao Deng, Lijun Yin 11d ago

MindAU: EEG-Conditioned Facial Action Unit Editing via Dual-Stream Manifold Alignment

EEG-conditioned facial action unit editing using dual-stream manifold alignment.

Ax Emil Joswin, Srujananjali Medicherla, Priyanka Mary Mammen 11d ago

A Mechanistic View of Authority Hierarchy in LLM Sycophancy

Mechanistic investigation of authority bias in LLMs showing how models prioritize source credibility over factual consistency.

Ax Guohao Sun, Xiaofang Wang, Yash Patel, Mengchen Liu, Zhiqiang Tao, Praveen Krishnan 11d ago

Information-Regularized Attention for Visual-Centric Reasoning

Information-regularized attention mechanism to improve visual grounding and reduce hallucination in vision-language models.

Ax Yuan Qing, Chengzhi Mao, Boqing Gong 11d ago

StochasT: Learning with Stochastic Turn Depth for Visual Instruction Tuning

StochasT: Visual instruction tuning method addressing visual attention decay in multi-turn vision-language model conversations.

Ax Sagnik Ghosh 11d ago

Predicting Lethal Outcome (Cause) And Understanding Key Biomarkers Linked With Acute Myocardial Infarction Using Deep Artificial Neural Network And Ensemble Of Machine Learning Methodologies

Deep neural networks and ensemble methods to predict mortality outcomes and identify biomarkers in acute myocardial infarction.

Ax Dilusha Chandrasiri, Maneesha Herath, Yasith Hewarathna, Muditha Herath, Gishan Bandara, Madara Mendis, Nathali Athukorala, Nisansa de Silva, Sandareka Wickramanayake 11d ago

How Environment and Urbanization Shape Bird Diversity in Sri Lanka

Analysis of bird diversity in Sri Lanka using spatial and environmental data.

Ax P. Dobson, J. M. Sanz-Serna, K. C. Zygalakis 11d ago

Optimal scaling of MCMC algorithms: exploiting the symmetry of the Metropolis-Hastings formula

Theoretical study of MCMC scaling properties using Metropolis-Hastings symmetry.

Ax Arya Raeesi, Hanna Roed 11d ago

Auditing Forgetting in Limited Memory Language Models

Causal auditing framework to detect whether deleted facts persist in limited memory language models through parametric memory or retrieval artifacts.

Ax Roberto Capobianco (Sony AI, Zurich, Switzerland), Harm van Seijen (Sony AI, North America, various locations), Nolan D. Bard (Sony AI, North America, various locations), Neil Burch (Sony AI, North America, various locations), Fatima Davelouis (Sony AI, North America, various locations), Josh Davidson (Sony AI, North America, various locations), Alisa Devlic (Sony AI, Zurich, Switzerland), Yunshu Du (Sony AI, North America, various locations), Ishan Durugkar (Sony AI, North America, various locations), Siddhant Gangapurwala (Sony AI, North America, various locations), Daniel Hernandez (Sony AI, North America, various locations), G. Zacharias Holland (Sony AI, North America, various locations), Sahil Jain (Sony AI, North America, various locations), Kenta Kawamoto (Sony AI, Tokyo, Japan), Raksha Kumaraswamy (Sony AI, North America, various locations), Patrick MacAlpine (Sony AI, North America, various locations), Dustin R. Morrill (Sony AI, North America, various locations), Declan Oller (Sony AI, North America, various locations), Francesco Riccio (Sony AI, Zurich, Switzerland), Akanksha Saran (Sony AI, North America, various locations), Craig Sherstan (Sony AI, Tokyo, Japan), Kaushik Subramanian (Sony AI, Zurich, Switzerland), Thomas J. Walsh (Sony AI, North America, various locations), Samuel Barrett (Sony AI, North America, various locations), Kizza N. Frisbee (Sony AI, North America, various locations), Mady Govil (Sony AI, North America, various locations), Johannes G\"unther (Sony AI, North America, various locations), Varun R. Kompella (Sony AI, North America, various locations), James A. MacGlashan (Sony AI, North America, various locations), Maxwell Svetlik (Sony AI, North America, various locations), Michael D. Thomure (Sony AI, North America, various locations), Jaden B. Travnik (Sony AI, North America, various locations), Kevin Waugh (Sony AI, North America, various locations), Elahe Aghapour (Sony AI, North America, various locations), Florian Fuchs (Sony AI, Zurich, Switzerland), Andreanne Lemay (Sony AI, North America, various locations), Shruti Mishra (Sony AI, Zurich, Switzerland), Takuma Seno (Sony AI, Tokyo, Japan), Peter Stone (Sony AI, North America, various locations), Michael Spranger (Sony AI, Tokyo, Japan), Peter R. Wurman (Sony AI, North America, various locations) 11d ago