AdaHOP: Fast and Accurate Low-Precision Training via Outlier-Pattern-Aware Rotation
Low-precision training method for LLMs using adaptive Hadamard transforms based on outlier patterns in weights, activations, and gradients.
Low-precision training method for LLMs using adaptive Hadamard transforms based on outlier patterns in weights, activations, and gradients.
Research on using LLM-generated synthetic data to warm-start contextual bandits, examining alignment between LLM choices and actual user preferences.
Spectral framework for multi-scale nonlinear dimensionality reduction balancing global-local structure preservation and expressiveness-transparency.
Optimized NF4 dequantization kernels for fast LLM inference on NVIDIA GPUs, addressing FP16 conversion bottleneck.
Communication-efficient distributed learning algorithm with differential privacy using local training and gradient clipping.
ROMAN operator for time series that creates multiscale representations by building antialiased pyramids for convolutional classifiers.
VoxelCodeBench platform benchmarking code generation models for 3D spatial reasoning with execution in Unreal Engine.
Analysis of function vectors in LLMs showing they steer behavior beyond logit lens interpretability across 4,032 cross-template transfer pairs.
Complex-valued GNNs for distributed control of networked systems with basis-invariance for GPS and compass-denied environments.
Continual graph learning without storing exemplars, using class-level prototypes and analytic continual learning to address catastrophic forgetting.
AXELRAM smart SRAM architecture computing attention scores from quantized KV cache without dequantization using orthogonal-transform quantization.
Distributed GNN training with communication-free sampling and 4D hybrid parallelism for scaling mini-batch learning on large graphs.
Study of generalization limits in RLHF alignment, proposing compound jailbreaks targeting LLM safety through redistribution of existing capabilities.
Theoretical analysis of gradient descent training at edge of stability with product-stability property for convergence guarantees.
Low-rank compression of pretrained models using randomized subspace iteration for efficient SVD-based model reduction.
Physics-informed neural networks coupled with finite difference methods for thermal-hydraulic system simulation.
Muscle fatigue detection from sEMG signals using adversarial and contrastive learning with neural networks.
Mechanistic interpretability research on whether LLMs encode belief geometries like transformers trained on hidden Markov models.
Token-space attacks on reward models used in RLHF, introducing TOMPA framework for adversarial optimization beyond semantic manipulation.
Semantic communication for wireless image transmission using mixture-of-experts to adapt to diverse image contents and channel conditions.
Algebraic-geometric framework for quantum neural networks addressing barren plateaus and noise robustness.
FluxMoE system decouples expert residency to improve inference serving throughput in Mixture-of-Experts LLMs.
Technical analysis of evaluation methodology challenges for diffusion language models at scale.
Theoretical analysis of diffusion model degradation in latent spaces using Fisher geometry framework.
Physics-guided deep learning framework combining microscopic and macroscopic approaches for crowd simulation.
Free-flow class-incremental learning framework handling realistic variable-sized task streams without fixed task schedules.
Study comparing active preference learning vs random sampling for online DPO data selection in modern LLMs.
Learning-guided approach for reducing computational burden in network-constrained unit commitment optimization.
GNN-based surrogate model for accelerating hydraulic flood forecasting simulations.
Graph-based ML approach for detecting money laundering transactions in financial networks.
Computational analysis of exponential weights algorithm for online logistic regression with improved complexity bounds.
Neural routing algorithms for network traffic optimization using live telemetry data with near-real-time latency.
ML analysis of NASA space biology dataset revealing thermogenic reprogramming in female mouse adipose tissue.
Addresses reward hacking in RLHF by using advantage sign robustness to prevent policy degradation.
Federated learning optimization via fixed gating for weight averaging under statistical heterogeneity.
Self-Guide framework for LLM agents using co-evolved internal rewards to address sparse reward problem in long-horizon tasks.
On-policy self-distillation training paradigm for LLMs combining dense signals from larger teacher models.
Graph domain adaptation method addressing structural discrepancies in graph neural networks under topology shifts.
arXiv paper analyzes hallucination's role in RL post-training of multimodal LLMs, investigating whether models genuinely learn visual information under RL.
arXiv paper proposes PRISM, combining LLM-guided labeling with semantic clustering for interpretable topic modeling with low computational cost.
arXiv paper studies context space optimization for generally capable agents, examining credit assignment, forgetting, and overfitting in context learning.
arXiv paper on surrogate modeling for real-time blood flow prediction and hemodynamic cardiovascular analysis.
arXiv paper exploring server learning to enhance federated learning robustness against malicious attacks with non-IID client data.
arXiv paper on federated class-incremental learning for LEO satellites addressing non-IID data, memory, and communication constraints.
arXiv paper proposes AI-agent-augmented DNS blocking to prevent LLM access during student evaluations, addressing academic integrity concerns.
arXiv paper introducing TRACE, ML pipeline for detecting Internet routing changes using traceroute latency data with ensemble learning.
arXiv paper analyzing backdoor attacks on decentralized LLM post-training via pipeline parallelism, examining vulnerabilities from malicious participants.
arXiv paper proposing fully photonic convolutional neural network with pre-trained in-situ training to overcome CMOS energy bottlenecks.
PlayGen-MoG: Mixture-of-Gaussians framework for diverse multi-agent trajectory generation in team sports with spatial coordination.
Guideline2Graph: LLM/VLM-based parser converting multimodal clinical guidelines into executable decision graphs with cross-page control flow.