FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving
FluxMoE system decouples expert residency to improve inference serving throughput in Mixture-of-Experts LLMs.
FluxMoE system decouples expert residency to improve inference serving throughput in Mixture-of-Experts LLMs.
Technical analysis of evaluation methodology challenges for diffusion language models at scale.
Theoretical analysis of diffusion model degradation in latent spaces using Fisher geometry framework.
Physics-guided deep learning framework combining microscopic and macroscopic approaches for crowd simulation.
Free-flow class-incremental learning framework handling realistic variable-sized task streams without fixed task schedules.
Study comparing active preference learning vs random sampling for online DPO data selection in modern LLMs.
Learning-guided approach for reducing computational burden in network-constrained unit commitment optimization.
GNN-based surrogate model for accelerating hydraulic flood forecasting simulations.
Graph-based ML approach for detecting money laundering transactions in financial networks.
Computational analysis of exponential weights algorithm for online logistic regression with improved complexity bounds.
Neural routing algorithms for network traffic optimization using live telemetry data with near-real-time latency.
ML analysis of NASA space biology dataset revealing thermogenic reprogramming in female mouse adipose tissue.
Addresses reward hacking in RLHF by using advantage sign robustness to prevent policy degradation.
Federated learning optimization via fixed gating for weight averaging under statistical heterogeneity.
Self-Guide framework for LLM agents using co-evolved internal rewards to address sparse reward problem in long-horizon tasks.
On-policy self-distillation training paradigm for LLMs combining dense signals from larger teacher models.
Graph domain adaptation method addressing structural discrepancies in graph neural networks under topology shifts.
arXiv paper analyzes hallucination's role in RL post-training of multimodal LLMs, investigating whether models genuinely learn visual information under RL.
arXiv paper proposes PRISM, combining LLM-guided labeling with semantic clustering for interpretable topic modeling with low computational cost.
arXiv paper studies context space optimization for generally capable agents, examining credit assignment, forgetting, and overfitting in context learning.
arXiv paper on surrogate modeling for real-time blood flow prediction and hemodynamic cardiovascular analysis.
arXiv paper exploring server learning to enhance federated learning robustness against malicious attacks with non-IID client data.
arXiv paper on federated class-incremental learning for LEO satellites addressing non-IID data, memory, and communication constraints.
arXiv paper proposes AI-agent-augmented DNS blocking to prevent LLM access during student evaluations, addressing academic integrity concerns.
arXiv paper introducing TRACE, ML pipeline for detecting Internet routing changes using traceroute latency data with ensemble learning.
arXiv paper analyzing backdoor attacks on decentralized LLM post-training via pipeline parallelism, examining vulnerabilities from malicious participants.
arXiv paper proposing fully photonic convolutional neural network with pre-trained in-situ training to overcome CMOS energy bottlenecks.
PlayGen-MoG: Mixture-of-Gaussians framework for diverse multi-agent trajectory generation in team sports with spatial coordination.
Guideline2Graph: LLM/VLM-based parser converting multimodal clinical guidelines into executable decision graphs with cross-page control flow.
Examination of confirmation bias in LLMs through rule-discovery tasks, evaluating and mitigating systematic reasoning failures.
Projection-free adaptive SGD for matrix optimization achieving dimension-independent convergence without costly quadratic projections.
Statistical survey of RLHF for LLM alignment, examining noisy subjective feedback for reward models and policy optimization.
Neural posterior estimation for Bayesian parameter inference in Li-ion batteries using probabilistic calibration.
AQVolt26: High-temperature halide dataset with r²SCAN calculations for training universal ML potentials for solid-state batteries.
Deep reinforcement learning for element-level bridge lifecycle optimization using condition state proportions.
Feature Attribution Stability Suite: Evaluates post-hoc attribution method stability under realistic input perturbations with improved metrics.
Synapse: LLM-guided two-phase retrieval and genetic algorithm optimization for job-person matching under information imbalance.
Empirical study of confidence calibration and hallucination in vision-language models for medical VQA, spanning three model families.
UniScene3D: Transformer-based 3D scene encoder jointly modeling appearance and geometry from multi-view colored pointmaps with CLIP alignment.
Optimal error algorithms for adversarial learning using randomized hypotheses, improving deterministic hypothesis bounds by factor of 1/2.
WSVD: Weighted SVD technique for efficient low-precision execution of vision-language models with reduced computational burden.
Study examining safety alignment bypass methods in LLMs via jailbreak-tuning and weight orthogonalization, analyzing disabled safety guardrails.
Trajectory-free self-test loss function for learning interacting particle system potentials from unlabeled discrete time-point data.
Gromov-Wasserstein optimal transport methods for geometry-aware multi-view data embedding under heterogeneous geometries and nonlinear distortions.
AutoVerifier: LLM-based agentic framework automating end-to-end verification of technical claims across scientific literature without domain expertise required.
RL framework using LLM-as-judge for label-free knowledge distillation and reasoning improvement in small and large language models without ground truth labels.
Placebo-anchored transport framework for meta-analysis addressing covariate shift across randomized controlled trials.
Systematic evaluation of LLM formal reasoning capabilities using Chomsky hierarchy and computation theory complexity metrics.
MOMO: Foundation model merging multi-sensor Mars remote sensing data (HiRISE, CTX, THEMIS) using Equal Validation Loss alignment.
Variational Bayesian adaptive Kalman filter for state estimation in sensor networks with packet dropouts and corrupted observations.