ECTO: Exogenous-Conditioned Temporal Operator for Ultra-Short-Term Wind Power Forecasting
Deep learning method for ultra-short-term wind power forecasting using exogenous meteorological variables and time-series prediction.
Deep learning method for ultra-short-term wind power forecasting using exogenous meteorological variables and time-series prediction.
Research on reward allocation principles for LLM post-training, comparing sparse sequence-level vs dense token-level rewards for different model stages.
WriteSAE introduces sparse autoencoders for matrix updates in recurrent language models like Mamba-2 and RWKV-7, learning rank-1 matrix atoms to directly replace model writes.
Safety guarantees for multi-task Bayesian optimization with uncertain co-regionalization matrices in linear models.
MoRe: modular representation learning approach for continual learning on sequential data with minimal catastrophic forgetting.
Lagrangian flow matching framework for designing probability paths in generative models beyond standard straight-line transport.
SEED: data selection method formulated as weighted independent set problem to balance quality and diversity in training datasets.
PULSE: generative model for non-stationary time series forecasting using phase evolution to handle distribution shifts.
Proposes Jacobian-guided noise reshaping for local differential privacy to improve data utility by task-aware noise injection.
Structured neural marked point process model for interpretable multi-class event stream analysis and inter-event dependency discovery.
Pre-training method for sparse text-attributed graphs using LLM alignment to improve transferability in sparse node text scenarios.
General Preference RL framework unifying online RL and preference optimization for LLM alignment across both structured and open-ended tasks.
Exact Linear Attention mechanism achieving linear computational complexity for Transformers through kernel decomposition without approximation error.
Analysis of neural network redundancy using spectral structural distortion to identify prunable neurons beyond weight/activation magnitude.
Few-shot machine unlearning method for removing sensitive information from LLMs without retraining or degrading model utility.
CODA optimization technique rewriting Transformer blocks as GEMM operations to reduce memory-bound operator bottlenecks in training.
Continual Model Merging framework using ODE perspective to control capacity allocation across sequential tasks, reducing catastrophic forgetting.
Study of how inference backend software systems impact LLM reproducibility and benchmark results at fractions of percentage point differences.
Online sequential optimization for driver subsidies in ride-hailing platforms balancing supply-demand dynamics with multiple constraints.
LLM-based framework for predicting missing survey responses in repeated cross-sectional studies using question/respondent embeddings.
Computational-statistical trade-offs in kernel two-sample testing using random Fourier features to reduce MMD test complexity.
Learning-to-Defer framework allocating extractive QA queries to specialized experts with theoretical guarantees for resource-constrained LLM deployment.
Statistical theory on testing probability distribution support size with minimal samples.
Privacy-preserving language modeling approach to prevent memorization and exposure of sensitive personal information in fine-tuned models.
Bayesian optimization algorithm using kernel regression and density-based exploration to reduce Gaussian process computational complexity.
Framework converting LLM-simulated survey responses into confidence sets with uncertainty quantification for population parameters.
Research on approximating optimal transport equations using neural differential equations (Neural ODEs) for unbalanced transport problems.
Plugin reasoner module for frozen LLMs that enables reasoning without retraining, compatible across different model architectures.
Study of how instruction-tuned multimodal LLMs align with brain activity patterns under naturalistic stimuli.
Benchmark dataset for evaluating out-of-distribution robustness of vision models on web-scale corruptions.
arXiv paper on control and optimization methods for neural PDEs in supervised learning contexts.
arXiv paper on machine-learned force fields for lattice dynamics at coupled-cluster accuracy levels.
arXiv paper on efficient long-context LLM inference using retrospective sparse attention to optimize KV cache.
Analyzes anti-establishment sentiment on TikTok and its relationship to institutional distrust through social media influence dynamics.
Proposes BALLAST, an active learning methodology for placing ocean drifters to infer spatio-temporal vector fields using physics-informed Gaussian processes.
Presents TimeRewarder, a method for learning dense reward signals from unlabeled videos by estimating frame-wise task progress for reinforcement learning.
Investigates quantum reservoir computing using Jaynes-Cummings models for time-series prediction tasks with nonlinear memory capabilities.
Introduces TelecomTS, a multi-modal dataset of zero-inflated observability time series and language data from enterprise system monitoring for benchmarking.
Analyzes adversarial robustness in learning-to-defer systems where inputs are routed to predictors or experts, extending prior two-stage analyses to one-stage joint training.
Performance study and optimization strategies for multi-node distributed LLM inference across GPU clusters and communication bottlenecks.
Possibilistic variational inference framework using possibility theory for Bayesian learning with epistemic uncertainty.
Bayesian approach for learning nonlinear system dynamics from infrequent measurements for uncertainty-aware optimal control.
Interpretable regression framework combining random Fourier features with localized additive models for heterogeneous data.
Agentic physical AI framework for nuclear reactor control as domain-specific foundation model alternative to general-purpose models.
Analysis of less discriminatory algorithm adoption requirements under U.S. discrimination law for high-stakes decision systems.
CPPDD framework for secure multi-client data aggregation using consensus and privacy-preserving mechanisms.
Sequential knowledge distillation methodology for training compact image compression autoencoders with reduced computational requirements.
Graph imitation learning approach addressing optimization and representation challenges in neural graph generation tasks.
Self-refining video sampling method improving physical realism in generated videos without external verifiers or augmented training.
Framework for learning incentive mechanisms enabling cooperative resilience in multi-agent systems under social dilemma conditions.