SPORE: Skeleton Propagation Over Recalibrating Expansions
SPORE is a classical clustering algorithm handling arbitrary geometry without rigid assumptions on cluster structure.
SPORE is a classical clustering algorithm handling arbitrary geometry without rigid assumptions on cluster structure.
Transformer-based symbolic regression method for discovering interpretable mathematical expressions from observed data.
Two-stage entropy approach for noise-tolerant multimodal LLM training using reinforcement learning with verifiable rewards.
Object-centric world models for reinforcement learning using decomposed representations to improve sample efficiency in multi-object environments.
UniGame addresses inconsistency in unified multimodal models between understanding and generation through adversarial framework.
SAFLe framework enabling scalable non-linear federated learning in a single round with heterogeneous data distribution invariance.
Domain adaptive retrieval using prototype-based semantic consistency alignment to transfer knowledge from labeled to unlabeled domains.
Hybrid physics and ML approach for crop yield projections combining gridded crop models with machine learning to improve agricultural forecasting.
Research on measuring noise in LLM evaluations using statistical methods to separate signal from noise in prediction, data, and combined noise.
Day-ahead electricity price forecasting combining linear models, neural networks and online learning for volatile market prediction.
Symbolic regression with partial parameter sharing for discovering expressions describing related phenomena with varying parameters.
Hellinger multimodal VAEs using probabilistic opinion pooling to aggregate unimodal inference distributions.
Sparse-RL addresses memory bottleneck in LLM reinforcement learning by reducing KV cache overhead during long-horizon rollouts.
Dual-prototype disentanglement framework for context-aware time series forecasting using dynamic temporal pattern learning.
Generalized framework for adaptive grid allocation in Kolmogorov-Arnold Networks accounting for target function complexity.
Theoretical framework explaining memorization in diffusion models through weighted sum of empirical score functions.
TextBFGS applies case-based reasoning to iterative code generation with LLMs, using past solutions to guide optimization.
Benchmarks Echo State Networks for univariate time series forecasting against traditional statistical methods on M4 dataset.
Domain adaptive diffusion policy for control that generalizes to unseen transition dynamics through domain representation learning.
Analyzes GRPO limitations in exploration and difficulty adaptation for LLM reasoning, proposing improvements to advantage symmetry.
VJE framework for self-supervised learning using reconstruction-free latent variables with symmetric conditional ELBO optimization.
Applies tabular foundation models to knowledge tracing for real-time student learning prediction without extensive offline training.
Interpretable image classification using hierarchical concept embeddings recovered from vision-language models.
φ-DPO addresses fairness in continual learning for multimodal models when training data is imbalanced across tasks.
Reduces transformer KV cache by using low-dimensional keys for attention selection while maintaining full-dimensional values.
Proposes using proper scoring rules to evaluate probabilistic predictions from tabular foundation models instead of point-estimate metrics.
Replaces dense attention projections with Walsh Hadamard Transforms to reduce transformer parameters by 25% while maintaining performance.
Theoretical analysis of multi-armed bandits under memory and batch constraints, studying regret bounds.
Autonomous driving framework using Dirichlet process mixture models and causal adjustment to address catastrophic forgetting and spurious correlations in lifelong learning.
LLM agent framework with causal scratchpad for open-ended scientific discovery through iterative program evolution.
Theoretical analysis of dataset distillation for encoding low-dimensional task representations from gradient-based learning.
Mixture-of-Experts foundation model for cross-species EEG signal decoding using spectral analysis.
Off-policy learning framework for contextual bandits with constrained item supply in recommendation and advertising.
Theoretical analysis of reverse diffusion sampling error under weak log-concavity using metric mismatch exploitation.
Backdoor attack method on text-attributed graphs by injecting malicious cues into node text.
Parameter-efficient vector-quantized UNet for weather precipitation nowcasting using deep learning.
RL approach decoupling exploration and policy optimization using uncertainty-guided tree search for hard exploration problems.
Sparse Feature Attention (SFA) method reducing transformer attention complexity via feature-level sparsity for ultra-long contexts.
Analysis of systematic biases in Chinchilla Approach 2 neural scaling law fits for compute-optimal LLM allocation.
Skill routing system for LLM agents managing and selecting from large skill ecosystems at inference time.
Synthetic Mixed Training method combining synthetic QAs and documents to scale LLM knowledge acquisition beyond RAG limitations.
Study comparing LLM agents against classical hyperparameter optimization algorithms using autoresearch framework for hyperparameter tuning.
Attack method (PEANUT) exploiting GNN vulnerabilities through graph topology perturbations in message passing.
Framework using Shapley values to measure and explain unfairness in ML models under group fairness criteria.
Image reconstruction using adaptive GANs to improve sample quality for complex image classes like human faces.
Study on adversarial evasion attacks against ML-based network intrusion detection systems, showing attacks remain impractical against dynamic systems.
Multi-GPU GNN training system optimizing data transfer for large-scale graphs exceeding GPU memory capacity via storage-based approaches.
Neural renderer for real-time visualization of large-scale scientific point cloud datasets using neural deferred rendering.
Framework for variational inference that optimizes posterior distributions under model misspecification for better predictive accuracy.
Transfer learning method for precision matrix estimation leveraging related source data with limited target samples.