Less is More: Rethinking Few-Shot Learning and Recurrent Neural Nets
Study on few-shot learning and RNNs applying asymptotic equipartition property from information theory to machine learning.
Study on few-shot learning and RNNs applying asymptotic equipartition property from information theory to machine learning.
Theoretical analysis of inexact Langevin algorithm convergence for score-based generative models with KL divergence guarantees.
Survey on continual graph learning covering incremental learning from streaming graph data with experience and generative replay approaches.
Mathematical analysis of auto-differentiation reliability in neural-ODE training with high-order numerical methods.
Novel prior learning method for neural networks using structured posteriors to improve generalization and uncertainty estimation.
Proves asymptotic optimality of new restless bandit policies with O(1/√N) gap under unichain and aperiodicity conditions.
Theoretical analysis of sample complexity for model-based Q-learning, establishing finite-time convergence bounds for model-learning algorithms.
Paper proposing Explaining-Away Variational Autoencoders to improve uncertainty representations in deep generative models for visual inference tasks.
Survey of multimodal continual learning methods that enable models to learn from new data across multiple modalities while retaining previous knowledge without catastrophic forgetting.
Transformers learn variable-order Markov chains in-context with finite-sample accuracy analysis using context-tree weighting.
Analysis of Sharpness-Aware Minimization robustness to label noise through gradient down-weighting at element-wise level.
Scalable neural network verification using branch-and-bound with inferred cutting planes instead of external MIP solvers.
Wavelet subspace compression for optimizer states reduces memory during LLM training, improving upon low-rank approaches.
Dataset distillation for credit scoring models addressing class imbalance in pretrained models on tabular financial data.
Binned spectral power loss function for improved deep learning predictions of chaotic multiscale dynamical systems.
Multimodal drug-aware diffusion model for ECG generation in virtual clinical trials with demographic disentanglement.
Steering vectors applied to LLM activations for bias mitigation across social dimensions like age, gender, and race.
Large graph dataset and measurement framework for evaluating long-range interactions in graph representation learning.
Methods to measure faithfulness of concept-based explanations in deep vision models using surrogate models.
Training-free audio-visual segmentation using foundational models for open-vocabulary pixel-level mask prediction.
Survey of LLM integration with Computer-Aided Design tools, covering applications in 3D modeling and design workflows.
Birch SGD framework represents distributed SGD methods as computation trees to unify analysis and design of optimization algorithms.
Deep latent variable models for vertical federated learning with flexible alignment and labeling across feature-partitioned data.
Foundation models for time-series prediction often use simple parroting strategies rather than learning physics, revealing shared failure modes.
FlowPure uses continuous normalizing flows for adversarial purification to remove perturbations from ML model inputs at inference time.
Structured Agent Distillation compresses large LLM-based ReAct agents into smaller models while preserving reasoning and action consistency.
CoDec kernel optimizes LLM decoding by sharing prefix computation across multiple prompts to reduce memory-intensive KV cache access.
Decentralized multi-player multi-armed bandits problem with unknown arm capacities and no collision sensing.
Physics-informed neural networks compute 3D magnetohydrodynamic equilibria by parametrizing Fourier modes and minimizing force residuals.
User-centric evaluation metrics for counterfactual explanations in ML models, focusing on actionability and end-user preferences.
MicroMix: mixed-precision quantization method using microscaling formats for efficient LLM inference on NVIDIA Blackwell hardware.
Novel fine-tuning mechanism for LLMs that addresses data quality/volume issues through controlled forgetting to improve domain adaptation.
PENGUIN: Transformer variant with periodic-nested group attention mechanism for improved long-term time series forecasting.
Empirical study of initialization schemes for Kolmogorov-Arnold Networks, proposing theory-driven approaches to improve training of spline-based KANs.
Training-free framework for deferring predictions to multiple experts using conformal prediction without retraining.
ReTrack enables data unlearning in diffusion models via importance sampling to remove memorized training data influence.
GaussianPSL framework for multi-objective optimization with soft partitioning handling complex discontinuous and degenerate Pareto frontiers.
Neural network approach to learning modular genetic circuit functions in synthetic biology from input/output data.
Algorithms for distributed RL with policy gradients under asynchronous parallel computation and communication.
Benchmark for LLM-assisted emergency triage from MIMIC-IV-ED database with preprocessing for rapid patient deterioration prediction.
Method for diagnosing when data augmentation and equivariant architectures improve or harm generalization under distribution asymmetry.
Q-learning algorithm for non-stationary RL with distribution shifts under both episodic and infinite-horizon settings.
Uses LLMs to programmatically synthesize anomaly detectors for tabular data without direct processing of raw data for privacy.
ACE framework evolves context for self-improving LLM agents, addressing brevity bias and context collapse in iterative refinement.
Mitigates premature exploitation in particle filtering for inference-time scaling of language models using process reward models.
TabPFN-Wide extends prior-data fitted networks for tabular data with extreme feature counts in biomedicine applications.
Constraints-of-Thought framework enables LLMs to perform constrained multi-step reasoning while satisfying symbolic constraints and user intent.
PANTHER applies generative pretraining to model user behavior sequences beyond language, using multi-dimensional action attributes.
Bandit algorithm for high-stakes sequential decision-making that learns when to abstain from actions with irreparable consequences.
RL algorithm for learning policies that maximize return while inducing dispersed state distributions across multiple reward sources.