MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models
MDM-Prime-v2: Improvements to masked diffusion language models through binary encoding and index shuffling.
MDM-Prime-v2: Improvements to masked diffusion language models through binary encoding and index shuffling.
Multi-view learning framework handling dimensional disparities across different feature views.
Federated learning approach using representation geometry to handle noisy annotations in distributed scenarios.
FIPO: RL algorithm improving token-level credit assignment for reasoning in LLMs beyond outcome-based rewards.
Safety-aware offline RL method using budget-conditioned reachability analysis for constrained decision-making.
SkillRouter: System for routing LLM agent requests to relevant skills from large skill libraries at inference time.
GNN layer using cost-sensitive neighborhood aggregation for heterophilous graph classification.
Theoretical analysis of online convex optimization with two-point bandit feedback and high-probability regret bounds.
GNN architecture addressing oversquashing through cross-attentive cohesive subgraph embedding.
ITQ3_S: 3-bit LLM quantization method using interleaved ternary quantization and rotation-domain smoothing for efficient inference.
Interpretability method for reinforcement learning using principal prototype analysis on manifolds.
Federated learning approach for livestock growth prediction addressing privacy and data scarcity in farm management.
Framework unifying gradient descent and Newton-type methods through quadratic gradient with synthesized Hessians.
Methods for uncertainty quantification in stochastic gradient descent using cheap resampling-based confidence intervals.
Neural network approach for prime number classification using sparse encoding.
Metrics for measuring predictability of recommender systems via structural complexity analysis.
Vision transformer optimization for image segmentation with adaptive computation per input image.
LLM-driven conversational recommender system for leisure event discovery with user-centric evaluation in SME context.
Image segmentation approach using divisive normalization for autonomous driving under diverse environmental conditions.
Framework for tightening convex relaxations of trained neural networks with convex and S-shaped activations for optimization incorporation.
Real-time operator takeover paradigm allowing seamless human intervention and correction during visuomotor diffusion policy execution.
Neural network method combining Gaussian processes with pre-trained priors to accelerate spatiotemporal inference on large datasets.
German-language LLM pre-training dataset curated via heuristic filtering, model-based selection, and synthetic data generation.
Meta-learning framework using LLMs to automatically design selection operators for evolutionary symbolic regression algorithms.
AVA-Bench systematically evaluates atomic visual abilities of vision foundation models independent of LLM instruction tuning.
SlowFast Sampling optimizes inference efficiency in diffusion-based language models through dynamic, flexible token generation strategies.
Streaming transformer architecture inspired by autoregressive LLMs for real-time 3D geometry perception and reconstruction from video.
Statistical methods for constructing confidence intervals for optimal treatment policy values using softmax smoothing in causal inference.
NES is an instruction-free code editing framework that learns from historical editing trajectories to suggest next edits with low latency.
Test-time adaptation method using domain augmentation and model ensembles to handle weather-related domain shifts in autonomous driving.
Knowledge distillation and self-supervised learning approach for continual learning with class-incremental learning and external unlabeled data.
Interpretability framework for understanding how components of particle swarm optimization algorithms affect performance.
Benchmark evaluating how large vision-language models handle object recognition in contextually incongruent scenes and manage uncertainty.
Wireless sensor network localization using distributed optimization algorithms for cooperative and non-cooperative positioning.
ProxyAttn method using representative attention heads to enable efficient sparse attention in LLMs for long-text processing with minimal performance degradation.
Multi-Stream Generative Policy framework for robot learning that combines multiple object-centric policies at inference to improve sample efficiency and generalization.
Algorithm for smooth quasar-convex optimization with general convex constraints achieving nearly optimal first-order query complexity.
Representation alignment technique for multimodal medical object detection addressing heterogeneous statistics across imaging modalities.
Transfer learning approach for assessing face image recognizability in unconstrained conditions without relying on visual heuristics.
Neuro-symbolic AI overview connecting neural networks with symbolic reasoning to satisfy constraints, addressing reliable trustworthy AI development.
Efficient local causal discovery method for identifying adjustment sets without learning the full causal graph.
Nonparametric neural network-based method for drift function estimation in diffusion processes with convergence rate analysis.
One-shot adaptation framework improving vision-language-action model generalization to novel camera viewpoints through spatial representation recalibration.
Evaluation of LLM performance on Indian language maternal healthcare triage, comparing native scripts versus romanized text in real-world deployment.
Guidance strategy for diffusion transformers using internal model dynamics to improve image generation quality without external classifiers.
Unsupervised segmentation approach for wind turbine blade inspection using region growing and classification without significant AI/ML innovation.
Explainable AI framework using multiple instance learning for survival prediction from glioblastoma histomorphology images without code or tool contributions.
Analysis of 25k chain-of-thought trajectories showing neural scaling triggers domain-specific phase transitions in reasoning rather than uniform capability improvements across 8B-70B parameter models.
V0 is a generalist value model for policy gradient methods that scales efficiently with LLM training, replacing large critic models in actor-critic methods like PPO.
Test-time guidance technique for diffusion models enabling fast text-driven image and video editing through inpainting with reduced computational cost.