Uncertainty-Aware Foundation Models for Clinical Data
Framework for uncertainty-aware foundation models on clinical data, addressing incomplete and irregular measurements in healthcare.
Framework for uncertainty-aware foundation models on clinical data, addressing incomplete and irregular measurements in healthcare.
Copula-based method for generating synthetic educational data that preserves marginal distributions while protecting student privacy.
ClawArena benchmark for evaluating AI agents in dynamic environments with evolving information, contradictions, and implicit user feedback.
Graph-assisted retrieval framework for reasoning about defects in laser powder bed fusion manufacturing using structured scientific knowledge.
Framework using Temporal Behavior Trees to repair suboptimal trajectories before using them for robot control policy learning.
Analysis of token routing in Mixture-of-Experts models reveals three-phase training trajectory for load balance evolution.
Method for constrained model steering of LLMs addressing safety/privacy requirements via spectral subspace optimization.
Calibration audit of multimodal cancer survival models fusing histopathology images with genomic data.
Two-stage ML framework predicting E. coli presence in household drinking water for microbial contamination screening.
Risk scoring system optimizing net benefit using sparse integer linear programming for high-stakes decision-making.
Study of vulnerabilities in large reasoning models when applying machine unlearning techniques to remove influence of specific data.
Federated RLHF method for fair LLM alignment across diverse human preferences without centralizing preference data.
Dual-step generative framework combining causal structure learning with tabular data synthesis using directed acyclic GANs.
Computer vision and neural network approach for classifying human brain activity from EEG data during hand movement.
Pipeline combining LSTM, synthetic data, and fine-tuning for EEG classification on implicit visual stimuli tasks.
Analysis of distributional reinforcement learning for complex domains like healthcare, addressing heterogeneous groups under uncertainty.
Tutorial on using flow- and score-based generative models for decision-making under distributional shift in operations research.
Neural architectures for learning to approximate Wasserstein-2 distances using Kuratowski embedding theorem.
System for generating editable design variations using decoder-only language model with Creative Markup Language representation.
Theoretical analysis of Q-value iteration convergence in multi-agent Stackelberg games using control-theoretic perspective.
Research on aligning LLMs with human preferences using relative density ratio optimization without assuming specific preference models, improving statistical consistency.
Study questioning necessity of prompt selection in task-free online continual learning for non-stationary data streams.
Ablation framework to estimate contributions of central, peripheral, and temporal visual information to human decision-making in Atari games.
TinyNina: edge-AI framework for satellite super-resolution applied to NO2 air quality monitoring with resource constraints.
DP-OPD: differentially private on-policy distillation method for compressing LLMs on sensitive data while maintaining privacy guarantees.
MAVEN: mesh-aware volumetric encoding network for simulating 3D flexible deformation using graph neural networks on mesh structures.
Discrete Prototypical Memories approach for federated time series foundation models using LLMs while preserving data privacy.
External validation study on ECG biometrics using Inception-v1 with ArcFace on MIMIC and HEEDB datasets.
Isokinetic Flow Matching introduces pathwise acceleration regularization to improve few-step sampling in flow-based generative models.
SLaB: sparse-lowrank-binary decomposition framework for efficient LLM compression maintaining performance at high compression ratios.
Multi-objective controllable language models framework enabling personalized alignment with varying human preferences beyond fixed reward optimization.
GAIN: multiplicative modulation technique for domain adaptation in LLMs, preventing catastrophic forgetting through feature re-emphasis.
Reproducibility study on spurious correlations and shortcut learning in DNNs, comparing frameworks for ensuring models use causally relevant features.
Revisits learning from equivalence queries model for modern ML systems like generative models and recommendation systems with periodic updates.
FlashSAC: off-policy reinforcement learning algorithm for stable, fast robot control in high-dimensional action spaces.
Detection method for free-riders in federated learning via simulated attack patterns, improving the WEF-based approach.
Deep learning approach for clinical mortality prediction from incomplete multimodal Electronic Health Records using point cloud paradigm.
Method to mitigate reward hacking in Best-of-N sampling for language models using pessimism, addressing inference-time compute scaling challenges.
Novel Anticipatory Reinforcement Learning framework for non-Markovian decision processes with jump-diffusions and structural breaks, designed for single trajectory learning.
Batch Loss Score metric for dynamic data pruning using exponential moving averages, accelerating deep learning training.
Explainable ML models for sepsis prediction using Romanian EHR dataset with 12,286 hospitalizations and 600 lab test types.
Quantization method for LLMs combining mixed-precision and low-rank decomposition for efficient INT computation on NPU devices.
Sampling parallelism approach for efficient Bayesian neural networks and uncertainty quantification in risk-sensitive applications.
Mechanistic analysis decomposing GPT-2 Small's final MLP into legible exception handler with 27 named neurons routing decisions.
Method using task reformulation to enable LLMs to learn from difficult problems via reinforcement learning from verifiable rewards.
Complete pipeline for federated unlearning with evaluation framework, enabling models to forget deleted data in distributed learning.
Algorithms for automatic concept selection in interpretable reinforcement learning policies without manual domain expertise.
Research on how generator access constraints affect autoregressive post-training and learning from rollouts vs prefix queries.
Python toolkit for intersectional fairness analysis in clinical ML models, addressing compounded disparities beyond single-axis comparisons.
Empirical robustness analysis of TabPFN's attention mechanisms for in-context learning on tabular data, examining noise immunity without retraining.