Adversarial Latent-State Training for Robust Policies in Partially Observable Domains
Research on robust policy training in partially observable reinforcement learning under adversarial latent state distribution shifts.
Research on robust policy training in partially observable reinforcement learning under adversarial latent state distribution shifts.
Theoretical analysis connecting drifting models and score-based generative models through kernel-based transport discrepancy.
Systematic study of jailbreak attack scaling laws across LLM methods and model families using compute-bounded optimization framework.
Research on parameter-efficient fine-tuning for continual learning using representation-level optimization instead of weight-level black-box methods.
Zero-shot surgical duration prediction combining retrieval-augmented LLMs with Bayesian averaging for resource management.
Physics-informed autoencoder with frozen PDE solver for tracking continuum mechanics dynamics in video.
Survey of privacy-preserving machine learning mechanisms for IoT devices covering federated learning and edge computing approaches.
Analysis of transformer training dynamics via Spectral Edge Dynamics, identifying coherent optimization directions vs stochastic noise.
Virtual cell perturbation prediction model using optimal transport for in silico experimentation on genetic/chemical perturbations.
Diffusion-based reinforcement learning policy using flow matching with direct entropy regularization and efficient gradient computation.
Comprehensive review of AI methods in fashion including aesthetics, personalization, virtual try-on, and forecasting.
Novel framework for estimating heterogeneous causal contrasts combining T-learning and DR-learning approaches.
Deep learning framework to compensate for perception latency in vision-based autonomous vehicle lane-keeping.
Detects hallucinations in virtually-stained histology using latent space analysis and neural precursor method.
Method for evaluating synthetic chest X-ray quality using embedded characteristic scores.
Universal sparse autoencoders for discovering and aligning interpretable concepts across multiple neural networks.
Multifidelity simulation-based inference framework for parameter estimation with expensive simulators.
Algorithm for finding game equilibria under differential privacy constraints in polymatrix games.
Survey of AI-based methods for detecting and mitigating distributed denial-of-service attacks.
Ensemble of language models for automated tumor classification in cancer registry pathology reports.
Physics-informed neural networks for learning transferable friction models in robotics simulation.
Visualization techniques and task definitions for graphs without node labels.
Hardware-aware neural architecture search for encrypted traffic classification on IoT edge devices.
First-order optimization methods for sparse convex problems with improved convergence rates.
RNN-based control system design with stability guarantees and model predictive control applications.
Transfer learning with GANs to generalize neutrino scattering predictions across different nuclear targets.
AI pipeline for automated deployment of coral reseeding devices using real-time image analysis and robotics.
Testbed for evaluating AI reasoning with causal world models in low-data and out-of-distribution settings.
Universal distillation method for training efficient one-step generators from diffusion and flow models without GANs.
Improves one-step image generation from masked diffusion models using soft embeddings to enable gradient flow for fine-tuning.
Transformer model for predicting cellular signal rates across multiple frequency bands in mobile handsets.
Artificial Age Score framework modeling memory aging patterns in large language models across conversational contexts.
Lightweight disentangled concept bottleneck model for improved interpretability in neural networks.
Survey on high altitude platform systems integration in 6G non-terrestrial networks.
Evaluation of machine learning interatomic potentials across chemical spaces, assessing generalization and transferability.
Transformer model for AP clustering and power allocation optimization in cell-free MIMO networks.
Theoretical analysis of clipped gradient optimization under heavy-tailed noise with refined convergence bounds.
Novel synthetic data generation method for wireless network traffic forecasting to augment training datasets.
Multi-preconditioned LBFGS algorithm for training physics-informed neural networks using domain decomposition.
Onboard flood detection using History Injection Transformer for continuous change detection on resource-constrained satellites.
Study showing arbitrary token generation order in diffusion LLMs doesn't improve reasoning despite flexibility.
Theoretical analysis of hierarchical quantization in VQ-VAEs questioning necessity for reconstruction performance.
One-shot data augmentation technique for few-shot learning combining geometric perturbations with noise injection.
Federated learning approach over wireless channels using over-the-air aggregation without channel state information.
Theoretical study of sparse representation models for density and mode estimation with optimal convergence rates.
Theoretical and interpretability analysis of quantum extreme learning machines using Pauli-transfer matrix approach.
Multi-agent game theory study formalizing coordination dynamics in repeated games as Markov games.
Theoretical analysis of quantum diffusion models showing noiseless-to-noisy transition for score reversal.
Method for efficient LLM reasoning balancing overthinking and underthinking in resource-constrained settings.
Learning-to-Defer framework extended to select experts and conditionally provide information like documents or tool outputs.