A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling
End-to-end reinforcement learning framework for heterogeneous DAG scheduling with gap-aware generation enabling rapid schedule adaptation across environments.
End-to-end reinforcement learning framework for heterogeneous DAG scheduling with gap-aware generation enabling rapid schedule adaptation across environments.
Diffusion model for unconditional molecular generation using permutation symmetry on quotient manifolds to enforce invariance in point-cloud generation.
Mechanistic interpretability framework identifying and attributing safety circuits in LLMs responsible for alignment, jailbreak, and backdoor behaviors.
Comparative study of seven ML models (XGBoost, LSTM, CNN-LSTM, etc.) for hourly air temperature and humidity forecasting in Chongqing.
Off-policy value-based reinforcement learning framework for LLMs enabling improved data utilization and sample efficiency for long-horizon tasks.
Energy-based model for graph generation using transport-aligned sampling to improve efficiency and quality in discrete domain generation.
Length-aware scheduling method accelerating reinforcement learning training for LLMs by optimizing rollout phase efficiency during chain-of-thought generation.
Continual learning framework using mixture-of-experts with similarity awareness for data-efficient adaptation to new tasks with limited samples.
Computationally efficient reinforcement learning algorithm for linear function approximation in MDPs satisfying linear Bellman completeness.
Federated learning approach combining differential privacy and Byzantine robustness to protect against both data leakage and adversarial server attacks.
Deep learning method for estimating aerodynamic variables (velocity, angle-of-attack) from piezoelectric sensor measurements on aircraft structures.
Systematic evaluation of prompting strategies (zero-shot, few-shot, chain-of-thought) for chart question answering across GPT-3.5, GPT-4, and GPT-4o models on ChartQA dataset.
TIPS framework improves RL training for search-augmented LLMs via turn-level reward shaping, addressing sparse rewards and credit assignment in reasoning tasks.
Multi-agent reinforcement learning agents develop efficient private communication protocol; performance drops with human-comprehensible language enforced.
CHANRG benchmark reveals limited generalization of RNA secondary-structure prediction models. 170K structured RNA families dataset.
Quantitative assessment of reference retrieval errors from 5 LLM platforms on 2,000 medical literature references. Evaluates Grok-2, ChatGPT, Gemini, Perplexity, DeepSeek.
Theoretical paper on thermodynamic principles and computational costs of maintaining symbolic interpretability in AI systems.
Theoretical analysis of low-rank knowledge distillation for LLMs with convergence and generalization guarantees. Covers compression techniques for efficient deployment.
Quantum-enhanced graph neural network for network intrusion detection exploiting relational dependencies in network traffic.
Quantum federated autoencoder for anomaly detection in IoT networks using distributed learning without centralizing raw data.
Multimodal fusion framework for predicting synthetic lethality in cancer drug development. Domain-specific bioinformatics research.
Quantum Wasserstein GAN for de novo drug design using generative AI. Focuses on drug discovery rather than ML applications or tools.
Quantum computing approach for probabilistic modeling over permutation-structured data using super-exponential symmetric group Fourier transform speedup.
Framework for computational arbitrage in AI model markets where arbitrageurs allocate inference budget across competing providers to undercut pricing.
First system enabling fully homomorphic encryption for end-to-end mmWave radar sensing with composable FHE kernels for signal processing and ML inference.
Token-level analysis of distributional shifts during RLVR fine-tuning of LLMs, examining mechanisms underlying reasoning improvements.
Data-driven approach using memory-augmented neural networks to model fluid wake effects for autonomous aerial and aquatic robots.
Functional component ablation framework analyzing specialization in hybrid language models combining attention with state space models or linear attention.
Verifiable synthetic benchmark for LLM-based insider threat detection using deterministic simulation engine to maintain ground truth and cross-artifact consistency.
Differential privacy framework for RLHF fine-tuning that decouples reward learning to preserve user privacy in LLM preference-based training.
Systematic benchmark comparing four multi-agent LLM orchestration architectures for financial document processing with cost-accuracy tradeoffs and scaling strategies.
Method leveraging intermediate layer representations in LLMs via Inter-Layer Structural Encoders to improve task-specific predictions beyond final-layer features.
Active learning approach using Rashomon ensemble for interpretable decision tree induction with direct hypothesis space characterization.
Quantitative model predicting when independently fine-tuned specialist LLMs can be fused post-hoc for improved performance using divergence metric.
Method addressing over-fragmentation in video object-centric learning through reconstruction-guided slot curriculum training approach.
Retrospective study using continuous AI monitoring to measure bed and chair fall rates in healthcare settings over exposure time.
Research on whether LLMs' step-by-step reasoning is genuinely used or post-hoc narrative generation through step-level evaluation of frontier models.
arXiv paper on brain-inspired object detection co-design. Algorithm-architecture optimization for CLIP-based task-oriented detection on edge devices.
arXiv paper analyzing hierarchical reasoning models for LLMs. Mathematical theory of recursive networks for algorithmic reasoning.
arXiv paper on off-policy evaluation for survival outcomes with censored data. Applied ML research for decision-making systems.
arXiv paper on black-box domain adaptation using dual-teacher distillation. Technical ML research on knowledge transfer without source access.
Computer vision method integrating expert eye gaze trajectories into transformer models for improved chest X-ray classification in radiology.
Human-AI co-design approach for privacy-preserving transformation of electronic health records using geometric operators to enable secure data sharing.
Novel stepwise variational inference method using vine copulas for estimating complex latent dependencies in probabilistic models.
Dataset and harm-aware model for Dari-language misinformation detection on YouTube with information type and harm level annotations.
Critical review framework evaluating membership inference attacks and conditions under which they pose genuine privacy threats to ML models.
Investigation of LLMs' reasoning and optimization capabilities under physical and operational constraints using Optimal Power Flow problems.
Object detection framework combining YOLOv10 with Kolmogorov-Arnold networks and vision-language models for interpretability.
Generative AI approach for lung CT synthesis across full Hounsfield Unit range to address medical imaging data scarcity.
Framework for model evaluation analyzing performance and reliability trade-offs when target KPI levels are unknown.