SleepVLM: Explainable and Rule-Grounded Sleep Staging via a Vision-Language Model
Vision-language model for sleep staging from polysomnography waveforms generating AASM-compliant clinical rationales with auditable reasoning.
Vision-language model for sleep staging from polysomnography waveforms generating AASM-compliant clinical rationales with auditable reasoning.
Controlled evaluation of how LLM model choice, size, and prompting strategies affect political text annotation, challenging conventional wisdom.
GNN technique using cross-attention and cohesive subgraph embedding to address oversquashing problem in graph neural networks.
3-bit weight quantization method for LLMs using rotation-domain smoothing via Fast Walsh-Hadamard Transform, improving precision in extreme quantization.
Benchmark dataset for evaluating vision-language models on Japanese scene text understanding, addressing multilingual complexity challenges.
EvidenceNet framework uses LLM-assisted pipeline to extract structured, evidence-grounded biomedical findings from full-text literature into knowledge graphs.
Compression technique for 3D Gaussian Splatting using geometry-aware hierarchical context models to reduce storage overhead.
Adaptive resolution framework for multimodal LLMs that reduces visual token overhead through input-side compression before encoding.
Physics-informed neural network framework for discovering constitutive models in thermomechanics using internal energy and dissipation potentials.
Framework for post-training compression of generative AI models with single-line implementation, addressing quantization and calibration challenges.
Analyzes football passes using spatio-temporal tracking data to understand tactical organization and defensive impact.
Derives time-varying momentum schedule for neural network training from physics principles, eliminating need for manual tuning.
Neural network learns tension parameters for curve subdivision across different geometries, replacing global parameter with per-edge predictions.
Operator learning framework combining linear radial and periodic angular components through polar geometry.
Privacy attack analysis of model reprogramming for membership inference against deep learning models.
Hybrid CPU-GPU framework combining differentiable optimization with ILP solving for combinatorial scheduling.
Multi-agent LLM framework for Bayesian optimization exploring exploration-exploitation trade-off through implicit reasoning.
Spectral analysis of neural network training phase transitions through rolling-window Gram matrix spectral gap.
Training-free flow matching between Gaussian mixture models with explicit velocity fields and Wasserstein bounds.
LLM agents for GPU kernel optimization using domain-specific language and speed-of-light guidance to reduce design space.
ML investigation of zodiac-based personality prediction, testing astrology claims empirically.
Hierarchical latent risk model for predicting clinical trial success using operational data from trial design phase.
Amortized analog circuit generation system combining graph VAE and flow-matching models with SPICE validation.
Analysis of long-range dependency in integer multiplication for neural networks through computational spacetime perspective.
Gymnasium-compatible RL trading environments with realistic nonlinear market impact models for agent evaluation.
Hierarchical world model with object-centric decomposition and causal latent dynamics for video prediction.
Bilevel optimization using KFAC-based hypergradients for efficient inverse Hessian-vector product computation.
Active learning with Gaussian processes for autonomous microscopy to improve data quality in structure-property learning.
Graph coarsening method for scalable Graph Convolutional Networks on large-scale node classification tasks.
Attack method exposing vulnerabilities in dummy class-based adversarial defenses through weighted attack strategies.
Physics-informed neural networks for modeling cell-induced phase transitions with causal gating mechanisms.
Method enforcing exact conservation laws in high-dimensional physics-informed neural networks using stochastic dimension implicit projections.
Deep neural model predicting item price elasticity for revenue management from historical sales and pricing data.
LGN-KM lifts nonlinear PDE dynamics into linear latent space by learning continuous-time Koopman generator decomposition.
Surrogate model framework for electro-thermal optimization of through-substrate vias replacing computationally expensive FEM simulations.
LGFNet fuses CFD, wind tunnel, and flight test data for aerodynamic modeling using local-global fusion with fidelity gap learning.
Three deep learning approaches for spacecraft telemetry anomaly detection optimized for edge device deployment using neural architecture search.
Theoretical finite-time convergence analysis of multi-timescale stochastic optimization algorithms for simulation-based optimization.
Federated learning method for graph neural networks on dynamic spatio-temporal graphs addressing heterogeneity across decentralized clients.
Hybrid quantum-classical method for 3D cloud field forecasting using spatiotemporal prediction models for weather analysis.
Open-source Python library for machine learning on medical time-series data, addressing heterogeneous clinical data and reducing friction for ML practitioners in healthcare applications.
Lightweight uncertainty quantification for neural networks using gradient norms and isotropy assumption without training data access.
Prior-fitted tabular foundation model using in-context learning for survival analysis with limited and censored data.
Analysis showing cosine similarity between label representations in softmax classifiers does not reliably indicate model behavior.
Target-Aligned RL (TARL) framework addressing stability-recency tradeoff in target networks through selective emphasis of aligned transitions.
Vine copulas and neural density estimation for modeling multivariate dependencies in EV charging event data.
Mathematical framework for polynomial group convolutional neural networks using graded group algebras and neuromanifold analysis.
Graph prompt-based method for out-of-distribution detection in neural networks using disentangled representations.
Information decomposition framework measuring information spectrum in vision-language models to assess multimodal fusion vs unimodal priors.
Framework analyzing pitfalls in active learning for multimodal data, addressing missing modalities and varying interaction structures.