Detecting low left ventricular ejection fraction from ECG using an interpretable and scalable predictor-driven framework
Develops interpretable ML framework for detecting low left ventricular ejection fraction from ECG data.
Develops interpretable ML framework for detecting low left ventricular ejection fraction from ECG data.
Applies Vision-Language Models to chip floorplanning macro placement optimization tasks.
Introduces HyperP, hypersphere parameterization for LLM scaling with improved stability and hyperparameter transfer.
Proposes time-varying momentum schedule derived from critically damped harmonic oscillator for neural network training optimization.
Research on membership inference attacks against deep learning models using model reprogramming to reduce computational costs of privacy auditing.
arXiv paper analyzing integer multiplication as hard problem for neural networks. Theoretical analysis challenging assumptions about long-range dependencies in neural computation.
arXiv paper on realistic market impact modeling for RL trading agents. Gymnasium-compatible environments with nonlinear transaction costs for algorithmic trading research.
arXiv paper on personalized federated fine-tuning of language models. Federated learning approach for task-centric LLM adaptation on private distributed data.
arXiv paper on Byzantine-resilient distributed optimization with probabilistic edge dropout. Convergence analysis for distributed learning with adversarial agents.
arXiv paper on memory-efficient LLM pre-training via truncated SVD factorization. Method reduces memory footprint for training large language models on consumer hardware.
arXiv paper on O(1) complexity label prediction for neural networks with millions of classes. Optimization technique for efficient classification in high-dimensional spaces.
arXiv paper on human-AI cooperation via fatigue-aware deferral systems. ML method modeling human fatigue to optimize when AI should defer to humans.
arXiv paper introducing Multiscreen attention mechanism for language models. Alternative to softmax attention enabling absolute relevance scoring in transformers.
arXiv paper on reinforcement learning post-training for reasoning models. Open-weight model training using verifiable rewards across diverse reasoning domains.
arXiv paper on opponent modeling in game-theoretic reinforcement learning using tree-search and generative models. Research on scalable multi-agent RL methods.
Theoretical analysis of accelerated gradient methods for nonconvex optimization and convergence to local minima.
Multi-agent reinforcement learning framework for HIV prevention policy optimization across U.S. regions.
Deep CNN model trained on Portuguese native flora dataset for species identification in citizen science.
Black-box visual prompting method for parameter-efficient transfer learning of foundation models without full parameter access.
SPRIG: Genetic algorithm for optimizing system prompts in LLMs to improve task performance.
MissNODAG: Framework for learning cyclic causal graphs from incomplete data using differentiable methods.
Sparse Gradient Descent algorithm for variable selection in convex piecewise linear regression models.
Score-matching causal discovery algorithm extended for temporal data on networks.
XAI-based method combining explainability with concept drift detection for monitoring model performance degradation.
Framework for constructing confidence sets for changepoints in sequential analysis using data-dependent stopping times.
World models using disentangled representations to transfer semantic knowledge from distracting videos for RL agents.
Digital twins framework for optimizing CI/CD build processes to reduce duration, failures, and flakiness.
Online test-time adaptation for spiking neural networks on neuromorphic chips to handle distribution shifts.
FSD framework combining vision-language models with robotic action models for zero-shot manipulation in novel scenarios.
Review of ML/AI applications in food processing, classification systems, and food informatics.
Neural network surrogate for learning evolution operators in time-dependent Schrödinger equations with unitarity constraints.
Gaussian mixture models as computationally efficient proxy for LLM+RAG systems combining multiple models.
COinCO dataset with 97,722 images created via diffusion-based inpainting for training context-aware vision models.
Machine learning methods for learning Hamiltonian components of open quantum systems.
Large deviations approach to accelerate constrained sampling algorithms for probability distributions.
Technique to recover LLM training on decentralized/spot nodes from partial model loss without full checkpoints.
Method for LLMs to reliably cite source documents seen during training without external retrievers at inference time.
Vision Transformer framework reconstructs cloud-obscured satellite imagery using time-series data for crop mapping.
SciGA-145k dataset for training models to automatically design graphical abstracts for academic papers using visual data.
CATNet applies graph convolutional networks to predict catastrophe bond spreads using relational data structures.
Modification of Whisper ASR model to enable low-latency streaming transcription through architectural and training changes.
Vision-language model for robotic manipulation using embodiment-agnostic pointing representation to address generalization in embodied AI.
System co-design for efficient on-device LLM inference on NPU hardware, optimizing attention operations for privacy-preserving deployment.
Diffusion-based causal inference method for spatio-temporal data with unmeasured confounders and multi-resolution observations.
Theoretical work on distributed mean estimation with 1-bit communication constraints using interval queries, achieving near-optimal sample complexity.
Google developer tool using deep learning to automatically fix copy/paste code, predicting required edits from formatting to cross-language translation.
Knowledge editing method for LLMs enabling sequential updates through null-space alignment, improving robustness in continual model editing scenarios.
AI system for credit scoring of Malaysian MSMEs using bank statement data as alternative to traditional credit bureau data.
Scientific machine learning approach using implicit neural representations for 3D gravity inversion, modeling subsurface density as continuous field.
Data envelopment analysis method for dynamic efficiency evaluation across multiple organizational dimensions with regularization for large-scale settings.