Mix-and-Match Pruning: Globally Guided Layer-Wise Sparsification of DNNs
Mix-and-Match Pruning globally-guided layer-wise sparsification framework for compressing DNNs with minimal accuracy loss on edge devices.
Mix-and-Match Pruning globally-guided layer-wise sparsification framework for compressing DNNs with minimal accuracy loss on edge devices.
FastPFRec federated recommendation framework using GNNs with enhanced convergence speed and privacy-preserving secure aggregation.
Knowledge distillation approach mitigating catastrophic forgetting in incremental hyperspectral image classification without storing old samples.
VGS-Decoding training-free method to reduce hallucinations in medical vision-language models using visual grounding scores during inference.
Empirical study of how executable tool access impacts safety alignment in LLM agents. Shows tool affordance increases capability-safety misalignment.
Demonstrates mathematical isomorphism between ant colony decision-making and random forest ensemble learning under stochastic ensemble intelligence framework.
Mathematical study of forward and inverse problems for time-dependent probability measures in Bayes-Hilbert spaces.
Federated recommender system addressing subgraph structural imbalance in decentralized training on client-specific user-item graphs.
G2DR framework for genotype-first therapeutic target and drug discovery using genetic variants and transcriptomics data.
Theory for data-driven operator learning methods in smoothing and forecasting of dynamical systems and data assimilation.
Framework for representing and propagating measurement uncertainty beyond Gaussian assumptions in control and measurement systems.
Nexerra-R1 chemical language model for metal-organic framework design. Systematic discovery of materials with targeted properties via LLM.
Multi-agent optimization framework for resource allocation in heterogeneous LoRa IoT networks combining ground and underground sensors.
Introduces Meta-Persuasion algorithms applying meta-learning to repeated Bayesian persuasion games with theoretical guarantees.
CERN uses Hidden Markov Models to correct raw electrical signals from nanopore DNA sequencing for improved genome assembly.
Hawkeye system reproduces GPU matrix operations on CPU for verifiable ML without precision loss. Enables reproducible ML inference analysis.
Derives computable state-estimation error bounds for physics-informed neural network KKL observers in control systems.
Study evaluating detectability of LLM-assisted peer reviews. Tests five detectors on dataset simulating human-AI collaboration levels.
Feature attribution method for explaining machine learning decisions in ECG signal analysis with shift-invariant properties.
Goal-oriented learning of surrogate models for stochastic dynamical systems with error bounds on path-dependent observables.
Simulation-based inference framework enabling rapid neural network fitting across varying cognitive modeling assumptions and parameterizations.
Study finding language models report highest confidence when fabricating, with formal proof this is observational not capability limitation.
SC-Net operator learning framework for regularized inverse problems using spectral filtering with improved interpretability and generalization.
World model-based reinforcement learning approach for training Vision-Language-Action robotic models without costly real-world interaction.
Neural architecture incorporating five inductive biases for improved performance on tabular data compared to tree-based models.
Dense associative memory system for empirical measures using Sinkhorn divergence and spherical Hellinger Kantorovich gradient flows for pattern retrieval.
Framework for integrating structural and functional brain connectomes using hierarchical multiscale learning across nested modular organizations.
Asynchronous decomposition framework for high-dimensional online learning with dynamic regularization avoiding error bound divergence.
mmWave-Diffusion framework using conditional diffusion models for contactless respiration sensing with micromotion interference removal.
Non-Differential Transformer architecture for improved sentiment analysis in text, inspired by Differential Transformer variants.
RoboECC edge-cloud collaborative deployment framework for Vision-Language-Action models enabling real-time embodied AI inference.
Multi-RF Fusion ensemble method achieving top OGB leaderboard rank for molecular property prediction via Random Forests and GNNs.
Evaluation and solution for improving general QA performance of LLMs trained with reinforcement learning from verifiable rewards.
Study of visual representation degradation in MLLMs with predictive regularization technique to preserve visual competence during language training.
Compass framework for optimizing compound AI workflows across multiple specialized models with dynamic adaptation under varying loads.
Dodgersort system for efficient pairwise ranking via CLIP-based pre-ordering, neural ranking, and uncertainty-aware pair selection.
HiCI hierarchical attention module for long-context language modeling with segment-level and global context integration.
Lightweight ensemble approach for white blood cell classification addressing class imbalance in medical imaging.
Time-series forecasting method for financial markets handling delayed/stale observations via residual latency-aware mixing.
RubricRAG system for interpretable LLM evaluation via domain knowledge retrieval to generate detailed rubrics instead of scalar scores.
Framework for jointly learning state, dynamics, and filtering algorithm parameters in data assimilation via auto-differentiable filtering.
Router mechanism for LLMs using internal prefill activations and Encoder-Target Decoupling to route queries to best-performing models.
Benchmark study of deep learning model efficiency comparing Conv6, VGG16, and other architectures under computational constraints.
Machine learning approach for adaptive rate allocation in 5G+ networks handling rapid wireless link quality fluctuations in mobile environments.
Framework applying Active Inference and Free Energy Principle to physical AI agents and robots operating under resource constraints in real-world environments.
Theoretical analysis of coordinate ascent variational inference stability differences in sequential vs parallel variants for high-dimensional linear regression.
MOELIGA multi-objective evolutionary algorithm for feature selection balancing subset size and classification accuracy.
DiscoUQ framework extracts semantic structure from disagreement patterns in multi-agent LLM ensembles for improved uncertainty quantification.
Intelligent Disobedience Game formalizes shared autonomy scenarios where systems must override human instructions for safety using game theory.
ALL-FEM framework fine-tunes LLMs to automatically generate and analyze finite element method code for engineering simulations.