Early Exiting Predictive Coding Neural Networks for Edge AI
Early exiting predictive coding neural networks optimized for edge AI devices with resource constraints and privacy requirements.
Early exiting predictive coding neural networks optimized for edge AI devices with resource constraints and privacy requirements.
GenOL framework for online learning with only concept names (name-only setup) enabling real-time adaptation to data distribution shifts in continual learning scenarios.
Introduces WEATHER-5K dataset and benchmarks physics-informed time-series forecasting models for global weather prediction.
Control-theoretic approach to reinforcement learning with convergence guarantees, new gradient theorem, and gradient ascent algorithm.
Information-theoretic analysis of transformer in-context learning on variable-order Markov chains with finite-sample accuracy bounds.
Diffusion sampler using value functions with invariant symmetries for sampling from unnormalized target densities.
Equivariant neural networks for analyzing scalar and vector fields on spheres using group convolutions in Fourier space.
Critical evaluation of model inversion attack assessment frameworks, identifying flaws in standard evaluation methodology.
Neural Graduated Assignment method for solving Maximum Common Edge Subgraph problem with improved scalability.
Training-free framework for compiling sparse Mixture-of-Experts variants with predicted expert utility metric for deployment optimization.
Theoretical analysis of concentration properties for fractional quasi p-norms in high-dimensional spaces.
Framework for characterizing epistemic errors in uncertainty-aware multitask learners under distribution shift.
Implicit neural representations for efficient exploration of large-scale simulation ensembles with interpretability focus.
Probabilistic inference speedup for Hidden Markov Models by filtering low-probability states in temporal sequences.
Classical polynomial chaos expansion technique for surrogate modeling and uncertainty quantification in physical simulation.
Research on dynamic reward weighting for multi-objective RL alignment in LLMs, addressing non-convex Pareto fronts in preference learning.
Comparative study of neural network classifiers and optimizers for EEG frequency band classification across brain hemispheres.
Theoretical convergence analysis of Muon optimizer for matrix-structured parameters in neural network training.
Deep learning for predicting shock propagation in porous materials with multi-field, spatio-temporal modeling.
Out-of-distribution detection for regression tasks in scientific AI using score-based diffusion models on joint likelihood estimation.
Transformer-based inter-atomic potential model for molecular simulations without explicit equivariance constraints.
Analysis of implicit models with infinite-depth weight-tied networks that match explicit models while reducing memory consumption.
Framework for learning graphon mixtures from graph data using motif moments for clustering graphs from multiple distributions.
Empirical study comparing message passing neural networks and graph transformers for atomistic property prediction.
Theoretical analysis of how attention head count influences transformer approximation properties and expressive power.
Adaptive rollout and routing method for data-driven weather forecasting with improved spatiotemporal modeling.
Novel evaluation metrics for generative models in crystal/material discovery assessing stability, uniqueness, and novelty.
Learning-to-optimize Transformer framework for scalable beamforming in multi-user wireless systems.
Automated algorithm design using machine learning to optimize hyperparameter auto-tuning for high-performance applications.
Geo-Foundation Models framework for flood hazard mapping from SAR satellite imagery in data-scarce regions.
Multi-objective optimization approach for balancing training dynamics across multiple sensor modalities in learning-enabled control systems.
Continual Transformers architecture for real-time low-latency inference on streaming data with reduced redundant computation.
Survey of deep unfolding techniques combining classical optimization algorithms with neural networks for signal processing.
Research on normalization-free transformer architectures using Dynamic Tanh as alternative to standard normalization layers.
Learning-theoretic approach to extracting interpretable features from superposition in complex ML models.
Comparative study of ML methods for forecasting electric vehicle charging demand across different time horizons.
Multimodal VAE method using Hellinger distance and probabilistic opinion pooling for weakly supervised generative learning.
Split learning system using hybrid-order optimization to reduce memory overhead for collaborative LLM training on edge devices.
Architecture separating energy-based world models from language generation in LLMs to improve understanding vs. fluency tradeoff.
PAIR-Former applies budgeted relational multi-instance learning to miRNA target prediction with compute constraints and instance-level relational processing.
Addresses machine unlearning for sparse LLMs to remove memorized sensitive information while maintaining model sparsification benefits for efficient deployment.
ECHO-2 is distributed RL framework for LLM post-training via reinforcement learning, optimizing cost-efficiency of rollout generation across distributed resources.
VJE introduces reconstruction-free latent-variable framework for self-supervised learning using symmetric conditional ELBO on paired embeddings.
LP-FNO uses Fourier Neural Operators as surrogate model for laser welding simulations, enabling faster parametric solution learning for industrial process optimization.
Proposes radVI algorithm for variational inference by optimizing radial profiles to better approximate high-dimensional distributions beyond standard Gaussian surrogates.
DGPO: RL-guided graph diffusion model for neural architecture search using reinforcement learning steering.
Study using finetuned LLMs for topic-conditional sentiment extraction to forecast aluminum commodity prices.
Survey of privacy-preserving ML techniques for IoT including federated learning and differential privacy approaches.
SafeDriver-IQ: Framework using inverse crash probability modeling for real-time driver safety scoring.
Decentralized bi-level RL algorithm for environment design with sample-efficient hypergradient estimation.