GLASS: Geometry-aware Local Alignment and Structure Synchronization Network for 2D-3D Registration
Vision network for 2D-3D image-to-point-cloud registration using geometry-aware local alignment and structural synchronization.
Vision network for 2D-3D image-to-point-cloud registration using geometry-aware local alignment and structural synchronization.
Approach for merging multiple LoRA modules while preserving subspace coverage and addressing directional anisotropy to maintain task representation in general-purpose systems.
Semi-structured discrete-time model combining additive predictors with neural networks for mortgage delinquency analysis and default prediction.
Benchmark for evaluating machine unlearning in multimodal models like CLIP, introducing SALMUBench with 60K persona-attribute associations for fine-grained forgetting evaluation.
Method for merging independently fine-tuned LoRA adapters across heterogeneous tasks using null-space compression, addressing classification-regression task combinations.
Tensor-network surrogate for efficient option pricing in portfolio risk management using tensor-train approximation.
Graph-learning algorithm (MED-MAGMA) for fitting Kronecker-sum-structured models with multiplicative noise in genomics applications.
Power-weighted noncentral complex Gaussian distribution for signal processing and communications with non-Gaussian amplitude characteristics.
Generative approach for uncertainty quantification in multimodal supervised learning combining images and text data.
Theoretical analysis of Kantorovich-kernel neural network operators with density results, convergence estimates, and Korovkin theorems.
Diffusion models for reconstructing quantum dot charge stability diagrams to accelerate quantum processor characterization.
Meta-learning framework for human mesh recovery from images using optimization-friendly initializations and uncertainty-aware updates.
UNIFERENCE: discrete-event simulation framework for developing and benchmarking distributed AI inference algorithms across heterogeneous devices and networks.
Conditional Neural Bayes Ratio Estimation (cNBRE) for experimental design optimization in frontier physics, applied to 21-cm radio cosmology.
Low-rank RNN approach using flows to infer neural connectivity structure from population recordings while addressing degeneracy in neural dynamics.
AMALIA: fully open source LLM trained on high-quality European Portuguese data with native evaluation benchmark and improved pt-PT representation.
ALBA: linguistically grounded benchmark for evaluating LLM performance on European Portuguese, addressing underrepresentation in existing benchmarks.
ToothCraft: diffusion model for automated dental crown completion using synthetic training data and contextual generation from incomplete teeth.
Experimental pipeline profiling energy consumption, latency, and quality trade-offs for deploying LLMs on edge devices with hardware constraints.
Study evaluating ML feature compatibility and transferability across malware detection datasets under distribution shifts.
Video benchmark for complex perception reasoning requiring multiple temporally separated visual evidence pieces and compositional logic.
Framework for constructing soft equivariant computer vision models by projecting weights into designed subspaces with theoretical bounds.
Deep symbolic regression using policy gradients with complexity awareness for interpretable data-driven mathematical expression discovery.
Approximate dynamic programming approach for global optimization of expensive black-box functions as alternative to Gaussian process Bayesian optimization.
Projection-free algorithms for online convex optimization with time-varying adversarial constraints and regret bounds.
Theoretical analysis of how iteration order affects convergence and stability in deep neural network training without learning rate schedules.
Methodological commentary on robust predictive modeling under distribution shifts in real-world deployment scenarios.
Task Tokens method adapts behavior foundation models to specific tasks via learnable tokens while preserving zero-shot generalization capabilities.
FastCache accelerates Diffusion Transformer inference through learnable linear approximation and spatial-aware token selection for hidden-state caching.
Defends RAG systems against knowledge poisoning attacks by detecting and mitigating adversarial text injections in external knowledge sources.
Masked training approach for robust arrhythmia detection from digitalized ECG images with temporal asynchrony and missing signal segments.
PepThink-R1 integrates LLMs with chain-of-thought supervised fine-tuning and reinforcement learning for interpretable cyclic peptide design optimization.
Generative model for molecular dynamics trajectories using Markov State Models to accelerate computational protein simulations.
LLMs perform automatic wireless modulation classification via discretized self-supervised candidate retrieval, avoiding distribution shift issues of supervised models.
Control-theoretic framework for LLM activation steering with feedback controllers, connecting empirical steering methods to proportional control theory for safety alignment.
NeST-BO proposes Newton-step targeting Bayesian optimization using Gaussian processes to learn gradient and Hessian information for expensive black-box problems.
Analyzes cryptanalytic model extraction attacks on ReLU-based DNNs with hard-label oracle access and polynomial-time complexity.
Sequence-level TopK (SeqTopK) improves Mixture-of-Experts routing in LLMs by adapting expert assignment per sequence rather than per token without retraining.
Cascading Bandits analyzes decision-making policies for edge inference with multiple models, providing theoretical regret guarantees for Explore-then-Commit and Thompson Sampling approaches.
LiteCache optimizes KVCache memory management for LLM inference using GPU-centric query similarity-driven approach to reduce memory overhead and improve CUDA Graph execution.
Repulsive Bayesian Prompt Learning addresses overfitting in prompt learning for foundation models using Bayesian inference framework for improved out-of-distribution generalization.
Balanced Fine-Tuning aligns LLMs with biomedical knowledge through confidence-weighted token-level optimization and adaptive reward mechanisms.
FedRE proposes a representation entanglement framework enabling federated learning across clients with heterogeneous model architectures and data.
SonicMoE optimizes Mixture of Experts inference through IO-aware and tile-aware techniques for high-granularity, sparse MoE language models.
Deep learning approach for radio path loss prediction in 5G networks with improved generalization across multi-transmitter scenarios and distribution shifts.
Concurrent training enhancements for Kolmogorov-Arnold networks using Newton-Kaczmarz method with FPGA implementation for improved efficiency.
Dual-State Action Pair (DSAP) primitive couples stochastic LLM generation with deterministic verification for reliable code generation agents.
Analyzes decentralized federated learning convergence with user mobility and data heterogeneity in next-gen wireless networks.
Provides theoretical framework explaining why diffusion models prefer direct data prediction over noise/velocity prediction in high-dimensional settings.
Extends Puzzle neural architecture search to reasoning LLMs, producing gpt-oss-puzzle-88B through MoE expert pruning and inference optimization.