Universal Hypernetworks for Arbitrary Models
ArXiv paper introducing Universal Hypernetworks that generate weights for arbitrary model architectures using descriptors.
ArXiv paper introducing Universal Hypernetworks that generate weights for arbitrary model architectures using descriptors.
ArXiv paper applying diffusion denoising objectives to causal structure learning from observational data.
ArXiv paper on model-based reinforcement learning for control systems with time-varying dynamics.
ArXiv paper on in-context agentic reinforcement learning enabling LLM agents to internalize skills at inference time.
ArXiv paper on lightweight diffusion transformer for crystal structure generation using subatomic tokenization.
ArXiv paper unifying group-relative and self-distillation policy optimization for LLM post-training with improved credit assignment.
ArXiv paper proposing Head-Calibrated Clipped-Linear Softmax as efficient surrogate for attention softmax in edge inference.
ArXiv paper on exact parameterization of doubly stochastic matrices for learned mixing in neural networks.
Single-stage training paradigm for efficient LLM reasoning that reduces token consumption in chain-of-thought without degrading quality.
Neural-assisted physics-based model for interpretable battery aging prediction via 2D aging fingerprints without additional diagnostics.
Learning-based cooperative coevolution framework addressing heterogeneous large-scale global optimization via adaptive low-dimensional optimizers.
Lightweight deep learning architecture for brain tumor classification from MRI images with comparative analysis of different approaches.
Machine learning optimization applied to solve modular bootstrap equations for exploring 2D conformal field theories.
Physics-informed neural network and finite volume hybrid approach for modeling UAV traffic patterns in 3D anisotropic wind fields.
Evolutionary multi-objective optimization framework for fusing deepfake speech detectors to balance accuracy and system complexity using NSGA-II.
Neural-symbolic framework for discovering constitutive closures in nonlinear PDEs from spatiotemporal data while avoiding spurious physical recovery.
Research on regularizing attention scores in vision transformers using bootstrapping to improve interpretability and reduce noisy attention maps.
Analysis of safety, security, and cognitive risks in world models used for autonomous decision-making in robotics, autonomous vehicles, and agentic AI systems.
Neural architecture for predicting odorant intensity perception by combining graph convolutional networks with domain-informed design for molecular structure analysis.
Mathematical study of optimal coupling in causal dynamical systems using Schrödinger Bridge framework for input-output distributional data.
Framework for conceptualizing and generating intentional event streams to evaluate stream processing and mining algorithms.
Generative approach for characterizing task timing in real-time systems across varying hardware resource contexts.
Study of reliability gaps in AI-assisted medication systems, highlighting risks in healthcare decision support.
Framework combining LLMs with infeasibility detection for NP-hard combinatorial optimization problems.
Equivariant transformer architecture for modeling agent behaviors in autonomous driving with SE(2) symmetry.
New optimizer deriving design principles from Muon, improving LLM training efficiency through surrogate model analysis.
Method for efficiently adapting closed-box LLM APIs to target tasks by priming followed by local optimization.
Theoretical physics study of topological gaps in spin models and critical phenomena using persistent homology.
Research on conformal risk control under non-monotonic loss functions for distribution-free prediction guarantees.
Benchmark dataset for evaluating AI coding agents based on production workloads, addressing language distribution and codebase structure gaps.
EXHIB benchmark for binary function similarity detection supporting vulnerability analysis and malware classification.
Study of interactions between normalization methods and optimizers in LLM training at 1B parameters.
LiteInception: lightweight interpretable deep learning framework for fault diagnosis on edge devices.
LiveMathematicianBench: benchmark for evaluating LLM mathematical reasoning capabilities with proof sketches.
Prophet inequality problem with noisy observations and unknown reward distributions using linear models.
Study on language pre-training bias improving performance on general vision tasks through cross-modality transfer.
Analysis of permutation-invariant discrete representation learning for spatially aligned images using vector quantization.
Learning spatial structure from automotive radar pre-beamforming data using cross-modal supervision.
Woosh: open-source sound effects foundation model from Sony AI with architecture, training details, and benchmarks.
Multi-class classification approach using possibilistic supervision and Kullback-Leibler projection.
Theoretical analysis for clustering heteroscedastic Gaussian data without prior knowledge of cluster count.
Physics-informed transformer for reconstructing wideband channel frequency response in multi-band wireless systems.
NeuroPose-AHM dataset integrating kinematic measurements and clinical scores for neurological disorder diagnosis.
AWS system for automated generation of detection rules for web vulnerability identification from CVE data.
Theoretical analysis of multi-head self-attention transformers using particle systems and homogenization limits.
Curia-2 foundation model using self-supervised learning for medical imaging analysis on CT and MRI data.
Comparison of centralized and decentralized RL controllers for traffic signal control in urban corridor networks.
Multi-scene indoor dataset for human detection, segmentation, and tracking across campus locations with automated annotation pipeline.
Reinforcement learning approach for speculative trading formulated as sequential optimal stopping with Cox process-driven intensity controls.
Mining instance-centric vision-language contexts for human-object interaction detection, leveraging VLMs to improve semantic understanding and contextual reasoning.