Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition
Zero-shot late fusion method combining audio-language models with specialist models for speech emotion recognition.
Zero-shot late fusion method combining audio-language models with specialist models for speech emotion recognition.
Systematic literature review of machine learning approaches for early detection of burnout in software engineers.
Scalable foundation model for automated knowledge graph generation from scientific literature using domain-specific optimization.
Active learning method leveraging vision-language foundation models for data-efficient visual recognition.
Safety monitoring approach for LLMs using activation watermarking to detect adaptive adversarial attacks during inference.
Method for weighted conformal anomaly detection under distribution shifts in low-data regimes.
Evaluation of LLMs' ability to mimic authorial styles of literary and political figures using zero-shot prompting.
Framework for sharing memory systems across heterogeneous LLM-based agents via contrastive trajectory distillation to improve knowledge reuse.
Study evaluating whether six LLMs can emulate emotional expression and personality traits across English and Arabic languages.
Research on query-efficient jailbreak fuzzing for LLMs that identifies token importance during prompt mutation to reduce redundant searching under query constraints.
Perceptual optimization strategies for 3D Gaussian Splatting using distortion losses validated via large-scale human evaluation.
Information-theoretic analysis of contextual graph matching with correlated Gaussian features deriving recovery thresholds.
ARGENT: vision-language model using hyperbolic geometry to capture hierarchical structure of visual and linguistic concepts.
Supervised contrastive metric learning for point cloud segmentation in particle detectors using density-based clustering.
VTAM: video-tactile-action model for embodied AI combining visual and tactile signals for contact-rich physical interaction.
VISOR: efficiency method for vision-language models using sparse dynamically selected interactions instead of visual token reduction.
Data-driven and empirical formula models to quantify momentum in competitive tennis matches.
ML model for virtual screening in drug discovery handling out-of-distribution regions with extrapolatory pseudo-label matching.
Minimal Frame Averaging: framework for constructing provably minimal frames achieving exact equivariance in ML systems efficiently.
Continuous action representation for 3D floorplanning addressing scalability bottlenecks from discrete canvas coordinates.
Convergence analysis of linear temporal difference learning in reinforcement learning without requiring linearly independent features.
HFLDD: hybrid federated learning framework using dataset distillation to address non-IID data heterogeneity and label distribution skew.
DART-Eval: benchmark for evaluating DNA language models on regulatory DNA prediction, interpretation and design tasks.
MSA-CNN: lightweight multi-scale CNN with attention mechanism for automatic sleep stage classification from signal data.
BalanceKV: streaming algorithm for approximating attention computations in LLMs using geometric process to reduce memory requirements for long-context generation.
Federated learning approach for designing data-driven feedforward control in vehicle lateral dynamics using distributed data across multiple systems.
Expectation Reflection: multiplicative parameter update paradigm for ML optimization using observation-prediction ratios instead of additive gradient descent.
Analysis of geometric properties and flatness in neural architecture search spaces.
Information-theoretic framework for characterizing and measuring information leakage in concept-based models.
Federated LoRA fine-tuning method for large language models with communication-efficient sparsified updates.
Meta-optimization approach for generating generalizable heuristics using LLMs for combinatorial optimization.
Multi-experiment equation learning method for deriving analytical models from agent-based simulation data.
State representation learning from trajectories using minimum action distance metric for MDPs.
Foundation model adapter for time series forecasting with heterogeneous covariates and multimodal data.
Red teaming framework for systematically discovering diverse vulnerabilities in large language models.
Privacy-preserving graph structure learning using differential privacy at data publishing stage.
Spiking neural network framework with hyperparameter optimization for fraud detection.
Testing framework for deep learning systems using topographical feature discrimination.
Fourier-embedded operator learning framework for solving partial differential equations.
Deep temporal graph method for correcting GNSS positioning errors from jamming attacks.
Graph neural network architecture for modeling spatio-temporal signals with dynamic structure.
Federated learning framework for training deep models on resource-constrained edge devices.
Method for identifying counterfactuals from observational data using optimal transport theory.
Novel sampling algorithm for masked diffusion models improving generation quality and efficiency.
Deep learning model for channel state information prediction in MIMO systems with robustness testing.
Optimizes on-device semantic selection with cross-encoder rerankers for retrieval, agent memory, and recommendations via monolithic forwarding.
Scalable framework for automated desktop UI exploration to generate training data for LLM-based GUI understanding and automation.
Parameter-free clustering framework using self-supervised consensus maximization without requiring hyperparameter tuning.
Pathlet dictionary learning approach for robust and interpretable trajectory generation in privacy-preserving urban mobility applications.
Studies model inversion attacks on latent diffusion models, showing non-uniform memorization patterns in latent space.