Quantization-Robust LLM Unlearning via Low-Rank Adaptation
Combines low-rank adaptation with quantization-aware unlearning to ensure LLM knowledge removal survives post-training 4-bit quantization.
Combines low-rank adaptation with quantization-aware unlearning to ensure LLM knowledge removal survives post-training 4-bit quantization.
Golden Layers method improves LLM knowledge editing via layer gradient analysis to identify optimal depth for updating model predictions per query.
cc-Shapley extends Shapley values for multivariate feature importance by incorporating causal context to address spurious associations.
TRC² architecture for continual learning in LLMs preventing catastrophic forgetting through decoder-only thalamic routing of cortical columns.
Web-Knowledge-Web pipeline iteratively crawls domain sources and knowledge graphs to discover small/medium enterprise suppliers with improved database coverage.
Establishes theoretical connection between drifting generative dynamics and Sinkhorn divergence-induced gradient flows with cross-minus-self decomposition.
AgentTrace framework for post-hoc root cause analysis in deployed multi-agent systems via causal graph reconstruction from execution logs.
Exploits massive redundancy in gradient transport to reduce real-time recurrent learning computational cost from O(n^4) via random sparsity patterns.
Connects adversarial robustness and LLM hallucinations through shared geometric principle formalized as Neural Uncertainty Principle with irreducible uncertainty bounds.
Benchmarks physics-guided and deep learning models for air quality index forecasting on region-specific datasets.
mSFT algorithm addresses overfitting in multi-task supervised fine-tuning by dynamically adjusting data mixture ratios based on task-specific learning dynamics.
Decouples exploration from policy optimization in RL using uncertainty-guided tree search for efficient autonomous exploration without intrinsic motivation.
Online learning algorithm balancing regret guarantees in adversarial/stochastic settings with safety constraints via COMPASS-Hedge method.
Architecture for aircraft health monitoring balancing accuracy and computational constraints under class imbalance and environmental uncertainty.
Deep learning approach for automated sleep staging in stroke patients with analysis of generalization gaps in clinical populations using Grad-CAM interpretations.
Method for steering code LLMs toward specific programming languages and libraries by manipulating activation space directions at inference time, tested on five language/library pairs across three open-weight models.
Analysis of response homogenization in RLHF-aligned LLMs showing reduced uncertainty estimation and implications for sampling.
Multimodal fusion approach for microservice incident detection handling missing modalities without static imputation.
Uncertainty-guided rebalancing technique for safety monitoring in cyber-physical systems with imbalanced time-series data.
Analysis of generalization in audio deepfake detection across datasets and model architectures.
Actor-critic reinforcement learning approach combining trajectory optimization with Sobolev learning for optimal control.
Knowledge-guided pretraining framework for multimodal foundation models applied to remote sensing applications.
Reproducibility analysis of 10 graph-based neural recommender papers from SIGIR 2022 assessing methodology and impact.
Investigation of channel-triggered backdoor attacks on semantic communication image reconstruction systems.
Mathematical framework defining curved Bregman divergences and their properties for statistical analysis.
Reproducibility study of diffusion-based recommender systems identifying methodological issues and limited actual progress.
Theoretical analysis showing supervised learning can be decomposed into unsupervised parameter selection plus label addition.
LSTM-based machine learning models for USD/BDT exchange rate forecasting using historical financial data.
Real-time streaming text-to-video generation model using transformer-based diffusion for interactive applications.
Multimodal approach for trajectory prediction with sensor fusion and tracking for embodied agents in occluded scenarios.
Benchmark (ORIC) examining vision-language model failures in object recognition under contextual incongruity scenarios.
Study on alternative training objectives for LLM fine-tuning beyond negative log likelihood to improve generalization.
Multi-dimensional autoscaling platform for stream processing services on edge devices with limited resources.
Framework for decentralized peer-to-peer ride-sharing with altruistic incentive structures to reduce congestion and emissions.
Bayesian optimization algorithms on metric graphs using Gaussian process surrogates for expensive black-box function evaluation.
GUI-AIMA alignment method for MLLMs to ground natural language instructions to UI regions, enabling computer-use agents via visual grounding.
Econometric study of Netflix personalized recommendation value using discrete choice modeling, not ML development focused.
PriVi foundation model for primate behavior analysis in video, data-centric computer vision approach for non-human animal research.
Framework combining LLMs and conformance checking for detecting control-flow anomalies in software monitoring, security application.
WorldMM memory-augmented video LLM agent for reasoning over hours-long videos with multimodal memory, addressing long-context understanding.
SELVA model for text-conditioned selective video-to-audio generation, enabling fine-grained audio control from multimodal video input.
Nemotron-Cascade framework scaling reinforcement learning for general-purpose reasoning models, addressing heterogeneity in response lengths and verification latency.
Neural network interpretability approach for identifying EEG patterns associated with cybersickness in VR, application of ML for neuroscience.
Deep learning method for single image reflection separation using dual-stream architecture with cross-scale gated fusion, computer vision application.
Framework preserving ambiguity in LLM inference through non-collapsing state spaces, addressing premature semantic commitment in dialogue systems.
Physics-based imaging technique extending ptychography to single-shot overlap-free coherent diffractive imaging, not AI/ML related.
Analysis of pooling strategies for aggregating pixel-level embeddings from geospatial foundation models to patch-level representations.
Diagnostic approach using entropy trajectory shapes to predict reasoning reliability in chain-of-thought LLM outputs, practical for uncertainty quantification.
KALAVAI protocol predicting when independently trained specialist LLMs can be fused post-hoc, with quantitative formula for cooperative value estimation.
MDKeyChunker pipeline for structure-aware document chunking and single-call LLM enrichment to improve RAG accuracy, addressing semantic fragmentation in retrieval.