Extracting Money Laundering Transactions from Quasi-Temporal Graph Representation
Graph-based ML approach for detecting money laundering transactions in financial networks.
Graph-based ML approach for detecting money laundering transactions in financial networks.
Computational analysis of exponential weights algorithm for online logistic regression with improved complexity bounds.
Neural routing algorithms for network traffic optimization using live telemetry data with near-real-time latency.
ML analysis of NASA space biology dataset revealing thermogenic reprogramming in female mouse adipose tissue.
Addresses reward hacking in RLHF by using advantage sign robustness to prevent policy degradation.
Federated learning optimization via fixed gating for weight averaging under statistical heterogeneity.
Self-Guide framework for LLM agents using co-evolved internal rewards to address sparse reward problem in long-horizon tasks.
On-policy self-distillation training paradigm for LLMs combining dense signals from larger teacher models.
Graph domain adaptation method addressing structural discrepancies in graph neural networks under topology shifts.
arXiv paper analyzes hallucination's role in RL post-training of multimodal LLMs, investigating whether models genuinely learn visual information under RL.
arXiv paper proposes PRISM, combining LLM-guided labeling with semantic clustering for interpretable topic modeling with low computational cost.
arXiv paper studies context space optimization for generally capable agents, examining credit assignment, forgetting, and overfitting in context learning.
arXiv paper on surrogate modeling for real-time blood flow prediction and hemodynamic cardiovascular analysis.
arXiv paper exploring server learning to enhance federated learning robustness against malicious attacks with non-IID client data.
arXiv paper on federated class-incremental learning for LEO satellites addressing non-IID data, memory, and communication constraints.
arXiv paper proposes AI-agent-augmented DNS blocking to prevent LLM access during student evaluations, addressing academic integrity concerns.
arXiv paper introducing TRACE, ML pipeline for detecting Internet routing changes using traceroute latency data with ensemble learning.
arXiv paper analyzing backdoor attacks on decentralized LLM post-training via pipeline parallelism, examining vulnerabilities from malicious participants.
arXiv paper proposing fully photonic convolutional neural network with pre-trained in-situ training to overcome CMOS energy bottlenecks.
PlayGen-MoG: Mixture-of-Gaussians framework for diverse multi-agent trajectory generation in team sports with spatial coordination.
Guideline2Graph: LLM/VLM-based parser converting multimodal clinical guidelines into executable decision graphs with cross-page control flow.
Examination of confirmation bias in LLMs through rule-discovery tasks, evaluating and mitigating systematic reasoning failures.
Projection-free adaptive SGD for matrix optimization achieving dimension-independent convergence without costly quadratic projections.
Statistical survey of RLHF for LLM alignment, examining noisy subjective feedback for reward models and policy optimization.
Neural posterior estimation for Bayesian parameter inference in Li-ion batteries using probabilistic calibration.
AQVolt26: High-temperature halide dataset with r²SCAN calculations for training universal ML potentials for solid-state batteries.
Deep reinforcement learning for element-level bridge lifecycle optimization using condition state proportions.
Feature Attribution Stability Suite: Evaluates post-hoc attribution method stability under realistic input perturbations with improved metrics.
Synapse: LLM-guided two-phase retrieval and genetic algorithm optimization for job-person matching under information imbalance.
Empirical study of confidence calibration and hallucination in vision-language models for medical VQA, spanning three model families.
UniScene3D: Transformer-based 3D scene encoder jointly modeling appearance and geometry from multi-view colored pointmaps with CLIP alignment.
Optimal error algorithms for adversarial learning using randomized hypotheses, improving deterministic hypothesis bounds by factor of 1/2.
WSVD: Weighted SVD technique for efficient low-precision execution of vision-language models with reduced computational burden.
Study examining safety alignment bypass methods in LLMs via jailbreak-tuning and weight orthogonalization, analyzing disabled safety guardrails.
Trajectory-free self-test loss function for learning interacting particle system potentials from unlabeled discrete time-point data.
Gromov-Wasserstein optimal transport methods for geometry-aware multi-view data embedding under heterogeneous geometries and nonlinear distortions.
AutoVerifier: LLM-based agentic framework automating end-to-end verification of technical claims across scientific literature without domain expertise required.
RL framework using LLM-as-judge for label-free knowledge distillation and reasoning improvement in small and large language models without ground truth labels.
Placebo-anchored transport framework for meta-analysis addressing covariate shift across randomized controlled trials.
Systematic evaluation of LLM formal reasoning capabilities using Chomsky hierarchy and computation theory complexity metrics.
MOMO: Foundation model merging multi-sensor Mars remote sensing data (HiRISE, CTX, THEMIS) using Equal Validation Loss alignment.
Variational Bayesian adaptive Kalman filter for state estimation in sensor networks with packet dropouts and corrupted observations.
Transfer learning approach for loan recovery prediction addressing data scarcity and distribution shifts across portfolio domains.
Theoretical characterization of Lipschitz constants for kernel feature maps in positive definite kernels.
Procedural geometry data generation and visual grounding with vision-language models for Referring Image Segmentation in education.
Split-and-conquer framework for detecting manipulated speech regions via boundary detection and segment-level classification.
GPU-accelerated solver combining randomized subspace embedding and Nesterov acceleration for large-scale portfolio optimization.
Learning from synthetic data using provenance-based input gradient guidance to teach models discriminative regions.
Inversion-free stochastic natural gradient method for optimization on Riemannian manifolds with implicit parameter constraints.
Fredholm Integral Neural Operators framework for learning non-expansive integral operators with universal approximation proofs.