Latent Semantic Manifolds in Large Language Models
Mathematical framework interpreting LLM hidden states as points on latent semantic manifolds with Riemannian geometry and Voronoi partitions.
Mathematical framework interpreting LLM hidden states as points on latent semantic manifolds with Riemannian geometry and Voronoi partitions.
Training-free hallucination detector for LLMs using sample transform cost to measure output distribution complexity without fine-tuning.
Chinese financial news dataset and benchmark for evaluating LLM-based agents in macro and sector asset allocation decision-making.
Conditional flow-matching framework using diffusion Transformers to unify learning of PDE solution operators across varying dimensionality.
CNN-LSTM framework with attention and focal loss for detecting falls in elderly individuals from multimodal sensor data.
AI-based tropical cyclone track and intensity forecasting with systematic bias correction on weather data.
Decision Transformer approach for optimizing emergency vehicle signal preemption using offline, return-conditioned sequence modeling.
Diffusion model for generating synchronized group dance choreography from music with spatial coordination for film/gaming.
Geometric Mixture-of-Experts framework for graph representation learning using curvature-guided routing on heterogeneous topologies.
Physics-informed Schrödinger bridge approach for data assimilation from sparse observations in PDE-governed systems.
Dataset aligning instruction manuals with assembly videos for evaluating multimodal LLMs on real-world technical tasks.
AEGIS infrastructure for governance of adaptive medical AI systems under FDA and EU regulations with continuous improvement.
Multi-task deep learning framework for predicting lithium-ion battery state-of-health and remaining useful life.
Delta-Aware Quantization framework for post-training LLM compression that preserves knowledge from alignment fine-tuning.
Classification approach for wind power ramp event forecasting under severe class imbalance for grid stability.
AgentSLR uses agentic AI to automate systematic literature reviews in epidemiology from retrieval through synthesis.
Method for adding trained persistent memory to frozen decoder-only LLMs without cross-attention mechanisms.
Applies conformal prediction for formal safety guarantees in wildfire spread prediction using tabular, spatial, and graph models.
Comprehensive study of LLM-based data imputation across multiple models and datasets, analyzing hallucination effects and control mechanisms.
Combines graph signal processing with Mamba2 state-space models to create adaptive filter banks for language modeling.
Causal Direct Preference Optimization method for training LLMs to generate recommendations while mitigating spurious correlations.
Graph RAG framework combining labeled property graphs and RDF for retrieval-augmented generation over structured and semi-structured data.
T-MAP uses evolutionary search to red-team LLM agents by exploiting multi-step tool execution vulnerabilities in MCP ecosystems.
Analysis of feature importance bias in gradient boosting models under multicollinearity, affecting SHAP-based explanations.
WIST framework uses web-grounded iterative self-play with reinforcement learning to improve LLM reasoning in specific domains.
Review of neuroscience and language technologies for aphasia rehabilitation using personalized, culturally sensitive AI tools.
Theoretical analysis of full-waveform inversion using neural tangent kernel framework for geophysical and medical imaging.
Study on using LLMs for algorithm synthesis with provable guarantees, combining mathematical reasoning with practical performance.
Research on improving conditional modeling in diffusion models, establishing equivalence between classifier-free guidance and alignment objectives.
Quantum-enhanced graph neural network for network intrusion detection exploiting relational dependencies in traffic flows.
Quantum federated autoencoder framework for anomaly detection in IoT networks using quantum computing and federated learning.
Proposes Reasoner-Executor-Synthesizer architecture for LLM agents that maintains O(1) context window while avoiding hallucination and token cost scaling.
Research evaluating Vision-Language Models' ability to detect misleading data visualizations and deceptive captions in charts.
Multimodal fusion framework for synthetic lethality prediction in cancer drug development addressing modality laziness problem.
FAAR quantization method for NVFP4 ultra-low-bit format that adapts rounding to non-uniform numerical grid for efficient LLM edge deployment.
Study on multimodal fusion strategies for time series forecasting showing naive fusion fails and proposing constrained fusion approach.
MTEO method for few-step diffusion sampling by distilling layer-wise, step-wise time embeddings to accelerate inference.
AI Co-Scientist framework combining LLM agents with cloud computing to automate search ranking research from ideation through GPU training.
PhD thesis on classification and segmentation of gastrointestinal tract images for real-time medical diagnosis applications.
Cross-task evaluation study of LoRA adapters showing nominal instruction-tuning labels don't reliably predict realized instruction-following capabilities.
Symbolic Graph Network framework for discovering partial differential equations from noisy sparse data without numerical differentiation.
Adaptive temporal control system for autonomous agents that learns optimal action intervals using hyperbolic geometry predictive signals.
Quantum Wasserstein GAN with latent style representation for de novo drug design using generative AI.
Open-source framework (CaP-X) for benchmarking and improving code-as-policy agents for robot manipulation tasks.
Token-level analysis of distributional shifts in RLVR fine-tuning of LLMs to understand mechanisms underlying reasoning improvements.
LLM-guided headline rewriting system that enhances reader engagement while maintaining editorial integrity and avoiding clickbait.
Edge AI video sensing paradigm using grayscale capture with selective RGB frames to reduce bandwidth and computational requirements.
Online adaptation method for neural closed-loop control systems that preserves stability while updating controllers during operation.
Ablation study analyzing specialization patterns in hybrid language models combining attention with state space models on sub-1B parameter models.
Framework for building language model general capabilities via automatic curriculum of cross-entropy game tasks for relevant skill discovery.