Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure
Investigation of how user personalization and mental health disclosure affect harmful behavior in tool-using LLM agents.
Investigation of how user personalization and mental health disclosure affect harmful behavior in tool-using LLM agents.
Benchmark for evaluating continual learning in biomedical NLP across task-diverse datasets with robustness and efficiency metrics.
Study of reproducibility in AI coding agents, showing agent-to-agent variation produces nonstandard errors in empirical results.
Two-stage RL framework training multimodal agents for anticipatory reasoning and long-term planning in multi-step tasks.
Pipeline integrating forecasting models and ML regressors with inventory optimization, evaluated on M5 Walmart dataset.
Evaluation of conformal factuality as reliability guarantee for RAG-based LLMs with novel metrics and robustness analysis.
Large-scale multimodal surgical dataset and foundation models for cross-procedure generalization in surgical AI tasks.
Study of cultural bias in LLMs and prompt-based methods to improve cultural alignment for policy and decision-making tasks.
RL environment where LLM agents learn to generate professional presentations through research, planning, and tool use with multi-component reward system.
Method for training LLM agents to leverage rich environment feedback through reflective experience and post-training, improving long-horizon planning.
Benchmark evaluating audio-visual social interactivity capabilities of omni-modal LLMs in dynamic dialogue settings.
RL framework using Soft Actor-Critic to learn adaptive ray sampling policies for efficient neural radiance field rendering.
Multimodal AI search framework combining vector search, hybrid retrieval, and reasoning for pharmaceutical data across text, images, audio, and video.
Evaluation of VLMs (GPT-4V, Gemini, Claude, Llava) for navigation assistance tasks for people with vision impairments.
Framework extending RLHF using multi-dimensional rubric-based rewards instead of scalar signals for RL training.
Inference-time governance approach for LLMs using adaptive prompt routing to enable social alignment without retraining.
Federated learning framework integrating knowledge graphs and temporal transformers for early sepsis prediction in multi-center ICUs.
Study on recursive language models with self-reflective program search for long-context handling, addressing information extraction challenges.
Analysis of Gini Index role in prompt-based classification for detecting and optimizing class accuracy disparities in long-tailed datasets.
Defense mechanism against steganographic collusion in multi-agent reinforcement learning using dynamic representational circuit breaking.
Model rectification framework using attribution-guided rank-one editing to fix unreliable neural network behaviors on corrupted samples.
Open-source pipeline extending single-agent AI orthodontic treatment planning to dual-agent framework with improved tooth segmentation and landmarks.
Application of quantum amplitude estimation to catastrophe insurance tail-risk pricing with convergence analysis and NISQ noise effects.
AI agent system for hardware design reviews using LLMs to verify semantic correctness of component connections against datasheets.
Framework for LLM application release management using automated self-testing with evidence-based quality gates across five dimensions.
Analysis of transformer training dynamics using Spectral Edge Dynamics to measure coherent optimization directions versus stochastic noise.
Context-aware safety framework for personalized text-to-image models that prevents misuse without broad concept erasure.
Analysis of multi-turn safety failures in LLMs through state-space perspective, showing structured contextual evolution enables jailbreaks.
Token compression method for omnimodal LLMs using dynamic audio-driven semantic chunking to reduce inference costs for audio-visual processing.
Domain adaptation approach for remaining useful life prediction using evidential learning under incomplete degradation trajectories.
Study on engineering challenges in LLM-based multi-agent systems, addressing context pressure, coordination errors, and system drift at scale.
Defense framework against backdoor attacks in LLMs using trigger generation and inversion to locate and mitigate malicious triggers.
Study on over-smoothing in hypergraph neural networks using Ricci flow theory to improve message passing and layer depth handling.
Research on using inference time as a proxy to estimate LLM energy consumption, addressing opacity in API-based model access and environmental impact.
SEMAG: self-evolutionary multi-agent code generation framework that decomposes programming tasks into planning, coding, debugging stages with adaptive workflow selection.
Uncertainty-guided multi-expert framework for imbalanced sequence learning addressing poor expert specialization and prediction conflicts in long-tailed data.
Retrieval-augmented generation framework using GPT-4 to accelerate CO2 reduction catalyst discovery by exploring chemical spaces and interpreting results.
Method bridging learned embeddings and handcrafted features in event sequences for financial systems, addressing interpretability and latency constraints in production ML.
Large-scale competition analysis revealing LLM agents' vulnerability to indirect prompt injection attacks through adversarial instructions in external content sources.
Framework and prototype for navigable dataset map in engineering design and systems engineering to improve data accessibility and research reproducibility.
Meta-TTRL: metacognitive test-time reinforcement learning framework for unified multimodal models enabling knowledge accumulation across similar prompts in text-to-image generation.
MiroThinker-1.7 and H1: research agents with enhanced verification and multi-step reasoning via structured planning and contextual reasoning for long-horizon tasks.
ClawWorm: first documented self-propagating attack across LLM agent ecosystems, demonstrating security vulnerabilities in OpenClaw platform with 40,000+ active instances.
Technique improving pretrained diffusion/flow-matching robot policies by replacing sampled noise with optimized constant vectors for better downstream reward performance.
Simulation Distillation method enabling sim-to-real transfer in robotics by pretraining world models in simulation for rapid real-world adaptation with low data.
CorrectionPlanner: autonomous driving planner using reinforcement learning with explicit self-correction mechanism in propose-evaluate-correct loop for unsafe action handling.
Evaluation of how LLMs and tokenizers handle Arabic root-pattern morphology, testing whether models capture genuine morphological structure or rely on surface memorization.
Differentiable framework for computing geodesics on 3D meshes with parallelization support to improve machine learning on non-Euclidean geometric domains.
OMNIFLOW: multimodal agent combining LLMs with physics-grounded reasoning to handle spatiotemporal PDE dynamics without domain-specific fine-tuning, reducing non-physical hallucinations.
Security framework for LLM-based multi-agent systems addressing manipulation risks from malicious agents in interactive agentic networks through communication channel exploitation.