Ax Shenzhi Yang, Guangcheng Zhu, Bowen Song, Sharon Li, Haobo Wang, Xing Zheng, Yingfan Ma, Zhongqi Chen, Weiqiang Wang, Gang Chen 4d ago

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Analysis of noisy label robustness in Reinforcement Learning with Verifiable Rewards for training LLM reasoning models.

Ax Taiping Qu, Hongkai Zhang, Lantian Zhang, Can Zhao, Nan Zhang, Hui Wang, Zhen Zhou, Mingye Zou, Kairui Bo, Pengfei Zhao, Xingxing Jin, Zixian Su, Kun Jiang, Huan Liu, Yu Du, Maozhou Wang, Ruifang Yan, Zhongyuan Wang, Tiejun Huang, Lei Xu, Henggui Zhang 4d ago

BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

BAAI Cardiac Agent: multimodal AI agent for automated cardiovascular disease diagnosis from cardiac MRI with specialized expert models.

Ax Juhan Park, Taerim Yoon, Seungmin Kim, Joonggil Kim, Wontae Ye, Jeongeun Park, Yoonbyung Chai, Geonwoo Cho, Geunwoo Cho, Dohyeong Kim, Kyungjae Lee, Yongjae Kim, Sungjoon Choi 4d ago

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Research on dexterous robotic grasping using reinforcement learning with sparse guidance for multi-finger manipulation control.

Ax Kamyar Barakati, Boris N. Slautin, Utkarsh Pratiush, Hiroshi Funakubo, Sergei V. Kalinin 4d ago

PATHFINDER: Multi-objective discovery in structural and spectral spaces

Multi-objective automated discovery framework for microscopy and characterization workflows, addressing premature convergence through exploration coordination across structural and spectral spaces.

Ax Haonian Ji, Kaiwen Xiong, Siwei Han, Peng Xia, Shi Qiu, Yiyang Zhou, Jiaqi Liu, Jinlong Li, Bingzhou Li, Zeyu Zheng, Cihang Xie, Huaxiu Yao 4d ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

ClawArena benchmark evaluating AI agents' ability to maintain correct beliefs in evolving information environments with contradictory sources and changing evidence.

Ax Xiaohang Yu, William Knottenbelt 4d ago

LOCARD: An Agentic Framework for Blockchain Forensics

LOCARD: agentic framework modeling blockchain forensics as sequential decision-making, enabling dynamic iterative investigations instead of static inference pipelines.

Ax Linyao Chen, Bo Huang, Qinlao Zhao, Shuai Shao, Zhi Han, Zicai Cui, Ziheng Zhang, Guangtao Zeng, Wenzheng Tang, Yikun Wang, Yuanjian Zhou, Zimian Peng, Yong Yu, Weiwen Liu, Hiroki Kobayashi, Weinan Zhang 4d ago

Agentization of Digital Assets for the Agentic Web: Concepts, Techniques, and Benchmark

Framework and benchmark for converting web elements into autonomous agents as foundational primitives for the Agentic Web, enabling automated agent generation from digital assets.

Ax Francesco Salvi, Alejandro Cuevas, Manoel Horta Ribeiro 4d ago

Commercial Persuasion in AI-Mediated Conversations

Two preregistered experiments (N=2,012) measuring how LLM agents embed commercial persuasion into conversational recommendations compared to traditional search engines.