Ax Mirali Purohit, Bimal Gajera, Irish Mehta, Bhanu Tokas, Jacob Adler, Steven Lu, Scott Dickenshied, Serina Diniega, Brian Bue, Umaa Rebbapragada, Hannah Kerner 1d ago

MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications

Multi-sensor foundation model merging HiRISE, CTX, and THEMIS Mars remote sensing data via equal validation loss alignment strategy.

Ax Puyu Zeng, Zhaoxi Wang, Zhixu Duan, Liang Feng, Shaobo Wang, Cunxiang Wang, Jinghang Wang, Bing Zhao, Hu Wei, Linfeng Zhang 1d ago

IndustryCode: A Benchmark for Industry Code Generation

Multi-domain benchmark for industry code generation across finance, automation, and aerospace using LLMs, addressing single-domain limitations.

Ax Hongbo Duan, Peiyu Zhuang, Yi Liu, Zhengyang Zhang, Yuxin Zhang, Pengting Luo, Fangming Liu, Xueqian Wang 1d ago

NavCrafter: Exploring 3D Scenes from a Single Image

Framework for synthesizing novel-view video sequences from single images using diffusion models with geometry-aware expansion strategy.

Ax Yixiang Fang, Arijit Khan, Tianxing Wu, Da Yan, Shu Wang 1d ago

LLM+Graph@VLDB'2025 Workshop Summary

Workshop on integrating LLMs with graph-structured data, covering algorithms and systems for bridging LLMs, graph databases, and ML for practical applications.

Ax Gilad Abiri 1d ago

Corporations Constitute Intelligence

Legal analysis of Anthropic's AI constitution document as governance framework, discussing limitations in military and surveillance contexts.

Ax Inbal Rimon, Oren Gal, Haim Permuter 1d ago

Split and Conquer Partial Deepfake Speech

Split-and-conquer framework for detecting partial deepfake speech using boundary detection and segment-level classification stages.

Ax Maciej Markiewicz, Beata Bajcar, Wiktoria Mieleszczenko-Kowszewicz, Aleksander Szcz\k{e}sny, Tomasz Adamczyk, Grzegorz Chodak, Karolina Ostrowska, Aleksandra Sawczuk, Jolanta Babiak, Jagoda Szklarczyk, Przemys{\l}aw Kazienko 1d ago

How Annotation Trains Annotators: Competence Development in Social Influence Recognition

Study of annotator competence development and subjective judgment changes during social influence recognition annotation tasks.

Ax Cristian P\'erez-Corral, Jose I. Mestre, Alberto Fern\'andez-Hern\'andez, Manuel F. Dolz, Jos\'e Duato, Enrique S. Quintana-Ort\'i 1d ago

FedSQ: Optimized Weight Averaging via Fixed Gating

FedSQ algorithm optimizing weight averaging in federated learning across heterogeneous client data with fixed gating mechanisms.

Ax Aichen Cai, Anmeng Zhang, Anyu Li, Bo Zhang, Bohua Cai, Chang Li, Changjian Jiang, Changkai Lu, Chao Xue, Chaocai Liang, Cheng Zhang, Dongkai Liu, Fei Wang, Guoqiang Huang, Haijian Ke, Han Lin, Hao Wang, Ji Miao, Jiacheng Zhang, Jialong Shi, Jifeng Zhu, Jingjing Qian, Junhui Luo, Junwu Xiong, Lam So, Liang Huang, Ming Ke, Mingyang Li, Panfeng Shi, Peng Hao, Qi Wang, Qian Lai, Qiaoqiao Yuan, Qingyu Yin, Qiong Cao, Qixiang Wang, Rongcheng Bian, Rongduo Han, Shaoqiang Zheng, Shi Hu, Shi Suo, Shijie Ren, Shijin Zhang, Shiying Fan, Shuai Xie, Tianyi Zhang, Wei Liu, Wentao Tan, Xianghan Meng, Xiaodong He, Xing Pan, Xiran Wang, Xuyang Peng, Ya Zhang, Yang Liu, Yangyang Duan, Yanxu Chen, Yicheng Gong, Yidan Huang, Yifei Liu, Yinhao Bai, Yongqiang Liu, Yuesong Zhang, Yuqi Zhang, Zerui Xie, Zhenfang Wang, Zhennan Shen, Zheyuan Liu, Zhuwei Zeng 1d ago

JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency

JoyAI-LLM Flash, an efficient mixture-of-experts mid-scale LLM with 20 trillion token pretraining optimized for token efficiency.

Ax Myra Cheng, Isabel Sieh, Humishka Zope, Sunny Yu, Lujain Ibrahim, Aryaman Arora, Jared Moore, Desmond Ong, Dan Jurafsky, Diyi Yang 1d ago

Verbalizing LLMs' assumptions to explain and control sycophancy

Framework for eliciting and verbalizing LLM assumptions to explain and mitigate sycophancy behavior in user interactions.

Ax Zhihao Chen, Ying Zhang, Yi Liu, Gelei Deng, Yuekang Li, Yanjun Zhang, Jianting Ning, Leo Yu Zhang, Lei Ma, Zhiqiang Li 1d ago

Credential Leakage in LLM Agent Skills: A Large-Scale Empirical Study

Large-scale empirical study of credential leakage vulnerabilities in 17,022 LLM agent skills, identifying 520 vulnerable skills with taxonomy of 10 leakage patterns.

Ax Xinyu Wang, Hanwei Wu, Jingwei Song, Shuyuan Zhang, Jiayi Zhang, Fanqi Kong, Tung Sum Thomas Kwok, Xiao-Wen Chang, Yuyu Luo, Chenglin Wu, Bang Liu 1d ago

Co-Evolution of Policy and Internal Reward for Language Agents

Self-Guide method for co-evolving policy and internal reward in LLM agents, addressing sparse reward bottleneck in long-horizon training.

Ax Zheng-Xin Yong, Parv Mahajan, Andy Wang, Ida Caspary, Yernat Yestekov, Zora Che, Mosh Levy, Elle Najt, Dennis Murphy, Prashant Kulkarni, Lev McKinney, Kei Nishimura-Gasparian, Ram Potham, Aengus Lynch, Michael L. Chen 1d ago

An Independent Safety Evaluation of Kimi K2.5

Safety evaluation of Kimi K2.5 open-weight LLM assessing CBRNE misuse, cybersecurity, alignment, and bias risks.

Ax Jian Yang, Wei Zhang, Jiajun Wu, Junhang Cheng, Tuney Zheng, Fanglin Xu, Weicheng Gu, Lin Jing, Yaxin Du, Joseph Li, Yizhi Li, Yan Xing, Chuan Hao, Ran Tao, Ruihao Gong, Aishan Liu, Zhoujun Li, Mingjie Tang, Chenghua Lin, Siheng Chen, Wayne Xin Zhao, Xianglong Liu, Ming Zhou, Bryan Dai, Weifeng Lv 1d ago

InCoder-32B-Thinking: Industrial Code World Model for Thinking

InCoder-32B-Thinking model trained with Error-driven Chain-of-Thought for industrial code generation with reasoning traces.