Ax Elena Villalobos (Tecnol\'ogico de Monterrey, Mexico City, Mexico), Adolfo De Un\'anue T. (Tecnol\'ogico de Monterrey, Mexico City, Mexico), Fernanda Sobrino (Tecnol\'ogico de Monterrey, Mexico City, Mexico), David Ak\'e (Tecnol\'ogico de Monterrey, Mexico City, Mexico), Stephany Cisneros (Tecnol\'ogico de Monterrey, Mexico City, Mexico), Jorge Lecona (Container Terminal Operations, Veracruz, Mexico), Alejandra Matadamaz (Container Terminal Operations, Veracruz, Mexico) 20d ago

Toward Reducing Unproductive Container Moves: Predicting Service Requirements and Dwell Times

Data science study using machine learning to predict container service requirements and dwell times at terminals to reduce unproductive moves.

Ax Jianhong Pang, Ruoxi Cheng, Ziyi Ye, Xingjun Ma, Zuxuan Wu, Xuanjing Huang, Yu-Gang Jiang 20d ago

Steering the Verifiability of Multimodal AI Hallucinations

Framework for steering verifiability of multimodal LLM hallucinations, distinguishing between obvious and elusive hallucinations to guide mitigation strategies.

Ax Seongwoo Jeong, Seonil Son 20d ago

How Much LLM Does a Self-Revising Agent Actually Need?

Empirical study decomposing LLM-based agent competence to identify which capabilities derive from the language model versus explicit structural design in self-revising agents.

Ax Nguyen Phuc Tran, Brigitte Jaumard, Oscar Delgado, Tristan Glatard, Karthikeyan Premkumar, Kun Ni 20d ago

LLM-Augmented Knowledge Base Construction For Root Cause Analysis

Evaluates LLM-augmented knowledge base construction for root cause analysis in network communications to enable rapid failure diagnosis and outage resolution.

Ax Peijie Yu, Wei Liu, Yifan Yang, Jinjian Li, Zelong Zhang, Xiao Feng, Feng Zhang 20d ago

Benchmarking LLM Tool-Use in the Wild

Benchmark for evaluating LLM tool-use agents on multi-turn, multi-step interactions addressing compositional tasks, implicit intent, and instruction transitions in real user behavior.

Ax Asif Azad, MD Sadik Hossain Shanto, Mohammad Sadat Hossain, Bdour Alwuqaysi, Sabri Boughorbel, Yahya Bokhari, Abdulrhman Aljouie, Ayah Othman Sindi, Ehsan Hoque 20d ago

Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

Modular system for phoneme-level Arabic pronunciation assessment combining speech-to-phoneme models with clinical-scale scoring metrics for language learning and therapy.