Isolater - Feed

Ax Sharath Sathish 4/8/2026

Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya

Research on fine-tuning LLMs for epistemic reasoning using Navya-Nyaya logic. Addresses hallucination and brittleness in LLM reasoning capabilities.

Ax Enso O. Torres Alegre, Diana E. Mora Jimenez 4/8/2026

Operational Noncommutativity in Sequential Metacognitive Judgments

Theoretical framework exploring order effects in sequential cognitive processes and non-commutativity in metacognition using operational methods.

Ax Volodymyr Yuzefovych 4/8/2026

Proximity Measure of Information Object Features for Solving the Problem of Their Identification in Information Systems

Proximity measure quantifies similarity of multi-source information object features for entity identification and matching across heterogeneous data sources.

Ax Cuong Van Duc, Minh Nguyen Dinh Tuan, Tam Vu Duc, Tung Vu Duy, Son Nguyen Van, Hanh Nguyen Thi, Binh Huynh Thi Thanh 4/8/2026

ReVEL: Multi-Turn Reflective LLM-Guided Heuristic Evolution via Structured Performance Feedback

ReVEL hybrid framework uses LLM-guided iterative evolution with structured performance feedback to design effective heuristics for NP-hard problems.

Ax Min Sun (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Federica Storti (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Valentina Martino (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Miguel Gonzalez-Andrades (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development), Tony Kam-Thong (F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development) 4/8/2026

Algebraic Structure Discovery for Real World Combinatorial Optimisation Problems: A General Framework from Abstract Algebra to Quotient Space Learning

Framework identifies algebraic structures in combinatorial optimization problems, constructs quotient spaces to reduce search space and improve solution quality.

Ax Yiwen Song, Yale Song, Tomas Pfister, Jinsung Yoon 4/8/2026

PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing

PaperOrchestra multi-agent framework automates AI research paper writing by transforming unstructured materials into submission-ready LaTeX manuscripts.

Ax Frazier N. Baker, Trieu Nguyen, Reza Averly, Botao Yu, Daniel Adu-Ampratwum, Huan Sun, Xia Ning 4/8/2026

MMORF: A Multi-agent Framework for Designing Multi-objective Retrosynthesis Planning Systems

MMORF multi-agent framework uses language models with specialized agents for multi-objective retrosynthesis planning balancing quality, safety, and cost.

Ax Andrew Sellergren, Chufan Gao, Fereshteh Mahvar, Timo Kohlberger, Fayaz Jamil, Madeleine Traverse, Alberto Tono, Bashir Sadjad, Lin Yang, Charles Lau, Liron Yatziv, Tiffany Chen, Bram Sterling, Kenneth Philbrick, Richa Tiwari, Yun Liu, Madhuram Jajoo, Chandrashekar Sankarapu, Swapnil Vispute, Harshad Purandare, Abhishek Bijay Mishra, Sam Schmidgall, Tao Tu, Anil Palepu, Chunjong Park, Tim Strother, Rahul Thapa, Yong Cheng, Preeti Singh, Kat Black, Yossi Matias, Katherine Chou, Avinatan Hassidim, Kavi Goel, Joelle Barral, Tris Warkentin, Shravya Shetty, Dale Webster, Sunny Virmani, David F. Steiner, Can Kirmizibayrak, Daniel Golden 4/8/2026

MedGemma 1.5 Technical Report

MedGemma 1.5 4B model expands medical capabilities with high-dimensional imaging (CT/MRI/histopathology), anatomical localization, and improved document understanding.

Ax Xuyang Shen, Haoran Liu, Dongjin Song, Martin Renqiang Min 4/8/2026

Uncertainty-Guided Latent Diagnostic Trajectory Learning for Sequential Clinical Diagnosis

LLM-based sequential clinical diagnosis system models uncertainty-guided evidence acquisition over time using diagnostic trajectory learning.

Ax Jose L. Salmeron 4/8/2026

Non-monotonic causal discovery with Kolmogorov-Arnold Fuzzy Cognitive Maps

Kolmogorov-Arnold Fuzzy Cognitive Maps extend neuro-symbolic modeling to handle non-monotonic causal dependencies in complex dynamic systems.

Ax Rongqian Chen, Yu Li, Zeyu Fang, Sizhe Tang, Weidong Cao, Tian Lan 4/8/2026

IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents

IntentScore is a plan-aware reward model trained on 398K offline GUI interactions to evaluate and score actions for computer-use agents across multiple operating systems.

Ax Hieu Le, Oguz Bedir, Mostafa Ibrahim, Jian Tao, Sabit Ekin 4/8/2026

Bypassing the CSI Bottleneck: MARL-Driven Spatial Control for Reflector Arrays

Multi-agent reinforcement learning replaces channel modeling with spatial intelligence for autonomous control of reconfigurable intelligent surface arrays.

Ax Hieu Le, Mostafa Ibrahim, Oguz Bedir, Jian Tao, Sabit Ekin 4/8/2026

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

Hierarchical multi-agent reinforcement learning optimizes reconfigurable intelligent surfaces for mmWave networks without channel state information estimation.

Ax Ahmad Maroof Karimi, Jong Youl Choi, Charles Qing Cao, Awais Khan 4/8/2026

Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems

Instruction-tuned LLMs parse and mine unstructured HPC system logs from heterogeneous sources to extract patterns and diagnose operational issues.

Ax Xiangyi Li, Kyoung Whan Choe, Yimin Liu, Xiaokun Chen, Chujun Tao, Bingran You, Wenbo Chen, Zonglin Di, Jiankai Sun, Shenghan Zheng, Jiajun Bao, Yuanli Wang, Weixiang Yan, Yiyuan Li, Han-chung Lee 4/8/2026

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

ClawsBench benchmark evaluates LLM agents on realistic productivity tasks (email, scheduling, documents) in simulated multi-service environments with stateful workflows.

Ax Eliza Berman, Bella Chang, Daniel B. Neill, Emily Black 4/8/2026

Attribution Bias in Large Language Models

AttriBench: Demographically-balanced benchmark for measuring attribution bias in LLMs when attributing quotes to original authors.

Ax Christopher Koch 4/8/2026

From Governance Norms to Enforceable Controls: A Layered Translation Method for Runtime Guardrails in Agentic AI

Framework for translating governance norms into enforceable runtime guardrails for agentic AI systems with multi-step execution.

Ax Zhiming Xue, Menghao Huo, Yujue Wang 4/8/2026

EAGLE: Edge-Aware Graph Learning for Proactive Delivery Delay Prediction in Smart Logistics Networks

Graph neural network approach for predicting delivery delays in logistics networks using warehouse and transportation data.

Ax Jonathan Elsworth Eicher 4/8/2026

Simulating the Evolution of Alignment and Values in Machine Intelligence

Evolutionary theory simulation of how alignment affects populations of AI models over time and belief propagation dynamics.

Ax Muhammad Ahmed Mohsin, Ahsan Bilal, Muhammad Umer, Emily Fox 4/8/2026

Pressure, What Pressure? Sycophancy Disentanglement in Language Models via Reward Decomposition

Reward decomposition approach to disentangle pressure capitulation from evidence blindness in LLM sycophancy behavior.

Ax Lesong Tao, Yifei Wang, Haodong Jing, Jingwen Fu, Miao Kang, Shitao Chen, Nanning Zheng 4/8/2026

Breakthrough the Suboptimal Stable Point in Value-Factorization-Based Multi-Agent Reinforcement Learning

Theoretical analysis and solutions for value factorization convergence to suboptimal stable points in multi-agent reinforcement learning.

Ax Dawei Li, Zongxia Li, Hongyang Du, Xiyang Wu, Shihang Gui, Yongbei Kuang, Lichao Sun 4/8/2026

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Graph of Skills: Dependency-aware skill retrieval system for managing and scaling thousands of reusable skills in agent systems.

Ax Hangoo Kang, Tarun Suresh, Jon Saad-Falcon, Azalia Mirhoseini 4/8/2026

TRACE: Capability-Targeted Agentic Training

TRACE: Framework for targeted training of LLM agents on capability gaps identified in specific environments and task distributions.

Ax Aisvarya Adeseye, Jouni Isoaho, Seppo Virtanen, Mohammad Tahir 4/8/2026

Dynamic Agentic AI Expert Profiler System Architecture for Multidomain Intelligence Modeling

Agentic AI system that profiles user expertise levels to adapt interaction depth using LLaMA-based modular architecture.

Ax Zhe Yu, Wenpeng Xing, Meng Han 4/8/2026

From Retinal Evidence to Safe Decisions: RETINA-SAFE and ECRT for Hallucination Risk Triage in Medical LLMs

RETINA-SAFE benchmark and ECRT framework for detecting hallucination risks in medical LLMs with insufficient or conflicting evidence.

Ax Xuan Xiong, Huan Liu, Li Gu, Zhixiang Chi, Yue Qiu, Yuanhao Yu, Yang Wang 4/8/2026

ETR: Entropy Trend Reward for Efficient Chain-of-Thought Reasoning

ETR: Training method for efficient chain-of-thought reasoning by optimizing entropy trends rather than global uncertainty reduction.

Ax Zhe Yu, Wenpeng Xing, Meng Han 4/8/2026

LatentAudit: Real-Time White-Box Faithfulness Monitoring for Retrieval-Augmented Generation with Verifiable Deployment

LatentAudit: White-box monitoring system for RAG hallucination detection using Mahalanobis distance on residual stream activations.

Ax Md Atik Ahamed, Mihir Parmar, Palash Goyal, Yiwen Song, Long T. Le, Qiang Cheng, Chun-Liang Li, Hamid Palangi, Jinsung Yoon, Tomas Pfister 4/8/2026

TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems

TFRBench: Benchmark for evaluating reasoning capabilities of time-series forecasting systems beyond numerical accuracy metrics.

Ax Akram Hossain, Rabab Abdelfattah, Xiaofeng Wang, Kareem Abdelfatah 4/8/2026

LLM-as-Judge for Semantic Judging of Powerline Segmentation in UAV Inspection

Using LLMs as judges to evaluate lightweight segmentation models for drone-based power line inspection under distribution shift.

Ax Jianzhi Yan, Zhiming Li, Le Liu, Zike Yuan, Shiwei Chen, Youcheng Pan, Buzhou Tang, Yang Xiang, Danny Dongning Sun 4/8/2026

Towards Effective In-context Cross-domain Knowledge Transfer via Domain-invariant-neurons-based Retrieval

Domain-invariant neurons approach for cross-domain knowledge transfer to boost LLM reasoning in expertise-scarce specialized domains.

Ax Le Liu, Zhiming Li, Jianzhi Yan, Zike Yuan, Shiwei Chen, Youcheng Pan, Buzhou Tang, Qingcai Chen, Yang Xiang, Danny Dongning Sun 4/8/2026

Reason Analogically via Cross-domain Prior Knowledge: An Empirical Study of Cross-domain Knowledge Transfer for In-Context Learning

Empirical study on using cross-domain demonstrations to improve in-context learning when expert annotations in target domain are scarce.

Ax Jian Tan, Fan Bu, Yuqing Gao, Dev Khanolkar, Jason Mackay, Boris Sobolev, Lei Jin, Li Zhang 4/8/2026

HYVE: Hybrid Views for LLM Context Engineering over Machine Data

HYVE framework for LLMs to better process machine data (logs, metrics, traces) through hybrid structured/unstructured representations.

Ax Myeongsoo Kim, Joe Hsu, Dingmin Wang, Shweta Garg, Varun Kumar, Murali Krishna Ramanathan 4/8/2026

CODESTRUCT: Code Agents over Structured Action Spaces

CODESTRUCT: LLM-based code agents using structured AST action spaces instead of text matching for reliable code editing and repository interaction.

Ax Hongkai Fan, Qinjing Xie, Bo Ouyang, Yaonan Wang, Zhi Yan, Jiawen He, Zheng Fang 4/8/2026

Multi-Agent Pathfinding with Non-Unit Integer Edge Costs via Enhanced Conflict-Based Search and Graph Discretization

Research on multi-agent pathfinding algorithms handling non-unit edge costs and continuous-time actions for real-world robotic/logistics scenarios.

Ax Siyuan Cheng, Bozhong Tian, YanChao Hao, Zheng Wei 4/8/2026

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection

PRISM-MCTS learning approach using reasoning trajectories with metacognitive reflection, inspired by reasoning models like OpenAI o1, for efficient low-resource NLP methods.

Ax Akshat Dasula, Prasanna Desikan, Jaideep Srivastava 4/8/2026

Automated Auditing of Hospital Discharge Summaries for Care Transitions

Automated framework using locally-deployed LLMs to audit hospital discharge summaries at scale, enforcing transition-of-care documentation requirements for patient safety.

Ax Zeyu Wang, Cuiqianhe Du, Renyue Zhang, Kejian Tong, Qi He, Qiyuan Tian 4/8/2026

Adaptive Serverless Resource Management via Slot-Survival Prediction and Event-Driven Lifecycle Control

Adaptive serverless resource management framework using slot-survival prediction and event-driven architecture to optimize cold start latency and utilization.

Ax Dongying Lin, Yinan Liu, Shengwei tang, Bin Wang, Xiaochun Yang 4/8/2026

OntoTKGE: Ontology-Enhanced Temporal Knowledge Graph Extrapolation

OntoTKGE model for temporal knowledge graph extrapolation leveraging ontological knowledge to handle sparse historical interactions and enable behavioral pattern inheritance.

Ax Xiaotian Zhou, Di Tang, Xiaofeng Wang, Xiaozhong Liu 4/8/2026

Can We Trust a Black-box LLM? LLM Untrustworthy Boundary Detection via Bias-Diffusion and Multi-Agent Reinforcement Learning

GMRL-BD algorithm using bias-diffusion and multi-agent RL to detect untrustworthy topic boundaries of LLMs, identifying domains where model answers cannot be reliably trusted.

Ax Yi Nian, Aojie Yuan, Haiyue Zhang, Jiate Li, Yue Zhao 4/8/2026