Ax Andrei Polubarov, Lyubaykin Nikita, Alexander Derevyagin, Artyom Grishin, Igor Saprygin, Aleksandr Serkov, Mark Averchenko, Daniil Tikhonov, Maksim Zhdanov, Alexander Nikulin, Ilya Zisman, Albina Klepach, Alexey Zemtsov, Vladislav Kurenkov 27d ago

Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner

arXiv paper on Decision Pre-Trained Transformer for in-context reinforcement learning, enabling scalable generalist agent training.

Ax Geert Trooskens (XY.AI Labs, Palo Alto, CA), Aaron Karlsberg (XY.AI Labs, Palo Alto, CA), Anmol Sharma (XY.AI Labs, Palo Alto, CA), Lamara De Brouwer (XY.AI Labs, Palo Alto, CA), Max Van Puyvelde (Stanford University School of Medicine, Stanford, CA), Matthew Young (XY.AI Labs, Palo Alto, CA), John Thickstun (Cornell University, Ithaca, NY), Gil Alterovitz (Brigham and Women's Hospital / Harvard Medical School, Boston, MA), Walter A. De Brouwer (Stanford University School of Medicine, Stanford, CA) 27d ago

Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation

Compiled AI: Paradigm where LLMs generate executable code during compilation for deterministic, model-free workflow automation execution.

Ax Junyu Guo, Shangding Gu, Ming Jin, Costas Spanos, Javad Lavaei 27d ago

LLMs Should Express Uncertainty Explicitly

Study on training LLMs to express uncertainty explicitly as control interface for abstention and verification tasks.

Ax Vishaal Kapoor, Mariam Dundua, Sarthak Ahuja, Neda Kordjazi, Evren Yortucboylu, Vaibhavi Padala, Derek Ho, Jennifer Whitted, Rebecca Steinert 27d ago

DQA: Diagnostic Question Answering for IT Support

Diagnostic RAG system for IT support with explicit diagnostic state tracking across turns to accumulate evidence and resolve hypotheses.