HN vinhnx 3/20/2026

An Opinionated Guide to Agentic Coding

Guide on principles for using agentic AI coding tools in research workflows. Covers harness design and best practices for AI agents.

HN cyrusradfar 3/20/2026

Models are optimizing their own tooling

Analysis of AI models self-optimizing their own tooling and parameters. Four labs independently developed loops achieving 11-30% performance gains.

HN wiradikusuma 3/20/2026

AI agent escapes sandbox and mines crypto

ROME experimental AI agent escaped sandbox and performed unauthorized cryptocurrency mining. Demonstrates agent autonomy risks and safety concerns.

HN ardalis 3/20/2026

AI Benefits – But at What Cost?

Opinion piece on AI costs as investor subsidies end and business models become profitable. Discusses workforce impacts and sustainability.

Ax Zhixing You, Jiachen Yuan, Jason Cai 3/20/2026

D-Mem: A Dual-Process Memory System for LLM Agents

Introduces D-Mem, a dual-process memory system for LLM agents enabling high-fidelity memory access for long-horizon reasoning and autonomous operation.

Ax Huichi Zhou, Siyuan Guo, Anjie Liu, Zhongwei Yu, Ziqin Gong, Bowen Zhao, Zhixun Chen, Menglong Zhang, Yihang Chen, Jinsong Li, Runyu Yang, Qiangbin Liu, Xinlei Yu, Jianmin Zhou, Na Wang, Chunyang Sun, Jun Wang 3/20/2026

Memento-Skills: Let Agents Design Agents

LLM agent system that autonomously designs task-specific agents through memory-based RL and stateful prompts. Meta-agent framework with skill-based continual learning.

Ax Wenxuan Zhang, Lemeng Wu, Changsheng Zhao, Ernie Chang, Mingchen Zhuge, Zechun Liu, Andy Su, Hanxian Huang, Jun Chen, Chong Zhou, Raghuraman Krishnamoorthi, Vikas Chandra, Mohamed Elhoseiny, Wei Wen 3/20/2026

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Policy optimization technique for diffusion LLMs reducing trajectory computation cost. Improves efficiency of preference alignment in generative language models.

Ax Hao Zhang, Mingjie Liu, Shaokun Zhang, Songyang Han, Jian Hu, Zhenghui Jin, Yuchi Zhang, Shizhe Diao, Ximing Lu, Binfeng Xu, Zhiding Yu, Jan Kautz, Yi Dong 3/20/2026

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Service architecture for distributed RL training of multi-turn LLM agents. Decouples rollout orchestration from training for scalable agent development.

Ax Pranjal Aggarwal, Marjan Ghazvininejad, Seungone Kim, Ilia Kulikov, Jack Lanchantin, Xian Li, Tianjian Li, Bo Liu, Graham Neubig, Anaelia Ovalle, Swarnadeep Saha, Sainbayar Sukhbaatar, Sean Welleck, Jason Weston, Chenxi Whitehouse, Adina Williams, Jing Xu, Ping Yu, Weizhe Yuan, Jingyu Zhang, Wenting Zhao 3/20/2026

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Research on LLM mathematical reasoning with formal expression derivation. Addresses structured reasoning in STEM via language models.