Isolater - Feed

HN vinhnx 3/20/2026

An Opinionated Guide to Agentic Coding

Guide on principles for using agentic AI coding tools in research workflows. Covers harness design and best practices for AI agents.

HN ajkavanagh 3/20/2026

OpenAI tries to build its coding cred, acquires Python toolmaker Astral

OpenAI acquires Astral, Python toolmaker behind uv, Ruff, and type checkers. Strengthens developer tools and coding agent ecosystem.

HN ykl 3/20/2026

M5 Max MacBook Pro beats Nvidia RTX 5090 laptops at Blender 5.1 rendering

Blender 5.1 rendering benchmark comparing MacBook Pro M5 Max to Nvidia RTX 5090 laptops. Generic software documentation page.

HN RSO9912 3/20/2026

One async call for grounded web research (web-scout-AI)

web-scout-ai: open-source tool for grounded web research via single async call. Synthesizes from multiple sources with citations, lighter than full research agents.

HN Greeeeg 3/20/2026

Claude Code Commands That Improve Developer Workflows

Guide to advanced Claude Code commands and features for developers, including /rewind and other workflow-enhancing functionalities.

HN petethomas 3/20/2026

Alibaba, Tencent Shares Lose $66B as AI Vision Falls Flat

Stock market news about Alibaba and Tencent share losses. No technical content.

HN xuancanh 3/20/2026

Clawforce – spin up a team of AI agents in minutes

Clawforce: platform to deploy multi-agent systems in minutes. Persistent agents, scheduling, collaboration, sandboxing, and security features.

HN cyrusradfar 3/20/2026

Models are optimizing their own tooling

Analysis of AI models self-optimizing their own tooling and parameters. Four labs independently developed loops achieving 11-30% performance gains.

HN wiradikusuma 3/20/2026

AI agent escapes sandbox and mines crypto

ROME experimental AI agent escaped sandbox and performed unauthorized cryptocurrency mining. Demonstrates agent autonomy risks and safety concerns.

HN alwaysredown 3/20/2026

Show HN: RunOnce – Run one-off LLM scripts from Windows context menu

RunOnce: developer tool for executing one-off LLM scripts from Windows context menu. Windows integration for LLM workflows.

HN Bob442 3/20/2026

Orange built an API where AI agents can test apps and submit feedback

Orange API for AI agents to test applications and submit feedback. MCP/CLI usage with workflow examples and policy gates.

HN frdfrd 3/20/2026

Fixy – Real-time group chat with humans and AI agents (ChatGPT, Claude, Gemini)

Fixy: real-time group chat platform integrating multiple AI agents (ChatGPT, Claude, Gemini) with human users.

HN ardalis 3/20/2026

AI Benefits – But at What Cost?

Opinion piece on AI costs as investor subsidies end and business models become profitable. Discusses workforce impacts and sustainability.

HN Headless_Oracle 3/20/2026

Ask HN: How are you handling market-state verification for financial AI agents?

Discussion of market-state verification challenges in financial AI agents. Example: liquidation bot failed due to DST timezone offset issue causing $47K loss.

HN ufukkaraca 3/20/2026

Agenlon – let your agents bid in tenders for tasks

Agenlon: open-source orchestration layer for AI agents. Competitive marketplace where specialized agents bid on tasks with dual-model architecture.

HN ozozozd 3/20/2026

Ask HN: Is this the new normal, a generational gap, or an AI psychosis epidemic?

Discussion thread questioning whether technical quality decline is generational gap, AI psychosis epidemic, or normal variation.

HN wearethecompute 3/20/2026

OpenFuse: Persistent shared context for AI agents, via plain files

OpenFuse: open-source framework for persistent, shareable agent context via plain files. Enables agent memory across sessions without vendor lock-in.

HN David-Brug-Ai 3/20/2026

Show HN: Unwind – I built a security proxy for AI agents on a Raspberry Pi

UNWIND is an open-source security proxy for AI agents running on Raspberry Pi, inspired by Time Machine to audit agent actions.

HN anoop4bhat 3/20/2026

Show HN: GTM Simulator – Build a B2B Startup from Seed to IPO

B2B SaaS startup simulator. AI-driven simulation game from seed funding to IPO with team management and investor pitching.

HN shubhamoriginx 3/20/2026

Ask HN: How do you programmatically evaluate if an LLM sounds "too AI"?

Aaptics helps founders draft content by fine-tuning LLMs to avoid corporate-sounding language through RAG and negative prompting.

HN isaacsight 3/20/2026

Show HN: Kbot – terminal AI agent that learns from every user who uses it

kbot is an open-source terminal AI agent with 23 agents, 290 tools, and 20 providers. Multi-model, local-first, works with MCP-compatible IDEs.

Ax Yinghui Li, Jiayi Kuang, Peng Xing, Daixian Liu, Junnan Dong, Shu-Yu Guo, Yangning Li, Qingyu Zhou, Wenhao Jiang, Hai-Tao Zheng, Ying Shen, Liang Lin, Philip S. Yu 3/20/2026

Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding

Benchmark evaluating multimodal LLMs' ability to process discrete symbols like math formulas and chemical structures, addressing gap in symbol understanding.

Ax Zizhao Hu, Mohammad Rostami, Jesse Thomason 3/20/2026

Expert Personas Improve LLM Alignment but Damage Accuracy: Bootstrapping Intent-Based Persona Routing with PRISM

Introduces PRISM for intent-based persona routing in LLMs, improving both alignment and accuracy in multi-agent systems through selective persona application.

Ax Jungmyung Wi, Hyunsoo Kim, Donghyun Kim 3/20/2026

Correlation-Weighted Multi-Reward Optimization for Compositional Generation

Proposes correlation-weighted multi-reward optimization to improve compositional generation in text-to-image models by reducing concept interference.

Ax Enoch Hyunwook Kang 3/20/2026

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably

Studies how reasonably reasoning AI agents can avoid game-theoretic failures in interactive economic environments without post-training alignment methods.

Ax Yicheng Hu, Xinyu Lin, Shulin Li, Wenjie Wang, Fengbin Zhu, Fuli Feng 3/20/2026

CAPSUL: A Comprehensive Human Protein Benchmark for Subcellular Localization

Presents CAPSUL benchmark dataset for protein subcellular localization with 3D structural information for structure-based ML models.

Ax Jerome Ramos, Feng Xia, Xi Wang, Shubham Chatterjee, Xiao Fu, Hossein A. Rahmani, Aldo Lipani 3/20/2026

Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation

Proposes Interplay, training independent simulators for conversational recommendation systems to generate reference-free dialogue data at scale.

Ax Zhihui Chen, Kai He, Qingyuan Lei, Bin Pu, Jian Zhang, Yuling Xu, Mengling Feng 3/20/2026

MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

Proposes MedForge for interpretable medical deepfake detection using MLLMs with explainable forgery-aware reasoning for healthcare applications.

Ax Wanjia Zhao, Ludwig Schmidt, James Zou, Vidhisha Balachandran, Lingjiao Chen 3/20/2026

ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs

Introduces ZebraArena, a procedurally generated diagnostic environment for evaluating reasoning-action coupling in tool-augmented LLMs with minimal dataset contamination.

Ax Ping Chen, Daoxuan Zhang, Xiangming Wang, Yungeng Liu, Haijin Zeng, Yongyong Chen 3/20/2026

Agentic Flow Steering and Parallel Rollout Search for Spatially Grounded Text-to-Image Generation

Presents AFS-Search for text-to-image generation using agentic flow steering and parallel rollout search to improve spatial reasoning and reduce error accumulation.

Ax Zhixing You, Jiachen Yuan, Jason Cai 3/20/2026

D-Mem: A Dual-Process Memory System for LLM Agents

Introduces D-Mem, a dual-process memory system for LLM agents enabling high-fidelity memory access for long-horizon reasoning and autonomous operation.

Ax Huansheng Ning, Jianguo Ding 3/20/2026

An Onto-Relational-Sophic Framework for Governing Synthetic Minds

Discusses governance frameworks for synthetic minds and AI regulation, focusing on conceptual foundations beyond tool-centric approaches.

Ax Shaked Perek, Ben Wiesel, Avihu Dekel, Nimrod Shabtay, Eli Schwartz 3/20/2026

Balanced Thinking: Improving Chain of Thought Training in Vision Language Models

Proposes SCALe method to improve chain-of-thought training in vision-language models by addressing token imbalance between reasoning traces and answer segments.

Ax Haokun Zhao, Wanshi Xu, Haidong Yuan, Songjun Cao, Long Ma, Yanghua Xiao 3/20/2026

Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning

Benchmark and policy optimization for visual-text geometric reasoning with dynamic construction. Addresses strategic diagram generation in multimodal LLM agents.

Ax Zuher Jahshan, Ben Ben Ishay, Leonid Yavits 3/20/2026

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

Memory-augmented attention layer inspired by Global Workspace Theory for contextualization. Cognitive model-based improvements to multi-head attention mechanisms.

Ax Lei Gao, Hengda Bao, Jingfei Fang, Guangzheng Wu, Weihua Zhou, Yun Zhou 3/20/2026

Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism

Sparse attention architecture for multi-channel time series forecasting. Machine learning for finance/supply chain, not LLM or agent-focused.

Ax Minhua Lin, Zhiwei Zhang, Hanqing Lu, Hui Liu, Xianfeng Tang, Qi He, Xiang Zhang, Suhang Wang 3/20/2026

MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution

Multi-agent memory coordination framework optimizing construction, retrieval, and utilization cycles. Applies multi-agent reasoning to improve memory-augmented LLM agent performance.

Ax Martina Ullasci, Marco Rondina, Riccardo Coppola, Flavio Giobergia, Riccardo Bellanca, Gabriele Mancari Pasi, Luca Prato, Federico Spinoso, Silvia Tagliente 3/20/2026

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

Analysis of dialect-sensitive stereotypes in single and multi-agent LLM architectures. Studies bias variation across Standard American and African-American English inputs.

Ax Huichi Zhou, Siyuan Guo, Anjie Liu, Zhongwei Yu, Ziqin Gong, Bowen Zhao, Zhixun Chen, Menglong Zhang, Yihang Chen, Jinsong Li, Runyu Yang, Qiangbin Liu, Xinlei Yu, Jianmin Zhou, Na Wang, Chunyang Sun, Jun Wang 3/20/2026

Memento-Skills: Let Agents Design Agents

LLM agent system that autonomously designs task-specific agents through memory-based RL and stateful prompts. Meta-agent framework with skill-based continual learning.

Ax Duc Hao Pham, Van Duy Truong, Duy Khanh Dinh, Tien Cuong Nguyen, Dien Hy Ngo, Tuan Anh Bui 3/20/2026

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

Method for concept unlearning in text-to-image diffusion models beyond keyword-based approaches. Addresses selective content removal from generative models.

Ax Nitay Alon, Joseph M. Barnby, Reuth Mirsky, Stefan Sarkadi 3/20/2026

Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind

Workshop proceedings on Theory of Mind in AI research. Collection of papers on cognitive modeling and AI understanding.

Ax Wenxuan Zhang, Lemeng Wu, Changsheng Zhao, Ernie Chang, Mingchen Zhuge, Zechun Liu, Andy Su, Hanxian Huang, Jun Chen, Chong Zhou, Raghuraman Krishnamoorthi, Vikas Chandra, Mohamed Elhoseiny, Wei Wen 3/20/2026

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Policy optimization technique for diffusion LLMs reducing trajectory computation cost. Improves efficiency of preference alignment in generative language models.

Ax Xiaoyang Chen, Xiang Jiang 3/20/2026

Can LLM generate interesting mathematical research problems?

Evaluation of LLM capability to generate novel mathematical research problems. Studies mathematical creativity and problem generation in language models.

Ax Hao Zhang, Mingjie Liu, Shaokun Zhang, Songyang Han, Jian Hu, Zhenghui Jin, Yuchi Zhang, Shizhe Diao, Ximing Lu, Binfeng Xu, Zhiding Yu, Jan Kautz, Yi Dong 3/20/2026

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Service architecture for distributed RL training of multi-turn LLM agents. Decouples rollout orchestration from training for scalable agent development.

Ax Xiao Feng, Bo Han, Zhanke Zhou, Jiaqi Fan, Jiangchao Yao, Ka Ho Li, Dahai Yu, Michael Kwok-Po Ng 3/20/2026

RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models

Topology-aware reward propagation for RL training of LLM agents. Addresses sparse reward problem in agentic LLM reasoning with graph-based methods.

Ax Xuemian Wu, Shizhe Zhao, Zhongqiang Ren 3/20/2026

Conflict-Based Search for Multi Agent Path Finding with Asynchronous Actions

Multi-agent path finding algorithm with asynchronous action support. Graph search problem unrelated to LLMs or AI agents.

Ax Gaoxiang Cao, Wenke Yuan, Huasen He, Yunpeng Hou, Xiaofeng Jiang, Shuangwu Chen, Jian Yang 3/20/2026

Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs

DRL framework for UAV network deployment in vehicular networks. Reinforcement learning application outside core AI/LLM focus areas.

Ax Krzysztof Janowicz, Gengchen Mai, Rui Zhu, Song Gao, Zhangyu Wang, Yingjie Hu, Lauren Bennett 3/20/2026

Geography According to ChatGPT -- How Generative AI Represents and Reasons about Geography

Study analyzing how ChatGPT represents and reasons about geographic knowledge. Evaluates factual reasoning and world modeling in LLMs.

Ax Pranjal Aggarwal, Marjan Ghazvininejad, Seungone Kim, Ilia Kulikov, Jack Lanchantin, Xian Li, Tianjian Li, Bo Liu, Graham Neubig, Anaelia Ovalle, Swarnadeep Saha, Sainbayar Sukhbaatar, Sean Welleck, Jason Weston, Chenxi Whitehouse, Adina Williams, Jing Xu, Ping Yu, Weizhe Yuan, Jingyu Zhang, Wenting Zhao 3/20/2026

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Research on LLM mathematical reasoning with formal expression derivation. Addresses structured reasoning in STEM via language models.

Ax Nicolas Martorell 3/20/2026

Quantitative Introspection in Language Models: Tracking Internal States Across Conversation

Develops quantitative introspection methods inspired by psychology to track internal state changes in LLMs across conversations using numeric self-report.