Isolater - Feed

Ax Zachary Bamberger, Till R. Saenger, Gilad Morad, Ofra Amir, Brandon M. Stewart, Amir Feder 4/1/2026

STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts

STATe presents an interpretable inference-time-compute method using structured action templates to improve output diversity and reasoning control in tree-of-thoughts approaches for LLMs.

Ax Ruxiao Duan, Alex Wong 4/1/2026

Evidential Neural Radiance Fields

Neural radiance fields with evidential uncertainty quantification separating aleatoric and epistemic uncertainty.

Ax Wisdom Ikezogwo, Mehmet Saygin Seyfioglu, Ranjay Krishna, Karim Bouyarmane 4/1/2026

When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On

Error enumeration as reward signal for reference-free RL post-training in virtual try-on with multiple valid outputs.

Ax Ernie Chu, Vishal M. Patel 4/1/2026

Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling

Face-to-Face dataset: 70-hour video of two-person conversations with multi-person tracking for interaction modeling.

Ax Dharshan Kumaran, Arthur Conmy, Federico Barbero, Simon Osindero, Viorica Patraucean, Petar Velickovic 4/1/2026

How do LLMs Compute Verbal Confidence

Study investigating how LLMs compute verbal confidence: timing of computation and relationship to answer quality.

Ax Hui Wen Goh, Jonas Mueller 4/1/2026

Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction

CONSTRUCT: real-time uncertainty estimator for LLM structured outputs and data extraction with field-level trustworthiness scoring.

Ax Zhi Sun, Wenming Zhang, Yi Wei, Liren Yu, Zhixuan Zhang, Dan Ou, Haihong Tang 4/1/2026

KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao

KARMA: fine-tuning LLMs for e-commerce personalized search via knowledge-action regularization addressing semantic-behavior gaps.

Ax Toluwani Aremu, Daniil Ognev, Samuele Poppi, Nils Lukas 4/1/2026

Robust Safety Monitoring of Language Models via Activation Watermarking

Activation watermarking technique for detecting adaptive adversarial attacks against LLMs during inference monitoring.

Ax Pronob Kumar Barman, Tera L. Reynolds, James Foulds 4/1/2026

Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off

Neural collaborative filtering for health community recommendation under extreme interaction sparsity using intake vectors.

Ax In-Chang Baek, Jiyun Jung, Geum-Hwan Hwang, Sung-Hyun Kim, Kyung-Joong Kim 4/1/2026

Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation

Language-conditioned multi-game level generation via shared representation learning across multiple game domains.

Ax Lorcan McLaren, James Cross, Zuzanna Krakowska, Robin Rauner, Martijn Schoonvelde 4/1/2026

Magic Words or Methodical Work? Challenging Conventional Wisdom in LLM-Based Political Text Annotation

Controlled study comparing LLM model choice, size, and prompt styles for political text annotation; challenges best practices.

Ax Julio C. Serrano, Joonas Kevari, Rumy Narayan 4/1/2026

A Multi-Agent Rhizomatic Pipeline for Non-Linear Literature Analysis

Multi-agent pipeline for non-linear literature analysis using rhizomatic approach grounded in process-relational ontology.

Ax Linqian Fan, Peiqin Sun, Tiancheng Wen, Shun Lu, Chengru Song 4/1/2026

$R_\text{dm}$: Re-conceptualizing Distribution Matching as a Reward for Diffusion Distillation

Research on diffusion model distillation using distribution matching as reward with reinforcement learning optimization.

HN allenleee 4/1/2026

The state of AI safety in four fake graphs

Opinion piece on AI safety progress presented through informal graphs and intuitions.

HN miacycle 4/1/2026

Anthropic open sourced Claude Code

Brief mention of Anthropic open-sourcing Claude Code with no technical details provided.

HN gmays 4/1/2026

OpenClaw: The complete guide to building, training and living with your AI agent

Newsletter announcement with promotional offers for various tools and courses.

HN 0x1997 4/1/2026

Burning Tokens Fast

User complaint about rapid token consumption in VS Code extension after update.

HN handfuloflight 4/1/2026

AI Agent Traps

Guide on building, training and deploying AI agents. Limited technical depth in provided excerpt.

HN matt_d 4/1/2026

Rethinking Language Model Scaling Under Transferable Hypersphere Optimization

Research on language model scaling using transferable hypersphere optimization techniques for improved training efficiency.

HN jbarrow 4/1/2026

LFM2.5-350M: No Size Left Behind

LFM2.5-350M model released with 28T token pre-training, optimized for inference on CPUs and GPUs with tool use capabilities.

HN prea 4/1/2026

After 8 years of Gatsby.js, I built my own static site generator

Developer built custom static site generator using AI assistance instead of Gatsby.js.

HN phwbikm 4/1/2026

Crypto Investment Management Claude Skill

Claude Code skill suite for crypto investment management demonstrating multi-agent system patterns.

HN agumza1 4/1/2026

Tumor control from 4 constants – no oncology programmed

Rust/Bevy-based simulation engine for tumor modeling and therapeutic strategy design from first principles.

HN Shriansh05 4/1/2026

AI Native Tool for Product Managers – Zeaota.ai

Announcement of AI-native product management tool.

HN jskopek 4/1/2026

OpenScreen is an open-source alternative to Screen Studio

Open-source screen recording tool providing free alternative to paid Screen Studio for creating product demos.

HN latentlie_747 4/1/2026

DefenseClaw: Secure Your OpenClaw

Enterprise governance layer for OpenClaw agents providing security controls for skills, MCP servers, and code execution.

HN shenli3514 4/1/2026

How Claude Code memory works

Explanation of how Claude Code memory system persists project context across sessions using disk-based file loading.

HN jwilliams 4/1/2026

Early Observations from Interviews with Engineering Teams Adopting AI

Analysis of engineering teams successfully adopting AI coding tools; workflow patterns identified.

HN wontopos 4/1/2026

Show HN: WMB-100K – Open benchmark for AI memory systems at 100K turns

WMB-100K: Enterprise benchmark for AI memory systems with 4.3M tokens, 2,708 questions, 100K turns.

HN doener 4/1/2026

Nvidia AI Ecosystem Expands as Marvell Joins Forces Through NVLink Fusion

Nvidia and Marvell announce strategic partnership for AI infrastructure via NVLink Fusion.

HN viftode4 4/1/2026

Show HN: Live simulation of AI agents scamming each other (and getting caught)

Live simulation showing AI agents scamming each other; demonstrates trust and verification gaps in agent economies.

HN walterbell 4/1/2026

CMU Best Practices for Large Language Models

CMU guide on best practices for integrating LLMs into workflows with expert recommendations.

HN antiviral0075 4/1/2026

The price of intelligence: what legal AI agents cost

Article on variable and hidden costs of AI legal agents vs. traditional flat-fee legal tech.

HN LumiTharMan 4/1/2026

Show HN: Cross Domain Intelligence – The Translation Problem in American R&D

Essay on translation gap between scientific research and practical application in U.S. R&D.

HN novbox 4/1/2026

Cellular Gateways and 5G Failover: Why Every Business Needs a Backup Connection

Cisco Meraki cellular gateways and 5G failover for business internet redundancy.

HN y1n0 4/1/2026

Autoscaling CI for Gitea in Rust

Gitea-ci-autoscaler: Rust service for on-demand provisioning of CI runner nodes for Gitea Actions.

HN smusamashah 4/1/2026

DreamLite: Lightweight On-Device Unified Model for Image Generation and Editing

DreamLite: Compact 0.39B diffusion model for real-time text-to-image generation and editing on-device without cloud.

HN bsgeraci 4/1/2026

Claude Code Interactive Architecture

Overview of Anthropic's Claude CLI architecture showing system layers and prompt execution flow.

HN jiruitao 4/1/2026

Turn any idea into a printable coloring page with AI

Coloring page generator tool using AI prompts for printable line art.

HN Ishymoto 4/1/2026

Codey-V2 is out – stable release

Codey-v2: Local AI coding agent for Android with daemon mode, RAG, git tools, voice, and self-refinement using three purpose-built models served via llama.cpp.

HN LelouBil 3/31/2026

Reverse engineering GTA San Andreas with autonomous LLM agents [video]

Video demonstration of using autonomous LLM agents to reverse engineer GTA San Andreas game engine.

HN AUDAZIA 3/31/2026

Mission Control for AI Agents – Cyberpunk dashboard, zero deps, one HTML file

Mission Control is a dashboard for monitoring AI agents built as single HTML file with zero dependencies. Cyberpunk-themed UI for agent oversight and control.

HN haaz 3/31/2026

Show HN: macOS app to ensure package managers only allow packages 1+ week old

macOS application verifying package managers enforce minimum 1-week age requirement before installing packages.

HN sarkory 3/31/2026

Veo 3.1 Lite – Veo 3.1 Lite – Turn Any Idea into AI Videos Instantly

Veo 3.1 Lite announcement for AI video generation. Lacks technical detail or original content.

HN cg505 3/31/2026

GitHub has DMCA'd nearly all forks of the official Claude-code repo

Report of GitHub DMCA takedowns targeting forks of Claude Code repository.

HN MohamedMabrouk 3/31/2026

Software Pipelining for GPU Kernels: Part 1 – The Pipeline Problem

Technical deep-dive into software pipelining and synchronization challenges in GPU kernel optimization, using Flash Attention as case study.

HN danoandco 3/31/2026

Agent skills for desktop automation and video recording

Reusable agent skills for desktop automation and video recording, extracted from Twill workflows for Claude integration.

HN mooreds 3/31/2026

The Oil Crisis Is About to Get Physical

Analysis of global oil supply disruptions through the Strait of Hormuz and impact on futures markets.

HN MayCXC 3/31/2026

Show HN: MCP server that generates macOS tools via Open Scripting Architecture

MCP server enabling Claude to control macOS applications via Open Scripting Architecture as alternative to computer use.

HN rpst 3/31/2026

Show HN: Claude Code rewritten as a bash script

Bash implementation of Claude Code editor functionality using curl and jq, 1,500 lines versus 380K TypeScript lines.