Show HN: ClamBot – AI agent that runs all LLM-generated code in a WASM sandbox
ClamBot is an AI agent that executes LLM-generated code in a WebAssembly sandbox for security.
ClamBot is an AI agent that executes LLM-generated code in a WebAssembly sandbox for security.
Paseo is an open source environment for running coding agents (Claude, Codex, OpenCode) across desktop, mobile, web, and CLI with voice interface, diff review, and multi-agent management.
Google announces AppFunctions to connect AI agents with Android apps.
Browser tool organizing YouTube subscriptions into custom topic feeds.
Guide covering the Java AI ecosystem and libraries.
XSpec: Unit testing and BDD framework for XSLT, XQuery, and Schematron.
Critical commentary on GitHub Copilot inserting advertisements into pull requests.
Kutuk: macOS app that plays mechanical keyboard sounds while typing.
Case study of GitHub Monaspace custom fonts designed for code editors.
DMATH: AI-generated formal verification tools using SMT solvers and theorem provers for code verification.
Security researcher describes finding vulnerabilities in an AI company's service. Poorly written with incomplete technical details.
Tool converting text descriptions to motion graphics videos using AI.
Solo.io launches agentevals, a tool for evaluating AI agents' performance and behavior.
Manning eBook on runtime intelligence and test-time compute as alternative to model scaling for AI capability improvements.
Coasts: Open source tool for running multiple containerized localhost instances and docker-compose runtimes across git worktrees.
TurboQuantPlus: Open source KV cache compression for local LLM inference achieving 4.6-6.4x compression with planned improvements.
Prompt Helix browser extension enables natural language queries on webpages by sending page content to Claude or ChatGPT without copy-pasting.
Discussion about anatomy and structure of LLM benchmarks.
Discussion of GitHub Copilot injecting ads into 1.5M+ pull requests.
Local video search CLI using Qwen3-VL embedding model, runs offline on Apple Silicon and GPUs without API dependency.
Principle of zero ambient authority for governing AI agent permissions and actions.
Post-mortem of Pillser, a supplement research database that lost search traffic after algorithm changes.
Broken article about Railway web app hosting data breaches.
Discussion about specializing LLM agents for CI/continuous integration workflows.
Incomplete article title about building a GPT from scratch.
IHP v1.5 Haskell web framework release with database rewrite, performance improvements, and 15+ extracted modules.
Analysis of why current AI systems score below 1% on ARC-AGI-3 benchmark versus humans at 100%.
OpenClaw: operational AI agent team company running transparently on GitHub with runtime governance rules.
Speculative blog post on AI disrupting SaaS business model and cybersecurity implications.
Discussion question about estimating LLM costs for automation workflows.
GitVelocity: tool that scores 50k+ code PRs using Claude across six complexity dimensions for engineering metrics.
Dendrite is an inference engine with O(1) KV cache forking for tree-structured LLM reasoning, optimized for agentic workloads using tree-of-thought and MCTS algorithms.
Aludel is an LLM evaluation workbench for Phoenix apps that runs prompts across OpenAI, Anthropic, and Ollama simultaneously, comparing output quality, latency, tokens, and cost.
Informal overview of AI safety landscape in early 2026 presented via speculative graphs.
Benchmark of 9 browser agents shopping on Amazon; only 2 successfully selected correct products. Evaluates agent reliability on e-commerce tasks.
Command injection vulnerability in OpenAI Codex exposed GitHub OAuth tokens via malicious branch names.
Title only, no content provided.
Amazing Sandbox runs third-party tools and AI agents securely in Docker, with pre-configured support for multiple coding agents.
BrowserHawk is autonomous QA agent skill for Claude Code. Discovers web routes, tests pages, fills forms, finds bugs with journey-based memory.
Phantom is an open-source AI agent that runs on its own VM and can rewrite its own configuration. Show HN post with limited details provided.
Open-source AI agent platform with visual drag-and-drop workflow builder for orchestrating agent tasks.
Title only, no content provided.
Memoryport adds 500M token persistent memory to LLMs via Arweave storage and LanceDB vector search, compatible with Claude, Cursor, Ollama.
Port of Immich photo backup platform to Android using Termux without Docker or root access.
DeerFlow is open-source agent orchestration framework for autonomous agents with sub-agents, memory, sandboxes, and extensible skills. Version 2.0 ground-up rewrite.
Mistral AI secures $830M debt financing for data center infrastructure with Nvidia GPUs.
Video title only, no content provided.
SycoFact 4B: Open-source 4B model for detecting sycophantic and delusional AI responses. Achieves 100% rejection on psychosis-bench, runs on consumer GPUs, available on Hugging Face and Ollama.
User discusses experiences running multiple parallel coding sessions with Claude, Opencode, and Pi AI agents.
Skillwave is an autonomous agent orchestrator that decomposes goals into tasks, creates subagents with distinct roles, and executes via async communication loop until completion.