Show HN: We scored 50k PRs with AI – what we learned about code complexity
GitVelocity: tool that scores 50k+ code PRs using Claude across six complexity dimensions for engineering metrics.
GitVelocity: tool that scores 50k+ code PRs using Claude across six complexity dimensions for engineering metrics.
Dendrite is an inference engine with O(1) KV cache forking for tree-structured LLM reasoning, optimized for agentic workloads using tree-of-thought and MCTS algorithms.
Aludel is an LLM evaluation workbench for Phoenix apps that runs prompts across OpenAI, Anthropic, and Ollama simultaneously, comparing output quality, latency, tokens, and cost.
Informal overview of AI safety landscape in early 2026 presented via speculative graphs.
Benchmark of 9 browser agents shopping on Amazon; only 2 successfully selected correct products. Evaluates agent reliability on e-commerce tasks.
Command injection vulnerability in OpenAI Codex exposed GitHub OAuth tokens via malicious branch names.
Title only, no content provided.
Amazing Sandbox runs third-party tools and AI agents securely in Docker, with pre-configured support for multiple coding agents.
BrowserHawk is autonomous QA agent skill for Claude Code. Discovers web routes, tests pages, fills forms, finds bugs with journey-based memory.
Phantom is an open-source AI agent that runs on its own VM and can rewrite its own configuration. Show HN post with limited details provided.
Open-source AI agent platform with visual drag-and-drop workflow builder for orchestrating agent tasks.
Title only, no content provided.
Memoryport adds 500M token persistent memory to LLMs via Arweave storage and LanceDB vector search, compatible with Claude, Cursor, Ollama.
Port of Immich photo backup platform to Android using Termux without Docker or root access.
DeerFlow is open-source agent orchestration framework for autonomous agents with sub-agents, memory, sandboxes, and extensible skills. Version 2.0 ground-up rewrite.
Mistral AI secures $830M debt financing for data center infrastructure with Nvidia GPUs.
Video title only, no content provided.
SycoFact 4B: Open-source 4B model for detecting sycophantic and delusional AI responses. Achieves 100% rejection on psychosis-bench, runs on consumer GPUs, available on Hugging Face and Ollama.
User discusses experiences running multiple parallel coding sessions with Claude, Opencode, and Pi AI agents.
Skillwave is an autonomous agent orchestrator that decomposes goals into tasks, creates subagents with distinct roles, and executes via async communication loop until completion.
macOS menu bar app that auto-detects screen sharing and blurs desktop.
Open-source GPS running app with on-device data storage, no backend or subscriptions.
Career advice article about knowledge transfer and team transitions.
Customermates open-source, self-hostable CRM with AI-first design.
Chardet character encoding library rewritten from scratch using Claude. Detailed technical account with conversation transcripts showing AI-assisted development process.
AgentLair credential vault for AI agents preventing environment variable exfiltration in supply chain attacks.
Discussion thread on limitations of current LLM code generation models across different complexity scenarios.
News brief on DeepSeek AI chatbot outage in China.
Explores credential management approaches for AI agents requiring secure access to passwords and authentication systems.
Tool for streaming Twitch/YouTube/Kick content inside Claude Code with live chat integration.
Google DeepMind research study on AI's persuasion and manipulation capabilities.
Video title only with no content or context provided.
Zinc LLM inference engine in Zig enabling 35B model inference on consumer AMD GPUs via Vulkan.
API design principles for LLM consumers. Reducing Claude's healthcare API calls from 72 to 8 through agent-focused redesign.
Title only, no content. General discussion about critical thinking with LLMs.
Web app for community location sharing using Claude Opus for text editing. Minor AI assistance, not primary focus.
Analysis of AI-generated patches passing CI tests but failing production. 20% breakage rate in vulnerability fixes.
Title only, no content. Tool for detecting LLM-generated text.
Brief mention of AMD Ryzen AI 300 processor inference capabilities without technical details.
Open source email infrastructure for AI agents. Send, receive, search, extract codes. Deploy on Cloudflare. Integrates with Claude Code and AI agent platforms.
Open source library of 450+ modular agent skills for medical research. Works with OpenClaw, Claude with scientific integrity constraints.
Open source macOS terminal multiplexer for running AI agents in parallel with notifications. Built for agent workflows.
Founder dispute over Stripe account closure for AI image/video generation platform citing payment reversal policy.
Analysis of accelerating AI tool/framework releases tracked via HN, GitHub, npm, PyPI showing ecosystem growth rate.
Philosophy preprint on mathematical methods and AI's role in mathematics formalization and human thought.
Neovim GUI for macOS using Metal GPU rendering with multi-window support and IME for CJK input.
Analysis of how AI agents integrate third-party tools into code generation and product decision workflows.
Marketing content for commercial AI image upscaler/enhancement tool with no technical details.
Website redesign benchmark comparing four AI models for generating website designs from URLs.
Empirical study showing verification steps degraded AI agent performance across 29 tests. Original experimental research.