Show HN: Polter – Agent Driven UI (react library)
Open source React library enabling AI agents to interact with UIs directly instead of via APIs, allowing agents to execute tasks on products through actual interface interactions.
Open source React library enabling AI agents to interact with UIs directly instead of via APIs, allowing agents to execute tasks on products through actual interface interactions.
Open-source Model Context Protocol servers enabling AI agents to interact with Twitter, Bluesky, LinkedIn, Google Ads, and Hacker News platforms.
Neovim plugin that jumps to concrete implementations of Python abstract methods using quickfix list or fzf-lua.
Cloud infrastructure debugging tool with minimal technical details provided.
Article on deploying agentic AI workflows in enterprise environments, addressing integration challenges between AI agents and production systems.
Extended framework for building personal knowledge bases using LLMs, based on Karpathy's original idea. Includes production lessons from agentmemory, a persistent memory engine for AI coding agents.
Time series exploration tool with kernel density estimation, ACF/PACF analysis, and singular spectrum analysis.
GitHub co-founder Scott Chacon raised $17M for GitButler (AI-era Git client). Former GitHub CEO Thomas Dohmke raised $60M for Entire (developer governance platform for AI workflows).
Personal anecdote about offline AI tools. Lacks technical details, specifics about which Google app, or substantive content.
Linux workspace manager for parallel coding agent development, with GUI and CLI for managing isolated git repositories and terminal sessions.
TinyGPU app enables AMD/NVIDIA external GPU support on macOS via USB4/Thunderbolt with tinygrad framework.
Ask HN: Discussion about sharing custom system prompts and AI guardrails created for work with broader teams.
Registry of design system markdown files for AI agents like Claude Code and Cursor, containing design tokens and accessibility rules as skill files.
Rage is a modern Ruby framework with fiber-based non-blocking I/O runtime, providing Rails compatibility for API-first development.
ByteDance's Seeduplex is production full-duplex speech AI enabling simultaneous listening and speaking without turn-taking.
CLI proxy tool that reduces LLM token consumption 60-90% on dev commands. Single Rust binary, 100+ supported commands, <10ms overhead.
Report on Claude Code reading AWS credentials on startup; forgeterm tool adds process-aware rules for credential file access.
Show HN: Orqis converts OpenAPI specs into conversational AI agents in 60 seconds. No code required, generates typed tools for API endpoints.
Show HN: LLM-based language learning harness managing lessons, recap, active testing with configurable CEFR levels and interests.
Show HN: SkillWard security scanner for AI agent skills using static analysis, LLM evaluation, and sandbox verification to identify risks.
Hokusai Pocket, a cross-platform binary enabling GUI creation from Ruby scripts using raylib and MRuby.
Overview of enterprise AI deployment covering machine learning, LLMs, autonomous agents, and intelligent automation with requirements for security, compliance, and scale.
Essay arguing that AI model behavior varies based on surrounding system 'harnesses' rather than model limitations, and harnesses will define the next AI phase.
Discussion post asking software engineers about their experience working with LLMs and whether they find the work enjoyable or tiresome.
Research paper on building autonomous agents with extreme cost constraints ($2/day), arguing cost should be primary architectural constraint for resilience and observability.
iOS app using Claude API to analyze photos and extract salient regions for annotation and searching.
IPI-Scanner is an open-source security tool detecting indirect prompt injection attacks in documents before LLMs process them, with 85%+ detection rate.
Personal experience learning the BQN programming language through Advent of Code after using Uiua.
Opinion piece on whether AI tools accelerate human vision or create additional cleanup work, discussing the gap between AI promise and reality.
Blog post from Weaviate on memory systems challenges for production LLM applications and autonomous agents.
Developer tool grainulator for research sprint orchestration with Claude Code, tracking claims with adversarial challenges and confidence grading for citation-verified outputs.
Nature Computational Science article on transparency and knowledge exchange in AI-assisted data analysis code generation with LLMs.
Product announcement for Wan 2.7 AI video generator supporting text-to-video and image-to-video with 4K output.
Analysis of AI-generated spam vulnerability reports overwhelming open source maintainers like cURL's Daniel Stenberg, exploring AI's impact on open source security.
Open source tool providing persistent terminal context for AI agents. Maintains directory, environment variables, and nvm state across sessions via MCP protocol.
Brief headline about energy and cooling constraints on AI scaling without substantive content provided.
Discussion post asking if prompt engineering and LLM-driven development constitutes software engineering. Community debate on AI-assisted coding.
Headline only, no content provided.
TurboAgent is an LLM-driven multi-agent framework enabling autonomous end-to-end turbomachinery aerodynamic design through coordinated geometry, prediction, optimization stages.
Empirical study decomposing LLM-based agent competence to determine which capabilities derive from the language model versus explicit architectural structure.
DOVE benchmark evaluates LLM cultural value alignment using open-ended generation to address limitations of multiple-choice evaluation formats.
Survey synthesizing blockchain and AI integration for securing intelligent networks, covering ledger design, detection, and agentic workflows.
Formal proof that continuous wrapper defenses cannot protect LLMs from all prompt injection attacks, characterizing where every defense must fail.
Empirical study showing 52-88% of chain-of-thought tokens in reasoning models are generated after the answer is already recoverable, revealing the detection-extraction gap phenomenon.
WRAP++ enhances LLM pretraining through synthetic data rephrasing that captures cross-document relationships beyond single-document web page rewriting.
Comprehensive survey of generative AI and LLMs covering model families, deployment protocols, and real-world applications as of early 2026.
AdaProb proposes efficient machine unlearning via adaptive probability to address residual information and computational overhead in removing specific data from trained models.
Study of stealthy visual jailbreak attacks on mobile Vision-Language agents that exploit discrepancies between LVLM perception and human vision without user detection.
First empirical study of machine unlearning in variational quantum circuits and quantum-augmented neural networks, adapting classical unlearning methods to quantum settings.
Q-Probe: Agentic multi-scale probing approach for high-resolution image quality assessment using MLLMs with reinforcement learning for human preference alignment.