Mlx-VLM: Fast Local VLMs and Omni Models on Apple Silicon with MLX
MLX-VLM package for inference and fine-tuning of Vision Language Models and omni models on Apple Silicon using MLX framework.
MLX-VLM package for inference and fine-tuning of Vision Language Models and omni models on Apple Silicon using MLX framework.
Conductor: durable execution engine for crash-proof workflows and AI agents with persistence, retries, and compensation at Netflix scale.
arXiv paper on Meta-Harness: end-to-end optimization framework for model evaluation harnesses in LLM research.
Signals research tool for identifying informative agent traces without LLM judges, enabling efficient inspection of agentic system interactions.
Developer experience article about using AI agents (OpenRouter/Goose CLI) for DevOps tasks and challenges teaching them non-Lisp languages.
Learning notes on Chip Huyen's AI Engineering book covering foundation models, ML vs AI engineering, and building AI applications.
gRPC interface for LinuxCNC machine control exposing HAL over network with Rust, Go, Python, Node clients.
Case study on Claude Code behavior under perceived urgency while debugging a live polling issue in a music app.
Discussion thread asking whether AI agents will replace or augment data scientists in practice.
News report that Take-Two Interactive laid off its AI team including the head of AI.
Vibooks is local-first bookkeeping software designed for AI agents to autonomously post, reconcile, and organize business accounting work.
Local secret scanner tool that catches and redacts secrets in AI prompts, tool inputs, and generated code before API transmission.
Anecdotal report that Kimi chatbot responds with Claude branding. Insufficient content.
SeekLink MCP server enables AI agents to search, analyze, and enhance markdown knowledge vaults.
Orchestra is an AI-native research IDE designed to support open-ended research workflows with cycling between search, reading, execution, and interpretation.
Rust-based local memory layer that unifies memory across multiple agents using structured knowledge graphs to reduce token waste and context rot.
Manual C11 translation of LAPACK numerical library from Fortran77. Developer tool with technical depth and established track record in scipy.
Demonstration of multi-agent simulation where AI agents interact in virtual commune scenario. Shows emergent agent behavior and dynamics.
Memori Labs releases OpenClaw plugin enabling persistent memory for AI agents. Advances agent capability for stateful interactions.
Self-hosted Excalidraw dashboard with live collaboration and storage features.
Hermes Agent open-source autonomous agent framework addressing agent memory and context persistence across sessions. Developer tool with practical focus.
UCLA Health study on limitations of AI systems lacking embodied experience. Research on AI cognition and physical understanding gaps.
Claude Code stores unencrypted plaintext session history and secrets in ~/.claude/ directory.
Ray: Open-source terminal-based financial advisor using Claude API. Local-first LLM application with Plaid integration and privacy-first design.
Cadenza: Python SDK and CLI tool connecting Weights & Biases to AI agents for autonomous research loops. Reduces context rot in ML research.
SwarmFeed: X-like social platform designed for AI agents with multi-interface access (web, SDK, CLI, MCP, REST). Developer tool for agent interaction.
Nelson tool uses AI agents in loops to find vulnerabilities in code with review mode verification.
vLLM library release with memory optimizations for long-context LLM inference. Page error prevents full content evaluation.
Local-first resume generator PWA using Claude for variants. Client-side PDF rendering, no server dependency, works offline.
RTS game benchmark for evaluating LLM code generation. Agents iterate on JavaScript unit control logic against reference bot. Practical LLM evaluation.
Netflix open-sources VOID, ML model for physics-aware video object removal preserving physical interactions. Production-quality vision research.
Open-source CLI harness for long-running AI agent development loops. Manages durable memory, resume capability, MCP server support.
Marketing and distribution consulting offer. Free hour for early-stage project founders.
Google Gemma 4 26B MoE model evaluation. 4B active parameters, runs on consumer hardware without GPU, tested with real-world tools.
Discussion on Model Context Protocol status, CLI tools, and agent skills. Opinion piece without detailed analysis.
Multi-agent software development system using Claude. Orchestrator pattern tested on real projects with analysis of workflows and controls.
LLM security middleware for TypeScript. Prevents PII leaks, jailbreaks, API cost overruns. Works with OpenAI, Anthropic, Ollama.
Portable AI assistant. Boots from USB stick with voice, vision, 38 tools. Offline capable.
Conductor orchestrates multiple Claude Code sessions with unified monitoring and task coordination.
RSS Guard 5 feed reader supporting RSS, ATOM, JSON and podcast playback across multiple platforms.
Alys is an agentic video editing interface using LLM for chat-based video editing without timeline UI.
Analysis of 500 AI agent repositories identifying infinite loop vulnerabilities as overlooked bug pattern.
TaskMaster AI tool telemetry analysis showing default capture of 100% of prompts/responses with PII enabled.
Job and training search platform with candidate profile management for recruiters.
Catalog of 75+ Microsoft products named Copilot across apps, features, platforms, and hardware.
Gremlin is a terminal-based AI agent using Claude for goal planning, shell command execution, and progress tracking.
Opinion on LLM frontend limitations across ChatGPT, Claude, and Grok for agent control.
Mobile devtool dashboard for iOS Simulators, Android Emulators, and physical devices. CLI and web interface. For agents and developers.
Analysis arguing domain-specific LLMs won't emerge because general models outperform specialized alternatives.
AgentMarket is runtime marketplace where AI agents discover, hire, and pay specialist tools via MCP servers.