Simulating societies with LLM agents in TypeScript
WorldSim is a TypeScript multi-agent simulation engine for modeling how communities respond to policy changes using LLM-powered agent reasoning.
WorldSim is a TypeScript multi-agent simulation engine for modeling how communities respond to policy changes using LLM-powered agent reasoning.
Hybrid question-answering system combining LLMs with Wikidata and Prolog for structured reasoning over knowledge graphs instead of pure RAG.
Financial overview of OpenAI and Anthropic ahead of potential IPOs.
Openbrowser is a headless browser built for AI agents using structured semantic state with element IDs instead of screenshots.
Semantik: vector message broker routing by semantic meaning using embeddings and SemQL query language for high-dimensional space.
Discussion of observability vs predictability testing approach for complex distributed systems.
Seedance 2.0: multimodal text-to-video and image-to-video AI generator with unified architecture.
m2a is a CLI tool for connecting to Agent2Agent protocol agents via HTTP+JSON/JSON-RPC with Bubble Tea chat UI and debug traces.
WraithVector governance layer intercepts OpenClaw agent tool calls to enforce policies, block dangerous actions, and generate audit evidence.
Per-user isolated environments infrastructure for AI agents. HN post with limited content.
KarpathyTalk is an open developer community platform with markdown posts, social features, and LLM-accessible APIs for builders and agents.
Small online sellers using AI tools like Alibaba Accio to automate product research and supplier discovery.
Geopolitical analysis of US-China AI dominance race across research labs, academia, and startups.
TriAttention uses trigonometric KV cache compression to reduce memory by 10.7x while maintaining accuracy for long-context LLM agent reasoning.
Discussion of building commercial company on open-source AI agent platform wrapping Claude Code with custom MCP servers for infrastructure/operations.
Production database safety concerns with AI-generated SQL. Models produce plausible but unreliable code lacking performance/locking awareness.
Discussion on production strategies for controlling costs and enforcing limits for LLM/agent systems to prevent unexpected loops and runaway expenses.
PocketTeam: autonomous AI team with COO, QA engineer, security auditor roles using Claude Code. Self-healing, real-time monitoring, daily reports.
Meta plans to open-source its new AI models despite previous setbacks in AI leadership, investing over $600B in AI infrastructure.
Model-as-a-Service platforms abstract operational overhead of self-hosted inference. Discussion of hpc-ai.com built on Colossal-AI vs control tradeoffs.
Video discussion of animal consciousness parallels to AI consciousness. Title only.
Research on using choreographic languages as a formalism for describing multi-agent LLM workflow coordination, framing it as a distributed systems problem.
QitOS is a research-first framework for building reproducible LLM agents with clean module design, benchmarks, and built-in observability.
Research on choreographic languages for managing multi-agent LLM coordination as a distributed systems problem with new programming language design.
Business impact of AI search engines. HubSpot experienced 140M lost visits as search behavior shifts toward AI-powered tools.
Production-grade skills framework for AI coding agents. Encodes workflows, quality gates, and engineering best practices as reusable skills activated via slash commands.
Title only with minimal metadata. No substantive content provided.
AEGIS: Scaling homomorphic encrypted transformer inference via hybrid parallelism on multi-GPU. Privacy-preserving ML optimization, niche application.
Circuit duplication technique for frozen vision transformer inference on marine species classification. ML optimization off-topic domain.
MetaSAEs: Introduces decomposability penalty for training sparse autoencoders with atomic latents. Improves alignment and safety-relevant applications.
Compares RAG vs standard approaches for Agile story point estimation in sprint planning. arXiv study on LLM application.
TRACE: Study on how LLMs allocate trust between conflicting code, documentation, and tests. Evaluates trustworthiness in AI-assisted software engineering.
ExpressEdit: Photoshop plugin using diffusion models for facial expression editing. Computer vision application, off-topic for interests.
RDFace benchmark dataset for rare disease facial phenotype analysis in children with synthetic data generation. ML research but off-topic domain.
Introduces vocabulary dropout technique to solve diversity collapse in co-evolutionary LLM self-play curriculum learning. arXiv paper with novel method.
LLM-powered evolutionary search automatically discovers unsupervised uncertainty quantification methods as Python programs for claim verification.
Fine-tuning approach adapting DeepSeek-OCR-2 for optical chemical structure recognition by formulating task as image-to-text.
Study of brain-LLM alignment during creative divergent thinking tasks, measuring correlation between model performance and human neural activity.
VisionClaw wearable AI agent on Meta Ray-Ban glasses combining egocentric perception with speech-driven task execution via OpenClaw agents.
Sim2Real-AD framework for zero-shot sim-to-real transfer of VLM-guided RL policies from CARLA simulation to physical autonomous vehicles.
Dynamic model analyzing productivity-skill tradeoffs when workers use AI tools, decomposing productivity effects into expertise-dependent and independent channels.
Taxonomy of LLM-based coding agent architectures analyzing scaffolding code patterns including control loops, tool definitions, and context strategies.
Novel salient object detection method based on user needs rather than visual stimuli alone.
LangFIR uses sparse autoencoders on monolingual data to discover language-specific features for steering LLM output language without parallel corpora.
AgenticFlict dataset of merge conflicts from AI coding agent pull requests on GitHub, studying integration challenges in collaborative AI-assisted development.
Video diffusion framework (CRAFT) for generating synthetic bimanual robot manipulation demonstrations with temporal coherence.
Phase-aware suppression method to reduce hallucinations in Vision-Language Models without iterative optimization overhead.
SecPI framework for secure code generation using reasoning LLMs through security reasoning internalization, addressing inference-time vulnerability mitigation.
Actor-critic reinforcement learning approach for multi-robot task allocation with asymmetric arrivals and switching delays.
Neural method for black-box global optimization using iterative refinement from noisy samples, addressing multi-modal function optimization.