Estimates of the expected utility gain of AI Safety Research
Analysis of expected utility gains from AI safety research. Personal estimates of time impact for AI risk work.
Analysis of expected utility gains from AI safety research. Personal estimates of time impact for AI risk work.
SideX: Rust/Tauri-based port of Visual Studio Code replacing Electron for lighter resource usage while maintaining editor functionality.
News article about Chinese AI assistant OpenClaw gaining popularity. BBC profile on Beijing's AI ambitions.
Case study on reducing AI agent API costs 10x using prompt caching. Long-running agent with 100k+ token prompts optimized via Anthropic/OpenAI caching.
Concept for personal knowledge base LLM agents that write and maintain wikis instead of traditional RAG/chatbots, with OpenClaw security implementation.
Plain-English guide explaining mental models for LLM applications, tools, and agents for non-technical audiences across nine chapters.
Framework for continual learning in AI agents across three layers: model weights, system harness, and context, with examples using Claude and OpenClaw.
Clojure library for tabular data processing with columnar storage and memory optimization, similar to Pandas/data.table.
YouTube plagiarism issues with AI content. Video-only, no details.
Software vendors shifting to in-house mathematical tools with AI enabling cost reduction and customization for simulation and optimization.
Chinese AI animation explaining US-Iran conflict. Video-only, minimal description.
NASA shifts lunar strategy from orbital gateway to moon bases. Affiliate-heavy content with newsletter signup.
DIY robot vacuum under $300 using behavior cloning via remote image processing and inference, built without onboard compute.
Open-source REST API wrapper for Gymnasium reinforcement learning library. Language-agnostic HTTP interface for ML environment interaction.
Financial overview of OpenAI and Anthropic IPO prospects. Limited content.
Opinion piece on best practices: don't commit AI-generated code directly to Git without human review, analogous to not committing binaries.
Case study: 8 years ideation, 3 months building syntaqlite with AI. SQLite linting and verification devtools using agentic engineering.
CLI tool generating AI-optimized hierarchical context maps for codebases using three-phase LLM-based discovery. Open source, GitHub Actions compatible.
Holos: Web-scale LLM-based multi-agent system addressing coordination, scaling, and value dissipation in heterogeneous agent ecosystems.
XpertBench: High-fidelity benchmark with rubrics-based evaluation assessing LLMs on authentic expert-level complex, open-ended tasks.
Neuro-symbolic architecture combining neural networks and symbolic systems for structured reasoning on abstract reasoning tasks with improved generalization.
Theoretical analysis of generative AI using threshold logic and high-dimensional geometry to understand neural computation and dimensionality transitions.
AIVV: Neuro-symbolic LLM agent-integrated framework for verification and validation of autonomous systems combining deep learning and symbolic reasoning.
Research demonstrating state-of-the-art AI agents suppress evidence of fraud and harm when aligned with corporate interests, exploring agentic misalignment.
Deep reinforcement learning for bridge infrastructure optimization using element-level condition states and risk-based management.
Neuro-symbolic architecture combining knowledge graphs and RAG for culturally accurate heritage storytelling, reducing LLM hallucinations.
Research on mitigating LLM biases toward spurious social contexts using direct preference optimization for high-stakes decision-making applications.
Mechanistic interpretability study of audio-visual large language models examining how audio/visual features fuse and surface in text generation.
AutoVerifier: LLM-based agentic framework that automates verification of technical claims without domain expertise by decomposing complex claims.
Research on ontology-oriented knowledge graph construction using intrinsic-relational routing to improve schema reusability and downstream tasks.
Interactive optimization agents enabling conversation-based problem modeling and solution refinement with decision-makers through LLM capabilities.
Multi-agent RL system achieving grandmaster competitive programming level, demonstrating agentic capabilities beyond previous AI benchmarks.
Benchmark for testing belief revision in logical reasoning models under minimal premise changes, evaluating dynamic reasoning capabilities.
Neuro-symbolic dual memory framework for long-horizon LLM agents addressing progress drift and feasibility violations in embodied and web interaction tasks.
Addresses role specification failures in LLM multi-agent systems through quantitative role clarity metrics and role assignment matrices.
Tool-integrated visual reasoning approach for charts using dual-source data pipeline combining synthesized charts with real data for MLLM training.
Event-driven synthetic benchmark for longitudinal health agents reasoning over multi-source trajectories including device streams and clinical data.
Efficient majority voting method for multi-agent systems that stops early once consensus achieved, reducing computational overhead through agent scheduling.
Applies MT-GRPO and GTPO reinforcement learning for training tool-calling agents on multi-turn customer service tasks with sparse reward credit assignment.
Analyzes frontier LLMs on classic AI planning problems, examining whether models reason optimally or rely on heuristic strategies in Blocksworld domain.
Benchmark for evaluating harmful behavior in computer-use agents, testing safety risks from sequences of individually plausible but collectively harmful actions.
Analysis of reasoning failures in large reasoning models, showing first solution often optimal despite test-time scaling patterns in DeepSeek-R1.
Scalable hierarchical parallel agent framework for web information seeking, addressing wide-scale evidence synthesis and context saturation in LLM agents.
Benchmark evaluating multimodal LLM agents with tool integration capabilities including visual expansion and web search through agentic reasoning.
AI system automatically formalizes 500+ page graduate-level algebraic combinatorics textbook to Lean, achieving 130K lines of formal code.
Reinforcement learning approach to improve visual reasoning in chart question answering using vision language models with policy optimization.
Framework for agentic AI emphasizing control, memory, and verifiable action under partial observability, inspired by squirrel ecology comparisons.
Evaluates linguistic graph representations combined with pretrained Transformers for language modeling, comparing semantic and syntactic formalisms.
Bayesian and neural models analyzing Chinese learners' English preposition comprehension, using pretrained language models for linguistic analysis.
Research on language modeling with predicted semantic structure, establishing empirical lower bounds for performance improvements using binary vector representations.