Ax Amirmohammad Farzaneh, Osvaldo Simeone 8d ago

Post-Selection Distributional Model Evaluation

Research on formal evaluation methods for machine learning models, focusing on test-time performance-reliability trade-offs when target KPI levels are unknown.

HN mesto1 9d ago

Distributed AI Agents

Research on using distributed AI agents with independent context windows to improve reasoning on complex multi-perspective questions.

HN FredrikMeyer 9d ago

AI Is Like TV

Opinion piece comparing AI adoption to TV, discussing shift to AI-assisted programming and loss of challenging side projects.

HN Olshansky 9d ago

Core views on AI safety (March 2023)

Anthropic's 2023 statement on AI safety risks and impact, discussing concerns about powerful AI development in coming decade.

HN ninjagoo 9d ago

The race to train AI robots

Article on training AI robots to understand physical movement through human trainers in India and global industrial settings.

HN xngbuilds 9d ago

LLM-Wiki

LLM-Wiki: Obsidian-based persistent agent memory system with searchable indexed documents, inspired by Andrej Karpathy's concept.