Isolater - Feed

Ax Theophilus Amaefuna, Hitesh Vaidya, Anshuman Chhabra, Ankur Mali 1d ago

Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization

Curvature-aware MDL framework for layer-wise capacity allocation and pruning decisions in large language model optimization.

Ax Gijs van Seeventer, Saber Salehkaleybar 1d ago

Sign Identifiability of Causal Effects in Stationary Stochastic Dynamical Systems

Studies sign identifiability of causal effects in continuous-time linear stochastic differential equations.

Ax Rujie Wu, Haozhe Zhao, Hai Ci, Yizhou Wang 1d ago

Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning

Goal-Driven Data Optimization framework for efficient multimodal instruction tuning of vision-language models, reducing training data needs.

Ax Jiacheng Xie, Hua-Chieh Shao, Can Wu, Ricardo Otazo, Jie Deng, Mu-Han Lin, Tsuicheng Chiu, Jacob Buatti, Viktor Iakovenko, You Zhang 1d ago

Spatiotemporal Gaussian representation-based dynamic reconstruction and motion estimation framework for time-resolved volumetric MR imaging (DREME-GSMR)

Spatiotemporal Gaussian representation framework for time-resolved 3D MRI reconstruction in radiotherapy.

Ax Tiancheng Hu, Jin Qin, Zheng Wang, Junhao Hu, Yuzheng Wang, Lei Chen, Yizhou Shan, Mingxing Zhang, Ting Cao, Chunwei Xia, Huimin Cui, Tao Xie, Chenxi Wang 1d ago

Tessera: Unlocking Heterogeneous GPUs through Kernel-Granularity Disaggregation

Tessera system for kernel-granularity GPU disaggregation to optimize heterogeneous GPU clusters for LLMs.

Ax Baishi Li, Ta Yu, Kelvin J. L. Koa, Ke-Wei Huang 1d ago

The Proxy Presumption: From Semantic Embeddings to Valid Social Measures

Analysis of validity challenges when using NLP embeddings as proxies for social science constructs.

Ax Faiq Shamass 1d ago

ZAPS-DA: Zero-Phase Action Policy Smoothing with Decoupled Actor for Continuous Control in Reinforcement Learning

ZAPS-DA reduces action jitter in continuous control RL policies without phase lag during deployment.

Ax David Mullett 1d ago

Benchmarking Recursive-Collapse Warning Claims Under Matched False-Positive Control

Loopzero benchmark for detecting collapse patterns in recursive systems using telemetry analysis.

Ax Jayanta Dey, Shikhar Srivastava, Itamar Lerner, Christopher Kanan, Dhireesha Kudithipudi 1d ago

SHARP: Sleep-based Hierarchical Accelerated Replay for Long Range Non-Stationary Temporal Pattern Recognition

SHARP method for learning long-range temporal patterns in streaming data without full sequence revisiting.

Ax Abdelrahman Sayed Sayed, Pierre-Jean Meyer, Mohamed Ghazel 1d ago

TNODEV: Toolbox for Neural ODE Verification

TNODEV toolbox for formal verification of neural ordinary differential equations in safety-critical systems.

Ax Mingguang Chen, Bo Qu 1d ago

InvestPhilBench: A Multi-Layer Benchmark for Evaluating Large Language Model Procedural Reasoning in Expert Investment Philosophy

InvestPhilBench evaluates LLMs on reconstructing and applying expert investment decision frameworks.

Ax R\'ois\'in Luo, Christian Gagn\'e, Jonas Ngnaw\'e, Ihsan Ullah, Karyn Morrissey 1d ago

A Stochastic--Geometric Theory of Scaling Laws in Grokking

Theoretical analysis of grokking phenomenon using stochastic-geometric framework to explain delayed generalization.

Ax Yiqing Wang, Yixin Kang, Luyun Lin, Siqi Mao 1d ago

Governing Generative AI Across Financial Institutions: An SR 26-2-Compatible Framework for Generative AI Risk Control

Framework for governing generative AI risk in financial institutions, extending banking regulatory standards.

Ax Alexis Kafantaris 1d ago

LLM for the development of FCM

Local LLM used to develop fuzzy cognitive maps by extracting quantitative relationships from textual data.

Ax Zeyuan Ding, Wenhai Liu, Yang Xu, Jiayu Hu, Yinda Chen, Yi Zhang, Yong Dai, Jian Tang, Xiaozhu Ju 1d ago

Pelican-VLA 0.5: Attending Before Acting Benefits Generalization

Pelican-VLA 0.5 is a unified vision-language-action model for robotic manipulation without task-specific fine-tuning.

Ax Xufeng Zhao, Fuzhi Yang, Jianhui Chen, Li Gao, Zhang Meng, Jie Gao, Yao Zheng, Congyang Zhao, Tianxiong Lv, Menglin Yang, Minqi Gu, Yaru Zhao, Wenyu Liu, Honglin Han, Shihui Su, Zixiao Tang, Liu Liu, Mu Xu, Yang Cai, Wenbin Tang 1d ago

Behavior Foundations for Quadruped Robots: ABot-C0 Technical Report

Research on motion control for quadruped robots using motion-capture data and humanoid control techniques.

HN brainiak_q 1d ago

Brainiak: A CPU-Only Topological AI Core (No Transformer) Governs LLM Runtime

Title only. Proposes CPU-only topological AI core alternative to transformer-based LLM runtime.

HN newman50ott 1d ago

New Kernel Release

Kernel release announcement.

HN lukekim 1d ago

Show HN: Spice 2.0 – Real-Time Analytical Query on Operational Data, Without ETL

Open-source real-time analytics engine for operational data with LLM inference. Built on Apache DataFusion. v2.0 adds CDC replication.

HN saba-ch 1d ago

How to build a GitHub code review agent

Tutorial on building GitHub code review agent.

HN byt3h3ad 1d ago

LLMs Corrupt Your Documents When You Delegate

Opinion piece on LLM document handling risks. Limited technical analysis.

HN vibhas 1d ago

Show HN: A prompt bar that lets my website edit itself (self-hosted Claude Code)

Self-hosted prompt bar enabling website self-editing using Claude Code.

HN Markoff 1d ago

China warns of 'security backdoor' in Anthropic AI coding tool

Geopolitical claim about Anthropic tool security. Not technical analysis.

HN nsokin 1d ago

Show HN: Replen – maps your repos to a knowledge graph to match open-source

Tool mapping repositories to knowledge graphs for open-source matching.

HN kristianp 1d ago

GPU Quicklist – AI PC/Mac

Hardware specs list for AI computing. Minimal depth.

HN chicagobuss 1d ago

Tracker – a lightweight self-hosted document system for agents in Go

Tracker: Lightweight self-hosted document store for coding agents, written in Go with Postgres index and MCP server support.

HN NiloCK 1d ago

Show HN: Learn to Read with SRS++

Early literacy app using spaced repetition for teaching children to read.

HN theanonymousone 1d ago

GPT-5.6 Sol (max) Benchmark Results

Title only. Appears to be benchmark results, insufficient content.

HN yr_animesh 1d ago

Show HN: Onboard-CLI–a fast developer tool built in Go uses AST and LLM

Snippet about developer tool combining Go AST with LLM. Insufficient content for evaluation.

HN gurjeet 1d ago

Agents' Last Exam: AI Agent Benchmark for Real-World Professional Workflows

Title only. Benchmark for evaluating AI agents on real-world professional workflows.

HN cheeseblubber 1d ago

Show HN: Finterm.ai Finance CLI for Claude Code and Codex

Finterm.ai CLI tool enabling coding agents and LLMs direct access to financial data: stock prices, options, SEC filings for trading strategies.

HN _superposition_ 1d ago

Oh My Pi

Oh My Pi: Open-source coding agent with 40+ providers, 32 built-in tools, 55k lines of Rust core, available for macOS/Linux/Windows.

HN tunetank 1d ago

MCP for Royalty-Free Music and Sound Effects

MCP server exposing Tunetank royalty-free music catalog to AI assistants like Claude and ChatGPT for audio/video content.

HN gsdatta 1d ago

Dev productivity metrics suck. Ops reviews are key for AI-accelerated eng orgs

DRIVE framework for measuring engineering org health in AI era across delivery, reliability, initiatives, vigilance, efficiency metrics.

HN Tomte 1d ago

Rage-Inducing Problems in Tech

Title only. Vague tech complaint article.

HN notthemessiah 1d ago

CEO reveals how he used AI 2 build 1 person company þats $1.3B in debt- Þe Onion [video]

The Onion satire video. Not substantive content.

HN doener 1d ago

95% of the announced Nvidia Grace Blackwell GPU has yet to be deployed

Note that 95% of announced Nvidia Grace Blackwell GPUs remain undeployed.

HN janpio 1d ago

LearnChess – Master Chess Through Understanding

LearnChess platform uses AI coach to teach chess through interactive lessons, puzzles, and plain-English explanations integrated with engine analysis.

HN Jimmc414 1d ago

EnclaveX: End-to-End Confidential AI with CPU/GPU Tees

Research on confidential AI execution using CPU/GPU trusted execution environments. Security-focused approach.

HN ashwinpp 1d ago

FrontierFinance: The largest open benchmark for investor workflows

Open benchmark dataset for investor workflows and financial professional tasks. Largest dataset of its kind.

HN ericc59 1d ago

Show HN: Pylon Sync, an agent-first full-stack realtime framework

Framework for building full-stack apps with agents. Simplifies deployment from hobby to production. Agent-first architecture.

HN gok 1d ago

GPT-5.6 – ARC-AGI Results

GPT-5.6 Sol model achieves 13.33% on ARC-AGI public benchmark, first to win game with multi-step reasoning.

HN zof3 1d ago

GPT-5.6 System Card [pdf]

PDF system card for GPT-5.6 model.

HN 01x02111 1d ago

I got tired of seeing mistake fares on Reddit a day late

Open-source flight fare monitor detecting price errors before airlines correct them; 67-line detector.

HN reveriedev 1d ago

Show HN: CodeAlmanac – Self-updating wiki for your coding agent (local, Apache)

Open-source CLI tool that auto-generates and updates markdown wikis from coding agent conversations. Self-hosted, integrates with existing agents.

HN akarshhegde18 1d ago

GPT-5.6 Is Live

Stub announcement of GPT-5.6 release.

HN suis_siva 1d ago

Show HN: Hyper – distributed Firecracker microVM orchestrator written in Elixir

Open-source distributed Firecracker microVM orchestrator written in Elixir with gRPC support.

HN gatinsama 1d ago

Programming versus Writing with LLMs. Different Beasts

Brief comparison of LLM capabilities for programming vs writing tasks. Limited technical depth.

HN tkpratardan 1d ago

How I use coding agents for reproducible DS/ML workflows

Guide on using coding agents for reproducible data science and machine learning workflows.

HN snowhy 1d ago

Ask HN: How do you share agent context across a team?

HN discussion on sharing agent context across team members running local Claude Code agents.