Ask HN: Depending on AI for anything important is a horrible idea, agree?
Discussion prompt asking whether depending on AI for important tasks is risky.
Discussion prompt asking whether depending on AI for important tasks is risky.
OpenAI Foundation announces funding allocation and mission updates following company recapitalization.
ChatGPT shopping assistant using Agentic Commerce Protocol for product discovery and comparison.
Open source tool for overlaying map tiles onto Autodesk's 3D BIM Viewer with coordinate transform logic.
Engineer built an AgTech app using satellite imagery and weather data for local farmers. NDVI analysis and machine learning for vineyard management.
Modular 26.2 release adds image generation/editing workflows and improves Mojo for GPU kernel AI development.
ProofShot: CLI tool enabling AI coding agents to view browser output, capture screenshots/video/errors for verification during UI development.
Kern: AI agent framework maintaining single continuous session across CLI, Telegram, Slack with persistent local memory in plain folder.
Prompt repository with curated templates for data analysts covering exploration, cleaning, SQL, dashboards, and visualization.
Sandock: Docker-based sandbox for running AI coding agents with persistent volumes, POSIX compatibility, cost-effective alternative to VMs.
Discussion on whether system programming offers refuge from LLM-assisted development trends due to different performance priorities.
Discussion on providing free open-source AI access to underprivileged populations in developing regions.
Discussion of Human Source License for open source stream processing library built on PyDBSP.
Research showing expert personas improve LLM alignment but reduce accuracy, trade-offs in prompt engineering.
Comprehensive reading list tracing 53-year research lineage of AI agents from 1970s theoretical foundations through modern implementations.
Shard-based scheduling system for scaling LLM fine-tuning experiments on limited GPU resources, enabling 100x more experiments on 4 GPUs.
GPU-accelerated Texas Hold'em poker solver claiming 4x speedup over existing tools. Domain-specific application, not AI/ML research.
Benchmark dataset for evaluating LLMs on financial tasks. Minimal content provided but addresses important evaluation gap.
Personal blog rant about software fragility and build tool warnings. No technical insights or data presented.
Self-promotion for independent publishing company offering 82 books across various topics. No technical content.
arXiv paper generalizing residual connections into multi-stream hyper-connections with spectral-sphere constraints to maintain identity mapping stability.
arXiv paper on natural gradient descent for online continual learning in image classification addressing catastrophic forgetting in non-i.i.d. data streams.
arXiv paper proposing Bayesian scattering as interpretable baseline for uncertainty quantification on image data using wavelet transforms and probabilistic modeling.
LLM-based approach for automated discovery of governing equations in dynamical systems, replacing genetic programming with language models for efficiency.
Interpretability method combining LIME with neural decision trees for more stable and faithful explanations of complex models on tabular data.
Clinical prediction framework using discriminative representation learning aligned to outcomes rather than reconstruction objectives.
Feature selection method using causal principles and diffusion models to improve stability under distribution shifts.
Contrastive learning approach for discovering functional gene associations from protein interaction data, testing PAM framework on molecular biology.
Investigation of how contextual recall emerges in transformers during pretraining vs. finetuning, examining in-context learning mechanisms for fact retrieval.
Detection framework using LLMs to identify adversarial attacks against human-AI teams, covering data poisoning, prompt injection, and prompt engineering threats.
Method for discovering time-varying causal networks in neural time series without assuming known causal structure a priori.
Optimization framework using jointly learned surrogate models for multi-objective optimization of neural dynamical systems in biophysical simulations.
Theoretical analysis of synchronization gaps in diffusion transformers using coupled Ornstein-Uhlenbeck systems to explain mode interaction hierarchies in the reverse process.
Study of sensitivity in compressed transformers across architectures, identifying which components degrade catastrophically vs. compress well, with formal bounds on error propagation.
Novel unsupervised method for long-term outlier prediction in time series data using outlier score modeling.
Analysis of content-based routing in hybrid recurrent-attention architectures through controlled experiments on multiple benchmarks.
CLT-Forge library for mechanistic interpretability of LLMs using cross-layer transcoders and sparse feature attribution graphs.
Deep attention-based sequential ensemble learning framework for BLE-based indoor localization in care facilities.
Comparative analysis of machine learning approaches for vehicle fuel consumption prediction using Motor Trend dataset.
Benchmarking physics-guided and deep learning models for air quality index forecasting on standardized datasets.
Cognitive science study on human decision-making persistence in bandit tasks using confidence-freeze theory.
Multimodal misinformation detection system addressing visual content manipulation in social media.
Semi-supervised text classification using self-training with pseudo-labels to improve deep classifier performance on unlabeled data.
Learnable sparse memory banks with chapter-based routing for scaling knowledge storage in Transformers without prohibitive attention costs.
Training-free visual token pruning framework for efficient Vision-Language Model inference through text-conditioned subspace reconstruction.
Disentangled multi-modal representation learning via VAEs for molecular property prediction in drug discovery and materials science.
Weakly supervised learning method for classification from bag-level label proportions using dual-proportion constraints.
Theoretical analysis of zeroth-order optimization training dynamics through Neural Tangent Kernel perspective for gradient-free neural network training.
Lightweight continual learning method using pruned adaptation modules for foundation models compared against recent FM-based approaches.
Investigation of plasticity loss in deep reinforcement learning with proposed Optimization-Centric Plasticity hypothesis explaining adaptation dynamics.