SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP
Technique to improve CLIP few-shot classification by addressing modality gap through semantic bridging between image and text embeddings.
Technique to improve CLIP few-shot classification by addressing modality gap through semantic bridging between image and text embeddings.
Benchmark for evaluating LLMs on detecting demographic-targeted social biases across diverse content types and demographics.
Method to improve LLM performance in multi-turn conversations by reinforcing long-term planning and goal tracking through prompting.
Protein structure alignment using optimal transport for identifying and comparing local structural motifs.
Lightweight Disentangled Concept Bottleneck Model addressing bias in input-to-concept mapping for interpretable multimedia recognition.
Framework enabling diffusion models to adapt generation quality based on real-time network bandwidth constraints in cloud-to-device scenarios.
Minimax optimal algorithm for best arm identification under fixed sampling budget with applications to A/B testing.
Convex optimization framework for robust scheduling of aggregated EV battery storage under uncertainty.
Study of task transfer in Vision-Language Models examining how finetuning on one perception task affects performance on others.
Philosophical analysis arguing static value alignment approaches cannot ensure robust AI alignment under capability scaling and distribution shift.
PINNs applied to source inversion in advection-diffusion equations with sparse measurements for scientific computing.
OxEnsemble: Fair classification approach for low-data, imbalanced settings with demographic group constraints.
Method for determining singular value thresholds in DNN weight compression using random matrix theory.
Parameter-efficient transfer learning with neural operators for microseismic phase picking across varying signal conditions.
Study on selecting minimal training data subsets for example-based explanations of language model predictions using influence estimation.
Wireless ML inference via programmable metasurfaces for over-the-air extreme learning machines in MIMO systems.
Accordion-Thinking: Framework enabling LLMs to self-regulate reasoning step granularity through dynamic summarization for efficient inference.
Neuro-symbolic framework using differentiable logic programming to design and optimize quantum circuits.
Complexity analysis of accelerated proximal-gradient methods for ℓ1-regularized PageRank computation.
Theoretical analysis of flow matching generative models' adaptation to data manifold structures.
ZipMap: Stateful 3D reconstruction model achieving linear-time complexity for large image collections via test-time training.
Evaluation of 17 LLMs showing diagnostic reasoning degrades across multi-turn conversations compared to single-turn benchmarks.
HiCI: Hierarchical attention module for long-context language modeling, organizing information from local to global levels.
tBayes-MICE: Bayesian approach to multiple imputation for time-series data with missing values via MCMC sampling.
CodecSight optimizes streaming vision-language model inference by leveraging video codec signals for end-to-end efficiency.
DOVE benchmark for evaluating LLM cultural value alignment using open-ended generation, addressing limitations of multiple-choice formats.
Study investigating saturation points in recommender system performance as training dataset size increases, with reproducible Python implementation.
Theoretical work on Schrödinger bridge problem with mean-field dynamics for multi-agent systems control.
Research: Chain-of-thought models generate 52-88% of tokens after answers are already recoverable, revealing inefficiency in reasoning.
ML research on cadence-aware encoding for next-basket recommendation in retail, modeling repurchase timing patterns.
JSON extension format supporting trailing commas and comments. Minor format specification unrelated to AI.
Brief mention of LLM routers injecting malicious tool calls as security issue. Insufficient detail.
Bittensor governance controversy and token price impact.
Personal experience report on using AI video generation models. Notes Gemini Nano, Veo3, Sora2, and Chinese models. Anecdotal observations.
Analysis of AI agents as SaaS replacement with integrated database, logic, and UI.
Linter tool for AI agent context files and MCP configs. Detects stale references, token waste, hardcoded secrets. Cites research on context bloat reducing agent performance.
Open-source platform for managing AI coding agents as teammates. Agents autonomously handle task assignment, code writing, and progress tracking without prompt copying.
User complaint about Claude signup experience and support quality. Opinion-based critique without technical depth.
Research on video generation models struggling with multi-subject action binding. Introduces ActionParty with per-subject state tokens to improve action accuracy.
Study reports Google's Gemini models generate inaccurate search results 9-15% of the time across 8,652 queries. Evaluates LLM reliability in production systems.
xAI lawsuit against Colorado AI anti-discrimination law. Legal/regulatory news without technical substance.
Facebook Marketplace MCP integration allowing Claude to search and monitor deals via command line. Demonstrates AI agent tool use.
Profile of Black Forest Labs, 70-person German AI image startup competing in image generation. Business/market focus, minimal technical substance.
Discussion question about managing token costs and data exposure in production AI agent systems. Operational concern without detailed analysis.
arXiv framework page for collaborative feature development. No technical content about the claimed compiler-LLM optimization research.
Tend: lightweight infrastructure for managing multiple concurrent AI agents and projects. Handles agent coordination, context switching, and multi-project workflows.
Forum question asking for AI video upscaling tool recommendations. No technical depth or research.
Five LLM-based agents play Texas Hold'em with distinct personalities and reasoning styles.
News report about police misuse of AI for non-consensual deepfakes. No technical or research relevance.
Discussion of LLM application to code patch review. Limited content, but relevant to developer tools and LLM applications.