Blog

Page 4

12 articles

The Federated AI Team: Why Centralizing AI Expertise Creates the Problems It Was Supposed to Solve
Central AI platform teams promise standardization and governance but routinely become bottlenecks, knowledge silos, and sources of the fragmentation they were meant to prevent. Here's what the failure looks like and what federation actually requires.
insiderai-engineering
May 410 min
Fine-Tuning Data Saturation: When Adding Examples Makes Your Model Worse
Adding more training examples is the default response to a fine-tuning plateau — and often the wrong one. How to detect data saturation early, and the four alternatives that actually break through it.
fine-tuningllm
May 49 min
The First-Mover Disadvantage in AI: A Framework for Timing Your AI Feature Launch
Moving fast in AI can kill your product faster than any competitor. A practical decision framework for timing AI feature launches based on the gap vs. layer distinction, moat accumulation, and model improvement velocity.
ai-strategyproduct
May 410 min
The Frozen Feature Trap: When Your AI Differentiator Becomes a Maintenance Anchor
Early AI differentiators — custom fine-tunes, bespoke retrieval pipelines, hand-crafted prompt chains — calcify into technical debt as base models improve. Here's how to recognize the transition and build a framework for retiring them.
ai-engineeringtechnical-debt
May 49 min
Function Calling vs Code Generation for Agent Actions: The Tradeoffs Nobody Benchmarks
Most agent benchmark papers measure function selection accuracy. The production tradeoffs that actually matter — safety surface, debugging cost, parsing failures, and irreversibility — are rarely compared. Here's the framework engineers need.
insiderai-agents
May 410 min
Ghost Context: How Contradictory Beliefs Break Long-Running Agent Memory
Persistent agent memory stores accumulate contradictory facts over time — and most systems retrieve them together without warning. Here's what that failure looks like in production and the patterns that prevent it.
insiderai-agents
May 411 min
The Helpful-But-Wrong Problem: Operational Hallucination in Production AI Agents
Factual hallucination gets the headlines, but there's a more insidious failure mode: AI agents that are directionally plausible but operationally wrong. Wrong API flag, stale method signature, correct concept wrong instance — and your evals won't catch it.
insiderai-agents
May 49 min
The Hidden Tax on Your AI Features: What Your Inference Bill Isn't Telling You
Inference is only 20-30% of the true cost of running AI features in production. A full-stack breakdown — from vector DBs and embedding pipelines to human review and prompt engineering labor — and how to build a cost model before launch.
aicost
May 410 min
The Human Bottleneck Problem: When Human-in-the-Loop Becomes Your Slowest Microservice
Human-in-the-loop review is often the right safety design — until your reviewers become the slowest microservice in the system. A practical guide to queue design, multi-signal routing, and SLOs that keep human oversight meaningful at scale.
ai-engineeringsystem-design
May 49 min
The Hyperparameter Illusion: Why Temperature and Top-P Are the Last Things to Tune
Engineers reach for temperature first when LLM outputs feel wrong. It's almost never the right move. Here's the evidence-backed tuning order that actually moves the needle.
insiderllm
May 49 min
The Inherited AI System Audit: How to Take Ownership of an LLM Feature You Didn't Build
A practical guide for engineers who inherit LLM features without documentation — how to reconstruct intent, audit guardrails, and refactor safely.
insiderllm
May 410 min
Lazy Evaluation in AI Pipelines: Stop Calling the LLM Until You Have To
Only 4.9% of tokens in a typical AI pipeline actually need a large model. A layered lazy evaluation strategy—semantic caching, complexity routing, early exit, and deferred generation—can cut LLM costs by 30–70% without sacrificing quality.
llmai-infrastructure
May 411 min

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.

Page 4

The Federated AI Team: Why Centralizing AI Expertise Creates the Problems It Was Supposed to Solve

Fine-Tuning Data Saturation: When Adding Examples Makes Your Model Worse

The First-Mover Disadvantage in AI: A Framework for Timing Your AI Feature Launch

The Frozen Feature Trap: When Your AI Differentiator Becomes a Maintenance Anchor

Function Calling vs Code Generation for Agent Actions: The Tradeoffs Nobody Benchmarks

Ghost Context: How Contradictory Beliefs Break Long-Running Agent Memory

The Helpful-But-Wrong Problem: Operational Hallucination in Production AI Agents

The Hidden Tax on Your AI Features: What Your Inference Bill Isn't Telling You

The Human Bottleneck Problem: When Human-in-the-Loop Becomes Your Slowest Microservice

The Hyperparameter Illusion: Why Temperature and Top-P Are the Last Things to Tune

The Inherited AI System Audit: How to Take Ownership of an LLM Feature You Didn't Build

Lazy Evaluation in AI Pipelines: Stop Calling the LLM Until You Have To

About Tian Pan