Yvo.Book
Back to archive016 entries

An index for the ideas inside the writing.

Each concept points at the exact places where it appears across essays. Tags bucket posts; this page indexes the thinking.

A
Agents

A program that uses an LLM to take actions against a goal — not just generate text.

How to build your first AI agent in an afternoon
B
Bounded task

A task with a clear goal, input sources, allowed tools, rules, expected output, and explicit stop points.

Use ChatGPT agents like a careful operator
C
ChatGPT agent

A ChatGPT mode for longer tasks where ChatGPT can reason, browse, use tools, work with files, and take actions under user supervision.

Use ChatGPT agents like a careful operator
Client hype-cycle triage

Diagnosing whether a stalled project needs technical work, ownership work, or narrative work — and refusing to do all three at once.

Why your AI pilot is stuck (and what to do about it)
E
Evals as contracts

Reframing evaluation as a binding agreement between engineering, business, and compliance — rather than a test suite nobody owns.

Evals as contracts: a better way to know if your agent is working
F
Few-shot examples as spec

Using two or three input/output examples as a task specification, in place of rule-based instructions.

The three prompt patterns I actually use in production
G
Guardrails

Explicit constraints on an agent's behavior; the antidote to 'it should figure it out autonomously'.

How to build your first AI agent in an afternoon
I
Iteration over architecture

Shipping a v1 with hardcoded data beats designing a perfect v∞ that never ships.

How to build your first AI agent in an afternoon
L
LangGraph

A stateful workflow framework for LLM applications built around cyclic graphs.

How to build your first AI agent in an afternoon
P
Pilot purgatory

The state where an AI initiative has technically succeeded in a demo but has no clear path to production use.

Why your AI pilot is stuck (and what to do about it)
Prompt injection

Instructions hidden in websites, emails, docs, or comments that try to make the agent ignore the user's real task.

Use ChatGPT agents like a careful operator
Q
Quality contract

A one-page document naming the specific failure modes, acceptable rates, measurement windows, and automatic actions for an AI system.

Evals as contracts: a better way to know if your agent is workingWhy your AI pilot is stuck (and what to do about it)
R
Role priming with negatives

System prompts that specify what the assistant is forbidden to do, with concrete examples. Does more work than telling it what to do.

The three prompt patterns I actually use in production
S
Structured output

Treating the model's response as a typed API payload rather than free-form text to be heuristically parsed.

The three prompt patterns I actually use in production
T
The three-failures rule

Start any eval design by listing the three failure modes you would personally be fired for. Those are the contract's load-bearing clauses.

Evals as contracts: a better way to know if your agent is working
W
Write action

Anything that changes the outside world: sending, editing, booking, submitting, deleting, sharing, or changing permissions.

Use ChatGPT agents like a careful operator