Question 1

We keep shipping improvements and then regressing. How do I make sure quality only goes up?

Accepted Answer

Quality Ratchet: A CI-enforced floor that only moves up. Each improvement becomes the new minimum. Learn more at https://templeton.host/lexicon#quality-ratchet

Question 2

How do I make AI agents improve over time without writing an improvement plan?

Accepted Answer

Quality Hillclimb: Ratcheted quality gates on stochastic output create emergent ascent. The agent does not need a plan. Learn more at https://templeton.host/frameworks/quality-hillclimb

Question 3

We deployed AI but we are scared to remove the human. How do we safely give it more independence?

Accepted Answer

The Promotion Protocol: A 3-state progression: Disabled, HITL, Autonomous. Promote on statistical evidence, roll back on drift. Learn more at https://templeton.host/frameworks/promotion-protocol

Question 4

Nobody agrees what "good" looks like for this task. How do I define quality?

Accepted Answer

The Performance Frontier: Map the distribution of human performance, find the 99th percentile, compute the gradient toward it. Learn more at https://templeton.host/frameworks/performance-frontier

Question 5

My AI keeps doing things I don't want. I can't write a complete spec of what I want.

Accepted Answer

The Deity Problem: Three evidence channels: structured elicitation (conjoint), revealed preference (behavioral observation), and direct query (ask when VOI exceeds attention cost). Learn more at https://templeton.host/frameworks/deity-problem

Question 6

Should I automate this specific task? How do I decide?

Accepted Answer

TaskVector: Score across 9 dimensions. If any single dimension is a landmine (1), it is a hard no. Learn more at https://templeton.host/tools/taskvector

Question 7

Is this task a good candidate for AI? What is the ROI?

Accepted Answer

Verification Quadrant + Templeton Ratio: T = time_to_do / time_to_check. High T = AI creates leverage. Low T = you are doing the work twice. Learn more at https://templeton.host/tools/verification-quadrant

Question 8

The AI output looks good but I don't know if it is correct. Checking takes as long as doing it.

Accepted Answer

Verification Trap: Easy to generate, hard to verify. T approaches 1. You have added a step without saving time. Learn more at https://templeton.host/lexicon#verification-trap

Question 9

How do I brainstorm business ideas that actually match real demand?

Accepted Answer

The Demand Field: Fix demand (immutable), vary means (mutable). Demand is a hidden force on your optimization gradient. Learn more at https://templeton.host/frameworks/demand-field

Question 10

How do I guarantee this initiative succeeds instead of hoping?

Accepted Answer

Designed Convergence: Design the game so rational agents converge to your outcome. Finite state + Bayesian search + ratchet = theorem. Learn more at https://templeton.host/frameworks/designed-convergence

Question 11

What are the actual dollar costs of AI being wrong?

Accepted Answer

Dollarized Confusion Matrix: Replace accuracy with dollars. Optimal threshold: theta* = C_FP / (C_FP + C_FN). Learn more at https://templeton.host/tools/dollarized-confusion-matrix

Question 12

The task is in a bad quadrant. What capital investment moves it to a better one?

Accepted Answer

Quadrant Shifting: Five moves: build a verifier, decompose, enrich inputs, constrain outputs, build a rubric. Learn more at https://templeton.host/tools/quadrant-shifting

Question 13

Where is value leaking in my business that nobody has named?

Accepted Answer

Directed Graph + Soft Spots: Your chart of accounts is a directed graph. Walk the edges. The soft spots are where value leaks. Learn more at https://templeton.host/frameworks/directed-graph

Question 14

I spend all my time in meetings and firefighting. How do I invest in systems?

Accepted Answer

Compile Time vs. Runtime: Compile time: building systems with multiplicative ROI. Runtime: executing tasks with single-period returns. Learn more at https://templeton.host/lexicon#compile-time

Question 15

Is this AI automation a wasting asset or a compounder?

Accepted Answer

Dual Curve + Knowledge Capital: Models depreciate, data appreciates. The net rate determines the investment type. See also: knowledge-capital framework. Learn more at https://templeton.host/lexicon#dual-curve

Question 16

What is the NPV of automating this task? Should I build, buy, or hire?

Accepted Answer

Automation NPV: Calculate NPV, IRR, and payback period. Compare to hiring, SaaS, or doing nothing. Same math, different asset class. Learn more at https://templeton.host/tools/automation-npv

Question 17

How should I think about AI output I cannot observe directly? How does the agent learn what I want?

Accepted Answer

Structured Elicitation, Revealed Preference, Direct Query, Drift Detector: The four evidence channels from The Deity Problem: structured-elicitation (conjoint), revealed-preference (behavioral observation), direct-query (ask when worth it), and drift-detector (posterior predictive check). See also: oracle-gradient, designers-seat. Learn more at https://templeton.host/lexicon#structured-elicitation

Question 18

How do I infer what the operator actually wants without asking? How do I learn from behavior instead of instructions?

Accepted Answer

Revealed Preference: Watch what the operator does, not what they say. Revealed preference theory (Afriat, GARP) applied to AI alignment. Cheapest evidence channel because the operator behaves naturally. Learn more at https://templeton.host/lexicon#revealed-preference

Question 19

When should the AI ask the operator a question? How do I avoid over-asking or under-asking?

Accepted Answer

Direct Query: Ask only when the expected value of the answer exceeds the cost of the operator's attention. An agent that asks too many questions is poorly calibrated, not diligent. Learn more at https://templeton.host/lexicon#direct-query

Question 20

How do I detect when the operator's preferences have changed and the model is stale?

Accepted Answer

Drift Detector: A posterior predictive check on recent decisions. When the fraction the model predicted incorrectly exceeds a threshold, trigger re-elicitation. Preferences drift - the system must detect it. Learn more at https://templeton.host/lexicon#drift-detector

Question 21

What does "operational alpha" mean? How is it different from just doing a good job?

Accepted Answer

Operational Alpha: Excess return on enterprise value generated through systematic identification of mispriced edges. The directed graph finds them. The tools evaluate them. Learn more at https://templeton.host/lexicon#operational-alpha

Question 22

What is the AI Sweet Spot? When does AI create the most value?

Accepted Answer

AI Sweet Spot + Templeton Ratio: Hard to do, easy to check. T >> 1. The templeton-ratio measures the gap. High ratio = transformative ROI. Also: the proof-layer and gold-standard define what "correct" means. Learn more at https://templeton.host/lexicon#ai-sweet-spot

Question 23

What should I build first - the AI system or the verification instrument?

Accepted Answer

Proof Layer + Gold Standard: Build the rubric first. The gold-standard IS the verification instrument. Without it, you are measuring with a broken ruler. See: soft-spot for finding where to look. Learn more at https://templeton.host/lexicon#proof-layer

Question 24

How do I manage the pull of real demand on my product trajectory?

Accepted Answer

Demand Gravity: The inescapable pull of real demand on product trajectories. Map it or crash into it. Learn more at https://templeton.host/lexicon#demand-gravity

Question 25

Should I be playing the game or designing it? What is the CTO's real job?

Accepted Answer

The Designer's Seat: Design the game so self-interested agents produce the outcome you want. Most engineering is game-playing. Mechanism design is game-designing. Learn more at https://templeton.host/lexicon#designers-seat

Question 26

The AI deployment is at autonomy-state-machine HITL state. When do I promote to runtime autonomous?

Accepted Answer

Promotion Protocol + Autonomy State Machine: Consecutive batches below acceptance threshold. Not one good batch - N consecutive. The construction-spread is the gap between build cost and operational value. Learn more at https://templeton.host/lexicon#autonomy-state-machine

Question 27

How do I measure what I should invest in next for my AI operations portfolio?

Accepted Answer

Knowledge Capital + Permutations: Knowledge work either compounds or depreciates. Invest in the appreciating side: verifiers, data, rubrics. Not the depreciating side: models. Learn more at https://templeton.host/frameworks/knowledge-capital

I Have This Problem

The Three Layers