TaskVector
Status: field-tested at scale across PE portfolio companies. Formal validation pending.
Before automating a task, score it across 9 dimensions. Each dimension measures a structural property that determines whether automation is viable. If any single dimension scores 1, it is a landmine - a hard no regardless of the composite score.
Drag across a row to score that dimension (far left = landmine, far right = strong).
How predictable is the correct output given the input?
Does the system have access to everything it needs?
How often does this task occur?
How easy is it to check whether the output is correct?
If the AI gets it wrong, how bad is the damage?
Will the people affected accept AI doing this?
Could automation introduce or amplify systematic bias?
How much surrounding context is needed?
What is the worst-case outcome of an error?
The Landmine Rule
A task with 5s across the board but a 1 on Consequence Severity - say, “AI decides whether to administer medication” - is not automatable. Period. The composite score is irrelevant when a single dimension represents a structural blocker.
The landmine rule exists because automation failures are not normally distributed. They are fat-tailed. The expected cost of a worst-case error on a landmine dimension dominates all other considerations.
What Comes Next
TaskVector tells you whether to automate. The rest of the toolkit tells you how:
Verification Quadrant - plot the task by calculation vs. verification difficulty. Calculate the Templeton Ratio.
Dollarized Confusion Matrix - price the error costs. Compute the optimal threshold.
The Promotion Protocol - deploy in HITL, gather evidence, promote to autonomous on statistical proof.
When to Use This
Use when
- +Evaluating one specific task for AI deployment, not an entire function
- +You have real examples to score against, not vibes
- +Multiple stakeholders need a shared scoring frame to align
- +You want to surface landmine dimensions before deploying, not after
Skip when
- -The task is obviously too immature (no data, no users, no rubric)
- -You already have deployment experience with this exact task class
- -Budget is the primary binding constraint - go straight to Automation NPV
- -The scoring itself would take longer than running a small pilot
Rosetta Stone
Four circles, four readings of the same object. Each role reads the artifact through its own lens.
Multi-factor scoring for operating instruments. Analogous to factor models in equities: decompose the task into orthogonal-ish dimensions, score each, rank by composite.
A screener for the next hundred tasks on the backlog. Run the vector, sort, work top-down. Keeps the team from optimizing for the loudest project instead of the most automatable.
A checklist with teeth. Nine questions that usually catch the "sounds automatable, isn't" failure mode before it eats a quarter.
A feature vector in R^9 embedding the task. Similarity in this space predicts automation viability. With enough labeled history, a supervised classifier outperforms hand-scoring.
See also
Which bets to make. Capital allocation, M&A due diligence, portfolio construction.
How to execute at scale. Multi-brand portfolio, turnarounds, P&L ownership.
Builds it, ships it, owns it. Solo full-stack, DevOps, production systems.
Proves it, models it, publishes it. Mathematical modeling, Bayesian frameworks.
TaskVector