Discussion about this post

User's avatar
Nick Burnett's avatar

Really helpful and important post Matt

Basil Puglisi's avatar

Matt, the missing ruler is the whole problem, and it is the exact gap I built an instrument to close. Productivity has had a measure for a century. The effect of the work on the doer's skill has had almost none, so organizations optimize the measured quantity and quietly spend down the unmeasured one. The joint optimization target you framed with Lepine and Kim is the right one.

The instrument is the Human Enhancement Quotient, and its composite is the Augmented Intelligence Score. The design choice that matters for your question is that it scores the process, not the answer. It never asks whether the human reached the correct conclusion. It asks whether the human governed the work, whether they directed the tool, surfaced and challenged conflict, verified against sources, and owned what was released. It cannot tell you who is right. It can tell you who governed. That is what makes it a measure of skill rather than a test of correctness, and what lets it hold across value systems, since good governance looks the same whether the values feeding it are religious or secular.

It reads as four behavioral competencies that map onto what the AI literacy frameworks already call for. Collaborative Intelligence is management itself, directing the tool and arbitrating sources and conflict. Ethical Alignment is responsible use, owning the output and knowing the law and culture that bind the work. Cognitive Agility is fluency. Adaptive Growth carries your concern directly, getting better at managing the machine over time and turning failures into method. That growth dimension is where the developmental claim lives, and I keep it grounded in the learning science you and I both draw on, scaffolding inside a zone of proximal development, metacognitive monitoring, active over passive engagement, and the recent finding that the same tool produces opposite cognitive outcomes depending on how its use is governed.

There is a point in this for your field specifically. Every major AI literacy framework, UNESCO, the OECD AILit work, the US Department of Labor, names governing or managing AI as a competence and then stops at the naming. None measures whether a particular person actually does it, in a real decision, with an accountable record. HEQ is built for exactly that deferred competence, the governance competence at the center of AI literacy that the frameworks agree matters and none of them scores.

It is deployed and running now, scored from real work with an auditable record behind every result, and the next move is the one you would recognize on sight, inter-rater reliability and a human-alone, machine-alone, governed-together comparison that shows the score predicts the lift. That is the kind of study your transcripts-and-interviews method is built to run. You are building a ruler. I have one running from the governance side, and I would value putting the two side by side.

No posts

Ready for more?