Frameworks

Role Operating Systems: The Missing Layer Between Personal Productivity and Team Execution

Intelligence is getting cheaper. Coordination is not. Role Operating Systems close the gap between individual productivity and team execution.

Intelligence Got Cheaper. Coordination Didn't.

Stuart Winter-Tear nails it: "...intelligent got cheaper. Coordination did not. People do not want agents. They want outcomes with less friction."

Everyone's building personal operating systems. Daniel Miessler has Telos. Nate Jones has his Chief of Staff/OpenBrain system. Productivity Twitter pushes second brains, zettelkasten, and AI-augmented workflows. The tools keep improving. Claude, GPT, local LLMs, agent frameworks.

Individual capability is through the roof. Coordination overhead hasn't budged.

You optimized for your productivity. But you work on a team. The more productive you got individually, the more the team friction showed. Your brilliant notes stayed in your system. Your insights never reached the people who needed them. You got faster at working alone while the handoff gaps widened.

Here's the math that nobody talks about: every person on the team got 3x faster, and the team got maybe 1.2x faster. The delta is coordination waste.

Role Operating Systems exist to close that gap.

The Composition Problem

I watched a feature die at NeuroBlu because of a handoff. Our PM spent three weeks with pharma clients, understood exactly what they needed for their schizophrenia trial data, wrote a detailed brief. The architect got a 30-minute walkthrough and built what he thought he heard. Six weeks and roughly $40K in engineering time later: technically solid feature, completely wrong problem. The PM's context about why the client needed cohort-level breakdowns (not patient-level) never made it across the gap. I was the PM. The brief was mine.

My NeuroBlu break wasn't unusual. It's the rule. Every handoff is a compression step:

PM to Architect: rich customer context becomes a 2-page brief and 30 minutes. 80% of why-this-matters, gone.
Architect to Engineer: technology trade-offs become diagrams and ticket descriptions. The alternatives considered, gone.
Engineer to QA: edge cases and assumptions become a test environment and stale docs. The mental model that explains why it works, gone. This isn't just a feeling. Jeremy McEntire ran controlled experiments with multi-agent AI systems, no humans involved, and watched performance collapse with each boundary added.

Compression Cascade: bar chart showing tasks correct dropping from 28 to 18 to 9 to 0 as boundaries are added

One agent, no boundaries: 28 out of 28 tasks correct. Add one boundary layer: 18. Add concurrent boundaries: 9. An 11-stage pipeline: 0 out of 28. Zero. No ego, no fatigue, no office politics. Pure architecture.

Information theory has a name for this: the Data Processing Inequality. In any chain X → Y → Z, Z can never have more information about X than Y does. Every boundary is a compression step. Information gets destroyed. Adding more process layers can't fix it.

I wish I'd known this three years ago. I kept scheduling more review meetings, thinking more touchpoints would fix the context loss. They didn't. I was adding overhead to the compression.

The underlying problem: individual workflows produce outputs optimized for the individual, not for the team.

Role Operating Systems: A Different Approach

What if individual workflows were designed to compose?

A Role Operating System flips the productivity question. Instead of "how do I work more effectively?" it asks "how does my work feed the team?"

Each role gets their own OS: agents, context scaffolds, workflows. But here's the twist: each OS is designed with explicit interfaces. What context does this role consume? What does it produce? At which lifecycle stages?

I call this composable individuality. (I made that up. I'm not 100% sure it's the right term but nobody's proposed a better one yet.)

You still work how you work. Your agents assist with your tasks. Your workflows match your thinking style. But the outputs conform to shared contracts that other roles can consume.

Winter-Tear offers a clean test for whether you're actually building something composable: "If you can name the workflow owner, the source of truth, the service level, and the audit trail, you are building an agent specialist. If you cannot, you are probably building a copilot."

Role OSes are how you answer those four questions for every role on your team.

The Lifecycle: Six Stages

Skipping Decide costs more than skipping Discover. By the time you've built the wrong thing, you've burned the team's trust along with the budget. The lifecycle isn't bureaucracy. It's where compounding mistakes get caught.

DPOS uses a six-stage lifecycle:

1. Discover

What happens: Problem identification, market research, user feedback synthesis, opportunity assessment.

Key question: What problem are we solving, and is it worth solving?

Output: Problem brief with validated customer need, market context, and preliminary feasibility signals.

2. Decide

What happens: Architecture selection, scope definition, risk assessment, resource allocation.

Key question: Should we build this, and how?

Output: Shaped pitch with defined approach, appetite (time budget), and go/no-go decision.

3. Build

What happens: Implementation, data pipelines, feature development, integration work.

Key question: Can we deliver this within our appetite?

Output: Working components with documentation and test coverage.

4. Test

What happens: Validation, quality checks, ethical review, user acceptance.

Key question: Does this work correctly and responsibly?

Output: Validated release candidate with quality metrics and risk assessment.

5. Ship

What happens: Deployment, monitoring setup, launch coordination, customer communication.

Key question: Is this ready for production?

Output: Live product with observability and rollback capability.

6. Evaluate

What happens: Usage analysis, feedback synthesis, impact assessment, iteration planning.

Key question: Did we solve the problem, and what's next?

Output: Learnings document with validated outcomes and next cycle priorities.

The Role x Lifecycle Matrix

Different roles engage at different stages. Here's how they compose:

Role × Lifecycle Engagement heatmap. Rows are the six DPOS lifecycle stages (Discover, Decide, Build, Test, Ship, Evaluate). Columns are the six Role OSes (PM, Engineer, Architect, QA, Designer, DevOps). Cells encode engagement intensity: Lead (warm accent), Support (mid), Consult (soft), Not engaged (·).

This matrix defines where each Role OS engages. But engagement isn't enough. We need explicit context flows.

What Makes a Role OS

A Role Operating System has four components. Most teams start with agents. Wrong order. Start with handoffs, end with agents. I'm presenting them in the order most readers want to read them, not the order you should build them. The "Building Your Role OS" section at the end has the right sequence.

1. Agents

AI assistants tuned to the role's work. Not generic chatbots. Specific agents for specific tasks.

Example (Data PM):
- Stakeholder Synthesizer: Processes meeting notes, extracts themes, identifies conflicting requirements
- Market Scanner: Monitors competitor activity, synthesizes industry trends
- Feedback Analyzer: Aggregates customer feedback, identifies patterns across channels
- Pitch Shaper: Helps structure problem/appetite/solution for betting table

Each agent has defined inputs, outputs, and oversight requirements. The PM reviews stakeholder synthesis before it feeds decision-making. The feedback analyzer flags low-confidence patterns for human verification.

But oversight isn't optional decoration. Nate Jones documented a case where an experienced engineer's coding agent deleted 2.5 years of production data. Not a junior mistake. An architectural one: too much write access, no rollback verification, no blast radius limits.

Five constraints every Role OS agent needs:

Defined blast radius before running any write operation
Read-broad, write-narrow access by default
Irreversibility assumed unless explicitly proven otherwise
Staged environments for mutations (staging first, production second)
Explicit confirmation for destructive actions (never implicit)

An agent that can do damage at the speed of AI needs guardrails that work at the speed of AI. Reviewing the output after doesn't help when the data is already gone.

2. Context Scaffolds

The boring component that decides whether the OS works. What the role knows (consumes) and what the role provides (produces).

Context In (Data PM):
- Business strategy and portfolio priorities (from leadership)
- Technical constraints and platform capabilities (from Architect)
- Quality metrics and trust status (from QA/Data Lead)
- Customer feedback and usage patterns (from Product Analytics)

Context Out (Data PM):
- Problem briefs (to all roles)
- Shaped pitches (to Architect, Engineering)
- Prioritization decisions (to all roles)
- Customer context summaries (to Designer, Engineer)

Context scaffolds make dependencies explicit. If the PM needs technical constraints from the Architect, that's a defined interface. Not a Slack message at 4pm on Friday.

3. Workflows

Workflows fail where they're least visible, in the parts you assumed everyone knew. How the role moves through lifecycle stages.

Example (Data PM Discover Workflow):

1. Trigger: New cycle begins OR significant feedback pattern detected
2. Gather: Pull latest customer feedback, market data, stakeholder inputs
3. Synthesize: Run Stakeholder Synthesizer and Feedback Analyzer agents
4. Validate: Review agent outputs, flag uncertainties, verify with sources
5. Frame: Draft problem brief with customer need + market context
6. Share: Publish problem brief to Context Out interfaces
7. Handoff: Schedule Decide stage kickoff with Architect

Workflows define the rhythm. They adapt to circumstances. But they keep key steps from getting skipped and make handoffs explicit instead of assumed.

4. Handoff Contracts

This is where most teams fall apart, and where the biggest gains live.

Jones puts it bluntly: "Most teams are treating agents like interns: throw tasks over the wall, hope the summary is honest." He's talking about AI agents, but the same pattern describes most human handoffs.

The fix is making handoff contracts checkable, not aspirational. Jones draws a critical distinction: constraints can be checked, preferences cannot. A handoff contract full of preferences ("provide good context," "include relevant details") is just a suggestion. A contract with constraints ("problem brief must include at least 3 customer data points," "no blocking technical constraints identified") is enforceable.

Before: "Write a problem brief for the architect."

After: "Write a problem brief. Must include: validated customer problem with at least 3 data points, business priority confirmed by portfolio owner, success criteria that are measurable, known technical constraints documented. No blocking dependencies identified."

Side-by-side comparison: aspirational preferences versus checkable constraints in handoff contracts

Example (PM → Architect handoff at Discover → Decide boundary):

Handoff: Discover → Decide
From: PM OS
To: Architect OS

Required Context:
  - Problem brief with customer validation (min 3 data points)
  - Business priority signal confirmed by portfolio owner
  - Success criteria (measurable, not aspirational)
  - Known constraints (technical, regulatory, timeline)

Quality Gates:
  - [ ] Problem validated with customer evidence
  - [ ] Priority confirmed (not assumed)
  - [ ] No blocking dependencies identified
  - [ ] Ethical considerations documented

Trigger for Next Stage:
  - Architect acknowledges receipt
  - Decide stage scheduled within 48 hours

The contract works because each gate is a constraint, not a preference. You can check whether customer evidence exists. You can't check whether the brief is "good enough."

When to Delegate vs. Coordinate

Not every task within a Role OS needs the same level of oversight. Jones provides three sorting criteria:

Factor	Delegate Autonomously	Coordinate Tightly
Error tolerance	Mistakes are cheap to fix	Correctness is non-negotiable
Tool span	Single tool/environment	Multiple tools, integrations
Independence	Task is self-contained	Pieces shape each other

High error tolerance + single tool + independent = delegate and verify after. Low error tolerance + multiple tools + interdependent = coordinate with contracts.

Most teams default to one mode. The sorting framework lets you be deliberate about which tasks get which treatment. Your PM's market scanner can run autonomously with weekly human review. Your architect's infrastructure changes need tight coordination with handoff contracts.

Deep Dive: The Data PM Operating System

Let me show you what a complete Role OS looks like. I'll use the Data PM role. It's what I know best.

PM OS Overview

Purpose: Drive product strategy, prioritize work, synthesize stakeholder needs, and ensure team focuses on valuable problems.

Lifecycle Engagement:
- Discover: Lead
- Decide: Lead
- Build: Support
- Test: Support
- Ship: Consult
- Evaluate: Lead

PM OS Agents

1. Stakeholder Synthesizer
- Input: Meeting transcripts, Slack threads, email threads, survey responses
- Process: Extract themes, identify conflicts, map to stakeholder priority
- Output: Stakeholder landscape document with requirement matrix
- Oversight: PM reviews before sharing; flags conflicting requirements for human resolution
- Blast radius: Read-only. Cannot modify source data or send communications.

2. Market Scanner
- Input: RSS feeds, competitor announcements, industry publications
- Process: Summarize developments, identify relevance to current products
- Output: Weekly market brief with actionable signals
- Oversight: PM curates sources; marks high-importance signals for team attention
- Delegation mode: Autonomous with weekly review (high error tolerance, single tool, independent)

3. Feedback River
- Input: Support tickets, NPS responses, usage analytics, sales call notes
- Process: Aggregate, categorize, identify patterns, surface emerging themes
- Output: Continuous feedback synthesis with pattern alerts
- Oversight: PM validates patterns before they inform prioritization
- Delegation mode: Autonomous collection, coordinated interpretation (patterns need human judgment)

4. Pitch Shaper
- Input: Problem brief, team input, technical constraints
- Process: Structure into Problem → Appetite → Solution framework
- Output: Shaped pitch for betting table
- Oversight: PM drives; Architect reviews technical feasibility section
- Delegation mode: Coordinated (low error tolerance, multi-input, interdependent)

PM OS Context Scaffolds

The PM consumes context from six sources (leadership strategy quarterly, architect constraints per cycle, QA metrics weekly, analytics daily, sales weekly, support daily) and produces five outputs (problem briefs, shaped pitches, priority decisions, customer context summaries, and cycle learnings).

The key design decision: every input and output has a defined format and cadence. No ad-hoc "can you send me that thing?" The context scaffold turns informal information flow into a defined interface. (Full tables in the Data PM OS deep-dive guide, coming after this series.)

PM OS Handoff Contract

Here's what the PM → Architect handoff looks like as a checkable contract:

Contract: Problem Brief Handoff
From: PM OS → Architect OS
Stage: Discover → Decide

Required:
  - customer_problem: "Validated by evidence, not assumption"
  - evidence: "Min 3 data points (quotes, usage data, market signals)"
  - business_rationale: "Why this matters to portfolio"
  - success_criteria: "Measurable, not aspirational"
  - constraints: "Technical, regulatory, timeline"

Quality Gates:
  - [ ] Problem validated with 3+ customer data points
  - [ ] Priority confirmed by portfolio owner
  - [ ] Success criteria include measurable threshold
  - [ ] Ethical considerations documented

Trigger: PM publishes → Architect acknowledges within 24h → Decide begins within 48h

Every gate is checkable. "Validated with 3+ customer data points" is a constraint you can verify. "Good enough context" is a preference you can argue about forever.

The Org Structure Reality

Winter-Tear names the hidden cost: AI Translation Debt. The unpriced work of repairing meaning as it crosses organizational seams. For years, humans absorbed this cost. They carried context. They made judgment calls. They knew who to ask and when to stop.

Agents don't remove that dependence. They remove the time you used to have to hide it.

Role OSes are the mechanism for paying down Translation Debt explicitly. Instead of hoping experienced people will patch the gaps between roles, you design the interfaces so the gaps don't exist.

But none of this works if you pretend org charts don't matter.

Dylan Anderson puts it directly: "Fitting data into an organisation's structure is proving more complicated than most companies can handle." He identifies five structural challenges: ownership ambiguity, role evolution faster than org charts, executives who don't understand data capabilities, leadership vacuums (only 27% of leading organizations have a CDO), and corporate politics.

That last one is the killer. Anderson: "Corporate politics is the single biggest hindrance to an organisation making progress in its data capability."

Role Operating Systems don't solve politics. But they sidestep the problem. They define interfaces between roles, not between reporting lines. The PM OS doesn't care whether the PM reports to Engineering or Product. It cares about what context flows in and what flows out.

Build for interfaces, not for org charts. I've watched three reorgs at NeuroBlu. The reporting lines changed every time. The handoffs between PM and Architect? Same every time.

Building Your Role OS

You don't need to implement everything at once. Start with:

Step 1: Map Your Lifecycle Engagement

Which stages do you lead? Which do you support? This defines where your OS needs depth.

Step 2: Identify Your Key Handoffs

Where do you receive context from others? Where do you hand off to others? These are your critical interfaces. Pick the one where context loss hurts most.

Step 3: Build One Agent

Pick your biggest pain point. Build an agent that addresses it. Define its blast radius and oversight requirements before you define its capabilities. (I learned this order the hard way. My first agent was a feedback aggregator with write access to our product backlog. It helpfully created 47 duplicate tickets in one afternoon.) Run it for 2-3 cycles. Iterate.

Step 4: Define One Handoff Contract

Pick your most problematic handoff. The one where context always gets lost. Make it checkable. Document what constraints must be met, not what preferences you'd like. Hold both sides accountable.

Step 5: Sort Your Tasks

Use the delegation framework. Which tasks in your Role OS can run autonomously? Which need tight coordination? Be deliberate. Default to coordination for anything with low error tolerance or cross-role dependencies.

Step 6: Expand Gradually

Add agents as pain points emerge. Add handoff contracts as you encounter friction. Let the OS grow organically from real problems, not from a comprehensive design document.

Common Failure Modes

Over-Engineering

Building elaborate agents before proving simple ones work. Your first agent should be embarrassingly simple. Add complexity when you've earned it.

Under-Specifying Handoffs

Defining what context is "expected" without defining checkable constraints. If there's no gate, there's no accountability. Handoffs without gates are just suggestions.

Agents Without Guardrails

Jones again: "Either you can name what would make you say 'not yet,' or you're going to keep discovering 'not yet' after it's already shipped." This applies to agent design. If you haven't defined what "done" looks like in checkable terms, your agent will cheerfully produce work that looks complete and isn't.

Solo Optimization

Building a Role OS that makes you faster but doesn't help the team. If your outputs aren't in formats others can consume, you've just built a personal productivity tool. The whole point is composability.

Context Hoarding

Consuming context from others without producing context for them. Role OSes are bilateral contracts. If you take, you give.

What's Next

Part 4 covers The Integration Layer: how individual Role OSes compose into team execution through DPOS's five layers. Role OSes solve the coordination problem between two roles. The Integration Layer solves it across the whole team.

After the core series wraps, I'm publishing individual Role OS guides. The Data PM guide is first (it's half-written already, which tells you something about which role I think about most).

Read the rest of the DPOS series

Part 0: Is Your Data Team a Dashboard Factory? — the diagnostic that started the series
Part 1: The Data Product Operating System — the framework introduction
Part 2: The DPOS Kernel — the principles every Role OS depends on
Part 4: The Integration Layer (publishing soon)
Part 5: Implementation (publishing soon)

Monday, 4 May 2026

, & Kevin Holland

dpos, Frameworks, Data Products, Product Management

Is Your Data Team a Dashboard Factory?

Part 0 of 5: The Problem. The first dashboard I built at HCA got opened four times. Total. Not four times a day. Four times ever.

The first dashboard I built at HCA got opened four times. Total. Not four times a day. Four times ever.

I'd spent eleven days on it. Custom SQL, well-designed layout, clear KPIs. The clinical ops director had requested it specifically. She opened it twice, her deputy opened it once, and someone from finance clicked on it by accident. (I checked the access logs like a psycho.)

175 hospitals in that system. $1.2M in annual cost savings from the reporting automation program I eventually built there. But that first dashboard? Dead on arrival. Not because the data was wrong. Because nobody had asked what decision it was supposed to change.

I was a dashboard factory of one.

Feature factory vs dashboard factory comparison

The Dashboard Factory

In 2016, John Cutler wrote "12 Signs You're Working in a Feature Factory" and product teams collectively flinched. The diagnosis was brutal: shipping features nobody asked for, measuring output instead of outcomes, no connection between what gets built and what moves the business.

Marty Cagan put it even sharper: "it doesn't matter how good your engineering team is if they are not given something worthwhile to build."

Data teams have the same disease. We just call it something different.

Instead of shipping features nobody uses, we ship dashboards nobody opens. Instead of a Jira backlog driven by the loudest stakeholder, we have a queue of ad hoc requests driven by whoever pinged us last on Slack. The metrics change (tickets closed per sprint instead of story points) but the dysfunction is identical.

Nick Zervoudis calls it a "service-oriented data team." Joe Horgan calls it the "dashboard factory operating model." I've lived in both versions. The pattern is the same: you build what people ask for instead of what they need. You measure how fast you build it instead of whether it changed anything. And quarter by quarter, the gap between your output and your impact gets wider.

The Diagnostic

Zervoudis published 13 signs of a service-oriented data team. I've adapted them here with my own commentary. Score yourself honestly. If you hit 8 or more, you're deep in it.

1. Stakeholders ask for specific visualizations instead of help with business questions. They say "I need a bar chart of monthly revenue by region." they should be saying "we're losing market share in the southeast and I don't know why." the first request gets you a dashboard. The second gets you a data product.

2. You've built a large number of reports and dashboards, and suspect most aren't being used. At NeuroBlu, I audited our analytics platform and found that nearly half of dashboards hadn't been opened in 90 days. Nobody noticed they were gone when we archived them. Nobody.

3. You find out about upstream schema changes after something breaks. I got paged at 2am once because a vendor changed a field name in their API. No migration notice. No changelog. Our pipeline broke, and three downstream dashboards showed zeroes for six hours before anyone noticed. When your team is the last to know about changes that directly affect your work, you're not part of the system. You're downstream of it.

4. You spend hours every week playing "number detective." The CFO's number doesn't match the VP's number, which doesn't match yours. Same metric, three different definitions, and your morning is now a forensic accounting exercise instead of building the thing that would prevent this from happening again. At NeuroBlu, I spent more time reconciling conflicting metrics across teams than building the analytics that would have standardized them.

5. Stakeholders have forgotten they requested something by the time you deliver it. Three-week turnaround on a "critical" request. You deliver it, send the link, get back "oh, we already handled this a different way." the urgency was real when they asked. By the time you delivered, the decision had already been made without data. That's the tax of a reactive queue.

6. Your workload is ad hoc requests, not strategic projects. Look at your last two weeks. Count the hours spent on requests that came in via Slack DM versus hours spent on work your team planned. If the ratio is worse than 60/40 in favor of ad hoc, you're not running a team. You're running a help desk with SQL skills.

7. You don't know what stakeholders do with the data you give them. This one haunts me. I'll dig into this more in Part 1, but the short version: a join bug inflated our cohort numbers by 40% at NeuroBlu and a client's analyst found it in a board meeting before we did. They'd built their quarterly forecast on our numbers. I had no idea how the data was being used downstream because nobody's workflow included that feedback loop.

8. Your team doesn't own its own roadmap. If your roadmap is just a list of things other people asked for, it's not a roadmap. It's a delivery queue.

9. Your success is measured by outputs, not outcomes. Tickets closed. Dashboards shipped. Reports delivered. None of which tell you whether anyone made a better decision because of your work.

10. You call people outside the data team "the business." The language tells you where the divide is. If there's "the data team" and "the business," you've already lost. You're a service bureau, not a product org.

11. Your quarterly priorities are decided by someone else. Someone in leadership sets your team's goals. You execute them. You don't get to say "actually, the highest-impact thing we could do this quarter is X." your roadmap is a delivery queue with a fancier name.

12. You get requests after decisions are made. "we decided to launch this. Can you build the reporting?" that's the gut-punch version of every sign on this list. The decision happened without data. Your team gets called in for the cleanup. You're not informing strategy. You're documenting it after the fact.

13. You can't quantify the monetary value your team adds. If the CFO asked you right now what your team is worth in dollar terms, could you answer? Most data teams can't. And that's why they get treated as cost centers.

Why This Happens

Five root causes. All of them are organizational, not technical.

You're positioned as a service function. The data team reports into IT or engineering instead of product. That reporting line defines your purpose: you exist to serve requests, not to drive outcomes. Your budget gets justified by how many tickets you close, not by what decisions you changed.

Nobody applied product thinking to data. Cagan's four risks (value, usability, feasibility, business viability) apply to data products the same way they apply to software products. Most data teams skip the first two. They validate feasibility ("can we build this pipeline?") and viability ("does the infrastructure support it?") but never ask "will anyone actually change their behavior because of this?"

Your methodology was designed for software, not data. Scrum, Kanban, SAFe. All designed for teams that ship code. Data teams have different rhythms. A data scientist needs three to five weeks of exploration before she can even scope the problem. You force that into a two-week sprint and she either rushes the exploration or carries it across sprints until everyone loses track. The methodology doesn't fit, so the team works around it, and the workarounds become the process.

Handoffs are informal. Context travels through Slack threads, meeting notes, and tribal knowledge. When work moves from the PM to the data scientist to the engineer, critical assumptions evaporate at each boundary. The PM knows why this cohort was chosen. The data scientist knows what edge cases were excluded. The engineer knows neither. I've watched entire quarters of work collapse because of one implicit assumption that nobody documented. (I'll dig into the worst version of this in Part 1.)

Nobody owns the fifth risk. Cagan's framework includes four risks. Data products need a fifth: data ethics and quality. Who's accountable when a model performs differently across demographic groups? Who catches the bias before it ships? In most teams, that job belongs to everyone, which means it belongs to no one.

Here's what makes all of this hard to fix. Joe Horgan nails it: "operating as a dashboard factory is often hard-wired into data teams. Ad hoc changes won't stick in an operating model that's designed to frustrate them." you can't fix it by hiring better analysts or buying better tools. Chad Sanderson argues that data's core problem is trust and communication, not technology. He's right, but you also can't fix it by just telling people to communicate better. You need the structure that makes good communication the default.

The dashboard factory reinforcement cycle

The Escape Route

I've been building a system around this since late 2023.

It started with the handoff problem. Every failure I described above traces back to the same root: context disappearing when work crosses from one person to another. What if handoffs were contracts instead of conversations? What if every time work moved between roles, there was an explicit agreement about what context transfers, what assumptions are in play, and what needs to be true before the next person builds on it?

I call it DPOS, the Data Product Operating System. It's an operating model: the principles, practices, and contracts that define how a data team discovers, builds, ships, and evaluates data products. Each role keeps its own workflow. The contracts connect them. This is a five-part series covering the kernel principles, role-specific operating systems, the integration layer that connects them, and practical implementation paths for teams at different maturity levels.

Part 1 starts here.

So What

The hard part isn't building better dashboards. It's recognizing that the dashboards were never the product. The decisions they were supposed to enable? Those are the product.

Dashboard factories don't need better dashboards. They need to stop being factories.

Next in series: Part 1: Introduction — The Operating System for Data Products

Series Navigation:
- Part 0: The Problem — You are here
- Part 1: Introduction
- Coming soon: Part 2 — The Kernel (Principles)
- Coming soon: Part 3 — Role Operating Systems
- Coming soon: Part 4 — The Integration Layer
- Coming soon: Part 5 — Implementation

Friday, 27 February 2026

, & Kevin Holland

data-products, Frameworks, dpos, Blog

DPOS: The Operating System for Data Products in the AI Era

Your day looks like this: standup, sync, planning, review, requirements. Somewhere in there you're supposed to actually analyze data. The system was designed for software teams, not data teams. DPOS fixes that.

I once watched a three-month project fail because one team assumed UTC and another assumed local time. Nobody wrote it down.

That's not a code bug. That's a handoff failure. The PM's requirements became the engineer's spec, and somewhere in the gap, a critical assumption about date formatting vanished. Three months of work. One implicit handoff. Dead.

Data teams don't fail because they're slow. They fail because the space between roles is where context disappears.

The Pattern I Keep Seeing

I've built $9M+ in data product ARR across healthcare analytics, real-world evidence platforms, and clinical decision tools. Nine years. (Feels like twenty.) Multiple companies. Different team sizes, different tech stacks, different org charts.

The failure pattern is always the same: the handoffs.

At Revive, six months into leading the data product team, my data scientist looked at me mid-sprint and said: "We constantly plan but never finish. We keep pushing the interesting modeling work to the next sprint." Our designer added: "The UI keeps changing because we don't have time to explore visualization options properly." Our tech lead just nodded. He was too frustrated to talk.

Two-week sprints were fragmenting work that needed deep exploration. But the real problem wasn't the sprint length. It was that every time work crossed from one person to another, context leaked. Ask me how many retros it took to figure that out.

In a different role, I shipped a cohort analysis with a join bug that inflated numbers by 40%. Our client's analyst found it before we did. In a board meeting. They'd built their quarterly forecast on our numbers. Wrong cohort, wrong forecast, wrong resource allocation downstream. That's what happens when the handoff between building and testing doesn't include explicit quality contracts. Nobody's job to catch it, so nobody caught it.

It's Not Velocity. It's the Gaps.

Every methodology I've tried shares the same blind spot: they treat handoffs as informal conversations.

Scrum optimizes for delivery cadence. I've run Scrum with data teams at three different companies. Here's the pattern: the data scientist needs four weeks of exploration before they can even scope the problem. You force that into a two-week sprint and they either rush the exploration (bad) or carry it across sprints (which defeats the point of sprints). Meanwhile the engineer is waiting for a spec that doesn't exist yet because the scientist hasn't finished exploring. Context leaks at every handoff because nobody has time to document what they learned before the next ceremony starts.

Kanban optimizes for flow. But data products need quality gates, ethical review, and validation that don't fit a pull-based model. You can't just "pull" the next data quality check when you're ready for it. Sometimes the data quality check needs to block everything.

CRISP-DM assumes linear progression: business understanding, data understanding, preparation, modeling, evaluation, deployment. In practice, you discover requirements while building. You find data quality issues in production. You loop back constantly.

SAFe tries to solve coordination at scale. But the coordination overhead suffocates the exploratory work that makes data products valuable.

Each of these gets something right. Scrum's retrospective discipline. Kanban's work-in-progress limits. CRISP-DM's insistence on business understanding before modeling. Good ideas, all of them.

But they all assume handoffs happen through conversations. Tickets get passed. People sync up in meetings. Context travels through Slack messages that disappear in a week.

For software products, that works well enough. The code is the artifact. It either compiles or it doesn't.

For data products, the context IS the artifact. Why this cohort was defined this way. What assumptions went into the model. Which edge cases were intentionally excluded. Lose that context in a handoff, and you get a UTC bug that kills three months of work.

What If Handoffs Were Contracts?

That question changed how I run data teams.

What if every time work crossed from one person to another, there was an explicit agreement about what context transfers? Not a meeting. Not a ticket description. A contract: here's what I produced, here's what it assumes, here's what you need to know before you build on it.

For the engineer, this means not reverse-engineering stakeholder intent from a Jira ticket. The contract tells you what the PM's discovery actually found, what it assumes, and what edge cases were already considered. You build from what's written down, not from guessing what the PM meant.

I've been building a system around this idea for the past two years. I call it DPOS, the Data Product Operating System.

DPOS works at two levels:

Individual. Each role on the team has their own operating system. The PM's workflow for synthesizing stakeholder feedback is different from the engineer's workflow for building pipelines. That's fine. Each person works the way they actually work, not the way a methodology prescribes.

Team. DPOS connects those individual systems through shared context at lifecycle boundaries: Discover, Decide, Build, Test, Ship, Evaluate. A set of principles that don't change (the kernel), and explicit handoff contracts that connect individual workflows into coordinated execution. The stages aren't the interesting part. The handoffs between them are.

Why Now?

DPOS is about explicit handoff contracts. You could implement these with shared docs, templates, or well-structured meeting notes. Before AI, I did exactly that. It worked, but the documentation overhead was brutal.

What's changed is that AI makes this cheap. At Holmusk, I ran 315 customer queries + questions through an AI analysis pipeline (Gemini) and got structured themes, segment breakdowns, and a prioritized issue list in 20 minutes. Before that, the same synthesis took me two hours of manual categorization. That synthesis becomes the handoff artifact my engineer actually reads, because it's specific enough to build from.

The documentation now happens as a byproduct of the work itself. AI makes DPOS faster. But the principles come first. The tooling is up to you.

What Makes This Different

I've tried forcing teams into Scrum. I've watched CRISP-DM die slow deaths in organizations that adopted it on paper and ignored it in practice. I've seen teams "customize" Kanban until it was just a wall of sticky notes with no structure.

Stop making people fit the framework. Let the framework emerge from how people actually work.

What's Coming

This is Part 1 of 5.

Part 2: The Kernel. Principles that never change. Trust over Features. Judgment over Automation. The philosophical and strategic foundation.
Part 3: Role Operating Systems. How to build an OS for each team role. Deep dive on the Data PM OS as proof of concept.
Part 4: The Integration Layer. How individual operating systems connect into team execution. Where this table gets the nuance it deserves.
Part 5: Implementation. Getting started patterns for greenfield, brownfield, and AI integration paths.

After the core series: practical Role OS guides for each core data product role, with templates and real workflows.

So What?

Explicit handoff contracts sound obvious. So why doesn't every team do this?

Because the hard part isn't the contract format. It's the principles underneath. When a data scientist's exploration contradicts the PM's roadmap commitment, whose context wins? When a quality check reveals a problem two days before launch, does trust outweigh the timeline? (Spoiler: it does. I learned that one the expensive way.)

Part 2 is about the kernel: the principles that make those calls before the pressure hits.

Next in series: Coming soon: Part 2 — The Kernel (Principles)

Series Navigation:
- Part 0: The Problem
- Part 1: Introduction — You are here
- Coming soon: Part 2 — The Kernel (Principles)
- Coming soon: Part 3 — Role Operating Systems
- Coming soon: Part 4 — The Integration Layer
- Coming soon: Part 5 — Implementation

Sunday, 22 February 2026

, & Kevin Holland

Data Products, Frameworks, dpos, Thought Leadership, Blog