本手册补齐的是 portfolio governance 层。已有文档分别解决高管沟通、平台产品、能力评估和董事会治理, 本文件把它们连接成一个企业级 AI 投资组合运营系统。
843 行AI_TRANSFORMATION_VALUE_OFFICE_PLAYBOOK.md
AI Transformation Value Office / Portfolio Governance Playbook
目标: 训练从 AI PM / BA / Architect 升级到能管理企业 AI 投资组合、价值实现、采用、成本、风险和停项决策的能力。
适用对象: AI PM Lead、AI Transformation Consultant、Enterprise Architect、AI Portfolio Lead、AI Governance PM。
核心原则: 企业 AI 转型不是不断启动 POC, 而是用可审计的 portfolio operating model 把资金投向能被采用、能复用、能控制风险、能证明收益的 AI 能力。
1. 定位与相邻资产连接
本手册补齐的是 portfolio governance 层。已有文档分别解决高管沟通、平台产品、能力评估和董事会治理, 本文件把它们连接成一个企业级 AI 投资组合运营系统。
feedback closure time、eval regression、prompt/model version performance
5.5 Unit Economics
AI portfolio 不能只报总收益, 必须看单位经济:
Net value per unit =
gross benefit per unit
- model/token cost
- platform cost allocation
- license cost
- human review cost
- QA/control cost
- support and change cost
- incident/risk reserve where material
| Use case | Unit | Gross benefit | Cost-to-serve | Net value signal |
|---|---|---|---|
| Customer service RAG | assisted case | 节省处理时间、降低 QA rework | token、RAG、license、supervisor review | 只有当 suggestion acceptance 和 quality gate 同时达标才成立 |
| AML copilot | alert review | 缩短 evidence gathering、降低 rework | retrieval、case integration、senior review、audit sampling | 不能用减少 SAR/STR 数量作为直接收益 |
| Fraud prioritization | alert / transaction | 提升 fraud capture、降低 false positive handling | real-time latency、model monitoring、appeal support | 必须同时看 fraud loss 和客户误拦截 |
| Inventory insight | SKU-store-week | 降低缺货和库存积压 | demand model、planner review、integration | 更适合快速规模化, 但要防止推荐无人采用 |
AI Portfolio Operating Pack
1. Executive summary
2. Value Office charter
3. Portfolio backlog and scoring
4. Financial retail use case map
5. Benefits realization register
6. Risk and governance mapping
7. Reuse and dependency map
8. Monthly executive dashboard
9. Scale/Stop decision memos
10. Interview storytelling appendix
10. Templates
10.1 AI Portfolio Kanban
Stage
Entry rule
Exit evidence
Example card
Intake
Business owner, workflow, pain, rough data source identified
Intake card complete
Customer service policy RAG
Prioritization
10-dimension scoring completed
Ranked and capacity checked
Fraud alert prioritization
Discovery
Discovery funding and owner confirmed
Baseline, target, data readiness, risk tier
AML investigator copilot
Pilot
Pilot gate and stop rule approved
Pilot evidence: quality, adoption, cost, risk
Payment dispute triage
Release
Release gate passed
Limited production release and monitoring
Service RAG for 8 policy topics
Scale
Benefits signed off and support model ready
Expanded users/process/regions
Inventory demand insight
Retire
Stop trigger or replacement approved
Retirement record and lessons learned
Legacy standalone chatbot
Kanban card fields:
Field
Example
Use case
Customer service policy RAG
Business owner
Head of Contact Center Operations
Stage
Pilot
Decision needed
Limited release approval
Next gate date
2026-08-15
Value hypothesis
Reduce AHT and QA rework for 8 high-volume policy topics
Risk tier
Medium-high
Platform reuse
RAG ingestion, citation, model gateway, audit log
Stop rule
Stop if unsupported claim rate remains above 3% after remediation cycle
10.2 Use Case Scoring Matrix
Dimension
Weight
Score
Evidence example
Interpretation
Strategic fit
12%
4
Contact center efficiency is FY priority
Strong business alignment
Value size
18%
4
42,000 monthly policy cases, 13.8 min AHT
Material operating value
Feasibility
12%
4
CRM and knowledge base API available
Pilot feasible
Data readiness
10%
3
80% policy docs owned, 20% needs cleanup
Discovery risk remains
Reuse potential
8%
5
Same RAG pattern for compliance reporting
Platform candidate
Time-to-value
8%
4
8 policy topics can pilot in 6 weeks
Fast evidence
Risk tier
10%
3
Customer-facing draft, human approval
Medium constraint
Cost-to-serve
8%
2
Cost per assisted case projected at 0.32 USD
Manageable
Adoption burden
8%
3
Agents need workflow change and supervisor cadence
Needs change plan
Regulatory sensitivity
6%
3
Policy advice can affect complaints
Moderate sensitivity
Decision interpretation:
Fund pilot if value + feasibility + reuse are strong, but require quality gate, source freshness control, supervisor approval for high-risk topics, and adoption proof before release.
10.3 Benefits Realization Register
Field
Example
Benefit ID
BR-CSR-RAG-001
Use case
Customer service policy RAG
Benefit type
Productivity capacity + quality improvement
Baseline
42,000 monthly policy cases, 13.8 min AHT, 7.2% QA fail
productivity capacity counted after staffing plan or backlog reduction is confirmed
Confidence
Medium until 4-week production sample
Sign-off
Business owner and finance reviewer sign at monthly value review
10.4 Scale / Stop Decision Memo
# Scale / Stop Decision Memo
## Decision Requested
Approve limited scale of customer service policy RAG from 30 agents to 180 agents across two contact center teams, with weekly monitoring and a stop trigger tied to unsupported claim rate, adoption, and cost per assisted case.
## Current Stage
Pilot completed for 8 high-volume policy topics over 6 weeks.
## Evidence Summary
- Baseline: 42,000 monthly policy cases, 13.8 min AHT, 7.2% QA fail.
- Pilot result: assisted cases reached 10.9 min AHT, QA fail 4.9%, unsupported claim 2.4%.
- Adoption: 76% target agents used the tool weekly, median 14 assisted cases per agent per week.
- Cost: all-in cost per assisted case 0.38 USD, within 0.45 USD gate.
- Risk: high-risk topics required supervisor approval; no severe incident during pilot.
## Recommendation
Scale to two teams, not enterprise-wide. Keep topic scope fixed for 30 days while monitoring production quality and support load.
## Conditions
- Keep all production calls through model gateway.
- Weekly manager review of override reasons and low-confidence answers.
- Source freshness control reviewed every Friday.
- Finance validates realized benefit after 4 production weeks.
## Stop / Rollback Trigger
Pause expansion if unsupported claim rate exceeds 3% for two consecutive weekly samples, stale source hit exceeds 1%, cost per assisted case exceeds 0.45 USD, or weekly active usage falls below 60% of target users.
## Next Review
Monthly value review after 4 production weeks.
10.5 Executive Portfolio Dashboard
Section
Metric
Green
Yellow
Red
Portfolio flow
% use cases with named business owner
>= 95%
80-94%
< 80%
Value
validated benefits / forecast benefits
>= 70%
40-69%
< 40%
Adoption
pilots meeting adoption gate
>= 70%
50-69%
< 50%
Quality
releases with eval report and regression pass
100%
90-99%
< 90%
Risk
high-risk systems with current risk owner sign-off
100%
95-99%
< 95%
Cost
use cases with cost per unit reported
>= 90%
70-89%
< 70%
Reuse
new pilots using platform gateway/eval/logging
>= 80%
60-79%
< 60%
Decision discipline
POCs stopped or scaled based on gate evidence
documented for all gate decisions
partial documentation
decisions made without evidence
Monthly narrative:
This month the portfolio moved from 42 to 38 active use cases after stopping 5 low-evidence POCs and adding 1 platform capability. Validated benefits increased from 41% to 55% of forecast as customer service RAG and inventory demand insight passed adoption gates. Two controlled bets remain constrained by data readiness: AML copilot and wealth guardrail. The decision request is to fund platform eval harness expansion because 7 high-risk pilots now depend on consistent release evidence.
11. 面试表达
11.1 30 秒版本
我会把 AI 转型从“做很多 POC”升级成 AI Value Office。它不只是 PMO, 而是管理 use case portfolio、funding gate、benefits realization、risk gate、platform reuse、change adoption 和 kill/scale decision。每个 AI use case 先通过 intake 和 scoring, 再用 baseline、target、quality proof、adoption proof、unit economics 和 risk evidence 决定是否继续投。这样 CFO 能看到收益口径, COO 能看到流程采用, CIO/CTO 能看到平台复用和成本, risk 能看到控制证据。
11.2 2 分钟版本
企业 AI 最大的问题不是缺少 idea, 而是缺少 portfolio discipline。我的做法是建立 AI Transformation Value Office。
第一层是 intake 和 prioritization。每个 use case 必须有 business owner、流程、baseline、数据来源和风险信号, 然后按 strategic fit、value size、feasibility、data readiness、risk tier、reuse potential、time-to-value、cost-to-serve、adoption burden、regulatory sensitivity 打分。
第二层是 stage gate。Discovery 证明问题和数据, pilot 证明质量、采用、成本和风险, release 证明控制和运维, scale 证明单位经济和 adoption 可扩张。每一阶段都有 stop rule。
第三层是 benefits realization。我不会只说节省小时数, 而是建立 baseline、target、leading/lagging indicators、unit economics、adoption proof、quality proof、risk reduction proof, 最后让 finance sign off 收益口径。
第四层是 platform reuse。凡是多个 use case 重复需要 model gateway、RAG、eval、audit log、cost dashboard, 就进入平台 backlog, 降低重复建设和治理风险。
最终输出包括 portfolio backlog、funding memo、benefits register、reuse map、dependency map、monthly exec dashboard 和 decision log。这样 AI 转型就从试点集合变成可投资、可治理、可停止、可规模化的企业能力。