返回 Papers
AI 扩展计划 / Playbooks

AI JTBD / Outcome-Driven Innovation Playbook

以下来源作为 JTBD、ODI 和 AI 风险治理的学习锚点。本文把它们转成金融零售 AI use case 选择、需求证据、自动化边界和产品架构设计语言, 不构成法律、合规、审计、模型验证或监管解释意见。访问日期按 2026-06-29 记录。

848AI_JTBD_OUTCOME_DRIVEN_INNOVATION_PLAYBOOK.md

AI JTBD / Outcome-Driven Innovation Playbook

面向对象: 已具备 CBAP / 高级 BA / AI PM / Product Architect / Solution Architect / AI Platform PM / 金融零售 AI 转型负责人。 核心问题: 如何用 Jobs-to-be-Done 和 Outcome-Driven Innovation 选择真正值得做的 AI use case, 而不是被 demo、模型能力、供应商话术或内部热点牵着走。 使用方式: 每个候选 AI use case 至少产出一份 AI JTBD Canvas、一张 Outcome Opportunity Scorecard、一份 Automation Boundary Contract、一份 Evidence-to-Architecture Traceability Matrix 和一页 Use Case Selection Memo。 训练定位: 本文不讲基础用户访谈、persona 入门或需求收集套路, 重点训练 CBAP 之后的高级问题定义、outcome 建模、证据驱动取舍、自动化边界和产品架构判断。


Source Anchors

以下来源作为 JTBD、ODI 和 AI 风险治理的学习锚点。本文把它们转成金融零售 AI use case 选择、需求证据、自动化边界和产品架构设计语言, 不构成法律、合规、审计、模型验证或监管解释意见。访问日期按 2026-06-29 记录。

AnchorOfficial / primary source本文使用方式
HBR Jobs to Be Done - Christensen et al.https://hbr.org/2016/09/know-your-customers-jobs-to-be-done用“客户做选择是为了完成某个 job”的视角, 把 AI idea 从功能愿望改写成稳定的 job、struggle、progress 和 switching trigger。
Strategyn JTBD / ODI officialhttps://strategyn.com/jobs-to-be-done/用 ODI 的 job map、desired outcome、importance / satisfaction 和 underserved opportunity 思路, 建立 AI use case 选择和排序方法。
NIST AI Risk Management Frameworkhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织 AI 风险、证据、评估、上线门禁、持续监控和责任边界。
NIST AI RMF Generative AI Profilehttps://www.nist.gov/publications/artificial-intelligence-risk-management-framework-generative-artificial-intelligence用 GenAI 特有风险补强 hallucination、data leakage、prompt injection、over-reliance、tool misuse、content provenance 和 misuse 的筛选门槛。
CFPB Chatbots in consumer financehttps://www.consumerfinance.gov/data-research/research-reports/chatbots-in-consumer-finance/用于客服 AI 的 outcome 和风险边界: AI 不能以效率为名降低客户获得及时、清晰、人工支持和救济的能力。
CFPB Circular 2022-03 complex algorithms and adverse actionhttps://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/用于信贷申请人场景: 复杂算法不免除提供具体、准确 adverse action reason 的义务, 因此 LLM 不能替代可追溯的 reason pipeline。

Source-to-artifact mapping:

Source lens可以产出的 artifact高级面试表达
HBR JTBDJob definition、switching trigger map、progress narrative“我不会从 AI 功能开始, 我先定义客户或员工真正要完成的 job, 再判断 AI 是否能让 progress 更快、更可靠或更低风险。”
Strategyn ODIJob map、outcome catalog、importance / satisfaction score、underserved opportunity map“我用 desired outcome 和 underserved outcome 排序 AI use cases, 避免把资源投到看起来炫但结果已经被满足的场景。”
NIST AI RMFAI risk map、measurement plan、release gate、monitoring gate、evidence binder“我把 use case selection 同时放进价值和风险闭环, 不把 AI 选择简化成 ROI 或准确率排序。”
CFPB consumer finance sources客服与信贷客户影响边界、人工升级、具体原因链路“金融零售 AI 的好 use case 不是只降低处理成本, 还要保护客户获得解释、人工支持、纠错和救济的能力。”

1. One-Sentence Positioning / 一句话定位

JTBD / ODI for AI Use Case Selection 的核心不是问“哪里能用 AI”, 而是:

Job to be done
-> job steps and desired outcomes
-> underserved outcomes with evidence
-> AI intervention point
-> automation boundary
-> product architecture and eval gate
-> portfolio decision: scale / pilot / redesign / stop

一句话:

AI JTBD / ODI = 用稳定的 job 和可度量的 desired outcomes 选择 AI 投资点, 用 underserved outcome 证明机会, 用 automation boundary 限定模型权力, 用产品架构和证据链把 idea 变成可上线、可审计、可持续改进的能力。

在金融零售场景里, 一个合格的 AI use case selection 必须同时回答:

  1. 这个 job 是谁在什么情境下必须完成的, 不是哪个部门想要 AI。
  2. 当前 job 的哪些 outcome 重要但满足度低, 有什么业务、风险或客户证据。
  3. AI 介入 job 的哪一步, 是 read、locate、summarize、draft、recommend、monitor、act 还是 decide。
  4. 哪些 decision、tool action、客户沟通、合规判断和责任边界不能自动化。
  5. 需要哪些数据、知识、规则、权限、workflow、UI、eval、monitoring 和 audit 组件。
  6. 什么时候 scale, 什么时候只做受控 pilot, 什么时候用规则/流程/数据质量改造替代 AI。

2. 为什么 CBAP 之后还需要 JTBD / ODI

CBAP 已经证明你掌握 stakeholder、elicitation、requirement lifecycle、strategy analysis、solution evaluation 和 change management。AI 时代的升级点不是再学习“怎么访谈用户”, 而是把已有 BA 能力提升为 outcome-first、evidence-first、architecture-aware 和 risk-aware 的选择系统。

CBAP 已有能力AI use case 选择中的新挑战JTBD / ODI 升级方式
Business need analysisAI idea 常由 demo、vendor、内部热点或模型能力驱动用 job 和 desired outcome 过滤 solution-first 需求
Stakeholder requirement同一 AI 输出可能影响客户、员工、主管、模型风险、合规和审计区分 job executor、buyer、operator、risk owner、affected customer 和 platform customer
Requirements elicitation业务方容易说“我要智能助手”, 而不是说清 job struggle从 job step、outcome、baseline、exception、evidence 和 switching trigger 提问
Solution evaluationAI 指标、业务 KPI、风险指标和 adoption 指标可能互相冲突用 ODI opportunity + NIST risk tier + architecture feasibility 做组合判断
Traceabilityprompt、RAG、model、tool、policy、knowledge source 都会变化建 requirement -> outcome -> eval case -> architecture component -> release gate 的 lineage
Change managementAI 上线后行为会漂移, 组织也会过度信任或绕过流程把 monitoring、feedback、override、incident 和 eval update 写入生命周期
Strategy analysisAI portfolio 容易碎片化成一堆 point solution用 stable job 和 reusable capability 把 use cases 聚合到产品架构和平台路线图

高级表达:

CBAP 让我能把业务问题和 stakeholder concern 讲清楚;JTBD / ODI 让我能进一步证明哪个 outcome 真的 underserved, AI 应该介入哪一步, 自动化边界在哪里, 以及需要什么架构和证据才能把 use case 安全地 scale。


3. 核心框架: JOA Selection Loop

JOA = Job, Outcome, Architecture。它把 AI use case selection 从灵感筛选变成可复核的决策系统。

1. Define market as a job executor + job
2. Map the job steps
3. Write desired outcome statements
4. Score importance, satisfaction, evidence and risk
5. Identify underserved outcomes
6. Decide AI intervention point and no-AI alternative
7. Define automation boundary and human accountability
8. Map outcome to product architecture
9. Build eval and release gate
10. Make portfolio decision

3.1 JOA 分层

Layer关键问题主要产出
L1 Market / Job Definition哪类人要完成哪个稳定 jobJob definition、job executor map、affected stakeholder map
L2 Job Mapjob 的理想完成过程是什么Define / Locate / Prepare / Confirm / Execute / Monitor / Modify / Conclude map
L3 Outcome Catalog每一步如何衡量“更好地完成”Desired outcome statements、outcome taxonomy
L4 Opportunity Scoring哪些 outcome 重要但满足度低Importance / satisfaction / evidence / risk-adjusted opportunity score
L5 AI TranslationAI 在哪一步提供 read、summarize、draft、recommend、actAI intervention map、no-AI alternative
L6 Automation BoundaryAI 能做什么, 不能做什么, 何时升级人Automation Boundary Contract、decision rights matrix
L7 Product Architecture需要哪些系统、数据、模型、规则、工具和治理组件C4 / sequence sketch、control points、component responsibility
L8 Eval / Gate如何证明可以 pilot、scale 或 stopEval contract、release gate memo、monitoring gate
L9 Portfolio Decision投资、暂停、合并、平台化或退役Use case selection memo、portfolio heatmap

3.2 选择原则

Principle操作化解释反例
Job before AI先定义稳定 job, 再选择 AI pattern“我们要做一个财富 GPT”
Outcome before feature先定义 desired outcome, 再写功能“加一个总结按钮”
Evidence before prioritization没有 baseline、案例、日志、QA、投诉、风险或财务证据, 不进入 scale只凭高管兴趣排优先级
Boundary before autonomy先定义 AI 不能做什么, 再讨论自动化程度先做 agent, 之后补审批
Architecture before pilot scalepilot 之前就要知道未来如何接入权限、日志、eval、monitoring 和工具网关PoC 能跑, 生产再说
Portfolio before platform看到多个 job 共享能力后再平台化第一个 demo 就建设大平台

4. JTBD 到 AI 的转译

JTBD 不是 persona, 也不是旅程图。它问的是: job executor 在一个情境中要完成什么进展, 他们用什么替代方案, 现有方案在哪些 outcome 上表现不足。AI 转译的关键是把 job step 改写成 AI 可以帮助或不能帮助的工作类型。

4.1 Job Step 到 AI Intervention

JTBD stepjob executor 在做什么AI 可以帮助AI 不应直接替代
Define明确目标、限制、成功标准解释政策、识别约束、生成检查清单、提示缺失信息定义受监管决策目标、替代业务 owner 的风险接受
Locate找到完成 job 所需输入检索政策、交易、客户资料、案例、产品资料、监管文本绕过权限、使用未经授权或过期来源
Prepare整理、清洗、结构化输入摘要、抽取字段、去重、实体匹配、预填表单默默修改正式记录、隐藏不确定性
Confirm确认信息足够、证据可信、约束满足completeness check、policy check、citation check、risk flags把缺证据判断成事实、跳过必要人工确认
Execute完成核心任务起草 memo、生成回复草稿、创建 case draft、建议下一步高影响 approve / decline / file / waive / trade / close
Monitor观察执行是否正常异常检测、队列优先级、SLA 风险、quality drift无人监督地改变客户权益或案件结论
Modify根据异常调整方案推荐补件、升级、重路由、请求二审、更新草稿自动覆盖人工决定、绕开 maker-checker
Conclude完成记录、通知、归档、复盘生成 case note、evidence bundle、QA summary、learning loop生成不可追溯的正式通知、删除或重写审计证据

4.2 AI Pattern 选择

Outcome 类型更可能适合的 AI pattern架构含义
信息查找慢RAG / semantic search / policy assistantsource-of-truth、权限过滤、版本生效日期、citation validator
文档和案例太长Summarization / structured extractionschema、evidence reference、human edit log、coverage check
资料缺失多Completeness checker / rules + modelrule engine、document parser、missing information workflow
判断不一致Recommendation with rubric / decision supportpolicy rules、case similarity、explanation、reviewer calibration
草稿耗时高Drafting copilotapproved templates、tone policy、edit diff、approval before send
队列优先级不清Predictive triage / anomaly detectionfeature store、risk score、monitoring、bias and drift checks
工具动作频繁且低风险Agent with approval or constrained automationtool gateway、least privilege、idempotency、rollback、kill switch
客户权益影响大Human decision with AI evidence supportstrong HITL、reason pipeline、formal notice boundary、audit binder

4.3 JTBD 转译公式

For [job executor],
when trying to [job],
at job step [step],
they struggle to [desired outcome not sufficiently achieved],
because [evidence-backed constraint],
so AI should [read / locate / summarize / draft / recommend / monitor / act-with-approval],
but must not [decision / action / communication boundary],
and success will be evaluated by [outcome metric + risk metric + adoption evidence].

例子:

场景弱表达JTBD-to-AI 表达
AML做一个 AML Copilot当 AML analyst 调查 alert 时, 在 locate / prepare / confirm 阶段难以及时找到跨账户证据和关键 red flags;AI 应汇总证据、生成引用和起草 narrative, 但不能关闭 alert、决定 SAR 或替代二审。
客服做更智能的客服机器人当客服处理费用、争议和投诉问题时, 在 define / locate / execute 阶段需要快速识别 regulated intent、引用有效政策并生成可审核回复;AI 可做 routing、知识检索和草稿, 但不能阻断人工支持或承诺未经授权的补偿。
信贷AI 帮客户申请贷款当申请人完成信贷申请时, 在 prepare / confirm 阶段容易漏传资料和误解下一步;AI 可解释材料要求和预填非决策字段, 但不能给出正式批准概率、拒绝理由或替代 adverse action reason pipeline。

5. Outcome Statement 设计

ODI 的核心资产不是 pain point list, 而是 desired outcome statement。AI use case 必须从 outcome statement 开始, 否则后续 eval、架构和价值证明都会漂移。

5.1 Outcome statement grammar

建议使用下面语法:

Direction [minimize / increase / reduce / improve]
+ metric object [time, effort, likelihood, accuracy, completeness, risk, rework, variance]
+ job step object [to locate evidence, confirm policy applicability, complete application]
+ context [when / while / before / after]
+ constraints [without increasing customer harm, compliance risk, manual rework, latency, cost]

常用模式:

Outcome typeGrammar金融零售例子
TimeMinimize the time it takes to...降低 AML analyst 找到关键交易证据和关联实体的时间。
EffortMinimize the effort required to...降低客服为同一客户问题跨系统查找政策、账户状态和历史 case 的努力。
LikelihoodIncrease the likelihood that...提高信贷申请人在提交前发现资料缺失和字段不一致的可能性。
CompletenessIncrease the completeness of...提高财富顾问会前准备中 suitability checklist 和风险揭示覆盖度。
AccuracyIncrease the accuracy of...提高客服回答中政策引用、费用规则和生效日期的准确性。
Risk reductionReduce the risk that...降低 AI 或员工遗漏 AML red flag、误导客户、生成不可解释拒贷原因的风险。
VarianceReduce variance in...降低不同分析员、客服或顾问对同类政策问题处理口径的差异。

5.2 好 outcome 与弱 outcome

弱 outcome问题高质量 outcome
提升客服效率不知道 job step、指标、风险边界在信用卡费用咨询中, 降低客服查找有效政策和生成可审核回复草稿的时间, 同时 unauthorized fee waiver commitment 为 0。
AML 更准确“准确”未定义提高 AML alert investigation 中关键 red flag 被识别并附上 source evidence 的覆盖度, 同时 final disposition 仍由授权 analyst 完成。
信贷体验更好缺少可测对象提高申请人在提交前发现缺失材料、身份信息不一致和资格限制的可能性, 同时不输出正式批准承诺或拒绝原因。
财富顾问更会卖产品销售导向且风险高提高顾问在客户会谈前完成 suitability evidence、费用风险解释和替代产品比较的完整性, 同时不让 AI 直接向客户生成个性化交易建议。

5.3 Outcome statement 到 eval 的连接

OutcomeEval questionRelease metricEvidence
降低证据查找时间AI 是否减少 analyst 跨系统查找次数Median evidence collection time、click path lengthCase telemetry、pilot shadow study
提高引用准确性AI 输出的关键事实是否有有效来源Citation precision、unsupported claim rateGolden set、citation validator
降低误导性客服承诺AI 是否生成未授权承诺Critical policy violation countRed-team cases、QA review
提高补件完整性申请人是否更少因缺资料返工Missing document return rateApplication funnel log
提高 suitability 覆盖顾问准备是否覆盖风险、费用、客户约束Checklist completeness、supervisor QAAdvisor desktop log、review sample

6. Underserved Outcome: 机会不是痛点, 是高重要低满足

Underserved outcome 是 AI use case selection 的核心。痛点可能很吵, 但不一定重要;重要 outcome 可能已经被规则、流程、培训或现有系统满足;AI 只有在某些 outcome 重要、满足度低、证据充分且风险可控时才值得进入投资队列。

6.1 Opportunity scoring

建议把 ODI 的 importance / satisfaction 思路扩展成 AI 适用的风险调整评分:

AI Opportunity Score =
Importance
+ max(Importance - Satisfaction, 0)
+ Evidence Strength
+ Strategic / Regulatory Relevance
+ Reuse Potential
- Automation Risk
- Data / Architecture Gap
- Adoption Friction

评分用 1-5 分即可, 不要制造虚假的小数精度。

Dimension1 分3 分5 分
Importance边缘便利明显影响团队效率或客户体验影响收入、风险、合规、客户权益或核心运营
Satisfaction已被良好满足局部可用但不稳定满足度很低, 返工、投诉、遗漏或延迟明显
Evidence strength只有主观意见有案例样本或专家判断有 baseline、日志、QA、投诉、损失、SLA 或审计发现
Strategic / regulatory relevance非核心支持部门目标支持企业战略能力或监管关注点
Reuse potential单一页面功能可复用到相邻流程可沉淀为企业 AI 平台能力
Automation risk低风险可逆有客户或运营影响高影响、不可逆、监管敏感
Data / architecture gap源清楚, 接入简单部分源需要治理owner 不清、质量差、权限复杂
Adoption friction嵌入现有流程需要局部流程改变需要大规模角色、激励或政策改变

解释:

Score pattern决策
高 importance、低 satisfaction、强 evidence、低/中风险优先 pilot, 同步设计 eval 和 architecture runway
高 importance、低 satisfaction、强 evidence、高风险可做 decision support / copilot, 自动化边界必须强, 先影子模式或受控 pilot
高 importance、低 satisfaction、弱 evidence先做 measurement sprint, 不直接 build
高 satisfaction、低 friction不优先 AI, 可用流程优化或现有系统增强
高 risk、低 evidence、低 reuse停止或重定义 job / outcome

6.2 Underserved outcome map

Outcome status含义AI 策略
Underserved重要但当前满足度低候选 AI opportunity, 进入证据和边界评审
Overserved满足度高但成本或复杂度过高考虑简化、自动化、self-service 或规则化, 不一定需要 LLM
Appropriately served重要且满足度合理保持现状, 只做小改进或监控
Mis-served看似满足, 但通过绕流程、人工补救或风险转移实现优先做流程/数据/控制改造, AI 只能作为辅助
Unserved / non-consumer现有方案让某群体无法完成 job可探索 disruptive / simpler AI-enabled workflow, 但必须先确认客户影响和合规边界

7. AI Automation Boundary

Automation boundary 是把 JTBD / ODI 变成金融零售 AI 产品的关键控制。它定义 AI 可以影响 job 到什么程度, 哪些动作必须由人、规则系统或正式决策引擎完成。

7.1 自动化等级

LevelAI role允许行为金融零售边界
L0 No AI不使用 AI用流程、规则、培训、数据质量或系统改造解决高风险且证据不足时优先考虑
L1 Retrieve找资料检索政策、案例、产品资料、客户可见 FAQ必须权限过滤、版本控制和来源引用
L2 Summarize总结资料摘要通话、case、交易、文档不能省略关键风险、反向证据和不确定性
L3 Draft草拟内容草拟客服回复、AML narrative、信贷 memo、顾问会前材料外发、归档或提交前必须审核
L4 Recommend建议下一步建议补件、升级、二审、调查方向、产品比较维度必须给证据和理由, 高影响建议由人负责
L5 Act with approval生成动作待批准创建 case、发内部请求、更新低风险字段tool gateway、审批、日志、可撤销
L6 Constrained autonomous act在强边界内自动执行低风险、可逆、规则明确、可监控的内部动作不适合自动拒贷、自动投资交易、自动关闭 AML case、自动阻断投诉

7.2 Boundary contract 必填项

Boundary field必须写清
Job scopeAI 支持哪个 job、哪一步、哪类用户和哪类案件
Allowed inputsAI 可访问的数据源、权限范围、版本、生效日期和保留要求
Allowed outputsAI 可生成的字段、格式、草稿、建议和证据
Blocked outputs禁止生成的承诺、判断、建议、通知、原因、法律/合规结论
Tool authority可调用工具、读写权限、审批要求、幂等性、回滚方式
Human accountability谁审核、谁批准、谁覆盖、谁承担最终业务决定
Escalation trigger低置信、缺证据、高风险意图、客户投诉、政策冲突、敏感属性、异常工具动作
Evidence requirement每个关键输出需要哪些 citation、trace、score、review record
Monitoring trigger哪些线上信号触发停用、降级、回归评估或 incident
Change controlmodel、prompt、RAG、policy、tool 或 UI 变更如何触发回归 eval

7.3 高风险边界红线

Red line原因更稳妥设计
LLM 自动批准或拒绝信贷申请客户权益、fair lending、adverse action reason 和模型风险高度敏感AI 做资料完整性、政策引用、memo 草稿, 决策由正式模型/规则/人审完成
AI 直接向客户给个性化投资交易建议适当性、销售合规、许可和客户损失风险AI 支持顾问准备材料和风险揭示, 客户建议由授权渠道输出
AI 自动关闭 AML alert 或决定不报 SAR金融犯罪风险、审计和监管责任AI 提供证据包、red flag checklist、narrative draft, final disposition 由 analyst / supervisor 决定
客服 AI 阻断投诉或人工支持客户救济、服务可得性和合规风险regulated intent 检测、显著人工入口、warm handoff、case ID
AI 平台默认允许业务 team 接任意模型和任意数据源数据泄露、不可审计、供应商和模型风险model gateway、data classification、policy-as-code、eval gate、usage registry

8. 需求证据与产品架构

高级 AI BA / PM 的差异化能力不是“写得出 AI user story”, 而是能把 outcome evidence 映射到 architecture decision。

8.1 Requirement evidence stack

Evidence type可证明什么例子
Workflow telemetry哪一步耗时、返工、等待、跳出AML case handling time、客服转接次数、申请漏件率
QA / audit sample哪类错误真实发生错误政策引用、漏 red flag、误导性客服承诺
Complaint / escalation log客户或员工在哪些结果上不满意chatbot 无法转人工、信贷申请状态不清、顾问材料解释不足
Policy / control evidence哪些边界不能被 AI 越过adverse action、suitability、AML escalation、fee waiver authority
Financial baselineoutcome 与成本、损失、收入、资本或运营容量的关系平均处理成本、SLA breach penalty、manual review backlog
Expert calibration专家对 outcome 的重要性和满足度判断senior analyst review、credit risk SME、compliance opinion
System feasibility evidence数据、API、权限、日志、工具是否可用source inventory、API readiness、data quality profile
Pilot / shadow evidenceAI 在受控环境是否改善 outcomeshadow mode comparison、human override analysis、eval report

8.2 Evidence-to-architecture traceability

Outcome evidence架构决策需要的 control
关键事实经常找不到来源RAG + citation + source freshnesssource registry、retrieval permission、citation validator
不同员工处理口径不一致policy rules + approved response templatesversioned policy, template governance, QA sampling
资料缺失导致返工extraction + completeness checker + workflow task creationschema validation、human confirmation、exception queue
高风险建议需要审核copilot + review queue + maker-checkerdecision log、override reason、approval authority
工具动作可能造成客户影响tool gateway + least privilege + approvalidempotency、rollback、kill switch、action audit
模型行为随版本变化model gateway + eval harness + regression suitemodel registry、prompt version、release gate
业务方要证明价值outcome dashboard + cohort comparisonbaseline、pilot design、adoption telemetry

8.3 Product architecture minimum set

Component作用选择问题
Experience surface员工 copilot、客户 chatbot、advisor desktop、platform portalAI 是嵌入工作流还是新入口
Workflow integrationcase management、CRM、LOS、advisor platform、ticketingAI 输出如何进入下一步, 谁负责确认
Knowledge / data layerpolicy docs、product master、transaction history、customer profile、case historysource-of-truth、权限、版本和数据最小化是否清楚
Model gatewaymodel routing、fallback、rate limit、logging、vendor abstraction是否需要多模型、成本控制、可替换性
Retrieval / context servicequery rewrite、authorization-aware retrieval、citation、freshness是否能证明输出基于有效来源
Rules / policy engineeligibility、authority、regulated intent、sales boundary哪些必须由确定性规则控制
Tool gatewayread / write tools、approval、rollback、idempotencyAI 能执行哪些动作, 怎样限制
Eval harnessgolden set、rubric、threshold、critical failureoutcome 能否被离线和在线评估
Observabilitytrace、latency、cost、quality、drift、human override上线后如何发现问题
Evidence binderrequirement、source、eval、approval、incident、change record审计和复盘如何重放

9. 金融零售案例

9.1 AML Analyst: Alert Investigation Copilot

维度设计
Job executorAML analyst / financial crime investigator
Core job在有限时间内调查 alert, 判断是否有足够证据支持 escalation、closure、SAR consideration 或 further review
Job stepsLocate transaction / customer / counterparty evidence -> prepare entity timeline -> confirm red flags and policy applicability -> draft narrative -> supervisor review -> conclude and archive
Underserved outcomes降低查找跨系统证据时间;提高关联实体、异常交易模式和 prior case 被发现的可能性;降低 narrative 中缺 source evidence 的风险;减少 analyst 之间口径差异
AI interventionRetrieve、summarize、entity linking、case timeline、red flag checklist、narrative draft、QA pre-check
Automation boundaryAI 不关闭 alert, 不决定 SAR, 不替代 supervisor review, 不删除反向证据, 不把未引用事实写入正式 narrative
Product architectureCase management integration + transaction graph + customer / counterparty profile + AML policy RAG + typology library + citation validator + analyst review UI + audit log + eval harness
EvidenceCase duration baseline、QA defect types、missed red flag sample、senior analyst calibration、shadow mode comparison、override reason analysis
Selection decision高价值但高风险;适合 L2-L4 copilot 和受控 pilot, 不适合 autonomous disposition

Outcome examples:

Outcome statementEval / metric
Minimize the time required for an analyst to locate transaction evidence and related entities when investigating an alert, without reducing evidence completeness.Median evidence collection time, entity recall on golden cases, missing evidence defects
Increase the likelihood that key red flags are surfaced with source evidence before narrative drafting.Red flag coverage, citation validity, senior analyst review score
Reduce variance in case narrative quality across analysts while preserving analyst accountability.Narrative rubric score, edit distance, supervisor rework rate

9.2 客服: Regulated Service Resolution Copilot

维度设计
Job executorContact center agent / digital service customer / supervisor
Core job快速、准确、合规地解决客户问题, 并在投诉、争议、困难援助、信贷、财富、欺诈等高风险意图出现时正确升级
Job stepsDefine intent -> authenticate / locate account context -> locate policy -> prepare answer or action -> confirm authority -> execute response / case creation -> monitor resolution -> conclude with record
Underserved outcomes降低跨系统查找政策和账户状态时间;提高 first-contact resolution;降低错误承诺和错误拒绝人工升级;提高投诉和争议识别率
AI interventionRegulated intent detection、policy RAG、customer context summarization、reply draft、next best action suggestion、handoff summary
Automation boundaryAI 不阻断人工支持, 不承诺未经授权的 fee waiver / credit / compensation, 不关闭投诉, 不替代正式通知, 不在未认证状态披露账户信息
Product architectureOmnichannel routing + identity / entitlement + CRM + policy RAG + approved response template + regulated intent classifier + human handoff + QA monitoring + complaint trigger log
EvidenceCall handle time、transfer rate、complaint keywords missed、QA policy defects、customer effort score、chatbot containment harm review
Selection decision适合从内部 agent assist 起步;客户可见 AI 必须按风险层级限制输出和人工入口

Outcome examples:

Outcome statementEval / metric
Reduce the time required for an agent to locate the current policy and account context needed to answer fee and dispute questions.Average handle time, policy lookup time, citation accuracy
Increase the likelihood that complaint, fraud, hardship and regulated credit intents are routed to the correct human or formal workflow.Intent recall on high-risk class, false negative count, handoff SLA
Reduce unauthorized commitments in AI-drafted customer responses.Critical policy violation count, QA defect rate, red-team cases

9.3 信贷申请人: Application Completion and Explanation Assistant

维度设计
Job executorCredit applicant / loan officer / credit operations team
Core job让申请人准确提交申请材料、理解流程状态和下一步, 同时确保正式信贷决策、理由和客户通知来自受控流程
Job stepsDefine product / eligibility context -> locate required documents -> prepare application -> confirm completeness -> submit -> monitor status -> modify with additional information -> conclude with decision / next steps
Underserved outcomes降低资料缺失和字段不一致;提高申请人理解补件要求的可能性;降低重复联系;降低用模糊解释替代正式 adverse action 的风险
AI interventionDocument checklist、field explanation、document extraction draft、missing info detection、status explanation、loan officer memo draft
Automation boundaryAI 不承诺批准概率, 不生成正式拒绝原因, 不替代 adverse action reason pipeline, 不使用敏感属性做非授权推理, 不绕过 fair lending 和 credit policy controls
Product architectureApplication portal + document ingestion + OCR / extraction + rule-based completeness check + LOS integration + credit policy rules + adverse action reason service + human review + audit evidence
EvidenceApplication fallout, missing document rate, call reasons, rework rate, fair lending control requirements, credit policy exception sample
Selection decision客户体验价值高;AI 适合辅助完成申请和解释流程, 不适合 LLM-only decisioning

Outcome examples:

Outcome statementEval / metric
Increase the likelihood that an applicant submits all required documents before formal review begins.Missing document return rate, first-pass completeness, abandonment rate
Reduce applicant effort required to understand what information is missing and why it is needed.Repeat contact rate, clarification request rate, customer effort score
Reduce the risk that customer-facing explanations conflict with formal credit decision reasons.Reason consistency check, compliance QA, formal notice boundary violations

9.4 财富顾问: Advisor Suitability Preparation Copilot

维度设计
Job executorWealth advisor / relationship manager / supervisor / client service associate
Core job在客户会谈前准备适当性证据、产品资料、风险揭示、费用说明和可讨论选项, 并在合规边界内与客户沟通
Job stepsDefine client objective -> locate profile / holdings / constraints -> prepare product comparison -> confirm suitability evidence -> draft meeting notes / disclosure -> execute advisor-led conversation -> monitor follow-up -> conclude record
Underserved outcomes降低会前准备时间;提高客户限制、风险偏好、集中度、费用、流动性和替代方案覆盖度;降低材料过期或产品风险遗漏
AI interventionClient profile summarization、portfolio concentration flags、product literature retrieval、suitability checklist、disclosure draft、meeting note draft
Automation boundaryAI 不直接向客户给个性化买卖建议, 不承诺收益, 不替代持牌顾问判断, 不生成未经批准的营销材料, 不在缺少适当性信息时推荐产品
Product architectureAdvisor desktop + CRM / portfolio system + risk profile + product master + approved literature RAG + suitability rules + supervision review + communication archive
EvidencePrep time baseline、supervisor QA defects、missing suitability evidence、product literature version issues、client complaint themes
Selection decision适合顾问内部 copilot;客户可见输出必须经过批准模板、顾问审核和记录保存

Outcome examples:

Outcome statementEval / metric
Minimize the effort required for an advisor to assemble current product, fee and risk evidence before a client meeting.Prep time, source freshness, missing document defects
Increase the completeness of suitability considerations captured before discussing product options.Checklist coverage, supervisor QA score, exception count
Reduce the risk of unapproved claims or outdated product information entering client communication.Approved source citation rate, marketing compliance defects

9.5 AI 平台内客户: Enterprise AI Platform Consumer

维度设计
Job executorBusiness AI product team / domain PM / solution architect / engineer / risk reviewer
Core job在符合企业治理、数据、安全、模型风险和审计要求的前提下, 更快交付可复用、可评估、可监控的 AI capability
Job stepsDefine use case -> locate approved model / data / pattern -> prepare integration -> confirm risk tier and eval -> execute pilot -> monitor usage / cost / quality -> modify prompt / model / knowledge -> conclude scale / stop decision
Underserved outcomes降低团队找到批准模型、RAG pattern、eval template 和合规门禁的时间;提高复用控制和证据完整度;降低重复建设和 shadow AI
AI interventionPlatform catalog assistant、pattern recommendation、eval template generation、architecture checklist, policy-as-code guidance, evidence binder automation
Automation boundary平台不替业务 owner 接受风险, 不自动批准高风险 use case, 不允许绕过数据权限和模型评审, 不把通用 eval 当成业务验收
Product architectureDeveloper portal + model gateway + RAG service + prompt / policy registry + eval harness + tool gateway + observability + cost controls + evidence binder + ARB workflow
EvidenceUse case onboarding time、重复组件建设、risk review cycle time、eval adoption、production incident and drift data、platform NPS from internal teams
Selection decision当多个业务 use case 反复需要相同控制时, 平台化价值高;第一天不应建设全量平台, 应从 repeated control points 开始

Outcome examples:

Outcome statementEval / metric
Reduce the time required for a domain team to configure an approved model, retrieval source and eval gate for a new AI use case.Onboarding lead time, configuration defect rate, eval completion rate
Increase the likelihood that every production AI use case has traceable requirements, model version, prompt version, data source and release approval.Evidence completeness, audit sample pass rate, registry coverage
Reduce duplicate implementation of model access, retrieval, logging, eval and policy controls across teams.Reuse rate, redundant service count, platform adoption

10. Templates

10.1 AI JTBD Intake Canvas

FieldGuidance
Job executor谁真正执行 job, 谁受影响, 谁购买/批准, 谁运营/支持
Job不写产品或功能, 写要完成的稳定进展
Context什么时候、在哪个渠道、什么触发条件下必须完成
Current alternatives现在用系统、表格、人工、规则、外包、客户自助或 workaround 完成
Job stepsDefine / Locate / Prepare / Confirm / Execute / Monitor / Modify / Conclude
Desired outcomes每一步如何衡量更快、更准、更低风险、更少努力
Evidencebaseline、日志、QA、投诉、审计、SME calibration、财务影响
Underserved outcomes重要高、满足低、证据强的 outcome
AI interventionread、retrieve、summarize、draft、recommend、monitor、act-with-approval
No-AI alternative流程、规则、系统集成、数据治理、培训、模板是否更合适
Automation boundaryallowed / blocked decisions, actions, outputs, handoff
Architecture implicationdata、RAG、rules、tool、workflow、eval、monitoring、audit
Portfolio decisionscale candidate、controlled pilot、measurement sprint、redesign、stop

10.2 Job Map to AI Pattern Template

Job stepCurrent struggleOutcome statementEvidenceAI patternBoundaryArchitecture component
Define
Locate
Prepare
Confirm
Execute
Monitor
Modify
Conclude

使用规则:

  • 空白单元格在正式交付中必须填入具体内容或明确标记为 not applicable with rationale。
  • Outcome statement 不能写成“提升体验”或“更智能”, 必须有 metric object。
  • Boundary 必须包含 blocked output 或 blocked action。
  • Architecture component 必须能追溯到至少一个真实 outcome 或 control need。

10.3 Outcome Statement Template

Outcome ID:
Job:
Job step:
Job executor:
Outcome statement:
Direction:
Metric object:
Context:
Current baseline:
Target movement:
Risk / compliance constraints:
Evidence source:
Potential AI intervention:
No-AI alternative:
Eval method:
Monitoring signal:
Owner:

10.4 Underserved Outcome Scorecard

Outcome IDImportanceSatisfaction gapEvidence strengthStrategic / regulatory relevanceReuse potentialAutomation riskArchitecture gapAdoption frictionDecision
O-00154554322Controlled pilot
O-00242232443Measurement sprint

Decision vocabulary:

Decision含义
Controlled pilot有足够 evidence 和价值, 风险可通过边界和人审控制
Measurement sprint价值假设合理但证据不足, 先建立 baseline 和样本
Workflow / rules first问题主要来自流程、规则不清或系统断点, AI 不是第一解
Platform candidate多个 use case 共享模型接入、RAG、eval、tool gateway 或证据组件
Stop低重要、弱证据、高风险或已有满足度高

10.5 Automation Boundary Contract

Section内容
Use case name清晰命名, 不用“智能助手”泛称
Job and outcome绑定到 job、job step 和 underserved outcome
AI roleretrieve / summarize / draft / recommend / monitor / act-with-approval
Allowed datasource、owner、permission、freshness、retention
Blocked datasensitive attributes、unapproved notes、expired policy、unconsented data
Allowed outputs结构化摘要、引用、草稿、建议、风险提示、checklist
Blocked outputs正式信贷理由、个性化投资建议、法律结论、SAR decision、投诉拒绝、未经授权承诺
Allowed toolsread-only API、case draft、task creation、approved workflow transition
Blocked toolsapprove / decline、file / close、trade、fee waiver、customer lockout、delete evidence
Human reviewreviewer、approver、override、dual control、SLA
Escalationlow confidence、missing source、policy conflict、high-risk intent、customer harm、tool error
Eval gatedataset、rubric、threshold、critical failure
Monitoring gateonline signals、alert threshold、rollback、incident owner

10.6 Evidence-to-Architecture Traceability Matrix

OutcomeEvidenceRequirementArchitecture componentEval caseRelease gateMonitoring signal
降低 AML 证据查找时间Case telemetryAI 必须聚合交易、实体、历史 case 并显示来源Entity graph + RAG + case UIGolden alert setEvidence completeness >= thresholdMissing citation / analyst override
降低客服错误承诺QA defectsAI 草稿不得承诺未经授权的补偿Policy rules + approved templatesRed-team fee waiver setCritical violation = 0QA critical defect
提高申请完整性Funnel log提交前提示缺失材料Completeness checker + portal UIApplication test setMissing doc detection >= thresholdReturn-for-incomplete rate

10.7 One-Page Use Case Selection Memo

Use case:
Job executor:
Job-to-be-done:
Top underserved outcomes:
Evidence summary:
AI intervention:
Automation boundary:
No-AI alternative considered:
Architecture pattern:
Required controls:
Pilot design:
Eval gate:
Monitoring gate:
Risk owner:
Business owner:
Decision:
Rationale:

11. Review Checklist

11.1 JTBD / ODI quality

CheckPass signal
Job is stable写的是要完成的进展, 不是产品、渠道、模型或功能
Job executor is clear区分执行者、受影响客户、审批者、平台内客户和风险 owner
Job map is complete至少覆盖 define、locate、prepare、confirm、execute、monitor、modify、conclude
Outcomes are measurable每个 outcome 有 direction、metric object、context 和 constraint
Underserved is evidenced有 importance、satisfaction、baseline 或强专家证据
No-AI alternative considered明确为什么不是只做流程、规则、培训或系统集成

11.2 AI suitability

CheckPass signal
AI role is specificretrieve、summarize、draft、recommend、monitor、act-with-approval 中至少选定一种
AI does not own forbidden decisions高影响金融决策仍由正式系统、人或授权流程负责
Boundary is explicitblocked data、blocked output、blocked tool、escalation trigger 都写清
Human accountability worksreviewer 有证据、权限、时间、训练和覆盖路径
Architecture supports evidenceRAG、rules、tool、eval、observability、audit 能支撑 outcome 和 boundary
Eval matches outcome不只测模型准确率, 还测业务 outcome、风险 failure 和人审质量

11.3 Portfolio quality

CheckPass signal
Selection is comparable每个 use case 用同一评分框架和决策词汇
Scale rule existspilot 成功、受限扩展、停止、平台化的条件明确
Reuse is real平台候选来自多个 use case 的重复控制点, 不是抽象愿望
Risk is priced人审成本、合规成本、监控成本和 incident 成本进入价值判断
Evidence can be audited决策、数据、模型、prompt、eval、审批和变更可追溯

12. 反模式

Anti-pattern表现修正方式
AI feature hunting到处问“哪个流程能加 AI”从 job 和 outcome portfolio 开始
Persona-only selection用“客服小王很忙”决定用例改为 job executor + job step + measurable outcome
Pain point inflation把所有抱怨都当机会用 importance / satisfaction / evidence 过滤
Demo-driven roadmap供应商 demo 好看就排进 roadmap要求 evidence、boundary、architecture 和 eval gate
Chatbot as default UX所有场景都做聊天入口按 job step 选择 search、form assist、copilot、workflow automation 或 agent
Automation-first先追求无人化先定义 forbidden decisions 和 human accountability
Accuracy-only eval只看回答准确率增加 outcome、critical failure、citation、human override、customer harm
Weak HITL写“人工审核”但不给证据和时间设计 reviewer UI、action set、capacity、training 和 audit
No baseline没有当前耗时、缺陷、投诉或成本先做 measurement sprint
Platform theater没有重复 use case 就建平台从 repeated controls: model gateway、RAG、eval、tool gateway、evidence binder 开始
Risk transfer to disclaimer用免责声明替代控制把边界落实到权限、规则、UI、handoff 和 release gate
LLM reason pipeline用 LLM 生成正式拒贷原因建正式 reason service, LLM 只解释流程或辅助员工理解
Shadow AI normalization允许团队私接模型和数据建 AI registry、gateway、policy-as-code 和 exception process

13. 30 天训练计划

目标: 30 天内形成一套可展示的 AI JTBD / ODI portfolio pack, 聚焦金融零售高价值场景, 不做基础访谈训练。

Day训练主题产出
1选定 5 个金融零售 AI 候选场景: AML、客服、信贷申请、财富顾问、AI 平台内客户Use case inventory
2为每个场景定义 job executor、affected stakeholder、business owner、risk ownerStakeholder and job executor map
3把每个场景从 solution name 改写成 job-to-be-doneJob definition sheet
4为 AML 场景画 define / locate / prepare / confirm / execute / monitor / modify / conclude job mapAML job map
5为客服场景画 job map, 区分客户可见和 agent assistCustomer service job map
6为信贷申请人场景画 job map, 标出正式决策和原因链路边界Credit applicant job map
7为财富顾问和 AI 平台内客户画 job mapAdvisor and platform job maps
8为每个 job step 写 3-5 条 outcome statementOutcome catalog v1
9清理弱 outcome, 替换“提升体验、提升效率、更准确”等模糊表达Outcome quality review
10收集每个 outcome 的 evidence: 日志、QA、投诉、审计、专家校准、成本、SLAEvidence inventory
11为 outcome 打 importance / satisfaction / evidence strength 分ODI scorecard v1
12加入 automation risk、architecture gap、adoption frictionRisk-adjusted scorecard
13选出每个场景前 2 个 underserved outcomesUnderserved outcome map
14为低证据但高潜力 outcome 设计 measurement sprintMeasurement plan
15将 underserved outcome 转成 AI intervention pointAI intervention map
16为每个 intervention 判断 no-AI alternativeNo-AI alternative memo
17为 AML 和客服写 automation boundary contractBoundary contracts v1
18为信贷、财富和平台场景写 automation boundary contractBoundary contracts v2
19标出 blocked data、blocked output、blocked tool 和 escalation triggerBoundary redline matrix
20把 outcome evidence 映射到架构组件Evidence-to-architecture matrix
21为每个场景画 C4 context / container sketch 或 sequence sketchArchitecture sketches
22设计 eval cases: golden set、rubric、threshold、critical failureEval case inventory
23设计 release gate: go / limited go / no-go / rollbackRelease gate memo
24设计 monitoring gate: quality、risk、cost、latency、override、incidentMonitoring gate spec
25写 AML use case selection memoAML selection memo
26写客服和信贷 use case selection memoCustomer service and credit memos
27写财富顾问和平台内客户 selection memoAdvisor and platform memos
28做 portfolio heatmap, 选择 scale、pilot、measurement、workflow-first、stopPortfolio decision map
29准备 8 个高级面试答案, 每个用 30 秒和 2 分钟版本Interview answer pack
30整理作品集: canvas、scorecard、boundary、architecture、eval、memo、executive narrativeAI JTBD / ODI portfolio pack

每周检查:

Week成功标准
Week 15 个场景都能从 solution name 改写成 job map
Week 2每个场景至少有 8 条高质量 outcome statement 和证据来源
Week 3每个候选 use case 都有 AI intervention、no-AI alternative 和 automation boundary
Week 4能用 portfolio heatmap 和 selection memo 向管理层解释为什么做、怎么做、做到哪里停

14. 面试答案

14.1 你如何选择一个 AI use case 是否值得做

30 秒版本:

我不会从模型能力或 demo 开始。我会先定义 job executor 和 job-to-be-done, 画 job map, 写 desired outcome statement, 用 importance、satisfaction 和证据找 underserved outcomes。然后判断 AI 适合介入哪一步, 定义 automation boundary, 映射到数据、RAG、rules、tool、eval、monitoring 和 audit 架构。最后用 risk-adjusted scorecard 决定 scale、pilot、measurement sprint、workflow-first 或 stop。

2 分钟版本:

Step说明
Job明确谁要完成什么稳定进展, 避免 solution-first
Outcome把痛点写成可度量 desired outcome, 如降低证据查找时间、提高引用准确性、降低错误承诺
Opportunity用 importance / satisfaction / evidence strength 识别 underserved outcome
Boundary判断 AI 是检索、总结、草拟、建议还是执行, 明确 blocked decisions 和 human accountability
Architecture反推 source-of-truth、RAG、rules、workflow、tool gateway、eval、monitoring、audit
Decision用价值、风险、复用、架构可行性和 adoption 共同决定投资方式

14.2 JTBD 和传统用户旅程有什么区别

30 秒版本:

用户旅程描述用户今天怎么走流程, JTBD 描述用户真正要完成的稳定 job 和每一步想达到的 outcome。AI 项目里这个区别很重要, 因为如果只优化现有旅程, 很容易给旧流程加 chatbot;如果从 job 和 outcome 出发, 我能判断是 RAG、copilot、规则、流程重构还是不该用 AI。

高级追问:

追问回答
旅程图还要不要要, 但旅程图服务于 workflow integration;JTBD 决定机会是否真实
persona 还要不要要, 但 persona 不能替代 job executor、decision rights 和 risk owner
job map 是不是流程图不是。流程图描述组织今天怎么处理, job map 描述理想完成 job 的步骤

14.3 什么是好的 outcome statement

30 秒版本:

好的 outcome statement 必须包含方向、度量对象、job step、情境和约束。例如“降低 AML analyst 查找交易证据和关联实体的时间, 同时不降低证据完整性”。它比“提升效率”强, 因为它能直接连接 baseline、eval、架构和上线门禁。

2 分钟版本:

要素例子
Directionreduce、increase、minimize、improve
Metric objecttime、effort、likelihood、completeness、risk、variance
Job steplocate evidence、confirm policy、prepare application
Contextinvestigating an alert、answering fee dispute、submitting application
Constraintwithout unauthorized commitment、without missing critical evidence

14.4 如何判断 underserved outcome

30 秒版本:

我看三个核心信号: 重要性高、当前满足度低、证据足够。AI 场景还要加风险和架构可行性。比如客服“错误承诺补偿”虽然频率可能不高, 但客户和合规影响高, 如果 QA 和投诉证据显示真实发生, 就是高优先级 outcome;相反, “回复语气更自然”如果没有客户结果证据, 不应排在前面。

14.5 金融零售 AI 的 automation boundary 怎么定

30 秒版本:

我先把 AI role 分成 retrieve、summarize、draft、recommend、act-with-approval 和 autonomous act。然后对每个 use case 写 blocked data、blocked output、blocked tool 和 escalation trigger。高影响金融决策, 比如拒贷、投资建议、SAR 判断、投诉关闭和 fee waiver, 不让 LLM 单独自动化, 只让 AI 提供证据、草稿、检查和建议, 最终责任在授权人或正式系统。

14.6 如何把 JTBD / ODI 转成产品架构

30 秒版本:

每个 underserved outcome 都会产生架构要求。证据找不到来源, 就需要 source registry、authorization-aware RAG 和 citation validator;错误承诺多, 就需要 policy rules 和 approved templates;工具动作有客户影响, 就需要 tool gateway、approval、rollback 和 audit log。架构不是先画系统图, 而是从 outcome 和 control 反推组件。

14.7 为什么不直接用 LLM 做信贷决策

30 秒版本:

信贷决策影响客户权益, 涉及公平信贷、可解释理由和正式通知。复杂模型也不能免除提供具体、准确原因的义务。LLM 可以帮助申请人理解材料要求、帮助 loan officer 起草 memo、检查资料完整性, 但正式 approve / decline 和 adverse action reason 应来自受控的决策、规则和 reason pipeline。

14.8 什么时候把 AI use case 平台化

30 秒版本:

我不会第一个 demo 就建大平台。平台化要来自重复证据: 多个 use case 都需要模型接入、RAG、权限、eval、tool gateway、observability、cost control 和 evidence binder。只有当这些 control points 重复出现, 并且平台能降低交付时间和风险 review 成本时, 才进入 platform roadmap。


15. 作品集交付物

一套完整的 AI JTBD / ODI portfolio pack 应包含:

Deliverable内容面试价值
Source Anchor MemoHBR JTBD、Strategyn ODI、NIST AI RMF、金融监管锚点如何转成方法证明不是凭感觉设计 AI
AI JTBD Canvasjob executor、job、context、alternatives、job steps、outcomes、evidence证明能从业务问题而不是模型能力出发
Job Map PackAML、客服、信贷申请人、财富顾问、AI 平台内客户五张 job map证明具备金融零售场景深度
Outcome Catalog每个场景 8-15 条 desired outcome statement证明能写可度量需求
Underserved Outcome Scorecardimportance、satisfaction、evidence、risk、architecture、adoption 评分证明能做投资排序
Automation Boundary Contractallowed / blocked data、outputs、tools、decisions、handoff证明知道 AI 风险和控制边界
Evidence-to-Architecture Matrixoutcome evidence 到架构组件、eval、gate、monitoring 的追踪证明能连接 BA、PM 和架构
Use Case Selection Memo每个场景一页 scale / pilot / measurement / stop 决策证明能向管理层表达取舍
Eval and Release Gate Packgolden set、rubric、threshold、critical failure、go / no-go证明能把需求转成上线门禁
Portfolio Heatmap价值、风险、证据、复用、可行性矩阵证明能管理 AI portfolio, 不是单点项目
Executive Narrative为什么选这些 use cases, 为什么现在做, 为什么这个边界证明能进入 AI PM / Architect / Transformation Lead 级别讨论

作品集叙事:

I selected AI use cases through a JTBD / ODI lens:
first defining stable jobs and desired outcomes,
then identifying underserved outcomes with evidence,
then deciding the right AI intervention and automation boundary,
then translating the decision into architecture, eval, release gates and monitoring.
This prevents demo-driven AI adoption and creates a portfolio that is valuable, controlled, reusable and auditable.

16. 最终判断准则

一个 AI use case 值得进入 pilot, 至少应满足:

  1. Job 清楚: 不是功能名, 是稳定 job。
  2. Outcome 清楚: 不是“更智能”, 是可度量 desired outcome。
  3. Underserved 清楚: 重要、满足低、有证据。
  4. AI role 清楚: 介入哪个 job step, 做 retrieve、summarize、draft、recommend、monitor 或 act-with-approval。
  5. Boundary 清楚: blocked decision、blocked output、blocked tool 和 handoff 写明。
  6. Architecture 清楚: 数据、知识、规则、工具、workflow、eval、monitoring、audit 能支撑。
  7. Gate 清楚: pilot、scale、stop 和 rollback 条件可执行。
  8. Owner 清楚: business、risk、data、platform、model、operations 和 audit responsibility 不混淆。

一句话收束:

好的 AI use case selection 不是证明 AI 能做什么, 而是证明某个重要 job 的 underserved outcome 值得被改善, AI 是合适但受控的介入方式, 并且组织有架构、证据和责任体系把它安全地交付和运营。