AI 扩展计划 / Playbooks

AI Knowledge Work Redesign / Role-Task Architecture Playbook

以下来源作为本文的设计锚点。本文把它们转成知识工作、角色任务架构、产品控制、流程治理和运行指标语言。访问日期按 2026-06-29 记录。

722 行AI_KNOWLEDGE_WORK_REDESIGN_ROLE_TASK_ARCHITECTURE_PLAYBOOK.md

AI Knowledge Work Redesign / Role-Task Architecture Playbook

定位: 面向 AI BA / AI PM / Product Architect / Enterprise Architect / Operations Transformation Lead 的知识工作重设计手册。

核心问题: AI 时代的金融零售知识工作, 不能只靠给员工发一个 Copilot。真正要设计的是 role、job、task、decision、artifact、handoff、control、metric 之间的新架构。

适用范围: AML / KYC / credit underwriting / collections / complaints / customer service / branch operations / payment disputes / retail banking operations / insurance and wealth servicing / retail merchandising and supply chain operations。

读者假设: 读者已经具备 CBAP 级业务分析能力。本文不讲基础组织变革、普通培训沟通或通用流程优化, 重点放在 AI-era knowledge work redesign、角色任务边界、责任设计、产品和架构控制、金融零售落地证据。

重要说明: 本文是学习、作品集和内部方案训练材料, 不构成法律意见、合规结论、审计意见、模型验证报告或监管解释。正式项目必须由 Legal、Compliance、Risk、Model Risk、Privacy、Security、Data Owner、Business Owner、Operations Owner、HR、Works Council / Employee Relations 和管理层结合机构类型、司法辖区、员工政策、客户影响和内部控制框架确认。

1. Source Anchors

以下来源作为本文的设计锚点。本文把它们转成知识工作、角色任务架构、产品控制、流程治理和运行指标语言。访问日期按 2026-06-29 记录。

Anchor	Official source	本文使用方式
WEF Jobs of Tomorrow: Large Language Models and Jobs	https://www.weforum.org/publications/jobs-of-tomorrow-large-language-models-and-jobs/	用 job exposure / task exposure 视角提醒: AI 冲击的是任务组合, 不是简单替换职位名称。角色重设计必须先做 task decomposition。
Microsoft Work Trend Index	https://www.microsoft.com/en-us/worklab/work-trend-index/	用 workplace research 视角提醒: AI adoption 的瓶颈不是工具可用性, 而是工作结构、管理节奏、员工负荷、协作方式和组织能否围绕 AI 重构工作。
NIST AI Risk Management Framework	https://www.nist.gov/itl/ai-risk-management-framework	用 Govern / Map / Measure / Manage 组织 role-task architecture 的风险识别、控制设计、指标验证、上线门禁和持续改进。
Microsoft Guidelines for Human-AI Interaction	https://www.microsoft.com/en-us/research/publication/guidelines-for-human-ai-interaction/	用 Human-AI Interaction 原则设计预期、证据、不确定性、纠错、反馈、控制、恢复和长期适应, 避免员工对 AI 过度信任或低效抵触。

Source-to-artifact mapping:

Source lens	直接产出的 artifact	高级表达
WEF task exposure	Role-task inventory、AI exposure heatmap、skill shift map	“我会先拆任务和判断点, 再决定岗位如何演进, 而不是讨论哪些职位会被 AI 取代。”
Work Trend Index	Adoption operating rhythm、manager dashboard、employee load metric、frontier workflow experiment	“AI 转型的核心是重新组织工作, 不是把个人生产力工具分发给员工。”
NIST AI RMF	Use case risk tier、control matrix、release gate、monitoring dashboard、evidence binder	“我会把任务分配和责任设计纳入 AI 风险管理闭环。”
Microsoft HAI Guidelines	Human-AI responsibility matrix、review UI requirement、feedback taxonomy、override workflow	“我会设计人如何理解、质疑、纠正和覆盖 AI, 而不是只设计 AI 输出内容。”

2. One-Sentence Positioning

AI Knowledge Work Redesign =
用 task decomposition 拆开知识工作,
用 role-task architecture 重新分配人、AI、规则、系统和控制点的职责,
用 accountability、metrics、training、workflow 和 architecture controls
证明工作更快、更准、更可控、员工负荷更健康、客户和监管风险更低。

中文记忆:

AI 时代的工作重设计, 不是“员工 + Copilot”, 而是“任务、判断、证据、责任、交接、控制、指标”的重新架构。

在金融零售里, 这件事尤其重要:

场景	不能只看效率的原因	需要一起设计的边界
AML alert review	错误关闭 alert 可能形成金融犯罪风险和监管暴露	investigator 判断权、AI narrative 草稿、证据引用、supervisor review、SAR escalation
Credit underwriting	AI 建议可能影响客户授信、拒绝理由、公平信贷和模型风险	underwriter accountability、policy reason code、override、adverse action evidence
Customer service	AI 回复可能成为客户理解中的正式承诺	agent assist 边界、客户可见文案、投诉升级、知识库来源、质量抽检
Payment disputes	自动化加速不等于争议处理正确	scheme rules、case evidence、merchant response、temporary credit approval、reversal path
Collections	话术、时机和客户脆弱性有合规和声誉风险	contact strategy、hardship routing、human empathy、script control、call audit
Retail operations	AI 推荐补货或促销会影响库存、门店执行和利润	buyer / planner 决策权、forecast confidence、exception reason、execution feedback

3. 为什么 AI 转型不是“给员工一个 Copilot”

给员工一个 Copilot 只能改变个人界面, 不能自动改变工作系统。金融零售的知识工作有强约束: 客户影响、监管证据、审批授权、系统记录、员工能力、队列管理、质量抽检、风险升级和审计复盘。AI 工具如果不进入这些工作边界, 只能制造局部效率, 甚至把错误规模化。

3.1 Copilot-only 反模式

反模式	表面收益	深层问题
Personal productivity only	员工写得更快、查得更快	没有改变流程瓶颈、审批队列、证据字段和下游系统动作
Shadow workflow	员工把 AI 输出复制到正式系统	正式系统看不到 AI 来源、版本、证据、人工修改和责任链
Rubber-stamp review	AI 生成结论, 人点击确认	管理指标逼迫员工默认采纳, review 变成行政动作
Task automation without decision design	AI 自动分类、摘要、路由	分类规则、误判成本、人工覆盖和客户影响没有显式设计
Skill erosion	员工越来越依赖 AI 起草和判断	新人无法形成业务判断, 资深员工变成低价值复核员
Metric illusion	AHT、处理量、文档产出提升	质量、风险、返工、客户伤害、员工负荷和长期学习没有被测量
Control theater	文档写着“人负责最终决定”	系统没有 evidence view、override、escalation、audit log 和 stop control

3.2 真正要重设计的不是工具, 而是工作架构

成熟设计要同时回答 12 个问题:

问题	应落到的 artifact
哪个 role 对业务结果负责	Role accountability map
哪个 job 被拆成哪些任务	Job-task decomposition
哪些 task 适合 AI 辅助、建议、决策或自动化	Task allocation matrix
哪些 decision 必须由人作出	Decision accountability matrix
AI 生成的 artifact 是否可进入正式记录	Artifact lifecycle map
人和 AI 之间如何 handoff	Handoff contract
质量和风险控制点在哪里强制执行	Control matrix
谁能 override AI 或人工决定	Override authority table
证据由谁保存、保存什么、保存多久	Evidence retention schema
员工需要什么新技能	Skill shift map
管理层如何看采用和负荷	Adoption and workload dashboard
架构如何防止越权、泄露、幻觉和不可复盘	Architecture control plane

3.3 设计原则

原则	含义	金融零售落地
Task before title	先拆任务, 再谈岗位变化	AML analyst 不是被替换, 而是从资料整理转向风险判断、例外处理和监管叙事质量
Decision before automation	先定义决策权和证据, 再定义自动化	Credit assistant 可以建议 reason code, 但拒贷决定和解释责任仍需授权人员承担
Control before scale	先证明控制有效, 再扩大用户	Customer service AI 先通过知识来源、合规话术、升级路径和抽检指标, 再开放更多场景
Metrics before claims	先建立 baseline, 再声称效率收益	不只看处理时长, 还看 first pass quality、override quality、rework、complaint trend
Learning before deskilling	设计技能升级路径, 防止判断能力退化	新员工用 AI 解释案例, 但必须完成无 AI case drill 和校准评审

4. Role-Task Architecture

Role-task architecture 是把知识工作拆成可分配、可控制、可衡量、可训练、可审计的结构。它连接八个对象:

Role -> Job -> Task -> Decision -> Artifact -> Handoff -> Control -> Metric

4.1 八个核心对象

对象	定义	设计问题	金融零售示例
Role	业务责任和权限的承载者	谁对结果负责, 谁有授权, 谁接受风险	AML investigator、credit underwriter、contact center agent、branch manager
Job	一个 role 在组织中的岗位组合	这个岗位由哪些任务、目标和约束组成	Credit underwriter job 包含资料审查、政策判断、额度建议、拒绝理由、case note
Task	可被分配和评估的工作单元	任务输入、输出、判断难度、风险和可自动化程度是什么	摘要客户收入证明、识别缺失材料、草拟 underwriting memo
Decision	对业务结果有影响的判断点	决策依据、授权、解释、复核和申诉路径是什么	是否升级 EDD、是否批准授信、是否发起 dispute temporary credit
Artifact	工作产生或消费的证据对象	输出能否进入正式记录, 版本和来源如何追踪	AML case narrative、credit memo、customer reply、QA checklist
Handoff	人、AI、系统、团队之间的交接	交接时传递什么上下文, 谁接收, SLA 是什么	Chatbot 转人工时传递身份验证状态、客户诉求、AI 回复、来源
Control	防止错误、越权、泄露和不可复盘的机制	控制点在流程中是否强制执行	tool gateway、maker-checker、policy engine、override reason、audit log
Metric	衡量价值、质量、风险、学习和负荷的指标	指标是否会鼓励错误行为	AHT + QA pass + escalation quality + employee cognitive load

4.2 Role、job、task 的区别

层级	错误问法	正确问法
Role	“AI 会不会替代 AML analyst?”	“AML analyst 的哪些任务会被增强, 哪些判断权保留, 哪些新责任出现?”
Job	“这个岗位还能剩多少工作量?”	“岗位目标、任务组合、绩效指标、技能要求和责任边界如何变化?”
Task	“这个任务能不能自动化?”	“这个任务的输入证据、输出形式、误判成本、可逆性和复核要求是什么?”

4.3 Task decomposition 层级

层级	示例: AML alert review	AI redesign 关注点
Value stream	从 alert 触发到关闭、升级或 SAR 叙事	控制金融犯罪风险, 降低低价值整理工作, 提高 narrative 质量
Process	Alert intake、case enrichment、investigation、decision、supervisor review、filing	AI 插入点、人工节点、队列、SLA、例外路径
Activity	收集客户资料、分析交易模式、检查名单、写 case note	哪些是 evidence gathering, 哪些是 judgment
Task	汇总 90 天交易、识别 round-dollar pattern、草拟 narrative	AI extraction / summary / recommendation / drafting
Micro-task	提取交易对手、按金额聚合、标记异常描述	可自动化但必须记录来源和转换逻辑
Decision	close、continue monitoring、escalate、SAR consideration	人类责任、复核、原因码、证据包
Artifact	risk summary、case note、narrative、supervisor packet	版本、引用、人工修改、审批记录

4.4 Work architecture 的四条流

Flow	说明	典型控制
Work flow	工作如何从入口到结果流动	BPMN、queue management、SLA、exception routing
Decision flow	判断如何被作出、复核和覆盖	DMN、policy engine、authority matrix、reason code
Evidence flow	事实、来源、模型输出和人工修改如何被保存	evidence store、trace id、source metadata、audit trail
Accountability flow	谁负责结果、风险接受、复核和持续改进	RACI、maker-checker、business owner sign-off、control owner

4.5 Product and architecture controls

Role-task architecture 必须落到产品和架构控制, 否则只是流程图。

User task UI
-> role and entitlement check
-> task context package
-> policy / DMN decision service
-> AI service with retrieval and tool boundaries
-> evidence and uncertainty display
-> human action controls
-> workflow state transition
-> audit event
-> quality, risk and workload monitoring

Control component	责任	典型 evidence
Identity and entitlement	确认员工角色、权限、客户关系、数据用途	actor id、role、resource、purpose、access decision
Task context package	把 case facts、policy、history、constraints 打包给 AI 和人	context version、source list、data freshness
Policy / DMN service	把可解释规则和 reason code 从 prompt 中分离	decision id、rule id、version、input facts hash
AI orchestration layer	限定模型、检索、工具、输出格式和失败路径	model id、prompt version、tool call log、guardrail result
Evidence view	让人看到 AI 输出背后的来源、缺口和冲突	citation、source owner、effective date、missing evidence flag
Human control gateway	管理 accept、edit、reject、override、escalate、rollback	action log、diff、reason code、reviewer id
Workflow engine	强制状态转换、审批、SLA 和交接	case state、transition rule、queue event
Observability and audit	监控质量、风险、采用、成本和员工负荷	dashboard、sample review、incident record、audit replay

5. Task Allocation

任务分配不是“AI 能做就让 AI 做”。要按风险、可逆性、数据质量、政策清晰度、时间压力、客户影响、员工能力和控制成本来决定。

5.1 六类 task allocation

Allocation pattern	定义	适合任务	不适合任务	控制要求
Human-only	人必须完成, AI 不能生成实质性判断或行动	高同理心沟通、重大例外、风险接受、监管签字、纪律处分、复杂伦理判断	高频低风险资料整理	强授权、记录理由、必要时双人复核
AI-assisted	AI 帮人检索、摘要、提取、改写, 人作判断	case summary、政策查找、证据抽取、文案润色	人没有时间或能力检查的高风险结论	来源展示、编辑差异、人工确认
AI-recommended	AI 给出建议、排序、下一步, 人选择是否采纳	alert prioritization、missing document suggestion、customer intent routing	直接影响客户权益且缺少解释的决策	解释、替代方案、override reason、采纳率监控
AI-decided with human review	AI 生成初步决定, 人必须复核后生效	低到中风险分类、规则清晰的初审、批量 QA 初筛	高影响不可逆动作、歧视敏感决策、正式拒绝原因	pre-effect review、sample calibration、false accept / false reject 监控
AI-automated	AI 在授权范围内自动完成任务或系统动作	低风险、可逆、规则清晰、高频的状态更新、任务创建、资料分类	冻结账户、拒贷、关闭投诉、发正式监管材料	tool allowlist、limit、rollback、exception queue、audit trace
AI-monitored	AI 不直接处理任务, 只监控异常、趋势、负荷和控制偏离	SLA breach prediction、quality drift、queue anomaly、employee overload signal	需要正式个案判断的结论	alert owner、threshold governance、no punitive use without review

5.2 Allocation decision criteria

Criterion	Human-only 倾向	AI-assisted 倾向	AI-automated 倾向
Customer impact	高, 影响授信、账户、资金、投诉、法律权利	中, 帮助员工解释或准备材料	低, 不直接改变客户权益
Reversibility	不可逆或补救成本高	可人工修正	可自动回滚或影响很小
Evidence quality	证据冲突、缺失、上下文强	有权威来源但需要判断	来源结构化、稳定、规则明确
Policy clarity	需要专家解释或例外处理	规则清楚但有少量灰区	规则完全可编码、版本稳定
Bias / fairness sensitivity	高	中	低
Time criticality	可等专家判断	需要加速但可排队	必须实时且低风险
Human skill value	需要专业判断、同理心、谈判、风险接受	人负责最终判断, AI 减少整理负担	人的参与主要增加延迟, 不增加判断质量
Audit expectation	需要完整审批和解释	需要来源、修改和确认记录	需要自动日志和抽样复核

5.3 金融零售任务分配示例

Use case	Task	Allocation	原因	控制
AML	汇总客户 90 天交易和对手方	AI-assisted	资料多、规则明确, 但可能遗漏异常	source-linked summary、investigator confirmation
AML	判断是否提交 SAR consideration	Human-only	高监管影响, 需要专业判断和责任承担	senior review、decision memo、evidence binder
Credit	提取收入证明字段	AI-assisted	文档处理适合 AI, 但字段需验证	OCR confidence、source highlight、manual correction
Credit	额度审批和拒绝理由确认	Human-only 或 AI-recommended	影响客户权益和公平信贷	reason code validation、underwriter accountability
Customer service	根据政策草拟费用解释	AI-assisted	员工可编辑, 来源可展示	approved knowledge source、agent edit log
Customer service	自动回答普通营业时间、状态查询	AI-automated	低风险、可逆、来源稳定	intent confidence、fallback、interaction log
Payment disputes	路由到 correct dispute type	AI-decided with human review	分类可加速, 误分有成本	review queue、scheme rule source、override metric
Retail planning	异常缺货预警	AI-monitored	AI 发现模式, 采购经理决定行动	threshold owner、exception annotation、buyer decision log

5.4 Allocation anti-patterns

Anti-pattern	问题	修正方式
Automate what is broken	把混乱流程自动化	先做 process mining / workflow baseline 和 decision inventory
Human review as magic	所有风险都写“人工审核”	明确 reviewer 能看到什么、多久看完、如何覆盖、如何追责
AI for everything low-value	员工变成 AI 输出搬运工	保留高价值学习任务, 把员工转向 judgment、exception、quality improvement
No task retirement	AI 新增任务, 旧任务不减少	明确哪些手工整理、重复查询、重复录入被移除
Manager metrics unchanged	仍按旧 AHT / volume 管理	改为 balanced metrics, 防止员工为速度牺牲质量和复核

6. Accountability Design

AI 可以执行任务, 但不能承担组织责任。Accountability design 的目标是让每个结果都能回答:

谁对业务结果负责?
谁负责复核?
谁能 override?
谁保留证据?
谁接受残余风险?
谁在失效时停机、补救和改进?

6.1 Accountability principles

原则	说明	失败表现
Accountability stays human and organizational	责任落在授权角色和组织治理上	“AI 建议错了”被当成事故根因终点
Review authority must match risk	复核人必须具备能力、权限和时间	junior agent 被要求批准复杂合规结论
Override must be legitimate	人可以覆盖 AI, 也可以覆盖低层级人工决定, 但要有理由和审计	override 被用来绕过控制或追求 KPI
Evidence must be durable	证据能复盘当时谁看见什么、AI 输出什么、人改了什么	只保存最终文本, 无来源和修改记录
Risk acceptance is explicit	上线和例外风险由授权 owner 接受	产品团队默认把风险推给一线员工

6.2 Accountability matrix

Decision / action	Accountable owner	Reviewer	Override authority	Evidence owner	Evidence retained
AML alert close	AML operations manager	AML investigator / QA sample reviewer	AML supervisor	Compliance operations	alert evidence、AI summary、investigator rationale、override reason、case status
SAR escalation consideration	AML compliance officer	Senior investigator	MLRO / BSA officer equivalent by jurisdiction	Financial crime compliance	narrative draft、supporting transactions、policy basis、approval record
Credit approval recommendation	Credit policy owner / lending business owner	Underwriter	Credit committee or delegated authority	Credit operations	application facts、policy rule id、model score, AI memo、human decision、reason code
Adverse action explanation	Lending compliance owner	Underwriter / compliance review sample	Compliance lead	Credit compliance	final reason codes、AI suggestions、human edits、customer communication
Customer service response	Service operations owner	Frontline agent / QA reviewer	Team lead / complaint specialist	Contact center operations	transcript、knowledge sources、agent edits、customer visible message
Payment dispute temporary credit	Dispute operations owner	Dispute analyst	Supervisor by amount and rule	Payment operations	scheme rule source、case evidence、approval, reversal path

6.3 Who is responsible for the result

Result type	Responsible party	Design implication
Customer-visible communication	Business process owner and acting employee	AI draft must be editable, source-linked, reviewed by risk tier
Formal decision	Delegated decision owner	AI can support or recommend, but decision log must name human authority or approved automated policy
Automated low-risk action	Product / operations owner plus control owner	Automation boundary, rollback, monitoring and audit evidence must be approved before scale
Model or AI behavior	AI product owner / platform owner / model owner	Eval, drift, prompt / model change, incident response and vendor change control are required
Workflow redesign outcome	Transformation lead and business owner	Baseline, adoption, quality, risk and workload metrics must be managed together

6.4 Who reviews

Review design must specify coverage, sampling, skill, SLA and calibration.

Review pattern	用法	控制
100% pre-effect review	高影响、不可逆、客户权益相关输出	reviewer checklist、evidence view、approval log
Risk-tiered review	中等风险高量任务	risk score threshold、queue prioritization、sample expansion trigger
Post-action QA sample	低风险、可逆、海量场景	statistically meaningful sample、defect taxonomy、feedback loop
Peer calibration	专业判断一致性训练	blind review、disagreement analysis、rubric refinement
Supervisor exception review	例外、高金额、政策冲突、客户投诉	escalation packet、decision SLA、override log

6.5 Who can override

Override 不是绕过系统, 而是授权控制。

Override target	谁能覆盖	必填证据	系统动作
AI recommendation	Acting employee within delegation	reason code、free-text rationale for high risk	log diff、track adoption and disagreement
AI automated action	Supervisor or control owner by threshold	customer impact、rollback feasibility、root cause	pause automation if pattern repeats
Human frontline decision	Supervisor / specialist	policy basis、case evidence、customer impact	reopen case、notify downstream if needed
Policy rule output	Policy owner / risk owner	rule exception、approval scope、expiry date	exception registry、sample review
Production AI capability	Business owner / platform owner / risk owner	incident or threshold breach	feature flag off、model route freeze、fallback workflow

6.6 Who retains evidence

Evidence retention is a product requirement, not an afterthought.

Evidence object	Why it matters	Retention owner	Architecture requirement
Input facts	证明 AI 和人依据什么做判断	Data owner / system owner	immutable snapshot or reproducible reference
Retrieved sources	证明引用政策和知识是否有效	Knowledge owner	source id、effective date、version
AI output	证明模型给出什么建议	AI platform owner	model id、prompt version、output hash
Human edit / decision	证明人如何使用或覆盖 AI	Operations owner	diff、reason code、reviewer id
Tool action	证明系统状态如何改变	Application owner	actor、resource、action、before / after state
Monitoring signal	证明控制是否持续运行	Risk / platform / ops	metric snapshot、threshold、alert disposition

7. Metrics

AI 知识工作重设计不能只用 productivity 指标。金融零售至少要平衡 productivity、quality、risk、learning、adoption 和 employee load。

7.1 Balanced metric model

Metric family	问题	示例指标	反向信号
Productivity	是否更快、更少返工、更高吞吐	cycle time、touch time、cases per FTE、first contact resolution、automation yield	速度提升但 QA defect、complaint、override 同时上升
Quality	输出是否更准确、更完整、更一致	first pass quality、source-supported claim rate、case note completeness、calibration agreement	AI 文本更长但关键证据缺失
Risk	是否降低或至少不增加客户、合规、模型和运营风险	false close、false approve、under-escalation、policy violation、incident count、audit replay success	低风险队列处理快, 高风险积压
Learning	员工和系统是否在变聪明	calibration improvement、challenge case pass rate、feedback-to-fix cycle、new skill attainment	员工无法解释 AI 错误原因
Adoption	员工是否在正确任务中正确使用	active qualified users、task-level adoption、accepted with edit、rejected with reason、fallback use	高 adoption 但低 edit / challenge, 可能是过度信任
Employee load	员工负荷是否健康	queue load、interruptions、review time sufficiency、after-hours work、cognitive load survey、burnout risk signal	AI 增加监控和复核负担, 旧工作没有减少

7.2 Productivity metrics

Metric	Good use	Risk
Cycle time	衡量端到端客户等待时间	如果只优化单点, 可能把瓶颈转移到复核队列
Touch time	衡量人工实际处理时间	不能替代质量和客户影响
First pass yield	衡量一次通过率和返工减少	需要清楚 defect taxonomy
Automation yield	衡量 AI 自动处理的有效比例	不能把高风险自动化率当成目标
Queue age	衡量积压和 SLA 风险	需要按 risk tier 分层

7.3 Quality metrics

Metric	Definition	金融零售用法
Source-supported claim rate	输出中关键陈述有可追溯来源的比例	客服、信贷、AML narrative 必须减少无来源断言
Human edit distance	人对 AI 草稿的修改程度	高修改可能代表 AI 质量低, 低修改也可能代表 rubber stamp
Calibration agreement	多个 reviewer 对同一 case 的一致性	underwriter、AML investigator、QA reviewer 训练
Critical omission rate	AI 摘要遗漏关键 red flag 的比例	AML、投诉、困难援助、欺诈场景
Formal defect rate	正式记录中被 QA / audit 发现的缺陷	case note、reason code、customer letter

7.4 Risk metrics

Metric	Definition	Escalation trigger
False close / false approve	被错误关闭或错误批准的比例	高风险队列出现连续缺陷
Under-escalation rate	应升级但未升级的 case 比例	投诉、欺诈、AML、困难援助显著上升
Override concentration	某团队或个人 override 异常集中	可能存在培训、激励或控制绕过问题
Policy violation rate	输出违反政策或合规话术	触发知识源和 prompt / policy review
Audit replay success	审计能否复现当时依据和决策	低于门槛时暂停 scale
Incident escape rate	问题被客户、监管或下游团队先发现	说明内部 QA / monitoring 失效

7.5 Learning metrics

Metric	Why it matters
Challenge case pass rate	证明员工能识别 AI 错误、缺证据、过度自信和越权建议
Feedback-to-fix lead time	衡量一线反馈进入知识、流程、模型或政策修复的速度
Skill mix shift	衡量员工从资料整理转向判断、例外处理、客户沟通和质量改进
Reviewer calibration drift	发现复核标准随着 AI 输出风格变化而漂移
New hire ramp quality	防止新人只会操作 AI, 不懂业务原则

7.6 Adoption metrics

Metric	Good interpretation	Warning
Task-level adoption	哪些任务中 AI 被使用	只看登录人数没有意义
Accepted with no edit	AI 输出可直接使用的比例	在高风险场景过高可能是默认采纳
Accepted with edit	员工有效利用 AI 并加入判断	需要分析修改类型
Rejected with reason	员工识别 AI 不适用	过低代表不敢拒绝或 UI 不支持
Fallback / escalation use	员工知道何时退出 AI	过低可能代表隐藏风险, 过高可能代表 AI 适用范围过宽

7.7 Employee load metrics

Metric	设计意图
Review time per case	复核时间是否足够支持真实判断
AI interruption count	AI 建议是否增加干扰和上下文切换
Queue pressure by risk tier	高风险任务是否被低风险高量任务挤压
After-hours correction work	员工是否把 AI 返工带回非工作时间
Cognitive load pulse	员工是否理解新责任、证据和控制界面
Manager coaching time	主管是否有时间做校准和反馈, 而不是只追产能

7.8 Metrics anti-patterns

Anti-pattern	风险
只看节省小时数	会忽略复核、返工、质量、风险和员工负荷
只看 adoption	高使用率可能来自管理压力, 不代表价值
只看模型准确率	模型指标不能替代流程结果和客户结果
只看平均值	高风险尾部和少数客户伤害会被掩盖
用旧 KPI 管新工作	员工会用 AI 提速旧流程, 而不是改变工作方式

8. Financial Retail Case: AML Analyst / Credit Underwriter / Customer Service Agent

8.1 AML Analyst 工作重设计

AS-IS pain

Pain point	影响
资料收集和系统切换占用大量时间	investigator 判断时间被压缩
Case narrative 质量不一致	supervisor review 和 audit 压力大
Alert 优先级不清	高风险 case 可能被低价值队列拖延
新人依赖 SOP 但缺少判断训练	calibration 不稳定
证据链分散	复盘和监管询问成本高

Redesigned role-task architecture

Task	Allocation	Human role shift	Artifact	Control
Alert intake and enrichment	AI-assisted	从手工查资料转向验证证据完整性	Enrichment packet	source list、data freshness、missing evidence flag
Transaction pattern summary	AI-assisted	从汇总交易转向识别解释和异常	Pattern summary	linked transactions、aggregation rule
Risk hypothesis generation	AI-recommended	从空白分析转向挑战假设	Risk hypothesis list	alternative hypothesis、unsupported claim flag
Case narrative draft	AI-assisted	从写初稿转向判断叙事是否完整	Draft narrative	edit diff、citation requirement
Close / escalate decision	Human-only	investigator 承担风险判断	Decision memo	reason code、supervisor sample
SAR consideration	Human-only	senior investigator / compliance 承担正式升级	Escalation packet	approval workflow、evidence binder
Quality monitoring	AI-monitored	QA 从抽样检查转向趋势和校准	Quality dashboard	defect taxonomy、calibration session

Metrics

Family	Metric
Productivity	alert cycle time by risk tier、touch time reduction、queue age
Quality	narrative completeness、source-supported claim rate、supervisor return rate
Risk	false close sample rate、under-escalation、SAR escalation quality
Learning	investigator calibration agreement、challenge case pass rate
Employee load	review time sufficiency、high-risk queue pressure

Architecture controls

AML case UI
-> entitlement check
-> customer / transaction evidence package
-> typology and policy retrieval
-> AI summary and narrative draft
-> investigator evidence view
-> close / escalate decision log
-> supervisor review queue
-> audit evidence binder
-> QA and calibration dashboard

8.2 Credit Underwriter 工作重设计

AS-IS pain

Pain point	影响
文档审查和资料缺口识别耗时	审批周期长
政策解释和 reason code 使用不一致	合规和客户沟通风险
Underwriter 被迫在速度和质量之间权衡	可能出现浅层复核
模型分数、政策规则和人工判断分散	难以解释最终决定
新人训练依赖经验传授	标准化难

Redesigned role-task architecture

Task	Allocation	Human role shift	Artifact	Control
Document extraction	AI-assisted	从录入字段转向验证关键事实	Verified fact sheet	source highlight、confidence flag
Missing document detection	AI-recommended	从人工逐项查清单转向确认缺口	Requirement checklist	DMN completeness rule
Policy eligibility check	AI-decided with human review or AI-recommended	从查政策转向解释例外	Eligibility result	rule id、version、input facts
Credit memo draft	AI-assisted	从写作转向风险判断和客户解释	Underwriting memo	citation、edit diff
Approval / decline decision	Human-only within delegated authority	保留最终授信责任	Decision record	authority check、reason code
Adverse action reason	Human-only with AI assistance	确认原因准确、可解释、合规	Customer communication	compliance review sample
Portfolio quality monitoring	AI-monitored	管理层看趋势而非只追产能	Quality dashboard	exception trend、fairness review input

Metrics

Family	Metric
Productivity	application cycle time、document completeness first pass、touch time
Quality	policy rule consistency、reason code accuracy、memo completeness
Risk	fair lending review signal、override concentration、appeal / complaint trend
Learning	underwriter calibration、policy quiz and case drill score
Employee load	queue pressure、manual evidence search time、review interruption count

Architecture controls

Credit workflow
-> applicant data and document package
-> OCR / extraction with source highlight
-> DMN policy decision service
-> AI memo drafting service
-> underwriter decision UI
-> authority and reason-code validation
-> customer communication control
-> audit log and quality sample

8.3 Customer Service Agent 工作重设计

AS-IS pain

Pain point	影响
Agent 在多个知识库和系统间切换	AHT 高, 体验不一致
话术和政策更新不同步	客户可见错误和投诉风险
转人工或转二线时上下文丢失	客户重复描述问题
复杂投诉、欺诈、困难援助识别不稳定	高风险客户旅程延误
QA 事后发现问题, 修复慢	同类错误重复发生

Redesigned role-task architecture

Task	Allocation	Human role shift	Artifact	Control
Intent detection	AI-decided with human review for uncertain cases	agent 从识别入口转向确认意图和风险	Intent tag	confidence threshold、fallback
Knowledge retrieval	AI-assisted	从搜索知识库转向判断来源适用性	Source-linked answer	knowledge owner、effective date
Response drafting	AI-assisted	从写话术转向同理心、客户上下文和合规确认	Agent response draft	approved language、edit log
Simple status answer	AI-automated	agent 从低价值查询中释放	Customer response	low-risk scope、audit transcript
Complaint / fraud / hardship routing	AI-recommended with mandatory human confirmation	agent 负责识别和保护客户权益	Escalation case	trigger list、handoff packet
Handoff summary	AI-assisted	二线获得完整上下文	Handoff note	transcript summary、unresolved issue
QA monitoring	AI-monitored	QA 从随机听录音转向风险抽样和模式修复	QA dashboard	defect taxonomy、feedback-to-fix

Metrics

Family	Metric
Productivity	first contact resolution、AHT by intent、handoff repeat rate
Quality	policy-correct response rate、source-supported answer rate、QA defect rate
Risk	complaint under-escalation、privacy incident、wrong commitment rate
Learning	agent correction quality、knowledge feedback closure
Employee load	context switching count、AI suggestion interruption、after-call work

Architecture controls

Contact center desktop
-> customer identity and consent state
-> intent and risk classification
-> RBAC-filtered knowledge retrieval
-> AI draft with source and limitation
-> agent edit / send / escalate controls
-> handoff context packet
-> transcript and audit record
-> QA, knowledge and policy feedback loop

8.4 Cross-case lessons

Lesson	AML	Credit	Customer service
AI should absorb low-value cognitive load	evidence summary	document extraction	knowledge retrieval
Humans should own judgment and exceptions	close / escalate	approve / decline	complaint / fraud / hardship
Controls must be in workflow	supervisor review	authority check	escalation trigger
Metrics must balance speed and risk	false close	reason accuracy	wrong commitment
Training must preserve expertise	typology drills	policy calibration	complex case simulation

9. Artifact Templates

以下模板不是空表, 而是可直接改写的结构和示例。正式项目应把示例行替换为真实 use case 记录, 并保留 owner、版本和批准记录。

9.1 Role-task matrix

用途: 把岗位拆成任务, 标注 AI 分配模式、风险、产物、控制和指标。

Role	Job outcome	Task	Current pain	AI allocation	Human responsibility	Artifact	Control	Metric
AML investigator	正确处置 alert 并形成可审计结论	汇总客户交易模式	多系统查询耗时	AI-assisted	验证摘要, 识别异常, 作出处置判断	Pattern summary	source-linked transaction list	narrative completeness、false close sample
Credit underwriter	在授权范围内作出可解释授信决定	识别资料缺口	手工 checklist 易漏项	AI-recommended	确认缺口并决定是否要求补件	Completeness checklist	DMN rule id、source highlight	first pass completeness
Service agent	快速、准确、可升级地解决客户问题	草拟政策解释	知识库搜索慢且话术不一致	AI-assisted	确认来源、语气和客户上下文后发送	Customer response draft	approved source、agent edit log	QA defect、wrong commitment

字段说明:

Field	写法
Role	使用真实授权角色, 不使用泛泛的“用户”
Job outcome	写业务结果, 不写工具使用目标
Task	写可观察的工作单元
Current pain	写基于流程事实的痛点
AI allocation	从六类 task allocation 中选择
Human responsibility	写人保留的判断、复核或责任
Artifact	写正式或半正式输出
Control	写强制执行的流程、产品或架构控制
Metric	同时覆盖效率、质量或风险

9.2 Human-AI responsibility matrix

用途: 明确人、AI、规则服务、系统和控制 owner 的边界。

Work object	AI responsibility	Human responsibility	Rule / policy service	System control	Accountable owner
AML case summary	生成来源链接摘要和异常候选	验证关键事实、补充上下文、决定处置	typology guidance、escalation criteria	citation required、unsupported claim flag	AML operations manager
Credit eligibility	解释政策条件和候选 reason code	确认事实、处理例外、作出授权决定	DMN eligibility rules	authority check、reason-code validation	Lending business owner
Customer response	草拟基于知识库的答复	编辑语气、确认适用性、识别升级	approved knowledge base、compliance language rules	send control、handoff trigger	Service operations owner
Automated status update	在低风险范围内更新 case 状态	处理异常和客户异议	workflow transition policy	tool gateway、rollback log	Application owner / ops owner

责任边界句式:

AI is responsible for producing a bounded, source-linked draft or recommendation.
The human role is responsible for validating evidence, applying judgment, handling exceptions, and accepting the business decision within delegated authority.
The system is responsible for enforcing permissions, workflow transitions, evidence capture, monitoring, rollback and auditability.

9.3 Training / adoption plan

用途: 把 adoption 从“培训完成率”升级为任务级能力迁移和管理节奏。

Phase	Audience	Capability goal	Training method	Adoption evidence	Manager cadence
Role readiness	AML / credit / service team leads	理解 role-task changes、风险边界和指标变化	leader workshop with case walkthrough	signed operating model、metric baseline	weekly calibration review
Task skill build	frontline employees	能使用 AI 摘要、建议、草稿, 并识别错误	guided task simulation、side-by-side old vs new workflow	simulation score、challenge case pass	daily floor support for first 2 weeks
Judgment calibration	specialists and reviewers	对高风险 case 保持一致判断	blind review、disagreement discussion	calibration agreement、override analysis	biweekly case clinic
Control operation	supervisors / QA / risk liaison	会处理 override、incident、rollback 和 quality trend	control drill、audit replay exercise	audit replay success、incident response time	monthly risk / quality forum
Continuous learning	product, ops, knowledge owner	把一线反馈变成知识、流程和模型改进	feedback triage、release note review	feedback-to-fix lead time、defect reduction	monthly improvement backlog

Adoption 不能只用登录率衡量。每个 phase 都要有:

Evidence type	示例
Task usage	交易摘要、政策检索、草稿生成、升级包生成的任务级使用
Quality proof	QA defect 下降、source-supported answer 上升
Risk proof	under-escalation 没有上升, audit replay 能复现
Learning proof	challenge case pass rate、calibration agreement 提升
Load proof	员工没有因复核和反馈增加不可持续负荷

9.4 Workload-risk dashboard

用途: 管理层同时看工作量、风险、采用、质量和员工负荷, 防止 AI 项目只报告效率收益。

Dashboard section	Metric	Slice	Action when signal worsens
Workload	queue age、cases per FTE、touch time	risk tier、team、channel、product	调整队列、暂停低价值自动化实验、补充 reviewer capacity
Quality	QA defect、critical omission、source-supported claim	task type、AI feature、reviewer group	触发知识源修复、prompt / workflow change、retraining
Risk	false close、under-escalation、policy violation、incident	high-risk customer segment、decision type	扩大人工复核、降低自动化范围、开 incident review
Adoption	accepted with edit、rejected with reason、fallback	role、task、manager group	做 floor coaching, 修正 UI 或 incentive
Learning	challenge case pass、calibration agreement	tenure、team、role	增加 case clinic, 调整培训和 rubric
Employee load	review time sufficiency、interruptions、after-hours correction	team、shift、queue	移除旧任务, 降低 AI interruption, 调整 staffing
Control health	audit replay success、override reason completeness、rollback success	use case、release version	暂停 scale, 修复 evidence capture 和 workflow gate

Dashboard interpretation rules:

Pattern	Interpretation	Response
Productivity up, quality down	AI 提速但判断或证据质量受损	缩小自动化范围, 加强 evidence view 和 QA
Adoption high, override low	可能是真正好用, 也可能是过度信任	抽样检查 no-edit cases, 做 challenge case drill
Workload down, employee load up	旧工作减少但复核、反馈、监控负荷上升	重新分配任务, 移除重复记录和无价值审批
Risk defects concentrated by manager group	指标、培训或管理压力导致行为差异	manager coaching, metric correction, targeted audit
Audit replay failure	证据链不完整	停止扩大范围, 修复 trace schema 和 evidence retention

10. Interview Answers

10.1 30 秒版本

AI 转型不是给员工一个 Copilot, 而是重新设计知识工作的 role-task architecture。我会先做 job / task decomposition, 判断每个任务适合 human-only、AI-assisted、AI-recommended、AI-decided with human review、AI-automated 还是 AI-monitored。然后明确谁对结果负责、谁复核、谁能 override、谁保留证据, 再用 productivity、quality、risk、learning、adoption 和 employee load 指标验证。金融零售场景尤其要把 workflow、decision accountability、human-AI interaction 和 architecture controls 串起来, 否则 AI 只会制造局部效率和系统性风险。

10.2 2 分钟版本

我不会从“哪个岗位会被 AI 替代”开始, 而会从任务架构开始。一个 AML analyst、credit underwriter 或 service agent 的工作, 其实由资料收集、摘要、判断、客户沟通、系统记录、复核和升级组成。AI 对这些任务的适配程度不同: 摘要、检索、抽取、草拟通常适合 AI-assisted；排序和下一步建议可以是 AI-recommended；低风险、可逆、规则明确的状态更新可以 AI-automated；但高影响决策、风险接受、客户权益和监管签字通常必须 human-only 或至少 human review。

设计时我会把 role、job、task、decision、artifact、handoff、control、metric 连起来。比如信用审批中, AI 可以提取收入证明、发现缺失资料、草拟 underwriting memo, 但 underwriter 仍要在授权范围内确认事实、选择 reason code、作出批准或拒绝决定。系统必须保存来源、模型输出、人工修改、审批人、政策版本和下游动作。管理指标也要从单纯 AHT 或处理量, 扩展到 first pass quality、source-supported claim、override pattern、audit replay success、employee load 和 learning curve。这样 AI 才是工作系统的一部分, 不是旁路工具。

10.3 Transformation Lead 版本

作为 Transformation Lead, 我会把 AI knowledge work redesign 当成 operating architecture 项目, 而不是工具上线项目。第一步是建立 use case portfolio 和流程 baseline: 哪些工作量大、返工多、风险高、员工负荷重, 哪些任务有清晰证据和规则。第二步是 role-task architecture: 拆 role、job、task、decision、artifact、handoff、control、metric, 并为每个任务选择 human-only、AI-assisted、AI-recommended、AI-decided with human review、AI-automated 或 AI-monitored。

第三步是 accountability design: 业务 owner 对结果负责, reviewer 对复核质量负责, policy owner 对规则负责, platform owner 对 AI 行为和日志负责, risk / compliance 对控制有效性提出要求但不替业务接受结果。每个高风险动作都要说明谁能 override、谁保留证据、谁有停机权。第四步是 architecture controls: identity and entitlement、DMN / policy service、AI orchestration、tool gateway、evidence store、workflow engine、audit trail、monitoring dashboard 和 rollback。第五步是 adoption operating rhythm: manager calibration、case clinic、challenge cases、QA feedback-to-fix、workload-risk dashboard。

我会用 balanced metrics 管理收益: productivity 看 cycle time 和 touch time, quality 看 source-supported claim 和 first pass quality, risk 看 false close、under-escalation 和 audit replay, learning 看 calibration 和 challenge case, adoption 看 task-level usage 和 reject / edit 行为, employee load 看 review time、interruptions 和 after-hours correction。这样才能证明 AI 不是把压力转嫁给员工, 而是真正重构了金融零售知识工作。

10.4 常见追问

追问	回答要点
如何判断任务能不能自动化	看客户影响、可逆性、证据质量、政策清晰度、bias / fairness 敏感性、时间压力、人工判断价值和审计要求。
如何避免员工被 AI 降维成审核按钮	保留 judgment、exception、customer empathy、quality improvement 和 risk escalation 任务, 同时减少旧的重复查找和录入任务。
如何防止 adoption 指标误导	不只看使用率, 还看 accepted with edit、rejected with reason、fallback、quality defect、override pattern 和 employee load。
如何向高管解释 ROI	ROI 必须连接 baseline、workflow change、adoption proof、quality proof、risk proof 和 unit economics, 不能只用节省小时数。
如何处理员工担忧	不用泛泛安抚, 而是明确任务如何变化、哪些责任保留、哪些技能升级、哪些旧任务移除、绩效指标如何调整。
架构师在这里的价值是什么	把角色任务边界转成 workflow gate、policy service、tool gateway、evidence store、audit trail、monitoring 和 rollback, 让责任和控制可执行。

11. 作品集表达方式

如果把本文能力放进作品集, 最好展示一套可交付证据:

Artifact	证明的能力
AS-IS / TO-BE role-task architecture	能从岗位视角重构知识工作
Task allocation matrix	能判断 AI、人、规则和系统的任务边界
Accountability and override matrix	能设计责任、授权、复核和证据
Workflow and architecture control diagram	能把组织设计落到产品和系统控制
Balanced metric dashboard	能证明效率、质量、风险、采用和负荷一起改善
Training and adoption plan	能让员工能力迁移, 而不是只完成工具培训
Financial retail case pack	能把 AML、credit、customer service 等场景讲成可落地方案

一句 portfolio pitch:

I redesigned AI-assisted financial retail knowledge work by decomposing roles into tasks and decisions, assigning human-AI responsibilities by risk, embedding accountability and evidence controls into workflow architecture, and measuring productivity, quality, risk, adoption, learning and employee load together.