AI 扩展计划 / Playbooks

AI Active Learning / Human Feedback Operations Playbook

以下来源是本文的技术和治理锚点。本文把它们转成产品架构、运营流程、评估门禁和模型风险证据要求，不把任何论文、框架或工具文档直接等同于监管合规结论。

1,214 行AI_ACTIVE_LEARNING_HUMAN_FEEDBACK_OPERATIONS_PLAYBOOK.md

AI Active Learning & Human Feedback Operations Playbook

定位：面向高级 AI PM / AI BA / AI Architect / Model Risk / 金融零售产品与运营团队，把 active learning、human-in-the-loop labeling、生产反馈、审核队列、标签质量、模型风险证据和客户伤害边界组合成可上线、可监控、可审计的 AI 反馈运营系统。

适用边界：本文面向 fraud、KYC、AML / financial crime、complaints、collections、customer servicing、RAG QA、内部 copilot、风险运营和模型持续改进。它不把“人工反馈”当成临时标注外包，而是把它设计成 AI 产品的控制平面、学习飞轮和风险证据链。

重要说明：本文是学习、作品集和内部方案训练材料，不构成法律意见、合规结论、模型验证报告或监管解释。正式项目必须由 Legal、Compliance、Model Risk、Fair Lending、Privacy、Security、Business Owner、Operations、Customer Experience 和管理层结合机构类型、司法辖区、业务用途、客户影响和内部政策确认。

Source Anchors

Anchor	Link	本文使用方式
Burr Settles, Active Learning Literature Survey	https://burrsettles.com/pub/settles.activelearning.pdf	建立 active learning 的主线：学习器在标签预算有限时选择最有信息量的样本向 oracle 查询；覆盖 pool-based、stream-based、uncertainty sampling、query-by-committee、expected model change、expected error reduction、density weighting 等方法。
NIST AI Risk Management Framework 1.0	https://www.nist.gov/itl/ai-risk-management-framework 和 https://doi.org/10.6028/NIST.AI.100-1	用 Govern / Map / Measure / Manage 组织人工反馈系统的风险治理：业务边界、客户影响、数据质量、监控、事件处置、审计证据和责任归属。
NIST AI RMF Playbook	https://airc.nist.gov/airmf-resources/playbook/	作为治理动作锚点：把 AI RMF 的核心功能转成 release gate、review cadence、evidence binder 和 change management 操作。
Label Studio Active Learning Documentation	https://docs.humansignal.com/guide/active_learning 和 https://labelstud.io/guide/ml	作为工程实现锚点：用标注任务、模型后端、预测结果、人工审核和主动学习循环说明 labeling platform 如何接入 ML pipeline。
AWS Well-Architected Machine Learning Lens, MLPER-18 Human-in-the-loop Monitoring	https://docs.aws.amazon.com/wellarchitected/latest/machine-learning-lens/mlper-18.html	作为 MLOps / HITL 监控锚点：把人工标注模型输出作为生产质量监控的一种可靠信号，连接采样、审核、性能监控和持续改进。

1. 一句话定位

Active learning 的产品目标不是“让人帮模型标几个不确定样本”，而是：

Active Learning Operations =
在标签预算、专家时间、客户伤害边界和模型风险要求之内，
系统性选择最值得人工判断的样本，
通过高质量审核、校准、裁决和数据版本治理，
把生产反馈转成可复现的训练数据、评估数据、策略调整和审计证据。

高级 PM / 架构师要能回答五个问题：

哪些样本值得花专家时间标注？
哪些样本不应该进入训练，因为它们属于 eval set、客户申诉、法律保全或模型风险证据？
谁有资格标注，什么时候需要 double review 或 adjudication？
人工反馈如何进入模型、规则、RAG、taxonomy 和运营 SOP，而不是变成无治理的数据垃圾桶？
生产反馈闭环是否会放大偏差、伤害客户或让模型只学习到现有策略的盲区？

金融零售里的 active learning 是一个运营系统，不只是一个算法：

维度	错误理解	正确设计
标签预算	把预算平均分给所有业务线	按风险、学习价值、客户影响、segment 覆盖和运营 SLA 分配
不确定样本	只看模型 confidence 最低的样本	结合 uncertainty、representativeness、business value、harm boundary 和 reviewer capacity
人工审核	让 analyst 在业务系统里随手改结果	结构化 labeling queue、rubric、双人复核、裁决、审计日志
生产反馈	所有 override 都拿来训练	区分训练标签、评估标签、申诉证据、异常样本和 policy feedback
模型迭代	标签一够就自动重训上线	数据版本、label quality、eval protection、release gate、shadow/pilot、监控

2. 为什么重要

2.1 标签不是越多越好，而是越可控越好

金融零售 AI 常见的真实约束：

约束	影响
专家时间稀缺	Fraud investigator、KYC analyst、complaint specialist、collections strategy SME 都有真实业务队列，不是无限 oracle
标签存在延迟	chargeback、违约、AML disposition、投诉升级、催收回收需要天到月级别结果
标签本身有噪声	不同审核员对投诉意图、KYC 风险、RAG 答案充分性可能判断不一致
错误有客户伤害	误拒交易、错误 KYC 阻断、投诉误分类、催收不当联系、RAG 错答都可能带来实际损害
反馈会改变分布	模型拦截、人工升级和运营策略会改变未来被观察到的标签
高风险场景需要证据	模型风险、审计和内部治理需要证明数据来源、标签质量和变更理由

Active learning 的价值不是减少人工，而是把专家判断用在最能降低风险和提升学习效率的位置。

2.2 从模型中心到反馈运营中心

传统模型迭代路径：

Collect labels
-> train model
-> evaluate
-> deploy

金融零售生产 AI 需要的路径：

Production decisions and model uncertainty
-> candidate selection policy
-> labeling queue with risk and SLA
-> skill-based reviewer allocation
-> reviewer calibration and adjudication
-> label quality gates
-> dataset versioning and eval set protection
-> model / policy / taxonomy / RAG improvement
-> release gate and monitoring
-> feedback loop risk review

这条路径把 human feedback 从“数据准备工作”提升为 AI 产品能力：

产品能力	业务价值
低标签成本学习	用较少专家标签获得更快模型改善
人工队列优先级	让最有客户伤害或模型学习价值的 case 先被处理
taxonomy 进化	投诉、KYC、RAG 等场景可以发现新意图、新错误类型和新政策缺口
模型风险证据	能证明谁标了什么、为什么标、标签质量如何、如何影响上线
反馈闭环治理	避免模型用自身输出和偏差反馈强化错误

2.3 Active learning 不等于 selective prediction

Active learning 和 selective prediction 经常一起出现，但解决的问题不同：

概念	核心问题	输出动作
Active learning	哪些未标注样本最值得请人标注来提升模型	label request、review queue、dataset update
Selective prediction	模型在哪些样本上足够可靠，可以自动处理	auto decision、abstain、clarify、human escalation
Human oversight	哪些 AI 输出需要人工确认或监督	approve、override、escalate、incident
Continuous evaluation	生产系统是否仍然有效	QA sample、gold review、monitoring alert

高级设计要把它们连接起来，但不能混为一谈：

Low confidence customer-facing answer
-> selective prediction says: do not answer automatically
-> active learning says: send to SME for label if it is representative and useful
-> human oversight says: protect customer now through escalation
-> evaluation says: add similar cases to future QA and regression suites

3. 核心概念

3.1 Active Learning Scenarios

Scenario	含义	金融零售适用性
Pool-based sampling	从一批未标注样本池中挑选最值得标注的样本	最常见，适合投诉语料、KYC exceptions、RAG conversations、historical fraud cases
Stream-based selective sampling	样本流入时实时决定是否请求标签	适合生产交易、实时客服会话、RAG 回答 QA、欺诈告警抽检
Membership query synthesis	系统生成合成样本请求标签	高风险金融场景谨慎使用，适合低风险 taxonomy 边界探索、synthetic eval 设计，不宜直接混入真实训练证据

Pool-based 是产品团队最容易落地的第一步：

Unlabeled pool
-> model scores and embeddings
-> sampling policy
-> reviewer queue
-> labels and rationales
-> quality gate
-> dataset version
-> model and eval update

Stream-based 更接近生产控制：

Event arrives
-> risk and uncertainty scoring
-> immediate business action
-> decide whether to request label
-> async review if not blocking customer
-> fast escalation if customer harm boundary is crossed

3.2 Query Strategy Frameworks

Strategy	核心思想	常见指标	优点	风险
Uncertainty sampling	选择模型最不确定的样本	least confidence、margin、entropy	简单、便宜、可解释	容易选到噪声、异常点或重复边界样本
Query-by-committee	多个模型或 prompt / retriever 版本分歧越大，越值得标注	vote entropy、KL divergence、disagreement rate	能捕捉 epistemic uncertainty	成本高，committee 多样性不足会虚假一致
Expected model change	选择标注后最可能改变模型参数或梯度的样本	expected gradient length	更贴近训练影响	工程复杂，对模型结构依赖强
Expected error reduction	选择预期能最大降低泛化错误的样本	expected future loss	目标最直接	计算昂贵，近似假设多
Variance reduction	选择能降低预测方差的样本	posterior variance、ensemble variance	适合概率模型和不确定性治理	对模型假设敏感
Density-weighted / representative sampling	样本既不确定又位于高密度区域	uncertainty * density	避免只选孤立异常点	需要 embedding / clustering 质量可靠
Diversity sampling	避免队列被相似 case 占满	cluster coverage、dedup distance	提高标签覆盖面	可能牺牲短期模型收益
Cost-sensitive active learning	把标注成本、错误成本、客户影响纳入选择	utility per label dollar	贴近运营预算	需要业务 cost matrix 和治理接受

金融零售中，单纯 uncertainty sampling 很少足够。更稳健的策略通常是：

Selection utility =
learning_value
+ business_risk_weight
+ segment_coverage_weight
+ drift_signal_weight
+ complaint_or_harm_weight
- duplicate_penalty
- reviewer_cost_penalty
- eval_leakage_penalty
- customer_harm_exclusion

其中 customer_harm_exclusion 是硬边界：某些 case 首先要保护客户、保留证据或走正式流程，不能为了模型学习而改变处理。

3.3 Uncertainty Signals

Signal	适用模型	用法
Class probability margin	分类模型	top1 和 top2 类别接近时进入 review
Entropy	多分类模型	意图分类、投诉 taxonomy、KYC document type
Calibrated confidence	已校准模型	接入 risk-based routing 和 label priority
Ensemble disagreement	多模型、committee、多个 prompt	欺诈、RAG、KYC matching、collections strategy
Embedding distance	文本、图像、RAG query	OOD、novel intent、knowledge gap
Retrieval support weakness	RAG	citation 弱、来源冲突、answerability 低
Rule-model conflict	决策系统	模型建议与政策规则或 analyst override 冲突
Segment drift	生产监控	某渠道、产品、地区、语言或客户生命周期变化
Outcome surprise	结果回流	高置信错误、客户申诉、chargeback、QA fail

3.4 Human Feedback 类型

Feedback 类型	示例	能否直接训练
Ground-truth label	confirmed fraud、complaint final category、KYC verified outcome	可进入训练，但要处理标签延迟和质量
Expert judgment	SME 判断 RAG 答案是否充分、KYC exception 是否合理	可进入训练或 eval，需 reviewer calibration
Analyst disposition	AML / fraud analyst 处理结论	可用，但要区分调查结论、运营动作和最终事实
Override reason	人工推翻模型或规则	适合作为 error taxonomy 和 policy feedback，不应无条件当作 truth
Customer complaint / appeal	客户提出异议	重要 harm signal，不等于标签本身，需 case review
Pairwise preference	两个回答哪个更好	适合 RAG / copilot ranking 和 response style
Free-text critique	审核员解释错误原因	适合 taxonomy、prompt、retrieval 和 guideline 改进
Implicit behavior	用户重试、转人工、放弃流程	适合监控和候选样本选择，通常不能直接当标签

4. Active Learning Platform 架构

4.1 总体架构

Production systems / batch pools
  - fraud alerts
  - KYC exceptions
  - complaint cases
  - collections outcomes
  - RAG conversations
  - analyst overrides
  - customer appeals
        |
        v
Event and data ingestion
        |
        v
Feature, embedding and evidence builder
        |
        v
Uncertainty and disagreement scoring
        |
        v
Sampling policy engine
  - learning value
  - risk tier
  - segment coverage
  - budget allocation
  - eval protection
  - customer harm exclusions
        |
        v
Labeling queue orchestrator
  - priority
  - SLA
  - reviewer skill routing
  - double review
  - adjudication
        |
        v
Reviewer workspace
  - source evidence
  - model output
  - policy rubric
  - label options
  - rationale capture
        |
        v
Quality control and adjudication
        |
        v
Feedback registry and dataset versioning
        |
        v
Model / taxonomy / RAG / policy improvement pipeline
        |
        v
Evaluation, release gate, monitoring and audit binder

4.2 架构组件

Component	主要职责	关键设计点
Event ingestion	接收生产事件、批量样本、人工 override、客户反馈	保留 decision context、model version、policy version、channel、timestamp
Evidence builder	聚合交易、客户、文档、对话、检索来源和历史处理记录	严格权限控制，避免 reviewer 看到不必要敏感信息
Uncertainty scorer	输出 uncertainty、disagreement、OOD、retrieval weakness、segment drift	分数版本化，记录计算方式和输入
Sampling policy engine	决定哪些样本进入标注队列及优先级	可配置、可审计、可回放，不让临时 SQL 成为生产策略
Budget manager	管理每日、每周、每 use case、每 segment 的标签预算	支持 hard quota、reserve budget、incident surge
Queue orchestrator	分配任务、SLA、双审、裁决和返工	与 reviewer skills、业务日历、队列负载连接
Reviewer workspace	展示证据、rubric、模型输出、历史相似案例和标签控件	避免 anchoring bias，必要时隐藏模型答案直到 reviewer 提交初判
Quality control	gold tasks、inter-rater agreement、adjudication、reviewer drift	控制标签可靠性，而不只看 throughput
Feedback registry	存储标签、理由、审核轨迹、冲突、裁决和使用许可	区分 train、eval、monitoring、audit、legal hold
Dataset versioning	生成训练集、校准集、评估集和分析集版本	防止 eval leakage 和标签污染
Improvement pipeline	触发模型重训、prompt 调整、retriever 修复、taxonomy 变更	所有变更要进入 release gate
Monitoring and audit	跟踪 learning curve、label quality、segment coverage、customer harm	输出模型风险和运营 evidence binder

4.3 数据对象模型

对象	关键字段	治理要求
Candidate	`candidate_id`、source system、risk tier、segment、model version、uncertainty score、sampling reason	可回放采样理由
Review task	`task_id`、priority、SLA、review type、assigned role、rubric version	支持双审和裁决
Label	label value、confidence、rationale、evidence references、reviewer id、timestamp	不允许无理由高风险标签
Adjudication	disagreement type、final label、adjudicator rationale、policy interpretation	用于 guideline 改进
Feedback usage	train / eval / monitor / audit / policy / exclude	防止同一标签被错误复用
Dataset version	source candidates、label quality threshold、split policy、created by、approval	支持模型风险审查
Release decision	model version、dataset version、eval result、risk acceptance、monitoring plan	与变更管理连接

4.4 Labeling Queue 产品能力

一个成熟的 labeling queue 不只是任务列表，而是 AI 反馈运营台：

能力	说明
Priority bands	按客户伤害、SLA、学习价值、segment 缺口和 incident 状态排序
Skill routing	把 fraud、KYC、complaints、collections、RAG QA 分给不同 reviewer pool
Blind review	高风险或校准任务隐藏模型答案，减少 anchoring
Double review	对边界样本、政策敏感样本、低一致性 taxonomy 启用双人复核
Adjudication lane	对 reviewer disagreement 进入 senior SME 裁决
Gold injection	混入已知答案任务监控 reviewer drift
Label rationale	要求结构化原因码和证据引用
Eval lock	标记 gold / eval / legal hold 样本，禁止进入训练
Feedback usage tag	明确该标签用于训练、监控、审计或策略修复
Audit playback	能重放当时模型、证据、rubric、审核和裁决过程

5. Sampling Policy 设计

5.1 从“最不确定”到“最值得标”

一个样本进入 review queue 的理由通常来自多个信号：

信号	问题	示例
Uncertainty	模型是否不知道	投诉意图 top1 0.42、top2 0.39
Disagreement	不同模型是否冲突	fraud model 拦截，规则引擎放行
Expected impact	标注后是否可能改善关键边界	信用卡争议交易和 fraud block 的交叉区
Representativeness	是否代表一批相似样本	新投诉主题在 embedding cluster 中快速增长
Segment coverage	是否补齐弱覆盖群体	西班牙语客服 RAG QA 样本不足
Drift	是否反映分布变化	新商户 MCC 的 fraud false positive 激增
Harm signal	是否涉及客户损害	客户申诉 AI 答案误导或 KYC 阻断
Label cost	标注是否昂贵	AML case 需要 senior investigator，成本高
Legal / audit status	能否用于训练	申诉证据可审计但可能不能直接训练

5.2 Utility Score 示例

label_priority_score =
  0.25 * calibrated_uncertainty
+ 0.20 * committee_disagreement
+ 0.15 * segment_coverage_gap
+ 0.15 * production_drift_signal
+ 0.15 * expected_business_value
+ 0.10 * customer_harm_signal
- 0.15 * duplicate_similarity_penalty
- 0.10 * reviewer_cost_penalty

Hard exclusions:
  - protected eval sample
  - legal hold or formal complaint evidence restricted from training
  - insufficient customer permission or data minimization breach
  - case requires immediate customer remediation before learning use

这个公式是产品规格的表达方式，不是通用最优公式。正式项目要用历史数据、reviewer capacity、业务风险偏好和模型风险要求校准权重。

5.3 Pool-Based Selection 工作流

1. Build candidate pool
   从最近 7 / 14 / 30 天生产样本、历史未标注样本和 drift clusters 生成候选池。

2. Score candidates
   计算 uncertainty、disagreement、embedding cluster、segment、risk tier、customer harm signal。

3. Apply exclusions
   移除 eval protected、legal hold、权限不足、重复样本和不适合训练的 case。

4. Allocate budget
   在 use case、segment、risk tier 和 reviewer pool 之间分配标签预算。

5. Create queue
   生成任务优先级、SLA、review type、rubric version 和 reviewer skill requirement。

6. Review and adjudicate
   进行单审、双审、QA、gold check 和 senior adjudication。

7. Register feedback
   写入 feedback registry，标记用途、质量等级和数据版本。

8. Evaluate learning curve
   比较主动选择样本与随机样本的模型改善、标签成本和风险影响。

5.4 Stream-Based Selection 工作流

Stream-based 适合生产事件持续到达的场景，但要区分同步阻断和异步学习。

类型	场景	处理方式
Synchronous high harm	交易拦截、账户冻结、KYC 阻断	先按风险策略保护客户或机构，再把 case 进入 expedited review
Asynchronous QA	RAG 回答、客服 copilot 建议、投诉初分类	不阻断客户流程，抽样进入 QA / learning queue
Continuous monitoring	fraud drift、collections contact outcome、complaint taxonomy drift	按时间窗口和 segment quota 入队
Incident surge	某模型版本高置信错误激增	临时提高相关 cluster / segment 的标注预算和 senior review 比例

Stream-based 的关键日志：

字段	用途
decision time model output	证明当时模型如何判断
policy action	区分模型建议和实际业务动作
propensity / sampling probability	支持后续偏差分析
customer impact marker	标记客户是否受到不利影响
reviewer timing	区分实时复核和事后 QA
final disposition	连接生产结果、人工结论和真实标签

5.5 Budget Allocation

标签预算不是单一池子，而是 portfolio allocation：

Budget bucket	目标	示例比例表达
Boundary learning	改善模型最不确定或阈值附近区域	fraud score near block threshold
Segment coverage	补齐弱覆盖客户、渠道、语言、产品	Spanish servicing、new merchant segment
Drift response	处理生产分布变化	新诈骗模式、新 KYC 文档类型
Harm and complaint review	处理客户伤害信号	wrong RAG answer、false decline appeal
Random sentinel	保留无偏监控样本	每日固定比例随机 QA
Gold calibration	监控 reviewer 质量	已知答案 case 混入队列

没有 random sentinel，active learning 很容易只看模型已知边界，失去对整体生产分布的感知。没有 segment coverage，模型可能在整体上变好，但在弱势 segment 上变差。

5.6 Eval Set Protection

Eval set 是模型风险证据，不是训练燃料。

风险	控制
eval leakage	eval 样本和近重复样本禁止进入训练
reviewer contamination	reviewer 不应在训练 guideline 中看到 gold 答案模式
prompt overfitting	RAG / LLM prompt 调整不能反复针对同一 eval case
taxonomy drift	eval label 的语义随 taxonomy 变更失效时要版本化解释
production replay	从生产回放生成 eval 时要记录采样窗口和排除规则
benchmark gaming	团队不能只优化公开 dashboard 指标而忽视客户伤害指标

实践上要维护至少四类数据：

Dataset	用途	保护级别
Training set	训练模型或改进 prompt / retriever	可扩充，可清洗
Calibration set	校准 confidence、threshold、routing policy	独立于训练
Evaluation set	release gate 和回归测试	严格冻结或版本化冻结
Monitoring sample	生产抽检和漂移监控	保持采样概率和时间窗口记录

6. Human-in-the-Loop Labeling Operations

6.1 Reviewer 角色模型

Role	主要职责	不能混淆的边界
Frontline reviewer	按 rubric 标注常规 case	不负责最终 policy 解释
SME reviewer	处理复杂、边界或高风险 case	不应单独改变模型上线门禁
Senior adjudicator	裁决 reviewer disagreement	需要记录裁决理由和 guideline 更新建议
QA lead	监控 reviewer agreement、gold accuracy、返工率	不应只用 throughput 管理质量
Taxonomy owner	管理投诉意图、错误类型、KYC reason code、RAG failure mode	变更 taxonomy 必须版本化
Model risk reviewer	评估数据、标签、评估和上线证据	不替代业务 owner 风险接受
Product owner	决定队列优先级、客户体验和 release scope	不应绕过质量门禁追求上线速度
Operations manager	管理 SLA、人员排班和产能	不应把复杂 case 压给低技能队列

6.2 Reviewer Allocation

Reviewer allocation 要同时优化质量、速度和风险：

规则	示例
Skill-based routing	KYC beneficial owner exceptions 只分给通过认证的 KYC analyst
Risk-based escalation	高客户伤害或政策敏感样本强制 senior review
Blind double review	对 taxonomy 不稳定或争议样本启用双盲审核
Load balancing	避免少数 reviewer 承担所有复杂 case 导致疲劳
Conflict control	参与原始业务处理的 analyst 不裁决自己的 override
Reviewer calibration	每周或每版本进行 gold set 校准
Language / domain fit	多语言投诉、地区政策、产品线规则要匹配 reviewer 能力

6.3 Label Quality 控制

Quality control	说明	指标
Labeling guideline	明确定义标签、边界、反例、证据要求和冲突处理	guideline version adoption
Gold tasks	混入已知答案任务	gold accuracy、false pass rate
Inter-rater agreement	多 reviewer 对同一 case 的一致性	Cohen's kappa、Fleiss' kappa、agreement rate
Adjudication	对分歧做 senior 裁决	disagreement rate、overturn rate
Rationale review	检查标签理由是否支撑结论	rationale completeness
Drift review	监控 reviewer 随时间偏移	rolling agreement、gold decay
Error taxonomy	把标签错误分为 guideline ambiguity、evidence missing、reviewer mistake、policy conflict	root cause mix
Rework loop	对失败标签返工并记录原因	rework rate、cycle time

高质量标签通常包含三件事：

label value
+ evidence reference
+ reviewer rationale

缺少 evidence 和 rationale 的标签难以用于高风险模型治理。它也无法解释模型为什么学到某个模式。

6.4 Reviewer Calibration

Reviewer calibration 不是培训签到，而是让人类 oracle 自己可测量。

Calibration round
-> reviewers label same gold / boundary cases
-> compare labels and rationales
-> identify disagreement patterns
-> update guideline examples
-> certify reviewer scope
-> monitor drift in production queue

Calibration 输出：

Artifact	内容
Reviewer calibration report	gold accuracy、agreement、典型分歧、行动计划
Guideline change log	新增定义、边界案例、反例、证据要求
Certification matrix	reviewer 可处理的 use case、risk tier、language、product
Quality exception log	质量低于阈值时的补训、降级或移出队列

6.5 Adjudication

Adjudication 的目标不是“找一个人拍板”，而是把分歧变成 taxonomy 和 policy 的学习信号。

Disagreement 类型	处理
Evidence missing	补充资料或标记不可判定
Rubric ambiguity	Taxonomy owner 更新 guideline
Policy conflict	Product / Compliance / Legal 介入解释
Reviewer mistake	QA 返工和培训
True business ambiguity	保留多标签、uncertain label 或升级专家
Model-induced anchoring	调整 UI，隐藏模型建议或改变展示顺序

裁决日志必须记录：

original labels
reviewer rationales
evidence considered
final label
adjudicator rationale
guideline implication
feedback usage permission

7. 金融零售 Use Cases

7.1 Fraud Investigator Labeling

目标：用 investigator 的高价值判断改善 fraud model、规则、step-up 策略和客户伤害控制。

设计项	方案
Candidate source	fraud alerts、declined transactions、step-up challenges、customer appeals、chargebacks、merchant disputes
Active learning signal	fraud score margin、model-rule conflict、new merchant cluster、high-confidence false positive、chargeback surprise
Reviewer	fraud investigator、senior investigator for high amount / vulnerable customer / repeat false positive
Label schema	confirmed fraud、not fraud、account takeover suspected、merchant dispute、customer confusion、insufficient evidence
Quality control	double review for high-dollar cases、gold tasks from closed investigations、appeal outcome reconciliation
Customer harm boundary	不为学习目的延迟必要的客户解冻、申诉处理或损失补救
Feedback usage	confirmed labels 进训练，appeal cases 进 harm analysis，investigator rationale 进规则和特征改进

关键指标：

Metric	含义
false positive appeal overturn rate	客户误伤
investigator-model disagreement	模型和专家冲突
label lag by fraud type	标签可用性
high-confidence error rate	高风险校准问题
segment false decline rate	公平和客户体验
learning curve per 1,000 labels	标签投资回报

7.2 KYC Exception Review

目标：把 KYC exception handling 从人工经验转成可学习、可审计、可分段监控的反馈系统。

设计项	方案
Candidate source	document OCR uncertainty、name mismatch、address mismatch、beneficial owner ambiguity、sanctions / PEP fuzzy match
Active learning signal	low margin document class、entity resolution disagreement、new document type、analyst override、region-specific error cluster
Reviewer	KYC analyst、financial crime SME、senior adjudicator for policy-sensitive cases
Label schema	verified match、mismatch、requires customer document、requires enhanced due diligence、false positive screening、insufficient evidence
Quality control	policy-versioned rubric、double review for high-risk entities、audit sampling
Customer harm boundary	不让未验证模型自动作出不利 KYC 结论；客户沟通和正式记录遵循既定流程
Feedback usage	entity matching labels 进训练，policy interpretations 进 guideline，ambiguous cases 进 taxonomy

风险点：

风险	缓解
不同地区证件和命名习惯覆盖不足	segment quota 和 language / region specialist review
模型把历史人工偏差学进去	random sentinel、fairness slicing、adjudication review
政策变更导致旧标签失效	policy version 和 label validity window
analyst override 被误当成 truth	区分 operational override、evidence label 和 final disposition

7.3 Complaints Intent Taxonomy

目标：让投诉分类模型和 taxonomy 随业务、渠道和客户语言演进，同时保留正式投诉处理边界。

设计项	方案
Candidate source	contact center transcripts、secure messages、social escalations、complaint intake forms、agent notes
Active learning signal	top intent margin 小、new embedding cluster、agent correction、complaint escalation、regulatory keyword conflict
Reviewer	complaints specialist、taxonomy owner、compliance reviewer for formal complaint boundary
Label schema	primary intent、secondary intent、root cause、product、channel、harm type、formal complaint flag、insufficient context
Quality control	taxonomy calibration、multi-label agreement、regular drift review
Customer harm boundary	模型分类不能替代正式投诉识别、响应时限或客户补救流程
Feedback usage	taxonomy labels 进训练，new clusters 进 taxonomy backlog，formal complaint decisions 进受控证据库

高级 PM 要关注：

问题	设计
投诉经常多意图	支持 primary / secondary label 和 hierarchical taxonomy
客户语言变化快	用 embedding cluster 发现新主题，不只看已知 intent
agent note 有主观性	优先使用客户原话和完整上下文
taxonomy 改动影响趋势	保留 taxonomy version map，支持跨版本对比

7.4 Collections Contact Strategy

目标：用人工反馈和结果标签改进催收联系策略，同时控制公平性、客户脆弱性和不当联系风险。

设计项	方案
Candidate source	contact attempts、promise-to-pay、hardship signals、complaints、agent notes、repayment outcomes
Active learning signal	strategy uncertainty、treatment-outcome surprise、segment drift、agent override、complaint after contact
Reviewer	collections strategy analyst、customer care SME、compliance reviewer for sensitive cases
Label schema	successful contact、promise kept、hardship identified、wrong party contact、complaint risk、do-not-contact constraint
Quality control	outcome lag handling、case note review、policy-sensitive double review
Customer harm boundary	不用 active learning 探索不当联系策略；客户脆弱性和限制偏好是硬约束
Feedback usage	outcome labels 用于策略评估，complaint / hardship signals 用于 exclusion and guardrail

Collections 场景特别要防止 feedback loop：

模型选择联系某类客户
-> 只有被联系客户产生结果标签
-> 模型以为未联系客户没有响应机会
-> 策略越来越偏向历史高接触群体

控制方式：

控制	说明
propensity logging	记录每个客户被某策略选择的概率
randomized holdout where appropriate	在低风险策略范围内保留可评估样本
customer constraints first	联系限制、脆弱性、投诉风险优先于学习价值
segment outcome review	按客户生命周期、渠道、产品和地区看结果

7.5 RAG Answer QA Review

目标：把 RAG 回答质量从“用户点踩”升级为结构化 QA、主动学习和知识治理闭环。

设计项	方案
Candidate source	customer questions、agent copilot responses、low citation support answers、thumbs down、escalations、policy-sensitive intents
Active learning signal	retrieval confidence low、source conflict、answerability uncertainty、LLM judge disagreement、customer retry、agent edit distance
Reviewer	servicing SME、policy owner、RAG QA analyst、compliance reviewer for high-risk answers
Label schema	correct and supported、partially supported、unsupported claim、wrong source、outdated source、permission issue、should refuse、should escalate
Quality control	citation-level review、gold question set、answer rubric calibration
Customer harm boundary	不让模型对信贷、投资、投诉、费用争议等高风险事项给出未支撑的确定答案
Feedback usage	supported labels 进 eval，failure modes 进 retriever / prompt / source governance backlog

RAG QA 的审核单位不是整段回答，而是 claim：

question
-> answer
-> claims
-> cited evidence
-> reviewer verdict per claim
-> answer-level decision
-> feedback route: source fix / retriever fix / prompt fix / policy escalation

8. Feedback Loop 治理

8.1 Feedback Loop 生命周期

1. Capture
   记录生产事件、模型输出、业务动作、客户反馈、人工 override 和结果标签。

2. Select
   用 sampling policy 选择候选样本，并记录采样概率和理由。

3. Review
   由合适 reviewer 按 rubric 标注，必要时双审或裁决。

4. Validate
   检查标签质量、权限、用途、数据泄露和 eval protection。

5. Register
   写入 feedback registry，连接 candidate、label、rationale、dataset version。

6. Improve
   改进模型、规则、taxonomy、prompt、retrieval、workflow 或客户体验。

7. Evaluate
   在独立 eval / calibration / monitoring set 上验证变化。

8. Release
   通过 release gate 后灰度、shadow 或 limited launch。

9. Monitor
   监控 performance、label quality、segment coverage、customer harm 和 drift。

8.2 Feedback 不是单一路径

同一个人工反馈可能进入不同改进轨道：

Feedback signal	改进轨道
模型分类错误	training data、threshold、calibration
rubric 边界不清	labeling guideline、taxonomy governance
RAG 引用过期	source governance、document lifecycle
reviewer 频繁分歧	reviewer calibration、policy clarification
客户申诉成立	customer remediation、harm analysis、model risk issue
agent 大幅编辑 copilot 输出	prompt / tool / retrieval improvement
analyst override 与规则冲突	decision policy review

8.3 防止反馈闭环偏差

生产 AI 最危险的反馈闭环是“模型只学习自己看见的世界”。

偏差类型	例子	控制
Selection bias	只有被模型拦截的交易有 fraud review 标签	random sentinel、propensity logging、unreviewed sample audit
Automation bias	reviewer 被模型建议影响	blind review、delayed reveal、model output masking
Confirmation bias	只把模型高置信错误拿来学习	保留 random QA 和 segment coverage
Survivorship bias	只看完成流程客户，不看放弃、转人工、投诉客户	capture abandonment、handoff、complaint signals
Historical bias	旧人工决策中的偏差被模型复制	fairness slicing、adjudication、policy review
Taxonomy lock-in	新投诉类型被强行塞进旧标签	novelty detection、new class review
Eval leakage	训练反复见过评估样本	eval lock、near-duplicate detection、dataset lineage

8.4 Customer Harm Boundaries

Active learning 不能为了学习而扩大客户风险。

Boundary	设计原则
No adverse action from unvalidated uncertainty	不因“模型想学习”对客户做不利动作
Remediation first	客户误伤、申诉、账户限制、错误回答先按业务流程补救
Data minimization	reviewer 只看完成标签所需证据
High-risk human escalation	高客户影响的低置信或冲突 case 强制人工
No dark experimentation	collections、credit、KYC、complaints 不做无治理探索
Explainable feedback use	能说明某类反馈如何用于训练、评估或策略修复

9. Governance and Model Risk

9.1 NIST AI RMF 映射

AI RMF Function	Active Learning / HITL 落地
Govern	定义 owner、reviewer role、label policy、data use、risk appetite、change management、evidence binder
Map	明确 use case、客户影响、业务动作、反馈来源、标签延迟、segment 和潜在伤害
Measure	测量 label quality、model performance、learning curve、fairness、queue SLA、reviewer agreement、feedback bias
Manage	用 sampling policy、human escalation、release gate、incident response 和 remediation 管理风险

9.2 Release Gate

Gate	通过标准
Use case boundary	明确 AI 输出是建议、排序、草稿、客户沟通、自动动作还是人审辅助
Sampling policy	有书面 policy，说明 uncertainty、disagreement、coverage、harm、budget 和 exclusion
Label guideline	标签定义、边界、反例、证据要求和 rubric 版本已批准
Reviewer readiness	reviewer 完成 calibration，角色权限和风险等级匹配
Label quality	gold accuracy、agreement、adjudication 和 rework 指标达到内部门槛
Eval protection	train / calibration / eval / monitoring 数据分离，近重复和泄露已检查
Segment coverage	关键客户、产品、渠道、语言和风险 segment 有覆盖和差异分析
Customer harm controls	高风险场景有 abstention、escalation、remediation 和 incident trigger
Feedback usage	每类反馈用途清楚：训练、评估、监控、审计、策略或排除
Monitoring	上线后有模型、队列、标签质量、客户影响和公平性监控

9.3 Audit Trail

审计轨迹要能回答：

为什么这个样本被选中标注？
当时模型、规则、prompt、retriever、taxonomy 和 policy 版本是什么？
reviewer 看到了哪些证据，没看到哪些敏感信息？
reviewer 如何判断，理由是什么？
是否存在分歧，谁裁决，裁决依据是什么？
该标签被允许用于训练、评估、监控、审计还是只用于 case handling？
它进入了哪个 dataset version，影响了哪个模型或产品变更？
上线后是否监控了相关风险和客户影响？

9.4 Evidence Binder

Artifact	内容
Use Case Feedback Intake	业务目标、客户影响、反馈来源、标签延迟、风险等级
Sampling Policy Spec	query strategy、priority formula、budget allocation、exclusion rules
Labeling Guideline	标签定义、例子、反例、证据要求、rubric version
Reviewer Calibration Report	gold accuracy、agreement、分歧类型、补训动作
Adjudication Log	reviewer disagreement、final label、裁决理由、guideline implication
Label Quality Dashboard	throughput、SLA、gold accuracy、IAA、rework、overturn rate
Dataset Lineage Report	candidate source、split policy、usage rights、version、near-duplicate check
Eval Protection Memo	训练、校准、评估和监控数据隔离证明
Segment Coverage Review	关键 segment 的样本量、错误率、升级率和标签质量
Release Gate Memo	模型 / policy / RAG 变更、评估结果、残余风险和批准
Monitoring Plan	drift、label quality、customer harm、fairness、incident triggers
Feedback Change Log	每次反馈驱动的 taxonomy、prompt、retriever、rule 或 model 改动

9.5 Incident Triggers

Trigger	处置
高客户影响场景出现高置信错误	暂停自动化或收紧阈值，启动 root cause review
reviewer agreement 突然下降	暂停相关标签进入训练，进行 calibration round
某 segment 标签质量或模型表现显著恶化	提高该 segment human review，限制自动动作
RAG unsupported claim 进入客户可见回答	停用相关 intent 自动回答，修复 source / retriever / prompt
active learning 队列集中在少数重复 cluster	调整 diversity 和 density policy
eval leakage 被发现	作废受污染评估结果，重建 eval set 和 release evidence
客户投诉指向反馈驱动模型变更	case review、客户补救、模型风险 issue 和变更回溯

10. 指标体系

10.1 Active Learning 指标

Metric	含义	使用方式
Learning curve	模型表现随标签数增长	比较主动采样、随机采样和业务规则采样
Label efficiency	每 100 / 1,000 个标签带来的性能提升	评估标签预算 ROI
Acquisition precision	被选中样本中真实有学习价值的比例	优化 sampling policy
Boundary improvement	阈值附近错误率是否下降	fraud block、KYC exception、complaint routing
Diversity coverage	采样覆盖多少 cluster / segment	防止重复标注
Drift capture rate	drift cluster 被标注覆盖比例	生产监控连接
High-confidence error capture	主动发现高置信错误能力	模型风险和校准治理

10.2 Label Operations 指标

Metric	含义
Queue backlog	未处理任务量
SLA attainment	按风险等级和业务线达成 SLA
Reviewer throughput	每 reviewer 每小时有效任务数
Rework rate	被 QA 或 adjudication 打回比例
Gold accuracy	已知答案任务正确率
Inter-rater agreement	多人标注一致性
Adjudication overturn rate	裁决推翻初审的比例
Label age	标签从事件发生到可用于训练的时间
Rationale completeness	标签理由和证据引用完整度
Reviewer drift	reviewer 质量随时间变化

10.3 Model and Product 指标

Metric	场景
Precision / recall by segment	fraud、KYC、complaints
Calibration and high-confidence error	风险分数和分类置信度
Automation coverage	selective prediction 和人审队列容量
Human override rate	模型和运营冲突
Customer appeal overturn rate	客户伤害和误判
Complaint after AI interaction	customer-facing AI 风险
RAG groundedness / citation support	RAG QA
Taxonomy unknown / new cluster rate	投诉和客服语料变化
Collections contact harm signal	催收策略客户影响

10.4 Fairness and Coverage 指标

Metric	解释
Segment sample coverage	每个关键 segment 的标注样本量和比例
Segment label quality	gold accuracy、agreement、adjudication by segment
Segment error rate	模型错误是否集中
Segment automation rate	自动处理率是否不均衡
Segment escalation rate	人工升级是否异常集中
Segment customer harm	误拒、投诉、申诉、错误回答、联系限制问题
Segment feedback lag	某些群体是否更晚获得真实标签

11. 模板

11.1 Use Case Feedback Intake

字段	要求	金融零售示例
Use case	明确业务流程、渠道和 AI 输出	Credit card servicing RAG answer QA
Customer impact	说明是否影响资金、账户、权益、投诉、信贷、KYC 或催收	客户可能把费用和争议回答理解为正式承诺
Feedback source	生产事件、人工 override、客户反馈、结果标签	thumbs down、agent escalation、SME QA
Label type	ground truth、expert judgment、preference、rationale、appeal signal	claim-level support label
Label lag	结果标签多久可用	QA 当日可用，投诉 outcome 需要后续回流
Active learning signal	uncertainty、disagreement、drift、harm、coverage	citation weak、retriever conflict、new intent cluster
Reviewer role	谁能标注，谁能裁决	servicing SME 初审，policy owner 裁决
Customer harm boundary	哪些 case 先保护客户或升级人工	信贷、投诉、费用争议不自动给最终结论
Feedback usage	训练、评估、监控、审计或排除	unsupported claim 进 RAG eval 和 source backlog
Evidence owner	模型、产品、运营、风险的 owner	Product owner + RAG platform owner + Model Risk

11.2 Sampling Policy Spec

模块	设计要求	示例
Candidate universe	进入候选池的数据范围	最近 14 天客服 RAG 会话和 agent escalations
Exclusions	不进入学习队列的样本	legal hold、eval locked、权限不足、已补救但受限 case
Priority signals	用于排序的信号	answerability uncertainty、citation support low、customer retry
Budget allocation	标签预算如何分配	40% uncertainty、20% segment coverage、20% random sentinel、20% harm signals
Diversity control	如何避免重复样本	embedding cluster 每日上限和 near-duplicate filter
Reviewer routing	谁处理什么任务	billing SME、credit SME、complaint SME 分池
Double review rule	何时双审	high-risk intent、policy conflict、new cluster
Adjudication rule	何时裁决	reviewer disagreement 或 label confidence low
Monitoring	采样策略是否有效	learning curve、QA fail capture、segment coverage

11.3 Labeling Queue Schema

Field	说明	示例值
task_id	标注任务唯一 ID	`rag_qa_2026_000184`
candidate_id	源样本 ID	`conversation_93281_turn_07`
use_case	业务场景	`credit_card_servicing_rag`
risk_tier	客户影响等级	`high_customer_visible`
priority_reason	入队原因	`unsupported_claim_risk + segment_coverage_gap`
sampling_probability	被采样概率	`0.18`
model_version	生产模型或 prompt 版本	`rag_router_v4.2`
policy_version	业务策略版本	`servicing_answer_policy_2026_05`
rubric_version	审核 rubric 版本	`rag_claim_support_rubric_v3`
reviewer_skill	所需 reviewer 能力	`credit_card_fee_policy_sme`
review_mode	单审、双审、盲审或裁决	`blind_double_review`
evidence_refs	审核证据引用	`policy_doc_17#section_4.2`
due_at	SLA 时间	`2026-06-30T16:00:00-05:00`

11.4 Reviewer Calibration Sheet

项目	评估方式	合格解释
Gold accuracy	已知答案 case 正确率	reviewer 能稳定识别明确样本
Boundary agreement	边界 case 与 senior adjudicator 一致性	reviewer 理解 rubric 的灰区
Rationale quality	标签理由是否引用证据	高风险标签能支撑审计
Policy sensitivity	是否识别需升级 Legal / Compliance 的 case	不把政策问题当普通标签
Drift awareness	是否识别新类型或 taxonomy 不足	能提出 guideline 更新
Rework trend	返工和推翻是否下降	培训有效

11.5 Adjudication SOP

Step	动作	输出
1	收集初审标签、理由和证据	disagreement packet
2	判断分歧类型	evidence、rubric、policy、reviewer mistake、true ambiguity
3	Senior adjudicator 给出最终标签	final label + confidence
4	记录裁决理由和证据	adjudication rationale
5	决定反馈用途	train、eval、monitor、audit、exclude
6	判断是否需要更新 guideline	guideline change request
7	将分歧模式计入质量 dashboard	disagreement root cause metrics

11.6 Eval Set Protection Checklist

检查项	通过标准
Split lineage	每个样本知道来自训练、校准、评估还是监控
Near-duplicate detection	eval 样本近重复不进入训练
Prompt / retriever tuning isolation	调参不反复针对同一 eval failure
Reviewer access control	gold 答案和 eval labels 不暴露给无关 reviewer
Versioned taxonomy	taxonomy 变化时 eval label 有版本解释
Release evidence	每次 release 使用的 eval set 版本可追溯
Contamination response	污染发生时有作废和重建流程

11.7 Feedback-to-Model Change Memo

字段	示例表达
Change driver	最近 30 天 RAG QA 显示 fee waiver policy 引用错误集中在 mobile app servicing intent
Feedback evidence	126 个 QA labels，gold accuracy 94%，adjudication overturn 6%，unsupported claim rate 18%
Root cause	Retriever 优先返回旧政策摘要，prompt 未要求版本优先级
Proposed change	更新 source ranking，加入 effective date filter，调整 answer template 的 citation requirement
Eval impact	Gold RAG set unsupported claim rate 从 12% 降到 4%，high-risk intent abstention rate 从 21% 到 24%
Customer harm control	fee dispute、credit decision、formal complaint intent 继续人工升级
Rollout	10% shadow，48 小时 QA，随后 limited launch
Monitoring	unsupported claim、customer escalation、source freshness、complaint after answer
Approval	Product owner、RAG platform owner、Model Risk reviewer、Servicing SME

11.8 Model Risk Evidence Binder Map

Evidence	Owner	Update cadence
Use case risk assessment	Product + Model Risk	New use case or scope change
Sampling policy	AI platform PM + Data Science	Monthly or strategy change
Label guideline	SME + Taxonomy owner	Each taxonomy or policy change
Reviewer calibration report	QA lead	Per release and monthly
Dataset lineage	Data platform	Each dataset version
Eval protection memo	Model validation / Model Risk	Each release
Segment coverage dashboard	Analytics + Fairness reviewer	Monthly
Customer harm review	CX + Compliance + Product	Monthly and incident-driven
Release gate memo	Product + Model Risk	Each release
Incident postmortem	Risk owner	Incident-driven

12. 30 天训练计划

目标：30 天内把 active learning 和 human feedback 从概念训练成可展示的金融零售 AI 产品、架构和治理资产。训练默认读者已具备高级 BA / PM / 架构能力。

Day	主题	产出
1	精读 Settles active learning survey 的场景和 query strategy	1 页 active learning method map
2	对比 pool-based、stream-based、membership query synthesis	金融零售适用边界表
3	设计 uncertainty sampling、margin、entropy 的产品解释	sampling signal cheat sheet
4	设计 query-by-committee 用于 RAG / fraud / KYC 的 committee 方案	disagreement scoring spec
5	研究 expected model change / expected error reduction 的产品取舍	高级面试解释卡
6	设计 active learning platform 总体架构	架构图和组件表
7	设计 labeling queue 数据模型	task schema + feedback registry schema
8	设计 reviewer role 和 skill routing	reviewer certification matrix
9	设计 reviewer calibration round	gold set plan + calibration report
10	设计 adjudication workflow	adjudication SOP
11	设计 label budget allocation	budget portfolio memo
12	设计 eval set protection	eval protection checklist
13	设计 feedback usage tags	train / eval / monitor / audit / exclude policy
14	设计 feedback loop bias controls	selection bias and propensity logging memo
15	Fraud case drill	fraud active learning queue + harm boundary
16	KYC case drill	KYC exception review feedback policy
17	Complaints taxonomy case drill	taxonomy drift and multi-label design
18	Collections strategy case drill	contact strategy feedback and fairness guardrails
19	RAG QA case drill	claim-level QA rubric
20	设计 active learning metrics	learning curve + label efficiency dashboard
21	设计 label operations metrics	queue SLA + reviewer quality dashboard
22	设计 segment fairness metrics	coverage and harm dashboard
23	用 NIST AI RMF 映射治理	Govern / Map / Measure / Manage evidence map
24	设计 release gate	active learning release gate memo
25	设计 incident triggers	feedback loop incident playbook
26	写 Feedback-to-Model Change Memo	完整变更说明
27	准备 architecture review	平台架构、数据流、权限和审计
28	准备 executive memo	标签预算、风险降低、客户保护、ROI
29	准备 interview story	STAR-T 面试答案
30	完成 portfolio package	case study、架构图、queue spec、evidence binder、dashboard mock

13. 面试答案

13.1 Active learning 和普通人工标注有什么区别？

30 秒回答：

普通标注通常是把样本批量交给人；active learning 是模型在标签预算有限时选择最有学习价值的样本向人查询。金融场景还要把客户影响、segment 覆盖、审核质量和 eval set 保护纳入选择策略。

2 分钟展开：

Active learning 的核心是样本选择策略。模型不是被动等待全量标签，而是根据 uncertainty、disagreement、expected impact、drift、representativeness 和 label cost 决定哪些 case 值得请专家判断。金融零售里不能只追求模型提升，还要控制客户伤害和模型风险证据。因此我会把 active learning 设计成平台能力：sampling policy、labeling queue、reviewer calibration、adjudication、feedback registry、dataset versioning 和 release gate。这样它既提升数据效率，也能支撑审计和治理。

13.2 Uncertainty sampling 有什么限制？

30 秒回答：

Uncertainty sampling 简单有效，但容易选到噪声、异常点、重复边界样本或低业务价值样本。高风险业务要结合代表性、多样性、segment 覆盖、客户伤害和 reviewer 成本。

2 分钟展开：

如果只选模型最不确定样本，队列可能被低质量数据、罕见异常或 taxonomy 模糊 case 占满，专家时间消耗很大但模型泛化提升有限。金融场景还会出现客户伤害边界，比如 fraud false decline 申诉要先补救客户，不是简单拿来训练。我会用 composite utility：uncertainty 加上 committee disagreement、density、segment coverage、drift 和 business value，同时设置 hard exclusions，保护 eval set、legal hold 和高风险客户处理流程。

13.3 Query-by-committee 适合什么场景？

30 秒回答：

Query-by-committee 适合模型不确定性来自知识不足而不是纯噪声的场景。多个模型、prompt、retriever 或规则版本分歧越大，越值得人工审核。

2 分钟展开：

在 fraud、KYC 和 RAG 中，单一模型 confidence 可能不可靠。QBC 用多个视角产生分歧信号，例如 fraud ensemble、KYC entity matching 模型加规则、RAG 的多个 retriever 或 prompt 版本。如果 committee 一致认为低风险，可以降低审核优先级；如果分歧很大，说明模型知识或证据不足，值得 SME 标注。关键是 committee 必须真正多样，否则只是多个相似模型给出虚假一致。治理上要记录 committee version 和 disagreement reason。

13.4 Expected model change 和 expected error reduction 为什么高级但难落地？

30 秒回答：

它们比简单 uncertainty 更接近“标这个样本能让模型变好多少”，但计算复杂、依赖模型假设和候选标签分布。产品上通常把它们转成近似 learning value 信号，而不是硬算理论最优。

2 分钟展开：

Expected model change 关注标注后模型参数或梯度会改变多少；expected error reduction 关注标注后未来泛化错误能降多少。它们目标更贴近学习效率，但在深度模型、LLM、RAG 和复杂金融特征下计算成本高，而且需要估计未知标签。实践中我会用近似方法，比如 ensemble disagreement、gradient embedding、cluster representativeness、threshold proximity 和历史 learning curve，形成可解释的 priority score。正式方案要用 A/B 或回放比较主动采样与随机采样的收益。

13.5 如何设计一个 active learning labeling platform？

30 秒回答：

我会设计从生产事件到反馈 registry 的闭环：ingestion、uncertainty scoring、sampling policy、labeling queue、skill routing、reviewer QA、adjudication、dataset versioning、release gate 和 monitoring。

2 分钟展开：

平台第一层是数据和证据：保留模型版本、业务动作、客户影响和采样概率。第二层是选择策略：综合 uncertainty、disagreement、segment coverage、drift、harm 和 budget。第三层是运营：按风险和技能分配 reviewer，支持 blind review、double review、gold tasks 和 adjudication。第四层是治理：feedback registry 标记标签用途，dataset versioning 保护 eval set，release gate 验证模型、RAG 或 policy 变更。最后监控 learning curve、label quality、客户伤害和 fairness。

13.6 如何保护 eval set？

30 秒回答：

Eval set 要和训练、校准、监控数据分离，并做版本、权限和近重复控制。任何 active learning 标签进入训练前，都要检查是否污染评估证据。

2 分钟展开：

Active learning 会不断挑选模型最关心的样本，这很容易污染评估集。如果团队反复用同一批失败 case 调 prompt 或模型，指标会变好但真实泛化不一定改善。我会维护 train、calibration、eval、monitoring 四类数据，给样本加 usage tag 和 eval lock，并做 near-duplicate detection。taxonomy 或 policy 改动时，eval label 也要版本化解释。若发现 leakage，要作废受污染结果，重建评估证据。

13.7 如何处理 reviewer disagreement？

30 秒回答：

分歧不是噪声，它通常暴露证据不足、rubric 模糊、政策冲突或真实业务 ambiguity。要通过双审、裁决、root cause 和 guideline 更新把分歧转成系统学习。

2 分钟展开：

我不会简单用多数投票结束分歧。第一步看分歧类型：证据缺失、标签定义不清、reviewer 失误、政策解释冲突还是 case 本身模糊。第二步由 senior adjudicator 记录最终标签和理由。第三步把分歧反馈给 taxonomy owner 和 guideline。指标上看 inter-rater agreement、adjudication overturn rate 和 disagreement root cause。高风险场景还要决定该标签能否用于训练，还是只能用于审计或监控。

13.8 如何避免 production feedback loop bias？

30 秒回答：

要记录采样概率和业务动作，保留随机 sentinel 样本，并按 segment 监控。否则模型只会学习被自己拦截、升级或展示过的世界。

2 分钟展开：

比如 fraud 模型只审核被拦截交易，就看不到放行交易里的漏报；collections 只观察被联系客户的还款，就无法判断未联系客户可能怎样反应。我的设计会记录 propensity、policy action、model output 和 customer outcome，保留随机 QA 样本，按 segment 分析错误和覆盖。对于客户伤害高的策略，不用无治理探索；必要时用低风险范围内的 controlled holdout 或 shadow evaluation 支撑因果判断。

13.9 如何把 active learning 用在 RAG QA？

30 秒回答：

RAG active learning 不是只挑用户点踩答案，而是选择 citation weak、answerability uncertain、source conflict、agent edit distance 大和高风险 intent 的回答做 claim-level QA。

2 分钟展开：

RAG 的标签对象应拆到 claim 和 evidence。Reviewer 判断每个 claim 是否被权威来源支持、来源是否当前有效、用户是否有权限、答案是否应该拒答或升级。Sampling policy 结合 retrieval score、LLM judge disagreement、用户重试、agent 修改和高风险意图。反馈不能只微调 prompt，也要进入 source governance、retriever ranking、taxonomy 和 escalation policy。高风险客户可见回答必须有人工升级和 unsupported claim 事件处理。

13.10 如何向高管解释标签预算 ROI？

30 秒回答：

我会用 learning curve、label efficiency、客户伤害降低、人工队列效率和模型风险证据来解释，而不是只说“标了多少条数据”。

2 分钟展开：

标签预算的价值不是数量，而是每个专家小时带来的风险降低和产品改善。我会对比主动采样和随机采样：同样 1,000 个标签，模型边界错误、高置信错误、RAG unsupported claim 或 fraud false positive 降低多少。还要看运营指标：队列 SLA、返工率、裁决率、reviewer calibration。对金融机构，高质量反馈还能形成 evidence binder，支持模型上线、审计和 incident response。这比单纯采购标注量更可解释。

14. 作品集表达

如果把本文转成作品集，可以用一个“金融零售 AI 反馈运营平台”案例展示：

Case: Active Learning and Human Feedback Platform for Customer-Facing Banking AI

Problem:
  银行在欺诈、KYC、投诉分类、催收策略和客服 RAG 中部署多个 AI 系统。
  模型团队缺少高质量标签，运营团队缺少统一队列，模型风险团队缺少反馈证据链。

Risk:
  人工反馈分散在业务系统、客服备注、申诉记录和 QA 表格中。
  直接拿生产 override 重训会放大 selection bias、automation bias 和 eval leakage。
  客户可见 AI 错误可能造成误导、误拒、投诉升级或补救成本。

Design:
  - event ingestion with model / policy / customer impact context
  - uncertainty and disagreement scoring
  - sampling policy engine with budget and harm exclusions
  - skill-based labeling queue
  - blind double review for high-risk cases
  - reviewer calibration and adjudication
  - feedback registry with usage tags
  - dataset versioning and eval set protection
  - release gate and evidence binder
  - monitoring for label quality, segment coverage, learning curve and customer harm

Use cases:
  - fraud investigator labeling for false positive and chargeback learning
  - KYC exception review with policy-versioned labels
  - complaints intent taxonomy drift detection
  - collections contact strategy feedback with customer constraints
  - RAG answer QA at claim and citation level

Evidence:
  - active learning learning curve versus random sampling
  - reviewer calibration report
  - adjudication root cause analysis
  - segment coverage and fairness dashboard
  - eval protection memo
  - model / RAG release gate memo
  - incident trigger and remediation workflow

Outcome:
  AI teams learn faster with fewer labels,
  operations teams review the right cases with better quality,
  product teams control customer harm,
  and model risk teams receive reproducible evidence for feedback-driven changes.

面试中的高级表达：

我不会把 human feedback 设计成一个“标注后台”。在金融零售里，它是 AI 产品的学习系统、运营系统和治理系统。真正高级的设计是同时回答：标什么、谁来标、怎么保证标签质量、哪些反馈能进训练、如何保护评估集、怎么防止反馈偏差，以及如何证明这套闭环没有伤害客户。