返回 Papers
AI 扩展计划 / Playbooks

AI Data Contracts / Lineage / Quality Playbook

这些来源作为方法锚点, 不替代企业内部 legal、compliance、model risk、privacy 和 architecture review 的正式判断。

897AI_DATA_CONTRACTS_LINEAGE_QUALITY_PLAYBOOK.md

AI Data Contracts / Lineage / Quality Playbook

定位: 面向 AI Data Product Manager / AI Product Architect / Data Architect / AI Governance / Risk Tech 的高级数据治理与产品化手册。 目标: 把 AI data contracts、lineage、metadata product、schema evolution、data quality SLO、drift、label governance 和 data incident response 连接成可上线、可审计、可运营的企业 AI 数据控制面。 核心观点: AI 数据治理不是“数据清洗”和“字段说明”, 而是为 AI use case 提供可签约、可追溯、可测试、可变更、可问责、可复盘的数据产品能力。


Source Anchors

这些来源作为方法锚点, 不替代企业内部 legal、compliance、model risk、privacy 和 architecture review 的正式判断。

AnchorLink本手册使用方式
OpenLineage Docshttps://openlineage.io/docs/将 job、run、dataset、facet 和 runtime lineage event 转成 AI 数据链路观测设计。
OpenLineage Object Modelhttps://openlineage.io/docs/spec/object-model/区分 runtime lineage、design-time job metadata、dataset metadata, 支撑训练、评测、RAG 和 feature pipeline 的证据链。
OpenLineage Facetshttps://openlineage.io/docs/spec/facets/用 facet 扩展 source code、schema、quality、model、prompt、RAG corpus、label batch 等 AI 元数据。
DataHub Data Contractshttps://docs.datahub.com/docs/generated/metamodel/entities/datacontract将 contract 表达为 schema、freshness、quality、SLA assertions, 并接入 CI/CD 和 quality 工具。
DataHub Lineagehttps://docs.datahub.com/docs/api/tutorials/lineage参考 table-level、column-level、data job、dashboard、chart 的 lineage 表达方式。
OpenMetadata Data Contractshttps://docs.open-metadata.org/v1.13.x/how-to-guides/data-contracts用 schema、semantics、security、quality assertions、SLA、terms of use 和 status 组织 contract。
OpenMetadata Data Lineagehttps://docs.open-metadata.org/v1.13.x/how-to-guides/data-lineage参考 table、column、pipeline、dashboard、ML model 的可视化 lineage 和 impact analysis。
OpenMetadata Quality Observabilityhttps://docs.open-metadata.org/v1.13.x/how-to-guides/data-quality-observability将 tests、profiler、alerts、incident manager 和 anomaly detection 纳入数据运营闭环。
Great Expectations GX Corehttps://docs.greatexpectations.io/docs/core/introduction/gx_overview/用 Expectation Suite、Validation Definition、Validation Result 和 Checkpoint 建立数据质量测试与报告。
Great Expectations Checkpointshttps://docs.greatexpectations.io/docs/core/trigger_actions_based_on_results/create_a_checkpoint_with_actions/将 validation results 转成 notification、Data Docs、custom action 和发布门禁。
NIST AI RMF Corehttps://airc.nist.gov/airmf-resources/airmf/5-sec-core/用 Govern / Map / Measure / Manage 组织 AI 数据风险治理。
NIST AI RMF GenAI Profilehttps://www.nist.gov/publications/artificial-intelligence-risk-management-framework-generative-artificial-intelligence将 GenAI 生命周期中的数据来源、评估、监控和风险响应转成 artifact。
本仓库 AI Data Product Management Playbookdocs/AI_DATA_PRODUCT_MANAGEMENT_PLAYBOOK.md本手册继续展开 contract、metadata、lineage、quality SLO 和 feedback loop。
本仓库 AI Requirements-to-Eval Cookbookdocs/AI_REQUIREMENTS_TO_EVAL_COOKBOOK.md将数据契约和 lineage 接到 eval contract、release gate、monitoring 和 incident loop。

1. 定位: AI 数据控制面的高级能力

这份手册不重复基础 BA、传统数据仓库或普通数据治理概念。它补的是 AI 产品和架构进入生产后必须具备的四类能力:

能力高级问题可交付证据
AI data contractsAI 工作流依赖的数据是否有 schema、语义、质量、权限、刷新、用途和变更承诺data contract、contract tests、schema change approval
Lineage一个模型输出、RAG 回答、eval 结果或特征值能否追溯到源记录、转换、版本、权限和质量检查lineage map、OpenLineage event、impact analysis
Metadata product元数据是否从被动目录升级为 AI runtime 的控制面metadata product canvas、catalog policy、ownership model
Quality operations数据质量下降、漂移、标签争议、语料过期和 schema 破坏能否被检测、拦截、分级和复盘quality SLO matrix、incident playbook、drift dashboard

一句话:

AI data product 不是“可以被模型使用的数据”, 而是能以 contract 形式对 AI 消费者负责的数据能力。


2. 为什么重要

企业 AI 的很多事故表面上是模型问题, 根因却是数据产品失控。

表面问题深层数据根因业务风险
KYC 文档抽取结果频繁错字段文档类型、OCR 版本、字段语义、置信度阈值没有 contract客户重复补件、开户延迟、监管抽查解释困难
AML copilot 总结 case 结论偏差case label 没有来源、审核人、时间点、jurisdiction 和分歧记录错误风险叙事、调查遗漏、模型评测失真
信贷模型表现突然下降上游申请渠道、收入字段、拒绝原因、客户分群发生 drift不公平结果、审批质量下降、模型风险升级
RAG 政策助手引用旧政策语料库版本、生效日期、下线日期和 index refresh 没有 SLO客服误导、员工错误执行、合规风险
客服对话 trace 无法复盘prompt、retrieval、tool call、human edit 和用户反馈没有统一 trace无法定位事故、无法沉淀 golden set
推荐系统转化波动特征定义、窗口口径、归因事件和实验分桶发生 schema 或 distribution 变化错误营销、客户体验下降、收入归因失真

AI 数据控制面要解决的不是单点数据质量, 而是跨 source、pipeline、metadata、contract、eval、runtime、feedback 和 incident 的端到端可证明性。


3. 能力地图

层级关键对象核心职责成熟交付物
Source layercore banking、CRM、KYC、AML、policy repo、call center、feature events明确 source-of-truth、owner、权限、保留期、业务语义source inventory、ownership map
Contract layerAI data contract、schema assertion、freshness assertion、usage policy让 producer 和 AI consumer 对可用性、质量和用途达成机器可测试承诺data contract、contract testing suite
Metadata product layerDataHub / OpenMetadata、glossary、domain、classification、data product让 metadata 成为 discovery、governance、runtime filtering 和 audit 的控制面metadata product canvas、catalog policy
Lineage layerOpenLineage event、column lineage、feature lineage、RAG corpus lineage记录数据从源系统到 AI 输出的证据链和影响范围lineage map、impact analysis report
Quality SLO layerGreat Expectations、OpenMetadata tests、DataHub assertions将 completeness、freshness、accuracy、consistency、validity、coverage、access correctness 转成 SLOquality SLO matrix、quality dashboard
Drift layerfeature drift、data drift、label drift、policy drift、corpus freshness drift识别训练、线上、评测和检索语料分布变化drift report、retraining or recrawl decision
Incident layerdata incident、AI incident、contract violation、schema break统一分级、止血、根因、修复、复盘和证据保全data incident playbook、post-incident review
Governance layerRACI、approval、release gate、model risk、privacy、audit把数据责任嵌入 AI 生命周期governance RACI、release evidence pack

4. Reference Architecture

flowchart LR
  S1[KYC / CRM / Core Banking] --> I[Ingestion and Transformation]
  S2[AML Case System] --> I
  S3[Credit LOS / Bureau / Ledger] --> I
  S4[Policy Repository] --> RAG[RAG Corpus Builder]
  S5[Contact Center / Trace Store] --> I
  S6[Digital Events / Feature Logs] --> F[Feature Pipeline]

  I --> OL[OpenLineage Events]
  F --> OL
  RAG --> OL

  OL --> M[Metadata Platform: DataHub / OpenMetadata]
  M --> C[AI Data Contracts]
  C --> Q[Quality Validation: GX / Assertions / Tests]

  Q --> DP[AI Data Products]
  DP --> FS[Feature Store]
  DP --> TS[Training Dataset Registry]
  DP --> ES[Eval Dataset Registry]
  DP --> KB[RAG Corpus Registry]
  DP --> LS[Label Store]

  FS --> AI[AI Services / Models / Agents]
  TS --> AI
  ES --> Gate[Eval and Release Gate]
  KB --> AI
  LS --> Gate

  AI --> T[AI Trace and Feedback]
  T --> M
  T --> Incident[Data and AI Incident Response]
  Incident --> C
  Incident --> Q

4.1 Control Plane

AI 数据控制面至少包含六个中心:

中心负责什么不能缺的能力
Metadata catalog发现、owner、domain、classification、glossary、usage、lineageDataHub 或 OpenMetadata 作为治理入口
Contract registry数据契约、schema、SLO、allowed AI use、change policy版本、审批、contract testing、violation 状态
Lineage backendjob、run、dataset、column、feature、corpus、eval sample 的证据链OpenLineage event、impact analysis、trace linkage
Quality runnerexpectation、assertion、checkpoint、anomaly detectionGreat Expectations、OpenMetadata tests、DataHub assertions
AI asset registrytraining set、eval set、RAG corpus、feature set、label setprovenance card、version、risk tier、owner
Incident workflowbreach detection、severity、containment、root cause、postmortem数据事故和 AI 输出事故联动

4.2 Runtime Instrumentation

OpenLineage 适合记录 pipeline job 的运行事实, 但 AI 数据链路需要扩展 facet。一个金融零售 AI pipeline 的 event 最少应能表达:

Facet字段用途
source facetsource_system、source_record_count、extract_window、jurisdiction解释数据来自哪里和覆盖什么
contract facetcontract_id、contract_version、assertion_status、breaking_change_flag连接 producer commitment 和 pipeline 状态
quality facetvalidation_suite_id、checkpoint_id、failed_expectations、severity将 quality results 进入 lineage graph
model data facettraining_dataset_id、feature_set_id、snapshot_time、sampling_policy追溯训练数据
eval data faceteval_dataset_id、golden_set_version、reviewer_pool、rubric_version追溯评测数据
RAG corpus facetcorpus_id、document_version、effective_date、index_version、chunking_policy追溯检索语料
label facetlabel_batch_id、label_source、review_status、inter_annotator_agreement管理标签质量和来源

4.3 Release Gates

AI 数据 release gate 不只检查模型分数, 还要检查数据契约是否可依赖。

Gate必过条件失败动作
G1 Contract Gatecontract active、owner assigned、allowed AI use 明确、schema assertions 通过阻止进入生产 pipeline
G2 Lineage Gatesource -> curated -> feature/training/eval/RAG -> AI output 有可查询 lineage不允许进入高风险 use case
G3 Quality Gatecritical SLO 通过, severe quality breach 为 0降级、回滚、人工复核
G4 Drift Gatefeature/data drift 在阈值内, label drift 有解释暂停自动化决策辅助或触发重评估
G5 Incident Readiness Gateseverity、owner、communication、containment、postmortem 流程已演练延迟扩大用户范围

5. 架构模式

5.1 AI Data Contract as API Boundary

把数据契约当成 AI 系统的 API 边界。模型、RAG、eval、feature pipeline 和 agent tool 不直接信任表或文件, 只信任通过 contract 的数据产品。

Contract areaAI-specific requirement金融零售例子
Schema字段类型、必填性、枚举、嵌套结构、nullable policydocument_type 只允许 passport、driver_license、utility_bill、bank_statement
Semantics字段含义、时间口径、单位、状态定义、业务优先级case_status=closed 表示调查结束, 不等于 SAR 已提交
Freshnesssource event 到 AI 消费可见的最大延迟KYC 文档抽取结果 P95 在 15 分钟内进入 review queue
Qualitycompleteness、validity、consistency、accuracy、coverage关键身份字段完整率 >= 99.5%
Permissionfield-level、row-level、retrieval entitlement、trace redaction客服可检索公开政策, 不可检索 AML investigation notes
Allowed AI useRAG、eval、training、routing、analytics、monitoring 的边界客服对话可用于质量分析和 eval 抽样, 不直接进入训练集
Change policyschema evolution、deprecation、backfill、dual-run、consumer approvalcredit feature 改口径必须重跑 validation 和 model risk impact review
Incident trigger什么 violation 触发停用、降级或人工复核政策库 freshness breach 超过 24 小时, RAG 只允许回答并提示人工确认

成熟设计不是“文档里有 contract”, 而是 contract 能被 CI、pipeline、catalog、quality runner 和 release gate 自动读取。

5.2 Metadata Product as AI Control Plane

Metadata product 的职责不是收藏描述, 而是让 AI runtime 能使用 metadata 做过滤、路由、解释和审计。

Metadata typeAI runtime 用途例子
Business metadata场景路由、用户意图映射、指标切片product_line、risk_type、customer_segment
Technical metadatafreshness、schema version、job run、source checksumingestion_time、contract_version、index_version
Governance metadataPII、consent、retention、policy classificationPII.Sensitive、retention_7y、no_model_training
Quality metadatavalidation result、failure history、SLO statusexpectation_suite_id、quality_score、breach_count
Lineage metadatasource records、transform jobs、downstream AI assetssource_table -> feature_set -> training_dataset
Eval metadatascenario、severity、rubric、expected behaviorAML_typology、critical_failure、expert_reviewed
Feedback metadatahuman edit、override reason、accepted suggestionwrong_policy、missing_evidence、tone_issue

AI Product Architect 要把 metadata platform 设计成 active metadata layer: RAG filter、model routing、access control、incident triage、impact analysis 和 release evidence 都从这里读取。

5.3 OpenLineage for Runtime and Design-Time Evidence

OpenLineage 的 job、run、dataset 模型适合把分散的 pipeline 变成统一 lineage graph。AI 场景要同时记录:

Lineage type记录对象关键问题
Runtime lineage每次 pipeline run、input dataset、output dataset、status、timestamp哪次运行生产了被模型使用的数据
Design-time job metadatajob 源码位置、声明输入输出、owner、代码版本如果代码变更, 影响哪些 AI assets
Dataset metadataschema、owner、documentation、classification、contract数据本身的定义和治理状态是什么
Column lineage字段级转换、派生字段、敏感字段传播risk_score 来自哪些源字段
AI asset lineagefeature set、training set、eval set、RAG corpus、label batch模型和回答依赖哪些版本

实践原则:

  • pipeline 的每个关键边界都要发 event: source extract、curation、quality validation、feature materialization、training snapshot、eval build、RAG indexing。
  • 对高风险字段保留 column-level lineage, 例如 income、risk_rating、customer_type、adverse_action_reason。
  • 对 RAG corpus 记录 document id、version、effective date、approval status、chunking policy、embedding model、index version。
  • 对 eval dataset 记录 query source、gold evidence、reviewer、rubric、risk tier、sample inclusion reason。

5.4 Schema Evolution without AI Breakage

schema evolution 在 AI 系统里比普通报表更危险, 因为模型可能静默吸收错误语义。

Change type兼容性处理方式AI 风险
Additive field通常兼容contract 增加字段, consumer 可选择使用新字段进入 prompt 或 training 前需用途审批
Rename field高风险建立 alias、dual-run、deprecation window、consumer migrationprompt、feature code、eval parser 静默失效
Type change高风险contract test 阻断, migration plan, backfill validation数值被当字符串、日期解析错误
Enum expansion中高风险新枚举进入 glossary 和 routing policy, 更新 eval cases模型未学习新状态, rule fallback 漏掉
Semantic change最高风险新 contract major version, impact analysis, model/eval rerun字段名不变但业务含义改变
Nullability change中高风险completeness SLO 重算, consumer fail-fast摘要缺字段、模型输入缺失偏差
Aggregation window change高风险versioned metric / feature definition, drift comparison特征分布变化, 线上离线不一致

版本策略:

Version适用示例
patch文档描述、owner、非行为性 metadata 修正kyc_doc_contract_v1.0.1
minor兼容增加字段、增加质量测试、增加标签credit_feature_contract_v1.1
major字段删除、重命名、语义改变、SLO 降级、用途边界改变aml_label_contract_v2

5.5 Data Quality SLO as Product Commitment

Great Expectations、OpenMetadata tests 和 DataHub assertions 都应服务于同一个目标: 让数据质量成为 AI 产品的 SLO, 不是事后报告。

SLO dimensionAI-specific measureRelease relevance
Freshnesssource event 到 feature/RAG/eval 可用的延迟防止旧政策、旧账户状态、旧标签进入 AI
Completeness关键字段、关键文档、关键标签覆盖率防止模型基于缺证据输出
Validity类型、范围、枚举、格式、业务规则防止输入解析错误
Consistency跨系统、跨表、跨版本口径一致防止客户、case、产品状态冲突
Accuracy与 source-of-truth 或人工审核对账防止事实错误
Coverage场景、分群、边界样本覆盖防止 eval 或训练样本偏
Access correctness权限过滤、脱敏、用途限制防止数据泄露
Trace completenessAI request 是否有 source、version、prompt、retrieval、tool、review支撑事故复盘

5.6 Feature/Data Drift and Policy Drift

AI 数据漂移需要按消费方式分层处理。

Drift type检测对象触发问题产品动作
Feature driftnumerical / categorical feature distribution线上申请渠道变化、收入字段分布异常触发 model performance review 和 feature owner review
Data driftsource records、document types、conversation topics新 KYC 文档类型、客服投诉主题变化扩展 contract、更新 parser、补 eval cases
Label driftlabel distribution、reviewer agreement、case outcome lagAML disposition 标准变化、审核团队口径不一致重新校准 label rubric 和 adjudication
Corpus driftpolicy docs、effective dates、approval status旧政策未下线、新政策未入库recrawl、reindex、RAG freshness warning
Schema driftcolumns、types、nested structure、enumupstream 发布破坏性变更contract test fail-fast
Usage driftAI consumer、新下游、prompt 变更数据被用于未批准训练或外部分享purpose limitation review

5.7 Training / Eval / RAG Corpus Lineage

训练数据、评测数据和 RAG 语料都叫数据, 但治理目标不同。

AssetLineage 必填项质量重点典型事故
Training dataset lineagesource snapshot、feature definition、label source、sampling、exclusion、PII handlingrepresentativeness、leakage、label validity、feature consistency训练集混入未来信息或不合规数据
Eval dataset lineagescenario source、gold evidence、expected behavior、reviewer、rubric、severitycoverage、review quality、rubric stability、regression valueeval 集和线上风险不匹配
RAG corpus lineagesource doc、approval status、effective date、chunking、embedding、index versionfreshness、authority、permission、citation granularity引用旧政策或未批准草稿

5.8 Label Governance

标签是 AI 风险系统里的决策资产, 不是普通字段。

Label areaGovernance requirement金融零售例子
Label definitionlabel ontology、positive/negative criteria、exclusionsAML suspicious_activity_confirmed 的定义和排除项
Label sourceexpert、operation outcome、customer feedback、LLM-assisted、syntheticSAR filing、case closure、QA outcome
Reviewer modelreviewer role、training、dual review、adjudicator高风险 AML case 双人复核
Agreement metricinter-annotator agreement、conflict rate、review latencyAML typology 标签一致性 >= 0.85
Versioninglabel rubric version、policy version、jurisdictionBSA/AML 规则更新后标注版本变化
Leakage control标签是否引用未来信息、人工结论时间点信贷违约标签不能泄露审批后才知道的信息
Auditabilitylabel event、reviewer、timestamp、evidence监管或模型风险审查可复现

5.9 Contract Testing

contract testing 要覆盖三层:

Test layer测试内容运行时机
Producer contract testsschema、semantic metadata、quality assertions、freshness、row/column rulespipeline build、release、daily run
Consumer contract testsprompt parser、feature code、eval loader、RAG indexer 对 contract 的依赖AI service build、model retrain、index rebuild
Change impact testsschema evolution、downstream lineage、model/eval/RAG assets impactpull request、catalog contract change、source release

高风险 AI use case 的 contract test fail 应直接阻断生产发布, 不能只发通知。


6. 金融零售案例蓝图

场景数据链路关键 contractLineage 重点Quality / Drift 重点
KYC 文档抽取数据链路upload -> OCR -> extraction -> validation -> review queue -> customer profile文档类型、字段置信度、PII 分类、review 状态、人工修正文档版本、OCR 模型、extractor 版本、reviewer editextraction accuracy、critical field completeness、stale document detection
AML case labelstransaction monitoring -> alert -> case investigation -> disposition -> label store -> eval/traininglabel definition、jurisdiction、case status、reviewer、evidence completenesslabel batch、case evidence、SAR decision、QA samplelabel agreement、case outcome lag、typology distribution drift
信贷模型训练数据application -> bureau -> income verification -> feature pipeline -> training snapshotfeature definition、time window、leakage exclusion、allowed usesource snapshot、feature code、model dataset versionfeature drift、missing income, adverse action reason coverage
RAG 政策库版本policy repo -> approval -> chunk -> embed -> index -> answer citationapproval status、effective date、expiry date、jurisdiction、role accesssource doc、chunk id、embedding model、index versioncorpus freshness、citation correctness、permission correctness
客服对话 trace 数据conversation -> intent -> retrieval/tool -> answer -> human edit -> feedbackconsent、PII redaction、logging purpose、retention、feedback labelsrequest trace、prompt version、retrieval docs、tool calls、human editstrace completeness、unsafe response, policy drift, topic drift
推荐系统特征数据event stream -> identity resolution -> feature aggregation -> model scoring -> campaign responseevent schema、identity match、window definition、opt-out policyevent source、feature window、experiment assignmentfeature drift、event loss、consent change、conversion label lag

6.1 KYC 文档抽取

架构重点:

  • 对每类文件建立 document contract: 文件类型、字段集合、置信度阈值、可接受缺失、人工复核规则。
  • OCR 和 extractor 版本进入 lineage, 人工修正作为 feedback label 回流。
  • 身份字段进入 customer profile 前必须通过 critical field SLO, 例如姓名、出生日期、证件号、地址。
  • 文档有效期和地区规则进入 metadata, 不能只存在 prompt 或人工 SOP 中。

关键交付物:

  • kyc_document_extraction_v1 data contract。
  • source document -> OCR run -> extraction result -> validation -> reviewer edit 的 lineage map。
  • extraction accuracy、field completeness、review override rate、cycle time 的 SLO matrix。

6.2 AML Case Labels

架构重点:

  • AML label 必须区分 alert outcome、case disposition、SAR filed、typology、QA finding, 不能合成一个模糊 label 字段。
  • 标签要记录 reviewer role、decision timestamp、evidence references、rubric version、jurisdiction。
  • 训练和 eval 使用不同 label snapshot, 并明确 case closure lag, 避免将未来结论泄露到模型训练。

关键交付物:

  • AML label governance rubric。
  • label batch provenance card。
  • label drift dashboard: typology mix、agreement、dispute rate、late outcome update。

6.3 信贷模型训练数据

架构重点:

  • 特征定义必须版本化, 包括时间窗口、聚合口径、排除规则、source priority。
  • income、employment、bureau score、existing relationship 等字段必须有 source-of-truth 和 reconciliation rule。
  • adverse action reason、reject reason 和 model reason code 要在 lineage 中能追溯到特征和规则。

关键交付物:

  • credit feature contract。
  • training dataset lineage card。
  • feature/data drift report 和 model risk impact memo。

6.4 RAG 政策库版本

架构重点:

  • 政策文档进入 RAG index 前必须是 approved 状态, 并带生效日期、失效日期、jurisdiction、business unit、role access。
  • chunking policy、embedding model、index version 必须进入 corpus lineage。
  • 引用必须能回到文档版本和段落级证据, 并能识别新旧政策冲突。

关键交付物:

  • RAG corpus freshness policy。
  • policy corpus lineage map。
  • citation support 和 stale source incident playbook。

6.5 客服对话 Trace 数据

架构重点:

  • 每次对话 trace 要连接 user intent、prompt version、retrieved evidence、tool call、model version、human edit、feedback。
  • 对话日志进入 eval 或训练前必须完成 consent、PII redaction、purpose limitation 和 retention 检查。
  • 用户不满意和人工改写要分类为 factual error、missing evidence、tone issue、policy issue、tool issue。

关键交付物:

  • customer service trace contract。
  • trace completeness SLO。
  • feedback-to-eval conversion policy。

6.6 推荐系统特征数据

架构重点:

  • 事件 schema、identity resolution、feature window、campaign assignment 和 conversion label 要分别建 contract。
  • opt-out、consent、sensitive segment exclusion 必须作为 feature pipeline 的 hard control。
  • 特征漂移和事件丢失要与 campaign performance 一起看, 不能只看模型 AUC 或 CTR。

关键交付物:

  • feature contract 和 event contract。
  • recommendation feature lineage map。
  • feature/data drift monitor 和 campaign rollback rule。

7. 产品决策框架

决策选项推荐判断
Contract orientationproducer-owned、consumer-specific、hybrid高复用核心数据用 producer-owned; 高风险 AI consumer 可追加 consumer-specific assertions
Enforcement modewarn、quarantine、block高风险合规、信贷、客户影响场景对 critical breach 使用 block
Metadata platformDataHub、OpenMetadata、组合使用以组织现有生态和自动化能力为准; 不把 catalog 当静态 wiki
Lineage granularitytable、column、record、chunk、feature高风险字段、RAG citation、model training 需要 column/chunk/feature 级
Schema evolutionbackward compatible、versioned break、dual-run语义变化和字段删除必须 major version + impact analysis
Quality toolingGX、OpenMetadata tests、DataHub assertions、dbt testscontract 层统一表达, 执行工具可组合
Drift actionmonitor、review、retrain、rollback、human-onlydrift 影响客户权益或合规义务时, 先降级再评估
Label sourceexpert、operational、LLM-assisted、synthetic高风险 label 以 expert 或 audited operational label 为主
RAG recrawltime-based、event-based、approval-based政策类 corpus 以 approval event 和 effective date 为核心
Incident severitydata only、AI output affected、customer/regulatory affected一旦影响客户决策、监管义务或权限泄露, 升到高严重度

ADR 问题清单:

ADR section问题
Context哪个 AI use case 依赖这份数据, 风险等级是什么
Decisioncontract、lineage、quality、drift、incident 的边界如何定义
Alternatives只做 catalog、只做 quality tests、只做 pipeline monitoring 的不足是什么
Consequences对发布速度、数据 producer 成本、审计证据、AI 质量的影响是什么
Review trigger哪些 schema、policy、model、regulatory 或 incident 事件会触发 ADR 复核

8. Governance and Operating Model

8.1 RACI

Artifact / ActivityAI DPMAI Product ArchitectData ArchitectData OwnerRisk / ComplianceMLOps / Data EngSecurity / Privacy
AI data contractACRACRC
Lineage mapCARCCRC
Quality SLO matrixACRACRC
Schema change approvalCARACRC
Eval dataset provenance cardACCCRRC
RAG corpus freshness policyACRACRC
Label governance rubricACCRARC
Data incident responseAARRARR

8.2 NIST AI RMF Mapping

Function数据控制面动作Evidence
Govern定义 ownership、policy、allowed use、RACI、approval、third-party data controlsgovernance charter、RACI、policy mapping
Map识别 AI use case、source-of-truth、data flows、risk tier、affected stakeholdersdata flow map、lineage map、risk tier memo
Measure建立 contract tests、quality SLO、drift metrics、label agreement、trace completenessvalidation results、drift report、quality dashboard
Manage运行 release gates、incident response、rollback、data repair、post-incident reviewrelease evidence pack、incident report、change log

8.3 Governance Cadence

Cadence会议对象输入输出
WeeklyAI DPM、Data Owner、Data Eng、MLOpscontract violations、quality breach、schema changesrepair actions、release blockers
MonthlyAI Product Architect、Risk、Compliance、Privacydrift trends、incident trends、label disputes、RAG freshnesscontrol updates、eval refresh decisions
QuarterlyGovernance board、Model Risk、Security、Business Ownerhigh-risk AI portfolio、audit findings、SLO historyrisk acceptance、funding, decommission decisions
Event-drivenIncident respondersdata breach、schema break、wrong output、policy corpus stalecontainment、customer/regulatory impact assessment

9. 可落地交付物模板

9.1 Data Contract Template

以下是 kyc_document_extraction_v1 的完整样例, 可作为 AI data contracts 的结构模板。

contract_id: kyc_document_extraction_v1
contract_version: 1.0.0
status: active
domain: retail_banking_kyc
producer:
  team: kyc_data_platform
  owner: kyc_data_owner
  steward: kyc_operations_steward
consumers:
  - kyc_document_extraction_service
  - kyc_review_queue
  - kyc_quality_eval_harness
source_of_truth:
  system: enterprise_document_management
  source_dataset: kyc_uploaded_documents
  conflict_rule: reviewed_extraction_overrides_raw_ocr
schema:
  document_id:
    type: string
    required: true
    pii_class: internal_identifier
  customer_id:
    type: string
    required: true
    pii_class: direct_identifier
  document_type:
    type: enum
    required: true
    allowed_values:
      - passport
      - driver_license
      - utility_bill
      - bank_statement
  extracted_fields:
    type: object
    required: true
    fields:
      full_name:
        type: string
        required: true
        confidence_min: 0.92
      date_of_birth:
        type: date
        required: true
        confidence_min: 0.95
      address:
        type: string
        required: false
        confidence_min: 0.90
  extraction_model_version:
    type: string
    required: true
  review_status:
    type: enum
    required: true
    allowed_values:
      - auto_accepted
      - human_review_required
      - human_corrected
      - rejected
semantics:
  full_name: customer legal name extracted from approved identity evidence
  review_status: operational status after automated validation and human review
freshness_slo:
  p95_minutes_from_upload_to_extraction: 15
  p99_minutes_from_review_to_profile_update: 60
quality_slo:
  critical_field_completeness_min: 0.995
  document_type_validity_min: 0.999
  reviewer_override_rate_review_threshold: 0.08
permissions:
  row_level_rule: assigned_branch_or_kyc_operations
  field_redaction:
    date_of_birth: masked_outside_kyc_and_compliance
    document_id: visible_to_support_with_case_context
allowed_ai_use:
  rag: false
  eval: true
  training: true
  routing: true
  customer_facing_generation: false
retention:
  raw_document: governed_by_kyc_record_policy
  extracted_fields: governed_by_customer_profile_policy
  eval_samples: redacted_and_reviewed
change_policy:
  additive_field: minor_version_with_consumer_notice
  semantic_change: major_version_with_risk_review
  enum_change: minor_version_with_eval_refresh
incident_triggers:
  permission_leakage: severity_0
  critical_field_completeness_below_slo: severity_1
  freshness_p95_breach_two_runs: severity_2
contract_tests:
  - schema_assertion_required_columns
  - enum_assertion_document_type
  - completeness_assertion_critical_fields
  - freshness_assertion_upload_to_extraction
  - permission_assertion_branch_scope

9.2 Lineage Map Template

Node IDAssetTypeOwnerContractQuality GateDownstream AI UseEvidence
src_kyc_docskyc_uploaded_documentssource table / object storeKYC Data Ownerkyc_document_extraction_v1upload integrity checkextraction, reviewsource checksum、upload timestamp
job_ocrOCR extraction jobOpenLineage jobData Engineeringocr_input_contract_v1OCR success ratedocument parsingrun id、OCR model version
ds_extractedkyc_extracted_fieldscurated datasetKYC Data Platformkyc_document_extraction_v1GX checkpoint kyc_extract_dailyprofile update、evalvalidation result、failed rows
job_reviewhuman review workflowoperational jobKYC Operationskyc_review_contract_v1reviewer completenesslabel feedbackreviewer id hash、edit reason
ai_evalkyc_extraction_eval_v2026_06eval datasetAI DPMeval_dataset_contract_v1provenance card approvedmodel release gaterubric version、sample reasons
flowchart LR
  A[Uploaded KYC Document] --> B[OCR Run]
  B --> C[Extraction Result]
  C --> D[Quality Validation]
  D --> E[Human Review]
  E --> F[Customer Profile Update]
  E --> G[Eval Dataset]
  C --> H[Training Snapshot]
  D --> I[Contract Violation Dashboard]

Lineage map 审核问题:

QuestionPass evidence
每个 AI 输出能否回到 source record 和 transform runtrace_id、run_id、dataset version
高风险字段是否有 column-level lineagecolumn transform、source field、derived rule
数据变更能否做 downstream impact analysiscatalog lineage query、consumer list
质量检查结果是否在 lineage 中可见validation result linked to run
人工修正是否成为 feedback lineagereviewer edit event、reason code

9.3 Quality SLO Matrix

Data ProductSLOTargetMeasurementBreach SeverityAutomated ActionOwner
KYC extractioncritical field completeness>= 99.5%GX expectation over extracted fieldsS1quarantine failed batch and route to human reviewKYC Data Owner
AML labelsreviewer agreement>= 0.85agreement score by typology and jurisdictionS2pause new label ingestion for disputed typologyAML QA Lead
Credit training setfeature null rate for income<= 1.0%feature validation checkpointS1block training snapshot promotionCredit Data Owner
Policy RAG corpusapproved active document coverage100% for in-scope policy setcorpus registry vs policy repositoryS1block index promotionPolicy Owner
Customer service tracetrace completeness>= 98%spans with prompt, retrieval, model, response, feedback fieldsS2exclude incomplete trace from eval conversionAI Platform Owner
Recommendation featuresevent ingestion freshness P95<= 10 minutesevent time to feature availabilityS2switch campaign scoring to last stable feature snapshotGrowth Data Owner
All high-risk AI datasetspermission assertion pass rate100% sampled passaccess test and redaction testS0disable affected endpoint and open security incidentSecurity Owner

SLO 设计原则:

  • SLO 要按 AI use case 风险等级分层, 不能全局统一。
  • critical fields 的 breach 动作要在 contract 中预定义。
  • low-risk analytics 可以 warn, regulated decision support 必须 block 或 human-only。
  • 平均值不能掩盖分群风险, 需要按 jurisdiction、product、channel、customer segment 切片。

9.4 Schema Change Approval

Section内容
Change IDschema_change_credit_features_2026_06_income_window
Requested byCredit Data Platform
Affected contractcredit_feature_contract_v2.3
Change typesemantic change and aggregation window change
Current definitionverified_income_90d_avg 使用过去 90 天 verified income records
New definitionverified_income_180d_avg 使用过去 180 天 verified income records, 排除 disputed records
Affected consumerscredit scoring model, adverse action reason service, model monitoring, portfolio analytics
Downstream impactfeature distribution shift expected; adverse action reason mapping requires rerun; existing eval slices require refresh
Required testsschema assertion, null rate check, distribution comparison, model backtest, fairness slice review, reason code consistency
Rollbackkeep verified_income_90d_avg materialized for 60 days and preserve model route switch
ApproversData Owner, AI Product Architect, Model Risk, Credit Policy Owner
Decisionapproved for shadow mode and blocked from production promotion until SLO and backtest pass

Approval checklist:

CheckEvidence
Contract version updatedmajor or minor version chosen with reason
Lineage impact generateddownstream AI assets listed
Quality tests updatedexpectations and assertions changed
Eval and model impact reviewedbacktest and regression report available
Communication completedproducers and consumers informed through catalog and release notes
Rollback path verifiedprevious dataset and feature definition available

9.5 Eval Dataset Provenance Card

FieldValue
Eval dataset IDaml_case_narrative_eval_v2026_06
PurposeEvaluate AML copilot case summary grounding, completeness, escalation, and policy compliance
Source systemsAML case management, transaction monitoring alerts, QA review outcomes
Source time windowcases closed from 2025-07-01 to 2026-05-31
Sampling policystratified by typology, jurisdiction, risk tier, case complexity, historical failure mode
Exclusion policyactive investigations, legally restricted cases, cases with unresolved QA dispute
PII handlinganalyst-facing eval uses masked customer identifiers; expert reviewers access source evidence through approved case system
Gold evidencetransaction summary, alert rationale, analyst notes, approved disposition, QA findings
Label sourceexpert AML review and audited operational outcomes
Reviewer modeltwo expert reviewers for high-risk cases, adjudicator for disagreement
Rubric versionaml_summary_rubric_v3
Severity tagsunsupported_claim, missing_evidence, under_escalation, wrong_typology, policy_violation
Known limitationsclosed-case distribution underrepresents newly emerging typologies; drift review monthly
Refresh cadencemonthly minor refresh, quarterly rubric calibration
OwnerAI DPM for AML Copilot
Risk sign-offAML Compliance and Model Risk

Provenance card quality bar:

  • 每条 eval case 都有 source reference、expected behavior、unacceptable behavior、severity 和 reviewer record。
  • 评测集不混入未授权生产 PII 副本。
  • LLM-assisted labels 只能作为 reviewer 工作辅助, 不能作为高风险 golden label 的唯一来源。
  • eval dataset lineage 能连接到 source case、label batch、rubric version 和 release gate。

9.6 RAG Corpus Freshness Policy

Policy areaRule
Corpus IDretail_policy_rag_corpus
Source-of-truthapproved policy repository, not shared drive, email attachment, or draft folder
Inclusion ruledocument status must be approved, effective date active, jurisdiction and business unit assigned
Exclusion ruledraft, expired, superseded, legal-review-only, no-RAG-use documents
Freshness SLOapproved policy change indexed within 4 hours; emergency regulatory bulletin indexed within 60 minutes
Version handlingold version remains retrievable only for historical-date questions and is excluded from current-policy answers
Chunking policysection-aware chunks with page, section, paragraph, table row, effective date and policy owner metadata
Embedding policyembedding model version recorded; re-embed triggered by model change, chunking change, or major policy format change
Permission policyrole, region, business unit and document classification filter applied before context assembly
Citation policyanswer must cite current active policy version and section; conflicting policies trigger escalation message
Staleness actionif corpus freshness SLO breached, customer-facing answers disabled and employee-facing answers display stale-source warning
Auditevery answer stores corpus_id、index_version、doc_ids、chunk_ids、effective_date、retrieval filters

9.7 Data Incident Playbook

PhaseActionOwnerEvidence
Detectcontract violation, quality SLO breach, lineage gap, drift alert, wrong AI output, access failureMonitoring Owneralert id、dataset、contract、time window
Classifyassign severity based on data scope, AI output impact, customer impact, regulatory impactAI DPM + Riskseverity decision log
Containblock pipeline, quarantine batch, roll back feature/RAG index, disable affected route, switch to human reviewData Eng + MLOpscontainment action record
Assess impactidentify downstream models, eval sets, RAG corpora, traces, reports, customers, analystsData Architectlineage impact report
Repairfix data, rerun validation, rebuild asset, refresh eval or labels, re-run release gateData Owner + MLOpsrepair run id、validation result
Communicatenotify business, risk, compliance, security, affected product ownersIncident Leadcommunication log
Decide restartverify SLO pass, downstream impact cleared, risk sign-off completedAI Product Architectrestart approval
Reviewroot cause, missed detection, control improvement, contract update, training updateGovernance Boardpost-incident review

Severity examples:

SeverityTriggerRequired response
S0unauthorized PII exposed through RAG context or trace logsimmediate endpoint disablement, security incident, legal/privacy review
S1stale policy corpus causes wrong customer-facing answerdisable customer-facing response path, notify compliance, rebuild corpus
S2credit feature freshness breach affects scoring shadow pipelineblock model promotion and rerun validation
S3non-critical metadata owner missing for low-risk analytics datasetcatalog correction within governance cadence

9.8 Contract Testing Matrix

TestExampleTooling patternBlock condition
Schema required columnscustomer_id, document_type, review_status existDataHub schema assertion, OpenMetadata contract, GX expectationmissing critical column
Type and enum validationdocument_type in approved valuesGX expectation, dbt test, OpenMetadata no-code testinvalid enum in production batch
Freshness validationpolicy doc indexed within 4 hoursDataHub freshness assertion, custom checkpointfreshness breach for high-risk RAG
Completeness validationcredit income feature null rate <= 1%GX checkpoint with actioncritical feature breach
Semantic metadata validationowner、domain、description、PII tag presentOpenMetadata semantics and governance checkshigh-risk dataset lacks owner or PII tag
Permission validationanalyst cannot retrieve cases outside assignmentintegration test against retrieval endpointunauthorized access possible
Downstream compatibilityprompt parser, feature loader, eval loader pass contract versionCI consumer testsconsumer fails against new contract
Lineage completenessrun emits source, input, output, contract and quality facetsOpenLineage event validationmissing lineage for regulated AI asset

10. 30天训练计划

Day训练主题当日产出
1选择一个金融零售 AI use case, 明确 risk tier 和 AI data productsone-page use case data map
2盘点 source-of-truth、系统 owner、数据消费者和 allowed AI usesource inventory
3为核心数据产品写第一版 AI data contractcontract draft
4把 schema、semantics、security、quality、SLA、terms of use 拆成 assertionsassertion catalog
5设计 producer contract tests 和 consumer contract testscontract testing matrix
6绘制 source -> curated -> AI asset 的 lineage maplineage map
7设计 OpenLineage event 和 AI-specific facetslineage event spec
8定义 metadata product: domain、owner、classification、glossary、data productmetadata product canvas
9用 DataHub 或 OpenMetadata 思路设计 catalog policycatalog governance policy
10为一个数据产品设计 Great Expectations / assertion suitequality validation suite
11建立 quality SLO matrix, 分 low / medium / high riskSLO matrix
12设计 schema evolution policy 和 approval workflowschema change approval template
13用 KYC 文档抽取场景演练 contract + lineage + SLOKYC case artifact pack
14用 AML case labels 场景设计 label governancelabel rubric and provenance card
15用信贷模型训练数据设计 training dataset lineagetraining data lineage card
16用 RAG 政策库设计 corpus freshness 和 citation policyRAG corpus freshness policy
17用客服 trace 数据设计 trace contract 和 feedback taxonomytrace data contract
18用推荐特征数据设计 feature drift monitorfeature drift report format
19设计 eval dataset provenance card 和 golden set refresh cadenceeval provenance card
20设计 feature/data drift、label drift、corpus drift 指标drift metric catalog
21设计 data incident severity matrix 和 containment actionsdata incident playbook
22做一次 schema breaking change tabletop exerciseimpact analysis memo
23做一次 stale RAG corpus incident tabletop exerciseRAG incident report
24做一次 label dispute and adjudication exerciselabel governance decision log
25将 NIST AI RMF Govern / Map / Measure / Manage 映射到 artifactsAI data risk control matrix
26设计 release gates: contract、lineage、quality、drift、incident readinessrelease gate checklist
27准备 DataHub / OpenMetadata / OpenLineage / GX 的架构选型说明tooling ADR
28把一个案例整理成作品集叙事: problem -> architecture -> controls -> outcomesportfolio story
29练习 10 道面试题, 每题 30 秒和 2 分钟版本interview answer sheet
30做综合复盘: artifacts、风险、缺口、下一轮深化方向30-day capstone pack

训练交付物最低标准:

  • 至少一个完整 data contract。
  • 至少一张 lineage map, 覆盖 source、transform、quality、AI asset、runtime trace。
  • 至少一张 quality SLO matrix, 带 breach action。
  • 至少一个 eval dataset provenance card。
  • 至少一个 RAG corpus freshness policy。
  • 至少一个 data incident playbook。
  • 至少一个 schema change approval。
  • 至少一份 tooling ADR。

11. 面试题与回答

Q1: AI data contracts 和传统数据字典有什么本质区别?

30秒版本: 数据字典解释字段, AI data contract 承诺 AI consumer 可以如何可靠、安全、合规地使用数据。它覆盖 schema、语义、质量 SLO、freshness、权限、allowed AI use、变更流程和 incident trigger, 并且应该能被 contract tests 自动验证。

2分钟版本: 在 AI 系统里, 数据变化不会只影响报表, 还会影响模型训练、RAG 引用、eval 结果、agent routing 和客户/员工决策。传统数据字典通常是描述性资产, 无法阻止 schema breaking change、旧政策入库、标签口径漂移或权限泄漏。AI data contract 是 producer 和 AI consumer 之间的可执行承诺: 生产者承诺结构、语义、刷新、质量和用途; 消费者声明如何使用以及破坏时如何降级。金融零售中, 例如 KYC 文档抽取字段如果没有 contract, 模型可能把 OCR 低置信度地址写入客户资料; 有 contract 后, completeness、confidence、review_status 和 human review 都会成为上线门禁。

Q2: 你会如何设计 AI lineage, 才能支持事故复盘?

30秒版本: 我会把 lineage 设计到 source、pipeline、contract、quality、AI asset 和 runtime trace 六层。每个模型输出或 RAG 回答都能回到 source records、transform run、contract version、quality result、index/model/eval version 和用户可见证据。

2分钟版本: 只做 table-level lineage 不够。AI 事故常发生在字段、chunk、feature、label 或 prompt 级别。我会使用 OpenLineage 记录 pipeline run、input/output datasets 和 facets, 在 metadata catalog 中连接 DataHub/OpenMetadata 的 dataset、column、job、contract 和 owner。对 RAG, lineage 要记录 source doc、effective date、chunk id、embedding model、index version、retrieval filters 和 citations。对训练, 要记录 source snapshot、feature definitions、label source 和 sampling policy。对 eval, 要记录 gold evidence、rubric、reviewer 和 severity。这样 stale policy、错误 label、schema break 或权限泄漏都能通过 lineage impact analysis 找到受影响模型、答案和客户流程。

Q3: Schema evolution 在 AI 项目里为什么需要更严格?

30秒版本: AI consumer 可能静默吸收字段语义变化, 不像普通 API 那样立刻报错。字段名不变但口径变了, 可能导致训练数据偏差、eval 失真、RAG 错引和特征漂移。

2分钟版本: 我会把 schema evolution 分为 additive、rename、type change、enum expansion、semantic change、nullability change 和 aggregation window change。新增字段一般 minor version, 但语义变化、字段删除和窗口口径变化必须 major version, 触发 downstream impact analysis、contract tests、model/eval rerun 和 risk review。例如信贷 verified_income_90d_avg 改成 180 天窗口, 字段类型不变, 但模型特征分布和 adverse action reason 都会变化。正确做法是 dual-run、backfill validation、drift comparison、shadow mode 和明确 rollback。

Q4: Training data lineage、eval dataset lineage 和 RAG corpus lineage 怎么区分?

30秒版本: 训练数据 lineage 证明模型学到了什么; eval lineage 证明测试样本和 gold evidence 是否可信; RAG corpus lineage 证明回答检索了哪个版本的权威知识。三者都要有来源和版本, 但治理重点不同。

2分钟版本: 训练数据重点是 source snapshot、feature definitions、label source、sampling、exclusion、PII handling 和 leakage control。eval 数据重点是 scenario source、gold evidence、expected behavior、reviewer、rubric、severity 和 refresh cadence。RAG 语料重点是 source document、approval status、effective date、expiry date、chunking policy、embedding model、index version、permission filters 和 citation support。金融场景中, 把它们混在一起会出问题: 客服日志可以用于 eval 抽样, 但未必能用于训练; 过期政策可用于历史问题, 但不能支撑当前政策回答。

Q5: 如何设定 data quality SLO?

30秒版本: 从 AI use case 风险和业务后果倒推 SLO。高风险场景设置 hard-stop 指标, 如权限正确率 100%、关键字段完整率 >= 99.5%、RAG 当前政策覆盖 100%。低风险场景可用 warn 和人工修复。

2分钟版本: 我不会只设通用 null rate。AI data quality SLO 至少覆盖 freshness、completeness、validity、consistency、accuracy、coverage、access correctness 和 trace completeness。KYC 抽取关注关键字段完整率、置信度和 reviewer override rate; AML labels 关注 reviewer agreement、dispute rate 和 outcome lag; RAG 政策库关注 approved active document coverage、index freshness 和 citation correctness; 推荐特征关注事件 freshness、identity match 和 feature drift。每个 SLO 都要有 measurement、target、severity、automated action 和 owner。

Q6: Feature/data drift 发现后产品经理应该如何决策?

30秒版本: 先判断漂移是否影响客户权益、合规义务或关键业务决策。低风险可监控和解释; 中高风险要降级、暂停自动化、切到人工复核、重跑 eval 或触发模型/数据修复。

2分钟版本: 漂移不是单一模型指标。feature drift 可能来自渠道变化、上游 schema 变化或真实业务变化; data drift 可能来自新文档类型、新客户群或新政策; label drift 可能来自审核口径变化; corpus drift 可能来自政策更新。产品决策要看 drift 的原因和影响范围。例如信贷收入特征漂移影响审批辅助, 应进入 model risk review; 政策库 corpus freshness breach 影响客服回答, 应禁用客户可见回答并显示人工确认; 推荐系统事件漂移影响营销, 可以回滚到稳定特征快照并暂停相关 campaign。

Q7: DataHub、OpenMetadata、OpenLineage、Great Expectations 在架构里分别承担什么角色?

30秒版本: OpenLineage 记录 lineage events; DataHub/OpenMetadata 作为 metadata catalog 和 governance control plane; Great Expectations 做数据质量 expectations、validation 和 checkpoints。它们不是互斥工具, 而是控制面的不同层。

2分钟版本: OpenLineage 适合从 pipeline runtime 采集 job、run、dataset 和 facets, 让 lineage graph 有执行证据。DataHub 和 OpenMetadata 管理 data assets、owner、domains、glossary、classification、contracts、lineage、quality signals 和 governance workflow。Great Expectations 把数据假设转成可运行的 Expectation Suites、Validation Definitions、Validation Results 和 Checkpoints, 并通过 actions 发通知或更新文档。在成熟架构里, contract 定义在 catalog, validation 由 GX 或内置 assertions 执行, 结果回写 catalog, lineage 记录每次运行和数据产物。

Q8: AML case labels 为什么需要 label governance?

30秒版本: AML label 影响训练、eval、风险叙事和模型上线判断。没有 label definition、reviewer、jurisdiction、evidence、rubric 和 disagreement handling, 模型会学习错误或不一致的调查结论。

2分钟版本: AML 的 suspicious_activity_confirmed 不是普通标签, 它可能代表不同调查阶段、司法辖区和 QA 口径。label governance 要明确 ontology、positive/negative criteria、case status、SAR decision、typology、reviewer role、evidence reference、timestamp、rubric version 和 adjudication。还要监控 inter-annotator agreement、label drift、dispute rate 和 case outcome lag。高风险 label 不应由 LLM 自动生成作为唯一真值, LLM 可以辅助预分类或生成 reviewer draft, 但最终 golden label 要有专家或审计过的运营结论。

Q9: 如果 RAG 政策库引用了过期政策, 你如何处理?

30秒版本: 先止血: 禁用受影响客户可见回答或切到人工确认。再用 lineage 找到 source doc、chunk、index version、retrieval filter 和受影响 trace。修复后重建 index、重跑 citation eval、更新 freshness policy 和 incident controls。

2分钟版本: 我会按 data incident playbook 做。第一步分类严重度: 是否客户可见、是否影响合规义务、是否造成错误执行。第二步 containment: affected corpus/index 下线, route policy 切到 current approved index 或 human-only。第三步 impact analysis: 用 corpus lineage 找出过期 doc version、chunk ids、answers、users、business units。第四步 repair: 修正 source approval/expiry metadata, 重新 chunk、embed、index, 运行 freshness、permission、citation support 和 regression eval。第五步 post-incident review: 为什么 policy repo approval event 没触发 reindex, 为什么 freshness SLO 没拦截, contract 和 alert 如何更新。

Q10: 如何把 metadata product 做成作品集亮点?

30秒版本: 不要只展示 catalog 截图, 要展示 metadata 如何驱动 AI 控制: RAG permission filter、contract testing、lineage impact analysis、quality SLO、drift dashboard、incident response 和 release gate。

2分钟版本: 作品集可以选择一个金融零售 AI use case, 例如 KYC extraction 或 policy RAG。展示完整链路: data product canvas、data contract、lineage map、quality SLO matrix、schema change approval、eval provenance card、RAG freshness policy 和 incident playbook。然后用一个具体事故演练说明 metadata 的价值: 旧政策进入 corpus 后, catalog 的 effective_date、approval_status、lineage、index_version 和 trace 让团队能定位影响、禁用错误路径、修复并证明恢复。这样体现的是 AI Product Architect 和 Data Architect 能力, 不是简单数据整理。


12. 上线自检清单

CheckPass condition
Contract每个高风险 AI data product 有 active contract、owner、allowed use、change policy
Contract testsproducer、consumer、change impact tests 已接入 CI 或 pipeline
Lineagesource -> transform -> quality -> AI asset -> runtime trace 可追溯
Metadataowner、domain、classification、glossary、PII、retention、access policy 已登记
Quality SLOcritical metrics 有 target、measurement、owner、breach action
Driftfeature/data/label/corpus/schema/usage drift 有指标和决策规则
Training datasnapshot、feature definition、label source、sampling、exclusion、PII handling 可复现
Eval dataprovenance card、gold evidence、reviewer、rubric、severity、refresh cadence 完整
RAG corpusapproval status、effective date、chunking、index version、permission filters、citation policy 完整
Label governancelabel definition、reviewer model、agreement metric、adjudication、versioning 完整
Incident responseseverity、containment、impact analysis、repair、restart、post-incident review 已演练
GovernanceRACI、release gate、risk sign-off、audit evidence pack 可展示

13. 参考来源链接