返回 Papers
AI 底层逻辑 / 经典论文

AI Closed-Loop Learning:纠正行动架构

一句话:

197ai-foundations/papers/110-ai-closed-loop-learning-corrective-action-architecture.md

AI Closed-Loop Learning / Corrective Action Architecture 解读

面向对象: AI PM / AI BA / AI Architect / Model Risk Partner / Customer Operations Lead / 金融零售 AI 治理负责人。 核心问题: 生产反馈、人工覆盖、投诉、专家审核、eval 失败、漂移信号、事件和审计发现如果只进入 dashboard 或 backlog, AI 系统不会真正变好。 学习目标: 区分 active learning 与 CAPA-like corrective action loop, 掌握 issue taxonomy、root cause、change linkage、更新治理和 effectiveness verification。


Source Anchors

SourceLink用途
NIST AI Risk Management Frameworkhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织风险、测量、处置、责任和持续改进证据
ISO/IEC 42001 AI management systemhttps://www.iso.org/standard/42001把 AI 改进闭环放进管理体系、目标、控制、记录、内审和持续改进
Federal Reserve SR 26-2https://www.federalreserve.gov/supervisionreg/srletters/SR2602.htm2026 年模型风险管理锚点。Nuance: SR 26-2 替代 SR 11-7 / SR 21-8, 采用更风险导向和机构规模分层的模型风险框架; 但正式 scope 聚焦 banking organization 的 model risk management, 生成式 / agentic AI 仍需结合更广 AI governance、消费者合规、隐私、安全、第三方和业务控制。
CFPB Consumer Complaint Databasehttps://www.consumerfinance.gov/data-research/consumer-complaints/把投诉主题、客户叙述、公司响应和趋势作为外部 customer harm / corrective action 信号
一句话:

Closed-loop learning 是把 AI 失败和弱信号从“可观察”推进到“已纠正、已验证、可审计、可防复发”。


1. Thesis

Active learning 问的是:

Which samples should humans label next to improve the model?

Closed-loop corrective action 问的是:

Which production issue harmed quality, customers, controls or risk assumptions?
What was the root cause?
Which data, model, prompt, RAG, tool, workflow or policy changed?
What evidence proves the fix worked and did not create new risk?

两者可以连接, 但不能混同。Active learning 是学习样本选择机制; corrective action architecture 是治理过的生产改进机制。它覆盖:

可能修复对象
Data标签定义、样本覆盖、数据质量、lineage、privacy filter
Model重训、重校准、阈值、segment policy、fallback
Promptsystem instruction、rubric、refusal、handoff、format contract
RAGsource authority、freshness、chunking、retriever、citation gate
Tool / agentpermission、tool schema、approval gate、state machine
Workflow / policyhuman review、case SLA、reason code、risk appetite
Controlmonitoring threshold、release gate、audit evidence、owner cadence

2. Why It Matters

金融零售 AI 的生产失败往往不是单一模型错误:

  • 客服 RAG 引用旧费用政策, 客户错过申诉窗口。
  • 欺诈模型误拦截上升, 员工大量 override, 但没有结构化原因。
  • 信贷解释生成器使用不完整 reason code, 客户投诉“原因不对”。
  • 审计发现模型更新后没有证明旧缺陷已被关闭。
  • 漂移 dashboard 报警, 但没有 owner、SLA、修复证据和复发监控。 高成熟度组织会把这些信号汇聚成 corrective action ledger:
signal -> issue classification -> severity and customer impact
  -> root cause analysis -> corrective and preventive action
  -> linked change artifacts -> release gate
  -> effectiveness verification -> recurrence monitoring -> closure evidence

3. Core Concepts

3.1 Feedback Is Not Automatically Truth

Feedback价值不能直接做什么
Human override暴露模型、政策或 UI 信任问题不等于 ground truth
Customer complaint暴露可感知伤害和旅程断点不等于最终责任认定
Expert review可形成标签、eval 或 policy interpretation需要 calibration 和证据
Eval failure暴露 release gate 缺口不等于只改 prompt
Drift signal暴露上线假设变化不等于自动重训
Audit finding暴露控制缺口需要 root cause 和整改验证

3.2 CAPA-like Loop

Step高级问题
Detect信号是否覆盖投诉、override、eval、drift、incident 和 audit
Classify这是模型、数据、RAG、prompt、tool、流程、政策还是治理问题
Contain是否需要暂停自动化、提高人工复核或客户补救
Root cause是直接缺陷、控制缺口、owner 缺失还是风险假设错误
Change哪个 artifact 被修改, 版本和审批是什么
Verify用什么 eval、回放、shadow、QA 或生产 KRI 证明有效
Close谁接受残余风险, 复发监控窗口多久

3.3 Change Linkage

issue_id -> root_cause_id -> corrective_action_id -> change_request_id
  -> dataset_version / prompt_version / model_version / retriever_version / tool_policy_version
  -> eval_run_id -> approval_record -> post_fix_monitoring_window

没有 change linkage 的“已修复”通常只是口头关闭。

4. Architecture Diagram

Feedback and risk signals
  complaints | appeals | overrides | expert QA | eval failures | drift alerts | incidents | audit findings
        |
        v
Signal normalization and AI contribution matching
  system id | model/prompt/RAG/tool version | journey step | customer impact | segment
        |
        v
Issue taxonomy and severity engine
  model | data | policy | prompt | retrieval | tool | workflow | governance | vendor
        |
        v
CAPA ledger
  containment | RCA | corrective action | preventive action | owner | SLA | evidence status
        |
        v
Change linkage layer
  dataset registry | eval registry | prompt repo | model registry | vector index | tool policy | SOP
        |
        v
Release and approval gate
  risk review | model validation | compliance | operations readiness | rollback plan
        |
        v
Effectiveness verification and closure
  regression eval | replay | segment metrics | complaint trend | override rate | recurrence

5. Financial Retail Case

Case: Credit card servicing RAG gives stale fee-waiver guidance

Signals: complaint says chatbot gave wrong fee waiver condition; internal “fee explanation wrong” complaints rise; agent override rate increases; RAG eval shows policy effective-date failures; drift monitor finds a new annual-fee cluster after campaign launch.

Root cause layerFinding
RetrievalOld policy summary ranked above current policy
PromptCitation required, but effective-date priority missing
Knowledge governanceSuperseded source was not inactive
MonitoringUnsupported claim KRI had no complaint linkage threshold
WorkflowConflicting citation answers did not force handoff
Corrective actions:
  • Deactivate superseded source and rebuild vector index.
  • Add effective-date filter and source authority ranking.
  • Update prompt to require current policy citation for fee claims.
  • Add eval cases for fee waiver, annual fee, refund and dispute intents.
  • Route conflicting citation answers to human review. Effectiveness evidence:
  • Unsupported fee-policy claim rate drops from 14% to 3% on locked eval set.
  • Human QA confirms 50 sampled production answers cite current policy.
  • Fee explanation complaints return to baseline for two consecutive weeks.
  • Agent override rate for fee answers drops below pre-incident average.
  • No repeat H2+ harm cases in 30-day post-fix window.

6. PM / BA / Architect Checklist

AreaQuestions
IntakeDoes every feedback source capture AI system, version, journey, segment and customer impact
TaxonomyCan the team distinguish data, model, prompt, retrieval, tool, workflow, policy and governance issues
RCAIs root cause deeper than “model hallucinated” or “user asked unusual question”
ContainmentCan Product or Risk pause the harmful path before the permanent fix ships
Change linkageIs each issue linked to exact dataset, prompt, model, RAG, tool, policy or SOP version
VerificationIs fix effectiveness tested on locked evals, replay, segment metrics and production KRIs
GovernanceDo Model Risk, Compliance, Operations and Business Owner have clear challenge and approval roles
EvidenceCan an auditor reconstruct signal, decision, change, approval, rollout, monitoring and closure

7. Code-lite Experiment

Goal: show that closed-loop learning needs traceability, not just a better model score.

issues = [
    {"id": "I-101", "signal": "complaint", "theme": "wrong_fee_policy", "severity": "H2"},
    {"id": "I-102", "signal": "agent_override", "theme": "wrong_fee_policy", "severity": "H1"},
]
changes = [
    {"id": "CR-77", "artifact": "retriever_config", "version": "v5.3", "root_cause": "superseded_source_ranked_first"},
    {"id": "CR-78", "artifact": "rag_eval_set", "version": "fee_policy_gold_v2", "root_cause": "missing_effective_date_filter"},
]
verification = [
    {"change": "CR-77", "metric": "unsupported_claim_rate", "before": 0.14, "after": 0.03, "pass": True},
    {"change": "CR-78", "metric": "fee_policy_eval_pass_rate", "before": 0.82, "after": 0.97, "pass": True},
]
closed = all(v["pass"] for v in verification)

8. Interview Questions

Q1: Closed-loop learning 和 active learning 的区别?

30 秒回答: Active learning 关注选择哪些样本让专家标注来提升模型。Closed-loop learning 关注生产问题如何被分类、找根因、链接到具体变更并验证有效。它更像 AI 版 CAPA, 覆盖数据、模型、prompt、RAG、tool、流程、政策和控制。

Q2: 如何证明一个 AI 修复真的有效?

30 秒回答: 我会要求四类证据: locked regression eval 通过, 生产 replay 或 shadow 结果改善, 相关投诉 / override / QA defect / drift KRI 回到阈值内, 且在定义好的 post-fix monitoring window 没有复发。只说“prompt 已更新”不够。

Q3: SR 26-2 对闭环学习有什么启发?

30 秒回答: SR 26-2 是 2026 年银行模型风险管理的重要锚点, 强调风险导向、生命周期治理、独立挑战和问题整改。Nuance 是它的正式 scope 是模型风险管理, 生成式和 agentic AI 的闭环学习还要叠加 AI 管理体系、消费者保护、隐私、安全、第三方和运营控制。

9. Pitfalls

Pitfall表现修正
Feedback swamp投诉、override、QA 都进表, 没有 taxonomy 和 owner建 CAPA ledger 和 severity routing
Prompt-only fix所有 RAG 问题都改 prompt同时检查 source、retrieval、policy、handoff 和 eval
No effectiveness test变更完成即关闭 issue增加 before / after、locked eval、生产 KRI 和复发窗口
Eval leakage把失败样本反复调到通过使用 locked eval、near-duplicate control 和新样本验证
Root cause too shallow“模型不准确”成为 RCA追到数据、政策、workflow、control 或 owner 缺口
Customer harm ignored只看模型指标, 不看投诉和补救把 complaint / appeal / recourse 指标纳入 closure gate
Audit gap无法证明谁改了什么、为什么改、谁批准维护 issue-to-change-to-evidence traceability

10. Portfolio Task

做一个 “AI Corrective Action Control Pack”:

Artifact内容
Feedback taxonomycomplaints、appeals、overrides、eval failures、drift、audit findings 的分类
CAPA workflowdetect、classify、contain、RCA、change、verify、close
Root cause taxonomydata、model、prompt、retrieval、tool、workflow、policy、governance、vendor
Change linkage mapissue 到 dataset / prompt / model / RAG / tool / SOP 的 traceability
Effectiveness dashboardbefore / after、locked eval、production KRI、recurrence window
Executive memo哪些高风险 issue 已关闭, 哪些 residual risk 被接受
最终要能讲清楚: 真正的闭环不是“模型持续学习”, 而是组织持续证明 AI 系统从失败中学到、修到、验证到、并防止复发。