AI Regulatory Reporting:风险数据聚合与签证架构
重要说明: 本文是学习、作品集和内部架构训练材料, 不构成法律意见、监管解释、合规结论、审计意见、财务报告结论、资本充足性结论、模型验证结论或监管报送建议。具体适用范围、报送义务、口径解释、重要性判断、提交时限、更正或重报判断由 Legal / Compliance / Regulatory Affairs / Finance / Risk / Internal Audit / Managemen
AI Regulatory Reporting / Risk Data Aggregation / Attestation Architecture 解读
面向对象: CBAP+ Financial Retail PM / Senior BA / Regulatory Reporting Product Lead / Risk Data Architect / Enterprise Architect / Data Governance Lead / Finance-Risk Technology Partner。 核心问题: 金融机构如何把 Call Report、FR Y-14、内部风险报告、BCBS 239 风险数据聚合、报表血缘、转换控制、指标定义、对账、签署、提交证据、问题整改和更正重报做成可追溯、可复算、可审计、可被 AI 辅助但不被 AI 接管的 reporting data architecture? 学习目标: 建立 source-to-report lineage、metric contract、attestation evidence graph、reporting control plane、AI assist boundary、restatement loop 和 senior management reporting readiness 的高级架构心智模型。
重要说明: 本文是学习、作品集和内部架构训练材料, 不构成法律意见、监管解释、合规结论、审计意见、财务报告结论、资本充足性结论、模型验证结论或监管报送建议。具体适用范围、报送义务、口径解释、重要性判断、提交时限、更正或重报判断由 Legal / Compliance / Regulatory Affairs / Finance / Risk / Internal Audit / Management 授权角色结合机构类型、牌照、监管关系、报告说明和内部政策确认。访问日期按 2026-06-30 记录。
Source Anchors
| Source | Official link | 本文使用方式 |
|---|---|---|
| BCBS 239 Principles for effective risk data aggregation and risk reporting | https://www.bis.org/publ/bcbs239.pdf | 用 governance、data architecture、accuracy、completeness、timeliness、adaptability、report accuracy、review 和 remedial action 组织 reporting data architecture。 |
| BCBS implementation update on BCBS 239 | https://www.bis.org/publ/bcbs_nl36.htm | 用持续实施、集团层级、子公司层级、数据治理和风险报告实践的官方语境校准 RDARR 成熟度。 |
| FDIC Current Quarter Call Report Forms, Instructions, and Related Materials | https://www.fdic.gov/bank-financial-reports/current-quarter-call-report-forms-instructions-and-related-materials | 作为 Call Report forms、instructions、updates 和 filing material 的官方入口之一。 |
| FFIEC 041 Current Information | https://www.ffiec.gov/resources/reporting-forms/ffiec041 | 用 FFIEC 报表页面作为 Call Report form/instruction 入口锚点, 支持报表版本和说明书管理。 |
| Federal Reserve FR Y-14A Reporting Form | https://www.federalreserve.gov/apps/reportingforms/Report/Index/FR_Y-14A | 用 annual capital assessment / stress testing reporting 的 instructions、forms 和 schedules 组织 capital planning reporting data architecture。 |
| Federal Reserve FR Y-14Q Reporting Form | https://www.federalreserve.gov/apps/reportingforms/Report/Index/FR_Y-14Q | 用 quarterly detailed asset、capital、PPNR、retail、wholesale、operational risk 等 schedules 组织高粒度风险数据聚合和报送控制。 |
| Federal Reserve FR Y-14M Reporting Form | https://www.federalreserve.gov/apps/reportingforms/Report/Index/FR_Y-14M | 用 monthly loan portfolio data collection 语境设计 loan-level / account-level reporting lineage。 |
| Federal Reserve SR 26-2 Revised Guidance on Model Risk Management | https://www.federalreserve.gov/supervisionreg/srletters/SR2602.htm | 用 risk-based model risk、development/use、validation/monitoring、governance/control 语言校准报表模型、估计和 AI 辅助能力。 |
| OCC Bulletin 2026-13 Model Risk Management: Revised Guidance | https://www.occ.gov/news-issuances/bulletins/2026/bulletin-2026-13.html | 与 SR 26-2 交叉锚定银行模型风险管理、第三方产品和治理控制。 |
| OCC Corporate and Risk Governance booklet | https://www.occ.gov/publications-and-resources/publications/comptrollers-handbook/files/corporate-risk-governance/index-corporate-and-risk-governance.html | 用 board/senior management、risk governance、roles/responsibilities 的官方监督语言支撑 reporting accountability。 |
| NIST AI Risk Management Framework | https://www.nist.gov/itl/ai-risk-management-framework | 用 Govern / Map / Measure / Manage 管理 AI-assisted reporting 的风险、边界、度量和处置。 |
| ISO/IEC 42001 AI management systems | https://www.iso.org/standard/81230.html | 用 AI management system、traceability、transparency、reliability、performance evaluation 和 continual improvement 组织 AI 辅助报送的管理体系。 |
一句话:
Regulatory reporting architecture is not a report factory. It is a governed source-to-report evidence system where every reported fact has definition, lineage, control, reconciliation, owner, attestation and correction path.
1. Thesis: 报送不是填表, 是事实产品化
低成熟度 regulatory reporting 通常长这样:
源系统抽数
-> 数据团队拼表
-> 报表团队调 Excel
-> Finance / Risk 人工检查
-> 管理层签字
-> 提交后靠邮件和共享盘解释差异
这种模式在 AI 时代会更危险。LLM 可以让 narrative 更快、mapping 更像样、异常解释更流畅, 但如果底层事实链没有工程化, AI 只会把不可靠数据包装成更有说服力的文本。
成熟 reporting data architecture 的目标是:
obligation / instruction
-> report metric contract
-> source-of-record and as-of snapshot
-> controlled transformation
-> reconciled reporting data product
-> schedule / cell / narrative output
-> layered attestation
-> submission package
-> post-submit issue and correction loop
-> audit-ready evidence graph
PM / BA / Architect 的高级价值不在于解释每个报表字段, 而在于把监管说明书、业务事实、数据结构、口径控制、责任签署、问题管理和 AI 辅助边界连成一个可运行系统。
2. Reporting Data Architecture 心智模型
2.1 从 Source-to-Report 到 Evidence-to-Action
| 层级 | 核心对象 | 关键问题 | 架构产物 |
|---|---|---|---|
| Obligation layer | Call Report item、FR Y-14 schedule、内部风险报告要求 | 哪些报告、哪些实体、哪些期间、哪些说明书版本适用 | obligation inventory、instruction version map |
| Definition layer | metric、schedule item、line item、cell、dimension | 口径、粒度、分母分子、排除项、as-of 时间、合并范围是什么 | report metric contract、semantic layer |
| Source layer | GL、subledger、loan servicing、cards、deposits、treasury、CRM、risk engines | 哪个系统是权威来源, 数据何时可用, 如何冻结版本 | source-of-record register、as-of snapshot |
| Transformation layer | ETL/ELT、rules、allocations、classification、manual adjustment | 每一步转换是否可解释、可测试、可复跑 | transformation spec、control tests、code version |
| Control layer | DQ、edit check、reasonableness check、reconciliation、variance analysis | 是否准确、完整、及时、一致, 差异如何解释 | control matrix、reconciliation engine、exception workflow |
| Attestation layer | data owner、report owner、finance/risk owner、executive signer | 谁证明什么事实, 基于哪些证据, 覆盖哪个版本 | attestation packet、sign-off ledger |
| Submission layer | final files、templates、validation results、confirmation、Q&A | 提交了什么, 何时提交, 用哪个版本, 有哪些反馈 | submission manifest、checksum、confirmation archive |
| Issue layer | data defect、instruction interpretation issue、late adjustment、correction | 影响哪些报告、期间、指标、客户/账户/组合和管理结论 | issue ledger、impact assessment、correction/restatement evidence |
2.2 三个不能混淆的对象
| 对象 | 定义 | 常见误区 | 正确架构要求 |
|---|---|---|---|
| Regulatory obligation | 来自报表说明、监管规则、监管问询或内部政策的报告要求 | 把说明书 PDF 当成静态附件 | 建成 versioned obligation inventory, 每个 line item 有 owner、解释记录和变更影响 |
| Reportable metric | 可填入 schedule/cell 或管理报告的一项可计算事实 | 把 dashboard 指标直接拿去报送 | 必须有 metric contract、source、transformation、DQ、reconciliation、approval |
| Attested report | 经授权责任人确认并提交或用于管理层的报告包 | 把签字当形式 | 签署必须绑定数据版本、控制结果、例外、解释和提交证据 |
2.3 BCBS 239 到架构语言
| BCBS 239 lens | Product / architecture translation | 可验收证据 |
|---|---|---|
| Governance | reporting data owner、report owner、risk/finance owner、technology owner、issue owner 明确 | RACI、committee decision、attestation ledger |
| Data architecture and IT infrastructure | 集成 taxonomy、metadata、single identifier、as-of data store、resilient pipeline | data model、lineage graph、BCP evidence |
| Accuracy and integrity | 自动化聚合、源头控制、编辑校验、对账、EUC 控制 | validation rule inventory、reconciliation report |
| Completeness | 覆盖所有 material exposure、legal entity、business line、off-balance sheet relevant population | completeness control、population reconciliation |
| Timeliness | 正常和 stress/crisis 情境下按时生成更新数据 | close calendar、SLA/SLO、late data log |
| Adaptability | 支持 ad hoc supervisory query、scenario subset、new instruction mapping | query workbench、semantic layer、change impact analysis |
| Report accuracy | 报告结果与风险数据对账, 近似值有可靠性标准 | report-to-source reconciliation、approximation rationale |
| Comprehensiveness / clarity / frequency / distribution | 报告覆盖重要风险, 口径清楚, 频率适合, 分发受控 | report catalog、distribution log、recipient entitlement |
| Review and remedial action | 独立评审、缺陷整改、监管反馈闭环 | control test, audit issue, CAPA, closure evidence |
3. Reference Architecture
Regulatory instructions / reporting obligations
Call Report | FR Y-14A/Q/M | internal risk reports | supervisory requests
|
v
Obligation and metric contract registry
schedule item | cell | definition | materiality | scope | owner | effective date
|
v
Source-of-record and as-of data layer
GL | subledger | deposits | loans | cards | treasury | risk engines | model outputs
legal entity | product | customer | account | collateral | scenario | calendar
|
v
Transformation and control plane
extraction | mapping | classification | aggregation | allocation | adjustment
DQ rules | edit checks | reconciliations | variance analytics | lineage events
|
v
Reporting data products
Call Report mart | FR Y-14 mart | risk aggregation mart | management reporting mart
|
v
Attestation and evidence layer
owner sign-off | exception disclosure | issue linkage | evidence manifest | audit log
|
v
Submission and post-submit lifecycle
final package | checksum | validation result | submission confirmation | regulator Q&A
correction / restatement | remediation | control improvement
|
v
AI assist layer with hard boundaries
mapping suggestions | anomaly explanations | narrative drafts | evidence retrieval | QA triage
架构原则:
No reported number without metric contract.
No metric contract without owner and lineage.
No lineage without transformation version.
No transformation without control evidence.
No attestation without exceptions and issue disclosure.
No AI-generated narrative without grounded source facts and human approval.
No submission without immutable package manifest.
4. Canonical Data Entities
| Entity | Minimum fields | 设计理由 |
|---|---|---|
| Reporting obligation | obligation_id, regulator, report_family, report_form, schedule, item, instruction_version, effective_date, applicability_owner | 把说明书和内部解释变成可管理对象 |
| Report metric contract | metric_id, line_item, definition, numerator, denominator, grain, dimensions, source_systems, exclusions, materiality, owner, version | 防止同名指标多口径 |
| Reporting period | period_id, as_of_date, close_calendar, freeze_time, late_data_policy, submission_due_date | 区分业务日期、可用日期、冻结日期和提交日期 |
| Source dataset | dataset_id, source_system, owner, refresh_time, cutoff, record_count, checksum, retention_class, DQ_status | 证明报表输入版本 |
| Transformation rule | rule_id, input, output, logic, code_version, parameter_version, approver, effective_date, test_suite | 证明口径转换可解释可复跑 |
| Reconciliation rule | recon_id, source_a, source_b, tolerance, frequency, owner, break_category, escalation_path | 把差异从邮件争论变成流程对象 |
| Manual adjustment | adjustment_id, report_item, amount, reason_code, preparer, approver, evidence_ref, reversal_policy | 控制人工覆盖和最后一刻改数 |
| Report cell | report_id, schedule, row, column, metric_id, value, unit, precision, data_version, validation_status | 细到 cell 的追溯能力 |
| Attestation | attestation_id, scope, signer_role, signer, data_version, control_results, exceptions, sign_time | 签署绑定事实和例外 |
| Submission package | package_id, report_id, files, checksums, validation_results, submitter, submission_time, confirmation_ref | 可证明提交的 exact artifact |
| Issue / correction | issue_id, severity, impacted_reports, impacted_periods, root_cause, owner, action, correction_decision, closure_evidence | 支持更正、重报和整改闭环 |
| AI assist event | ai_event_id, use_case, model, prompt_version, retrieved_sources, output_type, reviewer, approval_status | 证明 AI 只在授权边界内辅助 |
5. Report Metric Contract: 报送口径的产品契约
报表指标不是普通 BI metric。它直接连接监管说明、财务/风险事实、提交证据和签署责任。
| Contract field | 必须写清楚 | 反例 |
|---|---|---|
| Regulatory anchor | report family、schedule、line item、instruction version | 只写"贷款余额" |
| Business definition | 业务含义、产品范围、风险范围、客户/账户/交易口径 | 不区分信用卡 loan、commitment、unused line |
| Grain | account、loan、customer、legal entity、product、scenario、period | 聚合后无法回溯 population |
| Time basis | as-of date、event time、posting time、available time、freeze time | 把月末后补录数据混入月末报表 |
| Source of record | GL、subledger、servicing、risk engine、model output 的权威优先级 | 多系统哪个准靠会议判断 |
| Transformation logic | mapping、classification、aggregation、allocation、rounding、currency、consolidation | 只有 SQL, 没有业务解释 |
| Exclusions and inclusions | 排除项、例外、materiality threshold、materiality rationale | 人工过滤没有 reason code |
| Reconciliation | 对账对象、容忍度、break 分类、处理 SLA | 差异只写"immaterial" |
| Quality SLO | completeness、accuracy、validity、freshness、consistency、uniqueness | DQ 失败仍能静默出数 |
| Attestation | data owner、report owner、risk/finance owner、executive signer | 签署人不知道签的是哪个版本 |
| AI usage boundary | 是否允许 AI 辅助解释、检测异常、起草 narrative | 让 AI 解释口径并直接进入报表 |
示例:
metric_id: RPT-CREDITCARD-UTILIZED-BALANCE
report_mapping: FR Y-14M credit card portfolio schedule, internal regulatory reporting mart
grain: account x legal_entity x product x reporting_period
time_basis: month-end as-of, source freeze at T+2 18:00 CST
source_priority: card servicing ledger, reconciled to GL control account
transformation: exclude charged-off accounts after charge-off effective date; include posted purchases, fees and interest per approved mapping
reconciliation: subledger-to-GL tolerance 0.05% or USD 50k, lower of absolute/materiality rule
attestation: card data owner, finance reporting owner, regulatory reporting owner
AI_boundary: AI may draft variance explanation using approved recon facts; AI cannot change mapping or approve break
6. Source-to-Report Lineage
6.1 Lineage 深度分层
| Level | 适用场景 | 需要追溯到 |
|---|---|---|
| L1 Report lineage | 低风险内部管理汇总 | report -> schedule -> metric -> source system |
| L2 Dataset lineage | 常规监管报表和风险报告 | report -> metric -> curated dataset -> source dataset -> job run |
| L3 Column lineage | 重要监管报表、关键风险指标、重大管理决策 | report cell -> formula -> columns -> transformations -> source fields |
| L4 Record lineage | 高影响贷款级、账户级、交易级或监管问询样本 | report cell -> contributing records -> source record versions |
| L5 Evidence lineage | 提交、签署、更正、审计、监管问询 | record/metric -> control evidence -> attestation -> submission -> issue/action |
6.2 Report Lineage 应能回答的问题
| Question | 合格答案 |
|---|---|
| 这个 cell 为什么是这个值? | 能看到 metric definition、input population、transformation、aggregation 和 rounding |
| 来源数据何时冻结? | 有 source extract time、as-of date、available time、late arrival policy |
| 哪些记录贡献了这个值? | material schedules 至少可 drill down 到 contributing population 或样本 |
| 哪些控制通过或失败? | DQ、edit check、reconciliation、variance check 的结果和例外 |
| 谁批准了例外? | 例外 owner、审批人、理由、有效期和补救动作 |
| 提交的是哪个版本? | package manifest、checksum、submission timestamp 和 confirmation |
| 提交后发现问题怎么办? | issue impact assessment、correction decision、resubmission evidence |
7. Transformation Controls
报表转换控制的目标不是让每条规则都复杂, 而是让每条规则都能被理解、测试、变更和签署。
| Control area | 控制设计 | Evidence |
|---|---|---|
| Instruction mapping | 报表说明书 line item 映射到 metric contract, 解释争议由 Legal/Compliance/Reg Reporting 记录 | mapping approval, interpretation memo |
| Source extraction | 抽取窗口、过滤条件、record count、checksum、late data policy 固化 | extract log, record-count reconciliation |
| Classification | product、counterparty、legal entity、risk segment、collateral、geography 使用受控 taxonomy | taxonomy version, unmapped item report |
| Aggregation | group-by key、currency、rounding、netting、consolidation 明确 | aggregation test, sample replay |
| Allocation | shared balance、expense、capital、exposure allocation 有 approved methodology | allocation methodology, sensitivity review |
| Manual adjustment | adjustment reason、evidence、dual approval、reversal/roll-forward policy | adjustment register |
| EUC / spreadsheet | inventory、access control、version control、change log、independent check | EUC control report |
| Code change | transformation code 走 SDLC、peer review、test、deployment approval | pull request, release note, test result |
| Parameter change | threshold、mapping table、calendar、scenario、materiality 独立审批 | parameter change log |
| Runtime monitoring | job failure、data delay、quality breach、unexpected distribution shift 告警 | run log, alert, incident ticket |
AI 可以帮助发现映射异常、生成测试候选、解释差异模式, 但不能绕过 transformation control。
8. Reconciliation Architecture
对账不是最后一道人工检查, 而是 reporting data product 的核心能力。
8.1 对账类型
| Reconciliation type | 目标 | 例子 |
|---|---|---|
| Source-to-source | 多源系统同一事实是否一致 | loan servicing balance vs GL control account |
| Source-to-report | 报表值能否回到权威来源 | Call Report line item vs reporting mart vs GL |
| Cross-schedule | 同一报告不同 schedule 是否一致 | FR Y-14 retail totals vs balances / capital schedules |
| Cross-report | 监管报表、财务报表、内部风险报告是否口径可解释 | Call Report vs internal risk exposure dashboard |
| Period-over-period | 环比/同比变化是否合理 | deposit balances, delinquency, allowance, capital components |
| Population | 记录集合是否完整 | active accounts included/excluded with reason |
| Model-output-to-input | 模型或估计输出是否能回到输入和参数 | stress projection, allowance estimate, PPNR model output |
| Narrative-to-fact | 文字解释是否被结构化事实支持 | variance narrative cites approved recon facts |
8.2 Break Management
| Break severity | 标准 | 处理动作 |
|---|---|---|
| Critical | 影响提交准确性、关键报告项、管理层签署或监管问询回应 | freeze affected item, escalate to Reg Reporting / Finance / Risk / Legal as required, no silent submission |
| High | 超过容忍度, 但可通过批准调整或解释控制 | issue ticket, owner assigned, management exception approval |
| Medium | 未超容忍度但趋势异常或重复发生 | RCA, monitor, backlog remediation |
| Low | 文档、映射说明或非关键阈值优化 | batch fix through normal change process |
每个 break 必须有:
break_id
impacted_metric / schedule / period
amount / count / percentage
tolerance
root_cause
owner
decision: correct now / disclose exception / monitor / remediate after filing
approver
evidence_ref
9. Attestation Architecture
签署不是"我看过这个报表"。签署应明确:
I attest to this scope,
for this reporting period,
based on this data version,
with these controls passed,
with these disclosed exceptions,
within this responsibility boundary.
9.1 Layered Attestation
| Attestation layer | 签署对象 | 典型签署人 | 不能签什么 |
|---|---|---|---|
| Source data attestation | 源数据完整性、抽取窗口、冻结版本、已知限制 | source system owner / data owner | 不签最终报表解释 |
| Transformation attestation | mapping、rules、job run、test、parameter version | data engineering / reporting technology owner | 不签监管适用性 |
| Control attestation | DQ、edit check、reconciliation、variance、exception disposition | reporting control owner | 不签业务事实源头 |
| Business attestation | 业务口径、产品分类、异常解释、手工调整理由 | business / finance / risk owner | 不签技术运行细节 |
| Regulatory reporting attestation | report package completeness、instruction version、filing readiness | regulatory reporting owner | 不替代 executive certification |
| Executive attestation | 管理层批准、重大例外知情、提交授权 | authorized executive | 不消除底层 owner 责任 |
9.2 Attestation Packet
| Packet section | 内容 |
|---|---|
| Scope | report family、period、legal entity、schedule、metric set |
| Version | instruction version、data freeze、transformation release、semantic layer version |
| Control results | DQ status、reconciliation status、edit checks、variance review |
| Exceptions | open issues、approved breaks、manual adjustments、late data, EUC dependency |
| Material changes | source change、mapping change、model/estimate change、methodology change |
| AI usage disclosure | 哪些 AI assist 用于分析、叙事或证据检索, reviewer 和 approval |
| Sign-off | signer、role、time、delegation authority、comments |
| Evidence | links to logs、reports、tickets、meeting decisions、submission manifest |
10. Submission Evidence
提交证据要能证明"提交过"、"提交的是什么"、"基于什么版本"、"验证结果如何"、"之后发生了什么"。
| Evidence item | 说明 |
|---|---|
| Final report files | 报送文件、模板、schema、forms, 保存 exact final version |
| Package manifest | 文件名、hash/checksum、row count、cell count、reporting period、creator、approver |
| Validation output | regulator portal validation、internal edit checks、error/warning disposition |
| Attestation ledger | 所有 required sign-offs 和 exception acknowledgements |
| Submission confirmation | portal confirmation、timestamp、submitter、tracking number |
| Regulator Q&A | questions、responses、source facts、approvals、versions |
| Post-submit issue log | submitted issues, discovered defects, correction decisions |
| Retention and access log | 证据保留期限、访问权限、下载/查看记录 |
Submission evidence vault 的设计要求:
- immutable versioning: final package 不被覆盖。
- access segregation: preparer、approver、auditor、viewer 权限分离。
- searchability: 可按 report、period、metric、issue、signer、source system 查找。
- retention alignment: 按内部 records retention 和 legal hold 执行。
- evidence integrity: hash、timestamp、owner、approval chain 可验证。
11. Issue, Correction and Restatement Loop
提交后发现问题时, 架构必须支持事实快速定位, 但判断是否更正、重报或沟通监管属于授权治理流程。
detection
-> issue classification
-> impacted reports / periods / metrics / entities
-> quantitative and qualitative impact
-> Legal / Compliance / Regulatory Reporting / Finance / Risk decision
-> correction or no-correction rationale
-> resubmission package if required
-> root cause remediation
-> control enhancement
-> closure attestation
11.1 Issue Taxonomy
| Issue type | 例子 | 架构响应 |
|---|---|---|
| Source data defect | servicing 系统余额回灌错误 | identify impacted extracts, rebuild as-of snapshot, rerun reconciliations |
| Mapping defect | 产品映射到错误 Call Report category | update mapping table, impact historical periods, enhance mapping tests |
| Transformation defect | aggregation / allocation rule 代码错误 | fix code, regression test, version release, quantify impact |
| Manual adjustment error | adjustment reason 不充分或金额错误 | reverse / correct adjustment, dual approval, strengthen threshold |
| Instruction interpretation change | 内部口径与说明书解释不一致 | document interpretation, Legal/Compliance/Reg Reporting decision |
| Model / estimate issue | stress projection、allowance estimate、PPNR model output 发现缺陷 | model risk review, output replacement or disclosure decision |
| AI narrative issue | variance explanation 引用错误事实或过度推断 | revoke narrative, correct source references, prompt/control update |
| Submission artifact issue | 文件版本、format、portal warning、confirmation gap | package reconstruction, portal evidence, submitter review |
11.2 Correction Decision Record
| Field | 内容 |
|---|---|
| issue_id | 与 issue ledger 一致 |
| discovered_by | control, audit, regulator, business, AI anomaly detection |
| impacted_scope | report, schedule, item, period, legal entity, amount/count |
| materiality analysis | quantitative and qualitative facts, owner, reviewer |
| decision | correct, resubmit, disclose, monitor, no correction with rationale |
| approval | authorized roles and timestamps |
| evidence | recalculation, revised package, communications, closure proof |
| preventive action | control, data, transformation, training, AI prompt/guardrail update |
12. AI Assist Boundaries
AI 在 regulatory reporting 中有价值, 但必须定位为 assistant, not accountable reporter。
12.1 Allowed with Controls
| AI use case | 价值 | 必要控制 |
|---|---|---|
| Instruction-to-metric mapping suggestion | 加速说明书解析和影响分析 | human owner approval, source citation, version lock |
| Data anomaly detection | 发现异常变动、缺失、分布漂移 | deterministic thresholds + explainable alert + false positive tracking |
| Reconciliation break clustering | 把大量 break 按根因聚类 | no auto-close, owner review |
| Variance narrative draft | 根据 approved facts 起草原因说明 | grounded evidence, no unsupported conclusion, sign-off |
| Evidence retrieval | 快速查找 lineage、control、ticket、attestation | role-based access, retrieval log |
| Control test generation | 生成候选 DQ/edit rules 和 test cases | engineer/control owner review before production |
| Impact analysis assistant | 根据 lineage 推测受影响报表和指标 | deterministic lineage query as source, human confirmation |
12.2 Not Allowed
| Boundary | 原因 |
|---|---|
| AI 不做法规适用性最终判断 | exact applicability Legal/Compliance-owned |
| AI 不改变 report metric definition | metric contract owner 和授权 governance 决定 |
| AI 不直接写入 final report cells | 报送事实必须来自受控 data product |
| AI 不关闭 reconciliation break | break disposition 需要 accountable owner |
| AI 不签署 attestation | attestation 是授权责任行为 |
| AI 不提交监管报表 | submission authority、dual control、evidence capture 必须由授权流程执行 |
| AI 不生成无来源 narrative | 所有 narrative 必须可追溯到 approved facts |
| AI 不掩盖 issue 或重写审计痕迹 | audit trail 必须不可篡改 |
12.3 Generated Narrative Controls
| Control | 要求 |
|---|---|
| Grounding | narrative 只能使用 approved reporting facts、variance tables、reconciliation results 和 owner comments |
| Citation | 每个关键解释指向 metric、period、source、break 或 issue reference |
| Separation | 区分 fact、analysis、judgment、legal/compliance conclusion |
| Human review | report owner / business owner / regulatory reporting owner 根据 scope 审核 |
| Versioning | prompt、model、retrieval corpus、output、human edits 保留版本 |
| Prohibited language | 不写监管结论、承诺、责任归因或未经批准的解释 |
| Sampling QA | 定期抽检 narrative 是否与数字、对账和问题台账一致 |
13. Senior Management MI: 只报告 Reporting Readiness
本文不展开通用 board MI。这里的 senior management MI 只服务 regulatory reporting data architecture readiness。
| MI area | Question | Metric examples |
|---|---|---|
| Filing readiness | 本期报送是否在轨 | reports on schedule, open critical breaks, pending attestations |
| Data quality | 关键数据是否可靠 | critical DQ pass rate, late feeds, completeness exceptions |
| Reconciliation | 差异是否受控 | unreconciled amount, break aging, repeat break root causes |
| Manual adjustment | 人工覆盖是否过多 | adjustment count/value, late adjustments, approval exceptions |
| Lineage coverage | 报告数字是否可追溯 | % critical metrics with L3/L4 lineage, missing owner count |
| Issue management | 缺陷是否关闭 | open issues by severity, remediation SLA, repeat defects |
| Correction / restatement | 提交后质量如何 | corrections, resubmissions, regulator questions, root causes |
| AI usage | AI 是否在边界内辅助 | AI assist events, narrative QA defects, unauthorized use detections |
好的 MI 不是"本期已完成报送", 而是:
Which report items are materially reliant on manual workarounds?
Which reconciliations are open and who accepted them?
Which metrics lack required lineage depth?
Which AI-assisted narratives had factual defects?
Which repeat defects need platform investment?
14. PM / Architect Implications
| 角色 | 关键职责 | 高级产出 |
|---|---|---|
| PM | 把 regulatory reporting 从项目交付转成长期 data product roadmap | reporting product canvas, capability roadmap, adoption and control KPIs |
| Senior BA | 把 instructions、业务口径、数据字段、例外、签署和证据转成需求 | metric contracts, process maps, control requirements |
| Data Architect | 设计 source-of-record、as-of layer、semantic layer、lineage 和 DQ | canonical data model, lineage architecture, data contract |
| Solution Architect | 设计 pipeline、control plane、attestation workflow、evidence vault、integration | reference architecture, NFR, resilience and access design |
| Regulatory Reporting Owner | 确认报告流程、签署、提交、例外和问询响应 | report calendar, attestation matrix, submission manifest |
| Risk / Finance Owner | 确认业务事实、指标解释、对账差异和管理层影响 | variance explanation, break disposition, management readiness |
| AI Governance / Model Risk | 确认 AI assist 边界、模型/提示词控制、输出验证 | AI use register, narrative QA, model/prompt evidence |
高级面试表达:
我不会把 AI regulatory reporting 做成"LLM 帮忙填表"。我会先做 source-to-report data product: obligation inventory、metric contract、as-of snapshot、lineage、DQ、reconciliation、attestation 和 submission evidence。AI 只在 mapping suggestion、anomaly detection、evidence retrieval 和 grounded narrative draft 中辅助, 所有定义、例外、签署、提交和更正判断仍由授权 owner 负责。
15. Anti-Patterns
| Anti-pattern | 风险 | 改法 |
|---|---|---|
| PDF instruction + spreadsheet mapping | 口径不可追溯, 变更影响靠人记忆 | versioned obligation registry + metric contract |
| Dashboard number copied into filing | BI 口径和报送口径混用 | reporting mart with approved semantic layer |
| GL reconciliation at final step only | 上游错误发现太晚 | source-to-source and source-to-report controls embedded in pipeline |
| Manual adjustment without reason taxonomy | 无法分析重复问题和责任 | adjustment register + reason code + approval threshold |
| Attestation as email sign-off | 签署与数据版本脱节 | structured attestation packet |
| AI writes variance narrative from raw tables | 可能编造解释或引用未经批准事实 | grounded narrative over approved facts only |
| Submission package overwritten | 无法证明提交 exact version | immutable manifest + checksum |
| Corrections handled case-by-case | 根因不沉淀, 重复缺陷 | issue ledger + CAPA + control enhancement |
| Treat BCBS 239 as compliance checklist | 数据架构没有改变 | convert principles into controls, lineage and operating model |
16. 面试题准备
Q1: 如何设计一个 AI-assisted regulatory reporting architecture?
30 秒版本: 我会先建设 source-to-report evidence system, 再加 AI assist。核心是 obligation inventory、metric contract、as-of source data、controlled transformation、reconciliation、attestation、submission evidence 和 issue/restatement loop。AI 只能做建议、检测、检索和有来源的 narrative draft, 不能定义口径、改数、关闭差异、签署或提交。
2 分钟版本: 金融监管报送的风险不在于表格生成慢, 而在于数字能否被证明。架构上我会把每个 report item 当成一个数据产品: 有说明书版本、业务定义、粒度、source-of-record、转换规则、对账规则、质量 SLO、owner 和签署范围。数据层要有 as-of snapshot, 因为报送关心某个报告期和冻结点的事实。控制层要包括 source-to-source、source-to-report、cross-schedule、period-over-period 和 narrative-to-fact reconciliation。签署层要分 source、transformation、control、business、reg reporting 和 executive。AI 放在辅助层, 例如异常聚类、证据检索、说明书影响分析、variance narrative 草稿, 但每个输出必须有来源、版本、reviewer 和审批记录。
Q2: BCBS 239 对产品和架构最重要的启发是什么?
30 秒版本: BCBS 239 的本质是让风险数据聚合和风险报告具备 governance、architecture、accuracy、completeness、timeliness 和 adaptability。对 PM/Architect 来说, 它要求我们把报表从人工汇总升级为可追溯的数据产品和控制系统。
2 分钟版本: 我会把 BCBS 239 翻译成工程要求: 数据架构要有统一 identifier、taxonomy 和 metadata; 关键指标要有 owner 和 contract; 数据要能自动化聚合并与源系统和会计数据对账; 报告要在正常和压力情境下按时生成; 对监管临时问询要能按 legal entity、产品、行业、地区和风险维度快速重切。最容易被低估的是 adaptability: 它不是临时 SQL 能跑出来, 而是语义层、血缘、质量和权限已经产品化。
Q3: AI 生成 regulatory reporting narrative 的边界是什么?
30 秒版本: AI 可以基于 approved facts 起草 variance explanation, 但不能生成没有来源的解释、法律/合规结论、责任归因、提交承诺或更正判断。所有 narrative 都要有 fact grounding、citation、人审、版本和 QA。
2 分钟版本: 我会把 narrative 当成派生控制对象。输入只允许来自 approved variance table、reconciliation result、issue ledger 和 owner comment, 不让模型自由读取原始大表后发挥。输出要区分事实、分析、判断和需 Legal/Compliance 确认的问题。关键句子必须引用 metric、period、source 或 issue reference。高影响报表和监管问询材料必须人工审批, 并保留 prompt、model、retrieval corpus、AI output、human edit 和 approval log。
Q4: 发现提交后数据错误, 架构应该如何支持更正或重报?
30 秒版本: 系统应快速做 impact assessment: 哪些报告、期间、line item、legal entity、records 和 attestations 受影响。是否更正或重报由授权 governance 判断, 架构负责提供复算、证据、版本、审批和整改闭环。
2 分钟版本: 我会先冻结 evidence, 创建 issue_id, 通过 lineage 找到 impacted reports、periods、metrics、data versions 和 submission packages。然后量化影响, 区分 source defect、mapping defect、transformation defect、manual adjustment error、instruction interpretation 或 model estimate issue。Reg Reporting、Finance、Risk、Legal/Compliance 根据事实决定是否 correction、resubmission、disclosure 或 monitor。技术侧生成 revised dataset、reconciliation、validation、attestation 和 package manifest。最后把根因转成 preventive action, 例如新增 DQ rule、mapping test、parameter approval 或 AI narrative guardrail。
Q5: 如何避免 AI 让报送流程看起来更快但实际更不可靠?
30 秒版本: 先把 deterministic controls 建好, 再接 AI。AI 输出必须低权限、可追溯、可拒绝、可抽检。所有关键动作仍由系统控制和人类 owner 完成。
2 分钟版本: 我会设置四类 guardrail。第一, data guardrail: AI 只能访问 approved catalog、metric contracts 和 evidence vault, 不直接改 reporting mart。第二, workflow guardrail: AI 只能生成 suggestion 或 draft, final action 需要 owner review。第三, evidence guardrail: 每次 AI assist 都有 model、prompt、retrieved source、output、reviewer 和 decision。第四, quality guardrail: 抽检 AI narrative factuality、reconciliation break clustering accuracy 和 unauthorized use detection。这样 AI 提升分析效率, 但不削弱报送责任链。
17. 最小可落地能力集
| Capability | MVP | Scale version |
|---|---|---|
| Obligation inventory | 关键报表和 schedule 的 versioned register | 自动 diff instructions, impact analysis |
| Metric contracts | Top material line items | 全量 schedule/cell semantic layer |
| As-of snapshots | 关键源系统 freeze and checksum | event-time/available-time bitemporal store |
| Lineage | dataset-level lineage | cell/column/record/evidence lineage |
| Reconciliation | GL/subledger and period variance | cross-report, cross-schedule, narrative-to-fact |
| Attestation | structured sign-off packet | workflow-integrated attestation ledger |
| Evidence vault | submission package archive | immutable evidence graph with search and retention |
| AI assist | evidence retrieval and variance draft | controlled anomaly detection, impact analysis, QA copilot |
| Issue loop | issue register and RCA | correction/restatement workflow and CAPA analytics |
最终判断标准:
Can the team reconstruct a submitted number, its source records, transformation logic,
control results, exceptions, signer, submission artifact and later corrections without
asking the original preparer to remember what happened?
如果答案是否定的, 问题不是 AI 不够强, 而是 reporting architecture 还没有成为可证明的系统。