返回 Papers
AI 扩展计划 / Playbooks

AI Core Banking Modernization / Strangler / Evidence Playbook

面向 CBAP+ 金融零售 Senior PM、Senior BA、Solution Architect、Enterprise Architect、Core Banking Transformation Lead、Data Migration Owner、Technology Risk、Operational Resilience、Internal Audit、Vendor Management

691AI_CORE_BANKING_MODERNIZATION_STRANGLER_EVIDENCE_PLAYBOOK.md

AI Core Banking Modernization / Legacy Strangler / Evidence Architecture Playbook

面向 CBAP+ 金融零售 Senior PM、Senior BA、Solution Architect、Enterprise Architect、Core Banking Transformation Lead、Data Migration Owner、Technology Risk、Operational Resilience、Internal Audit、Vendor Management 和核心包/主机现代化团队。本文不讲基础 BA 访谈和普通需求模板, 而是把 AI、产品、解决方案架构和企业架构放在同一个现代化控制面里, 训练你设计可分阶段迁移、可对账、可回滚、可审计、可运营、可解释客户影响的核心银行 modernization program。

核心判断:

核心银行现代化的主线不是“旧核心到新核心”的技术搬迁, 而是“事实源、客户影响、运营责任和审计证据”的受控迁移。

本文是学习、作品集和内部架构训练材料, 不构成法律意见、监管解释、合规结论、审计意见、会计处理结论、供应商选型建议、迁移批准或生产 cutover 指令。正式项目必须由机构授权的 Business Owner、Technology Owner、Data Owner、Risk、Compliance、Legal、Finance、Operations、BCP、Third-Party Risk、Internal Audit 和 Vendor 结合业务、合同、牌照、司法辖区和内部政策确认。

访问日期: 2026-06-30。


1. Source Anchors

AnchorOfficial link本 playbook 使用方式
FFIEC Architecture, Infrastructure, and Operations IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/architecture-infrastructure-and-operations/用 enterprise architecture、data flow diagrams、technology assets、change、operations、monitoring、resilience、third-party oversight 设计 modernization control plane。
FFIEC Development, Acquisition, and Maintenance IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/development-acquisition-and-maintenance/用 development / acquisition / maintenance governance、risk management、testing、change control 和 vendor acquisition 组织核心包、主机改造与新平台建设。
FFIEC Management IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/management用 governance、risk management、enterprise architecture、project management、monitoring/reporting、quality assurance 支撑转型责任和管理层汇报。
FFIEC Business Continuity Management IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/business-continuity-management用 BIA、risk assessment、continuity strategy、BCP、training、exercise、maintenance、board reporting 组织 cutover、rollback、degraded mode 和恢复演练。
NIST SP 800-160 Vol. 1 Rev. 1 Engineering Trustworthy Secure Systemshttps://csrc.nist.gov/pubs/sp/800/160/v1/r1/final用 systems security engineering 视角设计可信系统边界、生命周期、交互、失效模式和可生存性。
NIST SP 800-218 Secure Software Development Frameworkhttps://csrc.nist.gov/pubs/sp/800/218/final用 SSDF 的 secure SDLC 语言组织代码、配置、供应链、测试、漏洞和缺陷响应。
NIST AI Risk Management Frameworkhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 定义 AI 在核心现代化中的用途边界、风险度量、人工复核和持续改进。
ISO/IEC 42001 AI management systemshttps://www.iso.org/standard/81230.html用 AI management system 的责任、运行控制、绩效评价和持续改进管理 AI-assisted modernization。

2. Executive Framing

核心银行现代化 program 通常同时面临四种压力:

  1. 业务压力: 新产品、实时服务、开放银行、个性化、费用和利率策略、渠道体验被旧核心 release cycle 卡住。
  2. 技术压力: 主机人才稀缺、batch window 紧张、接口脆弱、数据结构历史包袱重、核心包升级慢。
  3. 风险压力: 客户余额、交易、利息、费用、账务、报表、投诉、审计证据不能出错。
  4. 转型压力: 管理层希望看到现代化进展, 但 big-bang replacement 的风险过高。

高级 playbook 的目标不是押注某个银弹, 而是设计一套 transition architecture:

legacy estate
  -> controlled coexistence
  -> strangled capabilities
  -> target platform and core package integration
  -> evidence-backed source-of-truth transfer
  -> legacy retirement by capability

2.1 成功标准

维度低成熟度成功标准高成熟度成功标准
Scope新核心上线每个 capability 的 source-of-truth、运营责任和证据责任完成迁移
Data数据装载完成population manifest、mapping、DQ、reconciliation、exception closure 可证明
Channels接口打通渠道通过 ACL 使用稳定 intent API, 不吸收新旧核心差异
Risk测试通过customer harm scenarios、rollback、BCP、parallel-run、control evidence 达标
AIAI 提高效率AI 辅助受治理, 不拥有核心事实、写入、cutover 和审计结论
Audit项目文档归档evidence architecture 支撑查询、复盘、监管问询和管理层报告

3. Modernization Decision Matrix

核心现代化不是只有 replace。常见选项应按 capability、风险、收益、供应商约束和证据成熟度组合使用。

Pattern适用场景优点风险Gate
Encapsulate / Wrap旧能力仍稳定, 但渠道需要 API 化快速降低渠道耦合只包接口不改语义会固化旧复杂性ACL contract、semantic mapping、operational SLO
Rehost / Replatform技术基础设施过旧, 业务语义暂不变降低基础设施风险业务能力没有现代化Performance, DR, security, batch equivalence evidence
Refactor局部代码和规则可控降低维护风险对隐式规则理解不足Rule mining、unit/regression evidence、SME approval
Core package extension新核心包适配特定产品/地区利用供应商能力参数和定制可能形成新 legacyParameter governance、vendor release impact
Strangler by capability业务能力可拆, coexistence 可控分阶段释放价值需要强 ACL、recon 和运营编排Slice readiness、source-of-truth registry、rollback
Big-bang replacement规模小、产品少、旧系统简单或监管窗口允许结束 coexistence风险集中、rollback 困难Exceptional approval、full rehearsal、BCP proof

高级判断:

对复杂金融零售机构, strangler 常是主线, package replacement 常是目标平台的一部分, ACL / evidence / reconciliation 是降低 coexistence 风险的核心能力。


4. Core Constraint Inventory

启动前先建立约束清单, 不要先画目标架构。

Constraint domain必查对象关键问题Evidence
Products存款、贷款、卡、透支、费用、利率、优惠、豁免、历史产品哪些规则隐含在核心参数、批处理或人工 SOP 中Product rule catalog、parameter extract、exception list
Accounts and customersCIF、账户、joint account、relationship、KYC、状态、冻结、限制客户和账户状态是否一对多、多对多、跨系统冲突Entity relationship map、source-of-truth register
Transactionsposting、pending、reversal、adjustment、hold、chargeback、fee金额、币种、日期、顺序、幂等、冲正语义如何表达Transaction taxonomy、sequence evidence
Ledgersubledger、GL、settlement、suspense、interest accrual新旧账务是否能每日、月末、期末对齐Ledger mapping、GL reconciliation
Batch日终、月终、计息、收费、对账、报表、文件传输哪些时间窗口不可打断, 哪些任务有上游下游Batch dependency graph、critical path
Interfaces渠道、CRM、支付、卡、征信、报表、数据仓库、供应商谁直接读写旧核心, 哪些字段被外部误用Interface inventory、consumer impact
Data quality历史编码、缺失、重复、格式漂移、手工修复哪些数据迁移前必须清理或例外披露DQ profile、defect backlog
Operations柜面、客服、后台、财务、投诉、风险、催收Coexistence 时员工看哪个系统, 如何解释差异SOP, training evidence, unified workbench plan
Vendor核心包、系统集成商、主机服务商、云、数据工具环境、接口、批量、补丁、支持时限、退出条款Contract controls、SLA、release calendar

5. Capability Slicing Model

不要按组织架构切, 也不要按数据库表切。用 business capability + data fact + operational responsibility 切。

5.1 Slice Candidate Table

Slice适合先迁信号需要谨慎信号典型策略
Customer profile read读多写少, 可容忍 freshness 标注KYC / consent / privacy 强耦合Read replica + canonical profile API
Product catalog / pricing display参数可抽取, 规则可版本化费用和利率直接影响账务Product rule registry + human-approved publication
Statement / transaction search历史查询压力大, 写入少实时 pending / dispute 状态复杂Event capture + searchable read model
Alerts and servicing tasks可作为运营辅助, 不直接改账错误任务会误导员工Workflow strangler + case evidence
Fee waiver workflow独立审批和回写点清晰费用账务和客户承诺敏感Draft/approve/write via policy-gated ACL
Deposit account opening新账户路径可隔离需要 KYC、funding、card、statement、GL 集成New capability + core account creation adapter
Loan servicing changes高业务价值还款计划、利息、征信、催收耦合强Shadow calculation and long parallel run
Posting engine / ledger现代化收益大客户余额和 GL 风险最高Shadow ledger before controlled cutover

5.2 Slice Readiness Score

Criterion1 分3 分5 分
Business value主要技术债明确运营效率或渠道价值直接释放收入、风险控制或战略产品能力
Coupling与核心 posting、batch、GL 深耦合有可隔离接口可通过 ACL 清晰隔离
Data readiness源数据混乱, DQ 未知已有 profiling 和主要缺陷有 manifest、DQ profile、lineage 和 fix path
Test oracle结果难以判定SME 可抽样判断可自动对账, 有 golden scenarios
Operational readinessSOP 不清可通过培训支持有 workbench、runbook、queue、escalation
Rollback feasibility回退不可行或损失大可手工补救可恢复路由、重放事件、补偿交易

建议先选择总分高但 ledger 风险中低的 slice, 用它打磨 ACL、evidence、reconciliation 和运营模型。


6. Reference Architecture

Digital / branch / contact center / partner channels
        |
        v
Experience orchestration and workflow layer
        |
        v
Anti-corruption layer
  intent API | canonical model | entitlement | validation | policy | idempotency
        |
        +--------------------------+--------------------------+
        |                          |                          |
        v                          v                          v
Legacy core estate          Modernized services          Core package platform
mainframe / batch /         account, product,            deposit, loan,
COBOL / package             workflow, ledger             servicing modules
        |                          |                          |
        v                          v                          v
Event capture and CDC        Shadow / active ledger       Vendor adapters
batch extract, journal,      read models, commands        parameter, batch,
file, API events             and workflow state           API, file exchange
        \__________________________|__________________________/
                                   |
                                   v
Control and evidence plane
  source-of-truth registry
  migration manifest
  mapping decision record
  reconciliation engine
  cutover command log
  customer harm monitor
  AI assist evidence
  audit / regulatory evidence pack

6.1 Architecture Principles

No direct channel dependency on legacy semantics.
No source-of-truth transfer without registry update and approval.
No core write without policy, idempotency, trace and recovery path.
No migration without population manifest and reconciliation.
No cutover without customer harm monitoring and rollback authority.
No AI suggestion without source citation, owner review and usage boundary.
No vendor black box without evidence obligations and exit path.

7. Anti-Corruption Layer Contract

ACL 是核心现代化的产品化边界, 不是技术临时层。

Contract areaMust defineEvidence
Intent API业务动作名称、输入、输出、错误码、幂等键、权限、风险等级OpenAPI / AsyncAPI、consumer approval、version history
Canonical model核心实体、字段语义、状态机、金额精度、日期语义、as-of timeModel registry、decision log、lineage
Mapping ruleslegacy field / package field 到 canonical / target 的规则Mapping table、rule owner、sample tests
Validation输入合法性、业务规则、risk policy、customer harm guardrailRule suite、policy decision log
Idempotency重复请求、重放、超时恢复、队列 drainIdempotency key spec、replay test
Sequencingevent ordering、batch cutover、transaction prioritySequence contract、late event handling
Error handling技术错误、业务拒绝、未知状态、人工升级Error taxonomy、runbook
Evidencetrace、event、source snapshot、approval、side effectEvidence event schema、retention profile

7.1 ACL Write Tiers

TierExampleAllowed pathControl
Read onlyaccount inquiry、transaction searchread replica or legacy read adapterFreshness and entitlement
Draftfee waiver draft、servicing note draftnew workflow onlyHuman approval before any core effect
Low-risk writepreference update、non-financial notepolicy-gated ACLIdempotency, audit log, rollback note
High-risk writefee reversal、hold release、limit change、account closuredual control or explicit approvalApproval, side-effect id, recon
Ledger-impacting writeposting、interest adjustment、loan schedule changerestricted command pathFormal change authority, reconciliation, rollback plan

8. Canonical Model and Event Capture

8.1 Canonical Objects

ObjectMinimum semanticsCommon traps
Customeridentity, relationship, segment, KYC status, consent, vulnerability flag把 CIF 当唯一客户真相, 忽视 joint / household / business relationship
Accountaccount id, ownership, product, status, open/close dates, restrictions状态码跨系统语义不同
Balanceledger balance, available balance, pending, holds, as-of time, currency没有 as-of time 和 freshness 导致客户误解
Transactionauthorization, posting, reversal, adjustment, fee, interest, source channel把 authorization 和 posting 混为一谈
Productproduct code, terms, fees, rates, eligibility, effective date旧产品代码和当前产品目录断裂
Loan scheduledue date, principal, interest, escrow, delinquency, payment hierarchy提前还款、追溯调整、催收状态复杂
Hold / restrictionreason, scope, start/end, legal/regulatory basis, release rule错误释放或错误保留造成客户伤害
GL mappingsubledger account, GL account, cost center, legal entity, period客户账和财务账对不上

8.2 Event Capture Contract

FieldPurpose
event_id全局唯一事件标识
event_typeaccount.opened, transaction.posted, fee.assessed, interest.accrued, hold.placed
source_systemlegacy core、core package、workflow、adapter
source_record_ref源记录、batch、journal、file、API call 指针
business_time业务发生时间或生效日期
capture_time捕获时间
as_of_time数据快照时间
sequenceordering / replay 控制
idempotency_key去重和恢复
schema_version事件契约版本
evidence_hash源证据或 payload hash

8.3 Read Replica Rules

RuleReason
每个 read model 显示 freshness 和 source防止员工或客户把副本当实时核心
高风险客户影响答案必须引用 source and as-of支持投诉、争议和审计
Replica 不执行核心写入防止绕过核心控制
Replica 缺失时有 degraded mode保障客服和运营可继续安全工作

9. Shadow Ledger and Parallel Run

9.1 Shadow Ledger Use Cases

Use caseWhy shadow ledger helpsExit criteria
Deposit interest验证日计息、月结、利率阶梯、例外利率多个周期差异在 tolerance 内, 差异已分类和关闭
Fee assessment验证费用触发、豁免、退费、税费样本和全量对账达标, 客服解释一致
Loan payment allocation验证本金、利息、罚息、费用、escrow 顺序还款计划和余额差异可解释
Holds and restrictions验证冻结、释放、可用余额影响客户伤害场景无重大差异
GL projection验证 subledger to GL mapping日终和月末对账达标

9.2 Reconciliation Dimensions

DimensionExamples
Balanceledger balance、available balance、principal、interest、escrow
Transactioncount、amount、type、status、posting date、effective date
Product rulerate、fee、waiver、limit、eligibility
Customer/account statusactive、closed、frozen、delinquent、charged-off
GLdebit/credit、legal entity、cost center、suspense
Operationscase status、manual adjustment、approval state
Customer impactrejected transaction、wrong fee、wrong statement、complaint

9.3 Break Taxonomy

Break typeRoot cause candidateOwner
Data defect源数据缺失、历史编码、重复、手工修复Data Owner
Mapping defect字段、状态、产品代码、日期、金额精度映射错误Data Architect / BA
Rule defect利息、费用、还款、状态机规则理解错误Product SME / Architect
Timing defectbatch、event sequence、as-of、late arriving dataIntegration Architect
Vendor defect核心包参数、API、batch extract、环境问题Vendor Manager
Operational defect人工调整、SOP、审批或培训问题Operations Lead
Evidence defecttrace、manifest、approval、hash、recon record 缺失Evidence Owner

10. Migration Evidence Factory

把 migration 当成 evidence factory, 不要当成一次性 ETL。

StageEvidence artifactAcceptance
Population selectionmigration population manifest范围、排除项、记录数、金额总计、hash、owner 明确
Source extractionsource snapshot certificateextract time、batch id、file checksum、record count、retention class
Data profilingDQ profile and defect list缺失、重复、异常、历史编码、不可迁移项分类
Mappingmapping decision record字段、状态、规则、产品参数、SME approval、sample evidence
Transformationtransformation run manifestcode version、parameter version、input/output counts、error report
Loadtarget load evidenceload id、target count、rejects、retry、environment
Reconciliationrecon reporttolerance、break count/value、aging、materiality、owner
Exception handlingexception register每个例外有 business decision、risk acceptance 或 fix evidence
Approvalmigration signoff packData、Business、Tech、Risk、Ops、Finance 按范围签署
Retentionevidence retention record保存位置、访问控制、保留期限、legal hold path

10.1 Data Migration Guardrails

GuardrailWhy
Source extract freeze must be explicit防止源数据继续变化导致对账失效
Population manifest before transformation防止范围漂移
Mapping changes require version and regression防止最后一刻改映射破坏已验证结果
Rejects must be business-classified技术 reject 可能代表客户影响
Recon tolerance must be pre-approved上线当天临时放宽 tolerance 是红色信号
Manual adjustment needs maker-checker防止人工补数成为风险入口
Migration evidence must be queryable by account/customer/product事故和投诉需要快速定位

11. Cutover, Downtime and Rollback

11.1 Cutover Strategy Table

Strategy适用场景优点风险Required evidence
Blue/green routing渠道可路由, 能快速回切回滚快数据写入一致性难Route log、dual-write policy、replay plan
Phased cohort按员工、分行、地区、产品、客户 cohort 逐步迁风险分散Coexistence 复杂Cohort manifest、customer impact monitor
Business-date cutover按日终/月末切对账语义清楚downtime / batch window 压力Freeze proof、batch completion、GL recon
Read-first cutover先切读, 后切写降低风险双读差异会影响客服Freshness disclosure、read recon
Shadow-to-active ledgershadow 达标后切 active posting信心高切换点和历史重放复杂Shadow scorecard、activation approval
Big-bang能力小或环境简单快速结束旧系统风险集中Full rehearsal、BCP、rollback proof

11.2 Cutover Command Log

每个 cutover command center 应记录:

  • 时间线、执行人、审批人、观察员。
  • 前置检查、环境状态、批处理状态、供应商状态。
  • 数据 freeze、queue drain、routing change、smoke test、reconciliation checkpoint。
  • 决策点、异常、风险接受、rollback 判断。
  • 客户影响、投诉、交易失败、余额差异、运营队列。
  • 恢复普通运营的批准证据。

11.3 Rollback Decision

Rollback 不是技术脚本, 是企业决策。

TriggerDecision ownerRequired action
Balance variance above materialityMigration Authority + FinanceStop affected capability, preserve evidence, assess customer impact
High-risk write uncertaintyTechnology Owner + RiskDisable write path, switch read-only or manual-first
Customer-visible misinformationBusiness Owner + Customer OpsStop impacted journey, issue corrected script, monitor complaints
Vendor package critical defectVendor Manager + Tech OwnerActivate vendor escalation, route around or rollback
Evidence loss for Tier 1 flowEvidence Owner + RiskStop or downgrade affected flow, local capture, preservation hold
BCP dependency failureBCP Lead + Command CenterMove to pre-approved degraded mode

Rollback plan 必须说明:

  • 恢复到哪个 source-of-truth 状态。
  • 已进入新路径的交易如何补偿、重放或人工处理。
  • 客户通知和客服脚本如何更新。
  • 财务、对账、报表如何处理 cutover window。
  • 证据如何 preserved, 哪些日志进入 hold。

12. Customer Harm Control Checklist

HarmPre-cutover controlRuntime signalRemediation
Wrong balanceFull balance recon, high-risk account samplingBalance variance, customer inquiry spikeFreeze affected accounts, manual review, corrected communication
Duplicate postingIdempotency test, queue drain proofDuplicate detection, recon breaksReverse duplicate, notify ops, root cause fix
Missing postingSequence and replay testPending aging, transaction count mismatchReplay or manual post with approval
Wrong feeFee engine shadow runFee complaint, fee varianceRefund/waive with approval and root cause
Wrong interestInterest shadow calculationInterest variance by productCorrect adjustment and customer explanation
Transaction declineEligibility and hold mapping testDecline rate spikeRoute rollback or manual override path
Lost servicing caseCase migration reconCase backlog, SLA breachRestore case, prioritize affected customers
Wrong statementStatement sample and period close reconStatement suppression / correction eventsCorrected statement process

PM 高级表达:

我不会把客户伤害当上线后的运营问题, 而是把它前置为 cutover gate、monitoring signal 和 remediation playbook。


13. Vendor / Core Package Integration

核心包现代化常见误区是把 vendor platform 当成目标架构答案。真正需要管理的是 vendor 的语义、配置、接口、证据和运营责任。

Vendor areaControl questionEvidence
Product parameters参数如何表达本机构产品规则、历史例外和未来变更Parameter workbook、approval、diff report
API / file interface接口语义、错误码、幂等、重试、版本如何治理Interface contract、test evidence
Batch and calendar核心包日终、月终、节假日、时区如何处理Batch calendar、run evidence
Data extract数据抽取字段、时间、完整性、重跑能力Extract manifest、checksum
Upgrade / patch供应商 release 是否影响映射、参数、API、证据Release impact assessment
Environment测试、parallel-run、performance、DR 环境是否足够Environment readiness report
Supportcutover window 响应、严重问题升级、SLASupport plan、contact tree
Exit / portability数据、配置、证据如何导出Exit plan、data portability test

13.1 Vendor Evidence Clauses

核心现代化合同和 SOW 应明确:

  • 交付物不只包含配置和接口, 还包含 mapping evidence、test evidence、runbook 和 operational support。
  • 供应商必须支持 parallel-run 数据抽取、reconciliation、defect triage 和 root cause evidence。
  • 供应商环境和支持窗口必须覆盖 cutover rehearsal、production cutover 和 stabilization。
  • 供应商缺陷分类、修复 SLA、patch regression 和 release note 必须进入项目证据链。

14. AI Assist Operating Model

14.1 Safe AI Use Cases

Use caseHow AI helpsControl
Requirements mining从需求、SOP、Jira、监管材料、供应商文档抽取规则、冲突和未决问题Source citation, reviewer, conflict log
Legacy code understanding解释 COBOL、copybook、JCL、batch dependency、SQL、参数文件Read-only access, code reference, SME validation
Mapping suggestion建议 legacy to canonical / target mappingMapping owner approval, sample tests, version
Data profiling narrative总结 DQ patterns、异常、边界样本Data classification, reproducible query
Test generation生成边界场景、negative tests、parallel-run comparison casesQA selection, golden set, no self-approval
Cutover rehearsal analysis聚合演练日志、耗时、错误、dependency delayRunbook owner review, time-stamped evidence
Defect clustering将 recon breaks / UAT defects 聚类为根因候选Issue owner confirms closure
Runbook copilot帮 command center 检索步骤、联系人、历史问题和判断依据Read-only, cited runbook version, no autonomous action

14.2 AI Must Not Own

Prohibited ownershipReasonEnforced by
Final migration decision涉及事实迁移、客户影响、财务和风险接受Steering / Migration Authority signoff
Unreviewed core writes可能改变余额、状态、费用、额度和客户权利Tool gateway, policy, approval, dual control
Customer-impacting cutover涉及服务中断、通知、交易和投诉Command center, BCP authority
Source-of-truth changes改变事实责任和审计边界SoT registry workflow
Reconciliation closure关闭差异代表接受事实和风险Recon owner and materiality policy
Compliance / audit conclusions需要授权角色判断Human approval and disclaimer
Customer remediation decision影响补偿、通知和客户权利Business, Legal, Compliance, Customer Ops

14.3 AI Evidence Fields

FieldPurpose
ai_use_case_id追踪 AI 辅助场景
source_refs证明建议基于哪些文档、代码、数据
prompt_version复现 AI 辅助过程
model_id模型和供应商追踪
reviewer_id人工复核责任
approved_useAI 输出被批准用于什么
rejected_reason防止错误建议被重复采纳
linked_artifact映射、测试、缺陷、runbook 或决策记录

15. Evidence and Control Matrix

Control IDControl objectiveControl activityEvidenceOwnerGate
CBM-001现代化范围和事实责任明确建立 capability inventory and SoT registryCapability map, SoT registryEA / Data OwnerProgram initiation
CBM-002Legacy constraints 被识别完成 batch, interface, product, data, vendor constraint assessmentConstraint inventoryArchitectSlice selection
CBM-003迁移切片可控对每个 slice 评分并批准 sequencingSlice scorecardProduct / EARoadmap approval
CBM-004新旧语义隔离ACL contract 覆盖 canonical model、policy、idempotency、evidenceACL spec, contract testsSolution ArchitectBuild gate
CBM-005数据迁移可证明每批迁移有 manifest、DQ、mapping、recon、approvalMigration evidence packData Migration LeadMigration gate
CBM-006Core writes 受控写操作通过 policy-gated tool/command gatewayWrite log, approval, side-effect idTech OwnerProduction readiness
CBM-007Parallel run 可信Shadow ledger / dual-run 覆盖高风险场景Parallel-run scorecardQA / Finance / SMECutover gate
CBM-008Customer harm 被前置建立 harm scenarios、signals、remediationHarm checklist, monitoring dashboardProduct / OpsCutover gate
CBM-009Cutover 可恢复演练 downtime、rollback、queue drain、BCPRehearsal report, rollback proofRelease / BCP LeadCutover approval
CBM-010Vendor evidence 可用供应商交付 mapping、test、defect、support evidenceVendor control packVendor ManagerVendor gate
CBM-011AI 辅助受治理AI suggestions 有引用、review、approval 和使用边界AI assist logAI Governance / PMOngoing
CBM-012运营准备充分SOP、培训、workbench、command center、support model 准备ORR packOperations LeadGo-live
CBM-013审计证据可查询Evidence events 支撑 account/customer/product/slice 查询Evidence catalog, audit queriesEvidence OwnerAudit readiness
CBM-014Legacy retirement 受控退役前确认数据保留、只读、归档、接口停用Retirement packEA / Ops / RecordsDecommission gate

16. Operational Readiness Review

ORR 必须覆盖业务、技术、数据、运营、供应商、风险和证据。

ORR areaEvidence
Business readiness产品规则确认、客户话术、费用/利息解释、投诉路径
Operations readinessSOP、培训、队列、手工 workaround、权限、值班
Technology readinessEnvironment、monitoring、capacity、release、rollback、DR
Data readinessMigration recon、DQ exceptions、retention、lineage
Finance readinessGL recon、adjustment process、period close impact
Risk readinessControls, exceptions, residual risk, approvals
Vendor readinessSupport window, escalation, defects, release freeze
Evidence readinessTrace, event, command log, evidence pack, audit queries
AI readinessUse-case registry, access control, prompt/model version, reviewer workflow

17. Resilience / BCP / Degraded Mode

17.1 BCP Scenarios

ScenarioDegraded modeKey evidence
Legacy core read unavailableServe limited cached read with as-of disclosure or manual inquiryCache freshness, customer script
Target service unavailableRoute back through legacy path where SoT remains legacyRoute log, rollback decision
ACL degradedDisable high-risk writes, allow approved read-only pathsPolicy decision, write block log
Vendor package outageManual-first or legacy fallback by capabilityVendor incident, BCP activation
Evidence pipeline lagLocal append-only capture, restrict Tier 1 writes if evidence missingLocal ledger, preservation record
Reconciliation engine unavailableDelay cutover or freeze affected migrationGate exception, risk decision
Identity / entitlement issueNo personalized or sensitive view, manual verificationAccess denial log
AI workspace unavailableContinue modernization with approved human processAI dependency note, no customer impact

17.2 Impact Tolerance Examples

DimensionExample tolerance
Customer-facing balance accuracyKnown incorrect balance display for Tier 1 account flows triggers immediate safe-stop
Transaction postingNo unclassified duplicate posting beyond approved materiality
Evidence completenessTier 1 cutover commands and core writes require complete trace and command log
Manual backlogNo regulatory or contractual deadline breach in complaint/dispute queues
Rollback timeRoute rollback must fit tested recovery window for each slice
Vendor responseCritical cutover defect must have named vendor escalation with response commitment

18. Architecture Runway Backlog

Runway itemPurposeFirst usable milestone
Source-of-truth registryTrack fact ownership by capability and data objectRegistry covers first two slices
Canonical model registryPrevent semantic drift across adaptersCustomer, account, balance, transaction v1
ACL platformGovern intent API, mapping, policy, idempotency, evidenceRead API and one controlled write path
Event captureFeed read models, recon, evidenceLegacy transaction and account events
Reconciliation engineAutomate variance detection and break workflowBalance and transaction recon
Shadow ledgerValidate target ledger or posting rulesOne product family in shadow mode
Migration evidence storeStore manifests, snapshots, mappings, reconQuery by account/product/population
Operational cockpitMonitor cutover and coexistenceCustomer harm and error dashboard
Vendor control packMake vendor obligations observableRelease calendar and defect evidence
AI governed workspaceEnable safe AI assistanceRead-only docs/code RAG with review workflow

19. 30 / 60 / 90 Roadmap

Day 0-30: Diagnostic and Control Foundation

WorkstreamOutputs
Scope and governanceCapability inventory, RACI, modernization principles, decision rights
Constraint discoveryBatch/interface/product/data/vendor constraint inventory
Source-of-truthInitial SoT registry by data object and capability
Slice selectionReadiness scorecard, first slice candidates, customer harm model
Evidence designEvidence taxonomy, migration artifact list, cutover command log template
AI governanceAI use-case registry, safe-use policy, forbidden boundaries, access controls

Exit criteria:

  • Management understands modernization as staged source-of-truth transfer.
  • First slices selected with explicit risks and evidence requirements.
  • AI workspace is read-only or review-gated, not connected to production writes.

Day 31-60: Build Runway and Prove One Slice

WorkstreamOutputs
ACLIntent API, canonical model v1, mapping tests, idempotency design
Event/read modelEvent capture for first slice, read replica with freshness metadata
Migration factoryPopulation manifest, DQ profile, mapping decision record, recon report
Parallel runShadow or dual-run for selected scenarios
OperationsSOP, workbench view, training, command center draft
VendorInterface evidence, environment readiness, support model
AI assistRequirements mining, mapping suggestions and test generation with review evidence

Exit criteria:

  • One non-trivial slice proves ACL, evidence, recon and operational model.
  • Break taxonomy and ownership are working.
  • Cutover rehearsal plan is approved.

Day 61-90: Rehearse Cutover and Scale Sequencing

WorkstreamOutputs
Cutover rehearsalFull timeline, queue drain, smoke test, rollback drill, evidence capture
Customer harm monitoringBalance/transaction/error/complaint signals and remediation path
BCPDegraded mode scenarios, impact tolerance, vendor escalation
GovernanceCutover gate, exception process, residual risk pack
Scale planNext 3-5 slices, architecture runway gaps, funding and team topology
Legacy retirementRead-only, archive, interface shutdown, records retention approach
AI operationsDefect clustering, rehearsal analysis, runbook copilot under read-only controls

Exit criteria:

  • First production cutover or controlled pilot has evidence-backed approval.
  • Rollback and degraded mode have been exercised.
  • Next slices have sequencing based on value, coupling, evidence and readiness.

20. Anti-Patterns and Corrections

Anti-patternWhy it failsCorrection
Treat modernization as procurement新核心包不会自动解决语义、数据、运营和证据问题Define transition architecture and operating model
Wrap legacy APIs only渠道继续依赖旧语义, 未来迁移更难Build ACL with canonical model and policy
Migrate by table表迁移不等于业务事实迁移Migrate by capability, account population and fact type
Close recon breaks with spreadsheet comments差异不可追溯, 审计和事故复盘困难Break workflow with owner, cause, evidence and approval
Let AI generate final mapping映射错误直接造成客户和账务风险AI suggestion, human approval, sample and regression
Ignore branch/contact center员工无法解释旧新差异, 投诉上升Unified workbench, script, training, escalation
Cutover without rollback authority发现问题时没人敢停或回切Named rollback owner and triggers
Evidence after cutover事故后补证据不可信Evidence-by-design in pipeline and command center
Retire legacy too early历史查询、争议、报表和 records retention 断裂Retirement gate with archive and read-only strategy

21. Interview Answers

Q1: 你如何规划核心银行现代化?

30 秒版本:

我会把核心现代化定义为受控的 source-of-truth 迁移, 而不是一次系统替换。先做 constraint inventory 和 customer harm model, 再按 capability slicing 选择低耦合、高价值、证据成熟的切片。架构上建立 ACL、canonical model、event capture、read replica、reconciliation engine、shadow ledger 和 evidence plane。每个 cutover 都要有 migration manifest、parallel-run scorecard、rollback、BCP、运营准备和授权签署。

2 分钟版本:

核心银行承载客户余额、交易、利息、费用、贷款计划、总账和监管证据, 所以现代化不能只看目标系统。我的第一步是盘点旧核心约束: mainframe batch window、产品参数、接口耦合、数据质量、供应商节奏、运营 SOP 和客户伤害场景。

第二步是 capability slicing, 不是按数据库表迁移。对每个 slice 评估业务价值、耦合度、数据成熟度、测试 oracle、运营准备和 rollback 可行性。先用一个中低风险但有真实价值的 slice 证明 ACL、canonical model、event capture、reconciliation 和 evidence process。

第三步是 evidence-backed cutover。迁移要有 population manifest、source snapshot、mapping decision、DQ profile、parallel-run result、recon breaks、exception approval、customer harm monitoring 和 rollback proof。AI 可以帮助理解文档和代码、建议映射、生成测试和分析缺陷, 但不能拥有最终迁移决定、核心写入、source-of-truth 变更或客户影响 cutover。

Q2: Strangler 和核心包替换如何结合?

30 秒版本:

核心包可以是目标能力平台, 但 strangler 是迁移方式。我要用 ACL 把渠道和业务流程从旧核心语义中解耦, 再按 capability 把读、工作流、产品参数、开户、服务变更、ledger 能力逐步迁到核心包或新服务。关键是每个能力都有 SoT 状态、对账证据和回滚策略。

2 分钟版本:

很多项目失败在于把核心包当完整目标架构, 忽视 transition。我的做法是把核心包能力纳入 target state, 但通过 strangler 控制迁移节奏。旧核心、核心包和新微服务会共存一段时间, 所以 ACL 必须提供 intent API、canonical model、mapping、policy、idempotency 和 evidence。

对适合核心包承接的能力, 例如标准存款账户、产品参数和部分服务流程, 我会用 vendor adapter 和 parameter governance。对差异化能力, 例如数字化开户体验、运营工作流、AI 辅助客服或数据产品, 可以放在外围服务。所有写入必须通过受控命令路径, 所有事实源迁移都进入 SoT registry。

Q3: 如何证明数据迁移质量?

30 秒版本:

我会建立 migration evidence factory。每批数据都有 population manifest、source snapshot、DQ profile、mapping decision、transformation run、target load、reconciliation report、exception register 和 approval pack。质量不是“装载成功”, 而是新旧业务事实在账户、交易、余额、利息、费用、状态和 GL 上可对账、差异可分类、例外可批准。

2 分钟版本:

数据迁移最危险的是只看技术行数和装载成功率。核心银行迁移必须从业务事实证明: 迁移了哪些客户和账户, 源数据冻结在什么时间, 字段和状态如何映射, 产品参数如何转换, 哪些数据有质量缺陷, 新旧余额和交易是否一致, 差异是否在预先批准的 tolerance 内。

我会把 recon breaks 分成 data defect、mapping defect、rule defect、timing defect、vendor defect、operational defect 和 evidence defect, 每类有 owner 和关闭证据。对 ledger-impacting 能力, 我会用 shadow ledger 或 parallel run 跨日终/月终周期验证, 再进入 cutover gate。

Q4: AI 在核心现代化里最大的价值和边界是什么?

30 秒版本:

最大价值是加速理解和验证: requirements mining、legacy code/document understanding、mapping suggestion、data profiling、test generation、cutover rehearsal analysis、defect clustering 和 runbook copilot。边界是 AI 不能做最终迁移决策、未经审批核心写入、客户影响 cutover、source-of-truth 变更、reconciliation closure 或合规审计结论。

2 分钟版本:

我会把 AI 放在 governed workspace 中, 让它帮助处理大量非结构化材料和复杂规则。比如从 COBOL、copybook、SOP、供应商文档和 Jira 中抽取规则冲突, 为字段映射提供候选, 根据历史缺陷生成测试, 聚类 parallel-run 差异, 分析 cutover 演练日志。

但 AI 输出必须有引用、版本、置信度、reviewer 和批准用途。高风险工具默认只读或 dry-run, 核心写入需要 policy gate、审批、幂等、side-effect id 和 recon。AI 不得自动关闭差异, 不得自动批准 mapping, 不得指挥 production cutover。这样既利用 AI 提效, 又不把客户事实和监管责任交给模型。

Q5: 如何向管理层解释为什么不能 big-bang?

30 秒版本:

我会用风险集中度解释。Big-bang 把数据、渠道、运营、供应商、客户影响、财务对账和证据风险集中到一个窗口, rollback 也最复杂。Strangler 不是慢, 而是把风险拆成可证明的 capability increments, 每个 increment 都交付价值、证据和学习。

2 分钟版本:

我不会简单说 big-bang 不行, 而是展示风险对比。核心银行不是单系统上线, 它牵动账户、交易、利息、费用、贷款、总账、客服、投诉、监管报表和历史证据。Big-bang 的问题是所有未知同时暴露, 且一旦写入新核心后 rollback 涉及交易补偿、客户通知和财务处理。

Strangler 让我们先迁读能力、运营工作流、产品目录、低风险写入, 同时构建 ACL、event capture、recon、shadow ledger 和 cutover muscle。每个切片都有客户伤害监控和回退路径。管理层看到的是有节奏的风险下降和能力释放, 不是无限期双系统。


22. Portfolio Exercise

为一家拥有 mainframe deposit core、第三方 loan servicing package、数字渠道和数据仓库的零售银行设计一个 90 天 modernization pilot:

Deliverable期望内容
Capability slice选择一个具体切片, 说明为什么不是先切 ledger
ACL contractIntent API、canonical objects、write tier、error taxonomy
Evidence modelMigration manifest、mapping、recon、cutover log、AI assist log
Parallel-run plan覆盖哪些场景、周期、tolerance、break taxonomy
Customer harm model3-5 个伤害场景、监控信号和补救路径
AI boundary安全辅助清单、禁止边界、review workflow
30/60/90 plan每阶段 exit criteria 和管理层汇报指标