返回 Papers
AI 底层逻辑 / 经典论文

AI Treasury / Liquidity:ALM 预测与压力证据架构

本文是学习、作品集和架构训练材料, 不构成法律意见、监管意见、流动性充足性结论、资本充足性结论、会计处理意见、模型验证报告、审计结论、投资建议、融资建议、风险偏好批准或应急融资执行建议。

610ai-foundations/papers/141-ai-treasury-liquidity-alm-forecasting-stress-evidence-architecture.md

AI Treasury / Liquidity / ALM Forecasting / Stress Evidence Architecture 解读

面向对象: CBAP+ Financial Retail PM / Senior BA / Treasury Product Architect / ALM Architect / Risk Data Product Lead / Model Risk Partner / Enterprise Architect / Board MI Owner。 核心问题: 金融机构如何把 AI 用在 treasury、liquidity forecasting、deposit beta/runoff、cash-flow ladder、ALM、IRRBB、FTP、portfolio scenario、stress testing、contingency funding 和 board MI 中, 同时让每个预测、压力假设、人工覆盖和资金行动都可解释、可追溯、可挑战、可审计? 学习目标: 建立 treasury/ALM 预测不是 generic prediction modeling 的心智模型, 形成 forecast-to-action architecture、stress evidence graph、model risk controls、committee operating model、data lineage、board MI 和面试可表达的产品架构语言。


0. Disclaimer

本文是学习、作品集和架构训练材料, 不构成法律意见、监管意见、流动性充足性结论、资本充足性结论、会计处理意见、模型验证报告、审计结论、投资建议、融资建议、风险偏好批准或应急融资执行建议。

正式项目必须由 Treasury、ALM、Finance、Risk、Model Risk、Legal、Compliance、Regulatory Reporting、Information Security、Data Governance、Architecture、Internal Audit、Business Line、Operations 和董事会/管理层授权委员会共同判断。具体适用范围、监管口径、报告义务、模型治理要求、数据使用限制和披露要求由 Legal / Compliance / Regulatory Affairs / Model Risk 等权责方确认。

本文把官方材料转成产品和架构训练语言。不要把任何 AI forecast、scenario output 或解释图直接视为流动性行动许可。资金调拨、批发融资、资产出售、hedging、FTP 调整、产品定价变更、客户沟通和 contingency funding action 必须走机构批准的 human committee、dual control、limit、policy 和 evidence process。


Source Anchors

访问日期: 2026-06-30。

SourceOfficial link本文使用方式
Federal Reserve SR 10-6, Interagency Policy Statement on Funding and Liquidity Risk Management用户给定路径: https://www.federalreserve.gov/supervisionreg/srletters/sr1006.htm ; Fed legacy anchor: https://www.federalreserve.gov/boarddocs/srletters/2010/sr1006.htm ; attachment PDF: https://www.federalreserve.gov/boarddocs/srletters/2010/sr1006a1.pdf用 liquidity governance、cash-flow measurement、stress testing、management reporting、diversified funding、liquid asset cushion、contingency funding plan 和 internal controls 组织 treasury/ALM 架构。用户给定的新式路径在正式引用前需复核可访问性。
Federal Reserve SR 11-7, Guidance on Model Risk Managementhttps://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm用 model inventory、development、implementation、use、validation、ongoing monitoring、effective challenge 和 model uncertainty 设计 forecast / stress model risk control。
OCC Comptroller's Handbook indexhttps://www.occ.treas.gov/publications-and-resources/publications/comptrollers-handbook/index-comptrollers-handbook.html作为 OCC handbook 官方入口, 用于正式项目中定位 Interest Rate Risk、Liquidity、Model Risk、Bank Supervision Process 等 booklet。
OCC Interest Rate Risk booklet bulletinhttps://www.occ.treas.gov/news-issuances/bulletins/2020/bulletin-2020-26.html用 IRR、NII、EVE、NMD assumptions、stress testing、model risk 和 FTP 的 supervisory architecture language 校准 IRRBB/ALM 讨论。
Federal Reserve SR 16-3, Interagency Guidance on Funds Transfer Pricing Related to Funding and Contingent Liquidity Riskshttps://www.federalreserve.gov/supervisionreg/srletters/sr1603.pdf用 FTP 作为把 funding risk、liquidity component、contingent liquidity risk 分配到产品/业务线/组合决策的架构锚点。
FFIEC Management booklethttps://ithandbook.ffiec.gov/it-booklets/management.aspx用 IT management、governance、risk management、resources、performance、change 和 assurance 语言连接 treasury AI 平台的 technology control。
NIST AI RMFhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织 AI forecast risk、explainability、monitoring、human oversight、incident 和 evidence。
ISO/IEC 42001https://www.iso.org/standard/42001用 AI management system、roles、operation、performance evaluation、internal audit 和 continual improvement 设计 enterprise AI treasury operating model。
Federal Register final policy statement on funding and liquidity risk managementhttps://www.federalregister.gov/documents/2010/03/22/2010-6137/interagency-policy-statement-on-funding-and-liquidity-risk-management用官方发布记录交叉锚定 SR 10-6 policy statement 的 stress testing、management reporting、CFP、liquid asset cushion 等主题。

一句话:

AI Treasury Architecture is not "predict tomorrow's balance". It is a governed forecast-to-action and stress-evidence system for liquidity survival, ALM decisions, FTP incentives and board challenge.


1. Thesis

Treasury / Liquidity / ALM 场景中的 AI 预测, 不能按普通 demand forecast 处理。普通预测的中心问题是:

What value will happen next?

Treasury/ALM 的中心问题是:

Under which balance-sheet, rate, customer, market, legal-entity and operational conditions
will the institution have enough usable liquidity,
at what cost,
with what management actions,
and what evidence proves that the assumptions were understood, challenged and acted on?

成熟架构的变化是:

from: spreadsheet cash position + static ALM model + committee slide pack
to: data-lineage-backed forecast + scenario/stress engine + model-risk controls
    + ALCO decision workbench + contingency action register + board MI evidence graph

AI 的价值不是替代 Treasurer 或 ALCO。AI 的价值是把以下能力产品化:

  1. 更快识别 deposit behavior、payment flow、collateral、funding market 和 rate scenario 的变化。
  2. 把 NMD beta、runoff、cash-flow ladder、NII/EVE、FTP、portfolio mix 和 CFP trigger 放在同一个 scenario fabric 中。
  3. 用 probabilistic forecast、stress overlay、driver decomposition 和 uncertainty interval 支持行动选择。
  4. 把人工覆盖、committee challenge、限额状态、action rationale 和后验表现保存在同一条证据链。
  5. 让董事会看到的不是模型分数, 而是 liquidity position、stress survival horizon、assumption changes、control effectiveness 和 management action。

核心原则:

Treasury AI should recommend insight, not execute liquidity.
Forecast precision without action governance is operational risk.
Stress output without assumption lineage is weak evidence.
Deposit beta without segment and scenario is a dangerous scalar.
ALM model output without committee challenge is not decision architecture.
Board MI without source-to-action traceability is presentation, not oversight.

2. Treasury / ALM Domain Mental Model

Treasury/ALM 架构要同时处理四类不确定性:

Uncertainty典型问题AI/Architecture implication
Behavioral uncertainty存款客户是否迁移、提现、转到高收益产品、集中流出deposit beta/runoff 需要按 segment、relationship、rate sensitivity、concentration 和 event condition 建模
Market uncertainty利率、yield curve、credit spread、market liquidity、collateral haircut 变化scenario engine 必须连接 rate curves、FTP、EVE/NII、liquid asset value 和 funding cost
Operational uncertaintypayment rail、settlement、intraday flows、系统故障、现金配送和 collateral movementcash-flow ladder 要区分 contractual、behavioral、operational 和 intraday liquidity
Governance uncertainty管理层何时行动、能否执行、是否有 legal/entity/operational constraintforecast-to-action 必须记录 trigger、decision right、dual control、CFP action 和 evidence closure

Treasury/ALM 的关键对象不是单一时间序列, 而是 balance-sheet graph:

customer / counterparty
  -> account / product / legal entity
  -> contractual cash flow
  -> behavioral cash flow
  -> funding source / funding need
  -> collateral / encumbrance
  -> liquidity buffer
  -> scenario / stress assumption
  -> limit / risk appetite
  -> committee decision
  -> management action
  -> board evidence

2.1 Liquidity, ALM, IRRBB, FTP 的边界

DomainMain questionTypical outputsAI risk if blurred
Liquidity risk是否能满足到期和压力下资金流出cash-flow ladder、survival horizon、liquidity buffer、CFP trigger把盈利预测误当成流动性可用性
Treasury operations今天/本周如何调拨和融资cash position、intraday need、funding execution、collateral movementAI 自动建议资金动作但缺少审批和双控
ALM资产负债结构是否稳健repricing gap、NII、EVE、duration、basis risk、optionality只优化短期 NII, 忽略长期经济价值和流动性
IRRBB利率变化如何影响 banking book earnings/economic valueNII/EAR、EVE、scenario sensitivity、NMD assumption把 NMD assumptions 固定化, 看不到客户行为漂移
FTPfunding/liquidity cost 如何分配到产品和业务线FTP curve、liquidity premium、contingent liquidity charge定价激励与真实资金风险脱节
Stress testing极端但合理情景下如何表现idiosyncratic/market-wide/combined stress result、management action生成压力故事但没有 action feasibility 和 evidence

PM/Architect 要把这些域连接起来, 但不能混成一个黑箱模型。


3. Why AI Changes Treasury Architecture

AI 对 treasury/ALM 的改变, 不是"更复杂模型更准"。真正变化在于: 预测、压力、解释、行动和证据的周期变短。

Pressure旧模式AI-enabled architecture
Deposit mobility月度或周度看余额变动日内/日级识别 rate-sensitive segment、large depositor concentration、digital outflow pattern
Rate regime shifts静态 beta / average life 假设分 regime、分产品、分客户关系、分竞争利率的 behavioral assumption engine
Stress narrative手工定义少数情景scenario library + driver graph + reverse stress + sensitivity sweep
ALCO pack人工拼表和截图metric contract + lineage + board MI data product
Forecast challenge事后解释误差challenger model、backtesting、interval coverage、reason code、assumption drift alert
CFP readiness文档化计划trigger-to-action workflow、role readiness、funding capacity evidence、dry-run record
FTP定期调整曲线scenario-aware FTP simulation, 将 contingent liquidity cost 反馈到产品/组合激励

AI 同时放大风险:

  • False comfort: 模型给出平滑 P50, 掩盖尾部流动性风险。
  • Procyclicality: 模型根据近期流出提高假设, 触发过度收缩或激励错配。
  • Explainability gap: ALCO 无法解释为什么 deposit beta 或 runoff assumption 改变。
  • Data leakage: 使用事后可见的余额调整、pricing campaign 或 exception data 训练当时不可见的预测。
  • Model misuse: 短期 cash forecast 被拿去支持长期 ALM 或 FTP 决策。
  • Automation bias: 委员会接受 AI scenario ranking, 没有独立 challenge。
  • Evidence break: 董事会看到的 survival horizon 无法追溯到数据版本、模型版本、scenario version 和人工覆盖。

高级设计的目标不是消除 judgment, 而是让 judgment 结构化、可挑战、可复盘。


4. Reference Architecture

source systems
  core deposits / loans / cards / payments / wires / ACH / treasury trades
  GL / product pricing / CRM / client concentration / collateral / securities
  market rates / yield curves / macro / competitor rates / news-event signals
  legal entity / regulatory reporting / limits / committee records

-> as-of data product layer
     balance snapshots
     cash-flow events
     deposit behavior features
     rate and market data
     collateral and encumbrance
     customer / segment / legal-entity mapping
     data quality and lineage

-> AI and analytics layer
     baseline forecasts
     deposit beta and runoff models
     cash-flow forecasting
     anomaly / early-warning signals
     scenario generator and stress overlays
     challenger models
     explanation and driver decomposition

-> treasury risk engines
     liquidity ladder
     stress survival horizon
     ALM / IRRBB NII and EVE engines
     FTP simulation
     portfolio scenario engine
     contingency funding capacity

-> decision layer
     ALCO / Treasury Risk workbench
     limit and trigger monitor
     action recommendation with constraints
     human committee challenge
     dual-control action registration

-> evidence and MI layer
     forecast contract
     model inventory
     scenario version
     data lineage
     override record
     action log
     board MI pack
     audit evidence export

4.1 Architecture Capabilities

CapabilityMust doFailure mode if missing
Forecast contract registry记录 grain、horizon、cadence、decision use、owner、allowed action同一预测被拿去支持不兼容决策
As-of data lineage区分 event time、available time、restatement、cutoff、source owner训练和回测发生 leakage, 董事会数字无法复算
Deposit behavior feature store管理 rate offer、relationship depth、segment、concentration、digital behavior、large depositor flagsbeta/runoff 退化为粗糙平均值
Scenario library管理 base、idiosyncratic、market-wide、combined、reverse stress、operational stress压力测试变成一次性故事
ALM/IRRBB integration把 cash-flow assumption、NMD assumption、prepayment、rate curve 和 NII/EVE engine 对齐liquidity forecast 与 ALM 结果互相矛盾
FTP simulation将 funding cost、liquidity premium、contingent liquidity cost 分配到产品/业务线业务线指标鼓励消耗稳定性和尾部流动性
Human committee workflow记录 ALCO challenge、覆盖理由、批准人、action boundaryAI 输出成为事实标准, 有效挑战缺失
Evidence ledger串联 data version、model version、scenario、output、override、decision、action、MI审计或复盘时只能解释 PPT, 不能解释事实链
Board MI data product从 metric contracts 生成 liquidity / ALM / stress / action 指标董事会看到故事, 看不到 threshold、owner 和 closure

4.2 Canonical Data Entities

EntityMinimum fields
Deposit accountaccount_id, product_type, customer_id, segment, legal_entity, insured_flag_or_bucket, balance, rate_paid, repricing_terms, relationship_tenure
Deposit behavior eventevent_id, account_id, event_time, available_time, inflow_outflow, transfer_channel, pricing_event_id, balance_after, event_reason_code_if_available
Customer / counterparty concentrationcustomer_id, relationship_group, segment, aggregate_balance, top_depositor_rank, operational_relationship_indicators
Cash-flow eventcashflow_id, source_system, contractual_date, expected_date, amount, currency, legal_entity, product, confidence_band, behavioral_flag
Market and rate statecurve_id, rate_type, tenor, value, as_of_time, source, scenario_applicability
Scenarioscenario_id, type, horizon, rate_path, runoff_shock, wholesale_funding_shock, collateral_haircut, operational_assumption, approval_status
Model outputoutput_id, model_id, model_version, forecast_contract_id, scenario_id, horizon, quantile, value, confidence, explanation_ref
Treasury actionaction_id, trigger, proposed_action, owner, approval_forum, constraint_check, execution_status, evidence_ref
Committee decisiondecision_id, forum, date, challenge_points, approved_action, override_reason, residual_risk_owner
Board MI metricmetric_id, definition_version, source_lineage, threshold, status, action_link, reporting_period

5. Forecast Contracts: 预测必须绑定决策

Treasury AI 的第一个产品产物不是模型, 而是 Forecast Contract。

FieldTreasury/ALM definition
Decision usecash positioning、liquidity buffer sizing、ALCO monitoring、FTP update、pricing action、portfolio rebalance、CFP trigger
Grainaccount / customer group / product / branch / legal entity / currency / business line / day or intraday bucket
Horizonintraday、1-7 days、30 days、90 days、1 year、multi-year ALM horizon
Outputpoint, quantile, interval, scenario distribution, stress survival horizon, NII/EVE impact
Action boundaryread-only alert、committee review、pricing change proposal、funding execution request、CFP activation recommendation
Required explainabilitydriver decomposition、segment contribution、rate sensitivity、scenario assumption lineage、uncertainty
OwnerTreasury owner、ALM owner、model owner、data owner、business line owner
Evidencedata cutoff、model version、feature version、scenario version、backtest、override、committee minutes
Prohibited use用短期 operational forecast 替代 regulatory reporting、capital plan 或正式 liquidity adequacy conclusion

示例:

Forecast contract: retail NMD runoff under rate-up and confidence shock
grain: day x product x segment x legal entity
horizon: 30 / 90 days
outputs: P50/P90 runoff, beta estimate, segment contribution, top depositor concentration effect
decision use: ALCO review, liquidity buffer monitoring, FTP sensitivity, pricing strategy challenge
action boundary: no automatic funding action; red status triggers ALCO/Treasury Risk review
evidence: as-of deposit data, pricing offers, competitor rate feed, model version, scenario version, overlay reason

6. Deposit Beta And Runoff Architecture

Deposit beta 不是一个全行常数。Runoff 也不是余额时间序列外推。更成熟的模型对象是:

deposit behavior under a specified rate, confidence, relationship and liquidity scenario

6.1 Segmentation Logic

DimensionWhy it mattersArchitecture note
Product typechecking、savings、money market、CD、brokered、sweep 行为不同product taxonomy 必须与 core banking and ALM mapping 对齐
Customer segmentretail、SMB、commercial、municipal、wealth、digital-onlysegment 变化本身要有 lineage, 不能事后重分群污染回测
Relationship depthpayroll、loan、card、merchant、treasury management、direct depositrelationship features 要能解释 stickiness, 不只是余额规模
Balance concentrationlarge depositor / top-N / uninsured-like concentrationconcentration changes 要进入 early warning and board MI
Rate sensitivitypaid rate、offer rate、competitor spread、repricing delaybeta 应按 regime、product、segment 和 pricing action 估计
Digital mobilityonline transfer behavior、external linked account、recent app activity可能是 runoff acceleration signal, 但要注意数据使用边界
Event exposurenews、social、branch closure、service outage、downgrade、local stressevent features 只作为 risk signal, 不应直接替代 committee judgment

6.2 Beta / Runoff Model Portfolio

Model typeBest useControl requirement
Rule / expert baseline快速解释、低数据区间、model challenger必须保留作为 honest baseline
Elasticity / regression model估计 rate paid 与 market rate movement 的 beta检查 regime shift、multicollinearity、pricing campaign leakage
Survival / hazard model账户或客户关系层面 runoff timingreason code、censoring、cohort drift 要明确
Gradient boosting非线性 segment behavior and interactionfeature availability and monotonic sanity checks
Hierarchical Bayesian / pooled model多 segment、低样本、共享信息prior rationale、credible interval、sensitivity
Probabilistic time-series model多 horizon balance and flow forecastinterval coverage、tail calibration、scenario conditioning
Anomaly / early warning非典型流出、集中提款、渠道异常false positive process, escalation threshold
LLM / NLP signal classifier新闻、客户留言、call center notes 中识别 confidence stress signal不应直接驱动资金动作; 需要 source grounding、human review、privacy/security control

6.3 设计原则

Weak patternBetter pattern
"全行 NMD beta = 35%"beta by product x segment x scenario x rate regime, with uncertainty
用月末余额训练日级流出使用 daily/intraday flow events, 区分 event time and availability
只看账户余额加入 customer group concentration and relationship stickiness
用近期平稳期回测加入 rate shock, confidence event, competitor rate gap, out-of-sample stress windows
用单模型给 ALCObaseline + AI model + challenger + overlay + committee challenge
SHAP 排名当解释driver decomposition + segment contribution + scenario assumption + data lineage + action implication

7. Cash-Flow Forecasting And Liquidity Ladder

Cash-flow architecture 要同时区分四种 cash flow:

Cash-flow typeExamplesModeling note
Contractualloan maturities, security coupons, wholesale funding maturity, CD maturitydates and amounts are known but settlement and optionality still matter
BehavioralNMD runoff, loan prepayment, drawdowns, card/payment behaviorAI 可增强, 但必须带 uncertainty and scenario
OperationalACH/wire/card settlement, payroll, tax dates, branch cash, ATM cash, merchant settlement日历、cutoff、holiday、payment rail failure 是关键
Contingentcredit line draws, collateral calls, margin, unfunded commitments, liquidity facilities与 stress scenario and FTP contingent liquidity cost 绑定

Liquidity ladder 不应只展示 net cumulative cash flow。它至少要展示:

opening liquidity
+ expected inflows
- expected outflows
+ usable liquidity sources
- encumbered / trapped / operationally unavailable liquidity
- stress runoff and contingent outflows
= survival horizon and buffer surplus / shortfall

7.1 Liquidity Ladder Design Table

LayerRequired fieldsBoard / ALCO question
Opening positioncash, reserves, HQLA-like buffer, collateral, currency, legal entity现在可用流动性在哪里, 是否可转移?
Expected flowsinflow/outflow by product, payment rail, business line, date bucket哪些流入/流出是高置信度?
Behavioral overlayP50/P90 runoff, confidence shock, top depositor movement行为假设变化来自哪里?
Contingent flowscommitted lines, collateral calls, wholesale rollover, undrawn commitments压力下哪些承诺会变成现金需求?
Liquidity sourcesmarketable securities, secured borrowing, central bank/FHLB-like lines, cash actions可执行容量是否已测试?
Constraintslegal entity, currency, operational, settlement, collateral eligibility, policy limit账面 liquidity 是否真正可用?
Actionspricing, funding, asset sale, collateral movement, CFP activation谁批准, 多快执行, 证据是什么?

8. ALM / IRRBB Integration

AI treasury forecast 必须进入 ALM/IRRBB 架构, 但不能吞掉 ALM discipline。

8.1 AI 在 ALM 中的合理角色

RoleExampleBoundary
Assumption challenger识别 NMD average life、repricing rate、prepayment、deposit migration 的漂移challenger 不是自动替代 approved ALM assumption
Scenario expander生成 rate path、spread widening、customer behavior、portfolio mix 的组合情景scenario 必须可命名、可复现、可批准
Sensitivity analyzer解释 NII/EVE 对 beta、runoff、prepayment、basis risk 的敏感性不应只输出 single best scenario
Data quality monitor检查 ALM input completeness、stale curves、mapping breaksquality gate 可以阻止 MI 发布
Narrative assistant生成 ALCO pack 初稿、解释变动必须 source-grounded and reviewed

8.2 NII / EVE / Liquidity 的组合视角

Decision conflictExampleArchitecture response
NII vs liquidity提高存款利率保留资金, 短期压缩 NIMscenario shows liquidity survival benefit and FTP cost allocation
EVE vs NII长久期资产提升收益但加大经济价值敏感性ALM engine must show NII and EVE side by side
Growth vs contingent liquidity大量未用授信承诺带来收入但压力下 drawdownFTP contingent liquidity charge and stress draw assumption
Centralized liquidity vs legal entity constraint集团流动性充裕但某实体受限制legal-entity liquidity ladder and transfer constraint
Customer retention vs pricing discipline高收益促销留存 rate-sensitive balancesproduct profitability includes funding cost and runoff probability

8.3 IRRBB Assumption Register

AssumptionAI supportEvidence needed
NMD betaestimate by product/segment/regimehistorical fit, competitor rate context, uncertainty, committee approval
NMD average liferunoff/survival analysiscohort behavior, stress sensitivity, backtest
Loan prepaymentborrower behavior and rate incentivemodel validation, economic logic, scenario response
Deposit migrationchecking to MMDA/CD/brokerage/high-yield movementmigration matrix, pricing campaign data, customer segment
Non-parallel rate shockcurve scenario librarycurve source, scenario definition, NII/EVE impact
Basis riskmismatch between product rate and market curvemapping, spread behavior, sensitivity

9. FTP And Portfolio Scenario Architecture

FTP 是把 Treasury insight 变成业务激励的桥。没有 FTP, AI liquidity forecast 容易停留在风险报告; 有错误 FTP, 业务线会被错误激励。

9.1 FTP Decision Table

Business activityFunding / liquidity riskFTP implicationEvidence
Long-term fixed-rate lendingterm funding, interest-rate mismatchterm liquidity cost and hedge/funding componentcurve version, duration, liquidity premium
Transaction depositsstable operational funding benefitfunding benefit only if stability evidence existsbeta/runoff assumptions, relationship metrics
High-yield promotional depositrate-sensitive and runoff risklower stability credit, higher liquidity chargecampaign cohort, competitor spread, attrition behavior
Credit commitmentscontingent liquidity needcontingent liquidity chargedrawdown stress assumption, utilization history
Wealth / brokerage sweepfast migration riskscenario-specific runoff chargechannel mobility, top depositor concentration
Trading or securities portfoliomarket liquidity and collateral haircutliquidity buffer and haircut costmarket depth, encumbrance, haircut scenario

9.2 Portfolio Scenario Loop

business growth plan
  -> forecast deposit / loan / commitment / securities trajectory
  -> apply rate and liquidity scenarios
  -> calculate liquidity ladder, NII, EVE, FTP charges
  -> identify incentive mismatch and limit pressure
  -> propose portfolio / pricing / funding actions
  -> committee challenge and approval
  -> monitor actual behavior against scenario

PM implication:

The product manager for treasury AI is not optimizing model accuracy. They are aligning product growth, funding cost, contingent liquidity, ALM risk and management action through a traceable scenario loop.


10. Stress Evidence Architecture

Stress testing 的架构价值在于证据链, 不是情景数量。

10.1 Scenario Taxonomy

Scenario typeExampleRequired evidence
Base forecastnormal balance sheet and rate pathforecast contract, data cutoff, model version
Idiosyncratic liquidity stressinstitution-specific confidence event, concentrated depositor runofftrigger definition, runoff assumption, top depositor exposure
Market-wide stressrate shock, funding market freeze, collateral haircut, spread wideningmarket scenario source, curve/haircut mapping
Combined stressconfidence event plus market stresscorrelation assumption, management action capacity
Reverse stress找到 survival horizon 破裂条件failure threshold, scenario path, assumptions
Operational stresspayment rail outage, collateral movement delay, treasury system outageoperational dependency, BCP/DR evidence
Intraday stressunexpected payment outflows before funding source availableintraday data, cutoff, liquidity buffer

10.2 Stress Evidence Graph

scenario_id
  -> assumption_id
       -> source_data_version
       -> expert_judgment_record
       -> model_output_id
       -> overlay_id
  -> liquidity_ladder_result
  -> ALM / IRRBB result
  -> FTP / portfolio result
  -> limit status
  -> committee challenge
  -> management action
  -> board MI metric
  -> post-event backtest / lessons learned

10.3 Evidence Pack

Evidence artifactPurpose
Forecast Contract证明预测用途、边界、粒度、horizon 和 owner
Scenario Definition Card证明压力情景参数、来源、版本和批准状态
Assumption Register证明 beta、runoff、drawdown、prepayment、haircut、rollover 假设有 owner and rationale
Data Lineage Snapshot证明使用的 as-of data、quality checks、source systems and cutoff
Model Card / Validation Summary证明模型用途、限制、backtesting、challenger、monitoring and known limitations
Overlay Record证明人工调整原因、批准人、影响和有效期
Committee Challenge Record证明 ALCO / Treasury Risk 提问、反驳、决定和 residual risk owner
Action Register证明采取或未采取行动的原因、批准、执行状态和 closure
Board MI Tile证明董事会看到的数字来自同一 evidence graph

11. Explainability: 从"模型解释"到"行动解释"

Treasury/ALM explainability 不是给 SHAP chart。它要回答六个行动问题:

  1. 为什么 liquidity outlook 变差?
  2. 哪些 segment、product、legal entity、currency、payment rail 或 depositor group 贡献最大?
  3. 变化来自真实行为、市场利率、模型版本、数据质量、场景参数还是人工覆盖?
  4. 这个变化对 cash position、survival horizon、NII、EVE、FTP 和 limits 的影响分别是什么?
  5. 哪些管理行动可用, 每个行动的速度、成本、约束和副作用是什么?
  6. 委员会为什么接受、调整或拒绝 AI 建议?

11.1 Explainability Layers

LayerQuestionOutput
Data explanationWhich source changed?source delta, data quality status, cutoff
Behavior explanationWhich customers/products moved?segment contribution, concentration, migration matrix
Scenario explanationWhich assumption drives stress result?sensitivity tornado, scenario delta
Model explanationWhy did forecast model change?feature contribution, model drift, challenger comparison
ALM explanationHow does it affect NII/EVE?NII/EVE bridge and assumption impact
Action explanationWhat should management consider?action options, constraints, owner, residual risk
Governance explanationWho challenged and approved?committee record and override rationale

Memory rule:

Explainability for treasury AI = why the number changed + why the action is or is not justified.

12. Human Committee And Decision Rights

Treasury AI 要嵌入 committee operating model, 不是绕过它。

Forum / RoleDecision rightsAI supportEvidence
Treasury deskdaily positioning, funding execution within approved limitscash forecast, intraday alert, liquidity source viewexecution log, limit check
ALCObalance-sheet strategy, liquidity/IRR appetite, FTP reviewscenario package, NII/EVE sensitivity, deposit behavior trendsminutes, decision record, action owner
Treasury Risk / Market Riskindependent challenge of assumptions and limitschallenger views, limit breaches, stress analysischallenge log, issue register
Model Riskmodel inventory, validation, monitoring, model limitationsvalidation dashboard, drift and backtest evidencevalidation report, issue closure
Finance / FP&Aplan, NIM, funding cost, portfolio performanceforecast-to-plan bridge, FTP impactsfinance reconciliation
Legal / Complianceapplicability, disclosure, customer communication, regulatory interpretationsource-grounded evidence bundlelegal/compliance-owned decision record
Internal Audittest control design and operationevidence export, lineage sampleaudit test result
Board / Risk Committeeoversight, risk appetite, challenge, material action awarenessboard MI, thresholds, stress survival, management actionsboard pack and minutes

Decision boundary:

AI can detect, forecast, explain, simulate, draft and recommend.
AI should not independently approve liquidity risk appetite, execute emergency funding,
change FTP policy, sell assets, alter customer pricing, or declare regulatory adequacy.

13. Board MI Architecture

Board MI 必须从 production facts and evidence 生成, 不是 quarterly manual slide assembly。

13.1 Board Metrics

CategoryExample metricsDecision use
Liquidity positionliquidity buffer, usable liquidity by legal entity/currency, encumbrance是否在 appetite 内
Survival horizonbase / idiosyncratic / market-wide / combined stress days是否触发 management action
Deposit behaviorbeta by segment, runoff by segment, top depositor concentration, rate-sensitive balance是否需要 pricing/funding/portfolio action
Cash-flow reliabilityforecast error by horizon, interval coverage, data freshness是否相信 forecast-to-action
ALM / IRRBBNII/EVE sensitivity, NMD assumption changes, basis risk是否调整 balance sheet or hedge strategy
FTP / incentivesFTP charge movement, products below funding cost, contingent liquidity cost是否调整产品增长激励
CFP readinesstested funding capacity, action execution time, dry-run results是否能执行应急计划
Governancemodel validation status, issue aging, override rate, evidence completeness控制是否有效
Management actionred/amber actions, overdue remediation, residual risk acceptance管理层是否行动

13.2 Board Tile Example

Decision topic: Retail deposit runoff and liquidity buffer under 90-day combined stress

Status: Amber
Driver: digital savings and money-market deposit runoff P90 increased by 18% vs last quarter.
Evidence: competitor-rate spread widened; top-50 depositor concentration unchanged; model challenger agrees directionally but lower magnitude.
Impact: survival horizon down from 54 to 43 days under combined stress; NII downside increases under high-rate retention strategy.
Management action: ALCO approved targeted retention pricing for operational relationship segments, no broad promotional campaign; FTP charge updated for high-rate promotional balances.
Controls: model limitation noted for new digital-only cohort; human overlay approved for 60 days; validation backtest scheduled after next month-end actuals.
Decision requested: Board Risk Committee to note amber status and challenge management on CFP dry-run completion.

14. PM / BA / Architect Implications

RoleWhat "good" looks like
PMDefines decision products: ALCO workbench, liquidity forecast contract, stress evidence pack, board MI, not "AI prediction dashboard".
BACaptures treasury rules as metric contracts, scenario definitions, action boundaries, exception flows, committee decisions and evidence requirements.
ArchitectBuilds lineage-first data products, scenario services, model registry, action workflow, evidence ledger and integration with ALM/FTP engines.
Data Product OwnerOwns as-of data quality, cash-flow event model, deposit behavior features, curve data, product taxonomy and legal-entity mapping.
Model Risk PartnerEnsures inventory, independent validation, challenger models, ongoing monitoring, limitations, issue closure and effective challenge.
Treasury / ALM SMEOwns assumptions, management actions, risk appetite interpretation, CFP feasibility and ALCO narrative.

Minimum product artifacts:

ArtifactPurpose
Treasury Forecast Contract Catalog用途、粒度、horizon、action boundary
Cash-Flow Event Model统一 contractual / behavioral / operational / contingent flow
NMD Assumption Registerbeta、runoff、average life、repricing by segment
Scenario Librarybase, stress, reverse stress, operational and intraday scenario
ALCO Decision Workbench Speclimit, scenario, explanation, committee challenge, action
Evidence Graph Schemadata -> model -> scenario -> decision -> action -> MI
Board MI Metric Contracts定义、阈值、owner、lineage、cadence
Model Risk Packmodel card、validation、backtest、monitoring、limitations

15. Anti-Patterns

Anti-patternWhy dangerousBetter practice
Generic balance forecast dashboard预测没有绑定资金动作和风险偏好Forecast Contract + Action Boundary
One beta for all deposits掩盖产品、客户、rate regime 和 concentration 差异Segment/scenario beta with uncertainty
Accuracy-only model selectionP50 准不代表压力尾部可用interval coverage, tail stress, challenger and judgment overlay
ALCO pack assembled manually数字不可复算, 证据断裂MI data product and evidence ledger
LLM writes stress narrative from ungrounded context生成漂亮但不可审计的解释source-grounded narrative with cited data lineage
Stress scenario without feasible action只证明会出问题, 不证明能响应trigger-to-action workflow and CFP dry-run evidence
FTP not connected to runoff/liquidity forecast业务线继续追逐错误激励scenario-aware FTP and portfolio feedback
Model Risk only at annual reviewtreasury risk changes faster than review cadenceongoing monitoring, drift, issue and override dashboards
Board sees only status color无法挑战原因、阈值和行动driver bridge, threshold, owner, action and residual risk

16. Interview-Ready Language

Q1: How would you design AI for treasury liquidity forecasting without making it a black-box model?

30 秒版本:

I would not start with a model. I would start with forecast contracts tied to treasury decisions: daily liquidity positioning, ALCO monitoring, CFP triggers, FTP sensitivity and board MI. The architecture needs as-of data lineage, deposit behavior features, scenario library, baseline plus challenger models, ALM/IRRBB integration, human committee workflow and an evidence ledger that links data, assumptions, outputs, overrides and actions.

2 分钟版本:

In treasury, the risk is not just forecast error; it is using a forecast for the wrong decision. I would separate cash-flow forecasting, deposit beta/runoff, ALM/IRRBB assumptions, FTP and stress testing into connected but governed services. Each forecast has a contract: grain, horizon, decision use, allowed action, explainability and evidence. For deposit behavior, I would model beta and runoff by product, segment, relationship, concentration and rate regime, with uncertainty and challenger models. The output feeds a liquidity ladder, ALM NII/EVE engine, FTP simulation and ALCO workbench. No AI output directly executes funding or pricing actions; committee challenge, dual control, limit checks and evidence capture are part of the product.

Q2: What is the difference between generic forecasting and treasury/ALM forecasting?

30 秒版本:

Generic forecasting predicts a value. Treasury/ALM forecasting supports liquidity survival, balance-sheet strategy, rate risk, FTP incentives and contingency actions under scenarios. The unit of design is not a time series; it is forecast-to-action with assumptions, stress, limits, committee challenge and evidence.

Q3: How would you handle deposit beta and runoff assumptions?

30 秒版本:

I would reject a single bank-wide beta. I would build an assumption register by product, segment, relationship depth, concentration and rate regime, with historical evidence, uncertainty, challenger model, stress overlay and ALCO approval. The assumption is a governed object, not a hidden model parameter.

Q4: What evidence should be available for a board liquidity metric?

30 秒版本:

The board metric should trace to a metric contract, source systems, data cutoff, quality checks, model and scenario versions, assumptions, overlays, committee challenge, threshold, management action and closure status. If we cannot replay the number, it is not mature MI.

Q5: Where do NIST AI RMF, ISO 42001 and SR 11-7 fit?

30 秒版本:

I use SR 11-7 for model risk discipline: inventory, validation, monitoring, effective challenge and model uncertainty. I use NIST AI RMF for Govern/Map/Measure/Manage across AI risk, and ISO 42001 for the operating system: roles, policy, performance evaluation, internal audit and continual improvement. Exact applicability remains Legal/Compliance/Model Risk-owned.


17. Final Memory Card

Treasury AI = forecast-to-action + stress evidence + human committee.

Good architecture proves:
  what was forecast,
  why it changed,
  which scenario and assumptions were used,
  which liquidity / ALM / FTP / board metric was affected,
  who challenged it,
  what action was approved or rejected,
  and whether actual outcomes later supported the decision.