AI 底层逻辑 / 经典论文

AI Treasury / Liquidity：ALM 预测与压力证据架构

本文是学习、作品集和架构训练材料, 不构成法律意见、监管意见、流动性充足性结论、资本充足性结论、会计处理意见、模型验证报告、审计结论、投资建议、融资建议、风险偏好批准或应急融资执行建议。

610 行ai-foundations/papers/141-ai-treasury-liquidity-alm-forecasting-stress-evidence-architecture.md

AI Treasury / Liquidity / ALM Forecasting / Stress Evidence Architecture 解读

面向对象: CBAP+ Financial Retail PM / Senior BA / Treasury Product Architect / ALM Architect / Risk Data Product Lead / Model Risk Partner / Enterprise Architect / Board MI Owner。核心问题: 金融机构如何把 AI 用在 treasury、liquidity forecasting、deposit beta/runoff、cash-flow ladder、ALM、IRRBB、FTP、portfolio scenario、stress testing、contingency funding 和 board MI 中, 同时让每个预测、压力假设、人工覆盖和资金行动都可解释、可追溯、可挑战、可审计? 学习目标: 建立 treasury/ALM 预测不是 generic prediction modeling 的心智模型, 形成 forecast-to-action architecture、stress evidence graph、model risk controls、committee operating model、data lineage、board MI 和面试可表达的产品架构语言。

0. Disclaimer

正式项目必须由 Treasury、ALM、Finance、Risk、Model Risk、Legal、Compliance、Regulatory Reporting、Information Security、Data Governance、Architecture、Internal Audit、Business Line、Operations 和董事会/管理层授权委员会共同判断。具体适用范围、监管口径、报告义务、模型治理要求、数据使用限制和披露要求由 Legal / Compliance / Regulatory Affairs / Model Risk 等权责方确认。

本文把官方材料转成产品和架构训练语言。不要把任何 AI forecast、scenario output 或解释图直接视为流动性行动许可。资金调拨、批发融资、资产出售、hedging、FTP 调整、产品定价变更、客户沟通和 contingency funding action 必须走机构批准的 human committee、dual control、limit、policy 和 evidence process。

Source Anchors

访问日期: 2026-06-30。

Source	Official link	本文使用方式
Federal Reserve SR 10-6, Interagency Policy Statement on Funding and Liquidity Risk Management	用户给定路径: https://www.federalreserve.gov/supervisionreg/srletters/sr1006.htm ; Fed legacy anchor: https://www.federalreserve.gov/boarddocs/srletters/2010/sr1006.htm ; attachment PDF: https://www.federalreserve.gov/boarddocs/srletters/2010/sr1006a1.pdf	用 liquidity governance、cash-flow measurement、stress testing、management reporting、diversified funding、liquid asset cushion、contingency funding plan 和 internal controls 组织 treasury/ALM 架构。用户给定的新式路径在正式引用前需复核可访问性。
Federal Reserve SR 11-7, Guidance on Model Risk Management	https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm	用 model inventory、development、implementation、use、validation、ongoing monitoring、effective challenge 和 model uncertainty 设计 forecast / stress model risk control。
OCC Comptroller's Handbook index	https://www.occ.treas.gov/publications-and-resources/publications/comptrollers-handbook/index-comptrollers-handbook.html	作为 OCC handbook 官方入口, 用于正式项目中定位 Interest Rate Risk、Liquidity、Model Risk、Bank Supervision Process 等 booklet。
OCC Interest Rate Risk booklet bulletin	https://www.occ.treas.gov/news-issuances/bulletins/2020/bulletin-2020-26.html	用 IRR、NII、EVE、NMD assumptions、stress testing、model risk 和 FTP 的 supervisory architecture language 校准 IRRBB/ALM 讨论。
Federal Reserve SR 16-3, Interagency Guidance on Funds Transfer Pricing Related to Funding and Contingent Liquidity Risks	https://www.federalreserve.gov/supervisionreg/srletters/sr1603.pdf	用 FTP 作为把 funding risk、liquidity component、contingent liquidity risk 分配到产品/业务线/组合决策的架构锚点。
FFIEC Management booklet	https://ithandbook.ffiec.gov/it-booklets/management.aspx	用 IT management、governance、risk management、resources、performance、change 和 assurance 语言连接 treasury AI 平台的 technology control。
NIST AI RMF	https://www.nist.gov/itl/ai-risk-management-framework	用 Govern / Map / Measure / Manage 组织 AI forecast risk、explainability、monitoring、human oversight、incident 和 evidence。
ISO/IEC 42001	https://www.iso.org/standard/42001	用 AI management system、roles、operation、performance evaluation、internal audit 和 continual improvement 设计 enterprise AI treasury operating model。
Federal Register final policy statement on funding and liquidity risk management	https://www.federalregister.gov/documents/2010/03/22/2010-6137/interagency-policy-statement-on-funding-and-liquidity-risk-management	用官方发布记录交叉锚定 SR 10-6 policy statement 的 stress testing、management reporting、CFP、liquid asset cushion 等主题。

一句话:

AI Treasury Architecture is not "predict tomorrow's balance". It is a governed forecast-to-action and stress-evidence system for liquidity survival, ALM decisions, FTP incentives and board challenge.

1. Thesis

Treasury / Liquidity / ALM 场景中的 AI 预测, 不能按普通 demand forecast 处理。普通预测的中心问题是:

What value will happen next?

Treasury/ALM 的中心问题是:

Under which balance-sheet, rate, customer, market, legal-entity and operational conditions
will the institution have enough usable liquidity,
at what cost,
with what management actions,
and what evidence proves that the assumptions were understood, challenged and acted on?

成熟架构的变化是:

from: spreadsheet cash position + static ALM model + committee slide pack
to: data-lineage-backed forecast + scenario/stress engine + model-risk controls
    + ALCO decision workbench + contingency action register + board MI evidence graph

AI 的价值不是替代 Treasurer 或 ALCO。AI 的价值是把以下能力产品化:

更快识别 deposit behavior、payment flow、collateral、funding market 和 rate scenario 的变化。
把 NMD beta、runoff、cash-flow ladder、NII/EVE、FTP、portfolio mix 和 CFP trigger 放在同一个 scenario fabric 中。
用 probabilistic forecast、stress overlay、driver decomposition 和 uncertainty interval 支持行动选择。
把人工覆盖、committee challenge、限额状态、action rationale 和后验表现保存在同一条证据链。
让董事会看到的不是模型分数, 而是 liquidity position、stress survival horizon、assumption changes、control effectiveness 和 management action。

核心原则:

Treasury AI should recommend insight, not execute liquidity.
Forecast precision without action governance is operational risk.
Stress output without assumption lineage is weak evidence.
Deposit beta without segment and scenario is a dangerous scalar.
ALM model output without committee challenge is not decision architecture.
Board MI without source-to-action traceability is presentation, not oversight.

2. Treasury / ALM Domain Mental Model

Treasury/ALM 架构要同时处理四类不确定性:

Uncertainty	典型问题	AI/Architecture implication
Behavioral uncertainty	存款客户是否迁移、提现、转到高收益产品、集中流出	deposit beta/runoff 需要按 segment、relationship、rate sensitivity、concentration 和 event condition 建模
Market uncertainty	利率、yield curve、credit spread、market liquidity、collateral haircut 变化	scenario engine 必须连接 rate curves、FTP、EVE/NII、liquid asset value 和 funding cost
Operational uncertainty	payment rail、settlement、intraday flows、系统故障、现金配送和 collateral movement	cash-flow ladder 要区分 contractual、behavioral、operational 和 intraday liquidity
Governance uncertainty	管理层何时行动、能否执行、是否有 legal/entity/operational constraint	forecast-to-action 必须记录 trigger、decision right、dual control、CFP action 和 evidence closure

Treasury/ALM 的关键对象不是单一时间序列, 而是 balance-sheet graph:

customer / counterparty
  -> account / product / legal entity
  -> contractual cash flow
  -> behavioral cash flow
  -> funding source / funding need
  -> collateral / encumbrance
  -> liquidity buffer
  -> scenario / stress assumption
  -> limit / risk appetite
  -> committee decision
  -> management action
  -> board evidence

2.1 Liquidity, ALM, IRRBB, FTP 的边界

Domain	Main question	Typical outputs	AI risk if blurred
Liquidity risk	是否能满足到期和压力下资金流出	cash-flow ladder、survival horizon、liquidity buffer、CFP trigger	把盈利预测误当成流动性可用性
Treasury operations	今天/本周如何调拨和融资	cash position、intraday need、funding execution、collateral movement	AI 自动建议资金动作但缺少审批和双控
ALM	资产负债结构是否稳健	repricing gap、NII、EVE、duration、basis risk、optionality	只优化短期 NII, 忽略长期经济价值和流动性
IRRBB	利率变化如何影响 banking book earnings/economic value	NII/EAR、EVE、scenario sensitivity、NMD assumption	把 NMD assumptions 固定化, 看不到客户行为漂移
FTP	funding/liquidity cost 如何分配到产品和业务线	FTP curve、liquidity premium、contingent liquidity charge	定价激励与真实资金风险脱节
Stress testing	极端但合理情景下如何表现	idiosyncratic/market-wide/combined stress result、management action	生成压力故事但没有 action feasibility 和 evidence

PM/Architect 要把这些域连接起来, 但不能混成一个黑箱模型。

3. Why AI Changes Treasury Architecture

AI 对 treasury/ALM 的改变, 不是"更复杂模型更准"。真正变化在于: 预测、压力、解释、行动和证据的周期变短。

Pressure	旧模式	AI-enabled architecture
Deposit mobility	月度或周度看余额变动	日内/日级识别 rate-sensitive segment、large depositor concentration、digital outflow pattern
Rate regime shifts	静态 beta / average life 假设	分 regime、分产品、分客户关系、分竞争利率的 behavioral assumption engine
Stress narrative	手工定义少数情景	scenario library + driver graph + reverse stress + sensitivity sweep
ALCO pack	人工拼表和截图	metric contract + lineage + board MI data product
Forecast challenge	事后解释误差	challenger model、backtesting、interval coverage、reason code、assumption drift alert
CFP readiness	文档化计划	trigger-to-action workflow、role readiness、funding capacity evidence、dry-run record
FTP	定期调整曲线	scenario-aware FTP simulation, 将 contingent liquidity cost 反馈到产品/组合激励

AI 同时放大风险:

False comfort: 模型给出平滑 P50, 掩盖尾部流动性风险。
Procyclicality: 模型根据近期流出提高假设, 触发过度收缩或激励错配。
Explainability gap: ALCO 无法解释为什么 deposit beta 或 runoff assumption 改变。
Data leakage: 使用事后可见的余额调整、pricing campaign 或 exception data 训练当时不可见的预测。
Model misuse: 短期 cash forecast 被拿去支持长期 ALM 或 FTP 决策。
Automation bias: 委员会接受 AI scenario ranking, 没有独立 challenge。
Evidence break: 董事会看到的 survival horizon 无法追溯到数据版本、模型版本、scenario version 和人工覆盖。

高级设计的目标不是消除 judgment, 而是让 judgment 结构化、可挑战、可复盘。

4. Reference Architecture

source systems
  core deposits / loans / cards / payments / wires / ACH / treasury trades
  GL / product pricing / CRM / client concentration / collateral / securities
  market rates / yield curves / macro / competitor rates / news-event signals
  legal entity / regulatory reporting / limits / committee records

-> as-of data product layer
     balance snapshots
     cash-flow events
     deposit behavior features
     rate and market data
     collateral and encumbrance
     customer / segment / legal-entity mapping
     data quality and lineage

-> AI and analytics layer
     baseline forecasts
     deposit beta and runoff models
     cash-flow forecasting
     anomaly / early-warning signals
     scenario generator and stress overlays
     challenger models
     explanation and driver decomposition

-> treasury risk engines
     liquidity ladder
     stress survival horizon
     ALM / IRRBB NII and EVE engines
     FTP simulation
     portfolio scenario engine
     contingency funding capacity

-> decision layer
     ALCO / Treasury Risk workbench
     limit and trigger monitor
     action recommendation with constraints
     human committee challenge
     dual-control action registration

-> evidence and MI layer
     forecast contract
     model inventory
     scenario version
     data lineage
     override record
     action log
     board MI pack
     audit evidence export

4.1 Architecture Capabilities

Capability	Must do	Failure mode if missing
Forecast contract registry	记录 grain、horizon、cadence、decision use、owner、allowed action	同一预测被拿去支持不兼容决策
As-of data lineage	区分 event time、available time、restatement、cutoff、source owner	训练和回测发生 leakage, 董事会数字无法复算
Deposit behavior feature store	管理 rate offer、relationship depth、segment、concentration、digital behavior、large depositor flags	beta/runoff 退化为粗糙平均值
Scenario library	管理 base、idiosyncratic、market-wide、combined、reverse stress、operational stress	压力测试变成一次性故事
ALM/IRRBB integration	把 cash-flow assumption、NMD assumption、prepayment、rate curve 和 NII/EVE engine 对齐	liquidity forecast 与 ALM 结果互相矛盾
FTP simulation	将 funding cost、liquidity premium、contingent liquidity cost 分配到产品/业务线	业务线指标鼓励消耗稳定性和尾部流动性
Human committee workflow	记录 ALCO challenge、覆盖理由、批准人、action boundary	AI 输出成为事实标准, 有效挑战缺失
Evidence ledger	串联 data version、model version、scenario、output、override、decision、action、MI	审计或复盘时只能解释 PPT, 不能解释事实链
Board MI data product	从 metric contracts 生成 liquidity / ALM / stress / action 指标	董事会看到故事, 看不到 threshold、owner 和 closure

4.2 Canonical Data Entities

Entity	Minimum fields
Deposit account	account_id, product_type, customer_id, segment, legal_entity, insured_flag_or_bucket, balance, rate_paid, repricing_terms, relationship_tenure
Deposit behavior event	event_id, account_id, event_time, available_time, inflow_outflow, transfer_channel, pricing_event_id, balance_after, event_reason_code_if_available
Customer / counterparty concentration	customer_id, relationship_group, segment, aggregate_balance, top_depositor_rank, operational_relationship_indicators
Cash-flow event	cashflow_id, source_system, contractual_date, expected_date, amount, currency, legal_entity, product, confidence_band, behavioral_flag
Market and rate state	curve_id, rate_type, tenor, value, as_of_time, source, scenario_applicability
Scenario	scenario_id, type, horizon, rate_path, runoff_shock, wholesale_funding_shock, collateral_haircut, operational_assumption, approval_status
Model output	output_id, model_id, model_version, forecast_contract_id, scenario_id, horizon, quantile, value, confidence, explanation_ref
Treasury action	action_id, trigger, proposed_action, owner, approval_forum, constraint_check, execution_status, evidence_ref
Committee decision	decision_id, forum, date, challenge_points, approved_action, override_reason, residual_risk_owner
Board MI metric	metric_id, definition_version, source_lineage, threshold, status, action_link, reporting_period

5. Forecast Contracts: 预测必须绑定决策

Treasury AI 的第一个产品产物不是模型, 而是 Forecast Contract。

Field	Treasury/ALM definition
Decision use	cash positioning、liquidity buffer sizing、ALCO monitoring、FTP update、pricing action、portfolio rebalance、CFP trigger
Grain	account / customer group / product / branch / legal entity / currency / business line / day or intraday bucket
Horizon	intraday、1-7 days、30 days、90 days、1 year、multi-year ALM horizon
Output	point, quantile, interval, scenario distribution, stress survival horizon, NII/EVE impact
Action boundary	read-only alert、committee review、pricing change proposal、funding execution request、CFP activation recommendation
Required explainability	driver decomposition、segment contribution、rate sensitivity、scenario assumption lineage、uncertainty
Owner	Treasury owner、ALM owner、model owner、data owner、business line owner
Evidence	data cutoff、model version、feature version、scenario version、backtest、override、committee minutes
Prohibited use	用短期 operational forecast 替代 regulatory reporting、capital plan 或正式 liquidity adequacy conclusion

示例:

Forecast contract: retail NMD runoff under rate-up and confidence shock
grain: day x product x segment x legal entity
horizon: 30 / 90 days
outputs: P50/P90 runoff, beta estimate, segment contribution, top depositor concentration effect
decision use: ALCO review, liquidity buffer monitoring, FTP sensitivity, pricing strategy challenge
action boundary: no automatic funding action; red status triggers ALCO/Treasury Risk review
evidence: as-of deposit data, pricing offers, competitor rate feed, model version, scenario version, overlay reason

6. Deposit Beta And Runoff Architecture

Deposit beta 不是一个全行常数。Runoff 也不是余额时间序列外推。更成熟的模型对象是:

deposit behavior under a specified rate, confidence, relationship and liquidity scenario

6.1 Segmentation Logic

Dimension	Why it matters	Architecture note
Product type	checking、savings、money market、CD、brokered、sweep 行为不同	product taxonomy 必须与 core banking and ALM mapping 对齐
Customer segment	retail、SMB、commercial、municipal、wealth、digital-only	segment 变化本身要有 lineage, 不能事后重分群污染回测
Relationship depth	payroll、loan、card、merchant、treasury management、direct deposit	relationship features 要能解释 stickiness, 不只是余额规模
Balance concentration	large depositor / top-N / uninsured-like concentration	concentration changes 要进入 early warning and board MI
Rate sensitivity	paid rate、offer rate、competitor spread、repricing delay	beta 应按 regime、product、segment 和 pricing action 估计
Digital mobility	online transfer behavior、external linked account、recent app activity	可能是 runoff acceleration signal, 但要注意数据使用边界
Event exposure	news、social、branch closure、service outage、downgrade、local stress	event features 只作为 risk signal, 不应直接替代 committee judgment

6.2 Beta / Runoff Model Portfolio

Model type	Best use	Control requirement
Rule / expert baseline	快速解释、低数据区间、model challenger	必须保留作为 honest baseline
Elasticity / regression model	估计 rate paid 与 market rate movement 的 beta	检查 regime shift、multicollinearity、pricing campaign leakage
Survival / hazard model	账户或客户关系层面 runoff timing	reason code、censoring、cohort drift 要明确
Gradient boosting	非线性 segment behavior and interaction	feature availability and monotonic sanity checks
Hierarchical Bayesian / pooled model	多 segment、低样本、共享信息	prior rationale、credible interval、sensitivity
Probabilistic time-series model	多 horizon balance and flow forecast	interval coverage、tail calibration、scenario conditioning
Anomaly / early warning	非典型流出、集中提款、渠道异常	false positive process, escalation threshold
LLM / NLP signal classifier	新闻、客户留言、call center notes 中识别 confidence stress signal	不应直接驱动资金动作; 需要 source grounding、human review、privacy/security control

6.3 设计原则

Weak pattern	Better pattern
"全行 NMD beta = 35%"	beta by product x segment x scenario x rate regime, with uncertainty
用月末余额训练日级流出	使用 daily/intraday flow events, 区分 event time and availability
只看账户余额	加入 customer group concentration and relationship stickiness
用近期平稳期回测	加入 rate shock, confidence event, competitor rate gap, out-of-sample stress windows
用单模型给 ALCO	baseline + AI model + challenger + overlay + committee challenge
SHAP 排名当解释	driver decomposition + segment contribution + scenario assumption + data lineage + action implication

7. Cash-Flow Forecasting And Liquidity Ladder

Cash-flow architecture 要同时区分四种 cash flow:

Cash-flow type	Examples	Modeling note
Contractual	loan maturities, security coupons, wholesale funding maturity, CD maturity	dates and amounts are known but settlement and optionality still matter
Behavioral	NMD runoff, loan prepayment, drawdowns, card/payment behavior	AI 可增强, 但必须带 uncertainty and scenario
Operational	ACH/wire/card settlement, payroll, tax dates, branch cash, ATM cash, merchant settlement	日历、cutoff、holiday、payment rail failure 是关键
Contingent	credit line draws, collateral calls, margin, unfunded commitments, liquidity facilities	与 stress scenario and FTP contingent liquidity cost 绑定

Liquidity ladder 不应只展示 net cumulative cash flow。它至少要展示:

opening liquidity
+ expected inflows
- expected outflows
+ usable liquidity sources
- encumbered / trapped / operationally unavailable liquidity
- stress runoff and contingent outflows
= survival horizon and buffer surplus / shortfall

7.1 Liquidity Ladder Design Table

Layer	Required fields	Board / ALCO question
Opening position	cash, reserves, HQLA-like buffer, collateral, currency, legal entity	现在可用流动性在哪里, 是否可转移?
Expected flows	inflow/outflow by product, payment rail, business line, date bucket	哪些流入/流出是高置信度?
Behavioral overlay	P50/P90 runoff, confidence shock, top depositor movement	行为假设变化来自哪里?
Contingent flows	committed lines, collateral calls, wholesale rollover, undrawn commitments	压力下哪些承诺会变成现金需求?
Liquidity sources	marketable securities, secured borrowing, central bank/FHLB-like lines, cash actions	可执行容量是否已测试?
Constraints	legal entity, currency, operational, settlement, collateral eligibility, policy limit	账面 liquidity 是否真正可用?
Actions	pricing, funding, asset sale, collateral movement, CFP activation	谁批准, 多快执行, 证据是什么?

8. ALM / IRRBB Integration

AI treasury forecast 必须进入 ALM/IRRBB 架构, 但不能吞掉 ALM discipline。

8.1 AI 在 ALM 中的合理角色

Role	Example	Boundary
Assumption challenger	识别 NMD average life、repricing rate、prepayment、deposit migration 的漂移	challenger 不是自动替代 approved ALM assumption
Scenario expander	生成 rate path、spread widening、customer behavior、portfolio mix 的组合情景	scenario 必须可命名、可复现、可批准
Sensitivity analyzer	解释 NII/EVE 对 beta、runoff、prepayment、basis risk 的敏感性	不应只输出 single best scenario
Data quality monitor	检查 ALM input completeness、stale curves、mapping breaks	quality gate 可以阻止 MI 发布
Narrative assistant	生成 ALCO pack 初稿、解释变动	必须 source-grounded and reviewed

8.2 NII / EVE / Liquidity 的组合视角

Decision conflict	Example	Architecture response
NII vs liquidity	提高存款利率保留资金, 短期压缩 NIM	scenario shows liquidity survival benefit and FTP cost allocation
EVE vs NII	长久期资产提升收益但加大经济价值敏感性	ALM engine must show NII and EVE side by side
Growth vs contingent liquidity	大量未用授信承诺带来收入但压力下 drawdown	FTP contingent liquidity charge and stress draw assumption
Centralized liquidity vs legal entity constraint	集团流动性充裕但某实体受限制	legal-entity liquidity ladder and transfer constraint
Customer retention vs pricing discipline	高收益促销留存 rate-sensitive balances	product profitability includes funding cost and runoff probability

8.3 IRRBB Assumption Register

Assumption	AI support	Evidence needed
NMD beta	estimate by product/segment/regime	historical fit, competitor rate context, uncertainty, committee approval
NMD average life	runoff/survival analysis	cohort behavior, stress sensitivity, backtest
Loan prepayment	borrower behavior and rate incentive	model validation, economic logic, scenario response
Deposit migration	checking to MMDA/CD/brokerage/high-yield movement	migration matrix, pricing campaign data, customer segment
Non-parallel rate shock	curve scenario library	curve source, scenario definition, NII/EVE impact
Basis risk	mismatch between product rate and market curve	mapping, spread behavior, sensitivity

9. FTP And Portfolio Scenario Architecture

FTP 是把 Treasury insight 变成业务激励的桥。没有 FTP, AI liquidity forecast 容易停留在风险报告; 有错误 FTP, 业务线会被错误激励。

9.1 FTP Decision Table

Business activity	Funding / liquidity risk	FTP implication	Evidence
Long-term fixed-rate lending	term funding, interest-rate mismatch	term liquidity cost and hedge/funding component	curve version, duration, liquidity premium
Transaction deposits	stable operational funding benefit	funding benefit only if stability evidence exists	beta/runoff assumptions, relationship metrics
High-yield promotional deposit	rate-sensitive and runoff risk	lower stability credit, higher liquidity charge	campaign cohort, competitor spread, attrition behavior
Credit commitments	contingent liquidity need	contingent liquidity charge	drawdown stress assumption, utilization history
Wealth / brokerage sweep	fast migration risk	scenario-specific runoff charge	channel mobility, top depositor concentration
Trading or securities portfolio	market liquidity and collateral haircut	liquidity buffer and haircut cost	market depth, encumbrance, haircut scenario

9.2 Portfolio Scenario Loop

business growth plan
  -> forecast deposit / loan / commitment / securities trajectory
  -> apply rate and liquidity scenarios
  -> calculate liquidity ladder, NII, EVE, FTP charges
  -> identify incentive mismatch and limit pressure
  -> propose portfolio / pricing / funding actions
  -> committee challenge and approval
  -> monitor actual behavior against scenario

PM implication:

The product manager for treasury AI is not optimizing model accuracy. They are aligning product growth, funding cost, contingent liquidity, ALM risk and management action through a traceable scenario loop.

10. Stress Evidence Architecture

Stress testing 的架构价值在于证据链, 不是情景数量。

10.1 Scenario Taxonomy

Scenario type	Example	Required evidence
Base forecast	normal balance sheet and rate path	forecast contract, data cutoff, model version
Idiosyncratic liquidity stress	institution-specific confidence event, concentrated depositor runoff	trigger definition, runoff assumption, top depositor exposure
Market-wide stress	rate shock, funding market freeze, collateral haircut, spread widening	market scenario source, curve/haircut mapping
Combined stress	confidence event plus market stress	correlation assumption, management action capacity
Reverse stress	找到 survival horizon 破裂条件	failure threshold, scenario path, assumptions
Operational stress	payment rail outage, collateral movement delay, treasury system outage	operational dependency, BCP/DR evidence
Intraday stress	unexpected payment outflows before funding source available	intraday data, cutoff, liquidity buffer

10.2 Stress Evidence Graph

scenario_id
  -> assumption_id
       -> source_data_version
       -> expert_judgment_record
       -> model_output_id
       -> overlay_id
  -> liquidity_ladder_result
  -> ALM / IRRBB result
  -> FTP / portfolio result
  -> limit status
  -> committee challenge
  -> management action
  -> board MI metric
  -> post-event backtest / lessons learned

10.3 Evidence Pack

Evidence artifact	Purpose
Forecast Contract	证明预测用途、边界、粒度、horizon 和 owner
Scenario Definition Card	证明压力情景参数、来源、版本和批准状态
Assumption Register	证明 beta、runoff、drawdown、prepayment、haircut、rollover 假设有 owner and rationale
Data Lineage Snapshot	证明使用的 as-of data、quality checks、source systems and cutoff
Model Card / Validation Summary	证明模型用途、限制、backtesting、challenger、monitoring and known limitations
Overlay Record	证明人工调整原因、批准人、影响和有效期
Committee Challenge Record	证明 ALCO / Treasury Risk 提问、反驳、决定和 residual risk owner
Action Register	证明采取或未采取行动的原因、批准、执行状态和 closure
Board MI Tile	证明董事会看到的数字来自同一 evidence graph

11. Explainability: 从"模型解释"到"行动解释"

Treasury/ALM explainability 不是给 SHAP chart。它要回答六个行动问题:

为什么 liquidity outlook 变差?
哪些 segment、product、legal entity、currency、payment rail 或 depositor group 贡献最大?
变化来自真实行为、市场利率、模型版本、数据质量、场景参数还是人工覆盖?
这个变化对 cash position、survival horizon、NII、EVE、FTP 和 limits 的影响分别是什么?
哪些管理行动可用, 每个行动的速度、成本、约束和副作用是什么?
委员会为什么接受、调整或拒绝 AI 建议?

11.1 Explainability Layers

Layer	Question	Output
Data explanation	Which source changed?	source delta, data quality status, cutoff
Behavior explanation	Which customers/products moved?	segment contribution, concentration, migration matrix
Scenario explanation	Which assumption drives stress result?	sensitivity tornado, scenario delta
Model explanation	Why did forecast model change?	feature contribution, model drift, challenger comparison
ALM explanation	How does it affect NII/EVE?	NII/EVE bridge and assumption impact
Action explanation	What should management consider?	action options, constraints, owner, residual risk
Governance explanation	Who challenged and approved?	committee record and override rationale

Memory rule:

Explainability for treasury AI = why the number changed + why the action is or is not justified.

12. Human Committee And Decision Rights

Treasury AI 要嵌入 committee operating model, 不是绕过它。

Forum / Role	Decision rights	AI support	Evidence
Treasury desk	daily positioning, funding execution within approved limits	cash forecast, intraday alert, liquidity source view	execution log, limit check
ALCO	balance-sheet strategy, liquidity/IRR appetite, FTP review	scenario package, NII/EVE sensitivity, deposit behavior trends	minutes, decision record, action owner
Treasury Risk / Market Risk	independent challenge of assumptions and limits	challenger views, limit breaches, stress analysis	challenge log, issue register
Model Risk	model inventory, validation, monitoring, model limitations	validation dashboard, drift and backtest evidence	validation report, issue closure
Finance / FP&A	plan, NIM, funding cost, portfolio performance	forecast-to-plan bridge, FTP impacts	finance reconciliation
Legal / Compliance	applicability, disclosure, customer communication, regulatory interpretation	source-grounded evidence bundle	legal/compliance-owned decision record
Internal Audit	test control design and operation	evidence export, lineage sample	audit test result
Board / Risk Committee	oversight, risk appetite, challenge, material action awareness	board MI, thresholds, stress survival, management actions	board pack and minutes

Decision boundary:

AI can detect, forecast, explain, simulate, draft and recommend.
AI should not independently approve liquidity risk appetite, execute emergency funding,
change FTP policy, sell assets, alter customer pricing, or declare regulatory adequacy.

13. Board MI Architecture

Board MI 必须从 production facts and evidence 生成, 不是 quarterly manual slide assembly。

13.1 Board Metrics

Category	Example metrics	Decision use
Liquidity position	liquidity buffer, usable liquidity by legal entity/currency, encumbrance	是否在 appetite 内
Survival horizon	base / idiosyncratic / market-wide / combined stress days	是否触发 management action
Deposit behavior	beta by segment, runoff by segment, top depositor concentration, rate-sensitive balance	是否需要 pricing/funding/portfolio action
Cash-flow reliability	forecast error by horizon, interval coverage, data freshness	是否相信 forecast-to-action
ALM / IRRBB	NII/EVE sensitivity, NMD assumption changes, basis risk	是否调整 balance sheet or hedge strategy
FTP / incentives	FTP charge movement, products below funding cost, contingent liquidity cost	是否调整产品增长激励
CFP readiness	tested funding capacity, action execution time, dry-run results	是否能执行应急计划
Governance	model validation status, issue aging, override rate, evidence completeness	控制是否有效
Management action	red/amber actions, overdue remediation, residual risk acceptance	管理层是否行动

13.2 Board Tile Example

Decision topic: Retail deposit runoff and liquidity buffer under 90-day combined stress

Status: Amber
Driver: digital savings and money-market deposit runoff P90 increased by 18% vs last quarter.
Evidence: competitor-rate spread widened; top-50 depositor concentration unchanged; model challenger agrees directionally but lower magnitude.
Impact: survival horizon down from 54 to 43 days under combined stress; NII downside increases under high-rate retention strategy.
Management action: ALCO approved targeted retention pricing for operational relationship segments, no broad promotional campaign; FTP charge updated for high-rate promotional balances.
Controls: model limitation noted for new digital-only cohort; human overlay approved for 60 days; validation backtest scheduled after next month-end actuals.
Decision requested: Board Risk Committee to note amber status and challenge management on CFP dry-run completion.

14. PM / BA / Architect Implications

Role	What "good" looks like
PM	Defines decision products: ALCO workbench, liquidity forecast contract, stress evidence pack, board MI, not "AI prediction dashboard".
BA	Captures treasury rules as metric contracts, scenario definitions, action boundaries, exception flows, committee decisions and evidence requirements.
Architect	Builds lineage-first data products, scenario services, model registry, action workflow, evidence ledger and integration with ALM/FTP engines.
Data Product Owner	Owns as-of data quality, cash-flow event model, deposit behavior features, curve data, product taxonomy and legal-entity mapping.
Model Risk Partner	Ensures inventory, independent validation, challenger models, ongoing monitoring, limitations, issue closure and effective challenge.
Treasury / ALM SME	Owns assumptions, management actions, risk appetite interpretation, CFP feasibility and ALCO narrative.

Minimum product artifacts:

Artifact	Purpose
Treasury Forecast Contract Catalog	用途、粒度、horizon、action boundary
Cash-Flow Event Model	统一 contractual / behavioral / operational / contingent flow
NMD Assumption Register	beta、runoff、average life、repricing by segment
Scenario Library	base, stress, reverse stress, operational and intraday scenario
ALCO Decision Workbench Spec	limit, scenario, explanation, committee challenge, action
Evidence Graph Schema	data -> model -> scenario -> decision -> action -> MI
Board MI Metric Contracts	定义、阈值、owner、lineage、cadence
Model Risk Pack	model card、validation、backtest、monitoring、limitations

15. Anti-Patterns

Anti-pattern	Why dangerous	Better practice
Generic balance forecast dashboard	预测没有绑定资金动作和风险偏好	Forecast Contract + Action Boundary
One beta for all deposits	掩盖产品、客户、rate regime 和 concentration 差异	Segment/scenario beta with uncertainty
Accuracy-only model selection	P50 准不代表压力尾部可用	interval coverage, tail stress, challenger and judgment overlay
ALCO pack assembled manually	数字不可复算, 证据断裂	MI data product and evidence ledger
LLM writes stress narrative from ungrounded context	生成漂亮但不可审计的解释	source-grounded narrative with cited data lineage
Stress scenario without feasible action	只证明会出问题, 不证明能响应	trigger-to-action workflow and CFP dry-run evidence
FTP not connected to runoff/liquidity forecast	业务线继续追逐错误激励	scenario-aware FTP and portfolio feedback
Model Risk only at annual review	treasury risk changes faster than review cadence	ongoing monitoring, drift, issue and override dashboards
Board sees only status color	无法挑战原因、阈值和行动	driver bridge, threshold, owner, action and residual risk

16. Interview-Ready Language

Q1: How would you design AI for treasury liquidity forecasting without making it a black-box model?

30 秒版本:

I would not start with a model. I would start with forecast contracts tied to treasury decisions: daily liquidity positioning, ALCO monitoring, CFP triggers, FTP sensitivity and board MI. The architecture needs as-of data lineage, deposit behavior features, scenario library, baseline plus challenger models, ALM/IRRBB integration, human committee workflow and an evidence ledger that links data, assumptions, outputs, overrides and actions.

2 分钟版本:

In treasury, the risk is not just forecast error; it is using a forecast for the wrong decision. I would separate cash-flow forecasting, deposit beta/runoff, ALM/IRRBB assumptions, FTP and stress testing into connected but governed services. Each forecast has a contract: grain, horizon, decision use, allowed action, explainability and evidence. For deposit behavior, I would model beta and runoff by product, segment, relationship, concentration and rate regime, with uncertainty and challenger models. The output feeds a liquidity ladder, ALM NII/EVE engine, FTP simulation and ALCO workbench. No AI output directly executes funding or pricing actions; committee challenge, dual control, limit checks and evidence capture are part of the product.

Q2: What is the difference between generic forecasting and treasury/ALM forecasting?

30 秒版本:

Generic forecasting predicts a value. Treasury/ALM forecasting supports liquidity survival, balance-sheet strategy, rate risk, FTP incentives and contingency actions under scenarios. The unit of design is not a time series; it is forecast-to-action with assumptions, stress, limits, committee challenge and evidence.

Q3: How would you handle deposit beta and runoff assumptions?

30 秒版本:

I would reject a single bank-wide beta. I would build an assumption register by product, segment, relationship depth, concentration and rate regime, with historical evidence, uncertainty, challenger model, stress overlay and ALCO approval. The assumption is a governed object, not a hidden model parameter.

Q4: What evidence should be available for a board liquidity metric?

30 秒版本:

The board metric should trace to a metric contract, source systems, data cutoff, quality checks, model and scenario versions, assumptions, overlays, committee challenge, threshold, management action and closure status. If we cannot replay the number, it is not mature MI.

Q5: Where do NIST AI RMF, ISO 42001 and SR 11-7 fit?

30 秒版本:

I use SR 11-7 for model risk discipline: inventory, validation, monitoring, effective challenge and model uncertainty. I use NIST AI RMF for Govern/Map/Measure/Manage across AI risk, and ISO 42001 for the operating system: roles, policy, performance evaluation, internal audit and continual improvement. Exact applicability remains Legal/Compliance/Model Risk-owned.

17. Final Memory Card

Treasury AI = forecast-to-action + stress evidence + human committee.

Good architecture proves:
  what was forecast,
  why it changed,
  which scenario and assumptions were used,
  which liquidity / ALM / FTP / board metric was affected,
  who challenged it,
  what action was approved or rejected,
  and whether actual outcomes later supported the decision.