AI 扩展计划 / Playbooks

AI Payment Operations / Reconciliation / Settlement Exception Playbook

本文是学习、作品集和架构训练材料, 不构成法律意见、合规结论、支付网络规则解释、消费者保护建议、会计意见、审计结论、流动性建议、资金执行建议或机构操作指令。

645 行AI_PAYMENT_OPERATIONS_RECONCILIATION_SETTLEMENT_EXCEPTION_PLAYBOOK.md

AI Payment Operations / Reconciliation / Settlement Exception Architecture Playbook

适用对象: CBAP-level Financial Retail PM / Senior BA / Payment Operations Product Owner / Core Banking Architect / Settlement Ops Lead / Finance Control / Treasury Ops / AI Governance / Operational Risk / Internal Audit。目标: 把 AI 用于 payment processing、reconciliation、settlement exception、repair queue、suspense account、cash application、ledger break、cut-off SLA、dual control、evidence 和 incident response 的可运营、可审计架构。核心观点: Payment ops AI 的成熟交付物不是“自动对账率”, 而是 file-core-GL-cash-evidence 的 controlled exception operating system。

0. Boundary And Disclaimer

正式项目中的 rule applicability、rail-specific requirements、return/reversal treatment、Reg E boundary、customer communication、accounting treatment、capital/liquidity impact、regulatory reporting 和 evidence retention 必须由 Legal、Compliance、Payments Rules Owner、Finance、Treasury、Operational Risk、Model Risk、Internal Audit 和业务负责人确认。

本文刻意区分:

payment operations exceptions: file, posting, settlement, cash, GL, repair, suspense, SLA, evidence。
payment disputes / chargebacks / scam claims: customer assertion, liability, provisional credit, network claim path, complaint response。

二者会共享交易和证据, 但不应共享同一个未分层的 AI workflow。

1. Executive Framing

弱项目定义:

Use AI to reconcile payments faster.

成熟项目定义:

Build a governed payment operations control plane that detects,
classifies, prioritizes, repairs and evidences payment exceptions
across rail files, core posting, settlement cash, GL, suspense and downstream reporting.

Executive one-liner:

This is a ledger, cash and operations control product with AI assistance; not a payment chatbot.

高管问题集:

Question	Good answer requires
哪些 payment exceptions 正在威胁 cut-off、GL close 或客户资金可用性	event graph, queue SLA, materiality and customer impact
哪些 break 是 file issue、posting issue、settlement issue、GL issue 或 evidence issue	exception taxonomy and reconciliation layer
AI 推荐了什么, 人类批准了什么, 系统实际改了什么	AI trace, maker-checker, tool gateway and after-state
Suspense 为什么增长, 谁负责, 多久能清	suspense aging, reason code, owner, action plan
结算异常是否影响 liquidity forecast	treasury signal, scenario update, human action record
审计能否重放一笔异常从文件到总账再到现金	immutable evidence ledger and lineage export

2. Source Anchors

访问日期: 2026-06-30。以下来源作为 source anchors; 不替代机构 policy、network rules、contract terms 或 counsel/compliance interpretation。

Anchor	Official link	本 playbook 使用方式
FFIEC Retail Payment Systems booklet	https://ithandbook.ffiec.gov/it-booklets/retail-payment-systems.aspx	payment instruments、clearing/settlement、ACH/card/check/P2P、retail payment risk and controls
FFIEC Wholesale Payment Systems booklet	https://ithandbook.ffiec.gov/it-booklets/wholesale-payment-systems.aspx	wire/interbank messaging、settlement、resiliency、wholesale payment risk、Nostro/Vostro control language
Federal Reserve Financial Services Operating Circulars	https://www.frbservices.org/resources/rules-regulations/operating-circulars.html	OC 4/FedACH、OC 6/Fedwire Funds、OC 8/FedNow、OC 12/National Settlement Service 的 rule catalog entry point
FedACH Processing Schedule	https://www.frbservices.org/resources/resource-centers/same-day-ach/fedach-processing-schedule.html	cut-off and settlement schedule design; window data belongs in calendar service
Nacha Operating Rules resources	https://www.nacha.org/newrules	ACH rule changes, risk management topics and return/reversal watch process
Nacha Operations Bulletins	https://www.nacha.org/news/operations-bulletins	ACH operations changes and implementation signal for ops readiness
CFPB Regulation E	https://www.consumerfinance.gov/rules-policy/regulations/1005/	consumer EFT/remittance/error-resolution boundary marker, not an automated conclusion engine
NIST AI RMF	https://www.nist.gov/itl/ai-risk-management-framework	Govern / Map / Measure / Manage for AI risk, monitoring, controls and improvement
ISO/IEC 42001	https://www.iso.org/standard/81230.html	AI management system: roles, operation planning, performance evaluation, internal audit and continual improvement

3. Operating Principles

Principle	Practical meaning
Ledgers before language	reconciliation math, control totals, GL/cash balance and file integrity outrank generated narrative
Evidence before repair	every repair needs source artifacts, before/after state and approval evidence
Calendars are controls	cut-off, settlement windows, return windows and GL close calendars are governed data
AI advises, workflow controls	AI can recommend; state changes go through policy engine, tool gateway and maker-checker
Suspense is temporary control	suspense aging requires owner, reason, materiality and release evidence
Customer impact is explicit	delayed posting, wrong posting, fees, statement errors and availability are assessed, not assumed away
Finance and Ops co-own breaks	Ops repairs operational facts; Finance controls GL, suspense, materiality and close
Treasury signals are not actions	settlement variance can inform liquidity forecast but cannot auto-execute funding decisions
Audit replay is a product feature	file, event, AI run, human decision, journal and cash evidence must be reconstructable

4. Target Reference Architecture

1. Source ingestion
   ACH files, wire messages, card processor files, core posting reports,
   GL journals, settlement statements, Fed/correspondent statements,
   remittance advice, return/reversal files, downstream report feeds

2. Control and lineage layer
   file manifest service
   payment event graph
   rail calendar service
   rule catalog
   data quality and reconciliation controls
   evidence ledger

3. Reconciliation engines
   file-to-file
   file-to-core
   core-to-GL
   GL-to-cash
   Nostro/Vostro matching
   suspense aging and cash application

4. AI intelligence layer
   anomaly detection
   exception classification
   candidate matching
   root-cause summarization
   SLA prioritization
   evidence pack drafting
   liquidity signal explanation

5. Operations workbench
   repair queue
   settlement exception queue
   suspense release queue
   cash application queue
   maker-checker approvals
   customer-impact review
   finance close command center

6. Action and reporting layer
   controlled core/GL/payment tool gateway
   incident runbook
   dashboards and KRIs
   downstream report impact notice
   audit export

Architecture rule:

If an AI recommendation can affect posting, settlement, suspense, GL,
customer funds availability or downstream reporting, it needs a governed action path.

5. Core Data Products

Data product	Grain	Owner	Evidence
File manifest	file_id + direction + rail + sequence	Payment Tech Ops	hash, count, total, timestamp, source acknowledgment
Payment event graph	payment_instruction_id / trace_id	Payment Architecture	lineage across instruction, file, posting, settlement, return, GL
Rail calendar	rail + product + window + effective date	Payments Rules Owner	source link, approved version, change log
Reconciliation fact	recon_run_id + item_id + layer	Settlement Ops + Finance	matching rule, candidate set, result, exception id
Exception case	case_id	Operations	taxonomy, materiality, SLA, owner, status
Suspense ledger	suspense_item_id	Finance Control	reason, aging, amount, release action, journal link
Cash application fact	cash_line_id + candidate account/invoice	Cash App Ops	remittance extraction, confidence, approval
AI run ledger	ai_run_id	AI Product / Model Risk	prompt/model/source versions, output, confidence, reviewer action
Customer impact record	impact_id	Product Ops / Customer Ops	affected customers, funds/fees/statement impact, remediation route
Audit replay package	sample_id / incident_id	Internal Audit / Control Testing	immutable artifacts and lineage export

6. Exception Taxonomy And Routing

Exception type	Detection signal	Queue	SLA driver	Closure evidence
Missing file	expected file not received by calendar threshold	Payment Tech Ops	cut-off proximity	source inquiry, late/missing status, contingency action
Duplicate file	repeated hash/sequence/control total	Payment Tech Ops	duplicate posting risk	block/release decision, duplicate control proof
Control total mismatch	file totals differ from header/trailer/core	Payment Ops	posting integrity	corrected file or accepted variance evidence
Posting reject	core non-post report	Core Ops repair	customer funds / return window	repaired posting or return path
Settlement variance	expected vs actual settlement cash mismatch	Settlement Ops	amount/materiality/close	matched statement line or variance approval
Return/reversal exception	unmatched or late return/reversal	Rail Ops	rail window / customer impact	original event linkage and owner disposition
Suspense aging	item exceeds age/materiality threshold	Finance Control	close calendar / stale risk	approved release, journal, residual acceptance
Cash application unknown	incoming cash lacks reliable reference	Cash App Ops	customer/account impact	candidate approval and application journal
Nostro/Vostro break	correspondent statement unmatched	Treasury Ops	value date / currency / amount	statement match or investigated residual
Downstream report mismatch	report feed diverges from reconciled source	Data Reporting	filing/MI deadline	correction, report owner signoff

Routing priorities:

Priority	Criteria	Action
P0	material cash/GL break near cut-off or close, customer funds at risk, duplicate posting risk	command center and stop downstream action
P1	high-value settlement variance, unresolved suspense aging, rail return window pressure	senior queue and same-day escalation
P2	standard posting rejects and cash application candidates	normal SLA with sampling
P3	low-value aged research and trend samples	batch handling and QA
P4	training, taxonomy, model improvement samples	scheduled calibration

7. Decision Gates

Gate 0: Use-case eligibility

Question	Pass condition	Evidence
Which reconciliation layer is affected?	file, core, GL, cash, suspense or reporting layer named	use-case card
Can AI output affect ledger, cash, customer or report state?	impact tier assigned	risk tier record
Can deterministic controls solve this without AI?	alternative recorded	architecture decision record
Is rule applicability involved?	rule owner mapped	rule catalog reference
Is human capacity available?	queue and reviewer pool defined	operations readiness record

Gate 1: Source and file control

Question	Pass condition	Evidence
Did expected files/messages arrive?	calendar expectation checked	file manifest
Are sequence, hash, item count and control total valid?	validation pass or exception created	file validation report
Is event time separate from available time?	both timestamps captured	lineage record
Are rejected records preserved?	reject file linked	reject evidence
Are upstream changes known?	change record linked	release/change ticket

Gate 2: Reconciliation match quality

Question	Pass condition	Evidence
Which match strategy was used?	exact, tolerance, probabilistic or manual selected	recon run metadata
Are candidate matches visible?	ranked candidates shown, not hidden	candidate set
Is tolerance approved?	tolerance id and owner captured	tolerance catalog
Does materiality require review?	threshold applied	materiality rule
Can false match cause customer/GL harm?	harm assessment complete	risk note

Gate 3: Queue and SLA routing

Question	Pass condition	Evidence
Is exception taxonomy valid?	versioned reason code assigned	taxonomy record
Is cut-off or close calendar pressure active?	SLA clock created	clock id
Is the queue owner accountable?	owner accepted or escalated	assignment log
Are skills and authority sufficient?	reviewer role matches action type	entitlement record
Are aging thresholds monitored?	dashboard alert active	KRI record

Gate 4: Repair action approval

Question	Pass condition	Evidence
What state will change?	before/after state displayed	action preview
Does action touch core, GL, suspense, cash or customer communication?	action type classified	tool gateway decision
Is maker-checker required?	threshold and SoD rule applied	approval token
Are source facts cited?	evidence ids attached	evidence manifest
Is rollback or correction path defined?	recovery option documented	action record

Gate 5: Customer impact and downstream reporting

Question	Pass condition	Evidence
Could posting delay affect funds, fees, statement, balance or notices?	customer impact assessment completed	impact record
Does downstream MI/regulatory/report feed use affected data?	report owner notification rule checked	impact lineage
Is customer remediation route separate from ops repair?	customer ops owner mapped	remediation route
Is Reg E or other customer rule boundary possible?	Legal/Compliance route available	boundary flag
Are communications approved?	template or owner signoff	final message record

Gate 6: Finance close and residual break

Question	Pass condition	Evidence
Is the break material for close?	Finance materiality applied	close control record
Is suspense release justified?	release evidence and journal link	suspense release pack
Is residual break accepted?	accountable owner and expiry captured	residual break record
Are recurring breaks tracked?	RCA and CAPA created	issue record
Can GL and cash states be replayed?	audit package complete	audit export

Gate 7: AI monitoring and continuous control

Question	Pass condition	Evidence
Is model classification quality monitored by exception type?	slice metrics available	eval report
Are false matches and false closures sampled?	QA sampling active	QA result
Are prompt/source/calendar changes controlled?	change approval and regression run	release bundle
Are reviewer overrides analyzed?	override dashboard	monitoring report
Are incidents fed back into evals?	failed cases added to test set	learning loop record

8. Controls And Evidence Checklist

8.1 File and processing controls

Control	Evidence
Expected file calendar	rail calendar version, expected file list
File integrity	hash, size, control total, item count, sequence
Duplicate prevention	duplicate key, blocked duplicate record, release approval
Batch balancing	debit/credit totals, item count, trailer validation
Reject preservation	original reject record, reason code, repair state
Reprocessing control	reprocess request, approval, idempotency key

8.2 Reconciliation controls

Control	Evidence
Matching rule governance	rule id, tolerance, owner, effective date
Candidate transparency	full candidate list and match features
Materiality threshold	amount/customer/report impact rule
Manual repair SoD	maker, checker, approval token
Exception aging	age, owner, escalation, breach record
Residual acceptance	accountable owner, rationale, review date

8.3 Ledger and cash controls

Control	Evidence
Subledger-to-GL balance	recon run, journal id, batch id
GL journal approval	preparer, approver, evidence ids
Suspense aging	reason code, age bucket, action plan
Suspense release	source proof, journal link, dual approval
Cash statement matching	statement line, value date, counterparty
Nostro/Vostro investigation	correspondent statement, wire advice, FX/fee analysis

8.4 AI controls

Control	Evidence
Use-case boundary	AI role and prohibited actions
Source grounding	evidence ids cited in output
Confidence calibration	score distribution and accuracy by exception type
Human review	reviewer action, reason, override
Tool limitation	read/write scope and policy decision
Monitoring	drift, false match, false close, queue impact
Incident linkage	AI run ids linked to incident samples

9. Repair Queue Design

9.1 Queue states

detected
  -> classified
  -> assigned
  -> evidence_ready
  -> repair_proposed
  -> maker_submitted
  -> checker_approved
  -> executed
  -> reconciled
  -> customer_impact_closed
  -> finance_closed

9.2 Workbench requirements

Requirement	Why it matters
Six-ledger view	analyst sees instruction, file, core, GL, cash and evidence together
AI explanation with citations	summary is useful but source evidence stays visible
Before/after preview	prevents blind repair
Cut-off banner	protects rail and close windows
Customer impact panel	avoids back-office-only thinking
Approval panel	maker-checker embedded in workflow
Downstream impact map	report owner and treasury visibility
Similar case search	accelerates RCA without copying stale fixes
Action constraints	only allowed repair actions by role and state

9.3 Repair action catalog

Action	AI allowed?	Human control
Add evidence note	draft allowed	reviewer saves note
Change queue / reason code	recommend allowed	owner acceptance
Request source file resend	draft request allowed	ops approval
Reprocess file	no direct execution	maker-checker and idempotency proof
Correct posting mapping	recommend only	authorized core ops approval
Apply incoming cash	candidate match only	cash app approval
Release suspense	no direct execution	finance maker-checker
Post GL journal	no direct execution	finance workflow
Notify customer impact owner	draft allowed	customer ops route
Close material exception	recommend only	independent approval

10. SLA, Cut-off And Calendar Guardrails

10.1 Calendar service fields

Field	Notes
rail	ACH, wire, card, RTP/FedNow-like, check, internal transfer, correspondent
product	consumer, small business, commercial, treasury, card, merchant
direction	inbound, outbound, return, reversal, settlement
window	submission, receipt, processing, settlement, return, GL close
source	official source or internal policy
effective date	versioned start
owner	Payments Rules Owner or Finance calendar owner
change control	approval and regression requirement

10.2 SLA matrix

SLA type	Operational target
Detection SLA	expected missing/late file detected before downstream posting risk
Assignment SLA	P0/P1 exceptions accepted by accountable queue quickly enough to preserve cut-off
Evidence SLA	file, core, GL and cash evidence available before repair approval
Repair SLA	repair action executed within rail/customer/close constraints
Reconciliation SLA	repaired item rebalanced across affected ledgers
Suspense SLA	aging thresholds monitored and escalated before stale balances accumulate
Customer-impact SLA	funds/fees/statement impact routed before customer harm expands
Incident SLA	command center opened for material or systemic exception

10.3 Protected windows

During rail cut-off, end-of-day posting, month-end close or material incident:

freeze non-essential model/prompt/rule changes.
block unapproved reprocessing.
route material repairs to senior queue.
require local evidence capture if export pipeline is delayed.
escalate unresolved P0/P1 items to incident commander and Finance Control.
publish downstream report impact notice when data freshness is degraded.

11. Incident Runbook

11.1 Trigger examples

Trigger	Incident mode
Missing high-volume ACH/card/core file near cut-off	payment processing incident
Duplicate file passed validation	duplicate posting containment
Settlement cash variance exceeds threshold	settlement break command center
Suspense balance spike near close	finance control incident
AI classifier routes material breaks to wrong queue	AI operations incident
Evidence ledger export fails during repair window	evidence continuity incident
Downstream report feed uses unreconciled data	reporting impact incident

11.2 First hour actions

1. Declare incident severity and owner.
2. Freeze risky downstream actions: reprocess, suspense release, GL posting, customer-impact sends.
3. Capture source evidence locally if central evidence pipeline is degraded.
4. Identify blast radius: files, batches, accounts, customers, GL accounts, cash accounts, reports.
5. Switch AI to conservative mode: classify and summarize only, no repair recommendation if evidence is incomplete.
6. Assign workstreams: rail/source, core posting, settlement cash, GL/finance, customer impact, AI/control, communications.
7. Start incident log with timestamps, decisions, approvals and unresolved risks.

11.3 Decision log fields

Field	Content
decision_id	stable id linked to incident
time	event time and decision time
decision	action, freeze, reroute, accept residual, communicate, recover
evidence	file ids, report ids, screenshots, system logs, AI run ids
owner	named accountable role
approvals	maker, checker, senior approver if required
customer impact	assessed, not applicable, under review, route opened
finance impact	GL/cash/suspense/reporting impact
recovery condition	what must be true before normal mode resumes

11.4 Recovery gates

Gate	Exit condition
Source integrity	expected files and manifests validated
Posting integrity	duplicate/non-post/reject state understood
Settlement integrity	cash variance matched, explained or accepted
GL integrity	journals balanced and suspense status controlled
Customer impact	affected customer population and remediation route confirmed
AI integrity	classifier/recommender issue contained and regression tested
Evidence integrity	incident package complete enough for audit replay
Management signoff	Ops, Finance, Risk and Technology agree on restart

12. AI Eval And Monitoring Suite

12.1 Scenario library

Scenario	Expected model behavior
File sequence gap with high item count	classify as file integrity P0/P1, avoid repair certainty
Posting reject due to closed account	link to original payment, route to core ops, flag customer impact
Duplicate ACH file candidate	block auto-reprocess, request duplicate evidence
Settlement variance from processor lag	explain timing hypothesis with evidence, not final close
Suspense aging before month-end	raise finance close priority
Unapplied incoming wire with weak remittance	present candidate set and uncertainty
Nostro value-date mismatch	distinguish value date from posting date
Wrong downstream report feed	trace impacted reports and owner
Stale calendar source	refuse precise window conclusion and route to owner
Missing evidence ledger	prevent material closure

12.2 Metrics

Metric	Why it matters
false match rate	wrong candidate match can misapply funds
false non-match rate	unnecessary backlog and delayed cash application
false closure rate	hidden operational and finance risk
queue reroute rate	taxonomy or model routing quality
reviewer override rate	automation bias and model quality signal
evidence completeness	auditability and repair confidence
cut-off breach rate	operational effectiveness
suspense aging	finance control health
customer impact incidents	funds/fees/statement harm
downstream report corrections	reporting control quality
AI-assisted handling time	productivity, only valid with quality metrics

12.3 Monitoring actions

Signal	Action
false match spike	disable probabilistic auto-ranking for affected class
override concentration by reviewer	calibration and independence review
suspense aging spike	Finance Control escalation and RCA
queue backlog near cut-off	surge staffing and lower-risk deferral
source file change without regression	freeze AI routing until validation
evidence completeness drop	block material closure and open incident

13. RACI

Activity	Payment Ops	Settlement Ops	Finance Control	Treasury	Product	Technology	Risk/Compliance	Internal Audit
Exception taxonomy	A	C	C	C	R	C	C	I
File manifest controls	C	C	I	I	C	A/R	C	I
Rail calendar ownership	A	C	I	C	C	R	C	I
Reconciliation rule design	R	A	A	C	C	R	C	I
Repair queue operation	A/R	R	C	I	C	C	C	I
Suspense release	C	C	A/R	C	I	C	C	I
GL journal approval	I	C	A/R	C	I	C	C	I
Liquidity signal review	C	C	C	A/R	I	C	C	I
AI model monitoring	C	C	C	I	A/R	R	C	I
Incident command	A	A	A	C	C	R	C	I
Control testing	I	I	C	I	C	C	C	A/R

Legend: A = accountable, R = responsible, C = consulted, I = informed.

14. Roadmap

Phase 1: Control foundation

Deliverable	Outcome
File manifest service	reliable source ingestion evidence
Payment event graph MVP	trace from file to posting and exception
Exception taxonomy	consistent routing and reporting
Repair queue MVP	SLA, owner, evidence and closure
Suspense aging dashboard	finance-control visibility

Phase 2: AI assist

Deliverable	Outcome
Exception classifier	faster routing with override monitoring
Candidate match assistant	improved cash application and settlement research
Root-cause summarizer	better analyst productivity and incident narrative
Evidence pack generator	audit-ready repair and close support
Eval suite	measurable quality across exception types

Phase 3: Enterprise control plane

Deliverable	Outcome
GL/cash/downstream report lineage	close and reporting impact visibility
Treasury liquidity signal integration	forecast-to-action input with governance
Incident command integration	controlled degraded mode and recovery
Continuous controls monitoring	KRI-driven management oversight
Audit replay export	internal audit and regulator-exam readiness

15. PM / Architect Implications

PM

Product decision	Senior framing
Success metric	reduce unresolved material breaks before cut-off/close, with evidence completeness
User design	operations workbench, not generic chat
Workflow design	detect, classify, prioritize, repair, approve, reconcile, report
AI boundary	assistant for evidence and candidate actions, not autonomous ledger operator
Adoption	start with high-volume, low-ambiguity exceptions before material GL/cash actions
Governance	business value must include control quality, not only handling time

Architect

Architecture decision	Senior framing
Data model	event graph and reconciliation facts instead of flat work orders
Integration	source files and core/GL/cash APIs require idempotency and replay
Controls	rule engine, calendar service, tool gateway and evidence ledger are core components
AI pattern	RAG + classifier + candidate ranking + summarization with citations
Resilience	conservative mode, read-only mode, manual-first mode and local evidence capture
Audit	every material close must reconstruct source-to-action-to-ledger state

16. Anti-patterns

Anti-pattern	Why it fails	Correction
Optimize auto-close rate	hides false closures and stale breaks	optimize material break clearance with QA
Put rail rules in prompt	stale or unverifiable cut-off logic	versioned rule and calendar services
Treat suspense as operational backlog	creates finance and reporting risk	suspense owner, aging, materiality, release control
Let AI execute repair tools directly	unauthorized ledger/cash impact	tool gateway and maker-checker
Use one queue for all exceptions	P0 cut-off risk gets buried	taxonomy and priority routing
Ignore customer impact	delayed posting can create fees, availability or statement harm	customer impact assessment and route
No downstream report lineage	corrected ops data may not reach MI/regulatory feeds	report impact map
Use AI summary as evidence	audit cannot replay source facts	immutable evidence artifacts
Pilot only on clean data	production breaks are messy and time pressured	eval on real exception patterns
Finance joins late	GL/suspense issues discovered at close	Finance Control embedded from design

17. Implementation Guardrails

No AI recommendation can close a material exception without source evidence and human action.
No payment repair action should bypass idempotency, duplicate prevention and before/after preview.
No GL journal or suspense release should be executed by AI directly.
No cut-off or settlement time should be generated from model memory.
No candidate cash application should be presented as confirmed without approval state.
No exception should be marked no customer impact without explicit assessment.
No residual break should survive close without accountable owner and evidence.
No model/prompt/calendar/rule change should enter protected windows without emergency approval.
No incident recovery should restart normal automation before source, ledger, cash and evidence gates pass.
No productivity metric should be accepted without false match, false closure, evidence completeness and customer-impact metrics.

18. Interview-ready Case Answer

问题: 如何设计 AI-enabled payment reconciliation and settlement exception platform?

30 秒版本:

我会先把它定义成支付运营控制平台, 不是自动对账机器人。核心是 payment event graph: rail file、core posting、GL、settlement cash、suspense、return/reversal 和 evidence 串起来。AI 负责异常分类、候选匹配、根因摘要、SLA 优先级和证据包, 但所有影响 posting、cash、GL、suspense 或客户资金的动作都走 rule catalog、calendar service、tool gateway 和 maker-checker。

2 分钟版本:

架构上我会分三层。第一层是 source and lineage: ACH/wire/card/core/GL/cash files 全部进入 file manifest 和 event graph, 记录 hash、sequence、control total、event time、available time。第二层是 reconciliation engines: file-to-core、core-to-GL、GL-to-cash、Nostro/Vostro、suspense aging 和 cash application。第三层是 AI operations workbench: AI 只做 evidence-grounded classification、candidate matching、root-cause summary 和 prioritization。维修动作必须显示 before/after state, 根据金额、客户影响、GL close 和 cut-off 触发 maker-checker。所有 cut-off 和 return windows 来自 versioned calendar/rule catalog, 不是 prompt。指标上不只看自动化率, 还看 false match、false closure、evidence completeness、suspense aging、cut-off breach、customer impact 和 downstream report corrections。

高阶追问:

如果 settlement exception 影响 liquidity, 我会把它作为 treasury forecast signal: 说明 expected vs actual cash、时间窗口、置信度和未决 break。后续 funding 或 balance-sheet action 仍由 Treasury/ALCO 权限、limit check 和 evidence process 决定, AI 不能自动执行。