AI 扩展计划 / Playbooks

AI Open Banking / Open Finance / Consented Data Sharing Playbook

核心判断:

808 行AI_OPEN_BANKING_OPEN_FINANCE_CONSENTED_DATA_SHARING_PLAYBOOK.md

定位: 面向 CBAP+、高级 AI PM、Financial Retail Product Architect、Enterprise Architect、API Platform Owner、Data Product Manager、Fraud Risk、Privacy、Compliance、Model Risk、Third-Party Risk 和 Customer Trust Lead, 把 open banking / open finance / consented data sharing 设计成可上线、可治理、可审计、可撤销、可解释的 AI data-sharing operating system。适用范围: AI personal financial management、cash-flow assistant、account aggregation、income and affordability verification、SME cash-flow underwriting、subscription optimization、bill reminders、hardship support、fraud/scam monitoring、data portability、embedded finance、customer-facing agent、employee copilot 和 API ecosystem governance。核心产出: executive framing、source anchors、taxonomy、decision gates、artifact set、RACI、implementation roadmap、evidence pack、release checklists、metrics、anti-patterns、tabletop scenarios、interview answers 和 practical templates。

核心判断:

AI can create value from permissioned financial data only when consent, API access, data lineage, model use, third-party obligations, revocation and evidence are governed as one architecture.

0. Disclaimer

本文是学习、作品集、架构训练和内部治理讨论材料, 不构成法律意见、监管意见、CFPB 1033 适用性判断、合规期限判断、实体义务判断、隐私影响评估结论、模型验证报告、信息安全认证、消费者通知建议、供应商推荐、KYC/KYB 充分性判断、信用/适当性/投资建议结论或上线批准。

正式项目必须由 Legal、Compliance、Privacy、Information Security、Fraud Risk、Financial Crime、Model Risk、Data Governance、API Platform、Third-Party Risk Management、Vendor Management、Customer Experience、Operations、Accessibility、Product Owner、Architecture、Internal Audit 和管理层结合机构类型、产品风险、客户关系、数据类型、司法辖区、rule status、litigation / regulatory developments、contract structure、供应商角色和内部政策确认。

边界原则:

Consented access 不是无限使用。
API access 不是模型训练许可。
Account data 不是全域行为画像许可。
Data portability 不是数据转售许可。
Revocation 必须影响 collection、use、cache、features、embeddings、agent sessions 和 downstream sharing。
CFPB 1033 或其他要求的 exact applicability 取决于 product、entity role、data type、customer relationship、jurisdiction、rule status、litigation/regulatory developments 和 Legal/Compliance interpretation。

1. Executive Framing

高管通常会听到这样的项目叙事:

Customers can connect their bank accounts.
AI will analyze spending and recommend better products.
Open banking APIs make data safe and standardized.

这些说法如果没有 operating model, 会产生四类隐性风险:

Risk	What goes wrong	Executive concern
Trust risk	客户不知道谁拿了什么数据、为什么拿、怎么撤销	adoption, complaints, brand trust
Data-use risk	数据为预算功能授权, 后续进入 marketing, credit, training 或 unrelated profiling	privacy, consumer harm, regulatory scrutiny
Ecosystem risk	aggregator / developer / downstream party 数据质量差、撤销不传播、incident 不透明	third-party risk, resilience
AI risk	模型把交易数据推断成敏感属性、给出 unsupported advice、触发 agent overreach	model risk, fairness, harm, audit

Executive one-liner:

Open banking AI is not a connector project.
It is a consent, API, model and third-party risk operating model.

1.1 Steering Committee Questions

客户授权的数据是否只服务于 customer-requested product or service?
API scope、feature store、RAG retrieval、agent tools 和 retention 是否都受 consent registry 控制?
data recipient、aggregator、developer app、model provider 和 downstream party 的责任是否清楚?
Revocation 后, 数据、features、embeddings、agent sessions、downstream sharing 和 evidence 如何处理?
投诉发生时, 是否能重放 disclosure、consent、API calls、data quality、AI run、decision 和 customer message?

2. Source Anchors

Anchor	Official link	本 playbook 使用方式
CFPB Personal Financial Data Rights final rule	https://www.consumerfinance.gov/rules-policy/final-rules/required-rulemaking-on-personal-financial-data-rights/	用作 secure and reliable consumer / authorized third-party access、privacy protections、fair and inclusive standards 的 open banking 官方锚点
CFPB Regulation 1033 / Personal Financial Data Rights	https://www.consumerfinance.gov/rules-policy/regulations/1033/	用 covered data、developer interface、authorization disclosure、third-party obligations、accuracy、security、revocation 和 retention 结构化 decision gates
CFPB Personal financial data rights compliance resource	https://www.consumerfinance.gov/compliance/compliance-resources/other-applicable-requirements/personal-financial-data-rights/	用作 rule status、official interpretations、implementation resources 和 regulatory change watch 的入口, 不推断适用性或期限
CFPB compliance circulars / guidance landing page	https://www.consumerfinance.gov/compliance/circulars/	用作 guidance / circulars / supervisory materials 的 horizon scanning source
FFIEC Authentication and Access to Financial Institution Services and Systems	https://www.ffiec.gov/press/pr081121.htm	用 authentication risk assessment、layered controls、MFA/equivalent strength、third-party and customer-permissioned access controls 组织 release gates
NIST Privacy Framework	https://www.nist.gov/privacy-framework	用 privacy risk management、purpose limitation、data minimization、governance 和 customer trust 组织 privacy controls
NIST AI RMF	https://www.nist.gov/itl/ai-risk-management-framework	用 AI risk Govern / Map / Measure / Manage 思路组织 AI eval、monitoring、incident and risk treatment
ISO/IEC 42001 overview	https://www.iso.org/standard/42001	用 AI management system、roles、policy、operation、performance evaluation、internal audit 和 continual improvement 组织 operating rhythm

Source-to-control pattern:

source anchor -> policy interpretation -> architecture decision
  -> workflow requirement -> technical control
  -> evidence artifact -> owner -> metric -> review cadence

3. Taxonomy

3.1 Ecosystem Roles

Role	Product meaning	Control question
Customer / consumer	授权数据分享、使用 AI 服务、撤销、纠错、投诉	是否理解 value, data categories, duration, AI use and revocation?
Data provider	持有账户和金融产品数据的一方	developer interface 是否安全、可靠、标准化、可审计?
Authorized data recipient	为客户请求的服务访问数据的一方	是否只收集、使用、保留合理必要的数据?
Data aggregator	代表 provider / recipient 连接数据生态	revocation, accuracy, security, contract pass-through 是否可证明?
Developer app	面向客户的 app 或 embedded journey	是否通过 onboarding, certification and monitoring?
AI system	RAG、预测、推荐、agent、copilot 或自动化规则	是否受 consent-purpose-feature boundary 限制?
Downstream processor	模型供应商、analytics vendor、cloud、subprocessor	数据是否被约束、隔离、加密、审计和退出?
Governance owners	Legal、Compliance、Privacy、Risk、Audit、Product、Architecture	谁 accountable for interpretation, risk acceptance and CAPA?

3.2 Data Classes

Class	Examples	Governance rule
Raw provider data	transactions, balances, terms, bill data, account verification	consent-bound, source-preserved, minimal access
Normalized data	mapped schema, currency/timezone standardized, deduped	preserve source and transformation lineage
Enriched data	merchant category, recurring payment, subscription, income pattern	confidence and uncertainty required
Derived features	cash-flow volatility, income stability, overdraft risk	allowed/prohibited uses and fairness review
AI inference	vulnerability signal, spending persona, scam likelihood	never treated as verified fact
Customer-facing output	recommendation, alert, explanation, agent draft	source-grounded and policy-checked
Evidence data	consent, API call, model run, reviewer decision	retained under evidence policy, not generic analytics

State	System behavior
Draft	disclosure prepared, no collection
Active	scoped collection and use allowed for purpose
Expired	stop collection; reauthorization needed
Revoked	stop collection; enforce retention / cache / feature / embedding actions
Suspended	temporary hold due to fraud, complaint, provider issue or policy review
Disputed	customer questions accuracy or authorization; freeze high-impact use
Archived	evidence retained under policy, no operational use

3.4 AI Use Levels

Level	Example	Control
Inform	show balances or transactions	source display and freshness
Explain	categorize spend or summarize fees	grounding and quality caveats
Predict	cash-flow forecast or income volatility	model eval and drift monitoring
Recommend	budget plan or hardship option	suitability/advice boundary and explanation
Decide	eligibility, credit, fraud action	formal policy, human oversight, adverse-action/evidence workflow where applicable
Act	submit application, initiate payment, change account	separate action authorization and step-up

4. Decision Gates

Gate 0: Use Case Eligibility

Question	Pass condition
What customer-requested product or service justifies data access?	one-sentence value proposition approved
Is AI involved in collection, interpretation, recommendation, decision or action?	AI role and impact tier documented
Is the use case read-only, recommendation, decision or action?	action boundary defined
Does the use case involve regulated advice, credit, eligibility, identity, hardship or fraud?	Legal/Compliance/Model Risk review route defined
Can customer value be delivered with less data?	minimization alternatives assessed

Gate 1: Role and Regulatory Interpretation

Question	Pass condition
Are data provider, recipient, aggregator, service provider and downstream processor roles mapped?	ecosystem role map signed off
What data type and customer relationship are involved?	data taxonomy and scope record
Which jurisdictions and rule-status dependencies matter?	legal/compliance interpretation record
Are compliance dates, stays, litigation or amendments being monitored without making product assumptions?	regulatory horizon watch owner
Are contracts aligned to role allocation?	contract checklist completed

Question	Pass condition
Does disclosure show recipient, provider, service, categories, duration, frequency and revocation?	UI copy and consent receipt versioned
Are data categories/time windows necessary for the service?	minimization matrix approved
Are optional data scopes separated from required scopes?	scope design supports opt-down
Is AI use explained at the correct level?	customer-facing AI use statement approved
Can revocation be done as easily as authorization?	revocation UX and test evidence

Gate 3: API and Developer Ecosystem

Question	Pass condition
Is client registration tied to business purpose and scopes?	developer profile and scopes approved
Are authentication, token, rate, logging and security controls defined?	API gateway control spec
Is schema standardized and machine-readable enough for recipient use?	API contract and test suite
Are error, outage, retry and consent-state semantics defined?	integration runbook
Has the developer/app passed sandbox and negative scenarios?	certification result

Gate 4: Third-Party and Vendor Risk

Question	Pass condition
Are aggregator, model provider and subprocessors identified?	third-party inventory
Are data-use, retention, training, resale, subprocessing and audit rights contracted?	contract evidence
Are security, incident response and breach notification controls reviewed?	TPRM approval
Is revocation propagated to all relevant downstream parties?	revocation propagation test
Is exit/portability/replacement feasible?	exit plan and data return/destruction control

Gate 5: Data Quality and Feature Boundary

Question	Pass condition
Are raw, normalized, enriched and derived layers separated?	data lineage design
Are freshness, completeness, categorization and confidence available to AI?	quality metadata
Are features tied to consent purpose and allowed decisions?	feature boundary registry
Are sensitive inferences blocked or reviewed?	sensitive inference policy and test
Are customer corrections and disputes handled?	correction workflow

Gate 6: AI / RAG / Agent Controls

Question	Pass condition
Does every retrieval call check consent and purpose?	entitlement test
Are indexes isolated by customer/purpose or equivalent controls?	RAG architecture review
Are outputs source-grounded and freshness-aware?	eval result
Are advice, credit, eligibility, identity and legal conclusions bounded?	output policy tests
Are agent actions separately authorized?	action approval design

Gate 7: Fraud, Scam and Abuse

Question	Pass condition
Are malicious app, consent phishing, ATO, API exfiltration and scam scenarios covered?	threat model
Are high-risk access and actions step-up protected?	authentication policy
Are recipient and API anomalies monitored?	fraud dashboard
Are customers warned about risky sharing or suspicious recipients?	customer trust UX
Is support trained for scam and revocation incidents?	ops training record

Gate 8: Evidence, Complaints and Audit

Question	Pass condition
Can a case replay consent, API calls, data transforms, AI run and decision?	evidence bundle complete
Is customer final communication captured?	message id and content version
Can complaints link to consent and AI trace?	complaint schema updated
Are logs privacy-safe and retention-bound?	data governance approval
Is CAPA ownership clear?	RCA taxonomy and backlog owner

5. Required Artifacts

Artifact	What it proves
Executive One-Pager	客户价值、risk posture、AI boundary 和 go/no-go decision
Ecosystem Role Map	明确 provider、recipient、aggregator、developer、model provider 和 downstream processors
Regulatory Interpretation Card	记录 Legal/Compliance 对 role、data、jurisdiction、rule status 的解释边界
Consent and Disclosure Pack	证明客户看见了 scope、purpose、duration、frequency、revocation 和 AI use
Data Minimization Matrix	证明每个 data category 都有业务目的
API Contract and Scope Catalog	证明 technical interface 可执行 purpose-bound access
Developer Onboarding Checklist	证明第三方 app 通过 security/privacy/ops/model/fraud gates
Third-Party Risk File	证明 aggregator/vendor/subprocessor obligations and exit controls
Data Quality and Lineage Spec	证明 AI 能理解 freshness、quality、source and uncertainty
Feature Boundary Registry	证明 derived features 有 allowed/prohibited uses
RAG Control Spec	证明检索隔离、source grounding、revocation lifecycle
Agent Authorization Matrix	证明 read/reason/recommend/act 分级授权
Fraud Threat Model	证明 malicious app、ATO、API exfiltration、APP scam 等场景被覆盖
Evidence Bundle Schema	证明每个 AI answer/decision/action 可审计
QA/Eval Scenario Suite	证明上线前跑过 positive, negative, revocation, fraud and accessibility cases
Operating Model RACI	证明跨 Product、Architecture、Risk、Legal、Compliance、Ops 的 accountability

5.1 Data Minimization Matrix

Customer value	Bad request	Better request
Budget summary	all accounts, all history, all fields	selected accounts, 3-6 month transactions, categories required for budget
Account ownership	bank statement PDF and full account number	account verification signal or truncated/tokenized identifier
Cash-flow alert	continuous full transaction feed forever	time-bound balance and recurring obligations
Subscription cancellation	all transaction history and merchant identities	recurring merchant transactions in selected categories
Income verification	all spending plus employer inference	income/payroll deposits or approved income data source
Hardship support	full profile and behavioral history	income drop, missed payments and customer-selected supporting data

5.2 AI Boundary Matrix

AI task	Allowed	Block or review
Summarize spending	source-grounded summary for authorized account/time window	unsupported statement about full financial health
Detect recurring bills	explain candidate recurring transactions with confidence	cancel or contact biller without approval
Forecast cash flow	provide estimate with freshness caveat	guarantee affordability or credit outcome
Identify scam risk	route to warning/human review	reveal fraud rules or accuse customer
Prefill application	prefill from authorized source rows	submit without review
Recommend product	show customer-requested comparison if allowed	targeted cross-sell from unrelated consent
Train model	only under separately approved policy and consent basis	default training on connected-account data

6. RACI / Operating Model

Activity	Accountable	Responsible	Consulted	Informed
Use case selection	Product Owner	AI PM / Senior BA	CX, Risk, Architecture	Steering Committee
Regulatory interpretation	Legal / Compliance	Compliance Product Advisor	Privacy, Product, Architecture	Executive Sponsor
Consent and disclosure	Privacy / Product Owner	BA / Content / Design	Legal, Compliance, Accessibility	Operations
Data minimization	Privacy / Data Governance	Data Product Owner	Product, Model Risk, Legal	Audit
API scope and contract	API Platform Owner	API Architect / Engineering	Security, Product, Developer Relations	Operations
Developer onboarding	Third-Party Risk / API Platform	Developer Relations / Vendor Mgmt	Security, Privacy, Compliance, Fraud	Product
Aggregator/vendor oversight	TPRM Owner	Vendor Management	Legal, Security, Data Governance	Risk Committee
Data quality and lineage	Data Governance	Data Engineering / Data Product	Product, Model Risk	Operations
AI feature boundary	Model Risk / AI Governance	AI PM / ML Platform	Privacy, Compliance, Product	Audit
RAG controls	Architecture / AI Platform	Engineering / ML Platform	Security, Data Governance	Product
Agent action controls	Product Risk Owner	AI Platform / Workflow Engineering	Fraud, Legal, CX	Operations
Fraud threat model	Fraud Risk	Fraud Analytics / Security	API Platform, Product, Legal	Executive Risk
Evidence ledger	Architecture / Data Governance	Platform Engineering	Audit, Privacy, Compliance	Operations
Complaint linkage	Complaint Operations	Case Management / Data	Product, Compliance, Model Risk	Audit
Independent assurance	Internal Audit	Audit Team	Risk, Legal, Technology	Board Committee

Governance cadence:

Cadence	Forum	Output
Weekly	Pilot risk standup	consent defects, API errors, AI eval failures, fraud signals
Biweekly	Developer/app review	onboarding decisions, scope changes, rejected apps
Monthly	Open banking performance review	API SLO, data quality, revocation, complaints
Monthly	AI use and feature review	feature boundary, RAG grounding, inference incidents
Quarterly	Third-party risk committee	aggregator/vendor findings, exit readiness, incidents
Quarterly	Regulatory horizon review	CFPB/guidance/standard-setting/jurisdiction updates
Semiannual	Tabletop exercise	malicious app, revocation failure, model misuse, provider outage
Annual	AI management system / internal audit	policy effectiveness, evidence completeness, CAPA aging

7. Implementation Roadmap

Days 1-30: Strategy and Control Baseline

Day range	Work	Artifact
1-3	Pick one bounded journey, such as cash-flow assistant or account verification	Executive One-Pager
4-6	Map ecosystem roles and Legal/Compliance interpretation boundaries	Ecosystem Role Map, Interpretation Card
7-10	Define customer value, purpose, data categories, time window and AI role	Use Case Boundary Card
11-14	Build data minimization matrix and consent disclosure draft	Minimization Matrix, Consent Pack v1
15-18	Define API scopes, schema, error, freshness and revocation semantics	API Contract v1
19-21	Identify aggregator/vendor/model provider/subprocessors	Third-Party Inventory
22-24	Design data quality, lineage and feature boundary model	Data Lineage Spec, Feature Registry v1
25-27	Threat model AI/RAG/agent/fraud/scam scenarios	Threat Model
28-30	Define evidence schema, metrics and release gates	Evidence Bundle Schema, Dashboard Spec

Days 31-60: Controlled Pilot

Day range	Work	Artifact
31-35	Build consent registry, API scope enforcement and entitlement checks	Control Test Report
36-40	Integrate provider/aggregator APIs with quality and lineage tags	Data Quality Report
41-44	Implement RAG isolation, source citations and freshness-aware answers	RAG Eval Report
45-48	Configure AI output policies and prohibited-use controls	AI Guardrail Report
49-52	Implement revocation propagation across tokens, cache, features and embeddings	Revocation Test Evidence
53-55	Run developer/third-party onboarding and contract controls	TPRM Approval Pack
56-58	Train operations, fraud and complaints teams	Training Record
59-60	Launch pilot with manual review sampling and kill switch	Pilot Launch Decision

Days 61-90: Scale and Assurance

Day range	Work	Artifact
61-65	Analyze consent drop-off, value, data quality, AI defects and complaints	Outcome Review
66-70	Tune scopes, prompts, feature boundaries and fraud thresholds	Change Record
71-74	Test malicious app, ATO, consent phishing and provider outage scenarios	Tabletop Report
75-78	Validate complaint replay and evidence completeness	Audit Replay Sample
79-82	Complete Model Risk, Privacy, Security and Compliance review	Governance Review Pack
83-86	Validate exit plan and vendor/subprocessor controls	Exit Readiness Evidence
87-90	Decide scale, restrict, redesign or retire	Go/No-Go Decision Record

8. Evidence Pack

Minimum evidence fields:

Field	Purpose
`case_id`	customer journey reference
`use_case_id`	product/service and purpose
`customer_ref`	controlled reference, not raw PII in general logs
`data_provider_id`	provider of account/financial data
`data_recipient_id`	party receiving data
`aggregator_id`	aggregator if used
`developer_app_id`	app / client registration
`consent_id`	authorization record
`disclosure_version`	what the customer saw
`data_categories`	categories requested and actually collected
`scope_ids`	API scopes
`duration`	authorization period
`frequency`	collection frequency
`revocation_method`	how customer can revoke
`api_request_ids`	provider/aggregator/API calls
`api_response_hashes`	integrity without over-logging sensitive payloads
`data_quality_score`	freshness, completeness, confidence
`lineage_refs`	raw, normalized, enriched, feature layers
`feature_ids`	derived features used
`retrieval_run_id`	RAG retrieval trace
`source_record_refs`	rows/documents used for answer
`model_id`	model/version
`prompt_policy_id`	prompt/tool/output policy
`ai_run_id`	AI trace
`human_review_id`	reviewer and rationale if applicable
`action_approval_id`	separate approval for submit/payment/account change
`fraud_risk_result`	risk score and reason codes
`customer_final_message_id`	final answer/notice/alert
`complaint_id`	linked complaint if any
`retention_rule`	retention and deletion/archival policy
`revocation_event_id`	revocation and propagation evidence
`capa_id`	corrective action if defect

Evidence rules:

Store consent, operational data, model traces and raw financial payloads with separate access controls.
Do not log customer credentials, private keys, raw tokens or full account numbers in generic observability.
Treat missing consent id, missing source refs or missing revocation evidence as control defects.
Keep customer-facing text versions, not only backend decisions.
Record what the AI saw and what it did not see.
Preserve data quality and freshness at answer time.
Make complaint replay possible without exposing unnecessary raw data to support teams.

9. Checklists

9.1 Release Checklist

Check	Passing evidence
Customer-requested service defined	Use Case Boundary Card
Legal/Compliance interpretation boundary recorded	Interpretation Card
Data minimization approved	Minimization Matrix
Consent disclosure tested	Consent Pack and UI evidence
API scopes configured	Scope Catalog
Runtime entitlement checks pass	Control Test Report
Developer/app onboarding completed	Developer Certification
Aggregator/vendor review completed	TPRM Approval
Data quality and lineage available	Data Quality Report
Feature boundaries approved	Feature Registry
RAG source grounding passes	RAG Eval
Revocation propagation tested	Revocation Evidence
AI prohibited-use tests pass	Guardrail Eval
Fraud threat model complete	Threat Model
Complaint replay tested	Audit Replay Sample
Operations trained	Training Record

Check	Passing evidence
Data recipient name visible and understandable	UI screenshot/spec
Data provider name visible	UI screenshot/spec
Requested product/service described	content approval
Data categories listed with useful specificity	disclosure text
Duration and frequency shown	consent receipt
Revocation method shown	consent receipt and test
AI use explained without overclaiming	AI use statement
Optional scopes separated	UX test
Language and accessibility reviewed	accessibility evidence
Copy avoids blanket waiver language	content/legal review

9.3 API and Developer Checklist

Check	Passing evidence
Client registered and owned	developer profile
Business purpose approved	onboarding decision
Scopes match minimization matrix	scope review
Authentication and token controls configured	security test
No customer credential collection	architecture review
Error and outage behavior tested	integration test
Rate limits and anomaly monitoring active	gateway dashboard
Revocation notification path tested	revocation test
Sandbox negative cases passed	certification record
Incident contact and support path available	ops runbook

9.4 AI / RAG Checklist

Check	Passing evidence
Retrieval checks active consent and purpose	entitlement test
Customer/purpose isolation enforced	architecture review
Source rows captured	retrieval trace
Freshness shown or used in policy	answer evidence
Prompt-injection controls tested	red-team result
Sensitive inference policy applied	eval result
Advice/eligibility/credit boundaries enforced	output policy test
Revoked data not retrieved	revocation regression
Embedding lifecycle defined	data governance record
Human escalation triggers work	case test

9.5 Revocation Checklist

Check	Passing evidence
Revocation is easy to find and operate	UX test
API tokens stopped	gateway log
Provider/aggregator/downstream notified	propagation log
New collection stopped	data ingestion log
Cache action executed	cache record
Feature action executed	feature registry event
Embedding action executed	vector lifecycle record
Agent sessions stopped or downgraded	workflow event
Customer receipt issued	message id
Exceptions documented	risk acceptance or remediation

9.6 Fraud and Scam Checklist

Check	Passing evidence
Malicious recipient scenario tested	tabletop result
Consent phishing warning patterns defined	UX/risk rule
ATO-linked authorization monitored	fraud dashboard
High-risk access step-up active	auth policy
API exfiltration anomaly controls active	SIEM/gateway alerts
APP scam intervention defined	fraud playbook
Support can revoke and escalate	ops test
Third-party incident drill completed	exercise record

10. Metrics and KRIs

Metric	Why it matters
Connection completion rate	friction and customer value
Time to first useful insight	value realization
Consent drop-off by disclosure step	trust and clarity
Average data categories requested	minimization
Unused data ratio	overcollection
Revocation success time	customer control
Reauthorization completion	ongoing trust
API response success and latency	ecosystem reliability
Provider/aggregator error rate	operational risk
Data freshness defect rate	answer quality
Categorization correction rate	enrichment quality
RAG grounded-answer rate	AI reliability
Unsupported conclusion rate	model risk
Sensitive inference defect rate	privacy/fairness risk
Feature-without-consent-lineage count	governance gap
Revoked-data retrieval attempts	lifecycle control
Malicious/deceptive app alerts	ecosystem risk
ATO-linked authorization rate	fraud risk
Scam escalation conversion	customer protection
Third-party review overdue rate	vendor risk
Complaint trace completeness	audit readiness
CAPA aging	governance follow-through

Balanced scorecard:

Value: customers receive clear, useful financial outcomes.
Consent: data sharing is understood, scoped and revocable.
Data: quality, lineage and minimization are measurable.
AI: outputs are grounded, bounded and monitored.
Ecosystem: APIs and third parties are secure and reliable.
Fraud: malicious access and scams are detected.
Evidence: decisions and actions are replayable.

11. Anti-Patterns

Anti-pattern	Why it fails	Better pattern
“Connect bank” as blanket consent	hides data categories and purposes	scoped authorization disclosure and receipt
Collect all data for future AI	violates minimization and trust	purpose-specific data selection
Generic lake before governance	loses consent and revocation metadata	consent-bound data zones
RAG index shared across purposes	leakage and prohibited use	customer/purpose-isolated retrieval
Model training by default	customer did not authorize training	separate approved training policy and consent basis
AI turns spend into sensitive labels	privacy and fairness harm	sensitive inference controls
Revocation only disables token	old features and embeddings continue	full lifecycle revocation
Aggregator chosen only by coverage	ignores security, quality, revocation, exit	TPRM and architecture review
API error becomes AI fact	false customer advice	error-aware and freshness-aware responses
Agent acts under data consent	unauthorized payments or submissions	separate action approval
Developer onboarding is sales-led	weak third-party control	security/privacy/fraud/model certification
Success measured only by engagement	incentivizes surveillance	balanced value, trust and risk metrics

12. Tabletop Scenarios

A customer is coached by a scammer to authorize a budgeting app that uses a name
similar to a known fintech. The app requests transaction history and account verification data.

Expected decisions: developer identity display, app reputation checks, customer warning, revocation support, fraud escalation, evidence capture。

Scenario 2: Revoked Data Still in RAG

The customer revokes access. The API token is disabled,
but the AI assistant continues answering questions from old embeddings.

Expected decisions: embedding lifecycle defect, retrieval entitlement fix, customer communication, CAPA, regression test。

Scenario 3: Unsupported Credit Inference

The model uses cash-flow data from a budgeting consent to tell the customer
they are likely eligible for a credit product.

Expected decisions: purpose violation, output policy block, feature boundary update, model-risk review。

Scenario 4: Aggregator Outage

Balance refresh fails for six hours. The AI assistant still recommends moving money
to avoid an overdraft using stale balance data.

Expected decisions: freshness gate, degraded-mode messaging, action block, provider/aggregator incident handling。

Scenario 5: Malicious Developer Overcollection

A developer app certified for subscription detection starts requesting broad historical data
and showing signs of bulk export.

Expected decisions: anomaly detection, scope suspension, audit rights, customer notification analysis, incident response。

Scenario 6: SME Authority Ambiguity

An employee connects business accounts to an AI cash-flow underwriting app,
but authority to share business financial data is unclear.

Expected decisions: business consent authority model, role verification, human review, contract update。

Scenario 7: Prompt Injection in Transaction Memo

A transaction memo contains text instructing the assistant to ignore privacy controls
and reveal all recent transactions.

Expected decisions: data sanitation, prompt-injection defense, retriever policy, red-team regression。

13. Portfolio Deliverables

Deliverable	What it demonstrates
Executive one-pager	你能把 open banking AI 从 connector 讲成 trust operating model
Ecosystem role map	你能区分 provider、recipient、aggregator、developer、AI operator
Consent disclosure pack	你能设计客户可理解、可撤销、可审计的数据分享体验
Data minimization matrix	你能把价值主张转成最小数据请求
API scope catalog	你能把 purpose limitation 落到技术接口
Developer onboarding checklist	你能治理第三方生态
Feature boundary registry	你能防止交易数据变成无边界画像
RAG control spec	你能设计账户数据上的安全检索增强生成
Agent authorization matrix	你能区分 read, reason, recommend, act
Fraud threat model	你能覆盖 malicious app、ATO、APP scam、API exfiltration
Revocation runbook	你能处理 token、cache、features、embeddings 和 downstream sharing
Evidence bundle schema	你能让投诉、审计和监管问询可重放
Metrics dashboard	你能平衡 value、trust、risk、AI safety 和 ecosystem performance

Portfolio storyline:

I designed an AI open banking architecture where customer consent is a runtime control plane.
The system requests minimum data for a customer-requested service,
uses governed APIs and third parties,
preserves data quality and lineage,
constrains AI features and RAG retrieval by purpose,
separates recommendations from actions,
propagates revocation through features and embeddings,
and keeps evidence replayable for audit and complaints.

14. Interview Answers

Q1: 如何向高管解释 AI open banking 的价值和风险?

30 秒:

价值是让客户授权自己的金融数据, 获得现金流洞察、账户聚合、费用优化、收入验证和更好的金融体验。风险是授权被解释成无限使用, API 数据进入无边界模型, 第三方和 aggregator 控制薄弱, 撤销不生效。成熟方案要把 consent registry 做成 runtime control plane, 约束 API scope、features、RAG、agent actions、retention and evidence。

30 秒:

Consent 是客户同意某个 recipient 为特定服务访问特定数据。Authorization 是系统允许某个 client、agent 或 workflow 做某件事。Authentication 是确认当前用户或 client 身份。连接银行账户的 consent 不能自动授权 AI 发起付款、提交申请或用数据做不相关训练。

Q3: 如何防止 open banking 数据被 AI 滥用?

30 秒:

从数据入口就绑定 consent_id、purpose、data category、duration、allowed uses、prohibited uses 和 revocation action。Feature store、RAG retriever、model serving 和 agent tools 每次使用都检查 entitlement。敏感推断、交叉销售、训练和高影响决策需要单独政策审批、eval、人审和证据。

Q4: RAG over account data 的核心设计是什么?

30 秒:

它必须是 customer/purpose isolated retrieval, 不是通用知识库。每次检索先查 consent and entitlement, 再按 time window、data category、freshness、quality 取 source rows。回答要带 source grounding 和 caveat, 禁止越界 advice/action, revocation 后要停止检索并处理 embeddings lifecycle。

Q5: Revocation 为什么是架构问题而不是 UI 功能?

30 秒:

因为撤销不只是隐藏连接状态。它要停止 provider/aggregator API collection, 通知 downstream parties, 禁用或限制 derived features, 处理 cache and embeddings, 停止 agent workflows, 更新 retention 状态, 给客户 receipt, 并保留审计证据。做不到这些, 撤销只是前端按钮。

15. Practical Templates

15.1 Use Case Boundary Card

Use case:
Customer-requested product/service:
Customer segment:
Jurisdiction / policy scope:
Data provider(s):
Data recipient:
Aggregator / vendor:
AI role:
Decision/action impact:
Required data categories:
Optional data categories:
Time window:
Refresh frequency:
Allowed AI uses:
Prohibited AI uses:
Human review triggers:
Revocation behavior:
Evidence requirements:
Accountable owner:
Risk owner:

Recipient name:
Provider name:
Aggregator name if visible/applicable:
Requested product/service:
Data categories:
Required vs optional data:
Duration:
Collection frequency:
How data will be used:
How AI will be used:
Uses not permitted:
Downstream sharing:
Revocation method:
Customer support route:
Language:
Accessibility evidence:
Disclosure version:

15.3 Developer Onboarding Record

Field	Example
developer_app_id	`cashflow-assistant-prod`
legal_entity	Example Fintech Inc.
product_service	AI cash-flow insights
requested_scopes	transactions.read.6m, balances.read, bills.read
rejected_scopes	account_verification.full, transactions.all_history
security_status	approved with annual review
privacy_status	approved, no targeted advertising or resale
AI_status	RAG only, no model training on customer data
fraud_controls	anomaly monitoring, high-risk warning, support escalation
revocation_test	passed
incident_contact	named operational contact
exit_plan	data return/destruction and token termination

15.4 Feature Boundary Register

Feature name:
Business purpose:
Source data categories:
Consent purpose:
Transformation:
Confidence / quality dependency:
Allowed model use:
Allowed customer outputs:
Allowed decisions:
Prohibited decisions:
Sensitive inference review:
Fairness review:
Human review triggers:
Retention:
Revocation action:
Monitoring metric:
Owner:

15.5 RAG Control Record

RAG use case:
Retriever policy:
Consent check:
Customer/account isolation:
Purpose isolation:
Data categories:
Time window:
Freshness requirement:
Quality filters:
Prompt injection controls:
Source citation rule:
Output policy:
Human escalation:
Revocation behavior:
Eval scenarios:
Evidence fields:

15.6 Revocation Evidence Record

Revocation event id:
Customer ref:
Consent id:
Data recipient:
Data provider:
Aggregator:
Developer app:
Revoked at:
Customer channel:
API token action:
Provider notification:
Aggregator notification:
Downstream notification:
Collection stop confirmed:
Cache action:
Feature action:
Embedding action:
Agent workflow action:
Retention exception:
Customer receipt:
Evidence bundle:
Owner:
Closure time:

15.7 Complaint RCA Template

Complaint id:
Customer issue:
Consent id:
Disclosure version:
Data categories:
API request ids:
Data quality status:
Feature ids:
AI run id:
RAG source refs:
Human review:
Final customer message:
Alleged harm:
Root cause category:
Immediate remediation:
Customer remediation:
CAPA owner:
CAPA due date:
Closure evidence:

16. Final Operating Principle

这套 playbook 的成熟度可以用一个问题检验:

When an AI-enabled financial product uses customer-authorized open banking or open finance data,
can the institution prove the customer understood the value and scope,
the API enforced the consent,
the data stayed minimized and lineage-preserved,
the model used only permitted features and retrieval sources,
third parties honored purpose, retention and revocation,
fraud/scam controls monitored abuse,
agent actions required separate approval,
and every outcome can be replayed for customer trust, audit and remediation?

如果答案不清楚, 不是缺一个 bank-connection SDK。问题是 consented data sharing、API platform、data governance、AI governance、third-party risk、fraud controls、revocation lifecycle 和 evidence architecture 还没有成为同一套 operating system。

AI Open Banking / Open Finance / Consented Data Sharing Playbook

AI Open Banking / Open Finance / Consented Data Sharing Architecture Playbook

0. Disclaimer

1. Executive Framing

1.1 Steering Committee Questions

2. Source Anchors

3. Taxonomy

3.1 Ecosystem Roles

3.2 Data Classes

3.3 Consent States

3.4 AI Use Levels

4. Decision Gates

Gate 0: Use Case Eligibility

Gate 1: Role and Regulatory Interpretation

Gate 2: Consent and Data Minimization

Gate 3: API and Developer Ecosystem

Gate 4: Third-Party and Vendor Risk

Gate 5: Data Quality and Feature Boundary

Gate 6: AI / RAG / Agent Controls

Gate 7: Fraud, Scam and Abuse

Gate 8: Evidence, Complaints and Audit

5. Required Artifacts

5.1 Data Minimization Matrix

5.2 AI Boundary Matrix

6. RACI / Operating Model

7. Implementation Roadmap

Days 1-30: Strategy and Control Baseline

Days 31-60: Controlled Pilot

Days 61-90: Scale and Assurance

8. Evidence Pack

9. Checklists

9.1 Release Checklist

9.2 Consent UX Checklist

9.3 API and Developer Checklist

9.4 AI / RAG Checklist

9.5 Revocation Checklist

9.6 Fraud and Scam Checklist

10. Metrics and KRIs

11. Anti-Patterns

12. Tabletop Scenarios

Scenario 1: Consent Phishing

Scenario 2: Revoked Data Still in RAG

Scenario 3: Unsupported Credit Inference

Scenario 4: Aggregator Outage

Scenario 5: Malicious Developer Overcollection

Scenario 6: SME Authority Ambiguity

Scenario 7: Prompt Injection in Transaction Memo

13. Portfolio Deliverables

14. Interview Answers

Q1: 如何向高管解释 AI open banking 的价值和风险?

Q2: Consent 和 authorization 有什么区别?

Q3: 如何防止 open banking 数据被 AI 滥用?

Q4: RAG over account data 的核心设计是什么?

Q5: Revocation 为什么是架构问题而不是 UI 功能?

15. Practical Templates

15.1 Use Case Boundary Card

15.2 Consent Disclosure Checklist

15.3 Developer Onboarding Record

15.4 Feature Boundary Register

15.5 RAG Control Record

15.6 Revocation Evidence Record

15.7 Complaint RCA Template

16. Final Operating Principle