返回 Papers
AI 底层逻辑 / 经典论文

AI AML Alert Triage:调查工作台架构

以下来源只作为架构和控制设计锚点。具体适用性、监管口径、阈值、保存期限、SAR filing decision 和客户处置由 Legal / BSA-AML Compliance / Sanctions / Model Risk / Internal Audit / Business Owner 确认。

557ai-foundations/papers/144-ai-aml-alert-triage-investigation-workbench-architecture.md

AI AML Alert Triage / Investigation Workbench Architecture 解读

面向对象: Senior AI PM / Senior BA / Product Architect / AML Technology Architect / Model Risk Partner / Financial Crime Operations Lead。 核心问题: 如何把 AML alert operations 从“规则告警 + 人工翻系统 + 个案经验”升级为 evidence-first、graph/context-aware、human-owned、model-risk-controlled 的 AI investigation workbench。 学习目标: 能设计队列优先级、实体解析、图谱上下文、证据工作区、analyst copilot、处置建议、QA、SAR 草稿护栏、调参反馈、审计轨迹、职责隔离和采用度量, 同时不把 AI 误设计成 SAR 自动决策器。


Source Anchors

以下来源只作为架构和控制设计锚点。具体适用性、监管口径、阈值、保存期限、SAR filing decision 和客户处置由 Legal / BSA-AML Compliance / Sanctions / Model Risk / Internal Audit / Business Owner 确认。

SourceOfficial link在本文中的架构用法
FFIEC BSA/AML Manual - Suspicious Activity Reportinghttps://bsaaml.ffiec.gov/manual/AssessingComplianceWithBSARegulatoryRequirements/04把 workbench 对齐到 unusual activity identification、managing alerts、SAR decision making、SAR completion、continuing activity monitoring 五段链路; 强调 alert management、staffing、manual/automated monitoring 和支持证据。
FFIEC SAR Examination Procedureshttps://bsaaml.ffiec.gov/manual/AssessingComplianceWithBSARegulatoryRequirements/04_ep用于设计 alert review timeliness、research evidence、CDD/EDD context、documented no-file decision、escalation、prepared/filed SAR quality 和 transaction testing 证据。
FFIEC Customer Due Diligencehttps://bsaaml.ffiec.gov/manual/AssessingComplianceWithBSARegulatoryRequirements/02用 customer risk profile、expected activity、ongoing monitoring 支撑 context engine 和 false-positive explanation。
FFIEC CDD Examination Procedureshttps://bsaaml.ffiec.gov/manual/AssessingComplianceWithBSARegulatoryRequirements/02_ep用于定义客户风险画像变更、信息不足、beneficial ownership、OFAC context 和调查时应暴露的 CDD 证据。
FFIEC BSA/AML Independent Testinghttps://bsaaml.ffiec.gov/manual/AssessingTheBSAAMLComplianceProgram/03用于 QA、独立测试、系统/数据/报告完整准确性、SAR process review、整改闭环和审计工作底稿。
FFIEC Appendix F - ML/TF Red Flagshttps://bsaaml.ffiec.gov/manual/Appendices/07仅作为 red flag 和 additional scrutiny 的 source anchor; 本文不复制 typology/SAR narrative coverage。
FFIEC Appendix L - SAR Quality Guidancehttps://bsaaml.ffiec.gov/manual/Appendices/13用于 SAR draft pre-check、evidence completeness、key-term guardrail 和 narrative quality review。
FinCEN SAR Resourceshttps://www.fincen.gov/suspicious-activity-reports-sars用于 SAR 资源和 BSA E-Filing handoff 边界; workbench 不自动提交 SAR。
FinCEN BSA Filing Informationhttps://www.fincen.gov/resources/filing-information用于 filing workflow、E-Filing interface、submission acknowledgement、recordkeeping handoff 的系统边界。
OFAC Sanctions List Service / Searchhttps://ofac.treasury.gov/sanctions-list-servicehttps://ofac.treasury.gov/sanctions-list-search-tool用于 sanctions hit context、hard-stop escalation 和 sanctions-screening evidence reference; AML workbench 不替代制裁筛查系统。
NIST AI RMFhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织 AI risk、eval、monitoring、human oversight 和 issue remediation。
NIST AI RMF Corehttps://airc.nist.gov/airmf-resources/airmf/5-sec-core/用四类 function 设计 continuous risk management, 避免把 AI governance 当一次性上线清单。
NIST GenAI Profilehttps://www.nist.gov/publications/artificial-intelligence-risk-management-framework-generative-artificial-intelligence用于 GenAI 特有风险: hallucination、data leakage、prompt injection、overreliance、third-party dependency、eval 和 red-team。
ISO/IEC 42001https://www.iso.org/standard/81230.html用 AI management system 思路连接 policy、roles、operational control、performance evaluation、management review 和 continual improvement。

1. 一句话定位

AML Alert Triage / Investigation Workbench 不是“给告警系统加一个聊天框”。它是一个把告警、客户画像、交易时间线、实体网络、规则/模型信号、调查证据、人工判断、QA 发现、调参反馈和审计轨迹组织在一起的生产控制系统。

更准确的定义:

AML investigation workbench =
alert queue prioritization
  + entity resolution and graph context
  + evidence-first case assembly
  + analyst copilot
  + human-owned disposition
  + SAR draft guardrails
  + QA and tuning feedback
  + model risk controls
  + audit replay and segregation of duties

关键边界:

  • AI 可以辅助排序、汇总、比对、检查缺口、提出 investigation next step、生成有引用的草稿和 QA pre-check。
  • AI 不应自动决定 SAR filing / no filing。
  • AI 不应自动提交 SAR、关闭高风险 case、解除 sanctions hit、通知客户、联系 law enforcement 或绕过 BSA/AML owner。
  • AI 输出必须被当成可质疑的工作产品, 不是合规结论。

2. Architecture Mental Model

2.1 从 Alert Factory 到 Evidence Operating System

传统 AML 平台常见结构:

transaction monitoring rule/model
  -> alert
  -> analyst queue
  -> manual research across systems
  -> case note
  -> close / escalate / SAR process

AI workbench 的目标结构:

alert signal
  -> entity-resolved context
  -> graph and timeline assembly
  -> risk-prioritized queue
  -> evidence workspace
  -> copilot-supported investigation
  -> human disposition and escalation
  -> QA sample / case replay
  -> tuning and coverage feedback

架构重心从“模型分数”转成“证据和流程”:

Old centerNew center
Alert countRisk-adjusted queue value
Rule hit textEvidence bundle
Analyst memoryReusable investigation pattern
Individual case noteTraceable decision record
False-positive reduction onlyCoverage, quality, timeliness, adoption, auditability
Model outputHuman-owned decision with explainable support

2.2 Workbench 的六个核心对象

Object架构含义不能省略的控制字段
Alertunusual activity signal, 来自规则、模型、referral、law-enforcement request 或 sanctions/fraud referralsource, scenario/model version, trigger facts, generated time, SLA, queue route
Subject / Entitycustomer, account, beneficial owner, counterparty, device, address, business, merchant, walletentity confidence, source system, resolution method, conflict flag
Evidencetransaction, KYC/CDD, EDD, beneficial ownership, prior case, sanctions hit, document, analyst noteprovenance, timestamp, permission class, citation id, freshness, completeness
Caseinvestigation containerassigned owner, status, escalation state, decision owner, QA state
Dispositionhuman-owned outcome such as close as no unusual activity, continue monitoring, escalate, SAR considerationrationale, evidence references, reviewer identity, reason code, policy version
FeedbackQA finding, analyst disagreement, false-positive driver, scenario defect, data quality issuefeedback type, severity, owner, control route, retraining/tuning eligibility

3. Reference Architecture

Source systems
  Core banking, card, ACH/wire, RTP, digital channels, KYC/CDD, case management,
  sanctions screening, fraud, CRM, documents, external lists, FinCEN/advisory registry

Data and identity layer
  Data contracts, lineage, PII tagging, RBAC/ABAC, entity resolution, household/business graph,
  counterparty graph, beneficial ownership graph, feature store, evidence ledger

Decision support layer
  Alert scoring, queue prioritization, typology coverage link, case assembly,
  retrieval, summarization, gap detection, disposition recommendation, QA pre-check

Workflow and controls layer
  Case state machine, maker-checker, escalation, SAR draft handoff, tuning intake,
  model risk gate, audit trail, retention class, access review

Experience layer
  Queue console, investigation workspace, graph/timeline view, copilot panel,
  evidence cards, disposition panel, QA console, management dashboard

3.1 Key Architectural Decisions

DecisionRecommended positionRationaleRisk if wrong
Copilot authorityRead-mostly, recommendation-only for AML decisionsPreserves human ownership and examiner replayAI becomes hidden decision maker
Evidence modelStructured evidence ledger before free-text summaryEnables citation, QA, SAR draft guardrails and audit replayBeautiful summary with unverifiable claims
Entity resolutionConfidence-banded, reversible, source-visibleAML graph errors can contaminate multiple casesFalse linkage, privacy overexposure, bad escalation
Queue prioritizationRisk-adjusted routing, not pure likelihood scoreOperations need timeliness and coverage, not only model rankLow-volume high-risk typologies starve
SAR draftDraft packet with explicit human gateAI may assist wording, not filing conclusionAuto-filing pressure or SAR confidentiality breach
Tuning feedbackControlled feedback taxonomy and change gateAnalyst labels are useful but noisy and incentive-shapedModel learns operational shortcuts
AuditAppend-only event stream with version hashesSupports replay across model, prompt, evidence, user and decisionCannot explain why an alert was closed

4. Queue Prioritization Architecture

Alert queue prioritization should not be a single “risk score”。It is a routing policy combining risk, timeliness, evidence sufficiency, customer context, scenario coverage, operational capacity and escalation constraints.

4.1 Priority Inputs

Signal familyExamplesArchitecture guardrail
Scenario severitystructuring, rapid movement, mule network, high-risk geography, law-enforcement referralSeverity maintained by AML scenario owner, not model team alone
Customer contextCDD risk profile, EDD status, expected activity, account age, occupation/business typeShow source and freshness; do not infer missing CDD as suspicious by itself
Transaction patternamount, velocity, corridor, product, counterparty novelty, cash/ACH/wire mixExplain which facts caused priority; avoid opaque score only
Network contextshared address/device, beneficial owner overlap, counterparty cluster, prior casesEntity confidence must travel with graph edges
Time sensitivityregulatory clock, continuing activity, aged alert, repeated alert on same subjectSLA and aging must override pure risk score when required by policy
Evidence readinessenough data for review, missing source, stale data, broken feedRoute incomplete cases to data exception, not analyst guesswork
Operational routinganalyst skill, language, sanctions expertise, EDD specialization, queue capacityCapacity is a constraint, not a reason to suppress alert volume

4.2 Priority Output

OutputRequired explanation
priority_bandP1/P2/P3/P4 with policy-based reason
route_to_queuegeneral AML, EDD, sanctions referral, fraud-AML, senior investigator, QA sample
SLA_due_atpolicy-derived due date and clock source
top_drivers3-7 cited drivers, each tied to evidence
missing_contextCDD, transaction, counterparty or document gaps
review_warninglow entity confidence, conflicting data, possible sanctions relation, repeat alert

Good prioritization answer:

This alert is P1 because it combines repeated below-threshold cash deposits,
new outbound wires to unrelated counterparties, high-risk customer profile,
and prior alert escalation within 90 days. Entity link confidence is medium
because address match conflicts with phone/device evidence. Review CDD expected
cash activity before disposition.

Bad prioritization answer:

Risk score 0.91. Recommended SAR.

5. Entity Resolution and Graph Context

AML investigation quality depends heavily on whether the system can correctly connect customers, accounts, counterparties, beneficial owners, devices, addresses and prior cases.

5.1 Entity Resolution Bands

BandMeaningProduct behavior
Certainsame customer/account id from authoritative sourceAuto-merge in graph, cite source
Highstrong deterministic match, such as government id plus name/dateShow merged view with provenance
Mediummultiple corroborating soft signalsShow as probable link, require analyst confirmation for escalation
Lowweak shared attribute onlyShow as lead, not as fact
Conflictsource disagreement or stale identityTrigger evidence warning and data-quality route

5.2 Graph Views That Matter

GraphAML investigation useFailure mode
Customer-account graphsubject owns, controls, signs, benefits from accountsmissing beneficial owner hides exposure
Counterparty graphfunds flow, repeated beneficiaries, pass-through chainsgraph clutter without time/amount context
Device/address graphmule rings, synthetic identity, shared digital footprintover-linking households or shared businesses
Case graphprior alerts, SARs, QA findings, repeat suspicious activitySAR-sensitive access breach
Product/channel graphACH, wire, card, crypto on/off ramp, branch, ATMchannel gaps create blind spots

Architecture rule:

Every graph edge needs source, timestamp, confidence, reason and access class.

6. Evidence Workspace

Evidence workspace is the center of the product. The copilot should sit beside evidence, not replace evidence.

6.1 Evidence Card Schema

evidence_id: ev_tx_2026_000184
case_id: case_aml_7182
source_system: wire_platform
source_record_id: wire_983810
evidence_type: transaction
event_time: 2026-06-12T16:21:08Z
subject_entity_id: ent_customer_431
related_entities: [ent_counterparty_882, ent_account_901]
summary: outbound wire to newly observed counterparty
facts:
  amount: 24750.00
  currency: USD
  corridor: US->HK
  channel: online_wire
quality:
  freshness: current
  completeness: complete
  entity_confidence: high
controls:
  access_class: aml_restricted
  retention_class: bsa_supporting_documentation
  citation_required: true

6.2 Evidence Completeness

Investigation questionMinimum evidence
Is activity unusual for the customer?CDD profile, expected activity, historical baseline, transaction sequence
Are counterparties related or high risk?counterparty history, entity resolution, geography, sanctions/fraud referrals
Is this a one-off or continuing activity?prior alerts, prior cases, repeat pattern, lookback period
Can the analyst close with rationale?trigger facts, benign explanation, CDD consistency, reviewer note
Is SAR consideration escalated?fact chronology, amount aggregation, subject details, evidence citations, escalation approval

7. Analyst Copilot Patterns

7.1 High-Value Capabilities

CapabilityGood outputRequired guardrail
Alert explanationexplains trigger and evidence driverscite source fields and scenario/model version
Timeline assemblychronological facts with amount, party, channelno inferred motive without evidence
Graph summaryrelevant relationships and confidencedistinguish confirmed link from lead
Evidence gap checkmissing CDD, stale EDD, absent counterparty contextroute to data/task, not fabricated answer
Investigation plansuggested next questions and documentsanalyst decides which tasks to execute
Disposition assistoptions with evidence for/againstno final SAR/no-SAR decision
Case note draftconcise cited summaryunsupported sentence blocker
QA pre-checkmissing rationale, stale source, bad citationQA owner can override with reason

7.2 Structured Copilot Contract

copilot_output:
  answer_type: investigation_summary
  case_id: case_aml_7182
  scope_boundary: "assistant summary, not final disposition"
  claims:
    - claim: "Customer received three inbound ACH credits and sent two wires within 24 hours."
      evidence_ids: [ev_tx_1, ev_tx_2, ev_tx_3, ev_tx_4, ev_tx_5]
      confidence: high
    - claim: "Counterparty relationship is not established in customer profile."
      evidence_ids: [ev_cdd_1, ev_counterparty_2]
      confidence: medium
  missing_evidence:
    - latest EDD review
    - stated business purpose for Hong Kong wire counterparty
  prohibited_actions:
    - auto_file_sar
    - final_no_sar_decision

8. Disposition Recommendation Without Auto-SAR

Disposition recommendation is useful only when it is explicitly framed as decision support.

Disposition optionAI can doHuman must do
Close as no unusual activityidentify benign evidence and draft rationaledecide closure and sign case
Continue monitoringidentify repeat pattern, missing lookback or future triggerapprove monitoring action and period
Escalate to investigationexplain risk drivers and evidence gapsaccept escalation and assign owner
Refer to sanctions/fraud/EDDshow matched signals and source contextdetermine referral path and handling
SAR considerationassemble evidence packet and narrative draftdetermine whether to file and approve final filing workflow

Hard line:

The system may recommend "escalate for SAR consideration".
The system must not present "file SAR" as an autonomous decision.

9. SAR Draft Guardrails

SAR draft support should be implemented as a guarded writing workflow, not an autonomous filing workflow.

9.1 Guardrail Rules

RuleProduct behavior
No unsupported claimEvery factual sentence must link to evidence id or analyst-entered note
No criminal conclusionUse observed facts and suspicious indicators, not accusation language
No auto-filingSubmit/export requires authorized human step outside copilot autonomy
SAR confidentialityAccess, retrieval, logging and sharing must enforce SAR-sensitive controls
Key term supportSuggest official/internal key terms only with source anchor and reviewer approval
Draft provenanceStore model, prompt, evidence set, analyst edits and approval chain
Missing evidence warningBlock finalization of draft packet when required fields are absent under internal policy

9.2 Draft Packet Boundary

The workbench can produce:

  • cited fact chronology。
  • subject/account/counterparty table。
  • suspicious indicators observed。
  • transactions selected for review。
  • missing evidence and analyst questions。
  • draft narrative marked as AI-assisted。
  • QA checklist and reviewer comments。

The workbench should not produce:

  • final filing conclusion without BSA/AML owner。
  • hidden filing submission。
  • customer notification。
  • law-enforcement disclosure decision。
  • sanctions blocking/release decision。

10. QA, Feedback and Tuning Loop

AML AI improves only if feedback is separated by type. “Analyst clicked close” is not automatically a negative label.

10.1 Feedback Taxonomy

Feedback typeMeaningEligible use
Case dispositionhuman outcome for the caseoperations analytics; model training only after QA controls
QA defectdocumented quality issuecontrol improvement and training
Scenario defectrule threshold/logic issuescenario tuning workflow
Data quality defectmissing, stale, duplicated or conflicting sourcedata remediation
Copilot defecthallucination, bad citation, missing evidence, unsafe wordingprompt/model/RAG eval and remediation
Entity resolution defectfalse merge, missed link, confidence errorentity model tuning
Analyst disagreementreasoned challenge to AI outputeval set enrichment
Compliance overrideBSA/AML owner changed recommendationpolicy/control review

10.2 Tuning Guardrails

RiskGuardrail
Learning from noisy closuresRequire QA-reviewed labels for high-impact model updates
Optimizing only for fewer alertsInclude coverage, SAR quality, false-negative proxies and QA findings
Hidden threshold driftVersion thresholds, retain calibration report, run coverage regression
Analyst incentive biasSeparate productivity metrics from training labels
Typology erosionLink tuning change to scenario/typology coverage matrix
Model shortcutSlice eval by product, customer type, channel, geography and alert source

11. Model Risk and Validation

AML workbench is an AI system, not one model. Validation scope should include rules, ranking model, entity resolution, graph features, retrieval, prompts, LLM, judge, workflow, human oversight and monitoring.

ComponentValidation question
Queue modelDoes prioritization improve timeliness and risk focus without starving low-volume typologies?
Entity resolutionAre false merges and missed links measured by segment and source?
Graph featuresAre graph edges explainable, current and access-controlled?
RAG/retrievalDoes the copilot retrieve correct evidence and respect SAR-sensitive permissions?
LLM summaryAre factual claims supported and uncertainty expressed?
Disposition assistAre options balanced and clearly non-final?
SAR draftAre facts cited, language controlled and filing boundary enforced?
QA judgeDoes automated QA align with human QA and avoid self-grading bias?
WorkflowAre escalation, maker-checker, audit trail and fallback paths tested?

Revalidation triggers:

  • new source system or product/channel。
  • model, prompt, embedding, reranker or graph algorithm change。
  • scenario threshold change。
  • SAR QA defect spike。
  • sanctions/fraud referral workflow change。
  • material queue backlog or staffing model change。
  • regulatory, policy or advisory update。
  • incident, privacy issue, prompt injection or evidence leakage。

12. Audit Trail and Segregation of Duties

12.1 Required Events

EventMinimum fields
alert_createdsource, scenario/model version, trigger facts, timestamp
priority_assignedpriority model/rule version, drivers, route, SLA
entity_resolvedmatch method, confidence, source, user override
evidence_vieweduser/role, evidence id, access reason, timestamp
copilot_generatedmodel/prompt/RAG versions, evidence ids, output hash
disposition_selectedhuman actor, reason code, cited evidence, decision time
escalation_createdtarget queue, reason, approver, SLA
SAR_draft_createddraft version, evidence set, model version, human editor
QA_reviewedreviewer, sample reason, defects, severity, remediation
tuning_feedback_submittedfeedback type, eligibility, owner, approval state
policy_or_threshold_changedchanger, approver, before/after, test evidence

12.2 Incompatible Duties

Incompatible combinationControl
Analyst creates AI-supported closure and approves own QA sampleIndependent QA reviewer
Model owner changes queue ranking and certifies model effectivenessModel risk / validation challenge
Scenario owner tunes threshold and closes coverage review aloneBSA/AML compliance approval plus regression evidence
Copilot drafts SAR packet and submits filingAuthorized human filing process outside copilot autonomy
System admin can edit evidence and audit logWrite-once log, dual control, privileged access monitoring
Vendor generates model output and validates its own production qualityInternal acceptance, independent sample and audit rights

13. Metrics That Matter

Metric familyExamplesWhy it matters
Productivitymedian investigation prep time, evidence assembly time, queue aging, analyst throughputMeasures whether workbench removes manual friction
QualityQA defect rate, unsupported claim rate, citation accuracy, disposition consistencyPrevents speed from degrading investigations
Risk coveragetypology/scenario coverage, high-risk queue timeliness, repeat alert escalationPrevents false-positive reduction from creating blind spots
Adoptionactive analyst usage, accepted evidence cards, copilot edit distance, override reasonsMeasures real workflow fit, not demo usage
Human oversightrecommendation acceptance by risk band, disagreement rate, escalation qualityDetects automation bias and low-value AI
Model healthdrift, retrieval precision, entity false merge rate, eval pass rateManages AI system risk
Control healthaudit event completeness, SoD violation rate, stale CDD rate, data quality exceptionsMakes examiner/audit replay possible

Bad north star:

Reduce alerts by 70%.

Better north star:

Increase risk-adjusted investigation capacity while preserving evidence completeness,
timely escalation, SAR decision ownership and audit replay quality.

14. Anti-Patterns

Anti-patternWhy it failsBetter design
Chatbot over case managementNo state, no evidence ledger, no approvalsWorkbench with workflow and traceability
Score-only queueAnalysts cannot explain routingDriver-based priority with cited evidence
Entity graph without confidenceFalse links contaminate investigationsConfidence-banded edges and source visibility
SAR writer firstAutomates the most sensitive output before evidence controlsEvidence workspace and QA before narrative assist
Auto-close low-score alertsCreates hidden false-negative riskHuman-owned closure, sampling and coverage review
Train on all analyst outcomesLearns noise, staffing shortcuts and incentive biasQA-reviewed feedback taxonomy
Model validation limited to AUCIgnores workflow, retrieval, graph and human oversightAI system validation
Productivity-only adoptionEncourages rubber-stamp behaviorBalanced productivity, quality, risk and control metrics
Audit log as afterthoughtCannot replay decisions under challengeEvent schema designed before launch

15. PM / Architect Implications

15.1 PM Questions

QuestionStrong answer
What is the product outcome?Risk-adjusted investigation capacity with evidence quality and human decision ownership
What is out of scope?SAR filing decision, sanctions disposition, customer notification and law-enforcement disclosure
What does adoption mean?Analysts use evidence cards, challenge AI, edit drafts, reduce search time and improve QA outcomes
How do we avoid automation bias?Evidence-first UI, non-default recommendations, disagreement prompts, QA sampling and training
How do we prove value?Time saved plus QA quality, timeliness, coverage and audit completeness

15.2 Architect Questions

QuestionStrong answer
Where does truth live?In source records and evidence ledger, not in LLM summaries
How is context assembled?Entity-resolved graph, timeline, CDD profile and permission-filtered retrieval
How is SAR confidentiality protected?Access classes, retrieval filters, event logs, need-to-know roles and controlled handoff
How are changes governed?Versioned models/prompts/rules, release gates, regression eval and approval evidence
How can audit replay a case?Reconstruct alert, evidence, model versions, copilot output, human actions and approvals

16. Interview Expression

30 秒版本:

AML alert workbench 的核心不是让 AI 判断是否报 SAR, 而是把 alert queue、实体解析、交易图谱、CDD context、证据包、analyst copilot、处置建议、QA、调参反馈和审计轨迹做成一个 human-owned control system。AI 可以辅助排序、汇总、查缺口和起草, 但 SAR decision、filing、sanctions disposition 和客户动作必须保留在人类授权流程中。

2 分钟版本:

我会按 alert-to-evidence-to-case-to-disposition-to-feedback 设计。第一层是数据和 identity: 交易、CDD、EDD、case history、sanctions/fraud referral 进入带 lineage 和 access class 的 evidence ledger, entity resolution 输出带 confidence 的客户/账户/对手方图谱。第二层是 decision support: queue prioritization 不是单分数, 而是 risk band、SLA、route、top drivers、missing evidence。第三层是 analyst workspace: timeline、graph、evidence cards、copilot summary、gap checklist 和 disposition options, 每个事实都要引用证据。第四层是 controls: no auto-SAR, maker-checker, SAR-sensitive access, QA sampling, tuning change gate, model risk validation 和 audit replay。这样能同时提升产能、质量和监管可解释性, 不把 AI 变成隐藏的合规决策者。

可能追问:

追问回答要点
如何降低误报?先分解 false-positive drivers, 用 CDD context、entity graph、scenario tuning 和 evidence completeness 改善精度; 不用简单 suppress alerts。
如何防止漏报?保留 typology/scenario coverage matrix、false-negative proxy、QA sample、repeat alert review 和 coverage regression。
如何处理 SAR draft?AI 只生成有引用的 draft packet, 人类决定 filing; unsupported facts、criminal conclusion、SAR confidentiality 和 filing handoff 有硬护栏。
如何做模型验证?验证 full AI system: ranking、entity resolution、RAG、LLM、workflow、HITL、QA judge、monitoring 和 change management。

AssetHow to use
docs/AI_FINANCIAL_CRIME_TYPOLOGY_SCENARIO_COVERAGE_PLAYBOOK.md本文只链接 typology coverage, 不重复类型学和 SAR narrative 设计。
docs/AML_COPILOT_PRD.md可作为 one-page product framing 和原型级 MVP 边界。
docs/AML_GOVERNANCE_MAP.md可作为当前 repo 中 AML copilot 控制映射的治理补充。
docs/AI_HUMAN_OVERSIGHT_HITL_PLAYBOOK.md用于深化 human-owned decision、override、escalation 和 stop path。
docs/AI_MODEL_RISK_MANAGEMENT_PLAYBOOK.md用于深化 AI system inventory、validation、monitoring 和 revalidation trigger。
docs/AI_SEGREGATION_OF_DUTIES_DUAL_CONTROL_PLAYBOOK.md用于深化 maker-checker、dual control 和 incompatible duty matrix。