返回 Papers
AI 扩展计划 / Playbooks

AI Architecture Views / C4 / arc42 / 42010 Playbook

以下来源作为术语和方法锚点. 本文是学习, 作品集和架构沟通训练材料, 不构成认证, 合规, 审计或法律意见. 正式项目需要按机构政策, 监管地区, 数据类型, 供应商合同和模型风险要求复核.

740AI_ARCHITECTURE_VIEWS_C4_ARC42_42010_PLAYBOOK.md

AI Architecture Views C4 / arc42 / ISO 42010 Playbook

目的: 训练 AI Product / Enterprise Architect / Solution Architect / AI BA 把 AI 系统从一张大图升级为可沟通, 可评审, 可发布, 可审计, 可作品集化的架构视图体系. 适用对象: 已有 10 年金融零售 PM / BA / Developer 经验, 且已具备 CBAP 能力的人. 本文不讲需求和架构基础, 重点是 AI 产品, 企业架构, 解决方案架构, 架构沟通和作品集表达. 核心观点: C4 解决结构表达, arc42 解决架构文档叙事, ISO 42010 解决 stakeholder / concern / viewpoint / view 的治理语言. AI 架构要把 model, RAG, tool, eval, policy, observability, evidence, human accountability 和 release gate 放进同一套视图系统.


Source Anchors

以下来源作为术语和方法锚点. 本文是学习, 作品集和架构沟通训练材料, 不构成认证, 合规, 审计或法律意见. 正式项目需要按机构政策, 监管地区, 数据类型, 供应商合同和模型风险要求复核.

AnchorLink本文使用方式
C4 Modelhttps://c4model.com/用 Context, Container, Component, Code 的分层结构表达 AI 系统边界, 运行单元, 关键组件和实现责任.
arc42https://arc42.org/用 arc42 的架构文档结构组织目标, 约束, 上下文, building blocks, runtime, deployment, cross-cutting concepts, ADR, quality 和 risk.
ISO/IEC/IEEE 42010https://www.iso.org/standard/74393.html用 stakeholder, concern, architecture viewpoint, architecture view, correspondence 和 rationale 思路建立可治理的视图体系.
NIST AI RMFhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern, Map, Measure, Manage 的风险管理语言把 AI risk, eval, monitoring, evidence 和 improvement 接入架构视图和 release gate.

One-Sentence Positioning

AI Architecture Views = 用 ISO 42010 定义谁关心什么, 用 C4 展示 AI 系统如何组成和交互, 用 arc42 写清楚架构决策和质量属性, 再用 NIST AI RMF 风险语言把 eval, policy, observability, evidence 和 release gate 变成架构交付物.

更短的面试表达:

我不会只画一张 AI 大图. 我会为不同 stakeholder 建立 view package: 业务价值看 capability view, CTO 看 C4 container 和 integration view, risk 看 policy / eval / evidence view, delivery team 看 component / runtime / deployment view, audit 看 42010-style stakeholder-concern traceability.


1. 为什么 AI 架构表达不能只有一张大图

AI 项目进入企业生产后, 最大问题通常不是图不够漂亮, 而是同一张图承担了过多职责:

一张大图想表达的内容实际问题更好的视图安排
业务流程, 模型调用, RAG, 工具, 审批, 监控全部放一起读者无法判断哪些是业务边界, 哪些是技术边界, 哪些是控制边界用 capability / workflow view 表达业务, 用 C4 表达结构, 用 control view 表达风险控制
只显示 app, LLM, vector DB, API看不到数据权限, 知识来源, 工具动作, 人工责任和证据增加 data / knowledge view, tool / action view, human accountability view, evidence view
用一张 sequence diagram 说明所有事情无法回答架构评审中的质量属性, 失败模式和 release gate单独建立 runtime view, quality view, failure mode view 和 gate view
把治理写成旁注Risk, compliance, audit 无法基于图提出有效挑战用 ISO 42010 的 concern traceability 连接 stakeholder, concern, viewpoint, evidence
把模型当成普通外部服务忽略模型版本, prompt, eval, drift, hallucination, data leakage 和 judge calibration在 C4 和 arc42 中加入 model, prompt, eval, monitoring, policy 的专门视图

企业 AI 架构表达要同时服务五类沟通:

沟通对象他们真正关心的问题推荐视图
Business sponsorAI 是否嵌入真实业务流程, 价值如何实现, 人类责任是否清楚Capability view, workflow view, human accountability view
Product / BA用户旅程, 数据输入, 输出责任, fallback, 采用指标Experience view, process view, outcome metric view
CTO / Enterprise Architect系统边界, 平台复用, 集成方式, 技术债, 演进路线C4 context / container, integration view, roadmap view
Solution / Delivery team组件职责, API, runtime sequence, deployment, SLO, failure handlingC4 component, runtime view, deployment view, observability view
Risk / Compliance / Audit风险等级, 控制, 审批, eval 证据, 运行监控, 例外处理Policy view, eval gate view, evidence view, 42010 traceability

关键转变:

从: 画一个 AI solution diagram
到: 设计一组 architecture viewpoints, 每个 view 回答明确 stakeholder concern, 并指向证据和决策.

2. Viewpoint Taxonomy

Viewpoint 是看架构的规则: 为谁看, 看什么 concern, 用什么模型, 产出什么 evidence, 用于什么决策. View 是按这个规则画出来或写出来的具体架构内容.

2.1 AI Architecture Viewpoint Catalog

ViewpointPrimary stakeholdersConcernsModel kindsEvidence outputs
Business Capability ViewBusiness sponsor, Enterprise Architect, Product哪些业务能力被 AI 增强, 哪些能力需要平台支撑Capability map, value stream, heatmapcapability-to-use-case map, investment rationale
Experience / Human Accountability ViewProduct, BA, Ops, ComplianceAI 在哪个用户旅程节点介入, 人类是否保留最终责任journey map, swimlane, responsibility matrixhuman-in-the-loop rule, user disclosure, escalation design
C4 System Context ViewCTO, EA, Security, ProductAI 系统边界, 用户, 外部系统, 供应商, 数据边界C4 context diagramsystem boundary, external dependency list
C4 Container ViewSolution Architect, Platform, Securityapp, orchestration, model gateway, RAG, tool gateway, eval, observability 如何部署和交互C4 container diagramcontainer responsibilities, integration contracts
C4 Component ViewEngineering, QA, SRE关键容器内部组件如何分工component diagram, interface tablecomponent ownership, API and event contracts
Data / Knowledge / RAG ViewData owner, Knowledge owner, Security, Model Risk知识源, 权限, freshness, retrieval, citation, poisoning risklineage diagram, RAG pipeline, source registrysource card, retrieval eval, entitlement test
Model / Prompt / Policy ViewAI Architect, Model Risk, Platform模型路线, prompt 版本, policy-as-code, risk-tiered routingmodel route diagram, prompt lifecycle, policy matrixmodel card, prompt record, policy decision log
Tool / Action ViewSolution Architect, Security, Operations, ComplianceAgent 可以调用什么工具, 动作风险, 幂等, 审批, 回滚tool catalog, action tier map, sequence diagramtool card, approval record, action ledger
Runtime / Failure Mode ViewDelivery, SRE, Operations正常路径, fallback, timeout, degradation, kill switchsequence, state machine, fault treerunbook, incident drill, rollback condition
Eval / Release Gate ViewProduct, Model Risk, QA, Compliance如何证明当前版本适合用途, 何时能 pilot / release / scaleeval contract, gate checklist, scorecardeval report, release memo, risk acceptance
Observability / Evidence ViewSRE, Audit, Model Risk, Management生产请求如何追踪, 质量如何监控, 审计证据如何形成trace schema, dashboard, evidence binder maptrace sample, dashboard link, evidence index
Portfolio / Roadmap ViewCTO, PMO, AI Transformation Lead哪些 AI 用例共享平台能力, 哪些控制复用, 投资优先级如何定portfolio matrix, dependency map, maturity radarroadmap, reuse metrics, risk exception summary

2.2 View Package By Architecture Question

Architecture questionMinimum view package
这个 AI use case 值得做吗?Capability view, value hypothesis, risk tier, adoption metric view
这个系统边界清楚吗?C4 context, external dependency list, data boundary view
这个方案能否上线给客户使用?Experience view, RAG view, policy view, eval gate view, monitoring view
这个 Agent 是否能安全调用工具?Tool / action view, policy matrix, runtime sequence, approval and action ledger
这个系统是否可运维?Container view, deployment view, SLO, observability view, incident runbook
这个 release 是否可审计?Stakeholder-concern matrix, eval report, release memo, evidence binder
这个项目能否成为作品集?C4 pack, arc42 summary, decision matrix, case narrative, portfolio evidence pack

2.3 AI Viewpoint 设计原则

Principle架构含义
Concern first先定义 stakeholder concern, 再决定画什么图.
Separate structure from controlC4 讲结构, control view 讲允许, 拒绝, 审批, 降级和证据.
Treat data and knowledge as architectureRAG 不是实现细节, 它影响权限, 时效, 引用, 责任和审计.
Treat eval as release architectureEval 不是 QA 附件, 而是 release gate 和风险接受的核心证据.
Keep human accountability explicit所有高风险 AI 输出都要说明谁 review, 谁 approve, 谁最终负责.
Make observability evidence-readytrace 不只是 debug, 还要服务模型风险, audit, incident 和 management review.
Portfolio over single use case企业架构价值来自复用 model gateway, RAG, eval, policy, tool, observability 和 evidence.

3. C4 For AI Systems

C4 的优势是让复杂系统从高层到低层逐步展开. 对 AI 系统而言, 关键不是把 LLM 画成一个外部 box, 而是把 AI-specific responsibilities 放在正确层级.

3.1 C4 Level 1: System Context

Context view 回答: 这个 AI 系统存在于哪个业务环境中, 与哪些人, 系统, 供应商和控制边界交互.

AI context view 必须显示:

ElementAI-specific content
Human userscustomer, employee, analyst, reviewer, approver, auditor
Business systemsCRM, core banking, LOS, case management, payment, product catalog, knowledge base
AI platform servicesmodel gateway, RAG service, eval service, policy engine, observability, evidence binder
External providersLLM provider, embedding provider, monitoring vendor, data provider
Trust boundarycustomer data boundary, vendor boundary, region boundary, regulated process boundary
Human accountabilityfinal decision owner, review owner, escalation path

Context view 不能只写 "User -> AI App -> LLM". 对金融零售, 最小上下文应该能回答:

  1. 这个 AI 输出是客户可见, 员工内部, 决策支持, 还是工具动作?
  2. 哪些客户数据和业务系统进入上下文?
  3. 哪些第三方模型或供应商可能接触数据?
  4. 输出是否影响客户权益, 资金, 合规报告或投诉处理?
  5. 哪个角色保留最终业务责任?

3.2 C4 Level 2: Container

Container view 回答: 系统由哪些可部署或可运行单元组成, 它们如何交互.

AI container view 推荐包含:

ContainerResponsibilityKey controls
AI-enabled application用户体验, workflow entry, feedback, disclosurerole-based access, human review UX, fallback
Orchestration serviceprompt assembly, workflow state, agent step, retriesstep budget, deterministic wrapper, state limit
Model gatewaymodel route, version pinning, quota, fallback, costallowlist, data boundary, audit log
RAG serviceingestion, retrieval, reranking, citation, index versionentitlement, freshness, source owner, retrieval eval
Tool gatewaytool schema, auth, dry run, execution, result wrappingaction tier, approval, idempotency, action ledger
Policy engineruntime decisions, data and action rules, exception handlingpolicy-as-code, version, decision log
Eval serviceoffline eval, regression, red-team, release scorecardthresholds, critical failures, judge calibration
Observability pipelinetraces, metrics, logs, cost, quality signalsmasking, retention, trace completeness
Evidence binderrelease evidence, approvals, exceptions, monitoring linksimmutable index, owner signoff, audit export

Container view 中最常见的遗漏:

Omitted containerConsequence
Policy engine规则散落在 prompt 和 app code, 无法复用或审计.
Eval servicerelease gate 靠主观 UAT, 版本变更后无法回归.
Tool gatewayAgent 直接拿业务系统权限, 动作不可控.
Evidence binder审计时补材料, 无法证明版本和批准链.
Observability pipeline出错时只能查应用日志, 看不到 model / retrieval / tool / policy path.

3.3 C4 Level 3: Component

Component view 回答: 某个 container 内部有哪些组件, 每个组件承担什么职责.

以 Customer-facing RAG container 为例:

ComponentResponsibilityAI architecture concern
Source registry adapter读取已批准知识源, owner, effective date, audiencesource authority, lifecycle, audit
Ingestion pipelineparse, chunk, classify, embed, indexchunking policy, PII handling, index version
Entitlement filter按用户角色, 产品线, 地区, 文档状态过滤permission-before-retrieval
Hybrid retrieverkeyword + vector + metadata retrievalrecall, freshness, query intent
Reranker对候选片段重排relevance, latency, model route
Citation builder将答案绑定到 source ids 和段落groundedness, evidence
No-answer classifier判断没有足够证据时拒答或升级unsupported claim prevention
RAG eval hook记录 query, retrieved docs, answer, citation, scoreeval and monitoring feedback

以 Tool gateway container 为例:

ComponentResponsibilityAI architecture concern
Tool catalog resolver根据 workflow 和 role 返回可用工具least privilege, allowed use
Schema validator验证模型提出的 tool argumentsdeterministic control
Policy decision client请求 policy engine 判断 allow / review / denyruntime control
Dry-run executor对 write action 生成预览和影响分析human review quality
Idempotency manager防止重复执行同一业务动作operational safety
Approval router将高风险动作送人工或双控human accountability
Action ledger writer记录请求, 参数, 策略, 审批, 执行结果audit and incident replay

3.4 C4 Level 4: Code

Code view 不需要成为每个 AI 项目的默认交付物. 只有当关键风险集中在实现细节时才深入到代码层:

Code-level concern需要 Code view 的原因
Tool argument validation小错误可能导致错误退款, 错误状态更新或越权查询.
Policy decision wrapper必须证明所有高风险动作都调用同一 control point.
PII redaction and logging需要验证敏感字段不会进入 prompt, logs 或外部供应商.
Eval gate implementationCI / CD 中 gate 的阻断条件必须可测试.
Kill switch and fallback需要证明关停按 use case, model, tool, tenant 或 route 生效.

Code view 的作品集表达不要贴大段代码. 更好的方式是:

  1. 展示关键 wrapper 或 policy check 的伪代码.
  2. 说明它保护哪个 architecture concern.
  3. 连接到 test evidence, trace sample 或 release gate.

3.5 C4 Checklist For AI Systems

# C4 Checklist For AI Systems

## Context
- system_boundary_defined: AI system, AI platform, business systems and external providers are separated.
- user_roles_defined: customer, employee, reviewer, approver, operator and auditor roles are explicit.
- data_boundary_defined: customer data, confidential knowledge and vendor boundary are visible.
- human_accountability_defined: final decision owner and escalation path are shown.
- regulated_process_identified: credit, AML, complaints, payments, KYC or customer commitment boundaries are marked.

## Container
- model_gateway_present: application does not call models directly for production use.
- rag_service_present_if_needed: knowledge retrieval, citation and source lifecycle are explicit.
- tool_gateway_present_if_agentic: tool execution is mediated by schema, policy, approval and action ledger.
- policy_engine_present: data, action and workflow rules are runtime controls, not prompt-only text.
- eval_service_present: release quality is measured by use-case eval and regression, not demo-only review.
- observability_present: model, retrieval, tool, policy, approval and output traces are captured.
- evidence_binder_present: release, approval, exception and monitoring evidence are linked.

## Component
- component_ownership_defined: each critical AI component has business, technical or control owner.
- versioned_assets_defined: model, prompt, index, tool schema, policy and eval set are versioned.
- failure_modes_defined: fallback, no-answer, timeout, deny, degrade and kill switch are handled.
- data_minimization_defined: prompt and logs contain only necessary context.
- feedback_loop_defined: production feedback can update eval set, knowledge source, policy or prompt.

## Code
- high_risk_wrappers_tested: policy, tool execution, redaction, logging and release gate code paths are covered.
- deterministic_checks_present: model output is validated before business action.
- audit_fields_written: trace_id, use_case_id, risk_tier, model_version, prompt_version and policy_decision are recorded.

4. arc42 For AI Systems

arc42 的价值是把架构从图转成完整叙事. 对 AI 系统, arc42 不只是文档模板, 而是帮助架构师把 product intent, constraints, risk, building blocks, runtime behavior, deployment, quality and decisions 串成可评审资产.

4.1 arc42 Section Map For AI

arc42 sectionAI architecture interpretationRequired AI content
1. Introduction and Goals为什么做这个 AI 系统, 成功如何度量business outcome, user workflow, AI role, adoption metric, non-goals
2. Architecture Constraints监管, 数据, 供应商, 安全, 平台, 组织约束data residency, model allowlist, policy, procurement, risk tier, human oversight
3. Context and Scope系统边界和外部依赖C4 context, actors, business systems, AI platform services, vendor boundary
4. Solution Strategy核心架构路线和取舍RAG vs fine-tune, agent vs workflow, gateway vs direct API, HITL strategy
5. Building Block View结构分解C4 container / component, model gateway, RAG, tool, eval, policy, evidence
6. Runtime View关键场景如何运行happy path, no-answer, tool approval, policy deny, fallback, incident sequence
7. Deployment View部署和运行环境environments, model route, network boundary, secrets, logging, regional constraints
8. Cross-cutting Concepts横切设计原则identity, data classification, prompt versioning, observability, eval, security, evidence
9. Architecture Decisions关键 ADRmodel selection, RAG architecture, tool gateway, eval threshold, release gate
10. Quality Requirements质量属性和场景accuracy, groundedness, safety, latency, cost, availability, explainability, auditability
11. Risks and Technical Debt风险, 例外, 技术债hallucination, leakage, stale knowledge, tool misuse, judge drift, vendor lock-in
12. Glossary统一语言use case, risk tier, model route, prompt, index, citation, trace, evidence binder

4.2 arc42 AI 写作重点

Section不要只写应该写成
Goals提升效率, 提升体验降低客服平均处理时长 15%, 但客户承诺类回复必须人工确认.
Constraints使用公司 AI 平台必须通过 model gateway, RAG source 需 owner 批准, 高风险 release 需 eval gate.
Solution strategy用 RAG 回答问题用 governed RAG: source registry, entitlement filter, citation, no-answer, eval and monitoring.
Runtime view用户提问后模型回答展示 retrieval, policy, model, citation, human review, fallback and trace.
Quality准确率高unsupported customer-facing claim = 0, citation correctness >= 95%, stale policy answer = 0.
Risks模型可能出错对每个高影响 failure mode 给出 detection, control, owner and release evidence.

4.3 AI Quality Scenarios

质量属性要写成可评审场景, 而不是抽象形容词.

Quality attributeScenarioGate evidence
Groundedness当客户询问费用政策时, 回答必须引用当前有效政策源, 不得生成没有来源的承诺citation eval, no unsupported claim failures
Permission safety员工只能检索其角色, 地区和产品线允许访问的知识entitlement test, retrieval trace
Human accountability涉及退款, 信贷, AML 或投诉升级的输出只能作为草稿或建议workflow control, approval record
Tool safetyAgent 提出 write action 时必须先做 schema validation 和 policy decisiontool gateway test, action ledger
Release confidence模型, prompt, index 或 policy 变更后必须跑 regression seteval report, release gate memo
Operational resilience模型供应商不可用时系统进入人工或规则路径, 不隐藏失败fallback drill, incident runbook
Auditability任一客户可见输出可以追溯到用户, source, model, prompt, policy 和 reviewertrace sample, evidence binder
Cost control每个 accepted answer 或 completed case 有成本归因cost dashboard, route-level metrics

4.4 ADR Patterns For AI Architecture

DecisionTypical optionsAI-specific rationale
Model routepublic API, private hosted, hybrid by risk tier平衡能力, 数据边界, 成本, latency, vendor risk
RAG strategyper-app index, shared RAG service, federated knowledge plane平衡复用, 权限, freshness, domain autonomy
Agent controlautonomous agent, workflow-constrained agent, copilot draft only按动作风险和可逆性决定自动化程度
Tool executionapp direct API, model-held token, tool gateway工具动作必须可审批, 可审计, 可回放
Eval gatemanual UAT, generic benchmark, use-case eval contract金融零售需要场景化 eval, critical failure 和 slice analysis
Evidencestatic document folder, runtime evidence binder证据应从 trace, approval, eval and monitoring 自动沉淀

5. ISO 42010 Stakeholders / Concerns / Views

ISO 42010 的实用价值不是让文档更学术, 而是让架构沟通从 "我画了几张图" 变成 "我知道谁关心什么, 我用哪种 viewpoint 回答, 这些 view 之间如何保持一致".

5.1 Core Mapping

42010 conceptAI architecture usage
Stakeholder对 AI system 有利益, 决策, 风险或运营责任的人或组织.
Concernstakeholder 关心的系统属性, 如安全, 成本, 解释性, 审计, 可用性, 客户影响.
Architecture viewpoint规定如何表达某类 concern 的视角, 包括模型类型, 规则和证据.
Architecture view按 viewpoint 产生的具体图, 表, 文档或 dashboard.
Model kindC4 diagram, sequence, BPMN, capability map, eval scorecard, policy matrix, trace schema 等.
Correspondence不同 view 之间的对应关系, 如 container 中的 RAG service 必须对应 evidence view 中的 retrieval trace.
Rationale为什么这样设计, 取舍是什么, 通过 ADR, release memo 或 decision matrix 记录.

5.2 Stakeholder Concern Matrix

StakeholderConcernsRequired viewpointsEvidence
Business sponsor业务价值, adoption, 风险接受, 投入产出capability view, outcome metric view, portfolio viewvalue hypothesis, adoption dashboard, roadmap decision
Product Manager用户旅程, AI role, fallback, release scopeexperience view, workflow view, release gate viewPRD slice, workflow map, release memo
Business Analyst流程, 规则, 数据输入输出, 人机分工process view, data view, human accountability viewswimlane, business rule catalog, RACI
Enterprise Architect业务能力, 平台复用, integration, 技术债capability view, C4 context / container, roadmap viewarchitecture canvas, dependency map, ADR
Solution Architect组件职责, runtime, deployment, failure modesC4 component, sequence, deployment, observability viewAPI contracts, runbook, trace sample
AI Platform Ownermodel, RAG, tool, eval, policy, observability 复用platform service view, catalog view, golden path viewservice catalog, reuse metrics, platform SLO
Data Owner数据用途, 权限, freshness, lineage, retentiondata / knowledge view, RAG viewsource card, entitlement test, lineage record
Security身份, secrets, DLP, egress, least privilege, incidentsecurity view, threat model, policy viewcontrol test, DLP report, incident drill
Model Risk适用性, eval, drift, versioning, monitoringmodel view, eval view, monitoring viewmodel card, eval report, review cadence
Compliance / Legal客户影响, disclosure, prohibited use, recordkeepingpolicy view, human accountability view, evidence viewpolicy decision log, approval record, evidence binder
SRE / OperationsSLO, failure, fallback, cost, supportdeployment view, runtime view, observability viewdashboard, alert rule, runbook
Audit追溯性, 控制有效性, 例外管理evidence view, correspondence matrixevidence index, sample trace, exception record

5.3 View Correspondence Rules

不同 view 不能各说各话. 建议在架构包中维护 correspondence rules:

RuleWhy it matters
Every AI use case in capability view must have an owner, risk tier and release status防止作品集和治理清单脱节.
Every external model provider in context view must appear in model route and data boundary view防止供应商和数据外发风险被漏掉.
Every RAG source in data view must have source owner, freshness rule and entitlement rule防止知识源过期或越权.
Every tool in tool view must have action tier, schema, policy decision and audit record防止 agentic action 不可控.
Every high-risk workflow step must appear in human accountability view and release gate防止人工复核只是口头承诺.
Every release gate metric must map to quality requirement and production monitoring signal防止上线前 eval 与上线后监控脱节.
Every critical container must emit trace fields used by evidence view防止审计证据依赖人工补录.
Every ADR must link to affected views and follow-up monitoring防止决策停留在文档中而不进入运行管理.

5.4 Viewpoint Description Template

# Architecture Viewpoint

## Identity
- viewpoint_id: AI-VP-EVAL-RELEASE-GATE
- viewpoint_name: Eval and Release Gate Viewpoint
- owner: AI Architecture / Model Risk
- lifecycle_stage: pilot, release, scale, major change

## Stakeholders
- Product Manager
- Model Risk
- Compliance
- Solution Architect
- Operations

## Concerns Addressed
- whether the AI system is fit for intended use
- whether critical failures are blocked before release
- whether model, prompt, RAG index, tool schema and policy changes are regression-tested
- whether residual risk is accepted by the correct owner

## Model Kinds
- eval contract
- scorecard
- critical failure table
- slice analysis
- release gate memo
- production monitoring mapping

## Required Inputs
- use_case_id and risk_tier
- intended use and prohibited use
- model_version, prompt_version, index_version, tool_schema_version and policy_version
- golden set, regression set and red-team set
- quality thresholds and critical failure definitions

## View Outputs
- eval result summary
- gate decision: go, limited go, no-go, exception
- risk acceptance and approver
- monitoring signals and review cadence
- evidence binder entry

## Correspondence Rules
- every release metric maps to a quality requirement
- every critical failure maps to a policy, eval or human control
- every limited go decision maps to scope restriction and monitoring
- every exception has owner, expiry and compensating control

6. Financial Retail Case: Customer-Facing RAG / Agent

6.1 Case Background

一家零售银行希望在 contact center console 中加入 AI assistant. 它有两个能力:

  1. Customer-facing RAG: 根据产品条款, 费用政策, 投诉处理手册和客户当前 case 生成带引用的回复草稿.
  2. Agent workflow: 在员工确认后, 可以创建 follow-up task, 更新 case note, 或发起退款审批草稿. 不能自动退款, 不能自动关闭投诉, 不能做信贷或法律结论.

业务目标:

GoalTarget
降低客服平均处理时长pilot 阶段降低 10%-15%
提高政策答复一致性高风险政策问题 citation correctness >= 95%
保留人类责任客户可见回复必须由客服确认, 退款和承诺类语句进入审批
建立可审计 release所有 pilot 输出有 trace, eval result, policy decision and evidence binder entry

6.2 View Package

ViewPurposeKey artifact
Capability view说明 AI 增强哪些客服能力, 哪些能力仍由人类或规则系统负责service capability heatmap
Experience / human accountability view说明客服, supervisor, compliance reviewer 的责任边界swimlane and RACI
C4 context view展示 contact center, CRM, policy knowledge, AI platform, model provider, evidence bindercontext diagram
C4 container view展示 app, orchestration, RAG, model gateway, tool gateway, policy engine, eval, observabilitycontainer diagram
RAG view展示 source registry, ingestion, entitlement, retrieval, citation, no-answerRAG pipeline
Tool / action view展示 case note, task creation, refund approval draft 的 action tiertool catalog and action tier matrix
Runtime view展示 answer draft, policy denial, refund approval, fallbacksequence diagrams
Eval / release gate view展示 golden set, red-team, thresholds, critical failuresrelease gate memo
Observability / evidence view展示 trace schema, dashboard, evidence bindertrace and evidence map

6.3 C4 Context Sketch

Customer
  -> Contact Center Agent
  -> Contact Center Console with AI Assistant

Contact Center Console with AI Assistant
  -> CRM / Case Management
  -> Product and Fee Policy Knowledge Base
  -> AI Platform Control Plane
  -> Approved Model Provider
  -> Evidence Binder

AI Platform Control Plane
  -> Model Gateway
  -> RAG Service
  -> Tool Gateway
  -> Policy Engine
  -> Eval Service
  -> Observability Pipeline

Key context decisions:

DecisionRationale
AI assistant is employee-facing, not directly customer-facing保留客服对客户回复的最终责任, 降低误导和承诺风险.
Policy knowledge is retrieved through governed RAG保证 source owner, entitlement, freshness and citation.
Write actions go through tool gateway防止模型直接更新 CRM 或触发退款流程.
Release requires eval and evidence binder客户可见输出需要可追溯质量证据.

6.4 Container View Summary

ContainerResponsibilityFinancial retail controls
Contact center console展示 AI draft, citation, risk flag, edit and approve actionsrole access, supervisor escalation, customer disclosure rules
AI orchestration service组装 case context, route RAG query, call model, propose actionsstep budget, prompt version, fallback
RAG service检索有效政策和产品条款, 返回引用source owner, freshness block, entitlement filter
Model gateway调用 approved model routemodel allowlist, data boundary, cost and version log
Tool gateway执行 case note, task creation, refund approval draftschema validation, action tier, approval, idempotency
Policy engine判断回复内容和工具动作是否允许complaint escalation, refund threshold, legal phrase block
Eval service运行 RAG, tone, policy and tool trajectory evalcritical failure gate, regression
Observability pipeline收集 trace, quality, latency, cost, override, complaint signalsmasked logs, retention, dashboard
Evidence binder汇聚 architecture review, eval, approval, release and monitoringrelease evidence, audit export

6.5 Runtime Sequence: Policy Answer Draft

Agent opens authenticated customer case
  -> Console sends use_case_id, agent role, customer scope, case id to orchestration
  -> Orchestration requests policy decision for context assembly
  -> RAG service retrieves current policies with entitlement and freshness filters
  -> Model gateway generates answer draft with source citations
  -> Policy engine checks prohibited claims, complaint escalation and refund language
  -> Console shows draft, citations, risk flags and edit controls
  -> Agent edits and confirms response
  -> Trace and final response metadata enter observability and evidence binder

6.6 Runtime Sequence: Refund Approval Draft

Agent asks assistant to help prepare refund request
  -> Orchestration classifies requested action as customer-impacting write
  -> Tool gateway validates refund draft schema and performs dry run
  -> Policy engine requires supervisor approval based on amount and complaint status
  -> Model drafts rationale with policy citations and customer case facts
  -> Supervisor reviews amount, rationale, source citations and policy decision
  -> Approved draft is sent to refund workflow, not executed directly by the model
  -> Action ledger stores request, dry run, approval and execution reference

6.7 Release Gate

Gate areaMinimum gate
RAG groundednesscitation correctness >= 95%, unsupported claim critical failures = 0
Knowledge freshnessexpired or superseded policy citation critical failures = 0
Permissionrestricted document retrieval critical failures = 0
Complaint handlingmandatory escalation cases correctly flagged >= 99%
Tool actionwrite action without policy decision or approval critical failures = 0
Human accountabilityall customer-visible responses have agent confirmation trace
Monitoring readinessdashboard shows quality, latency, cost, override, complaint and incident signals
Evidence completenessarchitecture review, eval report, approvals, exception records and monitoring plan linked

6.8 Portfolio Evidence For This Case

Evidence artifactWhat it proves
Stakeholder-concern matrix你能从 BA/CBAP 能力升级到 architecture viewpoint design.
C4 context and container diagrams你能讲清系统边界, AI platform services, 供应商和数据边界.
RAG and tool action views你能把 AI-specific risks 结构化, 不只停留在 app + LLM.
Eval release memo你能把质量, 风险和 release decision 连接起来.
Evidence binder map你能设计审计可追溯性, 而不是上线后补材料.
Interview narrative你能面向 PM, BA, Architect, CTO, Risk 讲同一套方案的不同版本.

7. Artifact Templates

7.1 Stakeholder-Concern Matrix

# Stakeholder-Concern Matrix

## Use Case
- use_case_id: CS-RAG-AGENT-001
- use_case_name: Contact Center RAG and Agent Assistant
- business_domain: Retail Banking Customer Service
- risk_tier: High for customer-facing drafts and customer-impacting write actions
- release_scope: limited pilot for authenticated servicing calls

| Stakeholder | Decision rights | Concerns | Required views | Evidence |
|---|---|---|---|---|
| Business Sponsor | approve value and pilot scope | AHT reduction, CSAT, customer risk | capability view, adoption view | pilot metric dashboard |
| Product Owner | define workflow and release scope | user journey, fallback, feedback | experience view, runtime view | PRD slice, release memo |
| BA / Process Owner | validate rules and process fit | handoff, exception, business rule | swimlane, rule catalog | process signoff |
| Enterprise Architect | approve architecture fit | platform reuse, integration, tech debt | C4 context, container, roadmap | architecture review record |
| Solution Architect | approve design implementation | components, APIs, failure modes | component view, sequence, deployment | design review notes |
| Data Owner | approve knowledge sources | source authority, freshness, access | RAG view, data lineage | source card, entitlement test |
| Security | approve security controls | identity, DLP, secrets, egress | security view, policy view | security test result |
| Model Risk | approve AI fitness | eval, drift, monitoring | model view, eval gate view | eval report |
| Compliance | approve customer impact controls | disclosure, prohibited claims, recordkeeping | policy view, evidence view | policy decision samples |
| Operations | accept run and support model | SLO, escalation, incident, training | operating view, observability view | runbook, training pack |
| Audit | review traceability | control evidence, exception handling | evidence view, correspondence matrix | evidence binder export |

7.2 Viewpoint Catalog

# AI Architecture Viewpoint Catalog

| viewpoint_id | viewpoint_name | primary_concerns | model_kinds | minimum_evidence |
|---|---|---|---|---|
| AI-VP-001 | Capability and Value View | capability impact, value hypothesis, portfolio fit | capability map, value stream | business outcome metric |
| AI-VP-002 | Experience and Human Accountability View | user journey, AI role, human responsibility | journey, swimlane, RACI | human review rule |
| AI-VP-003 | C4 System Context View | system boundary, external dependency, trust boundary | C4 context | dependency and boundary list |
| AI-VP-004 | C4 Container View | deployable units, platform services, integration | C4 container | container responsibility table |
| AI-VP-005 | C4 Component View | internal responsibilities, APIs, ownership | component diagram, interface table | component owner and contract |
| AI-VP-006 | Data / Knowledge / RAG View | source, lineage, entitlement, freshness, citation | source registry, RAG pipeline | source card and retrieval eval |
| AI-VP-007 | Model / Prompt / Policy View | model route, prompt lifecycle, runtime control | model route, prompt registry, policy matrix | model card and policy log |
| AI-VP-008 | Tool / Action View | tool permission, action risk, approval, audit | tool catalog, action tier, sequence | action ledger sample |
| AI-VP-009 | Eval / Release Gate View | fitness for use, regression, critical failures | eval contract, scorecard, gate memo | eval report and release decision |
| AI-VP-010 | Observability / Evidence View | traceability, monitoring, audit readiness | trace schema, dashboard, evidence map | trace sample and evidence index |

7.3 arc42 Section Map

# arc42 AI Section Map

| arc42 section | Required AI questions | Evidence |
|---|---|---|
| Introduction and Goals | What business workflow changes, who uses AI, what outcomes matter, what is outside scope? | goal statement, adoption metric |
| Constraints | Which data, vendor, regulatory, platform and human oversight constraints bind the design? | constraint list, policy references |
| Context and Scope | Which users, systems, model providers, data sources and platform controls are in scope? | C4 context |
| Solution Strategy | Why this route for RAG, model, agent, tool, eval and release? | decision matrix |
| Building Block View | Which containers and components implement the system and controls? | C4 container / component |
| Runtime View | What happens for happy path, no-answer, policy deny, tool approval and fallback? | sequence diagrams |
| Deployment View | Where does it run, how are secrets, network, logs and model routes controlled? | deployment view |
| Cross-cutting Concepts | How are identity, data classification, prompt, eval, observability and evidence handled? | cross-cutting control table |
| Architecture Decisions | Which tradeoffs were decided and why? | ADR set |
| Quality Requirements | Which measurable quality and risk scenarios define release readiness? | quality scenario table |
| Risks and Technical Debt | Which residual risks remain and who owns them? | risk register and compensating controls |
| Glossary | Which terms must be consistent across business, risk and engineering? | glossary |

7.4 Architecture Review Questions

AreaReview questions
Business fitWhich business capability is improved, and what metric proves adoption or value?
AI roleIs AI generating a draft, recommendation, explanation, classification or business action?
Human accountabilityWho confirms customer-visible outputs and who owns final decisions?
Data boundaryWhat data enters prompt, retrieval, logs and vendor systems?
RAGAre source owner, freshness, entitlement, citation and no-answer behavior defined?
Model routeWhich models are allowed for this risk tier and what happens when model version changes?
PromptAre prompt versions owned, tested and linked to eval results?
Tool actionWhich tools can be called, at what action tier, with which approval and rollback?
PolicyAre controls implemented in runtime policy, or only described in documents and prompts?
EvalDoes eval cover golden cases, regression, red-team, slices and critical failures?
ReleaseWhat conditions allow pilot, scale, restrict or rollback?
ObservabilityCan a production issue be replayed across identity, prompt, retrieval, model, tool, policy and output?
EvidenceCan audit see architecture decision, eval, approval, exception, monitoring and incident evidence in one binder?
PortfolioWhich platform capabilities are reused, and which one-off choices become technical debt?

7.5 Portfolio Evidence Pack

# Portfolio Evidence Pack

## Case Narrative
- domain: retail banking customer service
- problem: policy answer inconsistency, high handling time, weak AI release evidence
- solution: governed customer-service RAG with constrained agent workflow
- audience: AI Product Manager, Enterprise Architect, Solution Architect, AI BA

## Architecture Views
- capability_view: customer servicing capability heatmap
- c4_context: users, systems, AI platform, model provider, evidence binder
- c4_container: application, orchestration, model gateway, RAG, tool gateway, policy, eval, observability
- rag_view: source registry, ingestion, entitlement, retrieval, citation, no-answer
- tool_action_view: action tier, schema validation, dry run, approval, ledger
- runtime_view: policy answer, refund approval, policy deny, fallback
- eval_release_view: eval contract, critical failures, thresholds, release decision
- evidence_view: trace schema, dashboard, evidence binder mapping

## Decisions
- ADR-001: Use governed RAG service rather than per-app vector index
- ADR-002: Use tool gateway rather than direct CRM updates
- ADR-003: Keep AI as employee-facing assistant for pilot
- ADR-004: Require eval release gate for customer-visible drafts
- ADR-005: Link production trace to evidence binder

## Metrics
- value: average handling time, first contact resolution, agent acceptance
- quality: citation correctness, unsupported claim, no-answer correctness
- risk: policy blocks, approval bypass attempts, complaint escalation misses
- operations: latency, cost per accepted answer, fallback rate, incident count
- adoption: weekly active agents, edited vs accepted drafts, feedback reason

## Interview Proof Points
- shows CBAP-level stakeholder and concern analysis
- shows architecture-level C4 and arc42 communication
- shows AI-specific control design across model, RAG, tool, eval and evidence
- shows release governance for financial retail AI

8. Interview Expression

8.1 30 秒版本

我会把 AI 架构表达成一组 views, 而不是一张大图. C4 帮我讲系统结构: context, container, component and code. arc42 帮我写完整架构叙事: goals, constraints, runtime, deployment, quality, ADR and risk. ISO 42010 帮我把 stakeholder, concern, viewpoint and evidence 对齐. 对 AI 系统, 我会额外强调 model, RAG, tool, eval, policy, observability, evidence, human accountability and release gate, 因为这些才决定企业 AI 能否安全上线和持续治理.

8.2 2 分钟版本

我不会从 app + LLM + vector DB 这种图开始. 我会先问 stakeholder concerns: 业务 sponsor 关心价值和采用, CTO 关心平台复用和集成, solution architect 关心组件和运行, risk 关心 eval 和控制, audit 关心证据. 然后为这些 concern 设计 viewpoint catalog.

结构表达我用 C4. Context view 讲用户, 外部系统, AI platform 和供应商边界. Container view 展示 application, orchestration, model gateway, RAG service, tool gateway, policy engine, eval service, observability and evidence binder. Component view 展开 RAG entitlement, citation, no-answer, tool schema validation, approval router 等高风险组件. Code view 只用于 policy wrapper, redaction, tool validation, release gate 这些关键实现点.

文档表达我用 arc42. 它让我把目标, 约束, 上下文, building blocks, runtime, deployment, cross-cutting concepts, ADR, quality and risks 写完整. AI 特别需要把 eval 写成 release architecture, 把 observability 写成 evidence architecture, 把 human accountability 写成流程责任.

治理表达我用 ISO 42010. 每张 view 都要能回答某类 stakeholder concern, 并通过 correspondence rules 保持一致. 比如 container view 里有 RAG service, evidence view 里就必须有 retrieval trace 和 citation evidence. tool view 里有 refund draft tool, policy view 里就必须有 action tier, approval rule and action ledger.

8.3 AI Product / BA 版本

Q: 作为 PM / BA, 你为什么要关心 C4, arc42 和 42010?

A: 因为 AI 产品的风险不只在功能需求里, 还在系统边界, 数据流, 模型行为, 工具动作和上线证据里. PM / BA 如果只写 user story, 很容易漏掉 human accountability, eval gate, policy control and evidence. 我会用 42010 做 stakeholder-concern matrix, 用 C4 把系统边界和组件讲清楚, 用 arc42 把目标, 约束, runtime, quality, ADR 和风险写成可评审材料. 这能把 CBAP 的需求分析能力升级到 AI solution architecture 和 architecture communication.

Q: AI 产品需求和 AI 架构视图如何连接?

A: 每个高风险需求都应该映射到至少一个 architecture concern. 比如 "客服可生成退款说明" 对应 customer impact, tool action, approval, evidence. 它不能只写成一个功能, 还要在 tool view 中定义 action tier, 在 policy view 中定义审批阈值, 在 eval view 中测试错误承诺, 在 observability view 中记录 trace, 在 release memo 中明确 pilot 范围.

8.4 Enterprise / Solution Architect 版本

Q: 你如何避免 AI 架构图变成组件堆砌?

A: 我会先建立 viewpoint taxonomy. C4 只负责结构分层, 不强行承载所有 governance 信息. 对 AI-specific concerns, 我会单独建立 RAG view, model route view, tool action view, policy view, eval gate view, observability view and evidence view. 然后用 42010 correspondence rules 连接这些 views, 确保结构, 控制, 运行和证据一致.

Q: 什么时候 C4 不够用?

A: 当问题从 "系统由什么组成" 转到 "谁关心什么, 哪些控制有效, 哪些证据证明 release readiness" 时, C4 不够. 这时需要 arc42 组织架构叙事和 ADR, 需要 42010 管理 stakeholder concerns and viewpoints, 需要 eval / policy / evidence views 处理 AI 风险. C4 是结构骨架, 不是完整治理体系.

Q: 你如何在金融零售 AI 项目中设计 release gate?

A: 我会从 intended use 和 prohibited use 定义 eval contract. 对 customer-facing RAG, gate 包括 citation correctness, unsupported claim, stale policy, permission leakage, no-answer behavior, complaint escalation. 对 Agent tool action, gate 包括 tool selection, argument correctness, approval bypass, idempotency, action ledger. Gate 结果进入 release memo, 并映射到 production monitoring 和 evidence binder.

8.5 Portfolio Pitch

这个作品集不是展示一张漂亮 AI 架构图, 而是展示我能把金融零售 AI use case 变成可交付的 architecture view package. 我从 stakeholder concerns 出发, 用 C4 表达结构, 用 arc42 表达完整架构叙事, 用 ISO 42010 保证视图和证据可追溯. 在 Customer Service RAG / Agent 案例中, 我展示了业务能力, 人机责任, RAG 权限和引用, tool action 风险, eval release gate, observability trace and evidence binder. 这体现的是 AI Product, BA and Architecture 的组合能力, 而不是单纯会画图.

8.6 自检问题

  1. 是否先定义 stakeholder concerns, 再决定画哪些 views?
  2. C4 context 是否显示用户, 外部系统, AI platform, 供应商和数据边界?
  3. C4 container 是否包含 model gateway, RAG, tool gateway, policy, eval, observability and evidence?
  4. RAG view 是否说明 source owner, entitlement, freshness, citation and no-answer?
  5. Tool view 是否说明 action tier, schema validation, approval, idempotency and ledger?
  6. arc42 是否写清 goals, constraints, runtime, quality, ADR and risk?
  7. ISO 42010 matrix 是否连接 stakeholder, concern, viewpoint, view and evidence?
  8. Eval gate 是否连接 intended use, critical failures, release decision and production monitoring?
  9. Observability 是否能支持 debug, model risk, audit and incident replay?
  10. 作品集是否能面向 PM, BA, CTO, Architect, Risk and Audit 讲不同版本?

Final Principle

AI 架构沟通的成熟标志不是图有多少组件, 而是每个关键 stakeholder concern 都能被一个清晰 view 回答, 每个 view 都能连接到架构决策, 风险控制, release gate 和运行证据.

一句话总结:

C4 gives the structure, arc42 gives the story, ISO 42010 gives the stakeholder logic, and AI architecture maturity comes from connecting all three to model, RAG, tool, eval, policy, observability, evidence and human accountability.