返回 Papers
AI 底层逻辑 / 经典论文

AI Agent Marketplace:工具认证与治理架构

重要说明: 本文是学习、架构训练和作品集材料, 不构成法律意见、合规意见、网络安全认证意见、审计结论、采购建议、供应商认证结论或监管解释。正式项目必须由 AI Governance、Enterprise Architecture、Cyber Security、Privacy、Data Governance、Model Risk、Third-Party Risk、Legal、Compliance、I

553ai-foundations/papers/129-ai-agent-marketplace-tool-certification-governance-architecture.md

AI Agent Marketplace / Tool Certification Governance Architecture 解读

面向对象: Advanced AI PM / Senior BA / Product Architect / Enterprise Architect / AI Platform Owner / API Product Manager / Model Risk / Cyber / Data Governance / Third-Party Risk / Internal Audit Partner。 核心问题: 当企业内部开始复用 agents、tools、prompts、MCP servers、APIs、RAG capabilities 和 reusable workflows 时, 如何把“可发现、可复用、可认证、可授权、可审计、可退出”做成一套 marketplace governance architecture, 而不是把高风险 agentic capability 扔进普通 service catalog? 学习目标: 建立 agent marketplace taxonomy、capability card、tool/API certification、risk tiering、signed package provenance、permission scope、owner attestation、runtime policy control、evaluation evidence、deprecation/exit 和 auditability 的产品架构能力。

重要说明: 本文是学习、架构训练和作品集材料, 不构成法律意见、合规意见、网络安全认证意见、审计结论、采购建议、供应商认证结论或监管解释。正式项目必须由 AI Governance、Enterprise Architecture、Cyber Security、Privacy、Data Governance、Model Risk、Third-Party Risk、Legal、Compliance、Internal Audit、Platform Engineering、API/Product Owners 和业务负责人共同判断。适用性取决于行业、监管环境、数据类型、agent autonomy level、tool side effect、供应商依赖、身份体系、运行时控制能力和机构内部政策。


Source Anchors

SourceLink用途
OWASP Top 10 for LLM Applicationshttps://genai.owasp.org/llm-top-10/用 prompt injection、sensitive information disclosure、excessive agency、insecure output handling、supply chain 等风险语言定义 agent/tool marketplace 的 threat-informed certification baseline
CISA Secure by Designhttps://www.cisa.gov/securebydesign用 secure-by-design、默认安全、可证明控制和供应商责任语言设计 signed packages、owner attestation、secure defaults 和 marketplace admission gate
NIST AI RMFhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织 marketplace risk tier、evaluation evidence、monitoring、risk acceptance 和 lifecycle review
ISO/IEC 42001 overviewhttps://www.iso.org/standard/42001用 AI management system、policy、roles、operation、performance evaluation、internal audit 和 continual improvement 组织 operating model
OpenAPI Specificationhttps://spec.openapis.org/oas/latest.html用 contract-first API specification、schema、security scheme、operation metadata 和 versioning 支持 tool/API certification
AsyncAPI Specificationhttps://www.asyncapi.com/docs/reference/specification/latest用 event-driven contract、message schema、channel、binding 和 consumer/producer boundary 支持 agentic event tools 与 asynchronous capabilities 的认证

一句话:

Agent marketplace governance is not a catalog problem; it is a permissioned product platform for discovering, certifying, invoking, monitoring and retiring agentic capabilities with evidence.


1. Thesis

企业内部 AI agent ecosystem 会很快从“几个团队试用 Copilot”演化为一组可复用能力:

  • 业务 agents: complaint response agent、underwriting assistant、fraud investigation copilot、branch knowledge agent。
  • Tools: CRM update、case creation、refund request、KYC lookup、document retrieval、pricing quote、workflow trigger。
  • Prompts and policies: system prompt、policy prompt、claim guardrails、tone guide、regulated disclosure template。
  • MCP servers and connectors: document systems、ticketing、data warehouse、knowledge graph、code repositories、workflow engines。
  • APIs and event capabilities: OpenAPI tools、AsyncAPI event subscriptions、batch jobs、notification channels。
  • Reusable evaluation and guardrail assets: prompt injection test set、policy compliance eval、PII leakage eval、tool misuse eval。

普通 service catalog 解决的是:

What services exist, who owns them, how do I request access?

Agent marketplace 要继续解决:

What can an agent do with this capability,
what data can it access,
what side effects can it create,
which risk tier has certified it,
which evidence proves it is safe enough,
which permissions are granted at runtime,
who attested ownership,
which package and specification were signed,
how will invocation be monitored,
and how will consumers exit when it is deprecated?

核心观点: AI agent marketplace 是一个 internal platform product。它的用户不是只有开发者, 还包括业务 owner、risk reviewer、tool provider、agent builder、auditor、data steward、security engineer 和 procurement/vendor owner。它的价值不是“把链接放在一个目录里”, 而是把 product discovery、risk governance、runtime authorization、evidence collection 和 lifecycle management 变成一条可重复的 platform operating model。


2. Why It Matters

Agentic capability 的风险比普通 API 更复杂, 因为调用者不是稳定的人类或固定系统流程, 而可能是会规划、组合、重试、解释、绕路和生成参数的 AI runtime。

Marketplace failure典型表现风险放大点
Tool over-permission一个知识查询 agent 获得 CRM write、refund trigger 或 customer notification 权限excessive agency, unauthorized side effect, weak segregation of duties
Prompt/package provenance gap团队复制旧 prompt、外部 MCP server 或未签名 tool wrappersupply chain, stale policy, hidden data flow, weak audit
Capability card 不完整catalog 只写“customer data lookup”, 不写数据字段、用途、限制、eval、ownerconsumer 误用, risk reviewer 无法判断, auditor 无法追溯
Certification 只看代码没有测试 prompt injection、data leakage、unsafe tool chaining、fallback pathagentic risk 未被覆盖
Runtime 与 catalog 脱节catalog 显示 Tier 2 read-only, 实际 token 可调用 Tier 1 write toolgovernance fiction, evidence mismatch
Deprecated tool 继续被调用old endpoint、old model route、old prompt bundle 被 agent cache 使用stale control, regulatory inconsistency, operational fragility
Owner attestation 形式化owner 不确认数据用途、SLA、evidence、change notice、exit pathaccountability weak, product consumers lose trust

金融零售场景尤其敏感:

  • 工具调用可能影响客户资金、信用、投诉、欺诈冻结、费用减免、KYC/AML、产品适配性和客户通知。
  • Prompt injection 可以从 RAG content、ticket text、email attachment、browser page 或 third-party data 中进入 agent planning。
  • Agent 可能把多个低风险 read tools 组合成高风险 inference 或 action。
  • 同一个 MCP server 可能服务多个 agents, 造成 platform-level systemic dependency。
  • 内部 reusable prompt 或 API 一旦被广泛复用, 它的缺陷会跨业务线扩散。

所以 marketplace 的第一性问题不是“如何让团队更快找到工具”, 而是“如何让团队只发现、申请、调用和复用已被正确描述、正确认证、正确授权、正确监控的能力”。


3. Marketplace vs Generic Service Catalog

维度Generic service catalogAgent marketplace / certification platform
主要对象IT services, APIs, applicationsagents, tools, prompts, MCP servers, APIs, event capabilities, eval packs, policy bundles
用户意图查找服务、申请访问、了解 owner发现可组合能力、判断可用边界、申请 agent runtime permissions、查看 evidence
风险模型availability, support, cost, ownershiptool side effect, data sensitivity, prompt injection, model behavior, autonomy, provenance, chain-of-actions
描述粒度service name, SLA, owner, docscapability card, allowed uses, forbidden uses, input/output schema, permission scopes, eval evidence, data classification
认证焦点是否上线、是否支持、是否合规是否可被 agent 安全调用, 是否 contract-first, 是否 signed, 是否 least privilege, 是否 monitored
运行时控制access request often outside catalogcatalog policy binds to runtime gateway, token scopes, approval gates and invocation ledger
生命周期onboarding/offboardingcertification renewal, breaking-change gate, deprecation, consumer impact, exit readiness, evidence retention
审计问题谁拥有服务, 谁有访问哪个 agent 何时基于哪个 package/spec/prompt/tool scope 对哪个数据执行了什么动作

设计判断:

If the catalog cannot constrain runtime invocation, it is documentation.
If certification cannot produce evidence, it is ceremony.
If marketplace discovery ignores risk tier, it accelerates unsafe reuse.

4. Capability Taxonomy

Marketplace 对象必须按 agentic behavior 建模, 而不是按技术组件建模。

Capability type示例Certification focus
Agent productcomplaint response agent, onboarding agent, claims triage agentuse case, autonomy, human oversight, eval, runtime policy, business owner
Tool / functioncreate_case, lookup_customer, refund_fee, send_messageinput/output schema, permission scope, side effect, idempotency, approval requirement
Prompt / policy bundlesystem prompt, regulated response prompt, refusal policyversioning, source of policy truth, test cases, forbidden behavior, owner attestation
MCP server / connectorticketing server, document repository server, data warehouse serveridentity delegation, resource boundary, tool manifest, transport security, audit log
API productOpenAPI-described REST API used by agentscontract, auth scheme, rate limit, data classification, error semantics
Event capabilityAsyncAPI-described stream or command/event channelmessage schema, producer/consumer boundary, replay, ordering, retention, side effects
RAG / knowledge assetcurated corpus, vector index, knowledge graph endpointsource lineage, ACL, freshness, retention, citation, data use constraint
Evaluation packprompt injection suite, regulated content eval, tool misuse evalscenario coverage, pass/fail thresholds, evidence reproducibility
Guardrail / policy servicePII redaction, claim checker, policy-as-code decisioncontrol objective, false positive/negative tradeoff, bypass rule, monitoring

Marketplace card 不应把这些对象都压扁成“服务”。每类对象有不同的风险、证据、owner 和生命周期。


5. Capability Card

Capability card 是 marketplace 的核心产品资产。它不是营销介绍, 而是 agent builders、business adopters、risk reviewers 和 auditors 共同使用的决策对象。

SectionRequired content
Identitycapability id, name, type, owner, steward, provider team, business domain, version
Purposeintended use, user persona, business process, value hypothesis, non-goals
Risk tierdata sensitivity, side effect level, autonomy compatibility, regulated workflow flag
ContractsOpenAPI / AsyncAPI / MCP tool manifest / prompt bundle / eval pack references
Inputs and outputsschema, data classifications, allowed fields, prohibited fields, retention expectations
Permission scopesread/write/execute scopes, customer/account/product boundaries, environment boundary
Invocation controlshuman approval, dual control, rate limit, spend limit, transaction limit, safe-stop
Evaluation evidenceeval suite, red-team cases, prompt injection tests, tool misuse tests, policy compliance
Provenancesigned package, source repository, build attestation, dependency/SBOM or AI BOM, release notes
Runtime telemetryrequired traces, audit events, metrics, alerting, log retention, evidence export path
Consumer obligationsallowed use cases, forbidden use cases, integration checklist, incident escalation
Lifecyclecertification date, renewal date, deprecation date, replacement path, exit support

高级判断:

  • Capability card should be machine-readable enough to drive policy and human-readable enough to support product discovery。
  • Risk tier should influence search ranking、request workflow、default permissions、eval requirement and renewal cadence。
  • Owner attestation should not be a checkbox; it should bind named owners to data use、side effect、support、evidence and lifecycle obligations。

6. Risk Tiering Model

Risk tiering 的目的不是给所有能力贴标签, 而是驱动差异化 controls。

TierCapability patternExampleRequired posture
Tier 0: ReferenceNo runtime access, documentation or static prompt sampleprompt pattern library, architecture guidepeer review, source attribution, no production invocation
Tier 1: Read-only low sensitivityread public/internal non-sensitive data, no customer actionproduct FAQ lookup, policy summary from approved public docsowner attestation, schema validation, basic eval, monitoring
Tier 2: Sensitive readread customer, employee, financial, complaint or confidential dataKYC lookup, complaint history retrievaldata steward approval, purpose binding, ABAC, prompt injection testing, evidence logging
Tier 3: Controlled write/actioncreate/update internal records, draft customer content, trigger workflow without direct monetary movementcreate case, update disposition, draft letterhuman approval, policy checks, rate limits, rollback, stronger eval, periodic review
Tier 4: High-impact actionmoney movement, account restriction, credit/insurance/investment impact, legal/regulatory communicationfee refund, fraud hold, adverse action draft, customer notice senddual control, segregation of duties, model risk review, runtime command gate, incident drill, board/reporting visibility where appropriate

Risk tier inputs:

  • Data classification and customer sensitivity。
  • Tool side effect and reversibility。
  • Agent autonomy compatibility: draft-only、recommendation、human-approved action、autonomous action。
  • Regulated workflow involvement。
  • External/customer visibility。
  • Vendor dependency and data residency。
  • Evidence completeness。
  • Prompt injection exposure surface。
  • Consumer blast radius: number of agents/products that can use it。

Important nuance:

A tool can be Tier 2 in isolation and Tier 4 in a specific chain.
Marketplace governance must evaluate both capability tier and composition tier.

7. Certification Architecture

Certification 应覆盖 design-time、release-time 和 runtime 三个层面。

provider submits capability
  -> contract and manifest validation
  -> risk tier classification
  -> security and data review
  -> eval and red-team evidence review
  -> owner attestation
  -> signed package and release approval
  -> marketplace publication
  -> runtime permission binding
  -> monitoring, renewal, deprecation

认证不只是“通过一次评审”。它应该产生可持续使用的证据:

Certification gateKey questionsEvidence
Intake completenesscapability 是否有 owner、purpose、scope、contract、data map、risk tiercapability card, owner record
Contract-first gateAPI/tool/event 是否有 versioned schema and security metadataOpenAPI/AsyncAPI spec, MCP tool manifest, schema tests
Data permission gate是否清楚读取/写入哪些数据, 是否 purpose-bounddata classification, steward approval, field allowlist
Agentic threat gate是否测试 prompt injection、tool misuse、excessive agency、output handlingred-team results, OWASP-aligned test cases
Side-effect gatetool action 是否可逆、幂等、审批、限额、回滚approval rules, transaction limits, rollback design
Provenance gatepackage、prompt、connector、eval 是否可追溯且签名signed artifact, build attestation, release hash
Runtime binding gatemarketplace policy 是否能下发到 gateway/token/scopepolicy id, token scope, enforcement test
Monitoring gate是否产生 invocation ledger、metrics、alerts、evidence exporttrace schema, log retention, dashboard
Renewal gate是否有过期、复审、drift、breaking change、consumer impact 管理renewal record, change log, consumer inventory

高级 PM/architect 的判断点:

  • Certification should be proportional to risk and reusable across consumers。
  • The consumer team should not need to re-litigate every control, but must still attest its own use case and composition risk。
  • Certification evidence must remain valid after version changes, model route changes, prompt changes and tool permission changes。

8. Signed Packages and Provenance

Agent marketplace 的 supply chain 风险来自多个对象: prompt、tool wrapper、MCP server、API schema、guardrail config、eval set、RAG corpus manifest、policy bundle、agent workflow graph。

Provenance model:

source repo / package registry
  -> build and test pipeline
  -> security scan and eval run
  -> signed artifact and manifest
  -> marketplace release record
  -> runtime deployment reference
  -> invocation trace references artifact hash
ArtifactProvenance requirement
Prompt bundleversion, owner, policy source, eval result, signed hash
Tool wrappersource repo, dependency scan, input/output schema, signed release
MCP servertool manifest, transport/security config, server identity, package signature
OpenAPI toolspec version, operation id, security scheme, schema conformance, breaking-change record
AsyncAPI event toolchannel, message schema, binding, consumer group, retention, replay policy
RAG corpussource list, ACL map, document hashes, refresh cadence, deprecation rules
Eval packscenario source, coverage statement, thresholds, run id, result hash
Agent workflowgraph version, allowed tools, fallback paths, approval gates, release attestation

Without provenance, runtime audit becomes narrative-driven:

We think the agent used the approved tool.
We think the prompt was current.
We think the RAG source was allowed.

With provenance, audit can ask:

Which signed capability version, under which policy scope,
was invoked by which agent, with which user delegation,
against which data object, producing which side effect?

9. Permission Scopes and Runtime Controls

Marketplace governance only matters if it can constrain runtime behavior。

Reference runtime model:

agent request
  -> identity and delegation context
  -> marketplace entitlement check
  -> policy decision point
  -> tool gateway / MCP gateway / API gateway
  -> schema validation and parameter policy
  -> approval or dual-control gate
  -> execution
  -> invocation ledger and evidence export

Scope design:

Scope dimensionExamples
Actionread, search, summarize, draft, create, update, approve, send, refund, restrict
Data boundaryproduct, customer segment, jurisdiction, account type, case type, document collection
Environmentsandbox, test, pilot, production, emergency break-glass
Identityagent id, human delegator, service account, role, business unit
Timesession, case duration, campaign window, standing entitlement, expiring token
Volumemax calls, max records, max monetary value, max customer messages
Approvalnone, human review, supervisor approval, dual control, risk approval
Purposecomplaint handling, fraud investigation, marketing review, underwriting support

Controls that distinguish agent marketplace from API gateway alone:

  • Parameter-level policy: e.g. an agent may read complaint summaries but not full attachment text。
  • Chain-aware policy: a read tool followed by a write tool may require additional approval。
  • Prompt-aware and source-aware checks: tool call allowed only if source evidence meets freshness/citation requirements。
  • Runtime permission downgrade: high-risk signals can move agent from action mode to draft-only mode。
  • Safe-stop and kill switch: disable capability or consumer entitlement without redeploying every agent。
  • Invocation ledger: records intent, context, policy decision, parameters, output, side effect and approval。

10. Operating Model

Marketplace governance requires product management, not just architecture review。

RoleAccountability
Marketplace Product Ownerroadmap, user experience, adoption, policy integration, discovery, metrics
Capability Ownerbusiness purpose, data use, support, change notice, lifecycle and attestation
Tool/API Product Managercontract quality, versioning, developer experience, consumer communication
AI Platform Engineeringregistry, gateway, policy enforcement, telemetry, signing, deployment integration
Enterprise Architecturetarget architecture, standards, composition patterns, technology rationalization
Cyber Securitythreat model, secure-by-design controls, identity, vulnerability, runtime protection
Data Governance / Privacydata classification, purpose binding, field-level permissions, retention
Model Risk / AI Governancerisk tiering, eval sufficiency, monitoring, use-case approval alignment
Third-Party Riskvendor-owned capabilities, contract obligations, assurance evidence, exit path
Internal Auditindependent assurance over certification, runtime evidence and lifecycle controls
Consumer Teamuse-case fit, integration, local controls, user training, incident escalation

Governance forums:

  • Weekly marketplace intake and certification triage。
  • Biweekly platform product review: backlog, adoption friction, policy exceptions, developer experience。
  • Monthly AI governance committee: Tier 3/4 capabilities, exceptions, incidents, high-risk changes。
  • Quarterly architecture/risk review: systemic dependency, vendor concentration, deprecation, audit findings。
  • Annual certification framework refresh: standards, threat model, eval library, control effectiveness。

Operating model test:

Can a new team discover a certified tool, understand its risk and obligations,
request the right scopes, integrate through the runtime gateway,
prove evidence, and exit safely when the tool changes?

11. Product and Architecture Decisions

DecisionOption AOption BSenior judgment
Marketplace scopecatalog only approved capabilitiesinclude experimental sandbox assets with clear labelsInclude sandbox only if runtime separation and search filters prevent accidental production reuse
Certification modelcentral board reviews everythingrisk-tiered delegated certificationUse risk-tiered delegation; central governance handles Tier 3/4, exceptions and standards
Capability cardsfree-form documentationstructured card linked to policy and evidenceStructured card, with narrative fields only where needed
Runtime gatewayoptional best practicemandatory for certified invocationMandatory for production agentic invocation; otherwise certification cannot bind runtime
Permission modelbroad app accessscoped capability entitlementScope by action, data, purpose, identity, environment and approval
Package provenancerepo links and release notessigned artifacts with manifest and hashSigned packages for production; hash referenced in invocation traces
Eval evidenceper team ad hoc evalsshared eval packs plus use-case evalsShared baseline for common risks, local evals for business-specific harm
Deprecationowner sends emailmarketplace-driven consumer inventory and sunset gateConsumer impact map and blocked new usage before shutdown
Exceptionsinformal risk acceptanceexpiring exception with owner, compensating controls and evidenceExpiring exception; renewal requires updated evidence

12. Control Matrix

Control objectiveControl activityEvidence
Prevent unsafe discoveryMarketplace search filters by risk tier, environment, certification status and consumer eligibilitysearch policy, eligibility logs, publication status
Ensure clear ownershipEvery capability has accountable owner, steward, support path and attestationowner attestation, support SLA, renewal record
Constrain agentic actionRuntime gateway enforces scopes, approvals, limits and purpose bindingpolicy decision log, token scopes, approval packet
Protect sensitive dataField-level allowlists, ABAC, data classification and retention controlsdata map, steward approval, access logs
Reduce prompt injection and tool misuseCertification includes adversarial evals and tool-call scenario testingeval run id, test cases, pass/fail evidence
Establish provenanceProduction packages, prompts, specs and manifests are signed and traceablesignature, hash, release manifest
Manage change safelyBreaking changes require consumer impact analysis and recertificationchange record, consumer inventory, recertification evidence
Enable auditabilityInvocation ledger records agent, delegator, tool, parameters, policy decision and side effectledger export, trace id, evidence pack
Support deprecation/exitDeprecated capabilities block new adoption, notify consumers and provide migration pathdeprecation notice, replacement mapping, shutdown record
Govern exceptionsExceptions expire, include compensating controls and owner approvalexception register, expiry, review outcome

13. Metrics and KRIs

Marketplace should be measured as a platform product and a governance control surface。

MetricWhy it matters
Certified capability adoption rateshows whether teams reuse trusted assets instead of creating shadow tools
Time from submission to certificationmeasures governance throughput and bottlenecks
Capability card completeness ratemeasures discovery and audit readiness
Runtime invocation coveragepercentage of production tool calls passing through governed gateway
Scope precision ratiogranted scopes vs actually used scopes, highlighting over-permission
Eval evidence freshnessdays since last eval or red-team run by risk tier
Signed artifact coveragepercentage of production capabilities linked to signed package/hash
Deprecated capability invocation countmeasures lifecycle control effectiveness
Exception age and renewal rateshows governance debt and risk acceptance discipline
Consumer exit readinesspercentage of consumers with tested migration or fallback plan

KRIs:

KRIExample threshold
Tier 3/4 tool outside runtime gatewayzero tolerance
Sensitive read capability without data steward approvalimmediate escalation
Customer-impacting action without approval evidenceincident review
Unsigned package in production agent pathblock new invocation
Capability card missing data classificationpublication blocker
Deprecated tool still used after sunsetexecutive escalation for high-risk capabilities
Prompt injection eval failure without compensating controlcertification hold
Broad wildcard scope granted to agentsecurity architecture review
Owner attestation expiredcapability hidden from new consumers
Invocation ledger missing side-effect resultaudit evidence defect

14. Failure Modes

Failure modeWhy it happensConsequenceBetter pattern
Marketplace becomes a link directoryteam optimizes for inventory, not runtime governanceunsafe capabilities look officialpublish only with capability card, tier, evidence and enforcement path
Certification detached from gatewayreview happens in slides, runtime uses direct credentialspolicy is unenforceablebind certification to entitlement and runtime policy
Tool providers under-describe side effectsAPI team thinks in endpoints, not agent behavioragents use tools in unintended chainsrequire side-effect taxonomy, reversibility and approval model
One-time approval culturecapability approved once and forgottenstale prompt, model, source or endpoint persistsrenewal cadence by risk tier and change triggers
Marketplace hides risk to improve adoptionhigh-risk tools marketed as “easy reuse”business teams bypass reviewmake risk visible but navigable with clear gates
Broad service accountsagents share generic credentialsweak attribution and excessive accessagent identity, delegated user context and scoped tokens
Eval theateronly happy-path tests or generic quality checksprompt injection/tool misuse missedthreat-informed eval library and use-case harm tests
No consumer inventoryowner cannot know who uses capabilitydeprecation and incident response failentitlement graph and invocation graph
No exit pathcapability becomes critical but unsupportedvendor or owner failure creates operational lock-inlifecycle and replacement plan before Tier 3/4 adoption
Audit evidence is reconstructed manuallytraces do not link card, package, scope and invocationslow regulator/internal audit responseevidence-by-design ledger

15. Interview-Ready Takeaways

Q1: AI agent marketplace 和普通 service catalog 最大区别是什么?

普通 service catalog 主要解决服务发现和访问申请。AI agent marketplace 还必须解决 agentic risk: tool side effects、data access、permission scopes、prompt injection、package provenance、runtime invocation evidence、owner attestation、risk-tiered certification 和 deprecation/exit。它不是目录, 而是受控的 platform product。

Q2: 你会如何设计 capability card?

我会把 card 做成人机共同可读的产品资产: identity、purpose、risk tier、allowed/forbidden uses、input/output schema、data classification、permission scopes、evaluation evidence、signed package provenance、runtime telemetry、consumer obligations、support and lifecycle。Card 要能驱动 search、access request、policy enforcement 和 audit evidence。

Q3: Tool certification 应该认证什么?

不只是代码质量。要认证 contract-first schema、auth/security scheme、data permissions、side effects、idempotency、approval requirement、prompt injection/tool misuse tests、signed package、runtime gateway enforcement、monitoring and evidence export。对 Tier 3/4, 还要有 human approval、dual control、limits、rollback 和 recertification。

Q4: 为什么 signed package 对 AI governance 重要?

Agent runtime 复用的是 prompt、tool wrapper、MCP server、API spec、guardrail config、RAG manifest 和 eval pack 的组合。没有签名和 hash, 事故后无法证明到底调用了哪个版本。Signed provenance 让 invocation trace 可以连接到 approved artifact。

Q5: 如何避免 marketplace 变成创新阻力?

把治理做成平台产品: 清晰分类、低风险快速通道、标准化 capability card、自动 schema validation、共享 eval packs、self-service sandbox、可解释 certification status、明确 SLA 和 developer experience metrics。高风险能力需要强控制, 低风险能力应快速复用。

Q6: 你如何处理 MCP servers 的治理?

不把 MCP 当特殊例外。MCP server 要有 tool manifest、server identity、transport/security controls、resource boundary、data classification、permission scopes、audit logs、package provenance、owner attestation and deprecation path。关键是把每个 exposed tool 认证为 capability, 并通过 runtime gateway 或 equivalent control enforcement 约束调用。


16. Practical Templates

16.1 Capability Card Example

# Capability Card: Complaint Case Update Tool

Capability ID: cap-servicing-complaint-case-update-v3
Type: Tool
Owner: Head of Customer Servicing Platforms
Steward: Customer Data Steward, Servicing Domain
Business Domain: Customer Servicing and Complaint Operations
Risk Tier: Tier 3
Certification Status: Restricted

Purpose:
Allow approved complaint response agents to draft and submit internal case disposition updates after human reviewer approval.

Allowed Uses:
- Update complaint case status after the assigned complaint specialist confirms evidence and resolution code.

Forbidden Uses:
- Do not send customer-visible messages, issue refunds, close regulatory complaints or override fraud/account restrictions.

Data Boundary:
- Data classes: customer complaint metadata, case notes, resolution codes.
- Allowed fields: case id, disposition code, reviewer id, resolution summary, supporting evidence ids.
- Prohibited fields: full payment card number, authentication secrets, unrelated account balances, medical or HR records.
- Retention: invocation ledger retained for seven years for complaint governance evidence.

Permission Scopes:
- Action: update complaint case disposition.
- Data: assigned cases in servicing complaint workflow.
- Purpose: complaint resolution and quality review.
- Environment: production with restricted entitlement.
- Approval: human complaint specialist approval before submit.

Contracts and Manifests:
- OpenAPI / AsyncAPI / MCP manifest / prompt bundle: servicing-case-tools-openapi.yaml.
- Version: 3.4.1.
- Signed artifact hash: sha256:9f8a5f2c-servicing-case-tool-v3-4-1.

Evaluation Evidence:
- Eval suite: complaint-tool-misuse-and-prompt-injection-v2.
- Last run: 2026-06-15.
- Pass threshold: 100% block rate for unauthorized closure/refund/send attempts.
- Known limitations: does not validate downstream CRM rendering defects; monitored by case platform QA.

Runtime Controls:
- Gateway: agent-tool-gateway-prod.
- Rate/volume limits: 20 updates per specialist per hour, 500 updates per agent per day.
- Human approval: required approval_packet_id for every write.
- Logging: invocation id, agent id, reviewer id, case id, policy decision, side effect.
- Alerting: deny or escalate when prompt injection signal appears before write action.

Lifecycle:
- Certified on: 2026-06-20.
- Renewal date: 2026-12-20.
- Deprecation policy: block new consumers 60 days before shutdown.
- Replacement path: cap-servicing-complaint-case-update-v4 after CRM migration.

16.2 Certification Decision Record Example

# Certification Decision Record

Capability: cap-servicing-complaint-case-update-v3
Version: 3.4.1
Risk Tier: Tier 3
Decision: Restricted
Decision Date: 2026-06-20
Decision Owners: AI Governance Delegate, Customer Servicing Platform Owner, Cyber Security Reviewer

Evidence Reviewed:
- Capability card: approved record cap-servicing-complaint-case-update-v3.
- Contract/spec: servicing-case-tools-openapi.yaml version 3.4.1.
- Data map: complaint-case-field-allowlist-v5.
- Eval results: complaint-tool-misuse-and-prompt-injection-v2 run 2026-06-15.
- Security review: appsec-review-servicing-tools-2026-06.
- Runtime enforcement test: gateway-policy-test-run-8842.
- Owner attestation: signed by Head of Customer Servicing Platforms on 2026-06-18.

Conditions:
- Production use requires approval_packet_id and assigned-case authorization for every update.

Consumer Obligations:
- Consumer agents must remain draft-only when prompt injection score is elevated or source evidence lacks current complaint policy citation.

Renewal Trigger:
- Recertify before CRM v4 migration, any new closure/refund action, or 2026-12-20 renewal date.

Decision Rationale:
Restricted certification is appropriate because the tool creates internal case side effects, but runtime approval, scoped authorization, prompt-injection downgrade and invocation ledger controls reduce risk to the approved Tier 3 posture.

16.3 Runtime Invocation Evidence Schema

FieldMeaning
invocation_idunique tool/capability call id
agent_idcalling agent or workflow
human_delegator_iduser or approver where relevant
capability_idmarketplace capability id
capability_versioncertified version
artifact_hashsigned package/prompt/spec hash
risk_tiereffective risk tier at invocation
scope_grantedapproved scopes
scope_usedscopes actually exercised
purpose_codedeclared business purpose
policy_decisionallow, deny, step-up, draft-only, dual-control
input_schema_resultvalidation outcome
data_objectsdata categories or object references
tool_parameters_hashhash or redacted parameter record
output_hashoutput or result reference
side_effectrecord updated, message drafted, message sent, refund created
approval_packet_idhuman or dual-control approval evidence
control_exceptionsexception id if used
trace_export_statusavailable, partial, failed

17. Final Operating Principle

成熟的 AI agent marketplace 可以用一句话检验:

Can the enterprise prove that every reusable agentic capability was discoverable,
accurately described, risk-tiered, certified with evidence, signed, permissioned,
monitored at runtime, owned by accountable people, and safely retired when trust expires?

如果答案不清楚, 企业不是缺一个 catalog 页面, 而是缺一套把 AI product architecture、platform governance、tool/API certification、runtime authorization 和 audit evidence 连接起来的 agent capability control plane。