返回 Papers
AI 扩展计划 / Playbooks

AI Product Portfolio Rationalization / Capability Decommission Playbook

版本: v1.0

902AI_PRODUCT_PORTFOLIO_RATIONALIZATION_CAPABILITY_DECOMMISSION_PLAYBOOK.md

AI Product Portfolio Rationalization / Capability Decommission Architecture Playbook

版本: v1.0 日期: 2026-06-30 适用对象: CBAP、Senior BA、AI PM、Product Portfolio Lead、Business Architect、Enterprise Architect、Solution Architect、AI Platform PM、FinOps、Operations、Records、Legal、Compliance、Risk、Internal Audit、金融零售转型负责人

定位: 把 AI product portfolio rationalization、capability decommission、application portfolio management、feature/API/model sunset、vendor exit、records/legal-hold control、customer migration 和 architecture governance 统一成一套可执行的金融零售高级手册。

重要说明: 本文是学习、作品集和内部架构训练材料, 不是法律意见、合规结论、监管解释、客户通知模板、记录保留意见或诉讼保全建议。正式项目必须由 Legal、Compliance、Privacy、Records Management、Risk、Security、Model Risk、Operations、Business Owner、Procurement、Internal Audit 和相应管理层按机构类型、产品、司法辖区、客户群、合同、监管关系和事实场景确认。


1. Executive Framing

AI 降低了新增产品能力的门槛, 也提高了组合失控的速度。一个金融零售企业可能在一年内同时出现:

  • 多个业务线各自建设客服 copilot。
  • 多个团队各自做 RAG 知识库和 embedding。
  • 同一 capability 由旧核心系统、新 SaaS、low-code workflow 和 AI agent 同时提供。
  • 多套 prompt、模型、工具权限、日志和 eval 证据并存。
  • 供应商模型、插件、向量库和 workflow 平台产生新的锁定。
  • 旧功能、旧 API 和旧模型没有退出路径, 导致运营、审计、客服、培训和合规复杂度持续上升。

本 playbook 的核心判断:

Portfolio rationalization is not cost cutting.
It is capability architecture, risk governance and capacity reallocation.

中文表达:

组合瘦身不是砍功能, 而是把企业能力放回正确的位置, 让客户价值、风险控制、架构简化和组织容量重新对齐。

1.1 成熟能力要回答的问题

QuestionMature answer
企业到底有多少重复 capabilitycapability map + product/application/API/model dependency graph
哪些能力值得保留、合并、迁移或下线risk-adjusted value and migration feasibility scorecard
AI 如何帮助识别重复和低价值能力usage/cost/risk mining, semantic clustering, impact analysis, sunset draft
谁能决定关闭客户可见能力portfolio governance forum, accountable executives and required control partners
legal hold / records 如何处理no disposal or evidence deletion before authorized review and hold check
API / feature / model 如何 sunsetversion policy, consumer inventory, migration cohort, deprecation evidence
客户如何迁移cohort design, alternative path, communication approval, support readiness
下线后如何证明做对了decommission certificate, evidence binder, benefit realization and risk monitoring

2. Source Anchors

以下官方来源作为控制语言和架构证据锚点。它们不自动产生任何机构特定的法律、合规或审计结论。

AnchorOfficial source本 playbook 使用方式
FFIEC Management IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/management.aspx用 governance、enterprise architecture、IT responsibilities、planning IT operations and investment、risk identification、metrics、SLAs、control effectiveness 和 reporting 组织组合治理
FFIEC Architecture, Infrastructure, and Operations IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/architecture-infrastructure-and-operations/用 AIO governance、asset inventory、data classification、data flow diagrams、business process narratives、change、resilience、capacity、log management、monitoring 和 continuous improvement 设计 decommission evidence
FFIEC Development, Acquisition, and Maintenance IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/development-acquisition-and-maintenance/用 SDLC、quality、documentation、impact analysis、rollback/back-out plan、end-of-life、termination and disposal、exit strategy、third-party and supply chain risk 设计 lifecycle sunset
FFIEC Business Continuity Management IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/business-continuity-management.aspx用 BIA、critical business functions、interdependency analysis、continuity strategies、communications、exercises/tests 和 board reporting 管理迁移和下线期间的韧性
NIST SP 800-160 Vol. 1 Rev. 1https://csrc.nist.gov/pubs/sp/800/160/v1/r1/final用 systems security engineering、trustworthiness、stakeholder requirements、life cycle、assurance、validation and verification 管理系统级可信停用
NIST AI RMFhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 组织 AI use case、model/tool/vendor dependency、impact、measurement、monitoring 和 risk treatment
ISO/IEC 42001https://www.iso.org/standard/81230.html用 AI management system 的 policy、objectives、risk/opportunity、operation、performance evaluation、improvement 和 lifecycle governance 设计 AI capability 管理体系
Wardley Mapping / Team TopologiesNon-official conceptual context仅作为 capability evolution、commoditization、team boundary、platform ownership 的思维工具, 不作为监管或合规依据

3. Scope: Rationalization Objects

本 playbook 处理的不是单一系统退役, 而是跨产品、能力、应用、接口、模型和供应商的组合治理。

ObjectIncluded examplesTypical owner
Product / product line账户服务、信用卡权益、贷款申请、客服渠道、商户工具Product executive / GM
Business capabilityonboarding、KYC、case triage、dispute resolution、policy retrieval、offer decisioningBusiness architect / capability owner
Applicationcore workflow、case system、portal、mobile module、analytics dashboardApplication owner
FeatureAI summary、document upload、legacy report、manual export、old journey stepProduct owner
API / event / batchpublic API、partner API、internal service endpoint、event topic、file feedAPI owner / platform owner
AI modelLLM、embedding model、classifier、ranking model、fraud model、OCR modelModel owner / AI platform
Prompt / policy / evalsystem prompt、policy pack、rubric、golden set、judge promptAI product / model risk / domain owner
RAG / knowledge assetvector index、source document set、taxonomy、ontology、retrieval pipelineKnowledge owner / data owner
Agent toolCRM update、payment action、case note write、email send、ticket closeTool owner / process owner
Vendor / SaaSmodel provider、RAG platform、workflow SaaS、observability、annotation vendorProcurement / TPRM / platform

Out of scope:

  • 单纯预算裁剪, 没有客户、风险、架构和迁移证据。
  • 法律或合规结论自动化。
  • 仅通过 AI 生成关闭名单并执行。
  • 绕过 product owner、architecture、risk、legal、records、operations 的客户影响决策。

4. Operating Principles

PrinciplePractical meaning
Capability first先看业务能力和客户任务, 再看应用和功能
Evidence before opinion先收集 usage、value、cost、risk、dependency、records 和 migration evidence
AI assists, humans decideAI 挖信号、聚类、草拟计划, 人类负责客户影响、风险接受和停用决策
Launch includes exit新能力上线不算完成, 旧能力退出和容量释放也要纳入成功标准
No silent shutdown客户、API consumer、运营和支持团队不能被突然切断
Records before disposal数据、日志、模型证据、客户沟通和 vendor records 在删除前必须通过 retention/hold review
Critical controls may look low usage低使用不等于低价值, 控制能力和韧性能力需要单独判断
Target architecture wins合并不是把旧系统拼在一起, 而是迁移到明确 target capability
Measure after shutdown下线后必须复盘客户影响、成本释放、风险变化和容量重分配

5. End-to-End Lifecycle

Trigger
  -> Inventory and evidence collection
  -> AI-assisted signal mining and clustering
  -> Dependency graph and impact analysis
  -> Rationalization scorecard
  -> Governance decision
  -> Migration and sunset design
  -> Customer / API / operations communication
  -> Controlled decommission execution
  -> Evidence binder and certificate
  -> Benefits, risk and capacity realization review

5.1 Trigger Types

TriggerExamplesRequired response
Strategic simplificationtoo many products, inconsistent journeys, platform consolidationcapability rationalization review
Cost pressuretoken cost spike, SaaS renewal, support burden, low margincost-to-serve and unit economics analysis
Risk eventincident, complaint spike, model failure, control issuerisk-weighted shutdown or restriction review
Architecture debtduplicate data, old API, EOL component, brittle integrationtarget architecture migration plan
Vendor changemodel deprecation, price increase, contract concern, service degradationvendor exit or replacement assessment
Regulatory / audit attentionweak records, poor evidence, unclear ownershipevidence and control remediation
New platform capabilityshared RAG, model gateway, common case assistantmerge and legacy exit plan

5.2 Stage Gates

GateDecisionMinimum evidence
IntakeIs this rationalization candidate worth analysis?trigger, owner, impacted domain, initial hypothesis
Evidence readinessIs there enough data to score?inventory, usage, cost, risk, dependency, constraints
DecisionKeep, invest, merge, migrate, freeze, sunset, archive, replacescorecard, target architecture, residual risk
Migration readinessCan customers/systems move safely?cohort plan, test, rollback, communication, support
Decommission readinessCan old object be disabled or removed?dependency clear, records/hold review, approvals, runbook
ClosureWas the decommission completed and measured?certificate, evidence binder, post-review metrics

6. Inventory Design

6.1 Product / Capability Inventory Fields

FieldDefinitionEvidence source
product_name客户或员工看到的产品/服务product catalog
capability_idbusiness capability map 中的能力节点capability model
owneraccountable product/business ownerRACI
customer_segmentaffected segment, channel, geography, product typeCRM / segmentation
job_to_be_donetask or outcome servedresearch / journey map
value_metricrevenue, risk reduction, time saved, complaint reduction, conversionmetric catalog
usage_metricactive users, transaction volume, API calls, model calls, case counttelemetry
cost_to_serveinfra, vendor, model, operations, support, controls, recordsFinOps / ops
risk_tiercustomer, financial, compliance, privacy, security, operational riskrisk assessment
replacement_capabilitytarget product/capability or no replacementtarget architecture
migration_complexitycustomer, data, integration, contract, training, recordsimpact analysis

6.2 Application Portfolio Fields

FieldDefinition
app_id / app_namesystem of record, system of engagement, workflow, analytics, AI service
supported_capabilitiesmapped capability IDs
business_criticalitycritical, important, supporting, convenience
lifecycle_statusstrategic, tolerate, migrate, contain, retire
technology_healthversion, EOL, patch posture, maintainability, test coverage
operational_healthincidents, availability, performance, support burden
integration_countAPIs, events, files, manual workarounds
data_and_recordsdata owner, retention, legal hold, archive, deletion constraints
AI_dependenciesmodel, prompt, RAG, tool, vendor, eval, trace
decommission_blockersdependencies, contracts, records, missing replacement, customer exceptions

6.3 AI Asset Registry Fields

AI assetRequired registry fields
Modelmodel_id, provider, version, route, business use, risk tier, eval status, cost, fallback, EOL
Promptprompt_id, version, owner, policy boundary, linked eval, consumer apps, status
Eval setdataset, source, coverage, rubric, owner, recency, release gate
RAG indexsource systems, document owners, ACL, embedding model, refresh cadence, retention
Tooltool name, action type, side effect, permission model, approval, audit log
Agent workflowstate machine, authority level, human handoff, rollback, incident trigger
Vendorservice, contract term, data rights, change notice, exit rights, deletion evidence

7. Dependency Graph Model

Rationalization 的核心资产是 dependency graph, 不是静态清单。

7.1 Nodes

CustomerSegment
Product
Capability
JourneyStep
Feature
API
Application
DataStore
RecordClass
Model
Prompt
RAGIndex
Tool
Vendor
Control
Team
Contract
LegalHold
Metric

7.2 Edges

EdgeMeaning
Product provides Capability产品提供某业务能力
Feature supports JourneyStep功能支撑旅程步骤
Feature calls API功能依赖接口
API served_by Application接口由应用提供
Application reads/writes DataStore应用数据依赖
Model used_by Feature模型服务某功能
Prompt configures Modelprompt 影响模型行为
RAGIndex grounded_in DataStore检索索引来自数据/文档
Tool performs Actionagent 工具执行操作
Vendor supplies Model/Tool/App第三方供应能力
RecordClass retained_in DataStore记录保留位置
LegalHold applies_to RecordClass保全约束
Control monitors Asset控制覆盖资产
Team owns Assetowner 关系

7.3 High-Value Queries

QueryWhy it matters
哪些 products 提供同一 capability?识别重复能力
哪些 features 使用同一模型但证据不同?识别模型治理不一致
哪些 API consumer 仍调用旧版本?判断 API sunset readiness
哪些 customer cohorts 无替代路径?迁移分层和沟通
哪些 tools 有 write side effect?防止 agent 下线或迁移造成业务状态不一致
哪些 records or holds 与目标资产相关?防止证据删除或迁移违反约束
哪些 vendors 同时支撑多个 critical capabilities?集中度和退出风险
下线某应用会释放哪些团队 capacity?capacity reallocation

8. AI-Assisted Signal Mining

AI 的价值在于把分散证据变成可审查的候选项, 而不是替代治理。

8.1 Signal Sources

SignalSourceAI use
Usageweb/app telemetry, feature flags, API gateway, event logsdetect low usage, cohort patterns, seasonality
Costcloud billing, token cost, vendor invoices, support timeallocate cost-to-serve by product/capability
Riskincidents, complaints, QA defects, audit issues, model eval failurescluster recurring failure modes
ProcessBPMN, SOPs, case logs, process miningidentify duplicated workflow steps
ArchitectureCMDB, service catalog, code search, API specs, lineagebuild dependency graph and stale integration view
Customerfeedback, call transcripts, NPS comments, support ticketsmap pain points and migration friction
Contractvendor agreements, renewal dates, exit clauses, noticesidentify exit windows and obligations for review
Recordsretention schedules, hold registers, archive locationsflag disposal constraints for authorized review

8.2 AI Outputs

AI outputHuman review question
Duplicate capability clusterAre these truly same customer/business outcomes or only semantically similar?
Low-value candidate listDoes low usage hide critical control, VIP, regulatory or seasonal value?
Cost anomalyIs cost avoidable, allocatable and tied to a rationalization decision?
Risk clusterDoes this justify restriction, redesign, migration or full sunset?
Migration cohort proposalAre vulnerable customers, contract commitments and operational exceptions handled?
Draft sunset planAre dates, owner, notice, support, rollback and evidence feasible?
Impact analysisDid the graph miss manual processes, reports, controls or vendors?

8.3 AI Boundary Control

ControlRequirement
Decision authorityAI output is recommendation evidence only
TraceabilityAI-generated clusters and plans link back to source data
Human sign-offproduct, architecture, operations, risk, legal/records as applicable
Bias and blind spotscheck long-tail customers, accessibility, vulnerable customers, low-volume controls
Data minimizationsensitive customer data is not overexposed during analysis
Prompt/version retentionAI analysis prompts and outputs are retained according to evidence policy

9. Rationalization Scorecard

9.1 Scoring Dimensions

Use 1-5 scoring, with written evidence for each dimension. Weighting should be approved by the portfolio governance forum.

Dimension1 score5 score
Customer valueunclear or negative valueclear, measured, customer-critical value
Strategic fitperipheral or obsoletedirectly supports target strategy/capability
Profitability / economic valuenegative margin, no risk benefitstrong margin or measurable risk/cost reduction
Cost-to-servehigh and risingefficient or strategically justified
Operational riskhigh incident/control burdenstable, controlled, resilient
Architecture fitduplicate, brittle, off-targettarget architecture, reusable, observable
AI dependency healthopaque, high cost, weak evalgoverned, evaluated, portable, monitored
Vendor / concentration risklocked-in, poor exitacceptable concentration, exit-ready
Records / legal-hold complexityunresolved constraintsreviewed and manageable
Migration feasibilityno safe pathtested, low-risk migration path

9.2 Decision Bands

PatternDecisionNotes
High value + good architectureKeep / investfund scale and migration from weaker duplicates
High value + high architecture debtRebuild / platformizedo not sunset value, reduce debt
Medium value + overlap + high costMerge / migratetarget capability and decommission duplicate
Low value + low risk + migration readySunsetexecute controlled decommission
Low value + unresolved records/holdFreeze execution pending reviewno disposal until authorized review
High risk + value unprovenRestrict / stop / redesignprotect customers and operations
Vendor EOL + replacement existsReplace / exitfollow contract and evidence controls
Critical control + low usageRetain as controlclassify as control capability, not normal product feature

10. Decision Types

DecisionWhen to useKey controls
KeepUnique value, acceptable economics, acceptable riskowner, metrics, roadmap, controls
InvestStrategic capability needs scalefunding, target architecture, capacity
MergeDuplicate capabilities can move to one targetmigration cohorts, data/API mapping, communication
MigrateOld product/app/API/model replaced by targetparallel run, rollback, evidence
FreezeStop new customers or new usage while existing runsfeature flags, sales/channel controls
RestrictReduce scope because risk or cost too highcohort limits, permissions, manual approval
SunsetPlanned feature/API/model/product retirementnotice, migration, cutoff, support
ArchivePreserve read-only evidence without active operationrecords retention, access controls
Replace vendorVendor risk, EOL, cost or capability issueexit plan, data return, deletion, transition
Retain as controlLow usage but high control/continuity valuecontrol owner, test, evidence

11. Migration Architecture

11.1 Migration Patterns

PatternUse caseRisk
Hard cutoverLow usage, internal-only, no critical dependencyoperational surprise if inventory incomplete
Parallel runAPI/model/workflow replacement needs comparisoncost and data reconciliation
Cohort migrationcustomer-facing capability with different risk groupsexception handling complexity
Adapter bridgeold API consumers need timebridge becomes permanent debt
Read-only archiveold app no longer active but evidence requiredaccess and retention governance
Dual model routemodel replacement with eval comparisoninconsistent outputs during transition
Shadow modeAI/model/tool candidate tested without customer impactfalse confidence if traffic not representative

11.2 Migration Cohort Design

CohortRecommended handling
Internal power usersearly pilot, feedback loop, training
Low-risk active customersfirst live migration with support monitoring
High-value customersaccount-team assisted migration
Vulnerable or accessibility-sensitive customersenhanced communication and alternate channel
Contracted enterprise / partner API usersformal notice, test window, certification
Dormant customersnotification and sunset with reactivation path
Legal-hold / investigation-related recordsno disposal or migration deletion without authorized review
High-risk AI workflow usersshadow, dual control, manual approval

11.3 Back-Out Plan

Every migration must define:

ElementRequired detail
Triggermetric or incident that stops migration
Ownernamed business and technical owner
Time windowwhen rollback can occur safely
Data reconciliationhow to reconcile state changes
Customer handlingmessage, support script, exception path
Evidencerollback decision log and post-incident review

12. Feature Sunset Controls

PhaseControls
Announce internallyowner, rationale, impacted users, support readiness
Freeze new entrydisable new enrollment, hide new purchase/configuration path
In-product noticeapproved message, alternative, date, support channel
Usage monitoringactive users, failed attempts, workaround usage, complaints
Migration supportwizard, bulk migration, assisted service, training
Final cutofffeature flag off, route disabled, access blocked except approved exception
Archiveretain evidence, configuration, communications and decision trail
Post-reviewconfirm cost, risk, complaint and capacity outcomes

13. API Sunset Controls

ControlRequirement
Version policydefine support window, deprecation status and cutoff date
Consumer inventoryidentify internal apps, partners, batch jobs, reports and unknown consumers
Deprecation telemetrytrack calls by client, endpoint, method, error and business process
Contract testsensure target API supports required behavior
Deprecation headersmachine-readable notice where appropriate
Developer communicationchangelog, migration guide, test environment, support channel
Parallel operationold and new version operate until readiness threshold
Rate and access controlprogressive restriction after notice window
Final disablementrevoke route, keys or version with rollback path
Evidenceconsumer sign-off, test results, final traffic proof

14. Model / Prompt / RAG / Tool Sunset Controls

14.1 Model Sunset

StepEvidence
Identify model consumersmodel gateway route inventory
Compare replacementeval result, latency, cost, safety, refusal, citation quality
Shadow or dual runoutput comparison and defect review
Update model card / system cardversion, limitations, approved use
Freeze old routeroute config and approval
Retain evidenceold model usage, eval, decision, incidents
Disable old routegateway policy, monitoring proof

14.2 Prompt Sunset

StepEvidence
Mark prompt as deprecatedprompt registry status
Link replacement promptversion diff and owner
Run regression evalgolden set and negative cases
Block new deploymentsCI/CD or registry gate
Archive old promptcontent, metadata, approval, effective dates

14.3 RAG Index Sunset

StepEvidence
Map source documentsdocument owner, retention, ACL
Build target indexsource lineage, embedding model, chunking, refresh
Compare retrievalcitation precision, permission filter, stale-source rate
Freeze old ingestionpipeline config
Migrate consumersroute and feature mapping
Dispose or archiverecords/hold review, deletion or retention proof

14.4 Tool / Connector Sunset

StepEvidence
Identify side effectswrite actions, external communication, financial action
Reconcile statepending tasks, retries, idempotency keys
Revoke permissionsIAM and tool broker records
Replace workflowtarget tool or manual process
Retain audit logstool call history and approval evidence
Disable connectorconfig, monitoring, final zero-traffic proof

15. Vendor Exit and Dependency Reduction

AI product rationalization often exposes vendor concentration.

Vendor concernRationalization response
Model provider price increaseroute optimization, smaller model, alternative provider, use-case restriction
RAG SaaS lock-inexport index metadata, rebuild ingestion, standardize chunk/source schema
Workflow vendor EOLprocess migration, API replacement, archive evidence
Observability vendor gapnormalize trace schema before exit
Annotation vendor quality issueexport labels, reviewer calibration evidence, replacement process
Contract lacks exit supportescalate before renewal, reduce dependency, negotiate transition support

Exit evidence:

  • contract notice and approval。
  • export manifest。
  • data return package。
  • deletion confirmation where applicable。
  • access revocation。
  • replacement validation。
  • residual risk memo。
  • final vendor dependency graph update。

16. Customer Communications

Customer communications are part of architecture execution because they affect adoption, support load, complaints and risk.

16.1 Communication Matrix

AudienceMessage focusEvidence
Retail customerswhat changes, when, alternative, supportapproved notice, send log, complaint monitor
SMB / commercial customersmigration steps, admin action, API or file changesaccount outreach, confirmation
Internal employeesnew workflow, support scripts, exception handlingtraining completion, readiness checklist
Partners / developersAPI version, sandbox, cutoff, test evidencedeveloper notice, certification
Operations / supportscripts, escalation, manual fallbackrunbook, QA sampling
Regulators / auditors if applicableevidence and decision trail through authorized channelsexam/audit response binder

16.2 Message Requirements

RequirementMeaning
Specificname the capability and dates clearly
Customer-centeredexplain service quality, consistency, security or product improvement, not internal cost alone
Alternative pathshow replacement, migration, exception or support
Accessibleconsider language, accessibility, digital exclusion and vulnerable customers
Traceableretain notice versions, approval, send logs and complaint response
Coordinatedsupport, branches, call center, relationship managers and digital channels aligned

Rationalization may involve deleting, archiving, migrating or freezing records. Product and architecture teams should not make legal conclusions.

17.1 Constraint Types

ConstraintPractical implication
Records retentionold outputs, customer communications, decisions, case notes, logs or model evidence may need retention
Legal holdroutine deletion or alteration may be suspended for scoped records
Regulatory inquiry / auditevidence must remain searchable, explainable and producible
Contractual recordspartner/customer contracts may define notice, data, audit or transition obligations
Privacy deletion requestmay conflict with retention or legal hold; requires authorized handling
Model governance evidenceevals, approvals, monitoring and incidents may need archive
Vendor data locationvendor-held records may need export, return or deletion confirmation

17.2 Records Gate

Before disposal or irreversible deletion:

CheckEvidence
Record classes identifiedrecord taxonomy and data map
Retention schedule mappedretention class and authority recorded
Active hold checkedhold register search and authorized sign-off
Export / archive completedarchive manifest, checksum, access controls
Vendor records addresseddata return/deletion certificate or exception
Disposal loggeddeletion/disposition certificate
Access retained for auditsread-only evidence binder or archive route

18. Decommission Evidence Binder

The evidence binder is the artifact that proves the decision and execution were controlled.

SectionContents
Executive decisiondecision type, owner, date, forum, rationale
Scopeproducts, capabilities, apps, APIs, models, tools, vendors
Evidenceusage, value, profitability, cost-to-serve, risk, architecture debt
Dependency graphimpacted customers, processes, systems, data, records, controls
AI analysisprompts/outputs or analysis method, source data, human review
Legal/records reviewreview status and constraints without making legal conclusions
Target architecturefuture-state diagram, owner, controls, SLO/SLA
Migration plancohorts, sequencing, tests, rollback
Communicationsapproved messages, audiences, send evidence, support scripts
Execution prooffeature flag, API route, model route, connector, access, vendor actions
Decommission certificatefinal status, exceptions, residual risk, sign-offs
Post-reviewbenefits, capacity release, incidents, complaints, lessons

19. Decommission Certificate Template

Decommission object:
Decision forum:
Decision date:
Accountable owner:
Execution owner:

Scope:
  Products:
  Capabilities:
  Applications:
  Features:
  APIs:
  Models/prompts/RAG/tools:
  Vendors:

Evidence reviewed:
  Usage:
  Customer value:
  Cost-to-serve:
  Profitability:
  Operational risk:
  Architecture debt:
  AI/model/vendor dependency:
  Records/legal-hold review status:

Migration:
  Target capability:
  Cohorts migrated:
  Exceptions:
  Back-out plan tested:

Communications:
  Customer/internal/API audiences:
  Approved notice version:
  Send/completion evidence:

Execution proof:
  Feature flags:
  API routes:
  Model routes:
  Tool permissions:
  Data/archive:
  Vendor access:

Residual risk:
  Open exceptions:
  Owner:
  Review date:

Closure:
  Final approval:
  Date:

20. Governance Model

20.1 Forums and Decisions

ForumOwnsDoes not own
Product Portfolio Councilportfolio strategy, value, funding, capacity, customer impact tradeoffsdetailed technical design alone
Architecture Review Boardtarget architecture, dependency risk, migration design, sunset controlsbusiness value decision alone
Risk / Compliance / Legal / Recordsconstraints, required reviews, risk acceptance path, retention/hold handlingproduct prioritization alone
Operations Readinesssupport, training, manual workaround, incident and rollbackstrategic product fit
Vendor / Procurementcontract notice, exit support, data return, deletion, renewalmodel behavior validation alone
Internal Audit / Assuranceevidence quality, control testability, decision trailmanagement decision itself

20.2 RACI

ActivityAccountableConsulted
Candidate identificationPortfolio leadAI analytics, EA, FinOps
Capability mappingBusiness architectPM, BA, domain SMEs
Dependency graphEnterprise architectapp owners, data, platform, security
ScorecardProduct portfolio councilfinance, risk, architecture, operations
Records/hold reviewRecords / Legal authorized rolesproduct, architecture, data owner
Target architectureArchitecture boardplatform, security, data, operations
Customer communicationProduct / channel ownerlegal, compliance, support, brand
API/model sunsetPlatform/API/model ownerconsumers, risk, security
ExecutionTechnical owner and operationsproduct, support, vendor
Closure reviewPortfolio leadaudit, finance, architecture, risk

21. Product Manager Incentives

Portfolio rationalization fails when PMs are rewarded only for launches.

21.1 Incentive Failures

Incentive failureResult
Feature count equals progressdead features accumulate
Local P&L onlyplatform, support and compliance costs are externalized
Launch celebrated, retirement ignoredlegacy never exits
Adoption without riskrisky high-use features appear successful
No credit for mergeteams defend duplicate capabilities
No capacity accountingreleased teams are immediately consumed by ad hoc work

21.2 Better PM Scorecard

MetricWhy it matters
Outcome per capabilityfocuses on customer/business result, not feature count
Cost-to-serve trendmakes operating burden visible
Duplicate capability reductionrewards simplification
Legacy exit completionprevents endless parallel run
Risk and complaint trendprevents cost-only shutdown
Platform reuse raterewards target architecture
Capacity reallocated to strategic betsproves decommission creates optionality
Evidence completenessmakes auditability a product responsibility

22. Implementation Guardrails

GuardrailPractical rule
No shutdown without ownerevery object has accountable business and technical owner
No irreversible deletion before records gateretention and hold review first
No API cutoff without consumer inventoryunknown traffic must be investigated
No model replacement without evalcompare quality, safety, cost, latency and evidence
No customer migration without support readinesssupport scripts, escalation and complaint monitoring required
No vendor exit without export testdata/config/log/evidence export must be validated
No hidden capacity claimreleased capacity must be measured and reallocated
No single-metric decisionusage, value, cost, risk, architecture and constraints all considered
No AI-only decisionAI recommendations require human review and governance sign-off

23. 30 / 60 / 90 Day Roadmap

First 30 Days: Build Evidence Foundation

WorkstreamDeliverable
Scopeselect 1-2 domains, such as customer service AI or onboarding workflow
Inventoryproduct/capability/application/API/model/vendor inventory
Metricsusage, cost-to-serve, incident, complaint, model call and support metrics
Dependency graphfirst graph with customer, product, app, API, model, data, vendor nodes
Governancedecision forums, RACI, scorecard draft
AI analysisfirst duplicate capability clustering with human review
Controlsrecords/legal-hold review path and decommission evidence template

Days 31-60: Decide and Design

WorkstreamDeliverable
Candidate scoringtop rationalization candidates with scorecard
Target architecturetarget capability owner, platform reuse, migration design
Cohort plancustomer/API/user migration cohorts and exception groups
Sunset controlsfeature/API/model/tool/vendor sunset runbooks
Communicationsinternal, customer, API consumer and support communication drafts
Evidencedecision packet for architecture board and portfolio council
Pilot migrationlow-risk cohort or internal-only capability migration

Days 61-90: Execute and Prove

WorkstreamDeliverable
Controlled migrationexecute first cohort and monitor
Decommissiondisable selected old feature/API/model/tool with rollback path
Evidence bindercompleted certificate and proof
Benefitscost, capacity, risk and customer impact measurement
Governance refinementupdate scorecard, gates, inventory fields and templates
Scale plannext rationalization wave and portfolio health metrics

24. Architecture Board Packet

SectionContent
Decision requestedkeep, merge, migrate, sunset, archive, replace
Business rationalecustomer value, profitability, cost-to-serve, strategic fit
Capability mapcurrent duplicate capabilities and target owner
Dependency graphimpacted products, apps, APIs, data, models, tools, vendors, controls
AI analysis summarysignal mining, clustering and human review notes
Risk assessmentoperational, model, security, privacy, vendor, customer harm
Records/legal-hold statusreview completed, constraints, exception handling
Target architecturefuture-state diagram and control points
Migration plancohorts, sequencing, parallel run, rollback
Customer/API communicationsaudience, timing, approved channels
Evidence planbinder, certificate, post-review metrics
Decision logapprovals, conditions, residual risks

25. Financial Retail Example: Onboarding AI Capability Consolidation

Current state:

  • Retail banking, credit card and small business teams each built an onboarding document AI assistant.
  • Capabilities overlap: document classification, missing-doc checklist, KYC policy retrieval, application summary.
  • Models differ, OCR vendors differ, RAG indexes differ, operations QA differs.
  • Customers see inconsistent missing-document reasons.
  • Model and vendor cost is rising, while support agents still manually reconcile outputs.

Rationalization finding:

DimensionFinding
Customer valuevalue is high, but fragmentation hurts consistency
Profitabilityduplicate OCR/model/vendor spend creates avoidable cost
Operational riskinconsistent KYC policy interpretation and manual rework
Architecture debtseparate indexes, prompts, trace schemas and workflow connectors
Model/tool dependencymultiple vendors with different evidence exports
Records constraintapplication documents, decision notes and customer communications need mapped retention
Migration pathshared onboarding AI platform with product-specific policy profiles

Decision:

  • Keep onboarding AI as a strategic capability。
  • Merge duplicate document classification and checklist generation。
  • Migrate to common model gateway, OCR abstraction, RAG pipeline, eval harness and trace schema。
  • Preserve product-specific rules, customer disclosures, risk thresholds and operations queues。
  • Sunset old APIs after partner and internal consumer migration。
  • Archive old outputs and eval evidence according to records review。

Target architecture:

Digital onboarding channels
  -> Onboarding AI capability service
  -> Product policy profile
  -> OCR / document extraction abstraction
  -> KYC / credit / SMB RAG sources
  -> Model gateway and eval harness
  -> Workflow and case system connectors
  -> Evidence and records layer

26. Interview Answers

Q1: How would you rationalize an AI product portfolio in a bank or fintech?

30 秒版本:

I would not start with an application list. I would build a capability and dependency graph across products, applications, APIs, models, prompts, RAG indexes, tools and vendors. Then I would score each candidate by customer value, profitability, cost-to-serve, operational risk, architecture debt, AI dependency health, vendor risk, records/legal-hold constraints and migration feasibility. AI can mine signals and propose clusters, but accountable governance decides customer-impacting shutdowns.

2 分钟版本:

In a financial retail environment, the biggest risk is shutting down the wrong thing because usage looks low or cost looks high. I start with business capabilities and customer journeys, then connect them to applications, APIs, data stores, records, models, prompts, tools and vendors. I use AI to mine usage, token cost, support tickets, incidents, complaints, model eval failures and architecture metadata, then cluster redundant capabilities. But every cluster is reviewed by product, architecture, operations, risk, legal/records and finance. The decision may be invest, merge, migrate, freeze, sunset or retain as a control. For execution, I require migration cohorts, customer/API communication, records/legal-hold review, rollback, decommission certificate and post-review benefits tracking.

Q2: What is the difference between application retirement and capability decommission?

30 秒版本:

Application retirement is about shutting down a system. Capability decommission is about removing, merging or migrating a business ability across products, processes, systems, APIs, models, tools, vendors and operating teams. In AI, capability decommission is broader because prompts, models, RAG indexes, evals and evidence may outlive the application.

2 分钟版本:

For example, retiring an old case management app does not necessarily retire the capability of complaint triage. The capability may move to a shared workflow service with AI summarization, policy retrieval and human review. I need to map customer journeys, record classes, API consumers, model routes, prompt versions, tool permissions and vendor contracts. Only after migration, evidence preservation and support readiness can the old app be technically retired. This is why capability decommission belongs in enterprise architecture, not only IT operations.

Q3: What role should AI play in decommission decisions?

30 秒版本:

AI should accelerate evidence work: mining usage, cost and risk signals; clustering overlapping capabilities; enriching dependency graphs; proposing migration cohorts; and drafting sunset plans. AI should not own customer-impacting shutdown decisions, legal-hold decisions, records disposal or risk acceptance.

2 分钟版本:

I would use AI as an analyst and drafting assistant. It can read product descriptions, API specs, support tickets, incident records and cost data to identify likely duplicate capabilities and high-cost low-value candidates. It can also propose migration cohorts based on usage and risk. But there must be traceability back to source evidence, human review for false positives and governance sign-off. The final decision affects customers, contracts, operations, records and risk appetite, so it belongs to accountable product and architecture governance with legal, compliance, records and risk partners.

Q4: How do you avoid harming customers during product sunset?

30 秒版本:

I use cohort migration, clear alternatives, support readiness, approved communications, complaint monitoring and rollback. I also check long-tail, vulnerable, accessibility-sensitive, high-value and contractual customers separately instead of using average usage metrics.

2 分钟版本:

I segment customers by activity, dependency, risk, value, accessibility, contract and support needs. Low-risk internal users may migrate first; high-value or vulnerable customers get enhanced support; API partners get a test window and certification. Every communication explains what changes, when it changes, what alternative exists and where to get support. I monitor complaints, failed attempts, support volume, conversion and incidents during migration. If thresholds are breached, the back-out plan pauses or reverses the migration.

Q5: What evidence would you show an architecture board before decommission?

30 秒版本:

I would show the capability map, dependency graph, usage/value/cost/risk scorecard, target architecture, model/tool/vendor dependency assessment, records/legal-hold review status, migration cohorts, API/customer communication plan, rollback plan and decommission certificate template.

2 分钟版本:

The board needs to see both why and how. Why: customer value, profitability, cost-to-serve, operational risk, architecture debt and strategic fit. How: impacted products, apps, APIs, data stores, records, models, prompts, RAG indexes, tools, vendors and teams. I would include controls for feature flags, API versioning, model route changes, tool permission revocation, archive and disposal evidence. I would also include residual risk owners and post-decommission metrics so the decision is not just a one-time approval.


27. Quick Reference Checklist

AreaCheck
InventoryProduct, capability, app, API, model, prompt, RAG, tool, vendor mapped
EvidenceUsage, value, cost, risk, profitability and debt collected
AI analysisClusters and recommendations traceable to source data
GovernanceDecision forum, accountable owner and RACI clear
ArchitectureTarget state and dependency graph reviewed
RecordsRetention and legal-hold review path completed before disposal
MigrationCohorts, tests, support and rollback defined
CommunicationsCustomer, internal and API notices approved and retained
ExecutionFeature/API/model/tool/vendor controls executed with proof
ClosureCertificate, evidence binder and benefits realization completed

Final mental model:

Inventory tells you what exists.
Signals tell you what hurts.
Capability mapping tells you what overlaps.
Architecture tells you what should replace it.
Governance tells you who may decide.
Migration tells you how to avoid harm.
Evidence tells you whether the decision was controlled.