返回 Papers
AI 扩展计划 / Playbooks

AI Requirements Mining / Process Knowledge Extraction Playbook

本文是学习、作品集、架构训练和内部治理讨论材料, 不构成法律意见、合规结论、监管解释、审计意见、记录保留结论、模型验证报告或供应商建议。正式项目必须由 Legal、Compliance、Privacy、Records Management、Information Security、Model Risk、Operational Risk、Internal Audit、Business Owner、D

712AI_REQUIREMENTS_MINING_PROCESS_KNOWLEDGE_EXTRACTION_PLAYBOOK.md

AI Requirements Mining / Process Knowledge Extraction Playbook

定位: 面向 CBAP+ Senior BA、AI PM、Product Architect、Enterprise Architect、Process Owner、EvalOps、Risk/Control Partner 的金融零售 AI 需求挖掘与流程知识抽取实战手册。 核心目标: 把分散在 PRD、BRD、SOP、政策、工单、Jira/Azure DevOps、通话转写、会议纪要、流程图、代码/API 规格、测试用例、生产日志和控制证据中的知识, 转成可追溯、可评估、可验证、可治理、可复用的 requirement and process knowledge assets。 核心观点: AI requirements mining 不是 BA replacement, 而是 evidence-grade BA operating system。AI 负责扩大证据面、发现冲突和生成候选资产; 人负责解释、取舍、授权、治理和上线责任。


0. Disclaimer

本文是学习、作品集、架构训练和内部治理讨论材料, 不构成法律意见、合规结论、监管解释、审计意见、记录保留结论、模型验证报告或供应商建议。正式项目必须由 Legal、Compliance、Privacy、Records Management、Information Security、Model Risk、Operational Risk、Internal Audit、Business Owner、Data Owner、Architecture、Engineering 和相关流程 owner 共同确认。


1. Source Anchors

AnchorOfficial link本 playbook 使用方式
ISO/IEC/IEEE 29148 Requirements Engineeringhttps://www.iso.org/standard/72089.htmlhttps://standards.ieee.org/ieee/29148/6937/作为需求质量、生命周期、traceability 和 stakeholder need 管理的标准化锚点。
FFIEC Development, Acquisition, and Maintenance IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/development-acquisition-and-maintenance/约束 AI 需求挖掘输出如何进入开发、采购、测试、实施、维护和变更控制。
FFIEC Management IT Handbookhttps://ithandbook.ffiec.gov/it-booklets/management/组织治理、风险管理、架构、资源、第三方和监督责任。
NIST SP 800-160 Vol. 1https://csrc.nist.gov/pubs/sp/800/160/v1/upd2/final把 stakeholder protection needs、security、resilience、assurance 纳入需求抽取和架构评审。
NIST SP 800-218 SSDFhttps://csrc.nist.gov/pubs/sp/800/218/final将代码/API/测试资产挖掘连接到安全软件开发、漏洞响应和 release evidence。
NIST AI Risk Management Frameworkhttps://www.nist.gov/itl/ai-risk-management-framework用 Govern / Map / Measure / Manage 设计 AI 风险识别、评估、门禁和持续改进。
ISO/IEC 42001 AI Management Systemhttps://www.iso.org/standard/81230.html用 AI management system 语言设计 policy、role、operation、performance evaluation、internal audit 和 improvement。
IIBA / BABOK public professional pagehttps://www.iiba.org/professional-development/knowledge-centre/business-analysis-body-of-knowledge/仅作为业务分析专业体系锚点; 不复述或替代 BABOK 受版权保护内容。

Source-to-artifact pattern:

official source anchor
  -> governance principle
  -> mining requirement
  -> architecture control
  -> evidence artifact
  -> owner and metric

2. Executive Framing

高管或业务方常见表达:

我们有很多 PRD、SOP 和 ticket, 能不能让 AI 自动生成需求?
我们想把会议记录和客服通话转成 backlog。
我们能不能让 AI 看代码和测试用例, 反推出系统需求?

高级改写:

Build an evidence-grade requirements and process knowledge extraction capability
that mines candidate needs, rules, events, controls, acceptance criteria and impact links
from governed artifacts,
filters by source authority and permissions,
routes ambiguity to SMEs,
and learns from production feedback without turning AI drafts into approved requirements.

Steering committee questions:

  1. 哪些 artifact 是权威来源, 哪些只是 pain signal 或 discussion evidence?
  2. AI 输出进入 baseline 前由谁验证、用什么 rubric、保留什么证据?
  3. 如何防止越权检索、过期政策引用、PII 泄露和记录处置失控?
  4. 如何把 mined requirements 连接到 process、API、data、test、control、eval 和 release?
  5. 如何从 production logs、QA、complaints、incidents 和 human overrides 回流到 portfolio learning?

3. Use Case Boundary

AI requirements mining 适合的任务:

Use caseGood fitBoundary
Requirements discovery从多源材料中发现候选需求、冲突、遗漏、重复不自动进入 approved baseline
Process knowledge extraction从 SOP、BPMN、logs、tickets 中抽 activity、event、role、handoff、variant不把日志行为直接等同于应然流程
Acceptance criteria drafting根据需求和测试资产生成验收候选高影响场景必须 SME/QA 验证
Change impact analysis政策/API/流程/控制变化后找影响面影响结论必须由 owner 确认
Control linkage把 policy/control/test evidence 连接到需求不给法律或合规适用性结论
Portfolio learning沉淀复用词汇、模式、eval cases、anti-patterns不用未授权 records 或 PII 做无边界训练

不适合直接交给 AI 的任务:

TaskReason
最终 scope tradeoff涉及商业优先级、资源、风险接受和战略选择
法律/监管解释需要授权职能结合具体事实和管辖范围判断
高影响客户决策需要授权、控制、解释、申诉和人工责任
records retention / legal hold 结论需要 Records/Legal/Compliance 决策
模型验证结论需要独立模型风险和验证程序

4. Target Operating Model

Business / Product Owner
  owns outcome, priority, scope, baseline decision

Senior BA / Requirements Architect
  owns mining taxonomy, ambiguity workflow, quality rubric, traceability graph

Process Owner / SME
  validates process activities, exceptions, variants and operating feasibility

Architecture / Engineering
  validates API, data, system, security, performance and integration impact

Risk / Compliance / Control Owner
  validates policy/control linkage, risk tier, approval boundary and evidence need

Privacy / Records / Legal
  validates data use, records class, hold propagation, access and retention controls

QA / EvalOps
  converts mined requirements to tests, eval contracts, thresholds and regression gates

AI Platform / Data Engineering
  operates ingestion, retrieval, permission filter, graph, model versioning and monitoring

RACI snapshot:

ActivityPMBAArchitectSMERisk/ControlPrivacy/RecordsQA/EvalOps
Source inventoryARCCCCC
Authority classificationARCCCCC
Extraction rubricCA/RCCCCR
Requirement validationARCRCCC
Risk tieringARCCA/RCC
Eval contractCRCCCCA/R
Release gate evidenceARRCCCR
Portfolio learningARCCCCR

5. Implementation Architecture

Connectors
  Confluence / SharePoint / Docs / Jira / Azure DevOps / Git / API gateway
  Contact center / CRM / case management / log platform / GRC / test management
        |
        v
Governed ingestion
  artifact id | source type | owner | approval status | version | hash
  effective date | data class | record class | permission tag | legal hold flag
        |
        v
Pre-processing
  parsing | OCR/layout | transcript diarization | code/API parsing | test extraction
  log event mapping | PII redaction | chunking | metadata enrichment
        |
        v
Knowledge layer
  domain vocabulary | process ontology | authority ladder | policy/control map
  requirement graph | event graph | system/API graph | test/eval graph
        |
        v
AI extraction and reasoning services
  candidate requirements | ambiguity | conflict | duplicate | process events
  stakeholder concerns | evidence standards | acceptance criteria | impact links
        |
        v
Human validation workbench
  side-by-side source evidence | approve/reject/merge/split/escalate
  reason codes | SME comments | decision record | audit trail
        |
        v
Delivery and governance
  backlog sync | requirement baseline | eval contract | test generation
  architecture review | control evidence | release gate | learning loop

Architecture non-negotiables:

Non-negotiableWhy
权限先于检索防止用户通过 AI 摘要看到无权材料
artifact hash and version支持复现、审计和变更影响
authority metadata防止 ticket、会议纪要和 AI draft 覆盖正式政策
structured output schema防止顺滑文本掩盖冲突和不确定性
SME decision logAI 只生成候选, 人负责授权
eval contract需求挖掘能力本身也要被评估和门禁
graph traceability支持跨需求、流程、系统、测试、控制、release 的 impact analysis

6. Source Intake Checklist

SourceRequired metadataExtraction focusKey risk
PRDowner, version, status, target release, approvalfeature, persona, metric, scope, assumptionsolution bias
BRDbusiness owner, benefit baseline, decision dateoutcome, stakeholder need, policy constraintvague benefit
SOPprocess owner, effective date, retired statusactivity, role, SLA, exception, evidencestale process
Policy/controlpolicy owner, effective date, scope, control idobligation, allowed/prohibited action, approvalmisinterpretation
Ticketsseverity, product, status, linked incident, resolutionpain, defect, workaround, frequencyduplicate noise
Jira/Azure DevOpsworkflow status, links, sprint/release, acceptance criteriabacklog, dependency, test/release linksweak traceability
Transcriptsconsent/notice, channel, QA score, redactionintent, friction, agent action, complaint signalPII and transcription error
Meeting notesattendees, roles, decision status, follow-updecision, assumption, open issuenon-authoritative
Process mapsversion, notation, owner, scopeintended flow, roles, controls, SLAidealized flow
Code/API specsrepo/version, endpoint, owner, deploymentactual behavior, contract, validation, errorcode as false policy
Test casestest owner, result, requirement link, coverageexpected behavior, edge case, regressionhappy-path bias
Logsevent schema, retention, sampling, data classvariant, latency, failure, handoff, outcomemissing business semantics
Controlscontrol owner, frequency, test result, issue linkcontrol objective, evidence, remediationcontrol/product disconnect

Intake gate:

No artifact enters the mining corpus without owner, version, permission tag and source class.
No restricted source enters AI processing without approved purpose and redaction path.

7. Decision Tables

7.1 Should this source be used?

ConditionDecision
Approved source, current version, clear owner, permission scopedUse for extraction and baseline evidence
Approved source but expired or supersededUse only for historical change impact
Operational source with high frequency pain signalUse for discovery, not baseline
Meeting note with unapproved decisionUse as clarification prompt and decision candidate
Artifact contains restricted data beyond purposeExclude or redact before indexing
Source owner unknownQuarantine until ownership is established

7.2 Can AI output move to backlog?

Requirement candidate conditionBacklog action
Grounded, no conflict, quality score >= 4, SME approvedCreate backlog item with source links
Grounded but ambiguousCreate clarification task, not delivery story
Conflict between policy and operational practiceCreate issue/decision item, not feature story
High-impact AI behavior without eval contractBlock from release backlog
Source is only ticket/transcriptConvert to problem statement or pain cluster
Derived from code/test onlyMark as actual behavior candidate and request owner decision

7.3 What level of human review is required?

Risk tierExampleReview requirement
Lowinternal UI label, non-material routing hintBA review and sampling
Mediumemployee workflow recommendation, non-customer-impact fieldSME approval and QA test
Highcustomer money/access/eligibility, complaint/dispute, regulated communicationProduct, Risk/Control, SME, QA/EvalOps approval
Restrictedlegal hold, sensitive identity, fraud, vulnerability, privileged tool actionSpecialized owner review and documented gate

7.4 Which intervention is correct?

DiscoveryLikely action
Repeated missing information in transcripts and ticketsImprove intake form, validation and customer guidance
High variant count from production logsProcess governance before AI automation
Conflict between SOP and API behaviorArchitecture impact assessment and remediation backlog
Policy ambiguity across teamsPolicy interpretation workflow and vocabulary update
Manual copy/paste between systemsIntegration/API/RPA/agent tool opportunity
Low requirement quality from PRDsProduct operating model improvement and template change
High hallucination in evidence citationRetrieval authority filter and citation eval redesign

8. Extraction Prompt Contract

AI extraction prompts should be treated as controlled product artifacts. A good extraction contract says what to extract, what not to infer, and how to expose uncertainty.

Required output schema:

{
  "candidate_id": "string",
  "candidate_type": "business_requirement | stakeholder_requirement | solution_requirement | transition_requirement | control_requirement | data_requirement | process_rule | acceptance_criterion | risk_issue",
  "statement": "string",
  "source_refs": [
    {
      "artifact_id": "string",
      "location": "section/page/span/event_id",
      "authority_level": "A1|A2|A3|A4|A5|A6",
      "effective_date": "YYYY-MM-DD"
    }
  ],
  "known_facts": ["string"],
  "unknowns": ["string"],
  "ambiguity_flags": ["actor_unknown", "decision_boundary_unknown", "data_scope_unknown", "control_owner_missing"],
  "conflicts": ["string"],
  "quality_score": 0,
  "recommended_acceptance_criteria": ["string"],
  "validation_owner": "role",
  "risk_tier": "low|medium|high|restricted"
}

Prompt rules:

RuleRationale
Do not invent missing business rules缺证据必须标 unknown
Preserve source authority不同来源不能被平均化
Separate current behavior from desired behavior代码和日志代表事实, 不代表应然
Produce questions, not false certainty模糊需求需要澄清
Cite exact source spans支持 SME 快速验证
Flag policy/control conflicts高级 BA 的价值在冲突发现
Avoid customer commitmentsmined output 不能成为客户可见承诺

9. Requirement Quality Gate

Gate checklist

GatePass signal
Source grounded每个 statement 有 artifact、location、version、owner
Authority clearsource level and conflict policy visible
Actor clearcustomer, employee, system, team, approver 不混淆
Decision boundary clearread/summarize/recommend/draft/decide/act 已区分
Data boundary clearsource fields、purpose、permission、retention 已定义
Control linkage clearapproval、dual control、review、audit evidence 已连接
Acceptance testablepositive、negative、edge case 和 evidence requirement 已写
Eval readydataset、rubric、threshold、critical failure、slice 已定义
Change impact traceableprocess、API、test、control、release links 存在
Owner accountablebusiness owner、SME、architect、QA/EvalOps owner 清楚

Scoring interpretation:

ScoreMeaningAllowed action
0wrong or unsupportedreject and log reason
1discovery notekeep in evidence cluster
2grounded but incompletesend to clarification
3clear but not testable/control-linkedimprove before backlog
4backlog-ready candidatecreate item with source links
5baseline-ready for high-impact userelease gate eligible after eval

10. Traceability Graph Play

Build the graph in layers

LayerNodesEdges
Strategyoutcome, KPI, benefit hypothesis, risk appetitejustifies, constrains
Stakeholderrole, need, concern, decision rightowns, approves, challenges
Requirementcandidate, baseline, acceptance criteriaderives_from, verifies
Processactivity, event, variant, handoff, exceptionprecedes, deviates_from, controls
SystemAPI, data object, service, UI, code ruleimplements, depends_on
Qualitytest case, eval case, rubric, thresholdverifies, blocks
Controlpolicy, control objective, evidence, issueconstrains, monitors
Deliverybacklog, release, change request, incidentdelivers, remediates

Minimum queries

QueryWhy it matters
Show all requirements derived from retired SOP sections防止过期来源继续驱动 backlog
Show requirements without acceptance criteria找不可验收需求
Show high-risk requirements without eval contract找上线阻断项
Show policy changes impacting AI prompts or RAG corpus防止过期政策输出
Show API schema changes impacting controls and tests支持 release impact review
Show tickets repeatedly linked to rejected requirements识别真实 pain 但方案不对
Show production variants not covered by SOP识别流程治理机会

11. Process Variant Discovery Play

输入:

case_id, activity, timestamp, resource, lifecycle, channel, product,
risk_tier, amount_band, status, outcome, source_system

步骤:

  1. 定义 case 粒度: application、dispute、alert、ticket、complaint、service request。
  2. 标准化 activity: 避免把状态码直接当业务活动。
  3. 生成 top variants: 找覆盖 80% 体量的主要路径和高风险长尾。
  4. 标记 rework、waiting、handoff、skip、loop、override。
  5. 与 SOP/BPMN/control path 对齐, 区分 acceptable exception、control gap、data noise。
  6. 生成 AI opportunity candidates: summarize、route、draft、validate、retrieve、recommend、tool action。
  7. 将每个机会连接到 requirement、acceptance criteria、control 和 eval。

Variant interpretation table:

FindingProduct implicationControl implication
主路径覆盖低不宜直接自动化, 先治理流程和 taxonomy例外处理和控制路径需补齐
高 rework改进资料收集、校验、政策解释监控返工原因和 customer harm
多团队 handoffAI handoff summary 或队列路由责任和 evidence transfer 要清楚
控制步骤被跳过阻断上线, 先修复流程/权限control issue and remediation
高等待来自外部资料客户/第三方提醒和 SLA 管理记录通知和暂停计时逻辑
override 集中在某团队policy ambiguity 或 training gapdual control / QA sampling

12. Eval Contract for Mining System

The mining system itself must be evaluated before teams rely on it.

Eval areaMetricRelease threshold idea
Requirement extraction precisionAI candidates accepted as valid by SMEhigh enough by source class, no critical false positives
Critical recallmust-have policy/control/exception requirements foundzero missed critical control in golden set
Groundednessstatements fully supported by source refsunsupported material claim = release blocker
Authority classificationsource authority correctly rankedno A4/A5 source overriding A1/A2
Ambiguity detectionrequired clarifications correctly flaggedhigh-risk ambiguity miss = blocker
Conflict detectionknown conflicts identifiedpolicy/SOP/log conflict misses reviewed
Permission safetyno unauthorized source leakagezero leakage in red-team tests
Output schema validitymachine-readable structured outputnear-perfect schema compliance
SME efficiencyreview time per candidateimproves without lowering quality
Change impact qualityimpacted systems/tests/controls foundvalidated against known changes

Critical failures:

  • hallucinated source citation
  • unauthorized PII or restricted source in output
  • policy/control requirement missed in high-impact workflow
  • low-authority source treated as approved baseline
  • AI-generated customer commitment
  • hidden conflict between source materials
  • output enters backlog without validation evidence

13. Evidence and Control Checklist

Pre-launch

Control areaEvidence
Source governanceinventory, owner, version, permission, retention, record class
Data protectionDPIA/privacy review where applicable, redaction rules, access matrix
Recordsrecord class, legal hold propagation, derived artifact retention
Securityconnector entitlement, secrets handling, audit logging, vendor boundary
Model governancemodel card, prompt version, eval results, limitations
EvalOpsgolden set, rubric, thresholds, critical failures, independent review
SME operationsreviewer guide, decision codes, escalation path
Traceabilitygraph schema, source-to-requirement links, impact queries
Releasego/no-go memo, exceptions, risk acceptance record

Production

Control areaEvidence
Usage monitoringwho mined what, source classes, exports, backlog sync
Quality monitoringacceptance rate, reject reasons, ambiguity density, conflict misses
Permission monitoringdenied retrievals, redaction events, suspicious access
Drift monitoringsource freshness, vocabulary drift, new ticket clusters
Change monitoringpolicy/API/SOP/model/prompt changes and regression eval
Incident handlingleakage, hallucination, wrong baseline, control miss, remediation
Portfolio learningreusable patterns, updated rubrics, added eval cases

14. 30 / 60 / 90 Roadmap

First 30 days: controlled discovery

WorkstreamOutput
Select domainone workflow, e.g., payment dispute, KYC onboarding, fee servicing, AML alert triage
Inventory sourcesPRD/BRD/SOP/policy/tickets/transcripts/process maps/tests/logs/control evidence
Define authority laddersource classes, approval status, conflict rules
Define vocabularykey terms, role names, activity taxonomy, forbidden ambiguous terms
Build small corpuspermission-filtered, redacted, versioned artifact set
Design rubricquality score, ambiguity flags, conflict categories
Create golden setSME-labeled requirements, controls, events, conflicts
Run pilot extractioncandidates, source refs, ambiguity questions, initial graph

Exit criteria:

The team can show source-backed candidates, rejected examples, ambiguity log,
and at least one requirement-to-test-to-control trace for the selected workflow.

Days 31-60: graph, eval and SME workflow

WorkstreamOutput
Traceability graphoutcome -> requirement -> process -> API/data -> test -> control
SME workbenchapprove/reject/merge/split/escalate with reason codes
Eval contractdataset, rubric, slices, thresholds, critical failures
Process mining linktop variants, rework, handoff, waiting, control gap
Backlog integrationonly approved candidates sync to Jira/Azure DevOps
Change impact queriespolicy/API/SOP/test changes show impacted assets
Control packpermission audit, evidence pack, release gate memo

Exit criteria:

The mining system can pass golden-set eval, route ambiguous outputs to SMEs,
and create backlog items only with source refs, quality score and validation evidence.

Days 61-90: production pilot and portfolio learning

WorkstreamOutput
Production pilotlimited users, limited corpus, high audit logging
Monitoringquality, permission, source freshness, SME decisions, backlog conversion
Regression evaltriggered by policy/SOP/API/model/prompt/corpus changes
Incident drillhallucinated source, permission leak, wrong baseline, missed control
Portfolio pattern libraryreusable requirement patterns, acceptance criteria, eval cases
Operating modelRACI, governance cadence, funding and scaling decision
Executive reviewvalue evidence, risk issues, expansion roadmap

Exit criteria:

The organization can demonstrate faster discovery, better traceability,
measurable SME productivity, controlled risk, and reusable portfolio assets.

15. Implementation Guardrails

GuardrailPractical rule
Start with one workflow跨企业全量挖掘会放大权限、词汇和评审问题
Keep source status visible每个答案显示 approved/current/expired/draft/source class
Use negative exampleseval 集必须包含过期 SOP、冲突政策、错误 ticket、转写错误
Separate discover vs decidediscovery workspace 与 baseline repository 分离
Red-team permission尝试让用户通过摘要推断无权内容
Calibrate by source classticket、policy、test、log 的 precision/recall 分开看
Monitor reviewer burdenAI 生成太多低质候选会伤害 SME 信任
Keep records artifacts governedembedding、summary、graph nodes、review notes 都可能成为 derived artifacts
Make rollback possible错误同步 backlog 或 baseline 时能追踪并撤销
Treat prompts as changeable codeprompt/corpus/model 变更触发回归 eval

16. Metrics

Business and delivery metrics

MetricMeaning
discovery cycle time reduction从 source intake 到 validated candidate 的时间
validated candidate yield每 100 个 artifact 产生的高质量需求数
duplicate reduction合并重复 ticket/story/requirement 的比例
clarification throughputambiguity 从发现到关闭的时间
backlog quality liftapproved story 的 source refs、AC、test link 完整度提升
change impact lead time变更影响分析时间
reuse ratepattern、AC、eval case、vocabulary 的复用比例

Risk and quality metrics

MetricMeaning
unsupported claim rateAI 输出无来源支持的比例
wrong authority rate权威等级识别错误或低权威覆盖高权威
critical recall miss漏掉高影响政策/控制/例外
permission leakage rate未授权信息在输出中出现
ambiguity miss rate人工发现但 AI 未标注的关键模糊点
conflict miss rate已知冲突未识别
SME disagreement rateSME 对输出解释不一致
production feedback incorporationincident/QA/ticket 回流 eval 的速度

17. Anti-Patterns and Repairs

Anti-patternSymptomRepair
“AI 生成 user story 工厂”backlog 变多, 质量更差强制 source_refs、quality score、SME approval
“一个向量库装所有文档”权限、版本、记录边界失控source registry + retrieval-time ACL + corpus partition
“只看文档不看日志”自动化理想流程, 忽略真实变体connect event logs and process mining
“只看 ticket 排优先级”高频噪声盖过高风险需求severity、journey、control、value weighted scoring
“让 AI 消除冲突”输出顺滑但错误show conflicts and route to owner
“代码就是需求”历史缺陷被产品化actual behavior vs desired requirement 分离
“eval 只测摘要质量”需求挖掘错误进 backlogtest extraction, authority, conflict, permission and impact
“SME review 无结构”审过但不可复用reason codes and decision log
“不治理 derived artifacts”summary、embedding、graph node 记录风险records/privacy controls for all derived artifacts
“pilot 后不学习”每个团队重复踩坑portfolio pattern library and eval expansion

18. Artifact Templates

18.1 Mining intake brief

workflow:
business owner:
process owner:
primary outcome:
risk tier:
source classes included:
source classes excluded:
permission model:
record classes:
SME reviewers:
quality rubric:
eval dataset:
release decision owner:

18.2 Requirement candidate review card

candidate statement:
source refs:
authority level:
known facts:
unknowns:
ambiguity flags:
conflicts:
risk tier:
quality score:
suggested acceptance criteria:
linked process activity:
linked API/data/test/control:
review decision:
reason code:
owner:

18.3 Change impact memo

changed artifact:
change type:
effective date:
impacted requirements:
impacted workflows:
impacted APIs/data objects:
impacted tests/evals:
impacted controls/records:
required approvals:
release implication:
monitoring update:

19. Interview Answers

Q1: 你如何设计 AI requirements mining 的端到端架构?

30 秒回答:

我会从 governed ingestion 开始, 对 PRD、SOP、policy、tickets、transcripts、Jira、code/API、tests、logs 和 controls 做 source inventory、authority classification、permission filtering 和 versioning。然后用 domain vocabulary 和 process ontology 做结构化抽取, 生成 requirement candidate、process event、stakeholder concern、control link、acceptance criteria 和 impact links。所有输出进入 traceability graph, 经 quality rubric、eval contract 和 SME validation 后, 才能进入 backlog 或 baseline。

Q2: 如何证明这不是简单的 RAG 总结?

RAG 总结只回答“这些文档说了什么”。requirements mining 要回答“哪些内容可以成为需求、依据是什么、权威级别如何、哪里冲突、哪里模糊、谁验证、怎么测试、影响哪些系统和控制、上线后怎么监控”。核心资产不是摘要, 而是 source-backed graph、quality score、eval contract、SME decision log 和 change impact evidence。

Q3: 如何避免 AI 自动替代 BA?

我会在流程上把 AI 输出固定为 candidate, 在系统上把 discovery workspace 与 baseline repository 分离, 在治理上要求 source_refs、quality score、SME approval 和 decision record。BA 的价值从写初稿升级为 evidence orchestration、ambiguity resolution、stakeholder negotiation、scope tradeoff、traceability design 和 release evidence governance。

Q4: 从通话转写和会议纪要挖需求时最大的风险是什么?

最大风险是把非权威、含噪声、含 PII 或上下文不完整的语言直接变成需求。转写可能错, 客户表达可能是情绪或投诉, 会议纪要可能不是批准决定。所以这些来源默认只能作为 pain signal、clarification input 或 decision candidate。进入 baseline 需要更高权威来源或 owner 追认。

Q5: 如何处理政策、SOP 和生产日志之间的冲突?

我不会让 AI 平滑合并冲突。政策和控制是约束, SOP 是 intended process, 生产日志是真实行为。冲突本身是高价值发现: 可能是流程绕行、控制缺口、SOP 过期、系统缺陷或政策解释不清。系统要显示 conflict graph, 指派 owner, 形成 decision 或 remediation item。

Q6: 你如何把 mined requirements 连接到 eval?

每个 AI 相关 requirement 必须有 acceptance criteria 和 eval contract。eval contract 定义 dataset、source scope、rubric、threshold、critical failures、slice、evaluator 和 go/no-go 规则。例如对于政策问答类需求, 不是只测准确率, 还要测 groundedness、authority awareness、unsupported claim、permission leakage 和 high-risk ambiguity detection。

Q7: 生产日志在这个 playbook 中扮演什么角色?

生产日志是 process truth 的关键来源。它帮助发现真实 variant、handoff、waiting、rework、override 和 control execution。文档挖掘告诉我们设计意图, 日志挖掘告诉我们真实行为。两者结合才能决定该做 AI assistant、流程重构、数据质量改进、API 集成, 还是控制修复。

Q8: 如何在金融机构控制隐私和 records 风险?

我会实施 permission-before-retrieval, 对 source、chunk、graph edge 和输出层做权限过滤; 对 PII 做 redaction 或 purpose-bound access; 对 raw artifacts、chunks、embeddings、summaries、review notes 和 graph nodes 设计 retention、legal hold propagation 和 audit log。AI 输出不能成为绕过 records 和 privacy controls 的旁路。

Q9: 什么是好的 90 天落地路径?

前 30 天只做一个 workflow 的 controlled discovery, 建 source inventory、authority ladder、vocabulary、golden set 和 extraction rubric。31-60 天建设 traceability graph、SME workbench、eval contract、process variant link 和 backlog integration。61-90 天做 limited production pilot, 加监控、regression eval、incident drill 和 portfolio learning。扩张前必须证明质量、权限和 SME 效率都可控。

Q10: 这个能力如何形成作品集亮点?

我会展示五类资产: source authority model、requirement quality rubric、traceability graph schema、eval contract 和 30/60/90 rollout plan。再用一个金融零售案例说明如何从 SOP、ticket、transcript、API、test 和 log 中提取需求, 如何发现冲突, 如何进入 SME validation, 如何把结果连接到 backlog、eval、control 和 release gate。这比展示“AI 自动写 user story”更能体现 senior PM/BA/Architect 能力。


20. Portfolio Deliverables

建议沉淀为作品集包:

DeliverableDemonstrates
Source authority and permission modelgovernance-grade thinking
Domain vocabulary and ambiguity taxonomyCBAP+ semantic discipline
Requirement quality rubricquality and baseline control
Traceability graph schemaarchitecture and impact analysis maturity
Eval contract for miningAI assurance and EvalOps ability
SME validation workflowoperating model design
Process variant discovery reportprocess intelligence and evidence-based prioritization
Change impact memorelease and control governance
Privacy/records checklistregulated environment awareness
30/60/90 roadmapexecution leadership

Final operating principle:

Mine broadly, trust narrowly, validate explicitly, trace everything,
and let production evidence improve the portfolio.