prioritization-frameworkdraft

Prioritization: Curator MVP / pilot scope

Applicability Filter Summary

  • Ran: ICE (estimates feasible at concept stage), MoSCoW (clear pilot scoping).
  • Excluded: RICE — Reach in users/qtr is unknown at concept stage (would be fabricated); revisit post-pilot. Kano — no customer survey yet. Weighted Scoring — ICE suffices.

Inputs Summary

Items derive from the opportunity tree (07). No real reach/effort data; ICE scores are judgment-based estimates flagged Low confidence. Items: - A reproducible analysis pipeline (1A) - B evidence-linked conclusions (1B) - C editable draft summary (2A) - D transcript + speaker separation (2B) - E client dossier (3A) - F commitment + micro-tracking (3B) - G consent + depersonalisation (4A) - H out-of-scope alerting (4B) - I live in-session suggestions (parking lot)

Per-Framework Scoring

ICE (Impact × Confidence × Ease, 1–10; estimates)

Item Impact Confidence Ease ICE Notes
G consent/depersonalisation 9 7 6 378 Gating prereq for any external LLM
D transcript+diarization 8 8 7 448 Integrate, not invent
C editable draft summary 8 6 7 336 Cheap, high felt value
A reproducible pipeline 10 4 3 120 The moat, but riskiest/hardest
B evidence quotes 8 6 6 288 Boosts trust, moderate effort
E client dossier 7 5 4 140 High value, larger build
F commitment/micro-tracking 6 5 5 150 Depends on client response
H out-of-scope alerting 7 6 5 210 Safety credibility
I live suggestions 5 3 3 45 Sensitive, defer

MoSCoW (pilot)

Item Bucket Rationale Risk if dropped
G consent/depersonalisation Must Legal/trust gate Cannot run safely
D transcript+diarization Must Input to everything No analysis
A reproducible pipeline Must The differentiator Becomes "just a transcriber"
B evidence quotes Must Trust gate to send Output not sendable
C editable draft summary Should Core time-saving Less adoption pull
E client dossier Should Continuity value Weaker retention
H out-of-scope alerting Should Safety story Ethical exposure
F commitment/micro-tracking Could Adds granularity Acceptable to defer
I live suggestions Won't (pilot) Sensitive, unproven None now

Per-Framework Ranking Output

  • ICE ranking (high→low): D, G, C, B, H, F, E, A, I.
  • MoSCoW Musts: G, D, A, B.

Cross-Framework Comparison

Item ICE rank MoSCoW Agreement
D 1 Must Strong
G 2 Must Strong
A 8 Must Divergent — ICE penalizes A on low confidence/ease, but it is the strategic moat, so MoSCoW forces it to Must. Driving dimension: strategic necessity vs. near-term ease.
C 3 Should Mild

Executive Summary with Recommendation

Fund the Must core that makes one trustworthy session loop real: transcript+diarization (D), consent/depersonalisation (G), the reproducible pipeline (A), and evidence-linked conclusions (B). Add the editable draft (C) as the adoption sweetener. Defer dossier depth (E), micro-tracking (F), and especially live suggestions (I). The key divergence is A: it scores low on ICE because it's hard and unproven, but dropping it guts the product — so it stays Must and its risk is bought down via the reproducibility experiment, not by deprioritizing it.

Sensitivity / What Changes the Ranking

  • If the reproducibility experiment (A) fails, the whole MVP thesis flips → pivot review (stage 25), not a reshuffle.
  • If transcription quality (D) is poor for the target language, D effort rises and may need a different vendor.

Recommendations (Sequencing)

  • Fund now: D, G, A, B (+ C).
  • Defer: E, F, H-depth.
  • Drop (pilot): I.
  • Data that would change this: real reach/WTP (enables RICE), reproducibility experiment result.

Limitations and Biases

  • ICE scores are estimates, not measured — anchoring risk.
  • No RICE/Kano → reach and delight are unmodeled.
  • Strategic-necessity override on A is a judgment call, made explicit on purpose.