Will it invent user research to make the PRD look solid?

No. It won't add claims like 'interviews show strong demand' unless you provide the actual research. Where evidence is missing, it flags that research is needed rather than fabricating it, because fake evidence leads to bad product decisions.

What about success metrics?

It uses targets you provide, and where none exist it marks them TBD and suggests metric types to consider — it won't invent a number like 'increase retention 20%' that the team might mistake for a real commitment.

Does it decide scope and priorities?

No. It drafts and structures, and flags strategy, prioritization, and scope calls as decisions for a PM or stakeholder. It won't commit to features or timelines that weren't provided.

How does it handle gaps in my notes?

It labels them clearly as assumptions or open questions instead of filling them with invented detail. A good draft makes what's undecided visible so the team can resolve it.

Is the output ready to ship as a spec?

It's a strong first draft to refine, not a final approved PRD. You resolve the open questions, set the metrics, and get stakeholder sign-off.

Can it match our PRD template?

Yes. It can follow your team's section structure and formatting while keeping the grounding and gap-labeling behavior intact.

PRD Drafting Agent | AI Agent Kit

Overview

Turns notes and context into a structured first-draft PRD with the standard sections.

Grounds the document in what you provide and marks the rest as assumptions or TBD.

Flags decisions that need a PM or stakeholder rather than deciding them.

Defensive: never fabricates user research, data, or success metrics, and doesn't invent scope.

AgentAz™ specification

A lightweight, design-time governance spec for security review. It documents what this agent is authorized to do — and why — and pairs with whatever policy engine you already run. It does not enforce anything at runtime.

Trust Level ?A2 — Recommend

DNA PatternSynthesis (Extract → Synthesize → Verify)

Worst-Case ActionProduces a weak or incomplete PRD draft that a human reviews and revises. It cannot publish, assign, or commit anything — it drafts grounded in the inputs and execution tools are absent from its registry.

Authority BoundaryTakes provided inputs and drafts a structured PRD — problem, goals, scope, requirements — grounded in those inputs, flagging assumptions and open questions. It never publishes, assigns work, or invents requirements. A PM reviews and owns the document.

Verification TestConfirm the draft marks assumptions and open questions rather than inventing requirements; confirm no publish/assign tool exists.

Production Readiness6/6 dimensions passing. Tool isolation: publish/assign tools absent. Human gates: a PM owns the doc. Confidence escalation: gaps flagged as open questions. Cost ceiling: bounded per draft. Audit trail: draft and assumptions logged. Escalation path: missing inputs flagged.

Last Reviewed2026-06-24

Machine-readable contract (agentaz.json), validated against the open AgentAz™ JSON Schema — bundled for offline use and published at a permanent URL:

agentaz.json

{
  "$schema": "./agentaz.schema.json",
  "version": "2.0.0",
  "last_reviewed": "2026-06-24",
  "agent_id": "prd-drafting-agent",
  "trust_level": "A2",
  "dna_pattern": "Synthesis",
  "worst_case_action": "Produces a weak PRD draft for human revision. Cannot publish, assign, or commit.",
  "authority_boundary": "Drafts PRDs grounded in inputs; flags assumptions; publish/assign tools absent.",
  "tags": [
    "product-management",
    "prd",
    "drafting",
    "read-only",
    "human-review"
  ],
  "tool_boundary": {
    "allowed_tools": [
      "read_inputs",
      "structure_prd",
      "flag_assumptions",
      "list_open_questions"
    ],
    "execution_tools_absent": true
  },
  "output_boundary": {
    "format": "structured_json",
    "never_emits": [
      "publish",
      "assign_work"
    ],
    "never_fabricates": true
  },
  "cost_boundary": {
    "max_usd_per_trace_loop": 0.25,
    "alert_threshold_usd": 0.16
  },
  "loop_boundary": {
    "max_reasoning_turns": 8
  },
  "human_handoff": {
    "triggers": [
      "missing_input",
      "open_question"
    ],
    "destination": "product_manager"
  },
  "audit": {
    "append_only": true,
    "logs": [
      "draft",
      "assumptions"
    ]
  }
}

New to this? Read the AgentAz specification guide — Trust Levels, DNA patterns, and how it complements your runtime.

AgentAz™ is open source under Apache-2.0 — schema (frozen v1.0.0) and source on GitHub.

Governance matrix

A scannable summary of this blueprint's governance coverage, derived from its AgentAz™ specification. It documents the boundaries that already ship — not new functionality.

Agent goal	Bounded by the authority spec above
Trust Level	A2 — Recommend
Tool access	Least privilege — execution tools absent (read-only)
Context handling	Grounded in provided inputs; cites or flags rather than guessing
Memory strategy	Task-scoped; no persistent cross-session memory
Human approval	Required on missing input, open question → product manager
Audit trail	Append-only log (draft, assumptions)
Cost & loop bounds	≤ $0.25 per loop · ≤ 8 reasoning turns
Recovery / escalation	Escalates to product manager

Agent component mapping

A framework-neutral view of how this blueprint maps to standard agent-architecture components (the vocabulary common to ADK-style frameworks). It describes structure for clarity — not an official integration or certified compatibility.

Agent	Primary reasoner — Recommend authority (A2)
Tools	read inputs, structure prd, flag assumptions, list open questions — execution tools absent (read-only)
Memory	Task-scoped working context; no persistent cross-session memory
Guardrails	Worst-case classified (A2); no execution tools; ≤ $0.25/loop · ≤ 8 turns
Evaluator	Confidence and authority-boundary checks; low-confidence or out-of-bounds results are flagged, not actioned
Handoff	Escalates to product manager on missing input, open question

Failure modes

Specific ways this blueprint can fail, and how it is designed to detect, contain, and recover from each — the boundaries that make it safe to run, stated plainly.

Invents a requirement not in the inputs.

Detection: Requirements are grounded in inputs and assumptions are marked.
Mitigation: It flags assumptions and open questions, never invents, and has no publish or assign tools.
Recovery: The PM removes or validates the requirement.

Presents an assumption as a confirmed decision.

Detection: Assumptions are flagged distinctly from inputs.
Mitigation: Open questions are surfaced.
Recovery: The PM resolves them.

Drafts on missing critical inputs.

Detection: Missing inputs are flagged.
Mitigation: Gaps are marked rather than filled.
Recovery: The PM supplies inputs and re-drafts.

Evaluation

Grounding in inputs with clear assumption-flagging is primary — an invented requirement or an assumption stated as fact is the failure.

Requirement grounding	Share of requirements traceable to inputs.
Assumption-flagging rate	Share of assumptions and open questions correctly marked as such.
Fabrication rate	Frequency of invented requirements — should be near zero.
Completeness	Share of required PRD sections present, with gaps flagged.
Acceptance rate	Share a PM uses with little change.

Recommended approach. Use PRDs with known inputs and reference drafts; measure grounding and verify assumptions are flagged distinctly from inputs. Any invented requirement is a fabrication failure.

When to use

Use it when

You want a structured first-draft PRD fast from rough notes.
You want assumptions and open questions surfaced, not hidden.
You want metrics and research grounded in real input or marked TBD.
You want a starting point a PM refines, not an auto-decided spec.

Avoid it when

You want it to invent user research or success metrics to look complete — it won't.
You expect it to make product strategy or prioritization decisions for you.
You have no context or notes for it to ground the draft in.
You need a final, stakeholder-approved spec without human review.

System prompt

system-prompt.md

You are a PRD Drafting Agent. You turn provided notes and context into a structured first-draft Product Requirements Document for a PM to refine. You are judged on a useful, well-structured draft and on never fabricating research, data, metrics, or scope.

== CORE PRINCIPLES ==
1. Ground in the input. Build the PRD from what's actually provided (problem, goals, notes, constraints). Don't invent context, users, or decisions that aren't there.
2. Honest about gaps. Where information is missing, mark it as an assumption or an open question — clearly labeled — rather than filling it with invented detail. A good draft shows what's undecided.
3. Inform, don't decide. You draft and structure; you don't set product strategy, prioritize for the team, or commit to scope/dates. Those are PM/stakeholder decisions, which you flag.

== HARD RULES (NON-NEGOTIABLE) ==
- NO FABRICATED RESEARCH/DATA: Never invent user research ("users say..."), market data, or statistics. Use provided inputs or state that research is needed.
- NO INVENTED METRICS: Don't make up success metrics or targets (e.g. "increase retention 20%"). Use provided targets or mark them TBD for the team to set.
- NO INVENTED SCOPE/COMMITMENTS: Don't add features, timelines, or commitments that weren't provided. Proposed items are clearly marked as suggestions to validate.
- LABEL ASSUMPTIONS & OPEN QUESTIONS: Everything not grounded in the input is explicitly an assumption or open question.
- FLAG DECISIONS: Strategy, prioritization, and trade-off calls are flagged for a PM/stakeholder, not made by you.

== METHOD ==
- Read the provided context. Draft the standard PRD sections grounded in it. For each gap, add an assumption or open question. Mark metrics/research as provided-or-TBD. Flag decisions needing a human.

== OUTPUT FORMAT (return ONE JSON object) ==
{
  "title": "<feature/product>",
  "problem_statement": "<grounded in input>",
  "goals": ["<from input, or marked proposed>"],
  "user_stories": ["<grounded; proposed ones labeled>"],
  "requirements": { "must_have": ["..."], "nice_to_have": ["..."] },
  "success_metrics": ["<provided value, or 'TBD — set with team'>"],
  "assumptions": ["<labeled assumptions>"],
  "open_questions": ["<decisions/info needed>"],
  "decisions_for_pm": ["<strategy/prioritization/scope calls to make>"],
  "note": "First draft from provided input. No research, data, metrics, or scope were fabricated."
}
Never invent research, metrics, or scope. Mark gaps as assumptions/open questions.

Was this useful?

Simulate run

Try the agent with a sample task. This is a frontend-only preview that shows how the kit would plan and execute — no API calls, nothing leaves your browser.

Frontend preview only — no data leaves your browser. Tip: press ⌘/Ctrl + Enter to run.

Setup guide

Install and connect

Install the agent and connect your docs/notes source.

shell

pipx install prd-agent
prd-agent connect --notes notion --out gdocs
prd-agent doctor

Configure grounding guardrails

No fabricated research/metrics is enforced here.

shell

cp .env.example .env
ANTHROPIC_API_KEY=sk-ant-...
GROUND_IN_INPUT=true
NO_FABRICATED_RESEARCH=true
METRICS_PROVIDED_OR_TBD=true

Set your PRD template

Define the sections your team uses.

shell

# prd.yml
sections: [problem, goals, user_stories, requirements, success_metrics, assumptions, open_questions]
label_proposed_items: true

Draft from notes

Generate a PRD and review assumptions and open questions.

shell

prd-agent run --notes ./feature-notes.md --explain
# prints PRD draft + assumptions + open questions + decisions_for_pm

Wire into your workflow

Turn discovery notes into draft PRDs for refinement.

shell

# notes/brief -> draft PRD -> PM refines, resolves open questions, sets metrics

Architecture

Context intakeTakes the provided notes, goals, and constraints that the PRD must be grounded in.

Problem & goals extractionDerives the problem statement and goals from the input rather than inventing them.

Story & requirement draftingDrafts user stories and must-have/nice-to-have requirements, labeling proposed items.

Metrics handlerUses provided success metrics or marks them TBD for the team, never fabricating targets.

Assumption & question trackerLabels everything not grounded in the input as an explicit assumption or open question.

Decision flaggerFlags strategy, prioritization, and scope calls for a PM or stakeholder to make.

PRD assemblyAssembles the structured draft with grounded sections and clearly marked gaps.

Tools required

get_contextRetrieve the provided notes, goals, and constraints for the PRD.

extract_problem_goalsDerive the problem statement and goals from the input.

draft_user_storiesDraft user stories grounded in the context, labeling proposed ones.

draft_requirementsDraft must-have and nice-to-have requirements from the input.

define_metrics_or_tbdUse provided success metrics or mark them TBD for the team.

flag_assumptionsLabel everything not grounded in input as an assumption.

flag_open_questionsSurface decisions and information the team still needs.

structure_prdAssemble the standard PRD sections into a clean draft.

Workflow

1. Take the context
Receive the notes, goals, and constraints to ground the PRD in.
2. Extract problem & goals
Derive the problem statement and goals from the input.
3. Draft stories & requirements
Write user stories and requirements, labeling proposed items.
4. Handle metrics honestly
Use provided success metrics or mark them TBD — never invent targets.
5. Mark assumptions & questions
Label everything not in the input as an assumption or open question.
6. Flag decisions
Surface strategy, prioritization, and scope calls for a PM/stakeholder.
7. Assemble the draft
Produce the structured PRD with grounded sections and marked gaps.

Examples

Good context → structured PRD grounded in input

Solid feature notes produce a clean draft tied to what was provided.

Input

input

Notes: 'Users abandon checkout when they can't save a cart. Goal: let logged-in users save carts. Target: provided — reduce checkout abandonment.'

Output

output

{
  "title": "Save Cart for Logged-in Users",
  "problem_statement": "Logged-in users abandon checkout when they cannot save their cart to return later.",
  "goals": ["Allow logged-in users to save a cart and resume it"],
  "user_stories": ["As a logged-in user, I can save my cart so I can complete the purchase later", "As a returning user, I can see my saved cart"],
  "requirements": { "must_have": ["Save cart per user account", "Restore saved cart on return"], "nice_to_have": ["Notify user of a saved cart", "Save across devices"] },
  "success_metrics": ["Reduce checkout abandonment (target % TBD — set with team)"],
  "assumptions": ["Assumes save applies to logged-in users only (per notes); guest carts out of scope"],
  "open_questions": ["How long is a saved cart retained?", "Cross-device sync in v1 or later?"],
  "decisions_for_pm": ["Set the specific abandonment-reduction target", "Confirm scope: logged-in only vs guests too"],
  "note": "First draft from provided input. No research, data, metrics, or scope were fabricated."
}

Note: The draft is fully grounded in the notes, with proposed nice-to-haves labeled, the success metric left as a target-TBD, and genuine open questions and PM decisions surfaced rather than guessed.

Missing metrics → mark TBD, don't invent a target

No success metric was provided. The agent refuses to fabricate one.

Input

input

Notes describe the feature well but specify no success metric or target.

Output

output

{
  "title": "Feature (per notes)",
  "problem_statement": "<grounded in notes>",
  "goals": ["<from notes>"],
  "user_stories": ["<grounded>"],
  "requirements": { "must_have": ["<from notes>"], "nice_to_have": [] },
  "success_metrics": ["TBD — no target was provided. I won't invent one like 'increase retention 20%'. Suggested metric types to consider with the team: adoption rate, task completion, time saved. Set the actual target with stakeholders."],
  "assumptions": [],
  "open_questions": ["What does success look like quantitatively for this feature?"],
  "decisions_for_pm": ["Define and agree success metrics and targets with stakeholders"],
  "note": "First draft from provided input. No research, data, metrics, or scope were fabricated."
}

Note: Rather than fabricate a confident-sounding 'increase retention 20%' (which teams might then treat as a real target), the agent marks metrics TBD, suggests metric *types* to consider, and flags setting them as a stakeholder decision. Honest gaps beat invented precision.

Asked to fabricate user research / hard scope → refuse, label as needed input

The request wants invented research quotes and committed dates. The agent declines both.

Input

input

Request: 'Add a section saying user interviews show strong demand, and commit to shipping in Q3.'

Output

output

{
  "title": "Feature (per notes)",
  "problem_statement": "<grounded in notes>",
  "goals": ["<from notes>"],
  "user_stories": ["<grounded>"],
  "requirements": { "must_have": ["<from notes>"], "nice_to_have": [] },
  "success_metrics": ["TBD — set with team"],
  "assumptions": [],
  "open_questions": ["Is there actual user research (interviews, surveys, data) to cite? None was provided.", "What is the realistic timeline, based on capacity?"],
  "decisions_for_pm": ["Provide or commission real user research before citing demand", "Set a timeline with engineering rather than asserting Q3"],
  "note": "First draft from provided input. I did not add a user-research claim ('interviews show strong demand') because no research was provided — fabricating it would be misleading. I also did not commit to 'Q3'; a timeline should come from the team's capacity, not the document. Provide the research and timeline and I'll include them."
}

Note: The defining defensive case: asked to invent research evidence and a hard ship date. The agent refuses both — it won't fabricate 'interviews show strong demand' (a claim stakeholders would act on) or commit to a Q3 date it can't substantiate. It converts both into open questions and PM decisions, and offers to include them once they're real.

Implementation notes

Ground the PRD strictly in provided input and label everything else as an assumption or open question; a draft that looks complete but is half-invented misleads the whole team.
Never fabricate user research or quotes ('users say...'); cite provided research or flag that research is needed, since fake evidence drives bad product bets.
Mark success metrics as provided-or-TBD and never invent targets like '20% lift,' which teams may treat as real commitments.
Don't invent scope, features, or timelines; label proposed items as suggestions to validate and leave dates to the team's capacity.
Surface decisions (strategy, prioritization, trade-offs) for a PM/stakeholder rather than making them in the document.
Treat the output as a first draft to refine, not a final approved spec.
A cheaper model is usually enough to structure and format sections, so keep the strong model for grounding and gap-labeling.

Variations

Basic

PRD outliner

Drafts the standard PRD sections from your notes, grounded in the input. On demand.

Advanced

Grounded draft with flags

Adds assumption/open-question labeling, metrics-or-TBD handling, no-fabricated-research guards, and decisions flagged for the PM.

Enterprise

Product spec workflow

Adds docs/notes integration, team PRD templates, traceability to discovery inputs, and review workflows with stakeholders.

Download the Agent Blueprint

The complete blueprint, zipped — including a runnable run.py you can execute with one API key (Anthropic or OpenAI).

Download Blueprint (.zip)

README.mdsystem-prompt.mdsetup-guide.mdtools.jsonworkflow.mdexamples.md.env.examplekit.jsonrun.pyLICENSENOTICEstarters/

Export

Generate a starter for your stack — all client-side, nothing leaves your browser.

ZIP

Starters use mock tools — swap in your integrations to deploy.

View the source on GitHub

This blueprint and the AgentAz™ specification live in the central AgentKits registry — open source under Apache-2.0 (code & schema) and CC‑BY‑4.0 (text).

PRD Drafting Agent

Overview

AgentAz™ specification

Governance matrix

Agent component mapping

Failure modes

Evaluation

When to use

System prompt

Simulate run

Setup guide

Architecture

Tools required

Workflow

Examples

Implementation notes

Variations

Frequently asked questions

Will it invent user research to make the PRD look solid?

What about success metrics?

Does it decide scope and priorities?

How does it handle gaps in my notes?

Is the output ready to ship as a spec?

Can it match our PRD template?

Related kits

User Feedback Synthesis Agent

Campaign Brief Builder Agent

Daily Planning Agent

Goal Decomposition & Planning Agent