AgentKits

Vendor Risk Assessment Agent

Flagship BlueprintAgentAz™ Enhanced
0New

Includes Agent Blueprint + Implementation Guide

An agent that runs the first-pass risk assessment on a third-party vendor: it gathers and normalizes the vendor's security, compliance, financial, and data-access profile, checks it against your risk criteria, scores risk across domains, and flags the gaps that matter — a missing DPA, an expired SOC 2, a prior breach, an over-broad data scope. It is built defensively: every finding is grounded in a source document, it flags missing evidence instead of assuming a control exists, it never auto-approves high-risk or critical-data vendors, and it routes those to your risk team. It assists the decision; it is not the final approval.

vendor-riskthird-party-riskgrccompliancesecuritydue-diligenceautonomous-agenttprmagentazagent-governancetrust-levelproduction-readiness
StackClaude, LangGraph, OpenAI
DifficultyAdvanced
Setup50 min
Version2.0.0 · 2026-06-21

Overview

Profiles a vendor across security, compliance, financial, and data-access domains, scoring risk against your criteria.

Evidence-grounded: every finding cites the source (SOC 2 report, questionnaire answer, DPA) — and missing evidence is flagged, not assumed away.

Surfaces the gaps that matter: expired/absent certifications, missing DPA, prior breaches, over-broad data access, risky fourth parties.

Defensive: recommends a tier or conditions, but never auto-approves high-risk or critical-data vendors — those go to your risk team.

AgentAz™ specification

A lightweight, design-time governance spec for security review. It documents what this agent is authorized to do — and why — and pairs with whatever policy engine you already run. It does not enforce anything at runtime.

Trust Level ?A2 — Recommend
DNA PatternEvaluation (Research → Evaluate)
Worst-Case ActionProduces an inaccurate vendor risk score, surfaced for human review. It cannot approve, onboard, or reject a vendor, or change any procurement record — execution tools are absent.
Authority BoundaryGathers vendor information, scores risk against your criteria, and surfaces findings for review. It never approves or onboards a vendor and never changes procurement systems. Procurement owns the decision.
Verification TestAttempt to call a vendor-approve or procurement-write tool → confirm it is absent from the registry.
Production Readiness6/6 dimensions passing. Tool isolation: approval/procurement tools absent. Human gates: procurement decides. Confidence escalation: uncertain findings flagged. Cost ceiling: bounded. Audit trail: scores and evidence logged. Escalation path: high-risk vendors routed to a reviewer.
Last Reviewed2026-06-24

Machine-readable contract (agentaz.json), validated against the open AgentAz™ JSON Schema — bundled for offline use and published at a permanent URL:

agentaz.json
{
  "$schema": "./agentaz.schema.json",
  "version": "2.0.0",
  "last_reviewed": "2026-06-24",
  "agent_id": "vendor-risk-assessment-agent",
  "trust_level": "A2",
  "dna_pattern": "Evaluation",
  "worst_case_action": "Produces an inaccurate risk score for human review. Cannot approve or onboard vendors.",
  "authority_boundary": "Assesses vendor risk and surfaces findings; no approval or procurement-write tools present.",
  "tags": [
    "compliance",
    "vendor-risk",
    "security",
    "read-only",
    "human-review"
  ],
  "tool_boundary": {
    "allowed_tools": [
      "gather_vendor_info",
      "check_criteria",
      "score_risk",
      "summarize_findings"
    ],
    "execution_tools_absent": true
  },
  "output_boundary": {
    "format": "structured_json",
    "never_emits": [
      "vendor_approve",
      "procurement_write"
    ]
  },
  "cost_boundary": {
    "max_usd_per_trace_loop": 0.3,
    "alert_threshold_usd": 0.2
  },
  "loop_boundary": {
    "max_reasoning_turns": 10
  },
  "human_handoff": {
    "triggers": [
      "high_risk_vendor",
      "missing_evidence",
      "low_confidence"
    ],
    "destination": "procurement_review"
  },
  "audit": {
    "append_only": true,
    "logs": [
      "scores",
      "evidence",
      "rationale"
    ]
  }
}

New to this? Read the AgentAz specification guide — Trust Levels, DNA patterns, and how it complements your runtime.

This is a flagship reference blueprint for AgentAz v1.0.0. AgentAz™ is open source under Apache-2.0 (spec text under CC‑BY‑4.0) — schema and source on GitHub.

Governance matrix

A scannable summary of this blueprint's governance coverage, derived from its AgentAz™ specification. It documents the boundaries that already ship — not new functionality.

Agent goalBounded by the authority spec above
Trust LevelA2 — Recommend
Tool accessLeast privilege — execution tools absent (read-only)
Context handlingGrounded in provided inputs; cites or flags rather than guessing
Memory strategyTask-scoped; no persistent cross-session memory
Human approvalRequired on high risk vendor, missing evidence, low confidence → procurement review
Audit trailAppend-only log (scores, evidence, rationale)
Cost & loop bounds≤ $0.3 per loop · ≤ 10 reasoning turns
Recovery / escalationEscalates to procurement review

Agent component mapping

A framework-neutral view of how this blueprint maps to standard agent-architecture components (the vocabulary common to ADK-style frameworks). It describes structure for clarity — not an official integration or certified compatibility.

AgentPrimary reasoner — Recommend authority (A2)
Toolsgather vendor info, check criteria, score risk, summarize findings — execution tools absent (read-only)
MemoryTask-scoped working context; no persistent cross-session memory
GuardrailsWorst-case classified (A2); no execution tools; ≤ $0.3/loop · ≤ 10 turns
EvaluatorConfidence and authority-boundary checks; low-confidence or out-of-bounds results are flagged, not actioned
HandoffEscalates to procurement review on high risk vendor, missing evidence, low confidence

Failure modes

Specific ways this blueprint can fail, and how it is designed to detect, contain, and recover from each — the boundaries that make it safe to run, stated plainly.

Over- or under-rates risk from incomplete documents.

Detection
A completeness check flags missing evidence.
Mitigation
The assessment uses only provided evidence; gaps are marked, not assumed.
Recovery
Missing documents are requested and the assessment is marked provisional.

A stale certification is treated as current.

Detection
Expiry dates on certifications are checked.
Mitigation
Expired or soon-to-expire certs are flagged.
Recovery
The case is routed to procurement for a refresh.

A fabricated or altered document is accepted at face value.

Detection
The agent never asserts authenticity and flags anomalies in the document.
Mitigation
A human verifies authenticity; the agent assesses content only.
Recovery
Suspicious documents are escalated for manual review.

Evaluation

Agreement with expert reviewers, plus correct handling of incomplete evidence, is what matters — over- or under-rating risk on thin documents is the failure to watch.

Assessment agreementAgreement and calibration of risk ratings against expert reviewers.
Evidence-gap detectionShare of assessments that correctly flag missing or expired evidence.
High-risk precisionOf vendors flagged high-risk, the share that genuinely were.
Escalation rateHow often it routes gaps and anomalies to a human reviewer.
LatencyTime per vendor assessment.

Recommended approach. Have experts score a set of vendor dossiers and measure agreement and calibration. Include incomplete and stale-certification dossiers specifically to test gap-flagging.

When to use

Use it when

  • You onboard or periodically review many third-party vendors and the first pass (collecting and checking evidence) eats analyst time.
  • You have risk criteria and access to vendor evidence (questionnaires, SOC 2/ISO reports, DPAs).
  • You want consistent, documented assessments with cited evidence and a clear tier recommendation.
  • You want to fast-track clearly low-risk vendors and route the risky or critical-data ones to humans.

Avoid it when

  • You expect it to be the final approval or to make a legal/contractual determination — those stay with humans.
  • You can't provide vendor evidence for it to assess, so findings would be ungrounded.
  • The relationship is novel/critical enough to need a full human-led assessment from the start.
  • You can't keep risk-team review on high-risk and critical-data vendors.

System prompt

system-prompt.md
You are a Vendor Risk Assessment Agent supporting a third-party risk (TPRM) team. You produce a first-pass, evidence-based risk assessment of ONE vendor and recommend a risk tier or escalation. You assist the decision; you are NOT the final approver. You are judged on accuracy, evidence discipline, and never clearing a risky vendor or fabricating assurance.

== CORE PRINCIPLES ==
1. Evidence or it's a gap. Base every finding on a source you actually reviewed (questionnaire answer, SOC 2/ISO report, DPA, breach record) and cite it. If evidence for a control is missing, that is a GAP — never assume the control exists.
2. Risk is contextual. Weight risk by what the vendor will access: a vendor touching sensitive/customer data or critical systems is inherently higher risk than a low-touch tool, regardless of polish.
3. Recommend, don't approve. Output a tier and conditions for a human. High-risk and critical-data vendors are escalated, not cleared.

== HARD RULES (NON-NEGOTIABLE) ==
- NO AUTO-APPROVAL OF HIGH RISK: You may recommend fast-tracking only clearly LOW-risk vendors with no sensitive-data access and complete evidence. MODERATE/HIGH risk, critical-data access, or missing key evidence MUST be escalated to the risk team.
- NO FABRICATED ASSURANCE: Never claim a certification, control, or clean record you haven't seen evidence for. Expired or absent = flag it. Do not infer SOC 2 from a marketing page.
- MISSING EVIDENCE IS A FINDING: Treat absent DPA, expired SOC 2, unanswered security questions, or no breach-history check as explicit gaps with required remediation.
- NOT LEGAL/FINAL: You provide a risk view, not a legal opinion or contract sign-off.
- DATA HANDLING: Treat vendor and internal data as confidential; keep it in scope.

== ASSESSMENT DOMAINS ==
Security posture (SOC 2/ISO, pen-test, MFA/encryption); compliance (DPA, GDPR/regulatory fit, data residency); data access & scope (what data, how much, fourth parties/subprocessors); financial/operational stability; and incident/breach history. For each: state the evidence, the gap (if any), and a domain risk rating.

== DECISION POLICY ==
- LOW (fast-track recommend): no sensitive-data access, complete evidence, no material gaps. Recommend approval with standard terms (human confirms).
- MODERATE (conditions): real gaps that are remediable; list required conditions (e.g. obtain DPA, MFA enforcement) before/with onboarding.
- HIGH / ESCALATE: sensitive/critical-data access with gaps, prior unremediated breach, missing key evidence, or conflicting signals. Route to the risk team with findings.

== OUTPUT FORMAT (return ONE JSON object) ==
{
  "vendor": "<name>",
  "data_access": "<what they access; sensitive? critical?>",
  "overall_risk": "low|moderate|high",
  "recommendation": "fast_track|conditional|escalate",
  "domains": [ { "domain": "security|compliance|data_access|financial|breach_history", "evidence": "<source cited>", "gap": "<gap or 'none'>", "rating": "low|moderate|high" } ],
  "gaps": ["<material gaps / missing evidence>"],
  "conditions": ["<remediations required for onboarding, if conditional>"],
  "rationale": "<grounded summary>",
  "escalation": { "needed": <bool>, "to": "risk_team|none", "reason": "<why, or empty>" },
  "not_final_approval": true
}
If key evidence is missing or data access is sensitive/critical with any gap, set recommendation to escalate. Never claim assurance without cited evidence.
Was this useful?

Simulate run

Try the agent with a sample task. This is a frontend-only preview that shows how the kit would plan and execute — no API calls, nothing leaves your browser.

Frontend preview only — no data leaves your browser. Tip: press ⌘/Ctrl + Enter to run.

Setup guide

Install and connect TPRM sources

Install the agent and connect it to your vendor register and evidence store.

shell
pipx install vendor-risk-agent
vendor-risk-agent connect --register onetrust --evidence-store sharepoint
vendor-risk-agent doctor

Configure risk criteria and gate

Define which vendors may be fast-tracked. Critical-data and high-risk always escalate.

shell
cp .env.example .env
ANTHROPIC_API_KEY=sk-ant-...
FAST_TRACK_ONLY_IF: { data: "non_sensitive", evidence: "complete", risk: "low" }
ALWAYS_ESCALATE: ["sensitive_data", "critical_systems", "missing_soc2", "prior_breach"]

Define required evidence per tier

Tell it what evidence is mandatory so absences become explicit gaps.

shell
# criteria.yml
required_for_data_access:
  - dpa
  - soc2_or_iso27001
  - breach_history_check
  - mfa_and_encryption_attested
weight_by_data_sensitivity: true

Assess a vendor from the CLI

Run an assessment and review the cited findings and tier before relying on it.

shell
vendor-risk-agent assess --vendor 'Acme Analytics' --explain
# prints domains+evidence+gaps, overall_risk, recommendation, conditions

Integrate with onboarding

Route new vendors for a first-pass assessment; high-risk/critical-data ones auto-route to the risk team.

shell
# onboarding webhook -> POST https://your-host/vendor/assess
# recommendation=escalate auto-assigns to the risk queue

Architecture

Tools required

get_vendor_profileRetrieve the vendor, the proposed product/relationship, and the data it will access.
questionnaire_parseParse the security/compliance questionnaire into structured answers and identify unanswered items.
cert_verifyCheck certification evidence (SOC 2/ISO) for presence, scope, and validity dates — flagging expired or absent ones.
breach_history_lookupLook up known breach/incident history and unremediated findings for the vendor.
data_scope_assessAssess what data the vendor accesses, the volume/sensitivity, and fourth-party/subprocessor exposure.
score_risk_domainsRate each risk domain from the cited evidence and weight by data sensitivity and access.
recommend_tierProduce the overall risk tier and required onboarding conditions for a human decision.
escalate_to_risk_teamRoute moderate/high-risk, critical-data, or evidence-gap vendors to the risk team with the findings.

Workflow

  1. 1. Frame the relationship

    Capture the vendor, the product, and — critically — what data and systems it will access.

  2. 2. Ingest evidence

    Parse the questionnaire, SOC 2/ISO reports, DPA, and any pen-test/breach records that exist.

  3. 3. Assess each domain

    Rate security, compliance, data access, financial stability, and breach history, citing the evidence for each.

  4. 4. Flag the gaps

    Mark absent/expired evidence and unanswered questions as explicit gaps — never assume an unseen control exists.

  5. 5. Score weighted by data

    Combine domain ratings, weighting by data sensitivity and access scope, into an overall risk level.

  6. 6. Recommend or escalate

    Fast-track only clearly low-risk vendors; attach conditions for moderate risk; escalate high-risk/critical-data/evidence-gap vendors to the risk team.

  7. 7. Register & schedule review

    Record the assessment with cited evidence and a re-assessment date for audit and periodic review.

Examples

Low-touch SaaS, complete evidence → fast-track recommend

A scheduling tool with no access to sensitive data, valid SOC 2, and a complete questionnaire. Clearly low risk.

Input

input
Vendor: MeetEasy (scheduling). Data access: calendar metadata only, no PII/customer data. Evidence: SOC 2 Type II (current, 2026), questionnaire complete, MFA + encryption attested, no breach history.

Output

output
{
  "vendor": "MeetEasy",
  "data_access": "Calendar metadata only — no PII or customer data (non-sensitive)",
  "overall_risk": "low",
  "recommendation": "fast_track",
  "domains": [
    { "domain": "security", "evidence": "SOC 2 Type II current (2026); MFA + encryption attested", "gap": "none", "rating": "low" },
    { "domain": "compliance", "evidence": "DPA in place; no regulated data", "gap": "none", "rating": "low" },
    { "domain": "data_access", "evidence": "Calendar metadata only; no subprocessors with data access", "gap": "none", "rating": "low" },
    { "domain": "breach_history", "evidence": "No known incidents", "gap": "none", "rating": "low" }
  ],
  "gaps": [],
  "conditions": [],
  "rationale": "Non-sensitive data access, current SOC 2, complete evidence, and no breach history. Low inherent and residual risk.",
  "escalation": { "needed": false, "to": "none", "reason": "" },
  "not_final_approval": true
}

Note: Clearly low-risk with complete, cited evidence and no sensitive-data access, so the agent recommends fast-tracking — but note 'not_final_approval' stays true: it surfaces a clean assessment, a human confirms. This is the volume that should move fast.

PII-handling vendor with gaps → conditional, list remediations

A vendor that will process customer PII has a DPA gap and an expired SOC 2. Remediable, so the agent recommends conditions rather than a flat no.

Input

input
Vendor: DataCleanse (data enrichment). Data access: customer PII. Evidence: questionnaire complete, SOC 2 EXPIRED (2024), no DPA on file, MFA attested.

Output

output
{
  "vendor": "DataCleanse",
  "data_access": "Customer PII (sensitive)",
  "overall_risk": "moderate",
  "recommendation": "conditional",
  "domains": [
    { "domain": "security", "evidence": "MFA attested; SOC 2 present but EXPIRED (2024)", "gap": "No current SOC 2 / equivalent", "rating": "moderate" },
    { "domain": "compliance", "evidence": "No DPA on file for a PII-processing vendor", "gap": "Missing DPA", "rating": "high" },
    { "domain": "data_access", "evidence": "Processes customer PII", "gap": "Scope/retention not documented", "rating": "moderate" },
    { "domain": "breach_history", "evidence": "No known incidents", "gap": "none", "rating": "low" }
  ],
  "gaps": ["No DPA for a PII processor", "SOC 2 expired (2024)", "Data scope/retention undocumented"],
  "conditions": ["Execute a DPA with data-processing scope, retention, and breach-notification terms", "Provide a current SOC 2 Type II (or ISO 27001)", "Document data scope and retention"],
  "rationale": "Processes customer PII but is missing a DPA and a current SOC 2. Risk is moderate and the gaps are remediable; onboarding should be conditional on closing them.",
  "escalation": { "needed": false, "to": "none", "reason": "" },
  "not_final_approval": true
}

Note: Because the vendor touches PII, the missing DPA is rated high — but the gaps are remediable, so the agent recommends a conditional path with a concrete remediation checklist rather than guessing or clearing it. It never assumes the expired SOC 2 still counts.

Critical-data vendor with prior breach + missing evidence → escalate

A vendor that would access critical systems has an unremediated breach and an incomplete questionnaire. The agent refuses to score it clean and escalates.

Input

input
Vendor: CoreConnect (infrastructure integration). Data access: production systems + customer data (critical). Evidence: questionnaire 60% complete, no SOC 2 provided, breach_history shows a 2025 incident with no remediation evidence.

Output

output
{
  "vendor": "CoreConnect",
  "data_access": "Production systems + customer data (critical)",
  "overall_risk": "high",
  "recommendation": "escalate",
  "domains": [
    { "domain": "security", "evidence": "No SOC 2 provided; questionnaire only 60% complete", "gap": "No security certification; key questions unanswered", "rating": "high" },
    { "domain": "compliance", "evidence": "Incomplete", "gap": "Insufficient evidence to assess", "rating": "high" },
    { "domain": "data_access", "evidence": "Production + customer data", "gap": "Critical access with unverified controls", "rating": "high" },
    { "domain": "breach_history", "evidence": "2025 incident on record", "gap": "No remediation evidence provided", "rating": "high" }
  ],
  "gaps": ["No SOC 2 / security certification", "Questionnaire 40% unanswered", "Unremediated 2025 breach"],
  "conditions": [],
  "rationale": "Critical access to production and customer data, no security certification, an incomplete questionnaire, and a prior breach without remediation evidence. This cannot be assessed as low/moderate and must not be auto-cleared.",
  "escalation": { "needed": true, "to": "risk_team", "reason": "Critical-data access with a prior unremediated breach and missing key security evidence." },
  "not_final_approval": true
}

Note: The defining case: critical data access + a prior unremediated breach + missing evidence. The agent does not fabricate assurance from the partial questionnaire or hand-wave the breach — it rates every domain high and escalates to the risk team with the gaps. Missing evidence is treated as risk, not as a pass.

Implementation notes

  • Make 'missing evidence is a finding' a hard rule: an absent DPA, expired SOC 2, or unanswered question is an explicit gap, never an assumed-present control.
  • Weight risk by data access and sensitivity, and enforce in a gate that critical-data and high-risk vendors can never be fast-tracked — only escalated.
  • Require a cited source for every assurance; the agent must never infer a certification or clean record it hasn't seen.
  • Output remediation conditions for moderate risk so the assessment is actionable, not just a rating.
  • Keep it explicitly non-final: it produces a risk view for the risk team, not a legal opinion or contract approval.
  • Write assessments to the vendor register with evidence and a re-assessment date so periodic review and audit are supported.
  • Spend the strong model on the domain risk judgment and escalation framing — a cheaper model can parse questionnaires and certs.

Variations

Basic

Evidence-cited risk reviewer

Assesses a vendor across domains from provided evidence and returns cited findings, gaps, and a recommended tier for an analyst. No auto-approval.

Advanced

Gated first-pass assessor

Adds data-sensitivity weighting, an escalation gate for high-risk/critical-data/evidence-gap vendors, and remediation-condition generation for conditional cases.

Enterprise

Governed TPRM automation

Adds register/evidence-store integration, multi-framework criteria, fourth-party mapping, full audit trails, periodic re-assessment scheduling, and tuning from risk-team outcomes.

Download the Agent Blueprint

The complete blueprint, zipped — including a runnable run.py you can execute with one API key (Anthropic or OpenAI).

Download Blueprint (.zip)
README.mdsystem-prompt.mdsetup-guide.mdtools.jsonworkflow.mdexamples.md.env.examplekit.jsonrun.pyLICENSENOTICEstarters/

Export

Generate a starter for your stack — all client-side, nothing leaves your browser.

ZIP

Starters use mock tools — swap in your integrations to deploy.

View the source on GitHub

This flagship blueprint and the AgentAz™ specification live in the central AgentKits registry — open source under Apache-2.0 (code & schema) and CC‑BY‑4.0 (text).

Frequently asked questions