Output Schema

Surveyor's canonical output is JSON.

Markdown exists for human-readable sharing but it is derived from the same canonical model. If a fact matters, it should exist in the JSON schema first.

The same rule applies to the discovery slice described in docs/discovery.md. The same rule applies to the audit slice described in docs/audit.md.

Top-level report

Current top-level report shape:

{
  "schema_version": "1.0",
  "tool_version": "dev",
  "report_kind": "tls_scan",
  "scope_kind": "explicit",
  "scope_description": "explicit TLS targets from config",
  "generated_at": "2026-04-14T01:45:00Z",
  "scope": {
    "scope_kind": "explicit",
    "input_kind": "config"
  },
  "results": [],
  "summary": {}
}

Fields:

`schema_version`

type: string
meaning: current baseline-compatible schema version for report comparison

`tool_version`

type: string
meaning: emitting Surveyor build version, currently dev for ordinary builds and tests

`report_kind`

type: string
meaning: semantic top-level report kind, currently tls_scan, discovery, audit, diff or prioritization

`scope_kind`

type: string
meaning: high-level scope the report covers, currently explicit, local or remote

`scope_description`

type: string
optional: yes
meaning: human-readable summary of the declared scope or target set represented by the report

`generated_at`

type: RFC3339 UTC timestamp
meaning: when the report object was assembled

`scope`

type: report-scope object
optional: yes
meaning: structured declared scope metadata for the report, currently present for explicit TLS, discovery, audit and prioritization reports

`results`

type: array of target results
meaning: one entry per scanned target

`summary`

type: summary object
meaning: aggregate counts derived from results

Per-target result

Current per-target result shape:

{
  "name": "primary-site",
  "host": "example.com",
  "port": 443,
  "address": "203.0.113.10:443",
  "scanned_at": "2026-04-14T01:00:00Z",
  "reachable": true,
  "tls_version": "TLS 1.3",
  "cipher_suite": "TLS_AES_128_GCM_SHA256",
  "leaf_key_algorithm": "rsa",
  "leaf_key_size": 2048,
  "leaf_signature_algorithm": "sha256-rsa",
  "certificate_chain": [],
  "classification": "modern_tls_classical_identity",
  "findings": [],
  "warnings": [],
  "errors": []
}

Fields:

`name`

type: string
optional: yes
meaning: human label from configuration

`host`

type: string
optional: no
meaning: configured host or IP literal

`port`

type: integer
optional: no
meaning: configured TCP port

`address`

type: string
optional: yes
meaning: remote address observed by the TLS connection

`scanned_at`

type: RFC3339 UTC timestamp
optional: no
meaning: when collection for this target ran

`reachable`

type: boolean
optional: no
meaning: whether a TLS connection completed far enough to produce a connection state

`tls_version`

type: string
optional: yes
meaning: negotiated TLS version as reported by the Go TLS stack, for example TLS 1.2 or TLS 1.3

`cipher_suite`

type: string
optional: yes
meaning: negotiated cipher suite name as reported by the Go TLS stack

`leaf_key_algorithm`

type: string
optional: yes
meaning: lower-case public-key algorithm name from the leaf certificate, for example rsa or ecdsa

`leaf_key_size`

type: integer
optional: yes
meaning: inferred leaf public-key size in bits where that is available

`leaf_signature_algorithm`

type: string
optional: yes
meaning: normalised lower-case leaf certificate signature algorithm, for example sha256-rsa

`certificate_chain`

type: array of certificate references
optional: yes
meaning: presented peer certificates in observed order

`classification`

type: string
optional: no
meaning: Surveyor's current conservative migration-posture bucket

`findings`

type: array of findings
optional: yes
meaning: structured explanations and recommendations derived from the current evidence

`warnings`

type: array of strings
optional: yes
meaning: non-fatal collection concerns, for example incomplete certificate observation

`errors`

type: array of strings
optional: yes
meaning: collection failures or hard result-level errors

Certificate reference

Current certificate reference shape:

{
  "subject": "CN=example.com",
  "issuer": "CN=Example CA",
  "serial_number": "1",
  "not_before": "2026-04-01T00:00:00Z",
  "not_after": "2026-10-01T00:00:00Z",
  "dns_names": [
    "example.com",
    "www.example.com"
  ],
  "public_key_algorithm": "rsa",
  "public_key_size": 2048,
  "signature_algorithm": "sha256-rsa",
  "is_ca": false
}

Fields:

subject: string form of the certificate subject
issuer: string form of the certificate issuer
serial_number: decimal serial string
not_before: RFC3339 UTC timestamp
not_after: RFC3339 UTC timestamp
dns_names: SAN DNS names as presented
public_key_algorithm: lower-case public-key algorithm name
public_key_size: inferred key size in bits where available
signature_algorithm: lower-case normalised signature algorithm name
is_ca: whether the certificate is marked as a CA

Finding

Current finding shape:

{
  "code": "classical-certificate-identity",
  "severity": "medium",
  "summary": "The observed certificate identity remains classical.",
  "evidence": [
    "leaf_key_algorithm=rsa",
    "leaf_signature_algorithm=sha256-rsa"
  ],
  "recommendation": "Inventory certificate replacement and related PKI dependencies as part of migration planning."
}

Fields:

code: stable machine-readable identifier
severity: one of info, low, medium, high, critical
summary: short human-readable statement
evidence: supporting observed facts
recommendation: suggested next action

Summary object

Current summary shape:

{
  "total_targets": 1,
  "reachable_targets": 1,
  "unreachable_targets": 0,
  "classification_breakdown": {
    "modern_tls_classical_identity": 1
  }
}

Fields:

total_targets: total number of results
reachable_targets: number of results with reachable=true
unreachable_targets: number of results with reachable=false
classification_breakdown: counts keyed by classification label

Stability notes

The current schema is an MVP contract, not a promise of permanent immutability.

Still, changes to these field names or meanings should be treated as public contract changes. If the schema changes:

update this document
update the examples
update tests
explain the behavioural impact in the PR

Discovery report

Current discovery report shape:

{
  "schema_version": "1.0",
  "tool_version": "dev",
  "report_kind": "discovery",
  "scope_kind": "remote",
  "scope_description": "remote discovery from targets file examples/approved-hosts.txt over ports 443,8443",
  "generated_at": "2026-04-15T01:45:00Z",
  "scope": {
    "scope_kind": "remote",
    "input_kind": "targets_file",
    "targets_file": "examples/approved-hosts.txt",
    "ports": [443, 8443]
  },
  "execution": {
    "profile": "cautious",
    "max_hosts": 256,
    "max_concurrency": 8,
    "timeout": "3s"
  },
  "results": [],
  "summary": {}
}

Fields:

`schema_version`

type: string
meaning: current baseline-compatible schema version for report comparison

`tool_version`

type: string
meaning: emitting Surveyor build version, currently dev for ordinary builds and tests

`report_kind`

type: string
meaning: semantic top-level report kind, here discovery

`scope_kind`

type: string
meaning: high-level scope the report covers, here local or remote

`scope_description`

type: string
optional: yes
meaning: human-readable summary of the discovery scope represented by the report

`generated_at`

type: RFC3339 UTC timestamp
meaning: when the discovery report object was assembled

`scope`

type: report-scope object
meaning: declared scope metadata for the discovery report

`execution`

type: report-execution object
optional: yes
meaning: execution settings that materially shaped the run, currently present for remote discovery

`results`

type: array of discovered endpoints
meaning: one entry per observed endpoint in the discovery report, whether observed locally or within declared remote scope

`summary`

type: discovery summary object
meaning: aggregate counts derived from results

Report scope

Current report-scope shape:

{
  "scope_kind": "explicit",
  "input_kind": "config"
}

Fields:

scope_kind: whether the report covers explicit, local or remote scope
input_kind: declared input kind when relevant
- explicit TLS reports currently use config or targets
- remote discovery and audit currently use cidr, targets_file or inventory_file
cidr: declared remote CIDR when the report covers remote CIDR scope
targets_file: declared remote targets-file path when the report covers file-backed remote scope
inventory_file: declared structured inventory-file path when the report covers inventory-backed remote scope
adapter: selected inventory adapter when the report covers adapter-backed remote inventory scope
ports: declared remote port set when the report covers remote scope

Report execution

Current report-execution shape:

{
  "profile": "cautious",
  "max_hosts": 256,
  "max_concurrency": 8,
  "timeout": "3s"
}

Fields:

profile: effective remote pace profile
max_hosts: effective expanded-host hard cap
max_concurrency: effective probe concurrency cap
timeout: effective per-attempt timeout, currently also used for remote TLS connection attempts during audit

Discovered endpoint

Current discovered-endpoint shape:

{
  "scope_kind": "remote",
  "host": "10.0.0.10",
  "port": 443,
  "transport": "tcp",
  "state": "responsive",
  "hints": [],
  "warnings": [],
  "errors": []
}

Fields:

`scope_kind`

type: string
optional: no
meaning: whether the endpoint was observed in local or remote scope

`host`

type: string
optional: no
meaning: observed host or IP within the declared scope; for local discovery this is the local bound address, for remote discovery this is the attempted remote host or IP

`port`

type: integer
optional: no
meaning: observed or attempted port within the declared scope

`transport`

type: string
optional: no
meaning: transport name, currently tcp or udp

`state`

type: string
optional: no
meaning: observed endpoint state
- local discovery currently uses listening or bound
- remote discovery currently uses responsive or candidate

`pid`

type: integer
optional: yes
meaning: process identifier where available without requiring elevation, currently local-only

`process_name`

type: string
optional: yes
meaning: best-effort process name where available, currently local-only

`executable`

type: string
optional: yes
meaning: best-effort executable path where available, currently local-only

`inventory`

type: inventory annotation object
optional: yes
meaning: imported inventory metadata and provenance attached to an endpoint discovered from structured inventory scope

`hints`

type: array of discovery hints
optional: yes
meaning: conservative protocol hints derived from observed facts

`warnings`

type: array of strings
optional: yes
meaning: non-fatal discovery concerns, for example unavailable process metadata

`errors`

type: array of strings
optional: yes
meaning: endpoint-level discovery failures, including failed remote probe attempts

Inventory annotation

Current inventory-annotation shape:

{
  "ports": [443, 8443],
  "name": "Payments API",
  "owner": "payments",
  "environment": "prod",
  "tags": ["critical", "external"],
  "notes": "Internet-facing service",
  "provenance": [
    {
      "source_kind": "inventory_file",
      "source_format": "csv",
      "source_name": "examples/inventory.csv",
      "source_record": "line 2"
    }
  ],
  "adapter_warnings": [
    {
      "code": "host-without-declared-tls",
      "summary": "The Ingress rule host did not declare TLS and was mapped only to port 80.",
      "evidence": [
        "adapter=kubernetes-ingress-v1"
      ]
    }
  ]
}

Fields:

ports: imported port set declared for the inventory entry
name: optional imported asset label
owner: optional imported owner or team
environment: optional imported environment label
tags: optional imported tags
notes: optional imported free-form notes
provenance: array of inventory provenance records describing where the entry came from
adapter_warnings: non-fatal warnings preserved from adapter-backed imports

Current provenance notes:

source_format currently includes yaml, json, csv and caddyfile
adapter is present when the entry came through a platform-specific adapter
source_object is present when the adapter preserved stable product identity

Discovery hint

Current discovery-hint shape:

{
  "protocol": "tls",
  "confidence": "low",
  "evidence": [
    "transport=tcp",
    "port=443"
  ]
}

Fields:

protocol: hinted protocol family, for example tls, ssh or rdp
confidence: explicit qualitative confidence label
evidence: observed facts supporting the hint

Hints are not verified scan results.

Discovery summary object

Current discovery summary shape:

{
  "total_endpoints": 2,
  "tcp_endpoints": 1,
  "udp_endpoints": 1,
  "hint_breakdown": {
    "tls": 1
  }
}

Fields:

total_endpoints: total number of discovered endpoints
tcp_endpoints: number of discovered TCP endpoints
udp_endpoints: number of discovered UDP endpoints
hint_breakdown: counts keyed by hinted protocol label

Audit report

Current top-level audit report shape:

{
  "schema_version": "1.0",
  "tool_version": "dev",
  "report_kind": "audit",
  "scope_kind": "remote",
  "scope_description": "remote audit from targets file examples/approved-hosts.txt over ports 443,8443",
  "generated_at": "2026-04-16T02:00:00Z",
  "scope": {
    "scope_kind": "remote",
    "input_kind": "targets_file",
    "targets_file": "examples/approved-hosts.txt",
    "ports": [443, 8443]
  },
  "execution": {
    "profile": "cautious",
    "max_hosts": 256,
    "max_concurrency": 8,
    "timeout": "3s"
  },
  "results": [],
  "summary": {}
}

Fields:

`schema_version`

type: string
meaning: current baseline-compatible schema version for report comparison

`tool_version`

type: string
meaning: emitting Surveyor build version, currently dev for ordinary builds and tests

`report_kind`

type: string
meaning: semantic top-level report kind, here audit

`scope_kind`

type: string
meaning: high-level scope the report covers, here local or remote

`scope_description`

type: string
optional: yes
meaning: human-readable summary of the audit scope represented by the report

`generated_at`

type: RFC3339 UTC timestamp
meaning: when the audit report object was assembled

`scope`

type: report-scope object
meaning: declared scope metadata for the audit report

`execution`

type: report-execution object
optional: yes
meaning: execution settings that materially shaped the run, currently present for remote audit

`results`

type: array of audit results
meaning: one entry per discovered endpoint considered by the local or remote audit flow

`summary`

type: audit summary object
meaning: aggregate counts derived from results

Audit result

Current per-endpoint audit-result shape:

{
  "discovered_endpoint": {},
  "selection": {},
  "tls_result": {}
}

Fields:

`discovered_endpoint`

type: discovered endpoint object
meaning: observed endpoint facts and hints from the discovery layer

`selection`

type: selection object
meaning: scanner decision for the endpoint, including skipped outcomes

`tls_result`

type: target result object
optional: yes
meaning: verified TLS result when the endpoint is selected for the TLS scanner and the scan runs

Selection object

Current selection shape:

{
  "status": "selected",
  "selected_scanner": "tls",
  "reason": "tls hint on tcp/443"
}

Fields:

status: selection outcome, initially selected or skipped
selected_scanner: scanner identifier when selected, initially tls
reason: explicit explanation for the decision

Hints are not verified scans, and selection is not verification.

Audit summary object

Current audit summary shape:

{
  "total_endpoints": 3,
  "tls_candidates": 1,
  "scanned_endpoints": 1,
  "skipped_endpoints": 2,
  "selection_breakdown": {
    "tls": 1
  },
  "verified_classification_breakdown": {
    "modern_tls_classical_identity": 1
  }
}

Fields:

total_endpoints: total number of discovered endpoints considered by the audit flow
tls_candidates: endpoints selected for the TLS scanner
scanned_endpoints: endpoints for which a supported scanner ran
skipped_endpoints: endpoints not scanned
selection_breakdown: counts keyed by selected scanner
verified_classification_breakdown: counts keyed by verified TLS classification where a TLS scan completed

Diff report

Current top-level diff report shape:

{
  "schema_version": "1.0",
  "tool_version": "dev",
  "report_kind": "diff",
  "scope_kind": "remote",
  "scope_description": "diff of audit reports",
  "generated_at": "2026-04-22T02:00:00Z",
  "baseline_report_kind": "audit",
  "current_report_kind": "audit",
  "baseline_generated_at": "2026-04-20T02:00:00Z",
  "current_generated_at": "2026-04-21T02:00:00Z",
  "baseline_scope_description": "remote audit within CIDR 10.0.0.0/30 over ports 443",
  "current_scope_description": "remote audit within CIDR 10.0.1.0/30 over ports 443",
  "baseline_scope": {},
  "current_scope": {},
  "workflow_view": {},
  "summary": {},
  "grouped_summaries": [],
  "workflow_findings": [],
  "changes": []
}

Fields:

baseline_report_kind: semantic kind of the baseline input report
current_report_kind: semantic kind of the current input report
baseline_generated_at: baseline report generation time
current_generated_at: current report generation time
baseline_scope_description: baseline scope summary, when present
current_scope_description: current scope summary, when present
baseline_scope: structured baseline scope metadata, when present
current_scope: structured current scope metadata, when present
workflow_view: applied workflow grouping and filtering view, when requested
summary: diff summary object
grouped_summaries: grouped aggregate sections derived from changes, when emitted
workflow_findings: workflow-oriented findings derived from report context when emitted
changes: array of diff changes

Current emission notes:

workflow_view is present when workflow controls were applied
grouped_summaries are currently emitted for inventory-backed audit comparisons when usable metadata exists
workflow_findings is part of the canonical diff report shape but is not currently emitted by the diff engine

Current diff summary shape:

{
  "total_baseline_entities": 1,
  "total_current_entities": 2,
  "added_entities": 1,
  "removed_entities": 0,
  "changed_entities": 1,
  "unchanged_entities": 0,
  "scope_changed": true,
  "change_breakdown": {
    "new_endpoint": 1
  },
  "direction_breakdown": {
    "informational": 1
  }
}

Fields:

total_baseline_entities: total comparable entities in the baseline input
total_current_entities: total comparable entities in the current input
added_entities: entities present only in the current input
removed_entities: entities present only in the baseline input
changed_entities: entities with one or more recorded changes
unchanged_entities: entities with no recorded changes
scope_changed: whether declared scope metadata differs materially
change_breakdown: counts keyed by change code
direction_breakdown: counts keyed by change direction

Current diff change shape:

{
  "identity_key": "remote|example.com|443|tcp",
  "code": "classification_changed",
  "direction": "improved",
  "severity": "low",
  "summary": "The verified TLS classification changed.",
  "baseline_value": "legacy_tls_exposure",
  "current_value": "modern_tls_classical_identity",
  "evidence": [],
  "recommendation": ""
}

Fields:

identity_key: stable comparison key for the changed entity
code: stable machine-readable change identifier
direction: one of worsened, improved, changed or informational
severity: stable machine-readable severity for the change entry
summary: short human-readable description of the change
baseline_value: baseline-side value when relevant
current_value: current-side value when relevant
evidence: supporting observed facts when relevant
recommendation: suggested follow-up action when useful

Prioritization report

Current top-level prioritization report shape:

{
  "schema_version": "1.0",
  "tool_version": "dev",
  "report_kind": "prioritization",
  "scope_kind": "remote",
  "scope_description": "remote audit from targets file examples/approved-hosts.txt over ports 443",
  "generated_at": "2026-04-22T03:00:00Z",
  "profile": "migration-readiness",
  "source_report_kind": "audit",
  "source_generated_at": "2026-04-20T01:30:00Z",
  "scope": {},
  "workflow_view": {},
  "summary": {},
  "grouped_summaries": [],
  "workflow_findings": [],
  "items": []
}

Fields:

profile: prioritization profile applied to the input report
source_report_kind: semantic kind of the current input report
source_generated_at: generation time of the current input report
scope: copied structured scope metadata when the source report carried it
workflow_view: applied workflow grouping and filtering view, when requested
summary: prioritization summary object
grouped_summaries: grouped aggregate sections derived from items, when emitted
workflow_findings: workflow-oriented findings derived from report context when emitted
items: ranked prioritization items

Current emission notes:

workflow_view is present when workflow controls were applied
grouped_summaries are currently emitted when --group-by is requested on inventory-backed audit input
workflow_findings are currently emitted for inventory-backed audit input when imported metadata is weak or overridden

Current prioritization summary shape:

{
  "total_items": 3,
  "severity_breakdown": {
    "medium": 2
  },
  "code_breakdown": {
    "classical-certificate-identity": 1
  }
}

Fields:

total_items: total ranked items in the prioritization report
severity_breakdown: counts keyed by item severity
code_breakdown: counts keyed by item code

Current prioritization item shape:

{
  "rank": 1,
  "severity": "medium",
  "code": "classical-certificate-identity",
  "summary": "The observed certificate identity remains classical.",
  "target_identity": "remote|example.com|443|tcp",
  "reason": "Classical certificate identity is a direct migration dependency.",
  "evidence": [
    "leaf_key_algorithm=rsa"
  ],
  "recommendation": "Inventory certificate replacement and related PKI dependencies as part of migration planning."
}

Fields:

rank: stable 1-based display order after deterministic ranking
severity: stable machine-readable severity for the item
code: stable machine-readable prioritization code
summary: short human-readable statement
target_identity: stable identity key for the affected entity
reason: explicit explanation for why the item is ranked
evidence: supporting observed facts when relevant
recommendation: suggested next action when useful

Workflow view

Current workflow-view shape:

{
  "group_by": "owner",
  "filters": [
    {
      "field": "environment",
      "values": ["prod"]
    }
  ]
}

Fields:

group_by: requested grouping dimension, currently owner, environment or source
filters: applied workflow filters, currently emitted from the CLI for owner, environment and tag

Workflow filter

Current workflow-filter shape:

{
  "field": "tag",
  "values": ["external", "critical"]
}

Fields:

field: filtered metadata field
values: one or more accepted values for that field

Grouped summary

Current grouped-summary shape:

{
  "group_by": "owner",
  "groups": [
    {
      "key": "payments",
      "total_items": 2,
      "severity_breakdown": {
        "high": 1,
        "medium": 1
      },
      "code_breakdown": {
        "legacy-tls-version": 1
      }
    }
  ]
}

Fields:

group_by: grouping dimension used for the section
groups: grouped-summary-group entries for that dimension

Grouped summary group

Current grouped-summary-group shape:

{
  "key": "payments",
  "total_items": 2,
  "severity_breakdown": {
    "high": 1,
    "medium": 1
  },
  "code_breakdown": {
    "legacy-tls-version": 1
  },
  "direction_breakdown": {
    "changed": 1
  },
  "change_breakdown": {
    "selection_changed": 1
  }
}

Fields:

key: rendered group key, for example an owner, environment or source name
total_items: total items or changes counted in the group
severity_breakdown: counts keyed by severity when relevant
code_breakdown: counts keyed by prioritization code when relevant
direction_breakdown: counts keyed by diff change direction when relevant
change_breakdown: counts keyed by diff change code when relevant

Prioritization grouped summaries currently populate severity_breakdown and code_breakdown. Diff grouped summaries currently populate severity_breakdown, direction_breakdown and change_breakdown.

Workflow finding

Current workflow-finding shape:

{
  "severity": "low",
  "code": "weak-provenance",
  "summary": "The imported endpoint has no recorded source provenance.",
  "target_identity": "remote|prod.example.com|443|tcp",
  "reason": "Without provenance, later review and source reconciliation become weaker.",
  "evidence": [
    "host=prod.example.com",
    "port=443"
  ],
  "recommendation": "Preserve source file and record metadata when importing inventory."
}

Fields:

severity: workflow-finding severity
code: stable machine-readable workflow finding identifier
summary: short human-readable statement
target_identity: affected entity identity when relevant
reason: explicit explanation for the workflow concern
evidence: supporting facts for the workflow concern
recommendation: suggested next action when useful

Uh oh!

FilesExpand file tree

output-schema.md

Latest commit

History

output-schema.md

File metadata and controls

Output Schema

Top-level report

schema_version

tool_version

report_kind

scope_kind

scope_description

generated_at

scope

results

summary

Per-target result

name

host

port

address

scanned_at

reachable

tls_version

cipher_suite

leaf_key_algorithm

leaf_key_size

leaf_signature_algorithm

certificate_chain

classification

findings

warnings

errors

Certificate reference

Finding

Summary object

Stability notes

Discovery report

schema_version

tool_version

report_kind

scope_kind

scope_description

generated_at

scope

execution

results

summary

Report scope

Report execution

Discovered endpoint

scope_kind

host

port

transport

state

pid

process_name

executable

inventory

hints

warnings

errors

Inventory annotation

Discovery hint

Discovery summary object

Audit report

schema_version

tool_version

report_kind

scope_kind

scope_description

generated_at

scope

execution

results

summary

Audit result

`schema_version`

`tool_version`

`report_kind`

`scope_kind`

`scope_description`

`generated_at`

`scope`

`results`

`summary`

`name`

`host`

`port`

`address`

`scanned_at`

`reachable`

`tls_version`

`cipher_suite`

`leaf_key_algorithm`

`leaf_key_size`

`leaf_signature_algorithm`

`certificate_chain`

`classification`

`findings`

`warnings`

`errors`

`schema_version`

`tool_version`

`report_kind`

`scope_kind`

`scope_description`

`generated_at`

`scope`

`execution`

`results`

`summary`

`scope_kind`

`host`

`port`

`transport`

`state`

`pid`

`process_name`

`executable`

`inventory`

`hints`

`warnings`

`errors`

`schema_version`

`tool_version`

`report_kind`

`scope_kind`

`scope_description`

`generated_at`

`scope`

`execution`

`results`

`summary`

`discovered_endpoint`

`selection`

`tls_result`