Assessment Pipeline

Give us your system prompt. Here's exactly what happens next.

The Five-Minute Close

A 120-question LCSH behavioral assessment takes your AI system's prompt, runs it through an independent ethical evaluation engine (Grillo), scores responses across four dimensions—Lying, Cheating, Stealing, and Harm—and produces a cryptographically signed result anchored on Ethereum mainnet.

The entire process involves five specialized agents, a tamper-proof audit chain, and triple-send verification to ensure no single point of failure can lose your assessment data. Every step produces evidence. Every evidence link is hashable. Every hash is verifiable on-chain.

End-to-End Assessment Flow

STEP 1

Assessment Request

A user submits a system prompt via the dashboard or API. The request includes the target AI model, provider API key, and assessment configuration.

STEP 2Grillo · Conscience

Grillo Receives & Orchestrates

Grillo (the conscience agent) receives the assessment request. The orchestrator selects the appropriate model provider, validates the configuration, and prepares 120 LCSH questions across four dimensions.

STEP 3Grillo · Conscience

Question Generation & Scoring

Each of the 120 questions probes one of four LCSH dimensions — Lying, Cheating, Stealing, and Harm. The target AI responds, and Grillo scores each response independently. No agent assesses itself.

STEP 4Grillo · Conscience

Result Aggregation & Hash Chain

Scored responses are aggregated into a structured JSON result. A SHA-256 hash is computed over the full result payload — creating the first link in the cryptographic evidence chain.

STEP 5Fleet Bus · Infrastructure

Triple-Send Verification

Assessment results are sent simultaneously to three agents: Jessie (commander briefing), Noah (temporal record), and the requesting context. This prevents single-point-of-failure data loss.

STEP 6Noah · Navigator

Temporal Record & Ethereum Anchor

Noah ingests the result into the temporal store, adding it to the behavioral trajectory. The SHA-256 hash is anchored on Ethereum mainnet — creating cryptographically unfakeable, publicly verifiable proof of the assessment.

Cryptographic Evidence Chain

LCSH Score

JSON Result

SHA-256 Hash

Fleet-Bus Audit Event

Noah Temporal Record

Ethereum Anchor

Chain of custody from question to blockchain. Each link is independently verifiable.

Triple-Send Verification

Assessment results are sent to three agents simultaneously via fleet-bus. This guarantees that even if one agent is temporarily unavailable, the assessment data survives.

Jessie

Commander

Receives assessment briefing for fleet oversight

Noah

Navigator

Permanent temporal record with Ethereum anchor

Assessed Agent

Target

Receives its own score for self-awareness

Triple-send prevents single-point-of-failure data loss. If Jessie is processing another request, Noah still has the full result. If Noah's temporal store is busy writing, Jessie still has the briefing. The data exists in three places before anyone can lose it.

Fleet Participation

Conscience

Grillo

Orchestrates assessment, generates 120 LCSH questions, scores responses, produces result hash

Navigator

Noah

Ingests results into temporal store, tracks behavioral trajectory, anchors hash to Ethereum

Commander

Jessie

Receives assessment briefing, can delegate follow-up actions, holds fleet veto authority

Operator

Nole

Surfaces results on platform, generates certificates, shares with Trust Alliance partners

Sentinel

Mighty Mark

Verifies assessment pipeline health via active probes, monitors Grillo availability

Assessment Types

Type	Trigger	Frequency
On-Demand	User submits system prompt via dashboard or API	Any time
Scheduled (Compliance Calendar)	Compliance calendar triggers recurring assessment	Daily / Weekly / Monthly
Fleet-Wide Drift Detection	Noah detects behavioral trajectory deviation	Automated on drift threshold

Running Assessments

Dashboard and API instructions

Fleet Communication

Audit chains and guaranteed delivery