Create Conversation Eval

Run a scripted conversation against the real reply engine, then score deterministic assertions (forbidden-leakage / memory-recall / per-turn contains) + LLM-judge rubrics. Async: returns an eval_id; poll the GET below for the scored report.

Authentication

AuthorizationBearer

LoreOS API key, sent as Authorization: Bearer <key>.

Path parameters

slugstringRequired

Request

This endpoint expects an object.
namestringRequired1-200 characters
turnslist of objectsRequired
seed_contextlist of objects or nullOptional

Context events planted before turn 1 (e.g. a fact to test leak).

assertionsobjectOptional
reply_modeenumOptionalDefaults to deep

Response

Successful Response

Errors

422
Unprocessable Entity Error