Structured LLM Output

concept 4 connections

Technique for getting parseable data out of an LLM despite it only emitting text. Old approach (still visible in LangChain's prompts, one in JSON and one in YAML retry form): ask 'please generate data matching this JSON schema', validate the output, and on failure re-prompt with the error message and schema — the same loop developers run against junior engineers. Modern LLM servers now enforce structured output internally, returning clean JSON with messages listed as arrays of objects and tool calls as structured fields. Hasiński highlights that server-side enforcement is superior because fault-tolerant per-token validation has to happen locally (latency of a remote round-trip would be prohibitive) — llama.cpp already does something similar.

Provenance

Created in: Next Token! — Chris Hasiński on LLM falsehoods 2026-04-18 07:42
Read by: 9 extractions