How To Ensure Systems Do What We Want And Take Care Of Themselves

talk 22 connections

Michał Zajączkowski de Mezer's wroclove.rb 2022 talk. Language-agnostic with no code — a message-passing abstraction is used throughout. Core recipe: 'at-least-once + idempotence' enables components to self-heal and resume workflows after failure. Walks through processing guarantees / delivery semantics (at most once, at least once, exactly once) and argues exactly-once is what we almost always want but is hard without the recipe. Retry guidance: use exponential backoff so retries don't kill overloaded receivers; distinguish expected vs unexpected errors and only alert on the unexpected, using metrics/alarms to detect tendencies; decide retry duration based on the actor (humans ~1–5 retries fast; machines retry for days/weeks so systems heal while you're on vacation). Mentions timeouts, fail-fast, circuit breaker, back pressure, and rate limiting as further patterns. On idempotence, exercises 'protocol thinking' against HTTP: read operations are naturally idempotent; deletes are idempotent but receivers must recognize already-processed messages; PUT is idempotent only under a single-sender assumption — multi-sender races (lost update / mid-air collision) require optimistic locking with a version parameter; POST creation requires idempotency keys (unique tokens generated by the sender and indexed by the receiver) to avoid duplicates on retry. Also covers sender-chosen IDs via PUT with UUIDv4 as an alternative, and the fallback 'find-or-create' (read-check-write) pattern with its race and consistency caveats when the receiver is a third party without idempotency-key support — sometimes the best remedy is to negotiate a feature request or switch API providers. Q&A covers: retrying a POST whose source-of-truth entity changed in the meantime (apply optimistic locking with version 0); working with third-party APIs that support neither idempotency keys nor find-or-create (negotiate or accept duplicates); and why overworked receivers at scale make idempotency hard — addressed via CAP-theorem-style sharding with a deterministic hash-based partitioner so concurrent writes on the same entity land on the same partition.

date

2022-03-11

type

talk

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

At-Least-Once Plus Idempotence Recipe concept

Recipe is the central thesis of the talk.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Delivery Semantics concept

Introduces at-most-once, at-least-once, and exactly-once as the framing vocabulary.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Exponential Backoff concept

Recommended default backoff strategy for retries.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Expected vs Unexpected Errors concept

Argues for splitting error reporting into expected (metrics/alarms) vs unexpected (team alerts).

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Protocol Thinking concept

The discipline used to exercise HTTP verbs against failure scenarios.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Idempotence concept

Second half of the talk is dedicated to idempotence at the protocol level.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

HTTP Method Idempotence concept

Walks through GET, DELETE, PUT, POST under the protocol-thinking lens.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Optimistic Locking concept

Presented as the protocol-level solution to lost-update races on PUT/DELETE.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Lost Update Problem concept

Race condition motivating optimistic locking.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Idempotency Key concept

Convention for making POST creation idempotent.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Find-or-Create Pattern concept

Fallback when a third-party API lacks idempotency-key support.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

about

Sharding concept

Raised in Q&A as the CAP-theorem answer to race-load scaling.

question Handling third-party APIs without idempotency keys

asked_at

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Second audience question in the Q&A.

question Scaling receivers vs implementing idempotency

asked_at

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Third audience question in the Q&A.

question How should POST retries handle changed source-of-truth state?

asked_at

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

First audience question in the Q&A.

person Michał Zajączkowski de Mezer

authored

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Single-speaker presentation delivered at wroclove.rb 2022.

takeaway Use at-least-once plus idempotence to build self-healing systems

from_talk

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Single 'remember-this-one-thing' takeaway of the presentation.

takeaway Don't kill dependencies — use backoff

from_talk

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Advice given in the retry section.

takeaway Don't kill your team — alert only on unexpected errors

from_talk

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Advice given in the error-reporting section.

takeaway Tune retry duration to the actor

from_talk

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Guidance on when to stop retrying based on human vs machine senders.

takeaway Negotiate for idempotency-key support

from_talk

How To Ensure Systems Do What We Want And Take Care Of Themselves talk

Advice for integrating with third parties lacking idempotency support.

talk How To Ensure Systems Do What We Want And Take Care Of Themselves

presented_at

wroclove.rb 2022 event

Talk delivered at wroclove.rb 2022 on 2022-03-11.

Provenance

Created: 2026-04-17 16:17 seed
Last updated in: How To Ensure Systems Do What We Want And Take Care Of Th... 2026-04-17 21:51
Read by: 14 extractions