How were manual tweaks benchmarked

question 1 connections

Audience asked how the team knew whether a manual tweak was for the better — customer reactions or benchmarking? Answer: the benchmark was the product and operations teams 'vibing' with the resulting boxes — looking at each suggested box given the input and asking 'would I feel good receiving this?'. A matter of taste and judgment, not a metric.

answer_summary

Benchmark was product/ops teams vibe-checking resulting boxes — taste-based, not metric-based.

question How were manual tweaks benchmarked

asked_at

Accidentally building a neural network — A Ruby product recommendation journey talk

Asked during the post-talk Q&A.

Provenance

Created in: Nicolò Rebughini — Accidentally Building a Neural Network... 2026-04-22 09:03