Design your search cluster

Build a distributed search engine (Elasticsearch / OpenSearch style) (12 scenes)

Scene 12 · Design your search cluster

Capstone: pick e-commerce, logs, or security analytics and configure shards, replicas, refresh, scoring, and ILM — the verifier traces every ✓/✗ back to the scene that earned it.

Previously

The same primitives — shards, replicas, refresh, scoring, tiers — configure radically different deployments. The capstone is choosing them deliberately, with each choice traceable to the scene that justifies it.

Scene 12

Design your search cluster

Watch

Diagram

Top: three workload archetype cards (e-commerce search, application logs, security analytics) — exactly one is active. Center: the design palette — primary shards, replicas, refresh interval, dfs_query_then_fetch toggle, ILM tiering depth. Right: the verifier panel — one row per knob, ✓ / ✗ / ! with the scene id that justifies the verdict (scene 4 = refresh, scene 5 = shards, scene 6 = replicas, scene 9 = dfs, scene 10 = aggs, scene 11 = ILM/operational). Bottom: verdict pill — fits the workload, or doesn't.

Three workloads, three honest configurations. The canvas snaps to the defaults for each archetype; read the verifier rows — every ✓ cites the scene that justifies it.

Implementation

archetype.ecommerce

small corpus, BM25 critical, dfs ON, ILM off

1# 5M product catalog · interactive browse · ranking is part of SLO
2# primary_shards = 2          (scene 5: small corpus, few shards)
3# replicas       = 1          (scene 6: HA + read fan-out)
4# refresh        = 1s         (scene 4: live catalog UI)
5# search_type    = dfs_query_then_fetch  (scene 9: stable rankings
6#                                          across reindex)
7# terms_agg      = default shard_size    (scene 10: not the bottleneck)
8# ILM            = off        (scene 11: products don't age out)

PreviousOperational sharp edges