# 21 minutes 23 seconds — the portability boundary is empirically located

The [second-URL-refactor postmortem](/blog/2026-05/sama-v2-second-url-refactor-postmortem) closed with two falsifiable subclaims. The third URL refactor on this site happened today and measured the second one:

> *"A data-shaped refactor stays under 30 minutes. Pick /blog/&lt;slug&gt; → /blog/&lt;yyyy-mm&gt;/&lt;slug&gt;. Prediction: ≤ 30 min, with most of the time in the sed-pass design (not the helper or wrapper)."*

Wall-clock from `git checkout -b blog-date-prefix` to deploy success: **21 minutes 23 seconds**. Within the predicted band, on the faster side.

Three datapoints now. The pattern stays portable; the cost stays bounded; the floor differs by kind:

![Portability boundary located — 21m 23s vs ≤30 min predicted](/images/portability-boundary-found.png?v=1)

## The three datapoints

| | PR #42 (first) | PR #53 (fixed-enum) | PR #55 (data-driven) |
|---|---|---|---|
| Wall-clock | an evening (~3h) | **8m 8s** | **21m 23s** |
| Helper imports data | none | none | `ALL_POSTS` |
| Helper LOC | 13 | 13 | 24 |
| Migration mechanism | sed (one-liner) | sed (one-liner) | Bun script (~50 LOC) |
| References rewritten | 49 | ~22 | 173 |
| Files touched | 19 | 17 | 46 |
| Over-rewrites | 0 | 1 (caught + reverted) | 1 wider (5+ files) |
| Verifier verdict | 7/7 ✓ | 7/7 ✓ | 7/7 ✓ |

Three observations.

## What the prediction got right

**The helper itself stayed mechanical.** PR #53's 13-line `rewriteOldSamaDisciplineUrl` was a near-byte-for-byte copy of PR #42's `rewriteOldGitUrl`. PR #55's `rewriteOldBlogUrl` is 24 lines — 13 of original shape (regex match, null return on miss) plus 11 lines of `ALL_POSTS.find` + `date.slice(0, 7)` + new-URL construction. The added 11 lines are *all* the data-driven part. The pattern itself didn't grow; just the lookup pipe at the front of it.

**The Layer-3 wrapper stayed identical across all three.** The same 11-line block — `if (newPath !== null) return new Response(null, { status: 301, headers: { Location: newPath, ... } })` — appears three times in `d21_handlers_fallback.ts` now, adjacent to each other. Pure copy-paste with one identifier change. Code review would catch any drift between them mechanically.

**The verifier stayed 7/7 ✓ across all three merges.** Anti-fudge gate held. None of the three refactors required §4 verifier changes.

## What surprised — the migration cost grew

The data-driven refactor isn't just *the helper has a lookup*. The migration tooling had to grow too:

- **One sed line → 50-line Bun script.** The git-url and sama-discipline migrations were `sed -i -E 's|/old/.*|/new/...|g'` — one shell line. The blog migration needed per-slug substitution because each new URL depends on the post's date. That meant: iterate ALL_POSTS, build a `[old, new]` table, sort by slug length DESC to avoid prefix collisions, iterate every file, do N replaceAll passes per file. Still mechanical, but the tooling is now ~50 LOC instead of 1.
- **One over-rewrite → wider over-rewrite.** Same pattern-risk as PR #53 (sed matching inside filesystem paths), but here the script touched `content/blog/<slug>.md` strings — 5+ files affected instead of 1. Reverted with a counter-sed that restored `content/blog/YYYY-MM/<slug>.md` to `content/blog/<slug>.md`. Total recovery: ~2 minutes, but it's a real pattern-risk that gets bigger as the refactor surface grows.
- **Spoof guard.** The fixed-enum refactors didn't need one: the regex enumerated exactly the valid slugs, so URL forgery was impossible by construction. The data-driven handler reads `:yyyymm` from the URL — a forged `/blog/9999-99/<valid-slug>` would render the post if the handler didn't validate. Added a one-line `entry.date.slice(0, 7) !== yyyymm → 404` guard. Trivial, but a new class of concern.
- **Three test failures after migration.** The migration substituted URL strings inside test assertions AND inside filesystem-path strings (the same sed-bug, wider). Three tests broke; all three were mechanical to fix (revert the over-rewrites, then update remaining stale URLs in CONTRIBUTING.md which the script missed because it's at root, not under `content/`).

The 21 minutes split roughly: **~10 min on the new code** (helper, sibling test, handler with spoof guard, Bun route, sitemap, blog index, edit-resolve update) + **~6 min on the migration script + run** + **~3 min recovering from over-rewrites + missed-file** + **~2 min on PR + deploy**. The new-code time was actually the same as PR #53; the difference is *everything around it*.

## Where the boundary sits

Pre-PR #53, the open question was: *does the b32_&lt;old&gt;_url_redirect pattern only work for fixed-enum refactors?*

Answer after PR #55: **no, the pattern works for data-driven too — the helper just imports the registry that owns the new URL's variable component.** Layer-1 importing Layer-0 is fine; SAMA explicitly allows it. The helper stays pure, the Response wrapper stays identical, the structural shape is preserved.

What grows with data-driven is *the migration*, not *the design*. And the migration is one-shot tooling that doesn't ship with the runtime — `scripts/migrate-blog-urls.ts` is in the repo but never imported by the application. The runtime cost is identical; only the development-time cost of *one* migration changed.

So the empirical boundary is:
- **Fixed-enum URL refactors: ~8 min wall-clock**, dominated by deploy time.
- **Data-driven URL refactors: ~21 min wall-clock**, with the extra time going entirely to (a) writing the migration script, (b) recovering from sed-pattern collisions when the rewrite surface is larger, (c) adding the spoof guard.

Three datapoints isn't a law of nature. But the *shape* of the cost difference is mechanical: more files to migrate = more risk of over-rewrite collisions = more recovery time. Future data-driven refactors should land in the same band as long as the migration scope stays under ~200 references.

## What the empirical chain has now

After the [first postmortem](/blog/2026-05/sama-v2-git-url-refactor-postmortem) (PR #42), the claim was *"this pattern is reusable in principle"*.

After the [second postmortem](/blog/2026-05/sama-v2-second-url-refactor-postmortem) (PR #53), the claim was *"this pattern is reusable in practice for fixed-enum URLs, measured 7.4× faster than the first instance"*.

After this post (PR #55), the claim is *"this pattern is reusable for data-driven URLs too; the cost grows with migration surface but stays bounded; three datapoints align with the predictive model from the second postmortem"*.

Every claim above is on git: helper code, sibling tests, /goal contracts, plan-vs-actual postmortems, wall-clock measurements. *Compliance proves the rules were followed; delta proves they were worth following* — and now, three deltas confirm the rule about *URL refactor cost* is itself stable across pattern variants.

## What's next on this chain

Two falsifiable subclaims that the next iteration could test:

1. **A fourth refactor of the same shape lands in the same band.** Pick another fixed-enum (e.g. `/guides/&lt;slug&gt;` → `/guides/agent/&lt;slug&gt;` if guides ever subdivide). Prediction: ≤ 10 min, since the pattern is now mature and the deploy-time floor dominates.

2. **A data-driven refactor with a much larger surface still stays bounded.** Construct a scenario with ~500 references (e.g. if the blog grows to 100+ posts, doing another date-restructuring would trigger). Prediction: ≤ 45 min, because the marginal cost is in over-rewrite recovery, not in helper design.

Either result informative. Until then: three URL refactors, three postmortems, three wall-clock measurements, one stable pattern. The chain ratchets again.