Claude Code and the Wayback Machine
two useful things from today- ctrl+e in Claude Code and...
did you know the Wayback machine has a CLI?!?!?
The Wayback CDX API is the index/search API for the Internet Archive's Wayback Machine. Instead of manually browsing web.archive.org to see what was captured
for a domain, you can query the CDX API programmatically:
json&limit=5"
It returns a list of every snapshot (capture) the Wayback Machine has for that
domain — timestamps, URLs, status codes, MIME types. This tells us:
- How many pages were captured (content richness)
- What URLs existed (site structure)
- When they were captured (which eras of the site we can reconstruct)
- Whether there's enough material to justify a rebuild
For the remaining domains, running CDX queries would rank them by content richness so we know which ones are worth rebuilding vs. which are thin enough to just redirect. It's step 1 in the process before you'd need to pull an Archivarix export.
2
0 comments
Guerin Green
4
Claude Code and the Wayback Machine
Burstiness and Perplexity
skool.com/burstiness-and-perplexity
Master AI use cases from legal & the supply chain to digital marketing & SEO. Agents, analysis, content creation--Burstiness & Perplexity from NovCog
Leaderboard (30-day)
Powered by