Web archiving - Claude skills for journalism

When to use

Service	Best for	API	Deletions
Wayback Machine	Historical research	Yes (free)	On request
Archive.today	Paywall bypass, quick saves	No	Never
Perma.cc	Legal citations	Yes (free tier)	By creator
ArchiveBox	Self-hosted, privacy	Local	Never
Conifer	Interactive content	Yes	By creator

Python code for checking availability, saving pages, and retrieving historical snapshots via CDX API.

Archive to Wayback, Archive.today, and Perma.cc simultaneously for maximum preservation.

Chain of custody documentation, content hashing, and timestamped capture records.

Self-hosted archiving setup, Python integration, and scheduled archiving workflows.

Try services in this order for maximum coverage:

916B+ pages, historical depth, API access

On-demand snapshots, paywall bypass

Recent pages, search: cache:url

Click dropdown arrow in search results

Searches multiple archives simultaneously

# Recommended: install the research-toolkit plugin

/plugin marketplace add jamditis/claude-skills-journalism

/plugin install research-toolkit@claude-skills-journalism

# Or copy just this skill from the plugin tree

git clone https://github.com/jamditis/claude-skills-journalism.git

cp -r claude-skills-journalism/research-toolkit/skills/web-archiving ~/.claude/skills/

Or browse this skill in the GitHub repository.

Source verification Web scraping Digital archive

Wayback Machine API, multi-archive redundancy, and legal evidence preservation in one skill.