Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
ArchiveBox is an open-source, self-hosted web archiving solution. It allows users to collect, save, and view websites offline, preserving digital content against link rot. The project supports various input formats, extracts different content types, and stores data in durable formats.
The target audience includes researchers, journalists, lawyers, and archivists who need to preserve and analyze online content. It also appeals to individuals who want to safeguard their personal bookmarks, social media, and other important web pages. The project is designed for technically proficient users who are comfortable with self-hosting and command-line tools.
archivebox init
command.archivebox add
command, specifying input files or URLs directly.CodeRabbit AI - Ad
Cut Code Review Time & Bugs in Half!
Cut Code Review Time & Bugs in Half!
Ad