--- title: ArchiveBox created: 2026-06-08 updated: 2026-06-08 type: app tags: [catalogue, archive, app-marathon-batch-b] confidence: high contested: false sources: [https://selfh.st/apps/?tag=Archive&app=archivebox, https://archivebox.io/] --- # 📩 ArchiveBox > **L'archive web open-source de rĂ©fĂ©rence** : snapshots HTML, PDF, screenshots, vidĂ©os, WARC, mĂ©dias sociaux. Tout en local, import depuis bookmarks, RSS, Pocket. ## 📋 Informations GĂ©nĂ©rales | Champ | Valeur | | :--- | :--- | | **Site web** | [archivebox.io](https://archivebox.io/) | | **GitHub** | [ArchiveBox/ArchiveBox](https://github.com/ArchiveBox/ArchiveBox) | | **License** | MIT | | **Langage** | Python | | **Étoiles GitHub** | 24k ⭐ | | **CatĂ©gorie** | [[cat-archive|Archive]] | ## 📝 Description **ArchiveBox** est un **"internet archiving"** self-hosted. Il capture des pages web sous plusieurs formats (HTML+ressources, PDF, screenshot PNG, WARC, mĂ©dias) pour les consulter hors-ligne, les partager, ou rĂ©sister au link rot. Import depuis Pocket, Pinboard, RSS, bookmarks navigateur, ligne de commande. DiffĂ©rence vs Wallabag : Wallabag = read-it-later (un seul format lisible). ArchiveBox = **multi-format, fidĂ©litĂ© d'archive**, plusieurs mĂ©thodes de capture par URL. Pour qui : journalistes, chercheurs, archivistes, particuliers inquiets de la disparition du contenu web ("digital preservation"). ## 🚀 Installation ### Docker Compose ```yaml version: '3.8' services: archivebox: image: archivebox/archivebox:latest container_name: archivebox restart: unless-stopped command: server --quick-init 0.0.0.0:8000 environment: - ALLOWED_HOSTS=archive.example.com - PUBLIC_INDEX=True - PUBLIC_SNAPSHOTS=True - MEDIA_MAX_SIZE=750m volumes: - archivebox_data:/data labels: - "traefik.enable=true" - "traefik.http.routers.archivebox.rule=Host(`archive.example.com`)" - "traefik.http.routers.archivebox.entrypoints=websecure" - "traefik.http.routers.archivebox.tls.certresolver=letsencrypt" volumes: archivebox_data: ``` ### Utilisation (CLI ou Web) ```bash # Ajouter une URL docker exec archivebox archivebox add 'https://example.com' # Importer des bookmarks Netscape docker exec archivebox archivebox add < bookmarks.html # Importer un flux RSS docker exec archivebox archivebox add --depth=1 'https://example.com/feed.xml' ``` ## 🔄 Alternatives ### Open Source - **Wallabag** — Read-it-later simple - [[app-project-nomad]] — Archive monorepo - [[app-ganymede]] — Archive vidĂ©o - **Shaarli** — Bookmarks personnels - **Perkeep (Camlistore)** — Personal cloud storage ### PropriĂ©taires - **Pocket** — Read-it-later Mozilla - **Pinboard** — Bookmarks payants - **Internet Archive** — Archive publique (Wayback) - **Wayback Machine Save** — Extension ## 🔐 SĂ©curitĂ© - **Auth** : email/password + SSO - **HTTPS** : obligatoire (URLs archivĂ©es souvent sensibles) - **Permissions fichiers** : dossiers snapshots en lecture seule cĂŽtĂ© public - **Pas d'auth par dĂ©faut** : activer `ALLOW_SNAPSHOT_LOCAL_ACCESS=False` - **CSP** : sandbox iframe pour les snapshots ## 📚 Ressources - [Site officiel](https://archivebox.io/) - [GitHub](https://github.com/ArchiveBox/ArchiveBox) - [Documentation](https://github.com/ArchiveBox/ArchiveBox/wiki) ## Pages LiĂ©es - [[cat-archive]] — CatĂ©gorie Archive - [[app-wallabag]] — Concurrent read-it-later - [[recettes-docker-compose]] — Templates Docker