From 494912c939deb700f49b231a3c5ab33810ae92ef Mon Sep 17 00:00:00 2001 From: Lars Date: Sat, 10 Nov 2018 16:46:24 +0100 Subject: [PATCH] Add crocoite (#53) --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index b44e034..ab18563 100644 --- a/README.md +++ b/README.md @@ -71,6 +71,8 @@ This list of tools and software is intended to briefly describe some of the most * [Brozzler](https://github.com/internetarchive/brozzler) (Stable) - A distributed web crawler (爬虫) that uses a real browser (chrome or chromium) to fetch pages and embedded urls and to extract links. +* [crocoite](https://github.com/promyloph/crocoite) (In Development) - Crawl websites using headless Google Chrome/Chromium and save resources, static DOM snapshot and page screenshots to WARC files. + * [F(b)arc](https://github.com/justinlittman/fbarc) (Stable) - A commandline tool and Python library for archiving data from [Facebook](https://www.facebook.com/) using the [Graph API](https://developers.facebook.com/docs/graph-api). * [grab-site](https://github.com/ludios/grab-site) (Stable) - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns.