mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2025-03-22 00:36:30 -04:00
Add crocoite (#53)
This commit is contained in:
parent
a89a159a01
commit
494912c939
@ -71,6 +71,8 @@ This list of tools and software is intended to briefly describe some of the most
|
||||
|
||||
* [Brozzler](https://github.com/internetarchive/brozzler) (Stable) - A distributed web crawler (爬虫) that uses a real browser (chrome or chromium) to fetch pages and embedded urls and to extract links.
|
||||
|
||||
* [crocoite](https://github.com/promyloph/crocoite) (In Development) - Crawl websites using headless Google Chrome/Chromium and save resources, static DOM snapshot and page screenshots to WARC files.
|
||||
|
||||
* [F(b)arc](https://github.com/justinlittman/fbarc) (Stable) - A commandline tool and Python library for archiving data from [Facebook](https://www.facebook.com/) using the [Graph API](https://developers.facebook.com/docs/graph-api).
|
||||
|
||||
* [grab-site](https://github.com/ludios/grab-site) (Stable) - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns.
|
||||
|
Loading…
x
Reference in New Issue
Block a user