mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2025-03-13 04:36:30 -04:00
Add Community Archive (Twitter Archive and API)
Co-Authored-By: Gabriel Chartier <gabriel@chartier.link>
This commit is contained in:
parent
cf4504832d
commit
9c3772e74f
@ -73,6 +73,7 @@ This list of tools and software is intended to briefly describe some of the most
|
||||
* [Brozzler](https://github.com/internetarchive/brozzler) - A distributed web crawler (爬虫) that uses a real browser (Chrome or Chromium) to fetch pages and embedded urls and to extract links. *(Stable)*
|
||||
* [Cairn](https://github.com/wabarc/cairn) - A npm package and CLI tool for saving webpages. *(Stable)*
|
||||
* [Chronicler](https://github.com/CGamesPlay/chronicler) - Web browser with record and replay functionality. *(In Development)*
|
||||
* [Community Archive](https://www.community-archive.org/) - Open Twitter Database and API with tools and resources for building on archived Twitter data.
|
||||
* [crau](https://github.com/turicas/crau) - crau is the way (most) Brazilians pronounce crawl, it's the easiest command-line tool for archiving the Web and playing archives: you just need a list of URLs. *(Stable)*
|
||||
* [Crawl](https://git.autistici.org/ale/crawl) - A simple web crawler in Golang. *(Stable)*
|
||||
* [crocoite](https://github.com/promyloph/crocoite) - Crawl websites using headless Google Chrome/Chromium and save resources, static DOM snapshot and page screenshots to WARC files. *(In Development)*
|
||||
|
Loading…
x
Reference in New Issue
Block a user