mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2025-03-25 02:06:31 -04:00
Updates for Archives Unleashed projects. (#78)
- Deprecate AUK notebooks - Add AU Notebooks - Add twut - Update the description for aut - Update documentation link
This commit is contained in:
parent
99ec216d7c
commit
e696a5e09b
@ -30,7 +30,7 @@ Web archiving, an effort to preserve the Web for future generations.
|
||||
* The [offical ISO 28500 WARC specification homepage](http://bibnum.bnf.fr/WARC/).
|
||||
* More advanced material:
|
||||
* [Awesome Memento](https://github.com/machawk1/awesome-memento)
|
||||
* [Archives Unleashed Toolkit documentation](http://archivesunleashed.org/aut/)
|
||||
* [Archives Unleashed Toolkit documentation](https://github.com/archivesunleashed/aut-docs)
|
||||
* [Heritrix Walkthrough](https://github.com/web-archive-group/heritrix-walkthrough) (In development)
|
||||
* [The WARC Ecosystem](http://www.archiveteam.org/index.php?title=The_WARC_Ecosystem)
|
||||
* [The Web Crawl section of COPTR](http://coptr.digipres.org/Category:Web_Crawl)
|
||||
@ -137,8 +137,9 @@ This list of tools and software is intended to briefly describe some of the most
|
||||
|
||||
* [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. (Stable)
|
||||
* [Archives Unleashed Cloud](https://cloud.archivesunleashed.org) - Archives Unleashed Cloud (AUK) is an web interface for analysing web archives. Currently, it can sync with Archive-It collections and extract hyperlink networks, full text, and other information from your collections. (Stable)
|
||||
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives.
|
||||
* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives. (Stable)
|
||||
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/notebooks) - Notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit. (Stable)
|
||||
* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives with Apache Spark. (Stable)
|
||||
* [Tweet Archvies Unleashed Toolkit](https://github.com/archivesunleashed/twut) - An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark. (In Development)
|
||||
|
||||
### Quality Assurance
|
||||
|
||||
|
@ -1,5 +1,6 @@
|
||||
# Deprecated/Abandoned Items
|
||||
|
||||
* [ArchiveFacebook](https://addons.mozilla.org/en-US/firefox/addon/archivefacebook/) - (Abandoned) - A [Mozilla Firefox](https://www.mozilla.org/en-US/firefox/new/) add-on for individuals to archive their Facebook accounts.
|
||||
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives.
|
||||
* [pywb Wayback Web Recorder (Archiver)](https://github.com/ikreymer/pywb-webrecorder) - (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system.
|
||||
* [Warrick](https://github.com/oduwsdl/warrick) - (Unmaintained) - An open source tool for reconstructing lost websites from web archives using [Memento](http://mementoweb.org/).
|
||||
|
Loading…
x
Reference in New Issue
Block a user