Updates for Archives Unleashed projects. (#78)

- Deprecate AUK notebooks
- Add AU Notebooks
- Add twut
- Update the description for aut
- Update documentation link
This commit is contained in:
Nick Ruest 2020-02-25 10:27:36 -05:00 committed by GitHub
parent 99ec216d7c
commit e696a5e09b
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
2 changed files with 5 additions and 3 deletions

View File

@ -30,7 +30,7 @@ Web archiving, an effort to preserve the Web for future generations.
* The [offical ISO 28500 WARC specification homepage](http://bibnum.bnf.fr/WARC/).
* More advanced material:
* [Awesome Memento](https://github.com/machawk1/awesome-memento)
* [Archives Unleashed Toolkit documentation](http://archivesunleashed.org/aut/)
* [Archives Unleashed Toolkit documentation](https://github.com/archivesunleashed/aut-docs)
* [Heritrix Walkthrough](https://github.com/web-archive-group/heritrix-walkthrough) (In development)
* [The WARC Ecosystem](http://www.archiveteam.org/index.php?title=The_WARC_Ecosystem)
* [The Web Crawl section of COPTR](http://coptr.digipres.org/Category:Web_Crawl)
@ -137,8 +137,9 @@ This list of tools and software is intended to briefly describe some of the most
* [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. (Stable)
* [Archives Unleashed Cloud](https://cloud.archivesunleashed.org) - Archives Unleashed Cloud (AUK) is an web interface for analysing web archives. Currently, it can sync with Archive-It collections and extract hyperlink networks, full text, and other information from your collections. (Stable)
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives.
* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives. (Stable)
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/notebooks) - Notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit. (Stable)
* [Archives Unleashed Toolkit](https://github.com/archivesunleashed/aut) - Archives Unleashed Toolkit (AUT) is an open-source platform for analyzing web archives with Apache Spark. (Stable)
* [Tweet Archvies Unleashed Toolkit](https://github.com/archivesunleashed/twut) - An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark. (In Development)
### Quality Assurance

View File

@ -1,5 +1,6 @@
# Deprecated/Abandoned Items
* [ArchiveFacebook](https://addons.mozilla.org/en-US/firefox/addon/archivefacebook/) - (Abandoned) - A [Mozilla Firefox](https://www.mozilla.org/en-US/firefox/new/) add-on for individuals to archive their Facebook accounts.
* [Archives Unleashed Notebooks](https://github.com/archivesunleashed/auk-notebooks) - Jupyter notebooks to assist in creating additional analysis and visualizations of Archives Unleashed Cloud derivatives.
* [pywb Wayback Web Recorder (Archiver)](https://github.com/ikreymer/pywb-webrecorder) - (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system.
* [Warrick](https://github.com/oduwsdl/warrick) - (Unmaintained) - An open source tool for reconstructing lost websites from web archives using [Memento](http://mementoweb.org/).