mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2024-10-01 03:15:45 -04:00
Adding WCT and a separate curation section. (#110)
* Adding WCT and a separate curation section. WCT should clearly be on this list. The curation section is a proposal to capture any tools that integrate web archiving into curation workflows and tools. * Fix spacing of bullet
This commit is contained in:
parent
a9daaebc34
commit
7b5c80c44f
@ -16,6 +16,7 @@ Web archiving is the process of collecting portions of the World Wide Web to ens
|
|||||||
* [WARC I/O Libraries](#warc-io-libraries)
|
* [WARC I/O Libraries](#warc-io-libraries)
|
||||||
* [Analysis](#analysis)
|
* [Analysis](#analysis)
|
||||||
* [Quality Assurance](#quality-assurance)
|
* [Quality Assurance](#quality-assurance)
|
||||||
|
* [Curation](#curation)
|
||||||
* [Community Resources](#community-resources)
|
* [Community Resources](#community-resources)
|
||||||
* [Other Awesome Lists](#other-awesome-lists)
|
* [Other Awesome Lists](#other-awesome-lists)
|
||||||
* [Blogs and Scholarship](#blogs-and-scholarship)
|
* [Blogs and Scholarship](#blogs-and-scholarship)
|
||||||
@ -84,6 +85,7 @@ This list of tools and software is intended to briefly describe some of the most
|
|||||||
* [WARCreate](http://matkelly.com/warcreate/) - A [Google Chrome](https://www.google.com/intl/en/chrome/browser/) extension for archiving an individual webpage or website to a WARC file. *(Stable)*
|
* [WARCreate](http://matkelly.com/warcreate/) - A [Google Chrome](https://www.google.com/intl/en/chrome/browser/) extension for archiving an individual webpage or website to a WARC file. *(Stable)*
|
||||||
* [Warcworker](https://github.com/peterk/warcworker) - An open source, dockerized, queued, high fidelity web archiver based on Squidwarc with a simple web GUI. *(Stable)*
|
* [Warcworker](https://github.com/peterk/warcworker) - An open source, dockerized, queued, high fidelity web archiver based on Squidwarc with a simple web GUI. *(Stable)*
|
||||||
* [Web2Warc](https://github.com/helgeho/Web2Warc) - An easy-to-use and highly customizable crawler that enables anyone to create their own little Web archives (WARC/CDX). *(Stable)*
|
* [Web2Warc](https://github.com/helgeho/Web2Warc) - An easy-to-use and highly customizable crawler that enables anyone to create their own little Web archives (WARC/CDX). *(Stable)*
|
||||||
|
* [Web Curator Tool](https://webcuratortool.org) - Open-source workflow management for selective web archiving. *(Stable)*
|
||||||
* [WebMemex](https://github.com/WebMemex) - Browser extension for Firefox and Chrome which lets you archive web pages you visit. *(In Development)*
|
* [WebMemex](https://github.com/WebMemex) - Browser extension for Firefox and Chrome which lets you archive web pages you visit. *(In Development)*
|
||||||
* [Webrecorder](https://webrecorder.io/) - Create high-fidelity, interactive recordings of any web site you browse. *(Stable)*
|
* [Webrecorder](https://webrecorder.io/) - Create high-fidelity, interactive recordings of any web site you browse. *(Stable)*
|
||||||
* [Wget](http://www.gnu.org/software/wget/) - An open source file retrieval utility that of [version 1.14 supports writing warcs](http://www.archiveteam.org/index.php?title=Wget_with_WARC_output). *(Stable)*
|
* [Wget](http://www.gnu.org/software/wget/) - An open source file retrieval utility that of [version 1.14 supports writing warcs](http://www.archiveteam.org/index.php?title=Wget_with_WARC_output). *(Stable)*
|
||||||
@ -165,6 +167,9 @@ This list of tools and software is intended to briefly describe some of the most
|
|||||||
* [xDoTool](https://github.com/jordansissel/xdotool) - Click automation on Ubuntu.
|
* [xDoTool](https://github.com/jordansissel/xdotool) - Click automation on Ubuntu.
|
||||||
* [Xenu](http://home.snafu.de/tilman/xenulink.html) - Desktop link checker for Windows.
|
* [Xenu](http://home.snafu.de/tilman/xenulink.html) - Desktop link checker for Windows.
|
||||||
|
|
||||||
|
### Curation
|
||||||
|
|
||||||
|
* [Zotero Robust Links Extension](https://robustlinks.mementoweb.org/zotero/) - A [Zotero](https://www.zotero.org/) extension that submits to and reads from web archives. Source [on GitHub](https://github.com/lanl/Zotero-Robust-Links-Extension). Supercedes [leonkt/zotero-memento](https://github.com/leonkt/zotero-memento).
|
||||||
|
|
||||||
## Community Resources
|
## Community Resources
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user