Commit Graph

102 Commits

Author SHA1 Message Date
Peter Krantz
285006a76d Added Warcworker (#56)
* Added Warcworker
2018-11-26 08:10:21 -05:00
Ian Milligan
4976c2b592 Adds Archives Unleashed Cloud and updates AUT (#55) 2018-11-16 19:22:49 -05:00
raffaele messuti
e2cde6b83f new tools: crawl and wasp (#54) 2018-11-12 15:20:39 -05:00
Lars
494912c939 Add crocoite (#53) 2018-11-10 10:46:24 -05:00
Andy Jackson
a89a159a01 Moved webarchive-discovery associated tools to be together (#50)
As suggested in https://github.com/iipc/awesome-web-archiving/pull/47#issue-195740023 this moves the `webarchive-discovery`-related tools under a `webarchive-discovery` section.
2018-10-16 07:48:46 -04:00
Andy Jackson
2d394f9a49
Create section for guidance for web publishers (#49)
* Some clean up and added Slack.

* Separate the basic and mroe advanced stuff, and add the intro video in.

* Added some new links and detail responding to #22.

* Add specific section for web publishers.
2018-10-16 12:27:37 +01:00
Toke Eskildsen
9e6d936a82 Added SolrWayback (#47)
* Added SolrWayback for both replay and discovery

* Removed SolrWayback from playback as it was confusing to list it under two different headings on the same page
2018-10-16 12:27:01 +01:00
Alex Osborne
c15b3c97e8
Heritrix wiki has moved to Github 2018-07-06 09:06:39 +09:00
Nick Sweeting
42e97d36ac Add Bookmark-Archiver (#46) 2018-06-24 16:00:21 -04:00
Alex Osborne
878d982775 Add OutbackCDX (#45) 2018-05-23 10:27:56 +01:00
Mat Kelly
9fcff939e3 Add ArchiveTools per #42 (#44)
Ping @ruebot
2018-05-15 12:18:27 -04:00
Ian Milligan
c6f6b4656d Updating old documentation links to new ones (#43) 2018-04-06 09:16:30 -04:00
IAMONSYS GmbH
063e0f2f35 Adding securitytrails archive (#41) 2018-03-14 08:51:20 -04:00
Jeremy Cahill
7964fca0e7 Move ArchiveFacebook to Deprecated (#40)
Project's FF addons page is disabled. Source appears to have been ported from Google Code in anticipation of updates that didn't materialize: https://groups.google.com/forum/?hl=en#!topic/archivefacebook/_m8KeOTnBng
2017-12-07 20:58:47 -05:00
Nick Ruest
03aa9703fd Update warclight (#35)
* Update warclight

* s/warcbase/aut/
2017-09-17 16:26:29 +01:00
Ashley
4013e4b8e2 rm duplicate link to awesome-momento (#39) 2017-09-16 14:50:09 -04:00
Ross Spencer
06b47c0f23 Added HTTPreserve Workbench (#37)
* Added HTTPreserve Workbench.

* Added language to HTTPreserve Workbench.
2017-09-03 19:18:11 -04:00
Ross Spencer
7e9671f411 Added HTTPreserve tikalinkextract. (#36) 2017-09-03 09:41:30 -04:00
Ross Spencer
17a41aca7e Added httpreserve.info (#38) 2017-09-03 09:40:43 -04:00
Patrick Connolly
11a60a2301 Added Archivers slack team. (#34) 2017-08-13 18:48:50 -04:00
Ian Milligan
5cf01e48df New sign-up process for Archives Unleashed slack (#33)
Changes from e-mail @ianmilligan1 to the @ruebot-created form.
2017-08-11 07:55:50 -04:00
Helge Holzmann
80deff9b4b Add Tempas v1 and v2 (#32)
* Add Tempas v1

* Add Tempas v2
2017-07-26 08:15:54 -04:00
John Berlin
63a410126d Add node-cdxj to the list (#31) 2017-07-24 23:02:15 -04:00
John Berlin
2d46142baa Updated node-warcs entry in the list to reflect http://ws-dl.blogspot.com/2017/07/2017-07-24-replacing-heritrix-with.html and WAILs + Squidwarcs usage of this library (#30) 2017-07-24 22:25:18 -04:00
John Berlin
3505b572dd Add Squidwarc to the list (#29) 2017-07-24 22:24:44 -04:00
Mohamed Aturban
d7fd3167a2 Add archivenow to the list (#28) 2017-07-10 13:05:18 +01:00
Mat Kelly
31389f46b9 Updates to PR#24 by @kant as recommended by @ruebot (#27)
* Minor fixes

* Changes per @ruebot in PR#14
2017-07-08 09:54:17 -04:00
Mat Kelly
c5b04a33e8 Add InterPlanetary Wayback (#26)
I deliberated under which category this should fit but replay seems most appropriate.
2017-07-07 21:55:29 +01:00
Mat Kelly
5dc48f29b8 Fix spelling (#25) 2017-07-07 15:21:25 +01:00
Andy Jackson
d820480d6e Add some links to other resources and clarifications (#23)
* Some clean up and added Slack.

* Separate the basic and mroe advanced stuff, and add the intro video in.

* Added some new links and detail responding to #22.
2017-06-26 16:38:34 -04:00
Mat Kelly
15b429d288 Update README.md (#21) 2017-06-23 12:57:54 -04:00
nruest
38b2694985
Test for http://netpreserve.org/web-archiving/tools-and-software embed 2017-06-22 16:08:03 -04:00
Mat Kelly
2a8288928e Rm superfluous paren (#20) 2017-06-21 17:17:45 -04:00
raffaele messuti
107fb052a3 add warcio, warctools, har2warc, node-warc, go webarchive (#19)
* warcat: still in utilities

* add webarchive-indexing

* add The Archive Browser

* add warcio, warctools, har2warc, node-warc, go webarchive
2017-06-21 16:01:56 -04:00
Nick Ruest
babfbac355 Add Heritrix Walkthrough (#18) 2017-06-20 10:05:35 +01:00
Mat Kelly
1634b60c96 Add "The Unarchiver" app. (#17)
A free variant of the already included "The Archive Browser" limited to the extraction features.
2017-06-19 11:28:32 -04:00
Kristinn Sigurðsson
d52d478000 Update README.md
Fixed incorrect alphabetical ordering of item.
2017-06-19 10:55:54 +00:00
Steffen
c370e303dc added html2warc (#16)
added html2warc, a simple script to convert offline data into a single warc file
2017-06-18 10:26:35 -04:00
raffaele messuti
4e413e2342 add webarchive-index and "the archive browser", remove warccat duplicate (#15)
* warcat: still in utilities

* add webarchive-indexing

* add The Archive Browser
2017-06-17 16:51:37 -04:00
Nick Ruest
c3658d76da Search & discovery (#14) 2017-06-17 13:08:44 +01:00
Nick Ruest
5788c2653c Add wasapi-downloader (#13) 2017-06-17 13:07:20 +01:00
Nick Ruest
d90f365477 Add more warcbase documentation/tranining (#12) 2017-06-17 13:06:41 +01:00
Ian Milligan
cd27e8c2f2 Adding some open scholarship resources (#10)
* added Web as History, SAA link, Archives Unleashed Slack

* as per @ruebot suggestion, alphabetizing resources
2017-06-16 12:35:14 -04:00
Helge Holzmann
fbb28e3d06 Add WarcPartitioner (#11) 2017-06-16 12:07:29 -04:00
Helge Holzmann
e4bf900190 Add HadoopConcatGz (#9) 2017-06-16 11:58:38 -04:00
Helge Holzmann
f8b58e3624 Add Web2Warc (#7) 2017-06-16 11:40:53 -04:00
Andy Jackson
4de1c1d2b3 Separate type of training material and add intro video (#5)
* Some clean up and added Slack.

* Separate the basic and mroe advanced stuff, and add the intro video in.
2017-06-16 11:19:50 -04:00
raffaele messuti
a04b77dcf6 new: webrecorder player in replay section (#6) 2017-06-16 16:18:32 +01:00
Nick Ruest
757c8967a3 toc anchors (#4) 2017-06-16 16:15:15 +01:00
Andy Jackson
e59333a0bd Some clean up and added Slack. (#3) 2017-06-16 10:27:02 -04:00