Peter Krantz
285006a76d
Added Warcworker ( #56 )
...
* Added Warcworker
2018-11-26 08:10:21 -05:00
Ian Milligan
4976c2b592
Adds Archives Unleashed Cloud and updates AUT ( #55 )
2018-11-16 19:22:49 -05:00
raffaele messuti
e2cde6b83f
new tools: crawl and wasp ( #54 )
2018-11-12 15:20:39 -05:00
Lars
494912c939
Add crocoite ( #53 )
2018-11-10 10:46:24 -05:00
Andy Jackson
a89a159a01
Moved webarchive-discovery associated tools to be together ( #50 )
...
As suggested in https://github.com/iipc/awesome-web-archiving/pull/47#issue-195740023 this moves the `webarchive-discovery`-related tools under a `webarchive-discovery` section.
2018-10-16 07:48:46 -04:00
Andy Jackson
2d394f9a49
Create section for guidance for web publishers ( #49 )
...
* Some clean up and added Slack.
* Separate the basic and mroe advanced stuff, and add the intro video in.
* Added some new links and detail responding to #22 .
* Add specific section for web publishers.
2018-10-16 12:27:37 +01:00
Toke Eskildsen
9e6d936a82
Added SolrWayback ( #47 )
...
* Added SolrWayback for both replay and discovery
* Removed SolrWayback from playback as it was confusing to list it under two different headings on the same page
2018-10-16 12:27:01 +01:00
Alex Osborne
c15b3c97e8
Heritrix wiki has moved to Github
2018-07-06 09:06:39 +09:00
Nick Sweeting
42e97d36ac
Add Bookmark-Archiver ( #46 )
2018-06-24 16:00:21 -04:00
Alex Osborne
878d982775
Add OutbackCDX ( #45 )
2018-05-23 10:27:56 +01:00
Mat Kelly
9fcff939e3
Add ArchiveTools per #42 ( #44 )
...
Ping @ruebot
2018-05-15 12:18:27 -04:00
Ian Milligan
c6f6b4656d
Updating old documentation links to new ones ( #43 )
2018-04-06 09:16:30 -04:00
IAMONSYS GmbH
063e0f2f35
Adding securitytrails archive ( #41 )
2018-03-14 08:51:20 -04:00
Jeremy Cahill
7964fca0e7
Move ArchiveFacebook to Deprecated ( #40 )
...
Project's FF addons page is disabled. Source appears to have been ported from Google Code in anticipation of updates that didn't materialize: https://groups.google.com/forum/?hl=en#!topic/archivefacebook/_m8KeOTnBng
2017-12-07 20:58:47 -05:00
Nick Ruest
03aa9703fd
Update warclight ( #35 )
...
* Update warclight
* s/warcbase/aut/
2017-09-17 16:26:29 +01:00
Ashley
4013e4b8e2
rm duplicate link to awesome-momento ( #39 )
2017-09-16 14:50:09 -04:00
Ross Spencer
06b47c0f23
Added HTTPreserve Workbench ( #37 )
...
* Added HTTPreserve Workbench.
* Added language to HTTPreserve Workbench.
2017-09-03 19:18:11 -04:00
Ross Spencer
7e9671f411
Added HTTPreserve tikalinkextract. ( #36 )
2017-09-03 09:41:30 -04:00
Ross Spencer
17a41aca7e
Added httpreserve.info ( #38 )
2017-09-03 09:40:43 -04:00
Patrick Connolly
11a60a2301
Added Archivers slack team. ( #34 )
2017-08-13 18:48:50 -04:00
Ian Milligan
5cf01e48df
New sign-up process for Archives Unleashed slack ( #33 )
...
Changes from e-mail @ianmilligan1 to the @ruebot-created form.
2017-08-11 07:55:50 -04:00
Helge Holzmann
80deff9b4b
Add Tempas v1 and v2 ( #32 )
...
* Add Tempas v1
* Add Tempas v2
2017-07-26 08:15:54 -04:00
John Berlin
63a410126d
Add node-cdxj to the list ( #31 )
2017-07-24 23:02:15 -04:00
John Berlin
2d46142baa
Updated node-warcs entry in the list to reflect http://ws-dl.blogspot.com/2017/07/2017-07-24-replacing-heritrix-with.html and WAILs + Squidwarcs usage of this library ( #30 )
2017-07-24 22:25:18 -04:00
John Berlin
3505b572dd
Add Squidwarc to the list ( #29 )
2017-07-24 22:24:44 -04:00
Mohamed Aturban
d7fd3167a2
Add archivenow to the list ( #28 )
2017-07-10 13:05:18 +01:00
Mat Kelly
31389f46b9
Updates to PR#24 by @kant as recommended by @ruebot ( #27 )
...
* Minor fixes
* Changes per @ruebot in PR#14
2017-07-08 09:54:17 -04:00
Mat Kelly
c5b04a33e8
Add InterPlanetary Wayback ( #26 )
...
I deliberated under which category this should fit but replay seems most appropriate.
2017-07-07 21:55:29 +01:00
Mat Kelly
5dc48f29b8
Fix spelling ( #25 )
2017-07-07 15:21:25 +01:00
Andy Jackson
d820480d6e
Add some links to other resources and clarifications ( #23 )
...
* Some clean up and added Slack.
* Separate the basic and mroe advanced stuff, and add the intro video in.
* Added some new links and detail responding to #22 .
2017-06-26 16:38:34 -04:00
Mat Kelly
15b429d288
Update README.md ( #21 )
2017-06-23 12:57:54 -04:00
nruest
38b2694985
Test for http://netpreserve.org/web-archiving/tools-and-software embed
2017-06-22 16:08:03 -04:00
Mat Kelly
2a8288928e
Rm superfluous paren ( #20 )
2017-06-21 17:17:45 -04:00
raffaele messuti
107fb052a3
add warcio, warctools, har2warc, node-warc, go webarchive ( #19 )
...
* warcat: still in utilities
* add webarchive-indexing
* add The Archive Browser
* add warcio, warctools, har2warc, node-warc, go webarchive
2017-06-21 16:01:56 -04:00
Nick Ruest
babfbac355
Add Heritrix Walkthrough ( #18 )
2017-06-20 10:05:35 +01:00
Mat Kelly
1634b60c96
Add "The Unarchiver" app. ( #17 )
...
A free variant of the already included "The Archive Browser" limited to the extraction features.
2017-06-19 11:28:32 -04:00
Kristinn Sigurðsson
d52d478000
Update README.md
...
Fixed incorrect alphabetical ordering of item.
2017-06-19 10:55:54 +00:00
Steffen
c370e303dc
added html2warc ( #16 )
...
added html2warc, a simple script to convert offline data into a single warc file
2017-06-18 10:26:35 -04:00
raffaele messuti
4e413e2342
add webarchive-index and "the archive browser", remove warccat duplicate ( #15 )
...
* warcat: still in utilities
* add webarchive-indexing
* add The Archive Browser
2017-06-17 16:51:37 -04:00
Nick Ruest
c3658d76da
Search & discovery ( #14 )
2017-06-17 13:08:44 +01:00
Nick Ruest
5788c2653c
Add wasapi-downloader ( #13 )
2017-06-17 13:07:20 +01:00
Nick Ruest
d90f365477
Add more warcbase documentation/tranining ( #12 )
2017-06-17 13:06:41 +01:00
Ian Milligan
cd27e8c2f2
Adding some open scholarship resources ( #10 )
...
* added Web as History, SAA link, Archives Unleashed Slack
* as per @ruebot suggestion, alphabetizing resources
2017-06-16 12:35:14 -04:00
Helge Holzmann
fbb28e3d06
Add WarcPartitioner ( #11 )
2017-06-16 12:07:29 -04:00
Helge Holzmann
e4bf900190
Add HadoopConcatGz ( #9 )
2017-06-16 11:58:38 -04:00
Helge Holzmann
f8b58e3624
Add Web2Warc ( #7 )
2017-06-16 11:40:53 -04:00
Andy Jackson
4de1c1d2b3
Separate type of training material and add intro video ( #5 )
...
* Some clean up and added Slack.
* Separate the basic and mroe advanced stuff, and add the intro video in.
2017-06-16 11:19:50 -04:00
raffaele messuti
a04b77dcf6
new: webrecorder player in replay section ( #6 )
2017-06-16 16:18:32 +01:00
Nick Ruest
757c8967a3
toc anchors ( #4 )
2017-06-16 16:15:15 +01:00
Andy Jackson
e59333a0bd
Some clean up and added Slack. ( #3 )
2017-06-16 10:27:02 -04:00