mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2025-02-22 15:49:56 -05:00
Add WarcPartitioner (#11)
This commit is contained in:
parent
e4bf900190
commit
fbb28e3d06
@ -95,6 +95,8 @@ To the extent possible under law, the owner has waived all copyright and related
|
||||
|
||||
* [Warcat](https://github.com/chfoo/warcat) (Stable) - Tool and library for handling Web ARChive (WARC) files.
|
||||
|
||||
* [WarcPartitioner](https://github.com/helgeho/WarcPartitioner) (Stable) - Partition (W)ARC Files by MIME Type and Year
|
||||
|
||||
#### Analysis
|
||||
|
||||
* [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation.
|
||||
|
Loading…
x
Reference in New Issue
Block a user