mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2025-02-23 08:09:55 -05:00
Add WarcPartitioner (#11)
This commit is contained in:
parent
e4bf900190
commit
fbb28e3d06
@ -95,6 +95,8 @@ To the extent possible under law, the owner has waived all copyright and related
|
|||||||
|
|
||||||
* [Warcat](https://github.com/chfoo/warcat) (Stable) - Tool and library for handling Web ARChive (WARC) files.
|
* [Warcat](https://github.com/chfoo/warcat) (Stable) - Tool and library for handling Web ARChive (WARC) files.
|
||||||
|
|
||||||
|
* [WarcPartitioner](https://github.com/helgeho/WarcPartitioner) (Stable) - Partition (W)ARC Files by MIME Type and Year
|
||||||
|
|
||||||
#### Analysis
|
#### Analysis
|
||||||
|
|
||||||
* [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation.
|
* [ArchiveSpark](https://github.com/helgeho/ArchiveSpark) (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation.
|
||||||
|
Loading…
x
Reference in New Issue
Block a user