mirror of
https://github.com/iipc/awesome-web-archiving.git
synced 2024-10-01 03:15:45 -04:00
Add FastWARC (#114)
* Update README.md * Capitalize the description to appease the linter
This commit is contained in:
parent
30661eacd0
commit
921cf36496
@ -139,6 +139,7 @@ This list of tools and software is intended to briefly describe some of the most
|
||||
|
||||
### WARC I/O Libraries
|
||||
|
||||
* [FastWARC](https://github.com/chatnoir-eu/chatnoir-resiliparse) - A high-performance WARC parsing library (Python).
|
||||
* [HadoopConcatGz](https://github.com/helgeho/HadoopConcatGz) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and `*.warc.gz`). *(Stable)*
|
||||
* [jwarc](https://github.com/iipc/jwarc) - Reading and write WARC files with a typesafe API (Java).
|
||||
* [Jwat](https://sbforge.org/display/JWAT/JWAT) - Libraries and tools for reading/writing/validating WARC/ARC/GZIP files (Java). *(Stable)*
|
||||
|
Loading…
Reference in New Issue
Block a user