Add FastWARC (#114)

* Update README.md

* Capitalize the description to appease the linter
This commit is contained in:
Mat Kelly 2021-12-13 11:30:53 -05:00 committed by GitHub
parent 30661eacd0
commit 921cf36496
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 1 additions and 0 deletions

View File

@ -139,6 +139,7 @@ This list of tools and software is intended to briefly describe some of the most
### WARC I/O Libraries
* [FastWARC](https://github.com/chatnoir-eu/chatnoir-resiliparse) - A high-performance WARC parsing library (Python).
* [HadoopConcatGz](https://github.com/helgeho/HadoopConcatGz) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and `*.warc.gz`). *(Stable)*
* [jwarc](https://github.com/iipc/jwarc) - Reading and write WARC files with a typesafe API (Java).
* [Jwat](https://sbforge.org/display/JWAT/JWAT) - Libraries and tools for reading/writing/validating WARC/ARC/GZIP files (Java). *(Stable)*