From 99241ae461605f253d7a99baf255f1555b88c817 Mon Sep 17 00:00:00 2001 From: lasztoth <113107827+lasztoth@users.noreply.github.com> Date: Mon, 6 May 2024 14:26:07 +0200 Subject: [PATCH] Added warc-safe to list (#148) --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 1150b2d..fa7b4ac 100644 --- a/README.md +++ b/README.md @@ -153,6 +153,7 @@ This list of tools and software is intended to briefly describe some of the most * [Warchaeology](https://nlnwa.github.io/warchaeology/) - Warchaeology is a collection of tools for inspecting, manipulating, deduplicating and validating WARC-files. *Stable* * [warcdb](https://github.com/florents-Tselai/warcdb) - A command line utility (Python) for importing WARC files into a SQLite database. *(Stable)* * [warcdedupe](https://gitlab.com/taricorp/warcdedupe) - WARC deduplication tool (and WARC library) written in Rust. (In Development) +* [warc-safe](https://github.com/natliblux/warc-safe) - Automatic detection of viruses and NSFW content in WARC files. * [WarcPartitioner](https://github.com/helgeho/WarcPartitioner) - Partition (W)ARC Files by MIME Type and Year. *(Stable)* * [warcrefs](https://github.com/arcalex/warcrefs) - Web archive deduplication tools. *Stable* * [webarchive-indexing](https://github.com/ikreymer/webarchive-indexing) - Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.