Update Data-Sets.md

This commit is contained in:
OhShINT 2021-10-30 09:06:12 -07:00 committed by GitHub
parent 6b3e4b8364
commit 51aa707a0c
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -17,6 +17,14 @@
A censorship measurement platform that collects data using multiple remote measurement techniques in more than 200 countries. Provides reports and offers their raw data sets which are available for download.
- [DomainsProject](https://dataset.domainsproject.org/)
Massive dataset of over 600 million domains. Total size is ~16 GB.
- [Face Recognition Datasets](https://www.face-rec.org/databases/)
A large collection of face datasets for training facial recognition systems and other things of that nature.
- [Kaggle](https://www.kaggle.com/datasets)
Offers over 50,000 public datasets for all kinds of various things.
- [Common Crawl](https://commoncrawl.org/)
An open repository of web crawl data that can be accessed and analyzed by anyone.
- [OCCRP Catalogue of Research Databases](https://id.occrp.org/databases/)
A massive collection of public data sources compiled by OCCRP researchers that are the most useful for investigative reporting.
## **<u>Government Data Sets</u>**