Update data_engineering.md

This commit is contained in:
Mia von Steinkirch, PhD, MSc 2020-03-02 16:31:50 -08:00 committed by GitHub
parent c297807e45
commit ab30db2f69
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -3,10 +3,27 @@
### Articles
* [Machine Learning System Design](https://medium.com/@ricomeinl/machine-learning-system-design-f2f4018f2f8).
* [How to Code Neat Machine Learning Pipelines](https://www.neuraxio.com/en/blog/neuraxle/2019/10/26/neat-machine-learning-pipelines.html).
### Courses
* [Coursera's Big Data Pipeline course](https://www.coursera.org/lecture/big-data-integration-processing/big-data-processing-pipelines-c4Wyd).
### Tools
* [Airflow resources](https://github.com/bt3gl/Curated_ETL-and-ML-Pipelines/blob/master/airflow.md).
* [Data science resources](https://github.com/davidyakobovitch/data_science_resources).
* [AWS Data pipeline samples](https://github.com/aws-samples/data-pipeline-samples/tree/master/samples).
* [Lorte data pipelining](https://github.com/instacart/lore).
### Videos
* [Industrial Machine Learning Talk](https://www.youtube.com/watch?v=3JYDT8lap5U).
### Enterprise Solutions
@ -29,3 +46,6 @@
* [Remind data pipeline](https://medium.com/@RemindEng/beyond-a-redshift-centric-data-model-1e5c2b542442).
* [Wish data pipeline](https://medium.com/wish-engineering/scaling-analytics-at-wish-619eacb97d16).
* [Databrick data pipeline](https://databricks.com/blog/2017/03/31/delivering-personalized-shopping-experience-apache-spark-databricks.html).