From 707381a7c032b37656c9a41c05abc4a515122ba1 Mon Sep 17 00:00:00 2001 From: "Mia von Steinkirch, PhD, MSc" <1130416+bt3gl@users.noreply.github.com> Date: Mon, 2 Mar 2020 16:23:09 -0800 Subject: [PATCH] Update README.md --- README.md | 60 ++++--------------------------------------------------- 1 file changed, 4 insertions(+), 56 deletions(-) diff --git a/README.md b/README.md index 7b98f1c..565d3e3 100644 --- a/README.md +++ b/README.md @@ -9,68 +9,16 @@ the model. # Resources in this repository -### Machine Learning Science +### Machine Learning -* [Deep Learning](https://github.com/bt3gl/Curated_ETL-and-ML-Pipelines/blob/master/deep_learning_resources.md). +* [Machine Learning resources](https://github.com/bt3gl/Curated_ETL_and_ML_Pipelines/tree/master/machine_learning_examples). +* [Deep Learning resources](https://github.com/bt3gl/Curated_ETL-and-ML-Pipelines/blob/master/deep_learning_resources.md). ### Machine Learning Infrastructure +* [Data Engineering resources](https://github.com/bt3gl/Curated_ETL_and_ML_Pipelines/blob/master/data_engineering.md). * [Airflow](https://github.com/bt3gl/Curated_ETL-and-ML-Pipelines/blob/master/airflow.md). ------ - -# External Resources - -### Tools & Code Samples - -* [Data science resources](https://github.com/davidyakobovitch/data_science_resources). -* [AWS Data pipeline samples](https://github.com/aws-samples/data-pipeline-samples/tree/master/samples). - - -### Lorte - -* [Lorte data pipelining](https://github.com/instacart/lore). - -### MOOCs - -#### General Pipelines -* [Coursera's Big Data Pipeline course](https://www.coursera.org/lecture/big-data-integration-processing/big-data-processing-pipelines-c4Wyd). - - -### Tutorials & Articles - -#### 2019 - -* [How to Code Neat Machine Learning Pipelines](https://www.neuraxio.com/en/blog/neuraxle/2019/10/26/neat-machine-learning-pipelines.html). - - -### Enterprise Solutions - -* [Netflix data pipeline](https://medium.com/netflix-techblog/evolution-of-the-netflix-data-pipeline-da246ca36905). -* [Netlix data videos](https://www.youtube.com/channel/UC00QATOrSH4K2uOljTnnaKw). -* [Yelp data pipeline](https://engineeringblog.yelp.com/2016/07/billions-of-messages-a-day-yelps-real-time-data-pipeline.html). -* [Gusto data pipeline](https://engineering.gusto.com/building-a-data-informed-culture/). -* [500px data pipeline](https://medium.com/@samson_hu/building-analytics-at-500px-92e9a7005c83.) -* [Twitter data pipeline](https://blog.twitter.com/engineering/en_us/topics/insights/2018/ml-workflows.html). -* [Coursera data pipeline](https://medium.com/@zhaojunzhang/building-data-infrastructure-in-coursera-15441ebe18c2). -* [Cloudfare data pipeline](https://blog.cloudflare.com/how-cloudflare-analyzes-1m-dns-queries-per-second/). -* [Pandora data pipeline](https://engineering.pandora.com/apache-airflow-at-pandora-1d7a844d68ee). -* [Heroku data pipeline](https://medium.com/@damesavram/running-airflow-on-heroku-ed1d28f8013d). -* [Zillow data pipeline](https://www.zillow.com/data-science/airflow-at-zillow/). -* [Airbnb data pipeline](https://medium.com/airbnb-engineering/https-medium-com-jonathan-parks-scaling-erf-23fd17c91166). -* [Walmart data pipeline](https://medium.com/walmartlabs/how-we-built-a-data-pipeline-with-lambda-architecture-using-spark-spark-streaming-9d3b4b4555d3). -* [Robinwood data pipeline](https://robinhood.engineering/why-robinhood-uses-airflow-aed13a9a90c8). -* [Lyft data pipeline](https://eng.lyft.com/running-apache-airflow-at-lyft-6e53bb8fccff). -* [Slack data pipeline](https://speakerdeck.com/vananth22/operating-data-pipeline-with-airflow-at-slack). -* [Remind data pipeline](https://medium.com/@RemindEng/beyond-a-redshift-centric-data-model-1e5c2b542442). -* [Wish data pipeline](https://medium.com/wish-engineering/scaling-analytics-at-wish-619eacb97d16). -* [Databrick data pipeline](https://databricks.com/blog/2017/03/31/delivering-personalized-shopping-experience-apache-spark-databricks.html). - - -### Talks - -* [Industrial Machine Learning Talk](https://www.youtube.com/watch?v=3JYDT8lap5U). - --- Buy Me A Coffee