Guido CasiraghiHow to provision an EMR cluster using On-Demand and Spot EC2 InstancesIn this tutorial I’m going to explain how to provision an Amazon EMR cluster using EC2 Spot Instances.Sep 3, 2021Sep 3, 2021
Guido CasiraghiRead data with Apache Spark from KafkaIn this brief tutorial I’m going to explain how to use Kafka as a data source for Spark.Aug 5, 2021Aug 5, 2021
Guido CasiraghiRun a Flask application as a Docker containerIn this post you will learn how to run a Python Flask application as a Docker container.Apr 9, 2021Apr 9, 2021
Guido CasiraghiMonitor your ML model in Apache Spark with DeequOnce you have built and trained your ML model, you have to maintain it in production. This great post provides a detailed overview of many…Feb 26, 2021Feb 26, 2021
Guido CasiraghiinAnalytics VidhyaHow to setup a Kafka cluster with Docker on Amazon EC2In this example we are going to setup a Kafka cluster of two nodes using Amazon EC2 instances. However, even if some configuration details…Feb 12, 2021Feb 12, 2021
Guido CasiraghiHow to compile OpenCV with SIFT and SURF support, and do feature extraction in JavaThis is a quick introduction to image feature extraction with OpenCV and JavaJan 29, 2021Jan 29, 2021
Guido CasiraghiHow to setup GitHub PagesThis is a short tutorial to help you setup your personal site on GitHub Pages as fast as possible.Jan 19, 2021Jan 19, 2021
Guido CasiraghiinAnalytics VidhyaPublish CSV reports using Plotly Dash, Pandas and SQLAlchemyPlotly Dash is a great tool to build Web-based dashboards in Python. It is a great choice for simple interactive charts as well for more…Sep 10, 2020Sep 10, 2020
Guido CasiraghiinAnalytics VidhyaHow to export a Plotly chart as HTMLIn this post, I will show how to create charts in Python using the Plotly library, by exporting them as plain HTML files.Sep 1, 20201Sep 1, 20201
Guido CasiraghiinBetter ProgrammingAnalyze your iCloud health data with PandasHow to analyze data from the iPhone Health app using Python and PandasJul 18, 20192Jul 18, 20192