❗The content presented here is sourced directly from Udemy platform. For comprehensive course details, including enrollment information, simply click on the 'Go to class' link on our website.
Updated in [July 25th, 2023]
This course provides an introduction to Real-time Credit card Fraud Detection using Spark, Kafka and Cassandra. Students will learn how to use Spark ML Pipeline Stages such as String Indexer, One Hot Encoder and Vector Assembler for pre-processing. The Random Forest Algorithm will be used to create a Machine Learning model, and K-means Algorithm will be used for data balancing. Additionally, students will learn how to integrate a Spark Streaming Job with Kafka and Cassandra, and how to achieve exactly-once semantics using Spark Streaming custom offset management. Finally, students will be introduced to the Airflow Automation framework, which can be used to automate Spark Jobs on a Spark Standalone Cluster.
Course Syllabus
Introduction
Real-time Fraud Detection Demonstration
Code Walkthrough