Karpagam JCS ISSN: 2582 – 8525 (Print), 2583 – 3669 (Online)

A review on Stream Data Analytics Using Pyspark and Google Colab

Abstract
Stream analysis is a crucial task in various domains, such as finance, social media, IoT, and network monitoring. With the increasing volume and velocity of data generated in real- time, efficient and scalable stream processing frameworks are required. This research paper focuses on utilizing Google Colab and PySpark, a powerful distributed computing framework, for stream analysis. We demonstrate how Google Colab, an online Jupyter notebook environment, can be leveraged with PySpark to process and analyze streaming data in real-time. We present a comprehensive methodology and experimental results to showcase the effectiveness and feasibility of this approach.

View Full Article

Download or view the complete article PDF published by the author.

📥 Download PDF 👁️ View in Browser