Hey folks! If you are a beginner with stream processing with Spark, or even if you have used it multiple times but want to get a better understanding of spark structured streaming, then this article is for you! Before discussing processing stream data in Spark, let’s first understand what stream data processing is and how does it vary from batch data processing? If you are already familiar with these concepts, please skip ahead. Stream data processing Processing of data as and when it comes to make near real time decisions. For example, Fraud detection as and when it happens, detection of an erroneous server by analyzing the error rate, etc. How does stream processing differ from batch processing? Batch data processing is processing of data accumulated over a period of time. These are the normal flows/jobs that you have running say at daily, weekly, or twice a day frequency. No matter when the data comes, it will always be processed at fixed defined intervals. Another difference ...