Cloud Transformation
Customer Data Platform (CDP): A need or necessity?
Consumers today create a heap of data and digital footprints than ever before. Starting from geographical, transactional or behavioral data, to data…
Apache Spark for real-time analytics
Apache Spark is the hottest analytical engine in the world of Big Data and Data Engineering. Apache Spark architecture is largely used…
Why B2B marketers need interactive analytics platform?
Your B2B marketing campaigns and programs generate a huge amount of data and dashboards are perhaps the best way to visualize, understand,…
Why Apache Arrow is the future for open source columnar
Apache Arrow is an example of open source technology and is a de-facto standard for columnar in-memory analytics. Engineers from across the…
Spark Streaming in production & how it works
This is our next blog in the series of blogs about Spark Streaming. After talking about Spark Streaming and how it works,…
Fault tolerant stream processing with Spark Streaming
After a look at how Spark Streaming works, and discussing good production practices for Spark Stream processing, this blog talks about making…
Fault tolerant streaming workflows with Apache Mesos
Mesos High Availability Cluster Apache Mesos is a high availability cluster operating system as it has several masters, with one Leader. The…
Getting data into Spark Streaming
In the previous blog post we talked about an overview of Spark Streaming, and now let us take a look on different…
Overview of Spark Architecture & Spark Streaming
Spark Streaming is designed to provide window-based stream processing and stateful stream processing for any real-time analytics application. Spark applications allows users…