Cloud Transformation
5 best practices for deploying ML models
In our previous article – 5 Challenges to be prepared for while scaling ML models, we discussed the top five challenges in…
5 challenges of scaling Machine Learning models
Machine learning on big data has opened the door to new opportunities to achieve business goals. It facilitates better ML modeling including…
Real-time data warehousing with Apache Spark and Delta Lake
Financial institutions globally deal with massive data volumes that call for large-scale data warehouses and effective processing of real-time transactions. In this…
Containerization of PySpark using Kubernetes
Containerization technology is widely used by data scientists and machine learning practitioners to promote the continuous deployment of models and test the…
COVID-19 Scenario in India: A Data Scientist’s Perspective (Part 2)
The initial effect of COVID-19 in India has been relatively low in comparison to other countries that we’ve analyzed in the previous…
Data analytics for CPG in the COVID era
As of June 23, 2020, there are 9.2 million confirmed cases of COVID-19 worldwide and about 474,998 fatalities. When the virus started…
What’s all the fuss about Quantum Computing and Quantum Supremacy?
“Quantum Computers will soon outperform Classical Machines” Quantum computing and quantum programming are showcasing power to outperform classical computers which otherwise…
Apache Spark on DataProc vs Google BigQuery
Introduction When it comes to Big Data infrastructure on Google Cloud Platform, the most popular choices by data architects today are Google…