Introduction Fabric is Microsoft’s recently announced SaaS all-in-one analytics platform. It brings together Azure Data Factory, Azure Synapse Analytics and Power BI into a single cohesive platform without the overhead of setting up resources, maintenance, and configuration. Fabric wouldn’t be an end-to-end data analytics platform without data science, so in this blog we will explore…… Continue reading Exploring Machine Learning in Microsoft Fabric: Should Data Scientists care?
Category: Essential Data Science
How to Fix Different Types of Model Drift
Introduction Model drift refers to the decline of model performance due to changes in data and relationships. Most drift is caused by things entirely out of our control so while we can’t stop it from happening, we can identify and mitigate it. Feature Drift Also known as Data Drift, Feature Drift is the changing of…… Continue reading How to Fix Different Types of Model Drift
Identifying Data Outliers in Apache Spark 3.0
The secret to getting machine learning to work effectively is in ensuring that the data we are using for training is as clean as possible and has any bias removed from it. When working with machine learning, we should be building in a generalised mode and to do this we need to understand what is…… Continue reading Identifying Data Outliers in Apache Spark 3.0
Will Koalas replace PySpark?
One of the first of many big announcements at the 2020 Spark and AI Summit was the official release of Koalas 1.0, the pandas API on top of Apache Spark. This blog will explore how Koalas differs from PySpark. Pandas and Spark To understand what makes Koalas so important, you need to understand the importance…… Continue reading Will Koalas replace PySpark?
Choosing the Right Recommendation Algorithm
Introduction Putting the right products in front of your customers at a time they are motivated to buy increases the bottom line. That could be which product is best placed when a customer is at the till, it could be what song you want to recommend on Spotify, or your next series to binge on…… Continue reading Choosing the Right Recommendation Algorithm