Exploring Machine Learning in Microsoft Fabric: Should Data Scientists care?

Introduction Fabric is Microsoft’s recently announced SaaS all-in-one analytics platform. It brings together Azure Data Factory, Azure Synapse Analytics and Power BI into a single cohesive platform without the overhead of setting up resources, maintenance, and configuration. Fabric wouldn’t be an end-to-end data analytics platform without data science, so in this blog we will explore…… Continue reading Exploring Machine Learning in Microsoft Fabric: Should Data Scientists care?

How to Fix Different Types of Model Drift

Introduction Model drift refers to the decline of model performance due to changes in data and relationships. Most drift is caused by things entirely out of our control so while we can’t stop it from happening, we can identify and mitigate it. Feature Drift Also known as Data Drift, Feature Drift is the changing of…… Continue reading How to Fix Different Types of Model Drift

Identifying Data Outliers in Apache Spark 3.0

The secret to getting machine learning to work effectively is in ensuring that the data we are using for training is as clean as possible and has any bias removed from it. When working with machine learning, we should be building in a generalised mode and to do this we need to understand what is…… Continue reading Identifying Data Outliers in Apache Spark 3.0

Choosing the Right Recommendation Algorithm

Introduction Putting the right products in front of your customers at a time they are motivated to buy increases the bottom line. That could be which product is best placed when a customer is at the till, it could be what song you want to recommend on Spotify, or your next series to binge on…… Continue reading Choosing the Right Recommendation Algorithm