Skip to main content

All about Sankey Charts

Let's consider a situation- you have 4 fields and you want to depict the flow with a help of visuals. The first thought can be to use a table or matrix in Power BI. It can depict the data but how about the intuitiveness? The solution for such scenarios is Sankey Charts which can help you to show different fields and the flow associated with them.

As a beginner in the BI world anyone can question- Is there a need for a Sankey chart when you already have a pool of options? Yes, we do need such charts if you are trying to visualize a process with respect to a particular field.

Protip- Sankey Charts are not available in your default library. You need to get it from the online library and keep in mind to only add certified visuals.

So, enough of the introduction let's showcase how the Sankey chart works. I am considering the SampleSuperstore dataset. The use case for the first visual is we are going to show the flow between Region and Category. The flow size will be governed by the amount of profit (putting the profit in the weight).


We have put the region in the source and the category in the destination. If you select any of the regions the flow will get highlighted as in the above-mentioned image. You can see the profit associated with different categories.

But still the visual doesn't look so intuitive. One of the reasons is the same or default color scheme of the flow. To change the color of the flows go to formatting of the visual and under the data link labels you can see the links. You can pick the color of your choice.


Protip- If you critically observe the above-mentioned image you can see I have picked the same color with different shades for a particular region. It just makes the visual more tidy and easy to spot a particular region.

Let's take a step forward and see the interactivity of such charts. We are creating two charts one is the Sankey chart with source as segment and destination as a category. Profit is in the weight. For another visual, we are considering profit and sales with every subcategory.


Now, if I select the flow Corporate to Technology in the Sankey chart then it will filter the clustered bar chart as well. 


Isn't it interesting? As a BI analyst, I always keep the Sankey chart in my visual arsenal but it has a few limitations too. It can be cumbersome if either the source or the destination has more than 6-7 values. Consider you have 10 categories and 10 segments. In such cases, you can't decipher every flow in the visual. Also, what if every flow size has values of the same range. For instance, if the profit values are between 200k and 250k for all segments and categories then the visual becomes counter-intuitive in nature. 


Thanks for Reading  Let's connect on  LinkedIn. For more such blogs do follow us.

Comments

Popular posts from this blog

Ultimate Beginners Guide to DAX Studio

There are zillions of external tools available with Power BI but DAX Studio is one of the most commonly used tools to work with DAX queries. It is a perfect tool to optimize the DAX and the data model. In this blog let's shed some light on the basic functionalities that can take your report to the next level. ARE YOU READY?  To start you will need the latest version of the DAX Studio. You can download it from their website . Don't worry you don't have to pay for the license. Fortunately, DAX Studio is a free tool As a BI Developer, I am using DAX Studio regularly. Based on my experience I use it for several purposes but in this blog, I will highlight the most common ones. Extracting a dump of all the measures used in your PBIX. Why do we need to do this? It can be used for documentation purposes also sometimes we try to reuse the DAX and such a dump comes in handy in this scenario. How to achieve it? Open the DAX Studio it is located under the external tools once you open t

Identify and Delete Unused Columns & Measures

Heavy dashboards and a bad data model is a nightmare for every BI Developer. Heavy dashboards can be slow due to multiple reasons. It is always advised to stick with best practices. Are you still figuring out about those best practices then you should definitely have a quick read on Best Practice Analyser ( link ). One of the most common issues with slow dashboards is unused columns and unused measures.  It is very normal to load some extra columns and create some test measures in your dashboard but as a part of cleanup process those unused columns and unused measures should be removed. Why we are removing them? Because if you keep them then ultimately it will increase the size of your data model which is not a good practice.  How to identify the culprits (unused columns and unused measures)? In today's blog we will provide you with 2 most common external tools which will help you in identifying the culprits. More external tools😒. Who's going to pay for this? To your surprise

Best Practice Analyser (BPA) Guide

Do you want to save tons of efforts to check if your data model and PBIX file follows the standard best practices and norms? Then this blog is for you. If you are a follower of our channel we already deep dive into the importance of the DAX Studio as an external tool. If you are a beginner I would highly recommend to visit this blog . In today's blog we will check how Tabular Editor can help to optimize the data model.  Best Practice Analyser allows to define or import best practices. It will make sure that we do not violate the best practices while developing a dashboard. Isn't it exciting!! Before we start make sure you already have Tabular Editor version 2.24.1 installed on your system. To install it do visit this link and select the link for windows installer. Once Tabular Editor is installed it will reflect in your PBIX file under external tool. Also, we need to define the standard rules. To do so in your advanced scripting or C# script copy this and save it via Ctrl+S. An