Applied more Analytics on your data with Key Influencer Visualization

In this post, I am going to demonstrate a new visualization that has been released by the AI team in Power BI in recent months.

Before showing off the nice features, there are some key points about this visual.

http://datascientistyuyuria.blogspot.com/2017/06/
  • It can be used by Data sciences, Data engineer and End users. 
    • It is easy enough to use and interpret
    • It consumes lots of algorithms behind the seen to identify the main factor
    • It can be used to align with other customer visuals to create a better visualization

 

  • Able to interpret both Categorical and Numeric variables
  • Provide a great clustering approach: find the natural grouping on data, and then visualize the main top cluster (segment) and also, it shows how a combination of factors affects the metric that you’re analyzing.
  • Interpret the results: the visual provides a brief description of how it works [1]
  • As mentioned before this visual employs a combination of algorithms. In categorial and regression analysis different algorithms have been used.

 

In this post, I am going to use a dataset about the concrete!

Concrete has been used in building the bridge, building and so forth.

The main elements for creating concrete is

  • Cement: A cement is a binder, a substance used for construction that sets, hardens, and adheres to other materials to bind them together.
  • Blast Furnace Slag: stony waste matter separated from metals during the smelting or refining of ore.
  • Fly Ash:  Fly ash or flue ash, also known as pulverized fuel ash in the United Kingdom, is a coal combustion product that is composed of the particulates that are driven out of coal-fired boilers together with the flue gases.
  • Water: The amount of water in concrete controls many fresh and hardened properties in concrete including workability, compressive strengths, permeability and water tightness, durability and weathering, drying shrinkage and potential for cracking [2].
  • Superplasticizer: Superplasticizers, also known as high range water reducers, are chemical admixtures used where well-dispersed particle suspension is required. These polymers are used as dispersants to avoid particle segregation and to improve the flow characteristics of suspensions such as in concrete applications [3].
  • Coarse Aggregate: Coarse aggregate is the portion of the concrete which is made up of the larger stones embedded in the mix. Concrete contains three ingredients; Water, cement, and aggregate. That aggregate is made of fine sand and coarse gravel.
  • Fine Aggregate
  • Age: how many days
  • Concrete compressive strength

 

The dataset also available from here

 

So, let’s start to predict what will be the strength of the concrete regarding other elements such as ashes, water, and so forth.

Question:

I want to answer the following questions:

  • What factors have more impact on the strength of the concrete to decrease or increase and how much?

  • I am interested to see the natural classification of my data.

  • Also interested to see some rules like if the amount of Cement is … and age is … then what is the strength

Let’s Answer these three questions using a brand new visualization named Key Influencer

Get it!

 

The key influencer is not as a preview feature, to access it you need to follow below steps

  • Click on file –> Option and Settings –> then Options –> then click under the Global, click Preview Feature, and you should find the Key Influencer Visual at the bottom
  • However, after all, steps you need to restart the Power BI ( close and open again)
  • Just notice, it a preview feature, some enhancement will be applied on it soon

 

 

Use It!

 

Now you need to import the concert dataset into Power BI Desktop

Get Data –> CSV –> Load

Easy!

Now our plan is to analyze the strength of the concrete, hence click on visual that has been added to the Visualization panel, and for the analyse choose Strength field from Concrete dataset.

You can see in the below picture (number 1 to 4)

However, we see an error there!

Error! “Strenght has more than 10 Unique values!

that means I need to change some part of the setting to make it works!

By default, the Key Influencer visual, set to be work for categorical variable up to 10 distinct values,

However, we can make it work for continues variable

To solve it:

Click on the visual –> then click on the format (number 2 in the picture) choose the Analyse type and select the Continues, here you are! The error will be gone, but we need to add some values to make it work.

 

 

Now you able to see the analysis!

First, the list of the factor has been displayed in the order they have more impact on the strength

There is a number who up beside each factor, that shows if-then analysis for us

For instance, If the cement increases 104 then the strength will increase by 10 in another word it impacts by 10%

Just notice for continues values: we have increased and decrease

Next, You able to see a scatter chart that shows the relationship between cement and strength and a line that shows the positive impact of the cement on strength

Let’s click on the water and see how it impacts on the strength in a negative way, more water the strength going down.

 

Now we going to analyze the next tab Top Segment

Click on the Top segment and you will see that strength is more likely to be High analysis you can change it to Low as well

As you can see in the picture, is able to identify 5 top segments in data

In each of you able to see the average of each segment, and also in the bottom list number of the data point that has been located there

So back to the second question “I am interested to see the natural groping of my data.

 

We are able to see the natural grouping of data there!

If you need more detail, click on each segment and it navigates you to another interesting page!

In this new, you get more detail about the segmentation for each segment

For instance, click on the bigger segment which is the average of strength is 63.99

Then you able to see some explanation of this segment

Like what is the age range, how much cement and water is in this segment and so forth.

To back to the previous page just click on the Close bottom at the top right,

This visual can be used by other Visual in Power BI report and it is available for everyone

 

References

[1]. https://docs.microsoft.com/en-us/power-bi/visuals/power-bi-visualization-influencers

[2]. https://www.forconstructionpros.com/concrete/equipment-products/article/11359846/water-in-concrete

[3] https://en.wikipedia.org/wiki/Superplasticizer

[4] https://www.quora.com/What-is-a-coarse-aggregate

[5] http://archive.ics.uci.edu/ml/machine-learning-databases/concrete/compressive/Concrete_Data.xls

[6] https://www.iconfinder.com/icons/3254948/decision_tree_family_tree_hierarchy_organisational_chart_organizational_chart_icon

Leila Etaati on LinkedinLeila Etaati on TwitterLeila Etaati on Youtube
Leila Etaati
Trainer, Consultant, Mentor
Leila is the first Microsoft AI MVP in New Zealand and Australia, She has Ph.D. in Information System from the University Of Auckland. She is the Co-director and data scientist in RADACAD Company with more than 100 clients in around the world. She is the co-organizer of Microsoft Business Intelligence and Power BI Use group (meetup) in Auckland with more than 1200 members, She is the co-organizer of three main conferences in Auckland: SQL Saturday Auckland (2015 till now) with more than 400 registrations, Difinity (2017 till now) with more than 200 registrations and Global AI Bootcamp 2018. She is a Data Scientist, BI Consultant, Trainer, and Speaker. She is a well-known International Speakers to many conferences such as Microsoft ignite, SQL pass, Data Platform Summit, SQL Saturday, Power BI world Tour and so forth in Europe, USA, Asia, Australia, and New Zealand. She has over ten years’ experience working with databases and software systems. She was involved in many large-scale projects for big-sized companies. She also AI and Data Platform Microsoft MVP. Leila is an active Technical Microsoft AI blogger for RADACAD.

4 thoughts on “Applied more Analytics on your data with Key Influencer Visualization

Leave a Reply