Azure Machine Learning Services : Automated Machine Learning -Part 2

In the last post, what is Azure ML Studio and Azure ML Services has been explained briefly.

In this post, I will explain how to use Automated machine learning service.

Automated machine learning (automated ML) picks an algorithm and hyperparameters for you and generates a model ready for deployment.

Automated machine learning helps to identify the relevant features, what algorithms to choose and how to set up the parameters of the algorithms. for instance, in the below picture, for the first round, AutoML chose the first four features, using Nearest neighbors algorithms, and set up the parameters like a number of neighbors.

In the same process, it may choose other features, with different algorithms and parameters.,

finally, it will check which of those algorithms perform better.

In general, Automated machine learning, clean data automatically, feature selection, choose better algorithms, tune model by applying different parameters, find the best model and provide an explanation on each algorithm and provide some accuracy analysis.

In this post, I will show how to do a Prediction.

The AutoML environment in Azure.

1- first log into portal.azure

2- after creating the Azure ML Services component

3- Click on Overview to see the details.

as you can see in the below picture, there is an option to Create a New Automated Machine Learning Model

Click on it, and it navigates you to the Automate Machine Learning Environment, click on Create Experiment

So you can create a machine learning model.

Create a new experiment, if you do not have a Compute, you need first create one that specifies the size of the virtual machine that model is going to run on it.

after creating a new AutoML experiment by assigning a training Compute to it, it navigates you to the data selection phase.

Now, you able to see the experiment name selected Compute name, storage account and so forth.

At the bottom of the page, you able to load your dataset, or if you have your data in blob storage, you can access it.

after uploading the Titanic dataset, you will see the name of the data columns, and able to include or ignore each column for further analysis.

Moreover, you need to identify the prediction task, whether it is a Classification, Regression, Forecasting ..

In this scenario, we are going to predict passenger in the Titanic ship will survive or not.

Next, in the advanced setting, we able to identify accuracy parameters, training job time, the maximum number of iterations, some preprocessing tasks like cross validations process, concurrency of iteration and so forth.

In the next step, you can limit the number of algorithms that are going to apply to your dataset.

this is one of my favorite part of AutoML as you will access to many algorithms as you able to see which on works better for your dataset.

for this scenario, I limit it just to two algorithms to make it faster.

After setting up the AutoML, AutoML will start to create a model that has better performance on data, based on the number of algorithms you choose and, hyperparameter tunning task, it may take a couple of minutes.

Overview of Running Experiment

During the run, you still able to see the list of experiment you have, whether they are running Completed, Failed or other situation by click on the Authoring (Preview)–> Automated Machine Learning.

As you can see in the above picture, there is a chart that shows the run history and it has some legend that shows theri run situation.

in our example, we have one algorithm that has been running.

Now at the bottom of the page, Under Experiment, you able to see the list of the experiment that is created. Click on the recent one.

For this one you able to see at the top of the page, there are an option name Logs

and there is a message showing that Run is Preparing.

Click on the Logs to see the a list showin the recent update and what steps the AutoML is and the list is completing

Moreover, under the runnig page, you able to see a chrat that shows the iteration, which algorithms achive what accuracy level.

Also you can click on each of these algorithms that has been listed and sorted based on their accuracy

Now, the detailed explanation of these algorithms will be shown to you like the accuracy chart and so forth, hopefully in the near future I write some post on it.

or a confuion matrix for classification accuracy analysis

however, if you interested to analyze the other algorithms, you can access them and see the detail of accuracy for each of them.

Leila Etaati

Trainer, Consultant, Mentor

Leila is the first Microsoft AI MVP in New Zealand and Australia, She has Ph.D. in Information System from the University Of Auckland. She is the Co-director and data scientist in RADACAD Company with more than 100 clients in around the world. She is the co-organizer of Microsoft Business Intelligence and Power BI Use group (meetup) in Auckland with more than 1200 members, She is the co-organizer of three main conferences in Auckland: SQL Saturday Auckland (2015 till now) with more than 400 registrations, Difinity (2017 till now) with more than 200 registrations and Global AI Bootcamp 2018. She is a Data Scientist, BI Consultant, Trainer, and Speaker. She is a well-known International Speakers to many conferences such as Microsoft ignite, SQL pass, Data Platform Summit, SQL Saturday, Power BI world Tour and so forth in Europe, USA, Asia, Australia, and New Zealand. She has over ten years’ experience working with databases and software systems. She was involved in many large-scale projects for big-sized companies. She also AI and Data Platform Microsoft MVP. Leila is an active Technical Microsoft AI blogger for RADACAD.