Written by: Lukasz Cmielowski, PhD, Thomas Parnell & Daniel Ryszka

Snap machine learning

A library that provides high-speed training and inference of popular machine learning models on modern CPU/GPU computing systems. Snap ML has been designed to address some of the biggest challenges that companies and practitioners face when applying machine learning to real use cases. These challenges are illustrated in the graphics below.


IBM Watson AutoAI has recently introduced a new beta feature — time series support. It’s is as easy as a walk in the park: all you need to do is drag & drop your time series data, and then sit back and relax while the best model to is being prepared for you.

In this story I will present how easily IBM AutoAI python API can be applied to COVID19 data to get predicted confirmed cases for the next few days.

Setup

To work with AutoAI for time series one needs to have Watson Machine Learning service instance (included with the…


Written by: Lukasz Cmielowski, PhD, Yair Schiff & Przemyslaw Czuba

Imagine you are at your favorite restaurant. You just finished eating your favorite dish. Yes, exactly, the one with the secret sauce. Suddenly you look up from your empty plate and see the chef standing next to your table. She tells you to please follow her. As you make your way towards the back of the restaurant the anticipation grows. Finally the chef swings open the doors to the kitchen, waves her hand for you to come in, and says “let me show you how the secret sauce is made!”


Written by: Lukasz Cmielowski, PhD, Jan Wasilewski, Patryk Wielopolski, Robert Benke

Hackathon

I had the pleasure to be part of the jury for this year edition of “Ustawka 2020” hackathon for students in Poland. The hackathon was organised by IBM Poland and University of Warsaw. The topic for this year’s edition was prediction of COVID-19 daily confirmed cases, deaths and recoveries. The predictions had to be made per each day in 14 days period (June 15–26th).

Fifteen teams representing different universities across whole Poland took place in competition. I have asked highly ranked teams to share their solutions in this and…


Is it possible to download Watson AutoAI trained model and use it outside Watson Studio ecosystem? The answer is YES.

This short story describes in details how one can download AutoAI trained model and use it on 3rd party environment (local machine, cloud service etc.).

Getting the model

The easiest way to download trained pipeline model is to use python SDK and autogenerated notebook. From the drop down menu next to selected pipeline model click “Save as Notebook”.

Figure 1. Save as Notebook option.

The notebook can be run either in Watson Studio runtime or any other notebook server (download it). Notebook installs automatically all required dependencies:

  • xgboost
  • lightgbm


As a continuation of epidemic models comparative analysis we want to examine one more regression model created by Watson Studio AutoAI. We will be using new python API to define and trigger AutoAI experiment. The jupyter notebook with all steps can be found here.

COVID-19 data

We are using COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University.

As a working environment Watson Studio notebook runtime is used (free plan is offered). In first step we load confirmed cases data into pandas DataFrame for further preprocessing.


Some time ago I have written a story how to predict incorrect bug fixes. The full story “Adoption of machine learning to software failure prediction” can be found here. Long story short — we have adopted binary classification algorithm to predict if the bug has been correctly or incorrectly fixed. Code change sets predicted as incorrect ones required development (QA) team attention. Ones predicted as correctly fixed were automatically closed. The adopted solution involved data science knowledge.

Data science expertise was required to:

  • Preprocess data coming from source code and bugs repos like Github,
  • Build machine learning pipeline using Spark…


Written by Lukasz Cmielowski , Alexandre Quemy, Rafal Bigaj & Wojciech Sobala

In this story, we would like to share our recent experience of building, serving and integrating COVID-19 models using IBM Cloud.

Building epidemic models

For the purpose of this experiment, we have tried several epidemic models. Since we are located in Poland we have adopted all tested models to Poland COVID-19 situation and data. However, shared below examples can be easily adapted to other locations and data sources by simply passing country name and other input parameters.

We have evaluated SIR, logistic, double-exponential and Weibull models to predict the number of…


Written by Lukasz Cmielowski & Rafal Bigaj

Would you like to know how the artificial intelligence (AI) investment you made is influencing your business?

Is it worth investing further?

What AI improvements should be made to accelerate your business?

It is extremely hard to measure the impact of the AI to the business. It is even harder to find AI issues that may drive your business indicators down. And the hardest problem is to predict the fix impact on the business indicator before it is even made.

Watson OpenScale and application monitor

Ability to track the key performance indicators (KPIs) in context of AI system health is one of the new Watson OpenScale components.

Business application monitor:

  • tracks AI characteristics and business key performance indicators (KPIs) in a runtime
  • informs about AI impact on business KPIs and…


Watson OpenScale in action

What is Watson OpenScale?

Watson OpenScale tracks and measures outcomes from AI across its lifecycle, and adapts and governs AI to changing business situations — for models built and running anywhere. You can read more details here.

OpenScale and 3rd party engines integration points

Watson OpenScale integrates with external ml serve engines (Azure ML Studio, Amazon SageMaker etc.) in the following way:

  1. Engine binding level — ability to list deployments and get deployment details
  2. Deployment subscription level — OpenScale needs to be able to score subscribed deployment in the required format (WML like format) and receive the output in the same compatible format (it needs to understand both input and output)

Lukasz Cmielowski, PhD

Automation architect and data scientist at IBM Krakow Software Lab. Currently working on Watson Machine Learning cloud offering.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store