Krzywa LIFT

Tutorial: How to establish quality and correctness of classification models? Part 5 – Lift curve

In this part of the tutorial you will learn more about definition and types of lift curves, accumulated LIFT with percentage scale how to construct a LIFT curve. You will gain more information about the accumulated LIFT with percentage scale and other types of LIFT curves.

Learn more
ROC

How to assess quality and correctness of classification models? Part 4 – ROC Curve

The ROC curve is one of the methods for visualizing classification quality, which shows the dependency between TPR (True Positive Rate) and FPR (False Positive Rate).

Learn more
Confusion Matrix - Macierz Błędów

Tutorial: How to establish quality and correctness of classification models? Part 3 – Confusion Matrix

Confusion Matrix is an N x N matrix, in which rows correspond to correct decision classes and the columns to decisions made by the classifier. The number ni,j at the intersection of i-th row and j-th column is equal to the number of cases from the i-th class which have been classified as belonging to the j-th class.

Learn more

How to determine the quality and correctness of classification models? Part 2 – Quantitative quality indicators

In this part of tutorial we will discuss derived quality indicators and show how to select the appropriate indicator using as an example churn analysis.

Learn more
Quality and correctness of classification models

Tutorial: How to determine the quality and correctness of classification models? Part 1 – Introduction

What is classification?

Classification is the process of assigning every object from a collection to exactly one class from a known set of classes.

Learn more
Data Quality Projects

What is data quality all about and how to run a data cleaning project?

Are you considering carrying out or outsourcing a data cleaning project? Find out what our experience tells us about this types of analyses.

Learn more

Understanding machine learning #3: Confusion matrix – not all errors are equal

One of the most typical tasks in machine learning is classification tasks. It may seem that evaluating the effectiveness of such a model is easy. Let’s assume that we have a model which, based on historical data, calculates if a client will pay back credit obligations. 

Learn more
Machine learning

Understanding machine learning #2: Do we need machine learning at all?

In the previous post of our Understanding machine learning series, we presented how machines learn through multiple experiences. We also explained how, in some cases, human beings are much better at interpreting data than machines. 

Learn more
Machine Learning

Understanding Machine Learning #1 – How machines learn?

“If (there) was one thing all people took for granted, (it) was conviction that if you feed honest figures into a computer, honest figures (will) come out. Never doubted it myself till I met a computer with a sense of humor.” ― Robert A. Heinlein, The Moon is a Harsh Mistress

Learn more
Data mart

Analytical Data Marts – data analysts’ indispensable tool

Information about provided services, customers and transactions can be stored in different database systems and data warehouses, depending on the way in which a company operates.

Due to such arrangements, even the simplest analyses or report may require significant expenditures of time, as well as in-depth knowledge about database systems and their availability.

Learn more

Correlation does not imply causation

A popular phrase tossed around when we talk about statistical data is “there is correlation between variables”. However, many people wrongly consider this to be the equivalent of “there is causation between variables”. It’s important to explain the distinction:

Learn more

Predictive Analytics glossary

As Predictive Analytics (also called Data Mining or Data Science) is gaining momentum and spreading across companies and sectors, we have created a short guide to some common terms in this field. We hope you like it!

Learn more