Menu

Selva Prabhakaran

Selva is an experienced Data Scientist and leader, specializing in executing AI projects for large companies. Selva started machinelearningplus to make Data Science / ML / AI accessible to everyone. The website enjoys 4 Million+ readership. His courses, lessons, and videos are loved by hundreds of thousands of students and practitioners.

conda vs miniconda vs anaconda

Conda create environment and everything you need to know to manage conda virtual environment

Typical python projects uses multiple packages for various tasks. And some of the packages are shared between projects as well. Sharing same packages between projects can cause problems. How? When you update one of the packages used in a project, it might cause compatibility issues in the other packages that use it. On upgrading, it […]

Conda create environment and everything you need to know to manage conda virtual environment Read More »

ARIMA Model – Complete Guide to Time Series Forecasting in Python

Using ARIMA model, you can forecast a time series using the series past values. In this post, we build an optimal ARIMA model from scratch and extend it to Seasonal ARIMA (SARIMA) and SARIMAX models. You will also see how to build autoarima models in python ARIMA Model – Time Series Forecasting. Photo by Cerquiera

ARIMA Model – Complete Guide to Time Series Forecasting in Python Read More »

Standard Error in Statistics – Understanding the concept, formula and how to calculate

Standard error of the mean measures how spread out the means of the sample can be from the actual population mean. Standard error allows you to build a relationship between a sample statistic (computed from a smaller sample of the population) and the population’s actual parameter. Standard Error – A practical guide with examples. Photo

Standard Error in Statistics – Understanding the concept, formula and how to calculate Read More »

Confidence Interval in Statistics – Formula and Mathematical Calculation

Confidence interval is a measure to quantify the uncertainty in an estimated statistic (like the mean) when the true population parameter is unknown. Training Custom Text Classification Model in spaCy. Photo by Jessica Wong. You will know 1. What is Confidence Interval? 2. Two types of Confidence Intervals problems 3. Difference between Population parameter vs

Confidence Interval in Statistics – Formula and Mathematical Calculation Read More »

T Test (Students T Test) – Understanding the math and how it works

T Test (Students T Test) is a statistical significance test that is used to compare the means of two groups and determine if the difference in means is statistically significant. In this one, you’ll understand when to use the T-Test, the different types of T-Test, math behind it, how to determine which test to choose

T Test (Students T Test) – Understanding the math and how it works Read More »

p-value Intuition

What is P-Value? – Understanding the meaning, math and methods

P Value is a probability score that is used in statistical tests to establish the statistical significance of an observed effect. Though p-values are commonly used, the definition and meaning is often not very clear even to experienced Statisticians and Data Scientists. In this post I will attempt to explain the intuition behind p-value as

What is P-Value? – Understanding the meaning, math and methods Read More »

vector autoregression model feature image

Vector Autoregression (VAR) – Comprehensive Guide with Examples in Python

Vector Autoregression (VAR) is a forecasting algorithm that can be used when two or more time series influence each other. That is, the relationship between the time series involved is bi-directional. In this post, we will see the concepts, intuition behind VAR models and see a comprehensive and correct method to train and forecast VAR

Vector Autoregression (VAR) – Comprehensive Guide with Examples in Python Read More »

Mahalanobis Distance Feature

Mahalanobis Distance – Understanding the math with examples (python)

Mahalanobis distance is an effective multivariate distance metric that measures the distance between a point and a distribution. It is an extremely useful metric having, excellent applications in multivariate anomaly detection, classification on highly imbalanced datasets and one-class classification. This post explains the intuition and the math with practical examples on three machine learning use

Mahalanobis Distance – Understanding the math with examples (python) Read More »

Principal Component Analysis – How PCA algorithms works, the concept, math and implementation

Principal Components Analysis (PCA) is an algorithm to transform the columns of a dataset into a new set of features called Principal Components. By doing this, a large chunk of the information across the full dataset is effectively compressed in fewer feature columns. This enables dimensionality reduction and ability to visualize the separation of classes

Principal Component Analysis – How PCA algorithms works, the concept, math and implementation Read More »

Logging in Python

Python Logging – Simplest Guide with Full Code and Examples

The logging module lets you track events when your code runs so that when the code crashes you can check the logs and identify what caused it. Log messages have a built-in hierarchy – starting from debugging, informational, warnings, error and critical messages. You can include traceback information as well. It is designed for small

Python Logging – Simplest Guide with Full Code and Examples Read More »

Matplotlib histogram feature

Matplotlib Histogram – How to Visualize Distributions in Python

Matplotlib histogram is used to visualize the frequency distribution of numeric array by splitting it to small equal-sized bins. In this article, we explore practical techniques that are extremely useful in your initial data analysis and plotting. Content What is a histogram? How to plot a basic histogram in python? Histogram grouped by categories in

Matplotlib Histogram – How to Visualize Distributions in Python Read More »

Time Series Analysis

Time Series Analysis in Python – A Comprehensive Guide with Examples

Time series is a sequence of observations recorded at regular time intervals. This guide walks you through the process of analyzing the characteristics of a given time series in python. Time Series Analysis in Python – A Comprehensive Guide. Photo by Daniel Ferrandiz. Contents What is a Time Series? How to import Time Series in

Time Series Analysis in Python – A Comprehensive Guide with Examples Read More »

Matplotlib Tutorial

Matplotlib Tutorial – A Complete Guide to Python Plot with Examples

The goal of this tutorial is to make you understand ‘how plotting with matplotlib works’ and make you comfortable to build full-featured plots with matplotlib. 2. A Basic Scatterplot The following piece of code is found in pretty much any python code that has matplotlib plots. import matplotlib.pyplot as plt %matplotlib inline matplotlib.pyplot is usually

Matplotlib Tutorial – A Complete Guide to Python Plot with Examples Read More »

Scroll to Top
Course Preview

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos:

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Scroll to Top