Python Exercises – Level 1
This notebook contains Python exercises to practice as a beginner. These are devised as byte sized mini tasks that you might need to apply...
This notebook contains Python exercises to practice as a beginner. These are devised as byte sized mini tasks that you might need to apply...
6 min
Using Cython, you can speed up existing Python code by an order of 100x or more. This is possible because Cython converts some of...
Let’s see how to cythonize Python code inside Jupyter notebooks step by step. In this post we will see how to: Define and time...
3 min
At its core, KL (Kullback-Leibler) Divergence is a statistical measure that quantifies the dissimilarity between two probability distributions. Think of it like a mathematical...
The Probe method is a highly intuitive approach to feature selection. If a feature in the dataset contains only random numbers, it is not...
Cook’s distance is a measure computed to measure the influence exerted by each observation on the trained model. It is measured by building a...
Z score, also called as standard score, is used to scale the features in a dataset for machine learning model training. It can also...
Z score is one of the most important concepts in statistics. It is also called standard score. Typically it is used to scale the...
Let’s understand what are outliers, how to identify them using IQR and Boxplots and how to treat them if appropriate. 1. What are outliers?...
3 min
Apache PySpark is a powerful big data processing framework, which allows you to process large volumes of data using the Python programming language. PySpark’s...
5 min
Introduction As we continue to generate massive volumes of data every day, the importance of scalable data processing and analysis tools cannot be overstated....
3 min
OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. OpenCV-Python is a Python wrapper for the original...
2 min
Pip is a widely used package manager for Python, allowing you to install and manage Python packages easily. In this blog post, we’ll explore...
Web scraping is the technique of extracting data from a specific website or web page. This has wide applications in: Research and publication purposes...
1. What is the purpose of adding Python to the PATH environment variable? Adding Python to the PATH environment variable in Windows allows you...
6 min
Adaboost is one of the earliest implementations of the boosting algorithm. It forms the base of other boosting algorithms, like gradient boosting and XGBoost....
3 min
numpy.random.randint function is used to get random integers from low to high values. The low value is included while the high value is excluded...
3 min
The np.random.uniform() function is used to create an array with random samples from a uniform probability distribution of given low and high values. random.uniform(low=0.0,...
2 min
The np.sort() function is used to sort the array along a specified axis. Numpy.sort (a, axis=- 1, kind=None, order=None) Purpose: This function is used...
4 min
numpy.median function is used to calculate the median of an array along a specific axis or multiple axes. Median is defined as the middle...