Learn about Numpy libraries in Python (Part 1)

Thursday, 17/12/2020

Tram Ho

During this time I had the opportunity to work in the Python language through a corporate health project. After a period of learning and working, I realized that this is a very powerful language, it is powerful because of its versatility, it can do everything related to programming from web, app or. game, but what makes python famous probably comes from the fact that it can code both in the field of data science analysis, building artificial intelligence. One more thing that makes Python so popular is that it has an extremely large ecosystem of support libraries. It is so big that sometimes we have to decide which library to use properly.

Among the millions of libraries that support Python, there is a library that is almost nailed, forcing any programmer who works with Pytohn to learn that is the NumPy library. In today’s post I would like to introduce you to the NumPy library in Python and how to use it in Python.

1. Introduction to the numpy library

Numpy (Numeric Python): is a very popular and powerful math library of Python. NumPy is equipped with optimized functions, allowing efficient work with matrices and arrays, especially large array and matrix data, with much faster processing speed when using Python alone. .

If you want to be a data science intensive programmer, you need to know numpy well. This is one of the most useful python libraries, especially if you are learning about numbers. Since much of Data Science and Machine Learning revolves around Statistics, practice becomes much more important.

NumPy is developed by Jim Hugunin. The original version was Numarray in development, with some additional functions. In 2005, Travis Oliphant created the NumPy package by combining the Numarray features and the Numeric package.

Using NumPy, programmers can do the following:

Mathematical and logical operations on arrays.
Fourier transforms and processes for manipulating shapes.
Mathematical operations involving linear algebra. NumPy has built-in functions for linear algebra and random number generation.

NumPy – The perfect replacement for MatLab

NumPy is commonly used in conjunction with packages like SciPy (Python Scientific) and Mat-plotlib (graphing library). This combination is widely used as an alternative to MatLab, a popular platform for engineering computation. However, Python replaces MatLab which is now seen as a more complete and modern programming language, most importantly, Numpy is a free, open source library compared to MatLab which is a source code library. pay and pay fees.

How to install NumPy

In this article I will practice on ubuntu and Django Framework, if you are running on another OS then gg will have detailed instructions.

First open up Terminal and enter

sudo apt install python3 <span class="token operator">-</span> pip
pip install numpy

sudo apt install python3 - pip

pip install numpy

You need to install Numpy via pip

After NumPy has been installed, we need to import it to use it as other Python libraries to use NumPy’s functions:

<span class="token keyword">import</span> numpy <span class="token keyword">as</span> np

1 2	<span class="token keyword">import</span> numpy <span class="token keyword">as</span> np

After the installation is complete we will learn about data types in Numpy

1. Arrays

An array is a data structure that contains a group of elements. Typically, all of these elements have the same data type, such as integers or strings. They are often used in programs to sort data so that a set of associated values can be easily sorted or searched for.

When it comes to NumPy, an array is the central data structure of the library. It is a grid of values and it contains information about the raw data, how to determine the position of an element, and how to interpret an element. It has a grid of elements that can be indexed in a variety of ways. All elements have the same type, called an array type (data type).

An array can be indexed by a set of non-negative integers, by a boolean, by another array, or by an integer. The rank of the array is the dimension number. The shape of the array is a series of integers that indicate the size of the array along each dimension. One way we can initialize the NumPy array is from a nested Python list.

An array is known as the central data structure of the NumPy library. The array in NumPy is called the NumPy Array.

The most important object defined in NumPy is a type of N dimensional array called ndarray. It describes a collection of similar items. Items in the collection can be accessed with a zero-based index.

Every item in a ndarray has the same block size in memory. Each element in ndarray is an object of a data type object (called a dtype).

Any item extracted from the ndarray (by trimming) object is represented by a Python object of one of the array scalar types. The following chart shows the relationship between ndarray, object data type (dtype), and array scalar type:

An instance of ndarray can be constructed using the different array creation procedures described later in the tutorial. Ndarray is basically created with an array function in NumPy as follows:

numpy <span class="token punctuation">.</span> array

1 2	numpy <span class="token punctuation">.</span> array

It generates a ndarray from any object that shows the array interface, or from any method that returns an array.

numpy <span class="token punctuation">.</span> array <span class="token punctuation">(</span> <span class="token builtin">object</span> <span class="token punctuation">,</span> dtype <span class="token operator">=</span> <span class="token boolean">None</span> <span class="token punctuation">,</span> copy <span class="token operator">=</span> <span class="token boolean">True</span> <span class="token punctuation">,</span> order <span class="token operator">=</span> <span class="token boolean">None</span> <span class="token punctuation">,</span> subok <span class="token operator">=</span> <span class="token boolean">False</span> <span class="token punctuation">,</span> ndmin <span class="token operator">=</span> <span class="token number">0</span> <span class="token punctuation">)</span>

numpy . array ( object , dtype = None , copy = True , order = None , subok = False , ndmin = 0 )

The constructor above takes the following parameters:

No	Parameter description
first	object – Any object displaying the array interface method will return an array or any (nested) string.
2	dtype – The desired data type of the array, optional
3	copy – Not required. By default (true), the object is copied
4	order – C (main row) or F (main column) or A (any) (default)
5	subok – By default, the return array must be the base class array. If true, the subclasses are passed
6	ndmin – Specifies the minimum size of the result array

For example to understand more:

import numpy as np 
a = np.array([1,2,3]) 
print a

import numpy as np

a = np.array([1,2,3])

print a

The result: [1, 2, 3]

# more than one dimensions 
import numpy as np 
a = np.array([[1, 2], [3, 4]]) 
print a

# more than one dimensions

import numpy as np

a = np.array([[1, 2], [3, 4]])

print a

The result: [[1, 2] [3, 4]]

# minimum dimensions 
import numpy as np 
a = np.array([1, 2, 3,4,5], ndmin = 2) 
print a

# minimum dimensions

import numpy as np

a = np.array([1, 2, 3,4,5], ndmin = 2)

print a

Results: [[1, 2, 3, 4, 5]]

Bản thử trực tiếp
# dtype parameter 
import numpy as np 
a = np.array([1, 2, 3], dtype = complex) 
print a

Bản thử trực tiếp

# dtype parameter

import numpy as np

a = np.array([1, 2, 3], dtype = complex)

print a

The result: [1. + 0.j, 2. + 0.j, 3. + 0.j]

Difference between Python List and Numpy Array

The Python List can contain elements of different data types while elements of the Numpy Array are always identical (same data type).
Python Array is faster and more compact than Python List:
- NumPy Array uses fixed memory to store data and less memory than Python List.
- Adjacent memory allocation in NumPy Array

Data type in NumPy

NumPy supports much more number types than Python. The following table shows the different scalar data types defined in NumPy.

No	Data type & description
first	bool_ – Boolean (True or False) is stored as bytes
2	int_ – default integer type (like C long; usually int64 or int32)
3	intc – Identical to int C (usually int32 or int64)
4	intp – Integer used for indexing (same as C ssize_t; usually int32 or int64)
5	int8 – Byte (-128 to 127)
6	int16 – Integer (-32768 to 32767)
7	int32 – Integer (-2147483648 to 2147483647)
8	int64 – Integer (-9223372036854775808 to 9223372036854775807)
9	uint8 – unsigned integer (0 to 255)
ten	uint16 – unsigned integer (0 to 65535)
11	uint32 – unsigned integer (0 to 4294967295)
twelfth	uint64 – unsigned integer (0 to 18446744073709551615)
13	float_ – Short for float64
14	float16 – float: sign bit, 5-bit exponent, 10-bit mantissa
15	float32 – float: sign bit, 8-bit exponent, 23-bit mantissa
16	float64 – float: sign bit, 11-bit exponent, 52-bit mantissa
17	complex_ – Abbreviation for complex128
18	complex64 – A complex number, represented by two 32-bit real numbers (real and imaginary components).
19	complex128 – Complex number, denoted by two 64-bit real numbers (real and imaginary components).

Data type object (dtype)

A descriptive data type object interprets the fixed memory block corresponding to an array, depending on the following aspects:

Data type (integer, float or Python object)
The size of the data
Byte order (little-endian or big-endian)
In the case of a structured type, the names of the fields, the data type of each field, and a portion of the memory block are taken by each field.
If the data type is a sub array, its shape and data type

The byte order is determined by prefixing the data type with ‘<‘ or ‘>’. ‘<‘ means the encryption has a small value (least significant is stored in the smallest address). ‘>’ means the encoding is big-endian (the most significant byte is stored at the smallest address).

A dtype object is constructed with the following syntax:

numpy.dtype(object, align, copy)

1 2	numpy.dtype(object, align, copy)

The parameters are:

Đối tượng – Converted to an object of data type
Căn chỉnh – If true, add a padding to the field to make the field similar to the C-struct
Sao chép – Creates a new copy of the dtype object. If false, the result is a reference to the builtin data type object

Array Indexing

NumPy provides several ways to access elements in arrays

Indexing and slicing: Each element in the 1-dimensional array corresponds to an index. Indexes in NumPy, like indices in python, start with 0. If a 1 dimensional array has n elements then the indices run from 0 to n – 1. And similar to list in python, NumPy arrays also has can be sliced.

<span class="token comment"># Khởi tạo numpy array có shape = (3, 4) như sau:</span>
a <span class="token operator">=</span> np <span class="token punctuation">.</span> array <span class="token punctuation">(</span> <span class="token punctuation">[</span> <span class="token punctuation">[</span> <span class="token number">1</span> <span class="token punctuation">,</span> <span class="token number">2</span> <span class="token punctuation">,</span> <span class="token number">3</span> <span class="token punctuation">,</span> <span class="token number">4</span> <span class="token punctuation">]</span> <span class="token punctuation">,</span> 
              <span class="token punctuation">[</span> <span class="token number">5</span> <span class="token punctuation">,</span> <span class="token number">6</span> <span class="token punctuation">,</span> <span class="token number">7</span> <span class="token punctuation">,</span> <span class="token number">8</span> <span class="token punctuation">]</span> <span class="token punctuation">,</span> 
              <span class="token punctuation">[</span> <span class="token number">9</span> <span class="token punctuation">,</span> <span class="token number">10</span> <span class="token punctuation">,</span> <span class="token number">11</span> <span class="token punctuation">,</span> <span class="token number">12</span> <span class="token punctuation">]</span> <span class="token punctuation">]</span>
<span class="token comment"># Dùng chỉ số để lấy phần tử hàng 1, cột 2</span>
<span class="token keyword">print</span> <span class="token punctuation">(</span> a <span class="token punctuation">[</span> <span class="token number">1</span> <span class="token punctuation">]</span> <span class="token punctuation">[</span> <span class="token number">2</span> <span class="token punctuation">]</span> <span class="token punctuation">)</span> <span class="token comment"># 7</span>
<span class="token keyword">print</span> <span class="token punctuation">(</span> a <span class="token punctuation">[</span> <span class="token number">1</span> <span class="token punctuation">,</span> <span class="token number">2</span> <span class="token punctuation">]</span> <span class="token punctuation">)</span> <span class="token comment"># 7</span>
<span class="token comment"># Dùng slicing để lấy 2 hàng đầu tiên của 2 cột đầu tiên</span>
<span class="token keyword">print</span> <span class="token punctuation">(</span> a <span class="token punctuation">[</span> <span class="token punctuation">:</span> <span class="token number">2</span> <span class="token punctuation">]</span> <span class="token punctuation">[</span> <span class="token punctuation">:</span> <span class="token number">2</span> <span class="token punctuation">]</span> <span class="token punctuation">)</span>
<span class="token comment"># [[1 2]</span>
   <span class="token punctuation">[</span> <span class="token number">5</span> <span class="token number">6</span> <span class="token punctuation">]</span> <span class="token punctuation">]</span>
<span class="token comment"># Kết hợp dùng slicing và indexing</span>
<span class="token comment"># Chú ý: sẽ tạo ra mảng có rank thấp hơn mảng cũ</span>
r1 <span class="token operator">=</span> a <span class="token punctuation">[</span> <span class="token number">1</span> <span class="token punctuation">,</span> <span class="token punctuation">:</span> <span class="token punctuation">]</span> <span class="token comment"># Rank 1, hàng 1 của a </span>
<span class="token keyword">print</span> <span class="token punctuation">(</span> r1 <span class="token punctuation">,</span> r1 <span class="token punctuation">.</span> shape <span class="token punctuation">)</span> <span class="token comment"># [[5 6 7 8]] (4,)</span>

# Khởi tạo numpy array có shape = (3, 4) như sau:

[ 5 , 6 , 7 , 8 ] ,

[ 9 , 10 , 11 , 12 ] ]

# Dùng chỉ số để lấy phần tử hàng 1, cột 2

print ( a [ 1 ] [ 2 ] ) # 7

print ( a [ 1 , 2 ] ) # 7

# Dùng slicing để lấy 2 hàng đầu tiên của 2 cột đầu tiên

print ( a [ : 2 ] [ : 2 ] )

# [[1 2]

[ 5 6 ] ]

# Kết hợp dùng slicing và indexing

# Chú ý: sẽ tạo ra mảng có rank thấp hơn mảng cũ

r1 = a [ 1 , : ] # Rank 1, hàng 1 của a

print ( r1 , r1 . shape ) # [[5 6 7 8]] (4,)

Boolean array indexing:

Allows you to select arbitrary elements of an array, often used to select elements that satisfy certain conditions.

a <span class="token operator">=</span> np <span class="token punctuation">.</span> array <span class="token punctuation">(</span> <span class="token punctuation">[</span> <span class="token punctuation">[</span> <span class="token number">1</span> <span class="token punctuation">,</span> <span class="token number">2</span> <span class="token punctuation">]</span> <span class="token punctuation">,</span> <span class="token punctuation">[</span> <span class="token number">3</span> <span class="token punctuation">,</span> <span class="token number">4</span> <span class="token punctuation">]</span> <span class="token punctuation">,</span> <span class="token punctuation">[</span> <span class="token number">5</span> <span class="token punctuation">,</span> <span class="token number">6</span> <span class="token punctuation">]</span> <span class="token punctuation">]</span> <span class="token punctuation">)</span>
bool_idx <span class="token operator">=</span> <span class="token punctuation">(</span> a <span class="token operator">&gt;</span> <span class="token number">2</span> <span class="token punctuation">)</span> <span class="token comment"># Tìm các phần tử lớn hơn 2;</span>
<span class="token comment"># Trả về 1 numpy array of Booleans có shape như mảng a</span>
<span class="token comment"># và giá trị tại mỗi phần tử là </span>
<span class="token comment"># True nếu phần tử của a tại đó &gt; 2,</span>
<span class="token comment"># False cho trường hợp ngược lại.</span>
<span class="token keyword">print</span> <span class="token punctuation">(</span> bool_idx <span class="token punctuation">)</span> 
<span class="token comment"># [[False False]</span>
<span class="token comment">#  [True True]</span>
<span class="token comment">#  [True True]]”</span>

a = np . array ( [ [ 1 , 2 ] , [ 3 , 4 ] , [ 5 , 6 ] ] )

bool_idx = ( a > 2 ) # Tìm các phần tử lớn hơn 2;

# Trả về 1 numpy array of Booleans có shape như mảng a

# và giá trị tại mỗi phần tử là

# True nếu phần tử của a tại đó > 2,

# False cho trường hợp ngược lại.

print ( bool_idx )

# [[False False]

# [True True]

# [True True]]”

Conclude

NumPy is a popular and powerful Python math library. It allows for efficient working with matrices and arrays, especially large array and matrix data, with much faster processing speed using pure Python only.

In the article I introduced you to NumPy, its benefits, how to install it to use, learn about NumPy array, numerical data type NumPy. In the next article we will continue to learn about other data types in NumPy

Refer:

Numpy Tutorial Numpy.org Numpy Medium

Share the news now

Source : Viblo

Learn about Numpy libraries in Python (Part 1)

1. Introduction to the numpy library

How to install NumPy

1. Arrays

Difference between Python List and Numpy Array

Data type in NumPy

Data type object (dtype)

Array Indexing

Boolean array indexing:

Conclude

TikTok becomes the second largest social platform in South Africa

The fastest depreciating after 9 months of launch, iPhone 14 Pro Max continues to break the bottom in Vietnam

Beginner's guide to R: Introduction

10 essential SublimeText plugins for JavaScript developers