Essential Python Libraries
Essential Python Libraries
Essential Python Libraries
Python
Libraries
CS158-1
NumPy
NumPy, short for Numerical Python, has
long been a cornerstone of numerical
computing in Python. It provides the data
structures, algorithms, and library glue
needed for most scientific applications
involving numerical data in Python.
NumPy contains among other things
• A fast and efficient multidimensional array object
ndarray
• Functions for performing element-wise
computations with arrays or mathematical
operations between arrays
• Tools for reading and writing array-based datasets
to disk
• Linear algebra operations, Fourier transform, and
random number generation
• A mature C API to enable Python extensions and
native C or C++ code to access NumPy’s data
structures and computational facilities
NumPy Advantages
• NumPy arrays are more efficient for storing
and manipulating data than the other built-in
Python data structures.
• Also, libraries written in a lower-level
language, such as C or Fortran, can operate
on the data stored in a NumPy array without
copying data into some other memory
representation.
• many numerical computing tools for Python
either assume NumPy arrays as a primary
data structure or else target seamless
interoperability with NumPy.
Pandas
• Pandas provides high-level data structures and
functions designed to make working with
structured or tabular data fast, easy, and
expressive.
• Pandas blends the high-performance, array-
computing ideas of NumPy with the flexible data
manipulation capabilities of spreadsheets and
relational databases (such as SQL).
• It provides sophisticated indexing functionality to
make it easy to reshape, slice and dice, perform
aggregations, and select subsets of data.
Matplotlib
learn
programmers. In just seven years,
it has had over 1,500 contributors
from around the world.
Classification: SVM, nearest neighbors, random forest,
logistic regression, etc.
NumPy Splitting
Joining merges multiple arrays into
Array one and Splitting breaks one array
into multiple.
• One of the most used method for getting a quick overview of the
DataFrame, is the head() method.
• The head() method returns the headers and a specified number of
rows, starting from the top.