Neural Network

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 37

NEURAL NETWORK

NEURAL NETWORK

Neural networks, also known as artificial neural networks


(ANNs) or simulated neural networks (SNNs), are a subset of
machine learning and are at the heart of deep learning
algorithms. Their name and structure are inspired by the
human brain, mimicking the way that biological neurons
signal to one another..
Artificial neural networks (ANNs) are comprised of a node layers, containing an input layer, one or more hidden
layers, and an output layer. Each node, or artificial neuron, connects to another and has an associated weight and
threshold. If the output of any individual node is above the specified threshold value, that node is activated, sending
ARTIFICIAL NEURAL NETWORK (ANN)
data to the next layer of the network. Otherwise, no data is passed along to the next layer of the network.
The number of hidden layers is arbitory.Units in the input layers called input units. Units in the output
layer and units in the hidden layer called neurodes.
The human brain is made up of about 85 billion neurons, resulting in a network capable of storing a
tremendous amount of knowledge. As you might expect, this dwarfs the brains of other living
creatures. For instance, a cat has roughly a billion neurons, a mouse has about 75 million neurons,
and a cockroach has only about a million neurons. In contrast, many ANNs contain far fewer neurons,
typically only several hundred, so we're in no danger of creating an artificial brain anytime in the near
future—even a fruit fly brain with 100,000 neurons far exceeds the current ANN state-of-the-art
1. Rudimentary ANNs have been used for over 50 years to simulate the brain's approach to problem solving.
2. At first, this involved learning simple functions, like the logical AND function or the logical OR.
3. These early exercises were used primarily to construct models of how biological brains might function.

However, as computers have become increasingly powerful in recent years, the complexity of ANNs has likewise
increased such that they are now frequently applied to more practical problems such as
• The Speech and handwriting recognition programs like those used by voicemail transcription services and
postal mail sorting machines .
automation of smart devices like an office building's environmental controls or self-driving cars and self-
piloting drones .
• Sophisticated models of weather and climate patterns, tensile strength, fluid dynamics, and many other
scientific, social, or economic phenomena.

ANNs are versatile learners that can be applied to nearly any learning task: classification, numeric
prediction, and even unsupervised pattern recognition
From biological to artificial neurons

Because ANNs were intentionally designed as conceptual models


of human brain activity, it is helpful to first understand how
biological neurons function. As illustrated in the following figure,
incoming signals are received by the cell's dendrites through a
biochemical process that allows the impulse to be weighted
according to its relative importance or frequency. As the cell body
begins to accumulate the incoming signals, a threshold is reached
at which the cell fires and the output signal is then transmitted via
an electrochemical process down the axon. At the axon's terminals,
the electric signal is again processed as a chemical signal to be
passed to the neighboring neurons across a tiny gap known as a
synapse.
The model of a single artificial neuron can be understood in
terms very similar to the biological model. As depicted in the
following figure, a directed network diagram defines a
relationship between the input signals received by the
dendrites (x variables) and the output signal (y variable). Just as
with the biological neuron, each dendrite's signal is weighted
(w values) according to its importance—ignore for now how
these weights are determined. The input signals are summed
by the cell body and the signal is passed on according to an
activation function denoted by f.
A typical artificial neuron with n input dendrites can be
represented by the formula that follows. The w weights allow
each of the n inputs, (x), to contribute a greater or lesser
amount to the sum of input signals. The net total is used by the
activation function f(x), and the resulting signal, y(x), is the
output axon
There are numerous variants of neural networks, each can be defined in terms of the following
characteristics

• An activation function, which transforms a neuron's net input signal into a single output signal to be
broadcasted further in the network
• A network topology (or architecture), which describes the number of neurons in the model as well as
the number of layers and manner in which they are connected
• The training algorithm that specifies how connection weights are set in order to inhibit or excite
neurons in proportion to the input signal
Activation functions

The activation function is the mechanism by which the artificial


neuron processes information and passes it throughout the
network
threshold activation function
In the biological case, the activation function could be
imagined as a process that involves summing the total input Click icon to add picture
signal and determining whether it meets the firing threshold. If
so, the neuron passes on the signal; otherwise, it does nothing.
In ANN terms, this is known as a threshold activation function,
as it results in an output signal only once a specified input
threshold has been attained
.the neuron fires when the sum of input signals is at least zero. Because of its shape, it is sometimes called a unit
unit step activation function

step activation function.


Sigmoid activation function or logistic sigmoid

where e is the base of natural logarithms .Although it shares a


similar step or S shape with the threshold activation function,
the output signal is no longer binary; output values can fall Click icon to add picture
anywhere in the range from 0 to 1. Additionally, the sigmoid is
differentiable, which means that it is possible to calculate the
derivative across the entire range of inputs
Linear Activation Function
Saturated Activation Function
Gaussian Activation Function
The primary detail that differentiates among these activation
functions is the output signal range. Typically, this is one of (0,
1), (-1, +1), or (-inf, +inf). The choice of activation function
biases the neural network such that it may fit certain types of
data more appropriately, allowing the construction of
specialized neural networks. For instance, a linear activation
function results in a neural network very similar to a linear
regression model, while a Gaussian activation function results
in a model called a Radial Basis Function (RBF) network
Network topology
The topology determines the
1. The capacity of a neural network to learn is rooted in its complexity of tasks that can be
learned by the network.
topology, or the patterns and structures of interconnected Generally, larger and more
complex networks are capable
neurons.
of identifying more subtle
2. Although there are countless forms of network architecture, patterns and complex decision
boundaries. However, the power
they can be differentiated by three key characteristics: of a network is not only a
• The number of layers function of the network size, but
• Whether information in the network is allowed to travel also the way units are arranged
backward
• The number of nodes within each layer of the network
The number of layers

1. The number of layers:-


• The input layers : the nodes which receive unprocessed
signals directly from the input data.
• Output layer:-those nodes which generate the signals
predicted values
• Hidden nodes:-a node that process the signals from the
input node
1. The input and output nodes are arranged in groups known
as layers. Because the input nodes process the incoming
data exactly as received, the network has only one set of
connection weights (labelled here as w1, w2, and w3).
2. It is therefore termed a single-layer network. Single-layer
networks can be used for basic pattern classification,
particularly for patterns that are linearly separable, but more
sophisticated networks are required for most learning tasks.
3. As you might expect, an obvious way to create more
complex networks is by adding additional layers.
4. As depicted here, a multilayer network adds one or more
hidden layers that process the signals from the input nodes
prior to reaching the output node.
5. Most multilayer networks are fully connected, which means
that every node in one layer is connected to every node in
the next layer, but this is not required
The direction of information travel
1. Networks in which the input signal is fed continuously in
one direction from connection-to-connection until reaching
the output layer are called feedforward networks.
2. Feedback networks- networks which allows signals to
travel in both directions using loops are called feedback
network/recurrent network
The number of nodes in each layer

• Num of features in the input data: the number of input nodes


• Num of output nodes:- number of outcome to be modelled
• Num of hidden nodes :- user to decide prior to training the model. The appropriate number depends are
no of input nodes amount of training data , amount of noisy data, complexity of the learning task and
many other factors.
The training algorithm

1. Algorithm specifies how connection are set in order


2. Mainly two algorithm for learning a single perceptron
• Perceptron rule: used when training dataset is linearly separatable
• Delta rule-used when training dataset is not linearly separable
Training neural networks with backpropagation
What is Backpropagation?
Backpropagation is the essence of neural network training. It is the method of fine-tuning the weights of a neural
network based on the error rate obtained in the previous epoch (i.e., iteration). Proper tuning of the weights allows
you to reduce error rates and make the model reliable by increasing its generalization.

Backpropagation is an algorithm for supervised learning of artificial neural networks using gradient descent.

Given an ANN and error function , the method calculates the gradient of the error function with respect to the neural
network weightd
How Backpropagation Algorithm Works
1. Inputs X, arrive through the preconnected path
2. Input is modelled using real weights W. The weights are
usually randomly selected.
3. Calculate the output for every neuron from the input
layer, to the hidden layers, to the output layer.
4. Calculate the error in the outputs
5. Travel back from the output layer to the hidden layer to
adjust the weights such that the error is decreased

ErrorB= Actual Output – Desired Output

Keep repeating the process until the desired output is


achieved
In its most general form, the backpropagation algorithm
iterates through many cycles of two processes. Each iteration of
the algorithm is known as an epoch. Because the network
contains no a priori (existing) knowledge, typically the weights
are set randomly prior to beginning. Then, the algorithm cycles
through the processes until a stopping criterion is reached. The
cycles include:
1. forward phase
2. backward phase
A forward phase in which the neurons are activated in sequence from the input layer to the output layer,
applying each neuron's weights and activation function along the way. Upon reaching the final layer, an output
signal is produced.

A backward phase in which the network's output signal resulting from the forward phase is compared to the
true target value in the training data. The difference between the network's output signal and the true value
results in an error that is propagated backwards in the network to modify the connection weights between
neurons and reduce future errors.
Why We Need Backpropagation?
(Advantage)
• Backpropagation is fast, simple and easy to program
• It has no parameters to tune apart from the numbers of input
• It is a flexible method as it does not require prior knowledge
about the network
• It is a standard method that generally works well
• It does not need any special mention of the features of the
function to be learned.
Click icon to add picture

You might also like