SlideShare a Scribd company logo
1 of 4
LEARNING METHODS
One of the most important properties of neural networks is to improve their
performances by taking into accountthe past experiences. This is achieved
through a process called learning. This improvement takes place over time in
accordancewith some prescribed measure. A neural network learns about its
environment through an interactiveprocess of adjustmentsapplied to its
synapticweightsand biaslevels. Ideally, the network becomes more
knowledgeableaboutitsenvironmentafter each iteration of the learning
process.
However, the process oflearning is a matter of viewpoint of an individual.
Hence, there is no universally agreeable definition of learning due to ambiguity
across all the available ones. Forexample, learning in the opinion of a
psychologist is quite different from learning in a classroom sense. Considering
the discussion to be bound within the domain of neural networks, the following
definition of learning adapted from Mendel and McClaren can be used:
“Learning is a process by which the free parameters of a neural network are
adapted through a process of stimulation by the environment in which the
network is embedded. The type of learning is determined by the parameter in
which the changes take place.”
This definition of learning process implies the following sequence of events:
1. The neural network is stimulated by an environment.
2. The neural network undergoes changes in its free parameters as a result of
this stimulation.
3. The neural network responds in a new way to the environment because of
the changes that have occurred in its internal structure.
Learning Paradigms:
There are three major learning paradigms: supervised learning, unsupervised
learning and reinforcement learning. Of these three, the first one can be viewed
as “learning with a teacher”, while the remaining two can be viewed as
“learning withouta teacher”. Usually they can be employed by any given type
of artificial neural network architecture. A prescribed set of well-defined rules
for the solution of a learning problem is called a learning algorithm. Each
learning paradigm has many learning algorithms.
SUPERVISED LEARNING:
Supervised learning, sometimes referred to as learning with a teacher is a
learning technique that sets parameters of an artificial neural network from
training data which serves as the “teacher” in this case. The task of the learning
artificial neural network is to set the value of its parameters for any valid input
value after having seen output value. The training data consistof labeled pairs
of input and desired output values that are traditionally represented in data
vectors, which may be conceptually thought of as an “environment” and
remains unknown to the neural network of interest. The neural network, after
learning from the training data or “teacher”, provides an output to a random
input which resembles the training data examples as close as possible. The
deviation of the actual output is called the error signal. Clearly, for good
performance, the value of this signal should be zero ideally and the least
possible in practical implementation cases.
Supervised learning can also be referred as classification, where we have a
wide rangeof classifiers, each with its strengths and weaknesses. Choosing a
suitable classifier (Multilayer perceptron, SupportVector Machines, k-nearest
neighbour algorithm, decision tree, radial basis function classifiers and others)
for a given problem is however still more an art than a science.
Fig. 1: Block diagram of Supervised Learning Model
In order to solve a given problem of supervised learning various steps has to be
considered:
1. Determine the type of training examples
2. Gather a training data set that satisfactory describe a given problem
3. Describe gathered training data set in form understandable to a chosen
artificial neural network
4. Do the learning and after the learning test the performance of learned
artificial neural network with the test (validation) data set. Test data set
consist of data that has not been introduced to artificial neural network
while learning
UNSUPERVISED LEARNING:
In unsupervised learning, there is no training set or “teacher” to monitor the
progress in the learning process ofthe neural network. In lieu of that, provision
is made for a task-independentmeasure of the quality of the representation that
the network is required to learn, and the free parameters of the network are
optimized with respect to that measure. The unsupervised-training model
consists of the environment, represented by a measurement vector. The
measurement vector is fed to the learning system and the system responseis
obtained. Based upon the system responseand the adaptation rule employed, the
weights of the learning system are adjusted to obtain the desired performance.
Note that unlike the supervised-training method, the unsupervised method does
not need a desired output for each input-featurevector. The adaptation rule in
the unsupervised training algorithm performs the error-signal generation role
the teacher performs in the supervised-learning system. Thus, the behaviour of
the unsupervised learning system depends in large measure on the
adaptation rule used to control how the weights are adjusted.
Fig. 2: Block Diagram of an Unsupervised Training Model
Unsupervised learning is mostly used for solution of estimation problems such
as statistical modelling, compression, filtering, clustering and others. In
unsupervised learning we seek to determine how the data is organized. It differs
from supervised learning and reinforcement learning in that the artificial neural
network is given only unlabeled examples.
REINFORCEMENT LEARNING:
Reinforcement learning is a learning technique that sets parameters of an
artificial neural network, where data is usually not given, but generated by
interactions with the environment. Reinforcement learning is concerned with
how an artificial neural network should take actions in an environment so as to
maximize some notion of long-term reward. The network is not told which
actions to take, but instead must discover which actions yield the most reward
by trying them. Thus reinforcement learning can be defined as:
“Reinforcement learning is a learning process in which the training set consists
of input patterns, after completion of a sequence a value is returned to the
network indicating whethertheresultwasrightor wrong and, possibly, how right
or wrong it was.”
One of the challenges that arise in reinforcement learning and not in other kinds
of learning is the trade-off between exploration and exploitation.To obtain a
lot of reward, a reinforcement learning aided network must prefer actions that it
has tried in the past and found to be effective in producing reward. But to
discover such actions, it has to try actions that it has not selected before. The
network has to exploit what it already knows in order to obtain reward, but it
also has to explore in order to make better action selections in the future. The
dilemma is that neither exploration nor exploitation can be pursued exclusively
without failing at the task. It must try a variety of actions and progressively
favour those that appear to be best. On a stochastic task, each action must be
tried many times to gain a reliable estimate its expected reward.
Reinforcement learning is particularly suited to problems which include a long-
term versus short-term reward trade-off. It has been applied successfully to
various problems, including robotcontrol, telecommunications, and games such
as chess and other sequential decision making tasks.

More Related Content

What's hot

Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkAtul Krishna
 
Genetic algorithm ppt
Genetic algorithm pptGenetic algorithm ppt
Genetic algorithm pptMayank Jain
 
Learning set of rules
Learning set of rulesLearning set of rules
Learning set of rulesswapnac12
 
Artificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural NetworksArtificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural NetworksThe Integral Worm
 
Machine Learning with Decision trees
Machine Learning with Decision treesMachine Learning with Decision trees
Machine Learning with Decision treesKnoldus Inc.
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagationKrish_ver2
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashrisheetal katkar
 
neural network
neural networkneural network
neural networkSTUDENT
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural networkSopheaktra YONG
 
Back propagation
Back propagationBack propagation
Back propagationNagarajan
 
Counter propagation Network
Counter propagation NetworkCounter propagation Network
Counter propagation NetworkAkshay Dhole
 
Multilayer & Back propagation algorithm
Multilayer & Back propagation algorithmMultilayer & Back propagation algorithm
Multilayer & Back propagation algorithmswapnac12
 
Neural network final NWU 4.3 Graphics Course
Neural network final NWU 4.3 Graphics CourseNeural network final NWU 4.3 Graphics Course
Neural network final NWU 4.3 Graphics CourseMohaiminur Rahman
 

What's hot (20)

Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
Genetic algorithm ppt
Genetic algorithm pptGenetic algorithm ppt
Genetic algorithm ppt
 
Learning set of rules
Learning set of rulesLearning set of rules
Learning set of rules
 
Artificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural NetworksArtificial Intelligence: Artificial Neural Networks
Artificial Intelligence: Artificial Neural Networks
 
Machine Learning with Decision trees
Machine Learning with Decision treesMachine Learning with Decision trees
Machine Learning with Decision trees
 
Perceptron
PerceptronPerceptron
Perceptron
 
Backpropagation algo
Backpropagation  algoBackpropagation  algo
Backpropagation algo
 
2.5 backpropagation
2.5 backpropagation2.5 backpropagation
2.5 backpropagation
 
Radial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and DhanashriRadial basis function network ppt bySheetal,Samreen and Dhanashri
Radial basis function network ppt bySheetal,Samreen and Dhanashri
 
Artificial Neural Network Topology
Artificial Neural Network TopologyArtificial Neural Network Topology
Artificial Neural Network Topology
 
neural network
neural networkneural network
neural network
 
Associative memory network
Associative memory networkAssociative memory network
Associative memory network
 
Feedforward neural network
Feedforward neural networkFeedforward neural network
Feedforward neural network
 
Back propagation
Back propagationBack propagation
Back propagation
 
Anfis (1)
Anfis (1)Anfis (1)
Anfis (1)
 
Counter propagation Network
Counter propagation NetworkCounter propagation Network
Counter propagation Network
 
Multilayer & Back propagation algorithm
Multilayer & Back propagation algorithmMultilayer & Back propagation algorithm
Multilayer & Back propagation algorithm
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Neural network
Neural networkNeural network
Neural network
 
Neural network final NWU 4.3 Graphics Course
Neural network final NWU 4.3 Graphics CourseNeural network final NWU 4.3 Graphics Course
Neural network final NWU 4.3 Graphics Course
 

Similar to Learning Methods in a Neural Network

Similar to Learning Methods in a Neural Network (20)

Learning in AI
Learning in AILearning in AI
Learning in AI
 
AI: Learning in AI 2
AI: Learning in AI 2AI: Learning in AI 2
AI: Learning in AI 2
 
AI: Learning in AI 2
AI: Learning in AI  2AI: Learning in AI  2
AI: Learning in AI 2
 
machine learning
machine learningmachine learning
machine learning
 
CSA 3702 machine learning module 1
CSA 3702 machine learning module 1CSA 3702 machine learning module 1
CSA 3702 machine learning module 1
 
Cognitive Science Unit 4
Cognitive Science Unit 4Cognitive Science Unit 4
Cognitive Science Unit 4
 
A Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement LearningA Review on Introduction to Reinforcement Learning
A Review on Introduction to Reinforcement Learning
 
ETRnew.doc.doc
ETRnew.doc.docETRnew.doc.doc
ETRnew.doc.doc
 
ETRnew.doc.doc
ETRnew.doc.docETRnew.doc.doc
ETRnew.doc.doc
 
B42010712
B42010712B42010712
B42010712
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 
Soft computing
Soft computingSoft computing
Soft computing
 
AI Unit 5 machine learning
AI Unit 5 machine learning AI Unit 5 machine learning
AI Unit 5 machine learning
 
ANN - UNIT 3.pptx
ANN - UNIT 3.pptxANN - UNIT 3.pptx
ANN - UNIT 3.pptx
 
ANN - UNIT 3.pptx
ANN - UNIT 3.pptxANN - UNIT 3.pptx
ANN - UNIT 3.pptx
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning techniques
Machine Learning techniques Machine Learning techniques
Machine Learning techniques
 
Operating system
Operating systemOperating system
Operating system
 
Neural network and fuzzy logic
Neural network and fuzzy logicNeural network and fuzzy logic
Neural network and fuzzy logic
 

Recently uploaded

Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 

Recently uploaded (20)

Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 

Learning Methods in a Neural Network

  • 1. LEARNING METHODS One of the most important properties of neural networks is to improve their performances by taking into accountthe past experiences. This is achieved through a process called learning. This improvement takes place over time in accordancewith some prescribed measure. A neural network learns about its environment through an interactiveprocess of adjustmentsapplied to its synapticweightsand biaslevels. Ideally, the network becomes more knowledgeableaboutitsenvironmentafter each iteration of the learning process. However, the process oflearning is a matter of viewpoint of an individual. Hence, there is no universally agreeable definition of learning due to ambiguity across all the available ones. Forexample, learning in the opinion of a psychologist is quite different from learning in a classroom sense. Considering the discussion to be bound within the domain of neural networks, the following definition of learning adapted from Mendel and McClaren can be used: “Learning is a process by which the free parameters of a neural network are adapted through a process of stimulation by the environment in which the network is embedded. The type of learning is determined by the parameter in which the changes take place.” This definition of learning process implies the following sequence of events: 1. The neural network is stimulated by an environment. 2. The neural network undergoes changes in its free parameters as a result of this stimulation. 3. The neural network responds in a new way to the environment because of the changes that have occurred in its internal structure. Learning Paradigms: There are three major learning paradigms: supervised learning, unsupervised learning and reinforcement learning. Of these three, the first one can be viewed as “learning with a teacher”, while the remaining two can be viewed as “learning withouta teacher”. Usually they can be employed by any given type of artificial neural network architecture. A prescribed set of well-defined rules for the solution of a learning problem is called a learning algorithm. Each learning paradigm has many learning algorithms.
  • 2. SUPERVISED LEARNING: Supervised learning, sometimes referred to as learning with a teacher is a learning technique that sets parameters of an artificial neural network from training data which serves as the “teacher” in this case. The task of the learning artificial neural network is to set the value of its parameters for any valid input value after having seen output value. The training data consistof labeled pairs of input and desired output values that are traditionally represented in data vectors, which may be conceptually thought of as an “environment” and remains unknown to the neural network of interest. The neural network, after learning from the training data or “teacher”, provides an output to a random input which resembles the training data examples as close as possible. The deviation of the actual output is called the error signal. Clearly, for good performance, the value of this signal should be zero ideally and the least possible in practical implementation cases. Supervised learning can also be referred as classification, where we have a wide rangeof classifiers, each with its strengths and weaknesses. Choosing a suitable classifier (Multilayer perceptron, SupportVector Machines, k-nearest neighbour algorithm, decision tree, radial basis function classifiers and others) for a given problem is however still more an art than a science. Fig. 1: Block diagram of Supervised Learning Model
  • 3. In order to solve a given problem of supervised learning various steps has to be considered: 1. Determine the type of training examples 2. Gather a training data set that satisfactory describe a given problem 3. Describe gathered training data set in form understandable to a chosen artificial neural network 4. Do the learning and after the learning test the performance of learned artificial neural network with the test (validation) data set. Test data set consist of data that has not been introduced to artificial neural network while learning UNSUPERVISED LEARNING: In unsupervised learning, there is no training set or “teacher” to monitor the progress in the learning process ofthe neural network. In lieu of that, provision is made for a task-independentmeasure of the quality of the representation that the network is required to learn, and the free parameters of the network are optimized with respect to that measure. The unsupervised-training model consists of the environment, represented by a measurement vector. The measurement vector is fed to the learning system and the system responseis obtained. Based upon the system responseand the adaptation rule employed, the weights of the learning system are adjusted to obtain the desired performance. Note that unlike the supervised-training method, the unsupervised method does not need a desired output for each input-featurevector. The adaptation rule in the unsupervised training algorithm performs the error-signal generation role the teacher performs in the supervised-learning system. Thus, the behaviour of the unsupervised learning system depends in large measure on the adaptation rule used to control how the weights are adjusted. Fig. 2: Block Diagram of an Unsupervised Training Model
  • 4. Unsupervised learning is mostly used for solution of estimation problems such as statistical modelling, compression, filtering, clustering and others. In unsupervised learning we seek to determine how the data is organized. It differs from supervised learning and reinforcement learning in that the artificial neural network is given only unlabeled examples. REINFORCEMENT LEARNING: Reinforcement learning is a learning technique that sets parameters of an artificial neural network, where data is usually not given, but generated by interactions with the environment. Reinforcement learning is concerned with how an artificial neural network should take actions in an environment so as to maximize some notion of long-term reward. The network is not told which actions to take, but instead must discover which actions yield the most reward by trying them. Thus reinforcement learning can be defined as: “Reinforcement learning is a learning process in which the training set consists of input patterns, after completion of a sequence a value is returned to the network indicating whethertheresultwasrightor wrong and, possibly, how right or wrong it was.” One of the challenges that arise in reinforcement learning and not in other kinds of learning is the trade-off between exploration and exploitation.To obtain a lot of reward, a reinforcement learning aided network must prefer actions that it has tried in the past and found to be effective in producing reward. But to discover such actions, it has to try actions that it has not selected before. The network has to exploit what it already knows in order to obtain reward, but it also has to explore in order to make better action selections in the future. The dilemma is that neither exploration nor exploitation can be pursued exclusively without failing at the task. It must try a variety of actions and progressively favour those that appear to be best. On a stochastic task, each action must be tried many times to gain a reliable estimate its expected reward. Reinforcement learning is particularly suited to problems which include a long- term versus short-term reward trade-off. It has been applied successfully to various problems, including robotcontrol, telecommunications, and games such as chess and other sequential decision making tasks.