SlideShare una empresa de Scribd logo
1 de 22
By:-
Abhishek Vishnoi(112501)
  NITTTR Chandhigarh
INTRODUCTION
 A fuzzy traffic signal controller uses simple “if–then”
  rules which involve linguistic concepts such as medium
  or long, presented as membership functions.
 In neuro-fuzzy traffic signal control, a neural network
  adjusts the fuzzy controller by fine-tuning the form
  and location of the membership functions.
 The learning algorithm of the neural network is
  reinforcement learning, which gives credit for
  successful system behaviour and punishes for poor
  behaviour;
Basics of Neuro-Fuzzy
 A combination of a neural network and a fuzzy system
  is called a neurofuzzy system. In neurofuzzy control,
  the parameters of the fuzzy controller are adjusted
  using a neural network.
 Neurofuzzy systems utilize both the linguistic,
  human-like reasoning of fuzzy systems and the
  powerful computing ability of neural networks.
 They can avoid some of the drawbacks of solely fuzzy
  or neural systems
Fuzzy system
 Fuzzy logic were brought to public attention by Zadeh.
 Fuzzy sets provide a mathematical interpretation for
  natural language terms.
 A fuzzy set is a set without a crisp, clearly defined
  boundary.
 It can contain elements with a partial membership
  function
 Fuzzy control uses a rule base, where the rules are
  propositions of the form “if X is S, then Y is T”.
  Here X and Y are linguistic variables,
 Membership function (MF) is a curve that defines how
  each point in the input space is mapped to a membership
  value (or degree of membership) between 0 and 1.
 Each linguistic terms is associated with a fuzzy set defined
  by a corresponding membership function.
Fuzzy traffic signal control

 the controller receives measurements of incoming
  traffic and chooses the length of the green signal
  accordingly.
 Advantage of fuzzy control systems over traditional
  ones:-
 Their ability to use expert knowledge in the form of
  fuzzy rule.
 Small number of parameters needed.
Fuzzy traffic signal control
• The traffic simulation environment used in this work is a two-
    phase controlled intersection of two-lane streets.
    In each approaching lane there are two traffic detectors, the first
    one before the stop line and the other at the stop line.
   These detectors send input measurements of traffic to the fuzzy
    controller:
    APP, number of approaching vehicles in the green direction
    and QUE, number of queuing vehicles in the red direction.
    Depending on the traffic situation, the green phase can be
    extended with one or several seconds.
   Output of the fuzzy controller is EXT, green time extension (in
    seconds).
   The linguistic values of
    APP are zero, a few, medium and many;
    QUE are a few, medium and too long;
    EXT are zero, short, medium and long.
Fuzzy rule
 The rule base consists of five rule sets. The choice of the rule
  set depends on how many green extensions have already been
  given.
 The objective of the rules is to split the green time and find
  the right moment of green termination so that the delay of
  vehicles is minimized
After minimum green (5 seconds)
 if APP is zero, then EXT is zero
 if APP is a few and if QUE is less than medium, then EXT is short
 if APP is more than a few, then EXT is medium
 if APP is medium, then EXT is long
After the first extension
 if APP is zero, then EXT is zero
 if APP is a few and if QUE is less than medium, then EXT is short
 if APP is medium, then EXT is medium
 if APP is many, then EXT is long

After the second extension
 if APP is zero, then EXT is zero
 if APP is a few and if QUE is less than medium, then EXT is short
 if APP is medium and if QUE is less than medium,
  then EXT is medium
 if APP is many and if QUE is less than medium, then EXT is long
After the third extension
 if APP is zero, then EXT is zero
 if QUE is too long, then EXT is zero
 if APP is more than a few and if QUE is less than medium,
  then EXT is short
 if APP is medium and if QUE is less than medium,
  then EXT is medium
 if APP is many and if QUE is less than a few, then EXT is long
 After the fourth extension
 if APP is zero, then EXT is zero
 if QUE is too long, then EXT is zero
 if APP is more than a few and if QUE is a few, then EXT is short
 if APP is medium and if QUE is less than a few,
  then EXT is medium
 if APP is many and if QUE is less than a few, then EXT is long
Reinforcement learning
Why is reinforcement learning needed?
 The parameters of the fuzzy controller could be
  updated      using     the      back      propagation
  algorithm common in supervised learning in neural
  networks.
 Backpropagation algorithm cannot be used; instead, a
  learning algorithm called reinforcement learning is
  used.
 In reinforcement learning, the system evaluates
  whether the previous control action was good or not. If
  the action had good consequences, the tendency to
  produce that action is strengthened, that is,
  reinforced.
Structure of the neurofuzzy control system
 The evaluation network gathers
  information about the decisions
  of the fuzzy controller and the
  delays of the vehicles.
 This reinforcement information
  is used in fine-tuning the
  membership functions of the
  fuzzy controller, which is also
  presented as a neural network.
 Thus there are actually two
  neural networks in the system:
  an evaluation network and a
  fuzzy controller network.
Evaluation network

 The evaluation network evaluates the goodness of the
  actions of the fuzzy controller based on information it
  has gathered by observing the process.
 The network fine-tunes the membership functions of
  the fuzzy controller by updating the parameters of the
  membership functions.
 It is a feedforward, multilayer perceptron-type
  network.
 The input variables of the network are APP and QUE,
  measurements of incoming traffic in green and red
  directions, respectively.
.
 The hidden layer activation function
  is a sigmoidal function
  zj(xj)=1/(1+exp(−xj)),
 where The size h of the hidden layer
  may vary, and there are no precise
  rules for determining how many
  cells it should contain. Increasing
  the size gives a more powerful and
  flexible network but requires a
  longer learning time. In our work
  the size of h=10 was found suitable
 The network output v is a measure of the goodness of
  the state of the network, a prediction of future
  reinforcement.               h

  v = b1 APP + b2 QUE +           cjzj
                              j 1
 The gradient descent algorithm is used in the learning
  phase of the evaluation network.
 If a positive reinforcement signal is received, the
  network weights are rewarded by being changed in the
  direction which increases their contribution to the
  total sum.
 If a negative signal is received, the weights are
  punished by being changed in the direction
  which decreases their contribution.
Fuzzy controller network

 Consider the first rule set, whose neural network
 presentation in figure.
Experimental results




Fig-Average vehicular delays (in              Fig-Average vehicular delays (in
seconds) before (dashed line) and after       seconds) before (dashed line) and
(solid line) the learning at traffic          after (solid line) the learning at traffic
volumes of 300, 500 and 1000 vehicles         volumes of 300, 500 and 1000 vehicles
per hour. The location of the first traffic   per hour. The location of the first
detector is 50 m from the stop line.          traffic detector is 100 m from the stop
                                              line.
Volume (veh/h) Det. (M)      dini           dnew           D
500             50           9.99           9.42           0.57
1000            50           14.78          13.96          0.81
500             100          9.11           8.82           0.81
1000            100          15.18          14.52          0.66



Here dini and dnew are the average vehicular delays using the initial
and the new membership functions, respectively, D=dini−dnew is the
difference of individual observations
Membership functions zero, a few, medium and many of APP before (dotted
line) and after (solid line) the learning at a traffic volume of 500 vehicles per
hour. The location of the first traffic detector is 50 m from the stop line.
Horizontal axis: number of approaching vehicles. Vertical axis: value of
membership function.
 Fig Membership functions a few, medium and too long of QUE before
  (dotted line) and after (solid line) the learning at a traffic volume of
  500 vehicles per hour. The location of the first traffic detector is 50 m
  from the stop line. Horizontal axis: number of queuing vehicles.
  Vertical axis: value of membership function.
 Fig. membership function zero, short , medium and long
  of EXT before (dotted line) and after (solid line) the learning at a
  traffic volume of 500 vehicles per hour. The location of the first
  traffic detector is 50 m from the stop line. Horizontal axis: green
  signal extension in seconds. Vertical axis: value of membership
  function.

Más contenido relacionado

La actualidad más candente

La actualidad más candente (12)

Autonomous system to control a mobile robot
Autonomous system to control a mobile robotAutonomous system to control a mobile robot
Autonomous system to control a mobile robot
 
Fuzzy Logic Model for Traffic Congestion
Fuzzy Logic Model for Traffic CongestionFuzzy Logic Model for Traffic Congestion
Fuzzy Logic Model for Traffic Congestion
 
EEG-Based Drowsiness Estimation for Safety Driving
EEG-Based Drowsiness Estimation for Safety DrivingEEG-Based Drowsiness Estimation for Safety Driving
EEG-Based Drowsiness Estimation for Safety Driving
 
IRJET-Fuzzy Logic Based Path Navigation for Robot using Matlab
IRJET-Fuzzy Logic Based Path Navigation for Robot using MatlabIRJET-Fuzzy Logic Based Path Navigation for Robot using Matlab
IRJET-Fuzzy Logic Based Path Navigation for Robot using Matlab
 
Mobile robot controller using novel hybrid system
Mobile robot controller using novel hybrid system  Mobile robot controller using novel hybrid system
Mobile robot controller using novel hybrid system
 
Fuzzy Logic-Genetic Algorithm-Neural Network for Mobile Robot Navigation: A S...
Fuzzy Logic-Genetic Algorithm-Neural Network for Mobile Robot Navigation: A S...Fuzzy Logic-Genetic Algorithm-Neural Network for Mobile Robot Navigation: A S...
Fuzzy Logic-Genetic Algorithm-Neural Network for Mobile Robot Navigation: A S...
 
Artificial neural networks
Artificial neural networksArtificial neural networks
Artificial neural networks
 
NEW METHOD FOR TRAFFIC DENSITY ESTIMATION
NEW METHOD FOR TRAFFIC DENSITY ESTIMATIONNEW METHOD FOR TRAFFIC DENSITY ESTIMATION
NEW METHOD FOR TRAFFIC DENSITY ESTIMATION
 
Automatic Train Control System using Wireless Sensor Networks
Automatic Train Control System using Wireless Sensor NetworksAutomatic Train Control System using Wireless Sensor Networks
Automatic Train Control System using Wireless Sensor Networks
 
Design and Development of Intelligent Navigation Control Systems for Autonomo...
Design and Development of Intelligent Navigation Control Systems for Autonomo...Design and Development of Intelligent Navigation Control Systems for Autonomo...
Design and Development of Intelligent Navigation Control Systems for Autonomo...
 
Motion Control of Mobile Robots using Fuzzy Controller
Motion Control of Mobile Robots using Fuzzy ControllerMotion Control of Mobile Robots using Fuzzy Controller
Motion Control of Mobile Robots using Fuzzy Controller
 
Identification and Control of Three-Links Electrically Driven Robot Arm Using...
Identification and Control of Three-Links Electrically Driven Robot Arm Using...Identification and Control of Three-Links Electrically Driven Robot Arm Using...
Identification and Control of Three-Links Electrically Driven Robot Arm Using...
 

Destacado

A neuro fuzzy decision support system
A neuro fuzzy decision support systemA neuro fuzzy decision support system
A neuro fuzzy decision support system
R A Akerkar
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
osmancikk
 

Destacado (9)

Expert system neural fuzzy system
Expert system neural fuzzy systemExpert system neural fuzzy system
Expert system neural fuzzy system
 
Fuzzy logic
Fuzzy logicFuzzy logic
Fuzzy logic
 
A neuro fuzzy decision support system
A neuro fuzzy decision support systemA neuro fuzzy decision support system
A neuro fuzzy decision support system
 
Neural network for real time traffic signal control
Neural network for real time traffic signal controlNeural network for real time traffic signal control
Neural network for real time traffic signal control
 
Density based Traffic Light Controller
Density based Traffic Light ControllerDensity based Traffic Light Controller
Density based Traffic Light Controller
 
Fuzzy Logic ppt
Fuzzy Logic pptFuzzy Logic ppt
Fuzzy Logic ppt
 
Bus tracking application project report
Bus tracking application project reportBus tracking application project report
Bus tracking application project report
 
Expert Systems
Expert SystemsExpert Systems
Expert Systems
 
Bus tracking application in Android
Bus tracking application in AndroidBus tracking application in Android
Bus tracking application in Android
 

Similar a Reinforcement learning in neuro fuzzy traffic signal control

Simulation of Wireless Sensor Networks
Simulation of Wireless Sensor NetworksSimulation of Wireless Sensor Networks
Simulation of Wireless Sensor Networks
Daniel Zuniga
 
Artificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logicArtificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logic
Krishna Reddy
 

Similar a Reinforcement learning in neuro fuzzy traffic signal control (20)

An ann based call handoff management scheme for mobile cellular network
An ann based call handoff management scheme for mobile cellular networkAn ann based call handoff management scheme for mobile cellular network
An ann based call handoff management scheme for mobile cellular network
 
Nuzzer algorithm based Human Tracking and Security System for Device-Free Pas...
Nuzzer algorithm based Human Tracking and Security System for Device-Free Pas...Nuzzer algorithm based Human Tracking and Security System for Device-Free Pas...
Nuzzer algorithm based Human Tracking and Security System for Device-Free Pas...
 
Simulation of Wireless Sensor Networks
Simulation of Wireless Sensor NetworksSimulation of Wireless Sensor Networks
Simulation of Wireless Sensor Networks
 
File 2
File 2File 2
File 2
 
Surface Electromyography (SEMG) Based Fuzzy Logic Controller for Footballer b...
Surface Electromyography (SEMG) Based Fuzzy Logic Controller for Footballer b...Surface Electromyography (SEMG) Based Fuzzy Logic Controller for Footballer b...
Surface Electromyography (SEMG) Based Fuzzy Logic Controller for Footballer b...
 
Comparison of fx lms and n fxlms algorithms in matlab using active vibration ...
Comparison of fx lms and n fxlms algorithms in matlab using active vibration ...Comparison of fx lms and n fxlms algorithms in matlab using active vibration ...
Comparison of fx lms and n fxlms algorithms in matlab using active vibration ...
 
Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...
Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...
Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...
 
K017655963
K017655963K017655963
K017655963
 
Artificial neural networks
Artificial neural networksArtificial neural networks
Artificial neural networks
 
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
Classical Discrete-Time Fourier TransformBased Channel Estimation for MIMO-OF...
 
Random Valued Impulse Noise Elimination using Neural Filter
Random Valued Impulse Noise Elimination using Neural FilterRandom Valued Impulse Noise Elimination using Neural Filter
Random Valued Impulse Noise Elimination using Neural Filter
 
Neural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile RobotNeural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile Robot
 
Ab35157161
Ab35157161Ab35157161
Ab35157161
 
Ab35157161
Ab35157161Ab35157161
Ab35157161
 
Artificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logicArtificial Neural Networks fuzzy logic
Artificial Neural Networks fuzzy logic
 
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
Adaptive Digital Filter Design for Linear Noise Cancellation Using Neural Net...
 
D017632228
D017632228D017632228
D017632228
 
The Joy of SLAM
The Joy of SLAMThe Joy of SLAM
The Joy of SLAM
 
Frequency offset estimation with fast acquisition in OFDM system assignment h...
Frequency offset estimation with fast acquisition in OFDM system assignment h...Frequency offset estimation with fast acquisition in OFDM system assignment h...
Frequency offset estimation with fast acquisition in OFDM system assignment h...
 
ML_Unit_2_Part_A
ML_Unit_2_Part_AML_Unit_2_Part_A
ML_Unit_2_Part_A
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Reinforcement learning in neuro fuzzy traffic signal control

  • 1. By:- Abhishek Vishnoi(112501) NITTTR Chandhigarh
  • 2. INTRODUCTION  A fuzzy traffic signal controller uses simple “if–then” rules which involve linguistic concepts such as medium or long, presented as membership functions.  In neuro-fuzzy traffic signal control, a neural network adjusts the fuzzy controller by fine-tuning the form and location of the membership functions.  The learning algorithm of the neural network is reinforcement learning, which gives credit for successful system behaviour and punishes for poor behaviour;
  • 3. Basics of Neuro-Fuzzy  A combination of a neural network and a fuzzy system is called a neurofuzzy system. In neurofuzzy control, the parameters of the fuzzy controller are adjusted using a neural network.  Neurofuzzy systems utilize both the linguistic, human-like reasoning of fuzzy systems and the powerful computing ability of neural networks.  They can avoid some of the drawbacks of solely fuzzy or neural systems
  • 4. Fuzzy system  Fuzzy logic were brought to public attention by Zadeh.  Fuzzy sets provide a mathematical interpretation for natural language terms.  A fuzzy set is a set without a crisp, clearly defined boundary.  It can contain elements with a partial membership function  Fuzzy control uses a rule base, where the rules are propositions of the form “if X is S, then Y is T”. Here X and Y are linguistic variables,
  • 5.  Membership function (MF) is a curve that defines how each point in the input space is mapped to a membership value (or degree of membership) between 0 and 1.  Each linguistic terms is associated with a fuzzy set defined by a corresponding membership function.
  • 6. Fuzzy traffic signal control  the controller receives measurements of incoming traffic and chooses the length of the green signal accordingly.  Advantage of fuzzy control systems over traditional ones:-  Their ability to use expert knowledge in the form of fuzzy rule.  Small number of parameters needed.
  • 7. Fuzzy traffic signal control • The traffic simulation environment used in this work is a two- phase controlled intersection of two-lane streets.  In each approaching lane there are two traffic detectors, the first one before the stop line and the other at the stop line.  These detectors send input measurements of traffic to the fuzzy controller:  APP, number of approaching vehicles in the green direction and QUE, number of queuing vehicles in the red direction. Depending on the traffic situation, the green phase can be extended with one or several seconds.  Output of the fuzzy controller is EXT, green time extension (in seconds).  The linguistic values of APP are zero, a few, medium and many; QUE are a few, medium and too long; EXT are zero, short, medium and long.
  • 8. Fuzzy rule  The rule base consists of five rule sets. The choice of the rule set depends on how many green extensions have already been given.  The objective of the rules is to split the green time and find the right moment of green termination so that the delay of vehicles is minimized After minimum green (5 seconds)  if APP is zero, then EXT is zero  if APP is a few and if QUE is less than medium, then EXT is short  if APP is more than a few, then EXT is medium  if APP is medium, then EXT is long
  • 9. After the first extension  if APP is zero, then EXT is zero  if APP is a few and if QUE is less than medium, then EXT is short  if APP is medium, then EXT is medium  if APP is many, then EXT is long After the second extension  if APP is zero, then EXT is zero  if APP is a few and if QUE is less than medium, then EXT is short  if APP is medium and if QUE is less than medium, then EXT is medium  if APP is many and if QUE is less than medium, then EXT is long
  • 10. After the third extension  if APP is zero, then EXT is zero  if QUE is too long, then EXT is zero  if APP is more than a few and if QUE is less than medium, then EXT is short  if APP is medium and if QUE is less than medium, then EXT is medium  if APP is many and if QUE is less than a few, then EXT is long After the fourth extension  if APP is zero, then EXT is zero  if QUE is too long, then EXT is zero  if APP is more than a few and if QUE is a few, then EXT is short  if APP is medium and if QUE is less than a few, then EXT is medium  if APP is many and if QUE is less than a few, then EXT is long
  • 11.
  • 12. Reinforcement learning Why is reinforcement learning needed?  The parameters of the fuzzy controller could be updated using the back propagation algorithm common in supervised learning in neural networks.  Backpropagation algorithm cannot be used; instead, a learning algorithm called reinforcement learning is used.  In reinforcement learning, the system evaluates whether the previous control action was good or not. If the action had good consequences, the tendency to produce that action is strengthened, that is, reinforced.
  • 13. Structure of the neurofuzzy control system  The evaluation network gathers information about the decisions of the fuzzy controller and the delays of the vehicles.  This reinforcement information is used in fine-tuning the membership functions of the fuzzy controller, which is also presented as a neural network.  Thus there are actually two neural networks in the system: an evaluation network and a fuzzy controller network.
  • 14. Evaluation network  The evaluation network evaluates the goodness of the actions of the fuzzy controller based on information it has gathered by observing the process.  The network fine-tunes the membership functions of the fuzzy controller by updating the parameters of the membership functions.  It is a feedforward, multilayer perceptron-type network.  The input variables of the network are APP and QUE, measurements of incoming traffic in green and red directions, respectively. .
  • 15.  The hidden layer activation function is a sigmoidal function zj(xj)=1/(1+exp(−xj)),  where The size h of the hidden layer may vary, and there are no precise rules for determining how many cells it should contain. Increasing the size gives a more powerful and flexible network but requires a longer learning time. In our work the size of h=10 was found suitable
  • 16.  The network output v is a measure of the goodness of the state of the network, a prediction of future reinforcement. h v = b1 APP + b2 QUE + cjzj j 1  The gradient descent algorithm is used in the learning phase of the evaluation network.  If a positive reinforcement signal is received, the network weights are rewarded by being changed in the direction which increases their contribution to the total sum.  If a negative signal is received, the weights are punished by being changed in the direction which decreases their contribution.
  • 17. Fuzzy controller network  Consider the first rule set, whose neural network presentation in figure.
  • 18. Experimental results Fig-Average vehicular delays (in Fig-Average vehicular delays (in seconds) before (dashed line) and after seconds) before (dashed line) and (solid line) the learning at traffic after (solid line) the learning at traffic volumes of 300, 500 and 1000 vehicles volumes of 300, 500 and 1000 vehicles per hour. The location of the first traffic per hour. The location of the first detector is 50 m from the stop line. traffic detector is 100 m from the stop line.
  • 19. Volume (veh/h) Det. (M) dini dnew D 500 50 9.99 9.42 0.57 1000 50 14.78 13.96 0.81 500 100 9.11 8.82 0.81 1000 100 15.18 14.52 0.66 Here dini and dnew are the average vehicular delays using the initial and the new membership functions, respectively, D=dini−dnew is the difference of individual observations
  • 20. Membership functions zero, a few, medium and many of APP before (dotted line) and after (solid line) the learning at a traffic volume of 500 vehicles per hour. The location of the first traffic detector is 50 m from the stop line. Horizontal axis: number of approaching vehicles. Vertical axis: value of membership function.
  • 21.  Fig Membership functions a few, medium and too long of QUE before (dotted line) and after (solid line) the learning at a traffic volume of 500 vehicles per hour. The location of the first traffic detector is 50 m from the stop line. Horizontal axis: number of queuing vehicles. Vertical axis: value of membership function.
  • 22.  Fig. membership function zero, short , medium and long of EXT before (dotted line) and after (solid line) the learning at a traffic volume of 500 vehicles per hour. The location of the first traffic detector is 50 m from the stop line. Horizontal axis: green signal extension in seconds. Vertical axis: value of membership function.