Academic Course: 10 On-line adaptation, learning, evolution

•

1 recomendación•963 vistas

FET AWARE project - Self Awareness in Autonomic Systems

By Gusz Eiben & Mark Hoogendoorn

Tecnología Educación

Designed by Gusz Eiben & Mark Hoogendoorn
Outline
• Population-based Adaptive Systems
• Types of adaptation: evolution, individual
(lifetime) learning, social learning
• Machine learning
• Reinforcement learning
• Off-line vs. on-line adaptation

Designed by Gusz Eiben & Mark Hoogendoorn
Population-based Adaptive Systems
PAS have two essential features
•They consist of a group of basic units that can
perform actions, e.g., computation,
communication, interaction, etc.
•The ability to adapt at
– individual level (modify agent ) and/or
– group level (add/remove agent).

Designed by Gusz Eiben & Mark Hoogendoorn
Types of adaptation
• Evolutionary learning (EL): Changes at population
level (assumed non-Lamarckian)
• Lifetime learning (LL): Changes at agent level
– Individual learning (IL): adaptation autonomously
through a purely internal procedure
– Social learning (SL): adaptation through interaction
/communication

Designed by Gusz Eiben & Mark Hoogendoorn
Taxonomy of adaptation
Adaptation
Evolutionary
Learning
Lifetime
Learning
Individual
Learning
Social
Learning

Designed by Gusz Eiben & Mark Hoogendoorn
Taxonomy of adaptation 2
Adaptation
Evolutionary
Learning
Lifetime
Learning
Individual
Learning
Social
Learning
Learning
Evolution

Designed by Gusz Eiben & Mark Hoogendoorn
Adaptation ≠ operation
• Operation: controller is being used
– Sensory inputs  outputs (motor, comm. device)
– Robot behavior changes, not the controller
• Adaptation: controller is being changed
– Present controller  new controller
– Uses utility/reward/fitness info
– It may require
• One single robot – learning
• More robots – evolution, social learning
• Adaptation + operation = generate + test
• Off-line (initial controller design, before start) vs. on-line (after
start)

Designed by Gusz Eiben & Mark Hoogendoorn
Genotype
Developmental
Engine(decoder)
Genetic operators:
mutation & xover
Learning
operators
Robot
behavior
State of the
environment
Phenotype =
controller
Reward
Fitness
Selection
operators

Designed by Gusz Eiben & Mark Hoogendoorn
Phenotype
Genotype
Developmental
Engine(decoder)
Genetic operators:
mutation & xover
Learning
operators
Robot
behavior
State of the
environment
Reward
Fitness
Selection
operators
controllershape

Designed by Gusz Eiben & Mark Hoogendoorn
Evolutionary loop
Genotype
DevelopmentalEngine
Genetic operators:
mutation & xover
Learning operator(s)
Robot
behavior
Changes in
environment
Controller =
phenotype
Reward
Fitness
Selection
operator(s)

Designed by Gusz Eiben & Mark Hoogendoorn
Learning loop
Genotype
DevelopmentalEngine
Genetic operators:
mutation & xover
Learning operator(s)
Robot
behavior
Changes in
environment
Controller =
phenotype
Reward
Fitness
Selection
operator(s)

Designed by Gusz Eiben & Mark Hoogendoorn
ENVIRONMENTAGENT
Reward r(t)
State s(t)
Action a(t)

Designed by Gusz Eiben & Mark Hoogendoorn
Reinforcement learning
Agent in situation/state st chooses action at
World changes to situation/state st+1
Agent perceives situation st+1 and gets reward rt+1
Telling the agent what to do is its
POLICY πt(s, a) = P r{at = a|st = s}
Given the situation at time t is s, the policy gives the probability the agent’s
action will be a.
For example: πt(s, goforward) = 0.5, πt(s, gobackward) = 0.5.
Reinforcement learning ⇒ Get/ﬁnd/learn the policy

Designed by Gusz Eiben & Mark Hoogendoorn
Further reading
• Evert Haasdijk and A.E. Eiben and Alan F.T.
Winfield, Individual Social and Evolutionary
Adaptation in Collective Systems , Serge
Kernbach (eds.) , Handbook of Collective
Robotics , Pan Stanford , 2011

Más contenido relacionado

Más de FET AWARE project - Self Awareness in Autonomic Systems

Academic Course: 02 Self-organization and emergence in networked systemsFET AWARE project - Self Awareness in Autonomic Systems

Academic Course: 01 Self-awarenesss and Computational Self-awarenessFET AWARE project - Self Awareness in Autonomic Systems

Awareness: Layman Seminar SlidesFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 04 Awareness ApplicationsFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 03 Awareness SimulationFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 02 Awareness PropertiesFET AWARE project - Self Awareness in Autonomic Systems

Industry Training: 01 Awareness OverviewFET AWARE project - Self Awareness in Autonomic Systems

Robot Swarms as Ensembles of Cooperating Components - Matthias HolzlFET AWARE project - Self Awareness in Autonomic Systems

Towards Systematically Engineering Ensembles - Martin WirsingFET AWARE project - Self Awareness in Autonomic Systems

Capturing the Immune System: From the wet-lab to the robot, building better ...FET AWARE project - Self Awareness in Autonomic Systems

Underwater search and rescue in swarm robotics - Mark Read FET AWARE project - Self Awareness in Autonomic Systems

Computational Self-awareness in Smart-Camera Networks - Lukas EsterleFET AWARE project - Self Awareness in Autonomic Systems

Why Robots may need to be self-‐aware, before we can really trust them - Ala...FET AWARE project - Self Awareness in Autonomic Systems

Morphogenetic Engineering: Reconciling Architecture and Self-Organization Thr...FET AWARE project - Self Awareness in Autonomic Systems

Ensemble-oriented programming of self-adaptive systems - Michele LoretiFET AWARE project - Self Awareness in Autonomic Systems

Self-awareness and Adaptive Technologies: the Future of Operating Systems? FET AWARE project - Self Awareness in Autonomic Systems

EnhancingWeb Process Self-Awareness with Context-Aware Service CompositionFET AWARE project - Self Awareness in Autonomic Systems

Testing cooperative autonomous systems for unwanted emergent behaviour and da...FET AWARE project - Self Awareness in Autonomic Systems

Enduring Institutions and Self-Organising Trust-Adaptive Systems for an Open ...FET AWARE project - Self Awareness in Autonomic Systems

SmartContent: A self protecting and context aware active contentFET AWARE project - Self Awareness in Autonomic Systems

Más de FET AWARE project - Self Awareness in Autonomic Systems (20)

Academic Course: 02 Self-organization and emergence in networked systems

Academic Course: 01 Self-awarenesss and Computational Self-awareness

Awareness: Layman Seminar Slides

Industry Training: 04 Awareness Applications

Industry Training: 03 Awareness Simulation

Industry Training: 02 Awareness Properties

Industry Training: 01 Awareness Overview

Robot Swarms as Ensembles of Cooperating Components - Matthias Holzl

Towards Systematically Engineering Ensembles - Martin Wirsing

Capturing the Immune System: From the wet-lab to the robot, building better ...

Underwater search and rescue in swarm robotics - Mark Read

Computational Self-awareness in Smart-Camera Networks - Lukas Esterle

Why Robots may need to be self-‐aware, before we can really trust them - Ala...

Morphogenetic Engineering: Reconciling Architecture and Self-Organization Thr...

Ensemble-oriented programming of self-adaptive systems - Michele Loreti

Self-awareness and Adaptive Technologies: the Future of Operating Systems?

EnhancingWeb Process Self-Awareness with Context-Aware Service Composition

Testing cooperative autonomous systems for unwanted emergent behaviour and da...

Enduring Institutions and Self-Organising Trust-Adaptive Systems for an Open ...

SmartContent: A self protecting and context aware active content

Último

Platformless Horizons for Digital AdaptabilityWSO2

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

Architecting Cloud Native ApplicationsWSO2

Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Exploring Multimodal Embeddings with MilvusZilliz

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Academic Course: 10 On-line adaptation, learning, evolution

1. Designed by Gusz Eiben & Mark Hoogendoorn On-line adaptation, learning, evolution

2. Designed by Gusz Eiben & Mark Hoogendoorn Outline • Population-based Adaptive Systems • Types of adaptation: evolution, individual (lifetime) learning, social learning • Machine learning • Reinforcement learning • Off-line vs. on-line adaptation

3. Designed by Gusz Eiben & Mark Hoogendoorn Population-based Adaptive Systems PAS have two essential features •They consist of a group of basic units that can perform actions, e.g., computation, communication, interaction, etc. •The ability to adapt at – individual level (modify agent ) and/or – group level (add/remove agent).

4. Designed by Gusz Eiben & Mark Hoogendoorn Types of adaptation • Evolutionary learning (EL): Changes at population level (assumed non-Lamarckian) • Lifetime learning (LL): Changes at agent level – Individual learning (IL): adaptation autonomously through a purely internal procedure – Social learning (SL): adaptation through interaction /communication

5. Designed by Gusz Eiben & Mark Hoogendoorn Taxonomy of adaptation Adaptation Evolutionary Learning Lifetime Learning Individual Learning Social Learning

6. Designed by Gusz Eiben & Mark Hoogendoorn Taxonomy of adaptation 2 Adaptation Evolutionary Learning Lifetime Learning Individual Learning Social Learning Learning Evolution

7. Designed by Gusz Eiben & Mark Hoogendoorn Adaptation ≠ operation • Operation: controller is being used – Sensory inputs  outputs (motor, comm. device) – Robot behavior changes, not the controller • Adaptation: controller is being changed – Present controller  new controller – Uses utility/reward/fitness info – It may require • One single robot – learning • More robots – evolution, social learning • Adaptation + operation = generate + test • Off-line (initial controller design, before start) vs. on-line (after start)

8. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Phenotype = controller Reward Fitness Selection operators

9. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Phenotype = controller Reward Fitness Selection operators

10. Designed by Gusz Eiben & Mark Hoogendoorn Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Reward Fitness Selection operators Phenotype controllershape

11. Designed by Gusz Eiben & Mark Hoogendoorn Phenotype Genotype Developmental Engine(decoder) Genetic operators: mutation & xover Learning operators Robot behavior State of the environment Reward Fitness Selection operators controllershape

12. Designed by Gusz Eiben & Mark Hoogendoorn Evolutionary loop Genotype DevelopmentalEngine Genetic operators: mutation & xover Learning operator(s) Robot behavior Changes in environment Controller = phenotype Reward Fitness Selection operator(s)

13. Designed by Gusz Eiben & Mark Hoogendoorn Learning loop Genotype DevelopmentalEngine Genetic operators: mutation & xover Learning operator(s) Robot behavior Changes in environment Controller = phenotype Reward Fitness Selection operator(s)

14. Designed by Gusz Eiben & Mark Hoogendoorn ENVIRONMENTAGENT Reward r(t) State s(t) Action a(t)

15. Designed by Gusz Eiben & Mark Hoogendoorn Reinforcement learning Agent in situation/state st chooses action at World changes to situation/state st+1 Agent perceives situation st+1 and gets reward rt+1 Telling the agent what to do is its POLICY πt(s, a) = P r{at = a|st = s} Given the situation at time t is s, the policy gives the probability the agent’s action will be a. For example: πt(s, goforward) = 0.5, πt(s, gobackward) = 0.5. Reinforcement learning ⇒ Get/ﬁnd/learn the policy

16. Designed by Gusz Eiben & Mark Hoogendoorn Further reading • Evert Haasdijk and A.E. Eiben and Alan F.T. Winfield, Individual Social and Evolutionary Adaptation in Collective Systems , Serge Kernbach (eds.) , Handbook of Collective Robotics , Pan Stanford , 2011

Academic Course: 10 On-line adaptation, learning, evolution

Recomendados

Recomendados

Más contenido relacionado

Más de FET AWARE project - Self Awareness in Autonomic Systems

Más de FET AWARE project - Self Awareness in Autonomic Systems (20)

Último

Último (20)

Academic Course: 10 On-line adaptation, learning, evolution