SlideShare una empresa de Scribd logo
1 de 28
Intelligent
Agents
Russell & Norvig
Chapter 2
Today’s class
• What’s an agent?
– Definition of an agent
– Rationality and autonomy
– Types of agents
– Properties of environments
• Lisp
How do you design an intelligent agent?
• An intelligent agent perceives its environment via sensors
and acts upon that environment with its effectors.
• A discrete agent receives percepts one at a time, and maps
this percept sequence to a sequence of discrete actions.
• Properties
–Autonomous
–Reactive to the environment
–Pro-active (goal-directed)
–Interacts with other agents
via the environment
What do you mean,
sensors/percepts and effectors/actions?
• Humans
– Sensors: Eyes (vision), ears (hearing), skin (touch), tongue
(gustation), nose (olfaction), neuromuscular system
(proprioception)
– Percepts:
• At the lowest level – electrical signals from these sensors
• After preprocessing – objects in the visual field (location, textures,
colors, …), auditory streams (pitch, loudness, direction), …
– Effectors: limbs, digits, eyes, tongue, …
– Actions: lift a finger, turn left, walk, run, carry an object, …
• The Point: percepts and actions need to be carefully
defined, possibly at different levels of abstraction
A more specific example:
Automated taxi driving system
• Percepts: Video, sonar, speedometer, odometer, engine sensors,
keyboard input, microphone, GPS, …
• Actions: Steer, accelerate, brake, horn, speak/display, …
• Goals: Maintain safety, reach destination, maximize profits (fuel, tire
wear), obey laws, provide passenger comfort, …
• Environment: Streets, freeways, traffic, pedestrians, weather,
customers, …
• Different aspects of driving may require
different types of agent programs!
Rationality
• An ideal rational agent should, for each possible percept
sequence, do whatever actions will maximize its expected
performance measure based on
(1) the percept sequence, and
(2) its built-in and acquired knowledge.
• Rationality includes information gathering, not “rational
ignorance.” (If you don’t know something, find out!)
• Rationality  Need a performance measure to say how well a
task has been achieved.
• Types of performance measures: false alarm (false positive)
and false dismissal (false negative) rates, speed, resources
required, effect on environment, etc.
Autonomy
• A system is autonomous to the extent that its own
behavior is determined by its own experience.
• Therefore, a system is not autonomous if it is
guided by its designer according to a priori
decisions.
• To survive, agents must have:
–Enough built-in knowledge to survive.
–The ability to learn.
Some agent types
• (0) Table-driven agents
– use a percept sequence/action table in memory to find the next action. They
are implemented by a (large) lookup table.
• (1) Simple reflex agents
– are based on condition-action rules, implemented with an appropriate
production system. They are stateless devices which do not have memory of
past world states.
• (2) Agents with memory
– have internal state, which is used to keep track of past states of the world.
• (3) Agents with goals
– are agents that, in addition to state information, have goal information that
describes desirable situations. Agents of this kind take future events into
consideration.
• (4) Utility-based agents
– base their decisions on classic axiomatic utility theory in order to act
rationally.
(0/1) Table-driven/reflex agent
architecture
(0) Table-driven agents
• Table lookup of percept-action pairs mapping from every
possible perceived state to the optimal action for that state
• Problems
– Too big to generate and to store (Chess has about 10120
states, for example)
– No knowledge of non-perceptual parts of the current
state
– Not adaptive to changes in the environment; requires
entire table to be updated if changes occur
– Looping: Can’t make actions conditional on previous
actions/states
(1) Simple reflex agents
• Rule-based reasoning to map from percepts to optimal
action; each rule handles a collection of perceived states
• Problems
– Still usually too big to generate and to store
– Still no knowledge of non-perceptual parts of state
– Still not adaptive to changes in the environment; requires
collection of rules to be updated if changes occur
– Still can’t make actions conditional on previous state
(2) Architecture for an agent with
memory
(2) Agents with memory
• Encode “internal state” of the world to remember the past as
contained in earlier percepts.
• Needed because sensors do not usually give the entire state
of the world at each input, so perception of the environment
is captured over time. “State” is used to encode different
"world states" that generate the same immediate percept.
• Requires ability to represent change in the world; one
possibility is to represent just the latest state, but then can’t
reason about hypothetical courses of action.
• Example: Rodney Brooks’s Subsumption Architecture.
(2) An example:
Brooks’s Subsumption Architecture
• Main idea: build complex, intelligent robots by
decomposing behaviors into a hierarchy of skills, each
completely defining a complete percept-action cycle for one
very specific task.
• Examples: avoiding contact, wandering, exploring,
recognizing doorways, etc.
• Each behavior is modeled by a finite-state machine with a
few states (though each state may correspond to a complex
function or module).
• Behaviors are loosely coupled, asynchronous interactions.
(3) Architecture for goal-based agent
(3) Goal-based agents
• Choose actions so as to achieve a (given or computed) goal.
• A goal is a description of a desirable situation.
• Keeping track of the current state is often not enough 
need to add goals to decide which situations are good
• Deliberative instead of reactive.
• May have to consider long sequences of possible actions
before deciding if goal is achieved – involves consideration
of the future, “what will happen if I do...?”
(4) Architecture for a complete
utility-based agent
(4) Utility-based agents
• When there are multiple possible alternatives, how to decide
which one is best?
• A goal specifies a crude distinction between a happy and
unhappy state, but often need a more general performance
measure that describes “degree of happiness.”
• Utility function U: State  Reals indicating a measure of
success or happiness when at a given state.
• Allows decisions comparing choice between conflicting
goals, and choice between likelihood of success and
importance of goal (if achievement is uncertain).
Properties of Environments
• Fully observable/Partially observable.
– If an agent’s sensors give it access to the complete state of the
environment needed to choose an action, the environment is fully
observable.
– Such environments are convenient, since the agent is freed from the
task of keeping track of the changes in the environment.
• Deterministic/Stochastic.
– An environment is deterministic if the next state of the environment is
completely determined by the current state of the environment and the
action of the agent; in a stochastic environment, there are multiple,
unpredictable outcomes
– In a fully observable, deterministic environment, the agent need not
deal with uncertainty.
Properties of Environments II
• Episodic/Sequential.
– An episodic environment means that subsequent episodes do not depend
on what actions occurred in previous episodes.
– In a sequential environment, the agent engages in a series of connected
episodes.
– Such environments do not require the agent to plan ahead.
• Static/Dynamic.
– A static environment does not change while the agent is thinking.
– The passage of time as an agent deliberates is irrelevant.
– The agent doesn’t need to observe the world during deliberation.
Properties of Environments III
• Discrete/Continuous.
– If the number of distinct percepts and actions is limited, the
environment is discrete, otherwise it is continuous.
• Single agent/Multi-agent.
– If the environment contains other intelligent agents, the agent needs
to be concerned about strategic, game-theoretic aspects of the
environment (for either cooperative or competitive agents)
– Most engineering environments don’t have multi-agent properties,
whereas most social and economic systems get their complexity
from the interactions of (more or less) rational agents.
Characteristics of environments
Fully
observable?
Deterministic Episodic Static Discrete? Single
agent?
Solitaire
Backgammon
Driving
Internet
shopping
Medical
diagnosis
Characteristics of environments
Fully
observable?
Deterministic Episodic Static Discrete? Single
agent?
Solitaire No Yes Yes Yes Yes Yes
Backgammon
Driving
Internet
shopping
Medical
diagnosis
Characteristics of environments
Fully
observable?
Deterministic? Episodic? Static? Discrete? Single
agent?
Solitaire No Yes Yes Yes Yes Yes
Backgammon Yes No No Yes Yes No
Driving
Internet
shopping
Medical
diagnosis
Characteristics of environments
Fully
observable?
Deterministic? Episodic? Static? Discrete? Single
agent?
Solitaire No Yes Yes Yes Yes Yes
Backgammon Yes No No Yes Yes No
Driving No No No No No No
Internet
shopping
Medical
diagnosis
Characteristics of environments
Fully
observable?
Deterministic? Episodic? Static? Discrete? Single
agent?
Solitaire No Yes Yes Yes Yes Yes
Backgammon Yes No No Yes Yes No
Driving No No No No No No
Internet
shopping
No No No No Yes No
Medical
diagnosis
Characteristics of environments
Fully
observable?
Deterministic? Episodic? Static? Discrete? Single
agent?
Solitaire No Yes Yes Yes Yes Yes
Backgammon Yes No No Yes Yes No
Driving No No No No No No
Internet
shopping
No No No No Yes No
Medical
diagnosis
No No No No No Yes
→ Lots of real-world domains fall into the hardest case!
Summary
• An agent perceives and acts in an environment, has an
architecture, and is implemented by an agent program.
• An ideal agent always chooses the action which maximizes its
expected performance, given its percept sequence so far.
• An autonomous agent uses its own experience rather than built-in
knowledge of the environment by the designer.
• An agent program maps from percept to action and updates its
internal state.
– Reflex agents respond immediately to percepts.
– Goal-based agents act in order to achieve their goal(s).
– Utility-based agents maximize their own utility function.
• Representing knowledge is important for successful agent design.
• The most challenging environments are partially observable,
stochastic, sequential, dynamic, and continuous, and contain
multiple intelligent agents.

Más contenido relacionado

Similar a IntelligentAgents.ppt

Similar a IntelligentAgents.ppt (20)

1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx
 
Artificial Intelligence and Machine Learning.pptx
Artificial Intelligence and Machine Learning.pptxArtificial Intelligence and Machine Learning.pptx
Artificial Intelligence and Machine Learning.pptx
 
Intelligent (Knowledge Based) agent in Artificial Intelligence
Intelligent (Knowledge Based) agent in Artificial IntelligenceIntelligent (Knowledge Based) agent in Artificial Intelligence
Intelligent (Knowledge Based) agent in Artificial Intelligence
 
Lecture 4 (1).pptx
Lecture 4 (1).pptxLecture 4 (1).pptx
Lecture 4 (1).pptx
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)
 
Types of Artificial Intelligence.ppt
Types of Artificial Intelligence.pptTypes of Artificial Intelligence.ppt
Types of Artificial Intelligence.ppt
 
AI Lesson 02
AI Lesson 02AI Lesson 02
AI Lesson 02
 
AI Basic.pptx
AI Basic.pptxAI Basic.pptx
AI Basic.pptx
 
AI week 2.pdf
AI week 2.pdfAI week 2.pdf
AI week 2.pdf
 
artificial intelligence document final.pptx
artificial intelligence document final.pptxartificial intelligence document final.pptx
artificial intelligence document final.pptx
 
Agents1
Agents1Agents1
Agents1
 
aiNew.pptx
aiNew.pptxaiNew.pptx
aiNew.pptx
 
Week 3.pdf
Week 3.pdfWeek 3.pdf
Week 3.pdf
 
Unit-1.pptx
Unit-1.pptxUnit-1.pptx
Unit-1.pptx
 
W2_Lec03_Lec04_Agents.pptx
W2_Lec03_Lec04_Agents.pptxW2_Lec03_Lec04_Agents.pptx
W2_Lec03_Lec04_Agents.pptx
 
A.i lecture 04
A.i lecture 04A.i lecture 04
A.i lecture 04
 
Intelligent Agents
Intelligent AgentsIntelligent Agents
Intelligent Agents
 
Lecture 2 agent and environment
Lecture 2   agent and environmentLecture 2   agent and environment
Lecture 2 agent and environment
 
CS4700-Agents_v3.pptx
CS4700-Agents_v3.pptxCS4700-Agents_v3.pptx
CS4700-Agents_v3.pptx
 

Último

Último (20)

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 

IntelligentAgents.ppt

  • 2. Today’s class • What’s an agent? – Definition of an agent – Rationality and autonomy – Types of agents – Properties of environments • Lisp
  • 3. How do you design an intelligent agent? • An intelligent agent perceives its environment via sensors and acts upon that environment with its effectors. • A discrete agent receives percepts one at a time, and maps this percept sequence to a sequence of discrete actions. • Properties –Autonomous –Reactive to the environment –Pro-active (goal-directed) –Interacts with other agents via the environment
  • 4. What do you mean, sensors/percepts and effectors/actions? • Humans – Sensors: Eyes (vision), ears (hearing), skin (touch), tongue (gustation), nose (olfaction), neuromuscular system (proprioception) – Percepts: • At the lowest level – electrical signals from these sensors • After preprocessing – objects in the visual field (location, textures, colors, …), auditory streams (pitch, loudness, direction), … – Effectors: limbs, digits, eyes, tongue, … – Actions: lift a finger, turn left, walk, run, carry an object, … • The Point: percepts and actions need to be carefully defined, possibly at different levels of abstraction
  • 5. A more specific example: Automated taxi driving system • Percepts: Video, sonar, speedometer, odometer, engine sensors, keyboard input, microphone, GPS, … • Actions: Steer, accelerate, brake, horn, speak/display, … • Goals: Maintain safety, reach destination, maximize profits (fuel, tire wear), obey laws, provide passenger comfort, … • Environment: Streets, freeways, traffic, pedestrians, weather, customers, … • Different aspects of driving may require different types of agent programs!
  • 6. Rationality • An ideal rational agent should, for each possible percept sequence, do whatever actions will maximize its expected performance measure based on (1) the percept sequence, and (2) its built-in and acquired knowledge. • Rationality includes information gathering, not “rational ignorance.” (If you don’t know something, find out!) • Rationality  Need a performance measure to say how well a task has been achieved. • Types of performance measures: false alarm (false positive) and false dismissal (false negative) rates, speed, resources required, effect on environment, etc.
  • 7. Autonomy • A system is autonomous to the extent that its own behavior is determined by its own experience. • Therefore, a system is not autonomous if it is guided by its designer according to a priori decisions. • To survive, agents must have: –Enough built-in knowledge to survive. –The ability to learn.
  • 8. Some agent types • (0) Table-driven agents – use a percept sequence/action table in memory to find the next action. They are implemented by a (large) lookup table. • (1) Simple reflex agents – are based on condition-action rules, implemented with an appropriate production system. They are stateless devices which do not have memory of past world states. • (2) Agents with memory – have internal state, which is used to keep track of past states of the world. • (3) Agents with goals – are agents that, in addition to state information, have goal information that describes desirable situations. Agents of this kind take future events into consideration. • (4) Utility-based agents – base their decisions on classic axiomatic utility theory in order to act rationally.
  • 10. (0) Table-driven agents • Table lookup of percept-action pairs mapping from every possible perceived state to the optimal action for that state • Problems – Too big to generate and to store (Chess has about 10120 states, for example) – No knowledge of non-perceptual parts of the current state – Not adaptive to changes in the environment; requires entire table to be updated if changes occur – Looping: Can’t make actions conditional on previous actions/states
  • 11. (1) Simple reflex agents • Rule-based reasoning to map from percepts to optimal action; each rule handles a collection of perceived states • Problems – Still usually too big to generate and to store – Still no knowledge of non-perceptual parts of state – Still not adaptive to changes in the environment; requires collection of rules to be updated if changes occur – Still can’t make actions conditional on previous state
  • 12. (2) Architecture for an agent with memory
  • 13. (2) Agents with memory • Encode “internal state” of the world to remember the past as contained in earlier percepts. • Needed because sensors do not usually give the entire state of the world at each input, so perception of the environment is captured over time. “State” is used to encode different "world states" that generate the same immediate percept. • Requires ability to represent change in the world; one possibility is to represent just the latest state, but then can’t reason about hypothetical courses of action. • Example: Rodney Brooks’s Subsumption Architecture.
  • 14. (2) An example: Brooks’s Subsumption Architecture • Main idea: build complex, intelligent robots by decomposing behaviors into a hierarchy of skills, each completely defining a complete percept-action cycle for one very specific task. • Examples: avoiding contact, wandering, exploring, recognizing doorways, etc. • Each behavior is modeled by a finite-state machine with a few states (though each state may correspond to a complex function or module). • Behaviors are loosely coupled, asynchronous interactions.
  • 15. (3) Architecture for goal-based agent
  • 16. (3) Goal-based agents • Choose actions so as to achieve a (given or computed) goal. • A goal is a description of a desirable situation. • Keeping track of the current state is often not enough  need to add goals to decide which situations are good • Deliberative instead of reactive. • May have to consider long sequences of possible actions before deciding if goal is achieved – involves consideration of the future, “what will happen if I do...?”
  • 17. (4) Architecture for a complete utility-based agent
  • 18. (4) Utility-based agents • When there are multiple possible alternatives, how to decide which one is best? • A goal specifies a crude distinction between a happy and unhappy state, but often need a more general performance measure that describes “degree of happiness.” • Utility function U: State  Reals indicating a measure of success or happiness when at a given state. • Allows decisions comparing choice between conflicting goals, and choice between likelihood of success and importance of goal (if achievement is uncertain).
  • 19. Properties of Environments • Fully observable/Partially observable. – If an agent’s sensors give it access to the complete state of the environment needed to choose an action, the environment is fully observable. – Such environments are convenient, since the agent is freed from the task of keeping track of the changes in the environment. • Deterministic/Stochastic. – An environment is deterministic if the next state of the environment is completely determined by the current state of the environment and the action of the agent; in a stochastic environment, there are multiple, unpredictable outcomes – In a fully observable, deterministic environment, the agent need not deal with uncertainty.
  • 20. Properties of Environments II • Episodic/Sequential. – An episodic environment means that subsequent episodes do not depend on what actions occurred in previous episodes. – In a sequential environment, the agent engages in a series of connected episodes. – Such environments do not require the agent to plan ahead. • Static/Dynamic. – A static environment does not change while the agent is thinking. – The passage of time as an agent deliberates is irrelevant. – The agent doesn’t need to observe the world during deliberation.
  • 21. Properties of Environments III • Discrete/Continuous. – If the number of distinct percepts and actions is limited, the environment is discrete, otherwise it is continuous. • Single agent/Multi-agent. – If the environment contains other intelligent agents, the agent needs to be concerned about strategic, game-theoretic aspects of the environment (for either cooperative or competitive agents) – Most engineering environments don’t have multi-agent properties, whereas most social and economic systems get their complexity from the interactions of (more or less) rational agents.
  • 22. Characteristics of environments Fully observable? Deterministic Episodic Static Discrete? Single agent? Solitaire Backgammon Driving Internet shopping Medical diagnosis
  • 23. Characteristics of environments Fully observable? Deterministic Episodic Static Discrete? Single agent? Solitaire No Yes Yes Yes Yes Yes Backgammon Driving Internet shopping Medical diagnosis
  • 24. Characteristics of environments Fully observable? Deterministic? Episodic? Static? Discrete? Single agent? Solitaire No Yes Yes Yes Yes Yes Backgammon Yes No No Yes Yes No Driving Internet shopping Medical diagnosis
  • 25. Characteristics of environments Fully observable? Deterministic? Episodic? Static? Discrete? Single agent? Solitaire No Yes Yes Yes Yes Yes Backgammon Yes No No Yes Yes No Driving No No No No No No Internet shopping Medical diagnosis
  • 26. Characteristics of environments Fully observable? Deterministic? Episodic? Static? Discrete? Single agent? Solitaire No Yes Yes Yes Yes Yes Backgammon Yes No No Yes Yes No Driving No No No No No No Internet shopping No No No No Yes No Medical diagnosis
  • 27. Characteristics of environments Fully observable? Deterministic? Episodic? Static? Discrete? Single agent? Solitaire No Yes Yes Yes Yes Yes Backgammon Yes No No Yes Yes No Driving No No No No No No Internet shopping No No No No Yes No Medical diagnosis No No No No No Yes → Lots of real-world domains fall into the hardest case!
  • 28. Summary • An agent perceives and acts in an environment, has an architecture, and is implemented by an agent program. • An ideal agent always chooses the action which maximizes its expected performance, given its percept sequence so far. • An autonomous agent uses its own experience rather than built-in knowledge of the environment by the designer. • An agent program maps from percept to action and updates its internal state. – Reflex agents respond immediately to percepts. – Goal-based agents act in order to achieve their goal(s). – Utility-based agents maximize their own utility function. • Representing knowledge is important for successful agent design. • The most challenging environments are partially observable, stochastic, sequential, dynamic, and continuous, and contain multiple intelligent agents.