A Cognitive Architecture for Physical Agents
The Icarus project focuses on two complementary goals: the AI
community's original dream of creating complete intelligent agents
and Allen Newell's vision of a unified theory of the cognitive
architecture. We are especially concerned with agents that exist
in a physical environment, whether actual or simulated, and thus
must integrate cognition with perception and action. This focus
grew originally out of the World Modelers Project, a joint effort
with Jaime Carbonell and others at CMU during the 1980s to create
a realistic simulated environment to serve as a testbed for
research on physical agents.
The Icarus architecture has gone through a number of different
versions, the most recent developed at DaimlerChrysler Research &
Technology Center between 1998 and 2000, and which, naturally, we
tested in a simulated driving domain. This incarnation of Icarus encodes
knowledge as reactive skills, each of which specifies the goal-relevant
reactions to a class of situations. A skill consists of three elements
stated in terms of logical expressions: a set of objectives, a set of
requirements or preconditions, and a set of alternate means for
accomplishing the objective under those conditions. Each objective,
requirement, or means can refer to primitive actions/sensors or to
other Icarus skills, thus imposing a hierarchical organization on
long-term memory. Each skill also has an associated utility cast as a
linear function of sensory attributes.
The basic Icarus interpreter operates on a recognize-act cycle but,
unlike many architectures, focuses on reactive execution of existing
skills rather than on problem-space search. Given a top-level skill
to pursue, on each cycle the system first checks the objective field
for that skill. If the objectives are true, nothing further needs to
be done, but, if not, the interpreter examines the requirements to
determine if the preconditions for action are met. If not, Icarus
invokes a subskill associated with the failed requirement in an effort
to satisfy it; otherwise, it selects one of the alternate means and
calls on the primitive action or subskill associated with it. The
architecture selects the alternative with the highest expected utility
as predicted by the linear function associated with each skill.
We have recently extended the Icarus architecture to support explicit
short-term memories and modules for conceptual inference, means-ends
problem solving, and cumulative learning of skills. The new version
relies on five distinctive assumptions:
Primacy of action and perception over cognition;
Separation of categories from skills;
Hierarchical structure of long-term memory;
Correspondence between long-term and short-term structures; and
Value-driven nature of agent behavior.
Our recent papers explain these assumptions in more detail. Our
research plans include incorporating mechanisms for forward chaining,
model-based learning from delayed reward, storing/retrieving episodic
traces, and learning from other agents' behaviors. We are also looking
at new domains, such as game playing, in which to evaluate the framework.
This research is funded by DARPA IPTO and the National Science
Foundation. Support for earlier work came from the Air Force Office of
Scientific Research, NASA Ames Research Center, and DaimlerChrysler
Research and Technology.
Current Contributors to the Project
Nima Asgharbeygi
Dongkyu Choi
Tom Fawcett
Tolga Konik
Professor Pat Langley
Negin Nejati
Chunki Park
David Stracuzzi
Past Contributors to the Project
Dr. Sachiyo Arai
Meg Aycinena
Kirstin Cummings
Dr. Ryutaro Ichise
Dr. Seth Rogers
Dr. Daniel Shapiro
Michael Siliski
Stefan Tang
Jiang Xuan
Related Papers
Langley, P., & Choi, D. (in press).
A unified cognitive architecture for physical agents.
Proceedings of the Twenty-First National Conference on Artificial
Intelligence.
Boston: AAAI Press.
Nejati, N., Langley, P., & Konik, T. (in press).
Learning hierarchical task networks by observation.
Proceedings of the Twenty-Third International Conference on
Machine Learning.
Pittsburgh, PA.
Asgharbeygi, N., Stracuzzi, D., & Langley, P. (in press).
Relational temporal difference learning.
Proceedings of the Twenty-Third International Conference on
Machine Learning.
Pittsburgh, PA.
Langley, P. (in press).
Cognitive architectures and general intelligent systems.
AI Magazine.
Langley, P., & Choi, D. (2006).
Learning recursive control programs from problem solving.
Journal of Machine Learning Research, 7, 493-518.
Langley, P. (2005).
An adaptive architecture for physical agents.
Proceedings of the 2005 IEEE/WIC/ACM International Conference on
Intelligent Agent Technology (pp. 18-25).
Compiegne, France: IEEE Computer Society Press.
Langley, P., Choi, D., & Rogers, S. (2005).
Interleaving learning, problem solving, and execution in the
Icarus architecture
(Technical Report). Computational Learning Laboratory, CSLI, Stanford
University, CA.
Choi, D., & Langley, P. (2005).
Learning teleoreactive logic programs from problem solving.
Proceedings of the Fifteenth International Conference on
Inductive Logic Programming
(pp. 51-68). Bonn, Germany: Springer.
Asgharbeygi, N., Nejati, N., Langley, P., & Arai, S. (2005).
Guiding inference through relational reinforcement learning.
Proceedings of the Fifteenth International Conference on
Inductive Logic Programming
(pp. 20-37). Bonn, Germany: Springer.
Langley, P., & Rogers, S. (2005).
An extended theory of human problem solving.
Proceedings of the Twenty-seventh Annual Meeting of the Cognitive
Science Society. Stresa, Italy.
Langley, P., & Rogers, S. (2004).
Cumulative learning of hierarchical skills.
Proceedings of the Third International Conference on Development
and Learning. San Diego, CA: IEEE Press.
Langley, P. (2004).
Cognitive architectures and the construction of intelligent agents.
Proceedings of the AAAI-2004 Workshop on Intelligent Agent
Architectures (pp. 82). Stanford, CA.
Langley, P., Arai, S., & Shapiro, D. (2004).
Model-based learning with hierarchical relational skills.
Proceedings of the ICML-2004 Workshop on Relational Reinforcement
Learning. Banff, Alberta.
Langley, P., Cummings, K., & Shapiro, D. (2004).
Hierarchical skills and cognitive architectures.
Proceedings of the Twenty-Sixth Annual Conference of the Cognitive
Science Society (pp. 779-784). Chicago, IL.
Choi, D., Kaufman, M., Langley, P., Nejati, N., & Shapiro, D. (2004).
An architecture for persistent reactive behavior.
Proceedings of the Third International Joint Conference on
Autonomous Agents and Multi Agent Systems (pp. 988-995).
New York: ACM Press.
Langley, P., Choi, D., & Shapiro, D. (2004).
A cognitive architecture for physical agents
(Technical Report). Computational Learning Laboratory, CSLI,
Stanford University, CA.
Ichise, R., Shapiro, D., & Langley, P. (in press). Structured program
induction from behavioral traces. IEICE Transactions on Information
and Systems (in Japanese).
Langley, P., Shapiro, D., Aycinena, M., & Siliski, M. (2003).
A value-driven architecture for intelligent behavior.
Proceedings of the IJCAI-2003 Workshop on Cognitive Modeling of
Agents and Multi-Agent Interactions (pp. 10-18). Acapulco, Mexico.
Langley, P., & Laird, J. E. (2002).
Cognitive architectures: Research issues and challenges
(Technical Report). Institute for the Study of Learning and Expertise,
Palo Alto, CA.
Ichise, R., Shapiro, D. G., & Langley, P. (2002).
Learning hierarchical skills from observation (pp. 247-258).
Proceedings of the Fifth International Conference on Discovery
Science.
Shapiro, D., & Langley, P. (2002).
Separating skills from preference: Using learning to program by reward.
Proceedings of the Nineteenth International Conference on Machine
Learning (pp. 570-577). Sydney: Morgan Kaufmann.
Shapiro, D., Langley, P., & Shachter, R. (2001).
Using background knowledge to speed reinforcement learning in physical
agents.
Proceedings of the Fifth International Conference on Autonomous
Agents (pp. 254-261). Montreal: ACM Press.
Shapiro, D., & Langley, P. (1999).
Controlling physical agents through reactive logic programming.
Proceedings of the Third International Conference on Autonomous
Agents (pp. 386-387). Seattle: ACM Press.
Langley, P. (1997).
Learning to sense selectively in physical domains.
Proceedings of the First International Conference on Autonomous Agents
(pp. 217-226). Marina del Rey, CA: ACM Press.
For more information, please send email to
dgs@stanford.edu .