Director: Prof. Maja J Matarić Associate Director: Prof. Gaurav S. Sukhatme Founder: Prof. George...
-
Upload
patrick-riley -
Category
Documents
-
view
213 -
download
1
Transcript of Director: Prof. Maja J Matarić Associate Director: Prof. Gaurav S. Sukhatme Founder: Prof. George...
Director: Prof. Maja J MatarićAssociate Director: Prof. Gaurav S. SukhatmeFounder: Prof. George A. Bekey
GOALS
Automate the process of robot controller design :Complexity of the robot’s tasks (sequencing)Robustness and real-time response propertiesModularity of the underlying architectureReusability of controller componentsSupport for complex task learning
[1] Monica N. Nicolescu, Maja J Matarić, "A hierarchical architecture for behavior-based robots", First International Joint Conference on Autonomous Agents and Multi-Agent Systems, July 15-19, 2002
[2] Monica N. Nicolescu, Maja J Matarić, "Learning and Interacting in Human-Robot Domains", Special Issue of IEEE Transactions on Systems, Man, and Cybernetics, Vol. 31, No.5, Pages 419-430, September, 2001.
[3] Monica N. Nicolescu, Maja J Matarić, "Experience-based representation construction: learning from human and robot teachers", IEEE/RSJ International Conference on Intelligent Robots and Systems, Pages 740-745, Oct. 29 – Nov 3, 2001
AN ACTION-BASED FRAMEWORK FOR LEARNING FROM DEMONSTRATION IN
HUMAN-ROBOT DOMAINS
APPROACH
Separate sensing (precondition checking) from actions into abstract/primitive behaviors.
allows for a more general set of activation conditionsEmbed abstract representation of the behavior’s goals
the task specific preconditions are tested via behavior links
Tasks are represented as (hierarchical) behavior networks.
Teaching by experienced demonstrationThe robot performs the task during demonstration and perceives the task through its own sensorsMapping observations to the robot’s own set of actions
A Hierarchical Abstract Behavior-Based ArchitectureRepresentation & execution of complex, sequential, hierarchically structured tasksSequential & opportunistic execution
EXPERIMENTAL VALIDATION
THE ARCHITECTURE
Effects Behi
{1/0}
Abstract/primitive behaviorstructure
Primitive behavior
Perform actions
Abstract behavior
Test world preconditions
Task specific preconditionsif met
Standard behaviorstructure
Test world preconditions
Test task specific preconditions
Perform actions
if met
if met
Effects Beh1…k {1/0}
Learned network:
Primitive Behavior
Abstract Behavior
Network Abstract Behavior
“Expanded” representation
of a NABNetwork link (ordering,
enabling, permanent)
Activation link
Execution: activation spreading + precondition checking
Behavior selection: a behavior is
active iff :
( It is not inhibited ) and
( Its controlled actuators are available) and
( Activation level 0 ) and
( All ordering constraints = TRUE ) and
( All permanent preconditions = TRUE ) and
(( All enabling preconditions = TRUE ) or
( the behavior was active in the previous step ))
Activation level (of a behavior):
The number of successor behaviors in the network that require the achievement of its postconditions
Generic network
THE LEARNING PROCESS
ALGORITHM:
Create network back-bone
The intervals Ik occurred during the demonstration
E
E
t1E t2E
C
C
t1C t1C
A
A
t1A t2A
A
A
t1D t2D
B
t1B t2B
B
A C A E
J Overlaps K
PermanentJ Includes K
J Ends K
No relationJ Starts K
J Equals K
EnablingJ Meets K
OrderingJ Before K
LinkRelationObservationJ K
J K
J K
J K
J K
J K
J K
Generate behavior links
Example network
Teacher-following strategy
The robot has a set of basic skills
Teacher signals moments in time relevant to the task
Mapping observations to the known effects of the robot’s own actions
An object transport task:
REFERENCES
Monica N. Nicolescu and Maja J. Matarić (monica|[email protected]) http:||robotics.usc.edu|
~monica
Behavior A
Behavior B
Behavior A
Behavior B Behavior B
Behavior A
t1A t1A t1At2A t2A t2At1B t1B t1Bt2B t2B t2B
Permanent Enabling Ordering
Postconditions true
Behavior active
Network links types (sequential preconditions):
Behavior set: PickUp &
Drop colored objects, Track
colored targets
Learning in clean/cluttered
environments, from
human/robot teachers
A task with long sequences
A slalom task
An object transport task
A “gate-traversing” task
Teacher demonstration:
Teacher signals