Caelan Reed Garrett, Leslie Pack Kaelbling, Tomás Lozano...

RSVM visualized as a classification task on pairs:

RSVM heuristic has a positive slope and thus is able to correctly rank both example pairs

Kendall Rank Correlation Coefficient (𝜏) : monotonic correlation

(i.e. normalized # of correctly ranked) Only heuristic ordering matters in a greedy search Solve using Rank Support Vector Machine (RSVM)

Equivalent to SVM on ranking pairs

Only penalize examples from the same plan Can use non-negativity constraints (NN)

Root Mean Squared Error (RMSE)

Solve using Ridge Regression (RR)

Cross validation to select

Sacrifices correct orderings to produce predictions close to outputs

Extremely sensitive to noisy data, imperfect feature representation, limited function class, and scaling of problems

Caelan Reed Garrett, Leslie Pack Kaelbling, Tomás Lozano-Pérez MIT CSAIL, {caelan,lpk,tlp}@csail.mit.edu

Learning to Rank for Synthesizing Planning Heuristics

Learn linear model for heuristic function . Choice of loss function:

Background

Models for Heuristic Learning

Results

||w||2 + C

⇠ijk

s.t. �(xij)

Tw � �(xi

k)Tw + 1� ⇠ijk, 8yij � y

ik, 8i

⇠ijk � 0, 8i, j, k .

(�(xij)� �(xi

k))Tw � 1� ⇠ijk, 8yij � y

ik, 8i

||�(X)w � Y ||2 + �||w||2

RMSE =1

vuut 1

(f(xij)� y

Original RR Single RR Pair RSVM Single RSVM Pair NN RSVM Pair

FF CEA

RMSE vs solved instances

0 50 100 150 200

⌧ vs solved instances

0.84 0.88 0.92 0.96 1

• 2014 IPC learning track domains: elevators, transport, parking, no-mystery (90 of the largest testing problems) • 6 configurations of deferred greedy best-first search for FF and CEA heuristics

• Feature representations (Single, Pair) • Learning techniques (Original, RR, RSVM, NN RSVM)

• Scatter plots of learned heuristics RMSE and 𝜏 vs number solved for transport • RMSE positively correlated implies bad loss function • 𝜏 positively correlated implies good loss function

Feature Representation• Learn heuristic function for greedy best-first search to

improve coverage and efficiency of domain-specific planning • Distribution of deterministic planning problems • Generate training examples from each solvable problem

• Use states on a plan to generate supervised pairs • Inputs are states along with their problem • Outputs are distances-to-go

• Training plans are often prohibitively noisy - locally smooth plans with plan neighborhood graph search [Nakhost, 2010]

{⇧1, ...,⇧n}

hxij , y

ij = hsij ,⇧ii

• Learning traditionally needs function ɸ(x) that embeds input • Obtain features from existing domain-independent heuristics

• Some heuristics produce approximate partially-ordered plans - FastForward (FF), Context-Enhanced Add (CEA), …

• Single actions - count instances of each action schema. Unable to capture approximations in approx. plan (~7 features)

• Pairwise actions - count partial-orders along with interacting effects and preconditions (~40 features)

s0A B C

Stack(B,C)

Pick(A)

Stack(A,B)

Start Goal

Pick(B)

Pick(Q)

• Simple: ɸ(x) = [Pick: 3, Stack: 2] • Pairwise: ɸ(x) = [Pick-Hold-Stack: 2, Pick-Clear-Stack: 1, …]

Blocksworld start and goal (stack ABC)

d = 6, hFF = 5

f(x) = �(x)Tw

⌧ 2 [�1, 1]

1D case study: 2 problems with 2 examples

RR heuristic has a negative slope pushing states away from goal

� +�(xi

a)� �(xib)a, b = 1, 2 a, b = 2, 1

fRSVM (x)

�(x)

fRR(x)

Conclusions • Pairwise features able to

encode more information • 𝜏 is generally correlated

with planner performance • RankSVM improves

heuristic performance by optimizing 𝜏

Caelan Reed Garrett, Leslie Pack Kaelbling, Tomás Lozano...

Documents

Transcript of Caelan Reed Garrett, Leslie Pack Kaelbling, Tomás Lozano...

UNIVERSIDAD SANTO TOMÁS ESPECIALIZACIONES EN …facultadadministracionempresas.usta.edu.co/images/documentos/... · Tomás en la ciudad de Bogotá y estudiar la Especialización

DARPA Mobile Autonomous Robot SoftwareLeslie Pack Kaelbling; January 2000 1 Adaptive Intelligent Mobile Robotics Leslie Pack Kaelbling Artificial Intelligence.

UNIVERSIDAD SANTO TOMÁS ADMINISTRACIÓN DE …

The Tomás Rivera Policy Institute IBM Hispanic Digital ... · The Tomás Rivera Policy Institute Founded in 1985, The Tomás Rivera Policy Institute advances critical, insightful

WHMIS Peter Koczanski, Marko Roslycky, Riley Barrett and Caelan Stephanson.

FRANCISCO TOMÁS Y VALIENTE def · 2017. 4. 25. · 1 FRANCISCO TOMÁS Y VALIENTE, HISTORIADOR DEL DERECHO* Mariano Peset Conocí a Paco Tomás en octubre de 1952, al empezar mi primer

Fabulas_literarias - Tomás de Iriarte

Video 15 universidades y santo tomás

Brady Caelan Justin Mason Digital Poem

Transports by Tomás

Una conferencia de Tomás Maldonado.

VEINTISÉIS CARTAS INÉDITAS DE TOMÁS NAVARRO TOMÁS A … · 2012. 6. 18. · Resumen: Se editan 26 cartas de Tomás Navarro Tomas dirigidas, entre agosto de 1933 y junio de 1936,

Tomás antônio gonzaga

CONFERENCE - Santo Tomás en Línea

St Tomás de Aquino

Tomás Aquino - De Rationibus Fidei

Tomás and the Library Lady

Tomás and the Library Lady

By Supreme Wizard Caelan And Empire Isaac

Cb08 alonso tomás