Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for...

36
University of Kent, 13 June 2011 Neural-Symbolic Systems for Cognitive Reasoning Artur d’Avila Garcez City University London [email protected]

Transcript of Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for...

Page 1: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

University of Kent, 13 June 2011

Neural-Symbolic Systems for Cognitive Reasoning

Artur d’Avila GarcezCity University London

[email protected]

Page 2: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Neural-Symbolic Computation Cognitive Science

Neuroscience

Reasoning Learning

Uncertainty

Page 3: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Neural-Symbolic Systems

“We need some language for describing the alternative algorithms that a network of neurons

may be implementing” L. Valiant

(New) Logic + Neural Computation

Learning from experience and reasoning about what has been learned in an uncertain

environment in a computationally efficient way

Page 4: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Symbolic vs. Connectionist (brain/mind) dichotomy

1960s-1980s: Expert Systems (hand-crafted rules)

1990's-present: Neural networks, Support vector machines (difficult to include domain knowledge)

New AI: Bayesian learning, probabilistic graphical models, efficient inference

IET/BCS lecture 2010, Chris Bishop

Artificial Intelligence

Page 5: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Neural-Symbolic Methodologyhigh-level symbolic representations

(abstraction, recursion, relations, modalities)

translations

low level, efficient neural structures(with the same, simple architecture throughout)

Analogy: low-level implementation (machine code) of high-level representations (e.g. java, requirements)

Page 6: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

A Foundational Approach(as opposed to the neuroscience or the engineering approach)

One Structure for Learning and Reasoning:

Take different tasks, consider what they have in common, formalize, evaluate and repeat.

Key aspect: controlling the inevitable accumulation of errors as part of a

perception-action cycle

Applications: training in simulators, robocup, verification of software models, bioinformatics, power

plant fault diagnosis, semantic web, general game playing, visual intelligence.

Page 7: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Neural-Symbolic Learning Systems

Symbolic

KnowledgeSymbolic

Knowledge

NeuralNetwork

Examples

Learning

Connectionist System

InferenceMachine

Explanation

1

2

3

4

5

Page 8: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Connectionist Inductive Logic Programming (CILP) System

A Neural-Symbolic System for Integrated Reasoning and Learning • Knowledge Insertion, Revision (Learning), Extraction

(based on Towell and Shavik, Knowledge-Based Artificial Neural Networks. Artificial Intelligence, 70:119-165, 1994)

• Applications: DNA Sequence Analysis, Power Systems Fault Diagnosis(CILP using backpropagation with background knowledge:

test set performance is comparable to backpropagation; test set performance on small training sets is comparable to KBANN;training set performance is superior than backpropagation and KBANN)

Page 9: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

r1: A ← B,C,~D;

r2 : A ← E,F;

r3 : B ←

CILP Translation AlgorithmA BθA θB

W WW

θ1 h1 θ2 h2 θ3 h3

B FE DC

WWW -WW

Interpretations

THEOREM: For any logic program P there exists a neural network N such that N computes Pbased on Holldobler and Kalinke’s translation, but extended to sigmoid neurons (backprop) and hetero-associative networksHolldobler and Kalinke, Towards a Massively Parallel Computational Model for Logic Programming. ECAI Workshop Combining Symbolic and Connectionist Processing , 1994.

Page 10: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Power Plant Fault Diagnosis (problem)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

Mapping 23 alarms to 32 faults

Transformer 0113,8Kv/230Kv

Generator01

BreakerTransformer 01

BreakerTransformer 02

BreakerBy-pass

Main Bus Auxiliary Bus

Transformer 0213,8Kv/230Kv

BreakerTransmission Line 01

Generator02

TransmissionLine 01

BreakerTransmission Line 02

TransmissionLine 02

Page 11: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Background Knowledge (35 rules with noise)278 examples of single and multiple faults

Fault(ground,close-up,line01,no-bypass) IF Alarm(instantaneous,line01) AND Alarm(ground,line01)

There is a fault at transmission line 01, close to the power plant generator, due to an over-current in the ground line of transmission line 01, which occurred when the system was not using the bypass circuit.

Power Plant Fault Diagnosis

Page 12: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Also, as expected, CILP networks learn faster

Power Plant Fault Diagnosis (results)10096.8 98.797.7

95.496.7

87.783.3

70

80

90

100

Single Fault Multiple Faults

% te

st s

et p

erfo

rman

ce

CILPBackprop35Backprop15Backprop5

Page 13: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Promoter Recognition (problem)

Minus35 Minus10 Conform. Contact Promoter

-50 DNA +7 Minus35 Minus10 Conform. Contact

Promoter = small DNA sequence at beginning of genes

Page 14: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Promoter Recognition (results)Background Knowledge (14 rules):

Promoter IF Contact AND ConformationContact IF Minus10 AND Minus35Minus35 IF @-36'ttgac'Minus35 IF @-37'cttgac'Conformation IF @-47'caa*tt*ac' AND @-22'g***t*c' AND @-8'gcgcc*cc'

10-fold cross-validation on set of 106 examples

CILP networks learn faster than backpropagation and KBANN, and perform slightly better than backpropagation and KBANN on small training sets. We attribute this to the soundness of the CILP transla-tion (i.e. the above theorem).

We also ran experiments on the splice-junction determination prob-lem obtaining similar results

Page 15: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

CILP Rule Extraction• Knowledge is extracted by querying/sampling

the trained network;• A partial ordering helps guide the search,

reducing complexity on the average case;• A proof of soundness guarantees that the rules

approximate the behaviour of the network;• Rule simplification and visualization

techniques help experts validate the rules;• The rules can be visualized in the form of a

state transition diagram

Page 16: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

CILP Extraction Algorithm

THEOREM: CILP rule extraction is soundChallenge: efficient extraction of sound, readable knowledge from large-scale networks

{-1,1,1}

{1,1,1}

{1,-1,-1}

{1,1,-1} {1,-1,1}

{-1,1,-1}

{-1,-1,-1}

{-1,-1,1}

[a,b,c]

a → h0

b → h0

c → h0

b, c → h1

a, c → h1

a, b → h1

1 (a, b, c) → h0

2(a, b, c) → h1

Page 17: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

d'Avila Garcez, Zaverucha. The CILP System. Applied Intelligence 11:59-77, 1999.

d'Avila Garcez, Broda, Gabbay. Knowledge Extraction from Neural Networks: A Sound Approach. Artificial Intelligence 125:153-205, 2001.

d'Avila Garcez, Broda, Gabbay. Neural-Symbolic Learning Systems. Springer-Verlag, 2002.

Publications on CILP

Page 18: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

CILP extensions (deep networks)

• The importance of non-classical reasoning• Preference, Modal, Temporal, Epistemic, Intuitionistic,

Abductive Reasoning, Value-based Argumentation.• New applications including normative reasoning

(robocup), temporal logic learning (model checking), software model adaptation (business process evolution from text), training and assessment in simulators (driving test), visual intelligence (video classification), general game playing, web ontology learning.

Page 19: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Connectionist Modal Logic (CML)

W1

W3W2

CILP network ensembles, modularity for learning,accessibility relations, disjunctive information

Page 20: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Semantics of necessity and possibility

A proposition is necessary (box) in a world if it is true in all worlds which are possible in relation to that world.

A proposition is possible (diamond) in a world if it is true in at least one world which is possible in relation to that same world.

Modalities used for reasoning about uncertainty (following J. Halpern, MIT Press).

Page 21: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Representing box and diamond

p

q

W2

W3

W1

q

q

p

Page 22: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Translates modal programs into ensembles of CILP networks, i.e. clauses Wi : ML1,...,MLn → MA and relations R(Wa,Wb) between worlds Wa and Wb, with M in {box, diamond}.

THEOREM: For any modal program P there exists an ensemble of networks N such that N computes P.

CML Translation Algorithm

Page 23: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Learning in CML

We have applied CML to a benchmark distributed knowledge representation problem: the muddy children puzzle

(children are playing in a garden; some have mud on their faces, some don’t; they can see if the others are muddy, but not themselves; a caretaker asks: do you know if you’re muddy? At least one of you is)

Learning with modal background knowledge offers better accuracy than learning by examples only (93% vs. 84% test set accuracy)

Page 24: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Connectionist Temporal Reasoning

Agent 1 Agent 2 Agent 3

t1

t2

t3

at least 1 muddy

3 muddy children

at least 2 muddy

A full solution to the muddy children puzzle can only be given by a two-dimensional network ensemble

THEOREM: For any temporal program P there exists an ensemble of networks N such that N computes P.

Page 25: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Garcez, Lamb, Gabbay. Connectionist Modal Logic. TCS, 371: 34-53, 2007. Garcez, Lamb, Gabbay. Connectionist Computations of Intuitionistic Reasoning. TCS, 358:34-55, 2006. Garcez, Lamb. Connectionist Model for Epistemic and Temporal Reasoning. Neural Computation, 18:1711-1738, July 2006.

Lamb, Borges, Garcez. A Connectionist Cognitive Model for Temporal Synchronisation and Learning, AAAI 2007.

Publications on Nonclassical Computation

Page 26: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Combining (Fibring) Networks

.

.

.

.

.

.

.

.

.

.

.

.

Network A

Network B

fibring function

Expressiveness to represent first-order logicLoosely-coupled integration: e.g. Network A and Legacy System B

A neuron that is a network! Modulation or recursion?

Page 27: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Fibring Expressiveness

A

xWh

o

ϕ B

Wo

B

yWh

o

ϕ C

Wo

C

zWh

oWo

Fibred networks approx-imate any polynomial function in unbounded domains, e.g. f(x)=x2, as opposed to each of A, B, C which are universal ap-proximators in compact domains only.

Page 28: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Examples of Fibring: Relational Learning

Experiments on the east-west trains dataset show an improvement from 62% (flat, propositional network) to 80% (metalevel network) on test set performance (leaving one out cross-validation)

X Y Z

X Y Z

X Y Z

X Y Z

P Q

α β γ δ

Grandparent(X,Z) IF parent(X,Y) AND parent(Y,Z)Inputs presented to P and Q at the same time trigger the learning process in the meta-level

Page 29: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Examples of Fibring: Metaphors

A B C

p q

Domain 1

Domain 2

D E

Combining a CILP network and a restricted Boltzmann machine

Page 30: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Cognitive Model: Fibred Network Ensembles

meta-level relations

fibring functions

object-level

Page 31: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Garcez, Lamb, Gabbay. Neural-Symbolic Cognitive Reasoning. Springer, 2009.

Garcez, Gabbay, Ray, Woods. Abductive Reasoning in Neural-Symbolic Systems. Topoi 26:37-49, 2007.

Bader, Garcez and Hitzler. Computing First Order Logic Programs by Fibring Artificial Neural Networks. FLAIRS Conference, AAAI Press, 2005.

Garcez, Gabbay. Fibring Neural Networks. AAAI 2004, July 2004.

Publications on Fibring and Cognitive Model

Page 32: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Recent ApplicationsTraining and Assessment in Simulators

Learning new information from observation of experts and trainees at task execution and reasoning about this

information online to provide feedback to the user

L. de Penning, A. S. d'Avila Garcez, L. C. Lamb and J. J. Meyer. A Neural-Symbolic Cognitive Agent for Online Learning and Reasoning. IJCAI'11, July 2011

Page 33: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Recent Applications (cont.)

Learning Normative Rules of the RoboCup Competition

G. Boella, S. Colombo Tosatto, A. S. d'Avila Garcez, V. Genovese, D. Ienco and L. van der Torre. A Neural-Symbolic System for Normative Agents. AAMAS'11, May 2011.

Page 34: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Recent Applications (cont.)Software Model Verification and Adaptation Framework

Borges, Garcez, Lamb, Nuseibeh. Learning to Adapt Requirements Spe-cifications of Evolving Systems. ICSE (NIER Track), May 2011.

Page 35: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Conclusion: Why Neurons and Symbols

To study the statistical nature of learning and the logical nature of reasoning.

To provide a unifying foundation for robust learning and efficient reasoning.

To develop effective computational systems for integrated reason-ing and learning.

Page 36: Neural-Symbolic Systems for Cognitive Reasoningaag/talks/kent.pdf · Neural-Symbolic Systems for Cognitive Reasoning ... Towards a Massively Parallel Computational Model for Logic

Current/Future Work

• Theory: how brains make mental models / abduction / argumentation

• Practice: systems and applications (training in simulators, visual intelligence)

• First Order Logic Learning: encoding vs. propositionalisation / binding problem

• Neural Networks for Normative Systems (Robocup)• General Game Playing (Stochastic CILP)• Adding/Extracting domain knowledge from deep

belief networks