Inductive Learning of Rules

Inductive Learning of RulesMushroom Edible?Spores Spots Color

Y N Brown NY Y Grey YN Y Black YN N Brown NY N White NY Y Brown Y

Y N BrownN N Red

Don’t try this at home...

Types of LearningWhat is learning?

Improved performance over time/experience Increased knowledge

Speedup learning No change to set of theoretically inferable facts Change to speed with which agent can infer them

Inductive learning More facts can be inferred

Mature TechnologyMany Applications

Detect fraudulent credit card transactions Information filtering systems that learn user

preferences Autonomous vehicles that drive public highways

(ALVINN) Decision trees for diagnosing heart attacks Speech synthesis (correct pronunciation) (NETtalk)

Data mining: huge datasets, scaling issues

Defining a Learning Problem

Experience:Task:Performance Measure:

A program is said to learn from experience E with respect to task T and performance measure P, if it’s performance at tasks in T, as measured by P, improves with experience E.

Example: CheckersTask T:

Playing checkersPerformance Measure P:

Percent of games won against opponents

Experience E: Playing practice games against itself

Example: Handwriting RecognitionTask T:

Performance Measure P:

Experience E:

Recognizing and classifying handwritten words within images

Example: Robot DrivingTask T:

Experience E:

Driving on a public four-lane highway using vision sensors

Example: Speech Recognition

Task T:

Experience E:

Identification of a word sequence from audio recorded from arbitrary speakers ... noise

IssuesWhat feedback (experience) is available?What kind of knowledge is being

increased? How is that knowledge represented?What prior information is available? What is the right learning algorithm?How avoid overfitting?

Choosing the Training ExperienceCredit assignment problem:

Direct training examples: E.g. individual checker boards + correct move for

each Indirect training examples :

E.g. complete sequence of moves and final resultWhich examples:

Random, teacher chooses, learner chooses

Supervised learningReinforcement learningUnsupervised learning

Choosing the Target FunctionWhat type of knowledge will be learned?How will the knowledge be used by the

performance program?E.g. checkers program

Assume it knows legal moves Needs to choose best move So learn function: F: Boards -> Moves

hard to learn Alternative: F: Boards -> R

The Ideal Evaluation FunctionV(b) = 100 if b is a final, won board V(b) = -100 if b is a final, lost boardV(b) = 0 if b is a final, drawn boardOtherwise, if b is not final

V(b) = V(s) where s is best, reachable final board

Nonoperational…Want operational approximation of V: V

How Represent Target Functionx1 = number of black pieces on the boardx2 = number of red pieces on the boardx3 = number of black kings on the boardx4 = number of red kings on the boardx5 = number of black pieces threatened by redx6 = number of red pieces threatened by black

V(b) = a + bx1 + cx2 + dx3 + ex4 + fx5 + gx6

Now just need to learn 7 numbers!

Target FunctionProfound Formulation:

Can express any type of inductive learning as approximating a function

E.g., Checkers V: boards -> evaluation

E.g., Handwriting recognition V: image -> word

E.g., Mushrooms V: mushroom-attributes -> {E, P}

Inductive bias

Theory of Inductive Learning

Theory of Inductive LearningSuppose our examples are drawn with a probability

distribution Pr(x), and that we learned a hypothesis f to describe a concept C.

We can define Error(f) to be:

where D are the set of all examples on which f and C disagree.

PAC LearningWe’re not perfect (in more than one way). So why

should our programs be perfect?What we want is:

Error(f) < for some chosen But sometimes, we’re completely clueless: (hopefully,

with low probability). What we really want is: Prob ( Error(f) < .

As the number of examples grows, and should decrease.

We call this Probably approximately correct.

Definition of PAC LearnabilityLet C be a class of concepts.We say that C is PAC learnable by a hypothesis space H if:

there is a polynomial-time algorithm A, a polynomial function p, such that for every C in C, every probability distribution Pr, and

and , if A is given at least p(1/, 1/) examples, then A returns with probability 1- a hypothesis whose error is

less than .k-DNF, and k-CNF are PAC learnable.

Version Spaces: A Learning Alg.Key idea:

Maintain most specific and most general hypotheses at every point. Update them as examples come in.

We describe objects in the space by attributes: faculty, staff, student 20’s, 30’s, 40’s. male, female

Concepts: boolean combination of attribute-values: faculty, 30’s, male, female, 20’s.

Generalization and Specializ...A concept C1 is more general than C2 if it

describes a superset of the objects: C1={20’s, faculty} is more general than C2={20’s,

faculty, female}. C2 is a specialization of C1.

Immediate specializations (generalizations).The version space algorithm maintains the

most specific and most general boundaries at every point of the learning.

ExampleT

male female faculty student 20’s 30’s

male, fac male,stud female,fac female,stud fac,20’s fac, 30’s

male,fac,20 male,fac,30 fem,fac,20 male,stud,30

Inductive Learning of Rules

Documents

Transcript of Inductive Learning of Rules

Introduction - ingasdocportfolio.weebly.comingasdocportfolio.weebly.com/.../information_processing… · Web view... only inductive learning, the picture-word inductive ... to establish

Relational inductive biases, deep learning, and graph networks

Inductive Learning. COW The Sandbox Theory.

Machine Learning Concept Learning General-to Specific Ordering (Inductive Classification)

Inductive Teachin g and Learning Methods: Definitions ... · Inductive Teachin g and Learning Methods: Definitions, Comparisons, and Research ... inductive methods ... not mean total

Inductive Learning for Case-Based Diagnosis with Multiple ...

Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming

Inductive Learning from Imbalanced Data Sets

USING FIRST ORDER INDUCTIVE LEARNING AS AN …

Interleaved Inductive-Abductive Reasoning for Learning ...ilp11.doc.ic.ac.uk/short_papers/ilp2011_submission_59.pdf · Interleaved Inductive-Abductive Reasoning for Learning Event-Based

Analyzing Aptitudes for Learning: Inductive Reasoning.

DEDUCTIVE IN INDUCTIVE AND COMPARISON OF LEARNING

Inductive Learning - Unit #8 Learning Centertechnologybcusd8.weebly.com/.../5/8/7/15873358/inductive_learning.pdf · underlies higher-order thinking and 21st century skills. Inductive

Inductive Learning and Decision Trees - Computer …ddowney/courses/349_Spring2015/lectures… · Inductive Learning and Decision Trees ... low, and outlook != rain) ... Task: Will

Inductive Learning or uncovering genius in the classroom

Quantifying Inductive Bias: AI Learning Algorithms and ...ml.informatik.uni-freiburg.de/former/_media/teaching/ss13/ml/divers… · inductive bias. The most prevalent form of inductive

Synthesis and Inductive Learning -Part 1 · –4– Induction vs. Deduction Induction: Inferring general rules (functions) from specific examples (observations) – Generalization

Learning from observations Inductive Learning - learning from examples Machine Learning.

UNIT V: LEARNING. LEARNING Learning from Observation Inductive Learning Decision Trees Explanation based Learning Statistical Learning methods Reinforcement.

Inductive Learning Methods towards Learning Outcomes