WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

13
WEKA – Knowledge Flow & Simple CLI

Transcript of WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Page 1: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

WEKA – Knowledge Flow & Simple CLI

Page 2: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Knowledge Flow

Page 3: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Knowledge Flow ..

• A graphical front end to Weka's core algorithms• It is a work in progress• It can handle data either incrementally or in batches (the

Explorer handles batch data only). Currently in Weka there are five classifiers that can handle data incrementally: NaiveBayesUpdateable, IB1, IBk, LWR (locally weighted regression).

Page 4: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Features

• intuitive data flow style layout • process data in batches or incrementally • process multiple batches or streams in parallel! (each

separate flow executes in its own thread) • chain filters together • view models produced by classifiers for each fold in a

cross validation • visualize performance of incremental classifiers during

processing (scrolling plots of classification accuracy, RMS error, predictions etc)

Page 5: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Components

• DataSourcesAll of Weka's loaders are available

• DataSinksAll of Weka's savers are available

• FiltersAll of Weka's filters are available

• ClassifiersAll of Weka's classifiers are available

• ClusterersAll of Weka's clusterers are available

Page 6: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Components ..• Evaluation

– TrainingSetMaker - make a data set into a training set – TestSetMaker - make a data set into a test set – CrossValidationFoldMaker - split any data set, training set or test set into

folds – TrainTestSplitMaker - split any data set, training set or test set – ClassAssigner - assign a column to be the class for any data set– ClassValuePicker - choose a class value as the "positive" class. – ClassifierPerformanceEvaluator - evaluate the performance of batch

trained/tested classifiers – IncrementalClassifierEvaluator - evaluate the performance of

incrementally trained classifiers – ClustererPerformanceEvaluator - evaluate the performance of batch

trained/tested clusterers – PredictionAppender - append classifier predictions to a test set.

Page 7: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Components ..• Visualization

– DataVisualizer - visualizing data in a single large 2D scatter plot – ScatterPlotMatrix - containing a matrix of small scatter plots (clicking

on a small plot pops up a large scatter plot) – AttributeSummarizer - component that can pop up a panel containing

a matrix of histogram plots - one for each of the attributes in the input data

– ModelPerformanceChart - component that can pop up a panel for visualizing threshold (i.e. ROC style) curves.

– TextViewer - component for showing textual data. Can show data sets, classification performance statistics etc.

– GraphViewer - pop up a panel for visualizing tree based models – StripChart - component that can pop up a panel that displays a

scrolling plot of data

Page 8: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Example

Page 9: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Result

Page 10: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Simple CLI

Page 11: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Simple CLI ..

• The Simple CLI provides full access to all Weka classes, i.e., classifiers, filters, clusterers, etc.

• It offers a simple Weka shell with separated commandline and output.

Page 12: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Commands

• java <classname> [<args>]: invokes a java class with the given arguments (if any)

• break: stops the current thread, e.g., a running classifier, in a friendly manner

• kill: stops the current thread in an unfriendly fashion • cls: clears the output area • exit: exits the Simple CLI • help [<command>]: provides an overview of the

available commands if without a command name as argument, otherwise more help on the specified command

Page 13: WEKA – Knowledge Flow & Simple CLI. Knowledge Flow.

Example

• java weka.classifiers.trees.J48 -t e:\data\iris.arff