Distinguish Wild Mushrooms with Decision Tree

10
Distinguish Wild Mushrooms with Decision Tree Shiqin Yan

description

Distinguish Wild Mushrooms with Decision Tree. Shiqin Yan. Objective. Utilize the already existed database of the mushrooms to build a decision tree to assist the process of determine the whether the mushroom is poisonous . DataSet. - PowerPoint PPT Presentation

Transcript of Distinguish Wild Mushrooms with Decision Tree

Page 1: Distinguish Wild Mushrooms with Decision Tree

Distinguish Wild Mushrooms with Decision Tree

Shiqin Yan

Page 2: Distinguish Wild Mushrooms with Decision Tree

Objective Utilize the already existed database of the

mushrooms to build a decision tree to assist the process of determine the whether the mushroom is poisonous.

Page 3: Distinguish Wild Mushrooms with Decision Tree

DataSet Existing record drawn from the Audubon

Society Field Guide to North American Mushrooms (1981) . G. H. Lincoff (Pres. ), NewYork: Alfred A. Knopf

Number of Instances: 8124 (classified as either edible or poisonous)

Number of Attributes: 22 Training: 5416, Tuning: 1354, Testing: 1354 Missing attribute values: 2480 (denoted by

“?”), all for attribute 11

Page 4: Distinguish Wild Mushrooms with Decision Tree

Mushroom Features 1. cap-shape: bell=b, conical=c, convex=x,

flat=f, knobbed=k, sunken = s 2. cap-surface: fibrous=f, grooves=g,

scaly=y, smooth=s 3. cap-color: brown=n, buff=b, cinnamon=c,

gray=g, green=r, pink=p, purple=u, red=e, white=w, yellow=y

4. bruise?: bruises=t, no=f 5. odor: almond=a, anise=l, creosote=c,

fishy=y, foul=f …

Page 5: Distinguish Wild Mushrooms with Decision Tree
Page 6: Distinguish Wild Mushrooms with Decision Tree

Approach Mutual information to determine the features

used to split the tree.

Mutual information: Y: label, X: feature Choose feature X which maximizes I(Y;X)

Page 7: Distinguish Wild Mushrooms with Decision Tree
Page 8: Distinguish Wild Mushrooms with Decision Tree

Most informative features extracted from decision tree: odor spore-print-color habitat population

Page 9: Distinguish Wild Mushrooms with Decision Tree

Prior Research

by Wlodzislaw Duch, Department of Computer Methods, Nicholas Copernicus University

Page 10: Distinguish Wild Mushrooms with Decision Tree

Add cross-validation to improve the accuracy

Prune the tree to avoid over-fitting

Future