Machine Learning Applied in Product Classification

Jianfu ChenComputer Science Department

Stony Brook University

Machine learning learns an idealized model of the real world.

+¿ ¿

1 + 1 = 2

+¿ ¿ ?

Prod1 -> class1Prod2 -> class2

f(x) -> y Prod3 -> ?

X: Kindle Fire HD 8.9" 4G LTE Wireless 0 ... 1 1 ... 1 ... 1 ... 0 ...

Compoenents of the magic box f(x)

Representat

• Give a score to each class• s(y; x) =

Inference

• Predict the class with highest score

Learning

• Estimate the parameters from data

Representation

Linear Model

• s(y;x)=

Probabilistic Model

• P(x,y)• Naive Bayes

• P(y|x)• Logistic

Regression

Algorithmic Model

• Decision Tree• Neural

Networks

Given an example, a model gives a score to each class.

Linear Model

• a linear comibination of the feature values. • a hyperplane.• Use one weight vector to score each class.

𝑤2𝑤3

Example

• Suppose we have 3 classes, 2 features• weight vectors

Probabilistic model

• Gives a probability to class y given example x:

• Two ways to do this:– Generative model: P(x,y) (e.g., Naive Bayes)

– discriminative model: P(y|x) (e.g., Logistic Regression)

Compoenents of the magic box f(x)

Representat

• Give a score to each class• s(y; x) =

Inference

• Predict the class with highest score

Learning

• Estimate the parameters from data

Learning

• Parameter estimation ()– ’s in a linear model– parameters for a probabilistic model

• Learning is usually formulated as an optimization problem.

Define an optimization objective- average misclassification cost

• The misclassification cost of a single example x from class y into class y’:

– formally called loss function• The average misclassification cost on the

training set:

– formally called empirical risk

Define misclassification cost

• 0-1 loss

average 0-1 loss is the error rate = 1 – accuracy:

• revenue loss

Do the optimization- minimizes a convex upper bound of

the average misclassification cost.

• Directly minimizing average misclassificaiton cost is intractable, since the objective is non-convex.

•minimize a convex upper bound instead.

A taste of SVM

• minimizes a convex upper bound of 0-1 loss

where C is a hyper parameter, regularization parameter.

Machine learning in practice

feature extraction { (x, y) }

select a model/classifier

Setup experimenttraining:development:test4 : 2 : 4

call a package to do experiments

• LIBLINEARhttp://www.csie.ntu.edu.tw/~cjlin/liblinear/• find best C in developement set• test final performance on test set

Cost-sensitive learning

• Standard classifier learning optimizes error rate by default, assuming all misclassification leads to uniform cost

• In product taxonomy classification

keyboardmousetruck car

IPhone5

Nokia 3720 Classic

Minimize average revenue loss

where is the potential annual revenue of product x if it is correctly classified;

is the loss ratio of the revenue by misclassifying a product from class y to class y’.

Conclusion

• Machine learning learns an idealized model of the real world.

• The model can be applied to predict unseen data.

• Classifier learning minimizes average misclassification cost.

• It is important to define an appropriate misclassification cost.

Machine Learning Applied in Product Classification

Documents

Transcript of Machine Learning Applied in Product Classification

Classification of milling machine

Large-scale Optimisation (ITNPD8) Combinatorial Optimisation … - Population... · 2015-02-04 · • Applied to: machine learning tasks (prediction, classification…) • Representation

A Machine Learning Application for Classification of ... Machine Learning Application for Classification of ... accurate analysis of spectroscopic data from ... Application for Classification

Applied Machine Learning - Classes

Support Vector Machine & Image Classification Applications

Applied Machine Vision - Georgia Institute of Technologykmlee.gatech.edu/me6406/Applied Machine Vision Lecture...Applied Machine Vision ME Machine Vision Class Doug Britton – GTRI

Machine learning for multi-criteria inventory classification applied … · 2018-10-05 · 1 Machine learning for multi-criteria inventory classification applied to intermittent demand

MACHINE LEARNING FOR CLASSIFICATION OF AN ERODING … … · Innsbruck Summer School, Obergurgl (Rutzinger et al., 2018, 2016), a team of researchers applied machine learning (ML)

Introduction to Classification, aka Machine Learningclasses.ischool.syr.edu/ist664/NLPFall2015/... · 2015-10-26 · Introduction to Classification, aka Machine Learning . 2 Classification:

Sentiment Classification using Machine Learning Techniquescis.csuohio.edu/~sschung/CIS660/SentimentAnalysis... · 2002. Thumbs up? Sentiment Classification using Machine Learning

Machine Learning in 5 Minutes— Classification

A Machine Learning Classification Broker for ...

Machine Learning for Document Classification

COMP 551 -Applied Machine Learning Lecture 4 -- …Basic idea –“Machine Learning 101”: Implement two linear classification algorithms (from this lecture and next lecture) Run

Classification: Physical Sciences, Applied Physical ...

CS 391L: Machine Learning: Inductive Classification

Introduction to Machine Learning & Classification

Question classification via machine learning techniques

Machine Learning Applied in Product Classification

Machine Learning algorithms for Image Classification of ... · machine learning algorithms for image classification has been documented. The implementation of five classification