Automatic Machine Learning, AutoML

Automatic Machine Learning

By: Himadri Mishra, 13074014

Overview: What is Machine Learning?

● Subfield of computer science● Evolved from the study of pattern recognition and

computational learning theory in artificial intelligence● Gives computers the ability to learn without being

explicitly programmed● Explores the study and construction of algorithms that

can learn from and make predictions on data

Basic Flow of Machine Learning

Overview: Why Machine Learning?

● Some tasks are difficult to define algorithmically. Example: Learning to recognize objects.

● High-value predictions that can guide better decisions and smart actions in real time without human intervention

● Machine learning as a technology that helps analyze these large chunks of big data,

● Research area that targets progressive automation of machine learning

● Also known as AutoML● Focuses on end users without expert knowledge● Offers new tools to Machine Learning experts.

○ Perform architecture search over deep representations○ Analyse the importance of hyperparameters

○ Development of flexible software packages that can be instantiated automatically in a data-driven way

● Follows the paradigm of Programming by Optimization (PbO)

What is Automatic Machine Learning?

Examples of AutoML

● AutoWEKA: Approach for the simultaneous selection of a machine learning algorithm and its hyperparameters

● Deep Neural Networks: notoriously dependent on their hyperparameters, and modern optimizers have achieved better results in setting them than humans (Bergstra et al, Snoek et al).

● Making a science of model search: a complex computer vision architecture could automatically be instantiated to yield state-of-the-art results on 3 different tasks: face matching, face identification, and object recognition.

Methods of AutoML

● Bayesian optimization● Regression models for structured data and big data● Meta learning● Transfer learning● Combinatorial optimization.

An AutoML Framework

Modules of AutoML Framework, unraveled

● Data Pre-Processing● Problem Identification and Data Splitting● Feature Engineering● Feature Stacking● Application of various models to data● Decomposition● Feature Selection● Model selection and HyperParameter tuning● Evaluation of Model

Data Pre-Processing

● Tabular data is most common way of representing data in machine learning or data mining

● Data must be converted to a tabular form

Problem Identification and Data Splitting

● Single column, binary values (Binary Classification)● Single column, real values (Regression problem)● Multiple column, binary values (Multi-Class

Classification)● Multiple column, real values (Multiple target Regression

problem)● Multilabel Classification

Types of Labels

● Stratified KFold splitting for Classification● Normal KFold split for regression

Feature Engineering

● Numerical Variables○ No Processing Required

● Categorical Variables○ Label Encoders○ One Hot Encoders

● Text Variables○ Count Vectorize○ TF-IDF vectorize

Types of Variables

Feature Stacking

● Two Kinds of Stacking○ Model Stacking

■ An Ensemble Approach■ Combines the power of diverse models into single

○ Feature Stacking■ Different features after processing, gets combined

● Our Stacker Module is a feature stacker

Application of models and Decomposition

● We should go for Ensemble tree based models:○ Random Forest Regressor/Classifier○ Extra Trees Regressor/Classifier○ Gradient Boosting Machine Regressor/Classifier

● Can’t apply linear models without Normalization○ For dense features Standard Scaler Normalization

○ For Sparse Features Normalize without scaling about mean, only to unit variance

● If the above steps give a “good” model, we can go for optimization of hyperparameters module, else continue

● For High dimensional data, PCA is used to decompose● For images start with 10-15 components and increase it as

long as results improve● For other kind of data, start with 50-60 components● For Text Data, we use Singular Value Decomposition after

converting text to sparse matrix

Feature Selection

● Greedy Forward Selection○ Selecting best features iteratively○ Selecting features based on coefficients of model

● Greedy backward elimination● Use GBM for normal features and Random Forest for Sparse

features for feature evaluation

Model selection and HyperParameter tuning

● Most important and fundamental process of Machine Learning

● Classification:○ Random Forest○ GBM○ Logistic Regression○ Naive Bayes○ Support Vector Machines○ k-Nearest Neighbors

● Regression○ Random Forest○ GBM○ Linear Regression○ Ridge○ Lasso○ SVR

Choice of Model and Hyperparameters

Evaluation of Model

Saving all Transformations on Train Data for reuse

Re-Use of saved transformations for Evaluation on validation set

Current Research

Automatic Architecture selection for Neural Network

Automatically Tuned Neural Network

● Auto-Net is a system that automatically configures neural networks● Achieved the best performance on two datasets in the human expert track of

the recent ChaLearn AutoML Challenge● Works by tuning:

○ layer-independent network hyperparameters○ per-layer hyperparameters

● Auto-Net submission reached an AUC score of 90%, while the best human competitor (Ideal Intel Analytics) only reached 80%

● first time an automatically-constructed neural network won a competition dataset

Conclusion

● Machine learning (ML) has achieved considerable successes in recent years and an ever-growing number of disciplines rely on it.

● However, its success crucially relies on human machine learning experts to perform various tasks manually

● The rapid growth of machine learning applications has created a demand for off-the-shelf machine learning methods that can be used easily and without expert knowledge

● Auto-ML is an open research topic and will be very soon challenging the state of the Art results in various domains

Thank You

Automatic Machine Learning, AutoML

Engineering

Transcript of Automatic Machine Learning, AutoML

Machine Intelligence at Google Scale - API Conference · Machine Intelligence at Google Scale Vision, Video, NLP, Speech, TTS, Dialogflow TensorFlow, Cloud ML Engine, AutoML Guillaume

Building and Benchmarking AutoML Systems · • Automatic Machine Learning (AutoML) • Machine Learning Benchmarking ... Changes made to the H2O AutoML algorithm ... will expand

4 Automated Machine Learning (AutoML) and Pentaho · Agenda We will discuss how Automated Machine Learning (AutoML) and Pentaho, together, can help customers save time in the process

Automated Machine Learning and Knowledge Discoverymensxmachina.org/wp-content/uploads/2018/09/ECCB... · o “Automated machine learning (AutoML) is the process of automating the

AutoML-Zero: Evolving Machine Learning …AutoML-Zero: Evolving Machine Learning Algorithms From Scratch Esteban Real * 1Chen Liang David R.So1 Quoc V. Le1 Abstract Machine learning

The Importance of AutoML€¦ · influence the structure of data teams long term. At a very high level, AutoML is about using machine learning techniques to automatically do machine

Algorithm Recommendation with Active Meta …static.tongtianta.site/paper_pdf/b111dbba-5926-11e9-83ca...eld of AutoML (Automatic Machine Learning), aiming at automatically select-ing

8 Key Considerations for AI in the Enterprise · 2020-04-29 · You can also use an automatic machine learning (AutoML) solution to help turn data into actionable insights. AutoML

Visus: An Interactive System for Automatic Machine ... · Automatic machine learning (AutoML) approaches have been proposed that help with this problem by synthesizing end-to-end

Fully Automatic Stretch Wrapping KCT-131-22 Machines€¦ · Fully Automatic Stretch Wrapping Machine with Rotating Ring Machine Description: Fully automatic stretch wrapping machine

DARTS+: Improved Differentiable Architecture …...in Automatic Machine Learning (AutoML), which has at-tracted lots of attention recently. The neural architectures searched by NAS

Automatic Arti Machine

Design of the 2015 ChaLearn AutoML Challenge - … · Design of the 2015 ChaLearn AutoML Challenge Isabelle Guyon ... - Automatic generation and reuse of workﬂows. ... paper summarizes

Automatic Machine Learning (AutoML): A Tutorial · –Based on scikit-learn & TPE Auto-sklearn [Feurer al, NIPS 2015] –Based on scikit-learn & SMAC / BOHB –Uses meta-learning

Automatic Stamping Machine

+91-8048113581...AUTOMATIC SANITARY NAPKIN VENDING MACHINE Sanitary Napkin Destroyer Automatic Sanitary Napkin Vending Machine Automatic AVM Vertical 3.0 Vending Machine Manual Sanitary

Automatic Machine Control - Lunds universitet · Automatic Machine & Control Vincent Hardion on behalf of KITS Group, MAXIV Automatic Machine Workshop

AlphaD3M Machine Learning Pipeline Synthesisdrori/alphad3m-slides.pdf · AutoML Methods Differentiable programming: End-to-end learning of machine learning pipelines with differentiable

H2O AutoML: Scalable Automatic Machine Learning · H2O AutoML (H2O.ai, 2017) is an automated machine learning algorithm included in the H2O framework (H2O.ai, 2013) that is simple

Fully automatic espresso machine Machine à expresso ...