Chapter 3: SAS Rapid Predictive Modeler

82
1 Chapter 3: SAS Rapid Predictive Modeler 3.1 Introduction 3.2 SAS Rapid Predictive Modeler Process Overview 3.3 SAS Rapid Predictive Modeler Model Settings 3.4 SAS Rapid Predictive Modeler Output 3.5 Saving Model Project Data 3.6 Registering the Model 3.7 Scoring 3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner 3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise

description

Chapter 3: SAS Rapid Predictive Modeler. Chapter 3: SAS Rapid Predictive Modeler. Objectives. Present a typical approach to data mining. State the k ey business drivers of SAS Rapid Predictive Modeler. - PowerPoint PPT Presentation

Transcript of Chapter 3: SAS Rapid Predictive Modeler

Page 1: Chapter 3: SAS Rapid Predictive Modeler

1

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 2: Chapter 3: SAS Rapid Predictive Modeler

2

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction 3.1 Introduction 3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 3: Chapter 3: SAS Rapid Predictive Modeler

3

Objectives Present a typical approach to data mining. State the key business drivers of SAS Rapid

Predictive Modeler. Present an alternative approach to data mining where

the business analyst and subject matter expert develops his or her own models.

Describe the key capabilities of SAS Rapid Predictive Modeler.

Page 4: Chapter 3: SAS Rapid Predictive Modeler

4

Churn Case StudyAnalysis Goal:

A telecommunications company wants to decrease the number of churning customers through the development of a churn classification model.

Data set: CHURN_RPM

Number of rows: 4,708

Number of columns: 15

Contents: account information, call history, equipment and complaint history

Targets: TARGET_CHURN (binary)

Page 5: Chapter 3: SAS Rapid Predictive Modeler

5

Churn Case Study: BasicsThroughout this chapter, you work with data in SAS Enterprise Guide and SAS Enterprise Miner to perform fast and accurate modeling with SAS Rapid Predictive Modeler.

1. Import the CHURN_RPM data.

2. Build the SAS Rapid Predictive Modeler model in SAS Enterprise Guide.

3. Score the CHURN_RPM_SCORE data set.

4. Open the model in SAS Enterprise Miner.

5. Improve the model.

Page 6: Chapter 3: SAS Rapid Predictive Modeler

6

Business Analyst / Subject

Matter Expert

Database Admin / IT

Quantitative Modeler /

StatisticianModel Development, Deployment, and ManagementSAS Enterprise Miner, SAS/STAT

Data preparation and data cleansingSAS Data Integration Studio

Apply model to specific customer issues (ex. find out customers, which are most likely to churn)SAS Enterprise Guide, SAS Add-In for Microsoft Office

Data Mining and Predictive AnalyticsConventional Approach

Page 7: Chapter 3: SAS Rapid Predictive Modeler

7

SAS Rapid Predictive ModelerKey Business Drivers

Need to generate numerous models to support a variety of business problems.

Models need to be developed in a short time-frame using a self-service approach.

Does not have to always rely on a statistician or modeler.

Collaborate to augment, validate, and deploy models.

Page 8: Chapter 3: SAS Rapid Predictive Modeler

8

Generate predictive models in a quick, automated fashionEasy-to-understand reports and chartsRegister model in SAS metadataSAS Enterprise Guide or SAS Add-In for Microsoft Office

Refine model and perform model comparisonTest, validate, and select champion model Monitor model performance for degradationSAS Enterprise Miner SAS, Model Manager

Business Analyst / Subject

Matter Expert

Database Admin / IT

Quantitative Modeler /

Statistician

Data preparation and data cleansingSAS Enterprise Data Integration Server

SAS Rapid Predictive ModelerComplementary Approach

Page 9: Chapter 3: SAS Rapid Predictive Modeler

9

SAS Rapid Predictive ModelerPrimary Objectives

Generate predictive models quickly and accurately.

Provide self-sufficiency to business users.

Generate easy-to-understand charts and reports.

Integrate analytics and BI for better decisions.

Page 10: Chapter 3: SAS Rapid Predictive Modeler

10

SAS Rapid Predictive Modeler’s Target Customers Across all industries Those dealing with customer-oriented and

marketing-analytics-oriented issues Those who need to generate numerous models to support a variety of

business problems:– customer acquisition– up-sell and cross-sell– customer retention– customer churn

Business analysts, subject matter experts, and business professionals with little to no statistical knowledge

Statisticians or data miners who need to develop quick baseline models that address common business issues

Page 11: Chapter 3: SAS Rapid Predictive Modeler

11

What Is SAS Rapid Predictive Modeler? SAS Rapid Predictive Modeler is a customized task that

runs prebuilt SAS Enterprise Miner models. It is an add-in for SAS Enterprise Guide or

SAS Add-In for Microsoft Office. It requires SAS Enterprise Miner and is included in SAS

Enterprise Miner packaging.– It also works with SAS Enterprise Miner for Desktop.

It enables business users, without prior statistical knowledge, to build predictive models quickly and effectively.

Results can be consumed in simple and easy-to-understand charts to make better decisions.

Page 12: Chapter 3: SAS Rapid Predictive Modeler

12

What Is SAS Rapid Predictive Modeler?

Page 13: Chapter 3: SAS Rapid Predictive Modeler

13

Key Capabilities You choose from basic, intermediate, or advanced prebuilt methods. SAS Rapid Predictive Modeler automatically handles outliers,

missing values, rare target events, skewed data, variable selection, and model selection.

Analytic results are presented in easy-to-understand business terms: scorecard, lift charts, and listing of key variables in the model.

Analytic experts can further customize and improve models developed in SAS Rapid Predictive Modeler using SAS Enterprise Miner.

Models are registered in SAS metadata to– automate the execution of score code – ease deployment to other systems.

Page 14: Chapter 3: SAS Rapid Predictive Modeler

14

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process 3.2 SAS Rapid Predictive Modeler Process OverviewOverview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring3.8 SAS Rapid Predictive Modeler Methods in SAS

Enterprise Miner3.9 Opening SAS Rapid Predictive Modeler Diagrams in

SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 15: Chapter 3: SAS Rapid Predictive Modeler

15

Objectives Give an overview of the SAS Rapid Predictive Modeler

process.

Page 16: Chapter 3: SAS Rapid Predictive Modeler

16

SAS Rapid Predictive Modeler Modeling Process: Overview1. Open SAS Enterprise Guide or Microsoft Excel.

2. Invoke the SAS Rapid Predictive Modeler task.

3. Select the data to model.

4. Define modeling roles (done automatically if variables are aptly named – for example, target_churn).

5. Run.

6. Review results. (You can save and share them.)

7. (optional) Save task to a SAS Enterprise Miner project.

8. (optional) Register the model in SAS metadata.

Page 17: Chapter 3: SAS Rapid Predictive Modeler

17

Open SAS Enterprise Guide

Page 18: Chapter 3: SAS Rapid Predictive Modeler

18

Invoke SAS Rapid Predictive ModelerTasks Data Mining Rapid Predictive Modeler

Page 19: Chapter 3: SAS Rapid Predictive Modeler

19

Select the Data for Modeling

Page 20: Chapter 3: SAS Rapid Predictive Modeler

20

Define Modeling Roles

Page 21: Chapter 3: SAS Rapid Predictive Modeler

21

Run

Page 22: Chapter 3: SAS Rapid Predictive Modeler

22

Review Results

Page 23: Chapter 3: SAS Rapid Predictive Modeler

23

Invoking and Running the SAS Rapid Predictive Modeler Task

Churn Case Study

Task: Invoke and execute the Rapid Predictive Modeler task in SAS Enterprise Guide.

Page 24: Chapter 3: SAS Rapid Predictive Modeler

24

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings3.3 SAS Rapid Predictive Modeler Model Settings3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 25: Chapter 3: SAS Rapid Predictive Modeler

25

Objectives Give a high-level overview of the SAS Rapid

Predictive Modeler model settings.

Page 26: Chapter 3: SAS Rapid Predictive Modeler

26

SAS Rapid Predictive Modeler: Data PanelAssociate input variables with modeling roles.

Required: dependent

variable (target)

Optional: Set frequency

count. Set ID. Exclude input

variables. Edit data

and filter.

Page 27: Chapter 3: SAS Rapid Predictive Modeler

27

SAS Rapid Predictive Modeler: Model PanelSpecify the complexity level of the model to build.

Default: Basic

Other methods: Intermediate Advanced

Other selections: Decisions and

priors

Page 28: Chapter 3: SAS Rapid Predictive Modeler

28

SAS Rapid Predictive Modeler: Model PanelDecisions and Priors

Event level

Prior probabilities

Decision function

Page 29: Chapter 3: SAS Rapid Predictive Modeler

29

SAS Rapid Predictive Modeler: Report PanelSelect additional features to be included in the model summary report.

Page 30: Chapter 3: SAS Rapid Predictive Modeler

30

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output3.4 SAS Rapid Predictive Modeler Output3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 31: Chapter 3: SAS Rapid Predictive Modeler

31

Objectives Review the charts and reports generated as output by

SAS Rapid Predictive Modeler.

Page 32: Chapter 3: SAS Rapid Predictive Modeler

32

SAS Rapid Predictive Modeler: Standard Report OutputModel Gains Chart

Page 33: Chapter 3: SAS Rapid Predictive Modeler

33

SAS Rapid Predictive Modeler: Standard Report OutputROC Chart

Page 34: Chapter 3: SAS Rapid Predictive Modeler

34

SAS Rapid Predictive Modeler: Standard Report OutputScorecard

Page 35: Chapter 3: SAS Rapid Predictive Modeler

35

SAS Rapid Predictive Modeler: Standard Report OutputProject Information

Page 36: Chapter 3: SAS Rapid Predictive Modeler

36

SAS Rapid Predictive Modeler: Optional Report Output Model Summarization

Page 37: Chapter 3: SAS Rapid Predictive Modeler

37

SAS Rapid Predictive Modeler: Optional Report Output Variable Ranking

Page 38: Chapter 3: SAS Rapid Predictive Modeler

38

SAS Rapid Predictive Modeler: Optional Report Output Crosstabulations

Page 39: Chapter 3: SAS Rapid Predictive Modeler

39

SAS Rapid Predictive Modeler: Optional Report Output Classification Matrix

Page 40: Chapter 3: SAS Rapid Predictive Modeler

40

SAS Rapid Predictive Modeler: Optional Report Output Fit Statistics

Page 41: Chapter 3: SAS Rapid Predictive Modeler

41

SAS Rapid Predictive Modeler: Optional Report Output Cumulative Lift Plot

Page 42: Chapter 3: SAS Rapid Predictive Modeler

42

SAS Rapid Predictive Modeler: Optional Report Output Model Comparison*

* Only available with intermediate or advanced methods

Page 43: Chapter 3: SAS Rapid Predictive Modeler

43

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data3.5 Saving Model Project Data3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 44: Chapter 3: SAS Rapid Predictive Modeler

44

Objectives Demonstrate how SAS Enterprise Miner project data

from an RPM model can be saved for later inspection and refinement.

Page 45: Chapter 3: SAS Rapid Predictive Modeler

45

SAS Rapid Predictive Modeler: Options PanelSave SAS Enterprise Miner project data from your SAS Rapid Predictive Modeler model.

Page 46: Chapter 3: SAS Rapid Predictive Modeler

46

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model3.6 Registering the Model3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 47: Chapter 3: SAS Rapid Predictive Modeler

47

Objectives Show how a SAS Rapid Predictive Modeler model can

be registered to the SAS Metadata Repository and explain why this might be necessary.

Page 48: Chapter 3: SAS Rapid Predictive Modeler

48

Register the SAS Rapid Predictive Modeler ModelRegister the model to the SAS Metadata Repository.

Use Cases: Import and score using the

Model Scoring task in SAS Enterprise Guide.

Import into SAS Enterprise Miner using the Model Import node for integrated model comparison.

Import into SAS Model Manager for champion/challenger model management.

Import into SAS Data Integration Studio to score with mining results transformation.

Publish as a scoring function for Teradata, Netezza, or IBM DB2.

Page 49: Chapter 3: SAS Rapid Predictive Modeler

49

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring3.7 Scoring3.8 SAS Rapid Predictive Modeler Methods in SAS

Enterprise Miner3.9 Opening SAS Rapid Predictive Modeler Diagrams in

SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 50: Chapter 3: SAS Rapid Predictive Modeler

50

Objectives Demonstrate how a new data set can be scored with

SAS Rapid Predictive Modeler. Discuss the steps of the model scoring task.

Page 51: Chapter 3: SAS Rapid Predictive Modeler

51

Model Scoring with SAS Rapid Predictive ModelerTasks Data Mining Model Scoring

Page 52: Chapter 3: SAS Rapid Predictive Modeler

52

Verify Data

Page 53: Chapter 3: SAS Rapid Predictive Modeler

53

Select Scoring Model

Page 54: Chapter 3: SAS Rapid Predictive Modeler

54

Select Scoring Model

Page 55: Chapter 3: SAS Rapid Predictive Modeler

55

Map Variables

Page 56: Chapter 3: SAS Rapid Predictive Modeler

56

Select Output

Page 57: Chapter 3: SAS Rapid Predictive Modeler

57

Save Output Data

Page 58: Chapter 3: SAS Rapid Predictive Modeler

58

Confirm and Finish

Page 59: Chapter 3: SAS Rapid Predictive Modeler

59

Scoring Results

Page 60: Chapter 3: SAS Rapid Predictive Modeler

60

Scoring Review Score new data using the Model Scoring task in

SAS Enterprise Guide. Score new data with SAS Enterprise Miner.

Page 61: Chapter 3: SAS Rapid Predictive Modeler

61

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in 3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise MinerSAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 62: Chapter 3: SAS Rapid Predictive Modeler

62

Objectives Discuss the basic, intermediate, and

advanced methods. Show how these methods translate to

SAS Enterprise Miner diagrams.

Page 63: Chapter 3: SAS Rapid Predictive Modeler

63

Basic Method: Eight Nodes

Samples only if a rare target event Decision tree for variable selection Forward stepwise regression model

Page 64: Chapter 3: SAS Rapid Predictive Modeler

64

Intermediate Method: 18 Nodes

Builds onto the basic method Several variable selection techniques performed Multiple variable transformations Decision tree and regression models used Variable interactions considered

Page 65: Chapter 3: SAS Rapid Predictive Modeler

65

Advanced Method: 32 Nodes

Builds onto the intermediate method Includes neural network, advanced regression, and

ensemble models

Page 66: Chapter 3: SAS Rapid Predictive Modeler

66

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring

3.8 SAS Rapid Predictive Modeler Methods in SAS Enterprise Miner

3.9 Opening SAS Rapid Predictive Modeler 3.9 Opening SAS Rapid Predictive Modeler Diagrams in SAS Enterprise MinerDiagrams in SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler Diagrams

Page 67: Chapter 3: SAS Rapid Predictive Modeler

67

Objectives Demonstrate how a SAS Rapid Predictive Modeler

diagram can be opened in SAS Enterprise Miner.

Page 68: Chapter 3: SAS Rapid Predictive Modeler

68

Opening the SAS Rapid Predictive Modeler Project in SAS Enterprise Miner1. Open SAS Enterprise Miner.

2. Select New Project.

3. Point to the folder where the SAS Rapid Predictive Modeler project resides.

Page 69: Chapter 3: SAS Rapid Predictive Modeler

69

Opening the SAS Rapid Predictive Modeler Project in SAS Enterprise MinerSelect Yes for the Project Exist prompt.

Provide the location of the SAS metadata folder.

Page 70: Chapter 3: SAS Rapid Predictive Modeler

70

SAS Rapid Predictive Modeler Project Opened in SAS Enterprise Miner

Page 71: Chapter 3: SAS Rapid Predictive Modeler

71

Chapter 3: SAS Rapid Predictive Modeler

3.1 Introduction

3.2 SAS Rapid Predictive Modeler Process Overview

3.3 SAS Rapid Predictive Modeler Model Settings

3.4 SAS Rapid Predictive Modeler Output

3.5 Saving Model Project Data

3.6 Registering the Model

3.7 Scoring3.8 SAS Rapid Predictive Modeler Methods in SAS

Enterprise Miner3.9 Opening SAS Rapid Predictive Modeler Diagrams in

SAS Enterprise Miner

3.10 Modifying SAS Rapid Predictive Modeler 3.10 Modifying SAS Rapid Predictive Modeler DiagramsDiagrams

Page 72: Chapter 3: SAS Rapid Predictive Modeler

72

Objectives Demonstrate how a SAS Rapid Predictive Modeler

diagram can be modified in SAS Enterprise Miner. Demonstrate SAS Rapid Predictive Modeler on a

census income data set.

Page 73: Chapter 3: SAS Rapid Predictive Modeler

73

Changing the Default Settings

73

Page 74: Chapter 3: SAS Rapid Predictive Modeler

74

Changing the Default Settings

Page 75: Chapter 3: SAS Rapid Predictive Modeler

75

Changing the Default Settings

Page 76: Chapter 3: SAS Rapid Predictive Modeler

76

Changing the Default Settings

Page 77: Chapter 3: SAS Rapid Predictive Modeler

77

Changing the Default Settings

Page 78: Chapter 3: SAS Rapid Predictive Modeler

78

Census Income Case StudyAnalysis Goal:

The goal is to develop a model to predict whether a person makes more than 50K a year based on census data.

Data set: ADULT

Number of rows: 32,562

Number of columns: 14

Contents: age, work class, education, marital status, race, sex,

capital gain or loss,hours worked, and so on

Targets: TARGET (binary)

Page 79: Chapter 3: SAS Rapid Predictive Modeler

79

Census Income Case Study: BasicsThis case study demonstrates how to invoke SAS Rapid Predictive Modeler from SAS Enterprise Guide and then open the generated project in SAS Enterprise Miner.

1. Import the ADULT data.

2. Build the SAS Rapid Predictive Modeler model in SAS Enterprise Guide.

3. Open the model in SAS Enterprise Miner.

4. Inspect the results.

Page 80: Chapter 3: SAS Rapid Predictive Modeler

80

Invoking and Running the SAS Rapid Predictive Modeler Task

Census Income Case Study

Task: Given a set of attributes, use the Rapid Predictive Modeler task to determine whether a person makes more than 50K a year.

Page 81: Chapter 3: SAS Rapid Predictive Modeler

81

Exercise

This exercise reinforces the concepts discussed previously.

Page 82: Chapter 3: SAS Rapid Predictive Modeler

82

ResourcesSAS Global Forum 2010 Paper by Wayne Thompson and David Duling support.sas.com/resources/papers/proceedings10/113-2010.pdf

UCI Machine Learning Repository Frank, A. and A. Asuncion. 2010. UCI Machine Learning Repository

[archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.