24 Hours of PASS -- Enterprise Data Mining with SQL Server

37
Enterprise Data Mining with SQL Server Mark Tabladillo Ph.D. Microsoft MVP MarkTab Consulting March 21, 2012

description

This presentation introduces SQL Server Data Mining (SSDM) for SQL Server Professionals based on the speaker's past presentation for Microsoft TechEd. Starting with SQL Server Management Studio (SSMS), the demo includes the interfaces important for professional development, including Business Intelligence Development Studio (BIDS), highlighting Integration Services, and PowerShell. The interactive demos are based on Microsoft's Contoso Retail sample data. Finally we will evaluate where Microsoft data mining can help you in a practical business environment, which may include Oracle and SAS.

Transcript of 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Page 1: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise Data Mining with SQL Server

Mark Tabladillo Ph.D.Microsoft MVPMarkTab Consulting

March 21, 2012

Page 2: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 3: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

About Mark Tabladillo

• 20 Years in Atlanta, Georgia• Consulting since 1998; Incorporated 2003

– Part-Time Faculty at University of Phoenix• SAS and Microsoft Expert

– Presenter since 1998 at conferences like Microsoft TechEd and SAS Global Forum

• Taught statistics at undergraduate and graduate level• Blog: http://marktab.net @MarkTabNet

3

Page 4: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise:Leaders of Leaders of

Leaders

Page 5: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise Challenge

Page 6: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise Challenge

Page 7: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise Challenge

Page 8: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Enterprise Challenge

Page 9: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

“Data Mining”

Page 10: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Definitions

Phrase Goal

“Data Mining” Inform actionable decisions

“Machine Learning”

Determine best performingalgorithm

Page 11: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Data Mining > Just Drilldown

Query Typical Result

T‐SQL Exact values and calculations

MDX Exact values and calculations

DAX Exact values and calculations

DMX Values plus probabilities

Page 12: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

SQL Server 2008 R2:

Physical and Logical

Page 13: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

OLAP EnginePhysical Architecture

• http://msdn.microsoft.com/en-us/library/ms174776.aspx

Page 14: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Analysis Services Logical Architecture

• http://msdn.microsoft.com/en-us/library/ms174587.aspx

Page 15: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Outline

• Contoso Retail and Fundamentals• Enterprise-Level Data Mining Demo for

SQL Server• What is my next step?

Page 16: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

What is Contoso Retail?

• Demonstration dataset for SQL Server Database Engine and Analysis Services

• http://www.microsoft.com/downloads/en/details.aspx?displaylang=en&FamilyID=868662dc-187a-4a85-b611-b7df7dc909fc

Page 17: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

What are the fundamentals?

Reading

Writing

Arithmetic

‘Readin’

‘Ritin’

‘Rithmetic

Page 18: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

What Enterprise Tools support Data Mining?

• SQL Server Management Studio (SSMS)• Business Intelligence Development Studio

(BIDS)– SQL Server Integration Services (SSIS)

• PowerShell version 2

Page 19: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

What Enterprise Tools support Data Mining?

Data Mining

SSMS SSIS PowerShell

Page 20: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 21: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 22: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Variable 0 1 2 3 4 5 6 7

Discretized

Discretized

Continuous

Discrete

Page 23: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Variable 0 1 2 3 4 5 6 7

Discretized

Discretized

Continuous

Discrete

Page 24: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Variable 0 1 2 3 4 5 6 7

Discretized

Discretized

Continuous

Discrete

Page 25: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Variable 0 1 2 3 4 5 6 7

Discretized

Discretized

Continuous

Discrete

Page 26: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Variable 0 1 2 3 4 5 6 7

Discretized

Discretized

Continuous

Discrete

Page 27: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Documentation

• Data Mining Structures– http://msdn.microsoft.com/en-us/library/cc645741.aspx– http://msdn.microsoft.com/en-us/library/ms174757.aspx

• Data Mining Models– http://msdn.microsoft.com/en-us/library/cc645779.aspx

Page 28: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Contoso Retail:Enterprise Data Mining

Demonstration

Page 29: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

What is my next step?

• SQL Server 2008 R2 Enterprise (includes database engine, Analysis Services, SSMS and BIDS)– http://www.microsoft.com/sqlserver/2008/en/us/trial-software.aspx

• Microsoft Office 2010 Professional– http://office.microsoft.com/en-us/try

• PowerShell 2.0– http://support.microsoft.com/kb/968929

• Data Mining Portal and Blog– http://www.marktab.net

Page 30: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 31: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 32: 24 Hours of PASS -- Enterprise Data Mining with SQL Server
Page 33: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

• Data mining leaders can tackle enterprise data mining challenges with– SQL Server Management Studio– Business Intelligence Development Studio– PowerShell version 2

• Become leaders of leaders of leaders

Conclusion

Page 34: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Where Can I Find More Information?

• http://marktab.net Data Mining Resource• http://marktab.net/datamining Data Mining Blog• http://sqlserverdatamining.com SQL Server Data Mining• http://technet.microsoft.com Microsoft’s TechNet

Page 35: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Graphics

• Ship graphics Copyright © 1995-2006 Nova Development and its licensors. All rights reserved. Used with permission.

Page 36: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Abstract

This presentation introduces SQL Server Data Mining (SSDM) for SQL Server Professionals based on the speaker's past presentation for Microsoft TechEd. Starting with SQL Server Management Studio (SSMS), the demo includes the interfaces important for professional development, including Business Intelligence Development Studio (BIDS), highlighting Integration Services, and PowerShell. The interactive demos are based on Microsoft's Contoso Retail sample data. Finally we will evaluate where Microsoft data mining can help you in a practical business environment, which may include Oracle and SAS.

Online Video:http://channel9.msdn.com/Events/TechEd/NorthAmerica/2011/DBI326

36

Page 37: 24 Hours of PASS -- Enterprise Data Mining with SQL Server

Thank You to our Sponsors