data warehousing and data mining

9
Data Mining & Data Warehousing E2MATRIX RESEARCH LAB COMPLETE THESIS & IEEE PROJECT HELP E 2 m a t r i x 1 E2matrix Opp Phagaara Bus Stand, Parmar Complex, Backside Axis Bank. Phagwara, Punjab, Call : +91 9041262727

Transcript of data warehousing and data mining

Page 1: data warehousing and data mining

1

Data Mining & Data WarehousingE2MATRIX RESEARCH LABCOMPLETE THESIS & IEEE PROJECT HELP

E2m

atrix

E2matrixOpp Phagaara Bus Stand,Parmar Complex, Backside Axis Bank.Phagwara, Punjab,Call : +91 9041262727

Page 2: data warehousing and data mining

Introduction Outline

Define data mining Data mining vs. databases Basic data mining tasks Data mining development Data mining issues

E2m

atrix

2

Goal: Provide an overview of data mining.

Page 3: data warehousing and data mining

Introduction Data is produced at a phenomenal rate Our ability to store has grown Users expect more sophisticated information How?

E2m

atrix

3

UNCOVER HIDDEN INFORMATIONDATA MINING

Page 4: data warehousing and data mining

Data Mining

Objective: Fit data to a model Potential Result: Higher-level meta information that may not be

obvious when looking at raw data Similar terms

Exploratory data analysis Data driven discovery Deductive learning

E2m

atrix

4

Page 5: data warehousing and data mining

Data Mining Algorithm

Objective: Fit Data to a Model Descriptive Predictive

Preferential Questions Which technique to choose?

ARM/Classification/Clustering Answer: Depends on what you want to do with data?

Search Strategy – Technique to search the data Interface? Query Language? Efficiency

E2m

atrix

5

Page 6: data warehousing and data mining

Database Processing vs. Data Mining Processing

Query Well defined SQL

Query Poorly defined No precise query language

E2m

atrix

6

Output– Precise– Subset of database

Output– Fuzzy– Not a subset of database

Page 7: data warehousing and data mining

Query Examples Database

Data Mining

E2m

atrix

7

– Find all customers who have purchased milk

– Find all items which are frequently purchased with milk. (association rules)

– Find all credit applicants with last name of Smith.– Identify customers who have purchased more

than $10,000 in the last month.

– Find all credit applicants who are poor credit risks. (classification)

– Identify customers with similar buying habits. (Clustering)

Page 8: data warehousing and data mining

Data Mining Models and Tasks

E2m

atrix

8

Page 9: data warehousing and data mining

Basic Data Mining Tasks Classification maps data into predefined groups

or classes Supervised learning Pattern recognition Prediction

Regression is used to map a data item to a real valued prediction variable.

Clustering groups similar data together into clusters. Unsupervised learning Segmentation Partitioning

E2m

atrix

9