Data Analysis: Basic Data Modeling and Evaluation

12
DATA ANALYSIS Basic Data Modeling and Evaluation Md Main Uddin Rony Software Engineer, Infolytx, Inc.

Transcript of Data Analysis: Basic Data Modeling and Evaluation

Page 1: Data Analysis: Basic Data Modeling and Evaluation

DATA ANALYSISBasic Data Modeling and

Evaluation

Md Main Uddin RonySoftware Engineer, Infolytx, Inc.

Page 2: Data Analysis: Basic Data Modeling and Evaluation

Outlines

What is this? Steps of Data Analysis

Tool’s overview, History It’s offerings Why should we use it?

Download Introduce its interface Building a Model Evaluate the Model

Little Insight Of Data Analysis

Introduction of a Data Analysis Tool

Hands-On of The Tool

Page 3: Data Analysis: Basic Data Modeling and Evaluation

Data AnalysisAnalysis of data is a process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision-making. (From Wikipedia)

❏ Acquiring meaningful insights from dataset

❏ Structuring the findings from survey research or other means of data collection

❏ Break a macro picture into a micro one

❏ Basing critical decisions from findings

❏ Ruling out human bias through proper statistical treatment

❏ Used in industries to allow companies and organization to make better business decisions

❏ Used to determine whether the system in place effectively protect data, operate efficiently and succeed in accomplishing an organization’s goal

And,

Data Analysis and Data Mining are not same!!!!!!

Page 4: Data Analysis: Basic Data Modeling and Evaluation

Steps of Data Analysis

Page 5: Data Analysis: Basic Data Modeling and Evaluation

Rapidminer❏ Most powerful, easy to use and intuitive graphical user interface for the design of analytical processes

❏ Formerly known as YALE (Yet Another Learning Environment)

❏ Developed in 2001 by Ralf Klinkenberg, Ingo Mierswa and Simon Fischer

❏ YALE changed to RapidMiner in 2007

Page 6: Data Analysis: Basic Data Modeling and Evaluation

What RapidMiner can Offer?❏ An integrated environment written in Java for

- machine learning

- data mining

- text mining

- predictive analysis

- business analytics

❏ Provides a GUI to design and execute analytical workflows (Process, Operator)

❏ Provides 99% of an advanced analytical solution through template-based frameworks that speed delivery and

reduce errors

❏ Nearly eliminates the need to write code

Page 7: Data Analysis: Basic Data Modeling and Evaluation

Why Should Use Rapidminer?❏ Powerful due to its learning operators and operator framework

❏ Easy to extend for Java programmers

❏ Stable

❏ Scalable

❏ Algorithms are optimized for speed

❏ Great visualization tools

❏ Available tools for data preprocessing

❏ Better community (http://rapid-i.com/)

❏ Better Debugging

❏ Wide range of supported file format

❏ Hadoop integration easy

Page 8: Data Analysis: Basic Data Modeling and Evaluation

Download RapidMiner❏ Open source

❏ Easy to download and set up

❏ Link: https://rapidminer.com/products/studio/

Page 9: Data Analysis: Basic Data Modeling and Evaluation

RapidMiner Interface

Page 10: Data Analysis: Basic Data Modeling and Evaluation

Building Model Using RapidMiner

1. Importing Data

2. Visualizing Data

3. Creating a Model

4. Applying a Model

5. Evaluation of the Model

Business Understanding Data Understanding

Data Preparation

Modeling

Evaluation

Deployment Data

Page 11: Data Analysis: Basic Data Modeling and Evaluation

QUESTION?

Page 12: Data Analysis: Basic Data Modeling and Evaluation

THANK YOU