Presentation on Big Data Hadoop (Summer Training Demo)

Post on 22-May-2015

174 views 2 download

Tags:

description

Demo presentation about summer training. for hadoop tutorials visit - hadoop-beginners.blogspot.com

Transcript of Presentation on Big Data Hadoop (Summer Training Demo)

POORNIMA INSTITUTE OF ENGINEERING & TECHNOLOGY, JAIPUR

DEPARTMENT OF COMPUTER ENGINEERING

APRACTICAL TRAINING PRESENTATION

ONBIG DATA HADOOP

SESSION 2014 – 15

Presented By: Guided By:Ashutosh Tiwari Dr. E.S. PilliCE/11/083 Assistant ProfessorAshok Rayal CS, DepartmentCE/11/025 MNIT, Jaipur.

Topics

1. Organization Details

2. Training Details

3. Technology Specification

4. Project Summary

5. Snapshots

6. Conclusion

ORGANIZATION PROFILE

Name-Malviya National Institute of Techonology, Jaipur MNIT, Jaipur is one of 30 national institutes of technology in

India. MNIT, established in 1963 inspired by Pt. Madan Mohan

Malviya. The institute's director is I. K. Bhat and the chairman of the

board of Governors is Dr. K. K. Aggarwal. Organization’s contacts:

Email : espilli.cse@mnit.ac.in Website : www.mnit.ac.in

Training Details

Start Date: 28/05/2014 Last Date: 9/07/2014 No. Of Days: 45(30+15). Timing: 9 AM to 5 PM Our training at MNIT were broadly divided into three phases:

o Case study of Hadoop and related papers (first 30 days).

o Hadoop cluster making (first 30 days).o Implementation of Near Duplicate Detection Using

Hadoop MapReduce (last 15 days).

ABOUT PROJECT

Near Duplicate Detection:

Comparative analysis of millions documents exist in network jargon to find similar document based on a predefined threshold value.

Near duplicate detection is essentially used in web crawls and many others data mining tasks.

TECHNOLOGY SPECIFICATION OF PROJECT

Project: Near Duplicate Detection

Technology Used:

Hadoop Map Reduce HDFS

SSH and Shell Scripting Java

SNAPSHOTS-HDFS

SNAPSHOTS-MAPREDUCE PROCESSING

SNAPSHOTS-OUTPUT

CONCLUSION

Training in big data helped us to know what is the crazy trend in IT industries and how technology is becoming more fruitful to human development.

Big Data is the future. Currently A lot of research is going on in this field. As data is increasing at faster rate thus there is a huge need of such tools and technology which can handle it.

Hadoop is the most emerging framework used by most of big firms like Facebook, Microsoft, IBM, Yahoo, Amazon and lots of other more.

Our experience at MNIT, was absolutely awesome as it has given as the platform and support for our tasks and case study.