Presentation on Big Data Hadoop (Summer Training Demo)

13
POORNIMA INSTITUTE OF ENGINEERING & TECHNOLOGY, JAIPUR DEPARTMENT OF COMPUTER ENGINEERING A PRACTICAL TRAINING PRESENTATION ON BIG DATA HADOOP SESSION 2014 – 15 Presented By: Guided By: Ashutosh Tiwari Dr. E.S. Pilli CE/11/083 Assistant Professor Ashok Rayal CS, Department CE/11/025 MNIT, Jaipur.

description

Demo presentation about summer training. for hadoop tutorials visit - hadoop-beginners.blogspot.com

Transcript of Presentation on Big Data Hadoop (Summer Training Demo)

Page 1: Presentation on Big Data Hadoop (Summer Training Demo)

POORNIMA INSTITUTE OF ENGINEERING & TECHNOLOGY, JAIPUR

DEPARTMENT OF COMPUTER ENGINEERING

APRACTICAL TRAINING PRESENTATION

ONBIG DATA HADOOP

SESSION 2014 – 15

Presented By: Guided By:Ashutosh Tiwari Dr. E.S. PilliCE/11/083 Assistant ProfessorAshok Rayal CS, DepartmentCE/11/025 MNIT, Jaipur.

Page 2: Presentation on Big Data Hadoop (Summer Training Demo)

Topics

1. Organization Details

2. Training Details

3. Technology Specification

4. Project Summary

5. Snapshots

6. Conclusion

Page 3: Presentation on Big Data Hadoop (Summer Training Demo)

ORGANIZATION PROFILE

Name-Malviya National Institute of Techonology, Jaipur MNIT, Jaipur is one of 30 national institutes of technology in

India. MNIT, established in 1963 inspired by Pt. Madan Mohan

Malviya. The institute's director is I. K. Bhat and the chairman of the

board of Governors is Dr. K. K. Aggarwal. Organization’s contacts:

Email : [email protected] Website : www.mnit.ac.in

Page 4: Presentation on Big Data Hadoop (Summer Training Demo)

Training Details

Start Date: 28/05/2014 Last Date: 9/07/2014 No. Of Days: 45(30+15). Timing: 9 AM to 5 PM Our training at MNIT were broadly divided into three phases:

o Case study of Hadoop and related papers (first 30 days).

o Hadoop cluster making (first 30 days).o Implementation of Near Duplicate Detection Using

Hadoop MapReduce (last 15 days).

Page 5: Presentation on Big Data Hadoop (Summer Training Demo)

ABOUT PROJECT

Near Duplicate Detection:

Comparative analysis of millions documents exist in network jargon to find similar document based on a predefined threshold value.

Near duplicate detection is essentially used in web crawls and many others data mining tasks.

Page 6: Presentation on Big Data Hadoop (Summer Training Demo)

TECHNOLOGY SPECIFICATION OF PROJECT

Project: Near Duplicate Detection

Technology Used:

Hadoop Map Reduce HDFS

SSH and Shell Scripting Java

Page 7: Presentation on Big Data Hadoop (Summer Training Demo)

SNAPSHOTS-HDFS

Page 8: Presentation on Big Data Hadoop (Summer Training Demo)

SNAPSHOTS-MAPREDUCE PROCESSING

Page 9: Presentation on Big Data Hadoop (Summer Training Demo)

SNAPSHOTS-OUTPUT

Page 10: Presentation on Big Data Hadoop (Summer Training Demo)

CONCLUSION

Training in big data helped us to know what is the crazy trend in IT industries and how technology is becoming more fruitful to human development.

Big Data is the future. Currently A lot of research is going on in this field. As data is increasing at faster rate thus there is a huge need of such tools and technology which can handle it.

Hadoop is the most emerging framework used by most of big firms like Facebook, Microsoft, IBM, Yahoo, Amazon and lots of other more.

Our experience at MNIT, was absolutely awesome as it has given as the platform and support for our tasks and case study.

Page 11: Presentation on Big Data Hadoop (Summer Training Demo)
Page 12: Presentation on Big Data Hadoop (Summer Training Demo)
Page 13: Presentation on Big Data Hadoop (Summer Training Demo)