System for Consolidation and Classification of Data and Event Logs … · System for Consolidation...

12
System for Consolidation and Classification of Data and Event Logs From MOOCs Pankaj Kumar Under the guidance of Prof.Deepak B.Phatak Dept of CSE IIT BOMBAY May 4, 2015

Transcript of System for Consolidation and Classification of Data and Event Logs … · System for Consolidation...

Page 1: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

System for Consolidation and Classification ofData and Event Logs From MOOCs

Pankaj Kumar

Under the guidance ofProf.Deepak B.Phatak

Dept of CSEIIT BOMBAY

May 4, 2015

Page 2: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Introduction

I The edX log files contain many fields.

I Events are stored in JSON documents, which can be difficultto read at a glance.

I There are three types of event, which are student event,instructor event and alphabetical event.

I Student event has different event types like load video, playvideo, pause video, stop video, create account etc.

Page 3: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Problem statement

System for Consolidation and Classification of Data andEvent Logs From MOOCs

I The main objective of tracking log file is to classify the datain a structured way.

I This file contain many fields, Which are user name, host,event source, event type, context, user id, org id, course id,time, agent and page.

I These fields are at the root level of the event JSONdocuments.

Page 4: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Features of the Designed System

I After tracking log file, we can see easily, the number of usersvisiting an event type.

I Knowing this we can also count, how many times a uservisited same event type.

I Finally the total number of times visited by each user for thesame event type was calculated.

Page 5: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Edx Tracking.log File

Page 6: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Work done

I Initially a python script was written to do the following

I During the tracking edx tracking.log file the number ofdistinct user types were found.

I After this the users and the number of times they visited aparticular event were found.

I In the end the total number of visits by users for each eventwas calculated by finding the aggregate sum.

I The end results were then displayed in the form of an HTMLtable using PHP.

Page 7: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Result of Visited Event Type

Page 8: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Result in Table Format

I Initially the results were accumulated in text file, which is notconvenient to display.

I The text file is then processed with the help of HTML andPHP and table named LOG DATA DETAILS is created.

I It looks very organized manner.

Page 9: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Result in Table Format

Page 10: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Conclusion

The following is the work proposed

I The main objective of our tracking log file is to classify thedata in a structured way.

I So we can extract personally identifiable information and wecan also count how many time each user visited same eventtype.

I Finally we are storing data in table ,which are consists fourcolumn.

I So we can see how many user load video, play video, pausevideo, stop video, create account, seek video, changeenrollment, save problem success, problem save, dashboardand so on.

I All classified and counted data store in table. So we can seeeasily.

Page 11: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

References

Guide Tracking Logs. https://edx.readthedocs.org/en/latest/internal dataformats/tracking logs.html.

Page 12: System for Consolidation and Classification of Data and Event Logs … · System for Consolidation and Classi cation of Data and Event Logs From MOOCs I The main objective of tracking

Thank You