Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

32
CCLTracker Framework Monitoring users learning and ac7vity in web based ci7zen science projects Jose Luis FernandezMarquez [email protected]

Transcript of Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Page 1: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

CCLTracker  Framework    

Monitoring  users  learning  and  ac7vity    in  web  based  ci7zen  science  projects  

Jose  Luis  Fernandez-­‐Marquez  

[email protected]  

Page 2: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Outline  

•  Mo7va7on  •  Background  (Google  Analy7cs)  •  CCLTracker  Framework  

•  Conclusions  •  Experiment  and  demo  

Page 3: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Mo7va7on  

   Bounce  rate,  pages  views,  avg.  7me  per  session,  etc…  might  not  be  relevant  to  measure  user  engagement,  or  relevant  in  our  ci7zen  science  project.    

Page 4: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Google  Analy7cs  

•  Audience  •  Acquisi7on  •  Behaviour  

Page 5: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Audience:  Par7cipa7on  16.000  sessions  8.000  users  108  countries  –  90  languages  Avg.  of  400  sessions  per  day  

Page 6: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Audience:  Who?  

+  browser,  OS,    and  mobile  or  PC  connec7on  

Page 7: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Acquisi7on:  where  do  they  come  from?  

Page 8: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Acquisi7on:  where  do  they  come  from?  

Page 9: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Acquisi7on:  Evalua7ng  dissemina7on  ac7vi7es  

Itc.ua  1st  Release  

Twi^er  Reddit  Post  

Page 10: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Behavior  

Page 11: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  adding  filters  

•  Give  me  previous  stats  focusing  on  visitors  coming  from  reddit.  

•  Give  me  previous  stats  focusing  on  female  visitors,  between  18  and  24  years,  linux  users,  living  in  Switzerland.    

•  Infinite  number  of  possible  combina7ons.    

Page 12: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  adding  filters  

•  Top  10  countries  by  Female  users.  •  Top  10  referrals  (reddit,  facebook,  …)  gathering  female  users.  

•  Top  10  referrals  gathering  visitors  who  immediately  run  away  from  the  site.    

Page 13: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Why  do  we  need  anything  else?  

Page 14: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Bounce  rate,  pages  views,  and  avg.  4me  per  session  might  not  be  relevant  for  a  ci7zen  science  project.    – We  need  to  know  the  ac7ons  performed  in  the  site  to  measure  par7cipants  contribu7on.  

– We  need  to  be  able  to  make  public  analy7c  data.    – We  need  to  be  able  to  create  advance  data  aggrega7on.  I.e.  clustering  analysis,  advance  engagement  func7ons.    

Page 15: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

CCLTracker  framework  

CCLTrackerJS Library Google

Analytics

Google Tag Manager

RGA Library R

Monitoring Storing, aggregating, reporting

Advance Aggregation and reporting

Google Super Proxy

Page 16: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

CCLTracker  events  

-­‐ Is  the  user  scrolling  down  on  the  web  site  (0%,  25%,50%,75%100%)  -­‐   is  the  user  clicking  new  projects,  about,  forum,  etc?  

Page 17: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

CCLtracker  events  

Page 18: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

CCLTracker  events  

Page 19: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  filtering  

•  Stats  only  for  par7cipants  who  has  properly  installed  the  CERN  VM.  (segmenta7on)  

•  Comparing  two  segments  (e.g.  all  sessions  vs  sessions  running  the  virtual  machine.    

Page 20: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  filtering  

Page 21: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  filtering  

•  Most  common  web  API  errors  by  browser  (crossing  data,  filtering)  

Page 22: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  filtering  

•  Top  more  ac7ve  users.    

Page 23: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Segmenta7on,  crossing  data  and  filtering  

•  All  ac7ons  by  a  given  user    

Page 24: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Advance  Data  Aggrega7on  

1 2 3 4 5 6 7 8 9 10 12

Number of days

% of us

ers

05

1015

2025

3035

29.58

17.63

9.5210.18

7.38

2.883.875.31

3.321.84

4.064.43

•  Engagement  

Page 25: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Advance  Data  Aggrega7on  

Page 26: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Advance  Aggrega7on  

Visitors  per  age  group  

Visitors  successful  running  VM  per  age  group    

0  

10  

20  

30  

40  

50  

60  

18-­‐24   25-­‐34   35-­‐44   45-­‐54   55-­‐64   >65  

%  of  session

s  

Users'  age  

User  engagement  per  age  group  

Page 27: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Advance  Data  Aggrega7on  

Desired  Flow  of  ac7ons  

Page 28: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Advance  Data  Aggrega7on  

Page 29: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

Why  do  we  want  all  this  data?  

•  Increase  the  number  of  par7cipants  •  Increase  par7cipants’  engagement.  

•  Improve  the  navigability,  and  accessibility  of  the  website.    

•  Improve  users’  learning  experience.  Are  users  improving  the  quality  of  their  contribu7ons?  

Page 30: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

What  did  we  learn  using  CCLTracker?  

•  Transla7on  to  different  languages  is  important  to  reach  a  large  audience.  E.g.  Russian  referral.  

•  Addressing  technology  sec7ons  in  newspapers.  

•  Web  site  naviga7on  is  not  trivial.    

•  Low  engagement.    

•  Low  female  par7cipa7on.  

Page 31: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

2nd  CERN  Public  Compu7ng    Challenge  

•  Increase  the  number  of  par7cipants:  – Addressing  female  par7cipa7on.  –  Transla7ng  the  website  to  different  languages.  –  Pos7ng  the  informa7on  in  data  hubs,  scien7fic  sec7ons  of  newspapers,  etc…    

•  Segmen7ng  data  by  different  level  of  engagements:  –  Visitors  who  do  not  run  the  VM.    –  Visitors  who  compute  10  jobs,  20  jobs,  …  n  jobs.  –  Visitors  who  compute  for  1  h,  5  h,  …  n  hours.    –  Visitors  who  par7cipate  over  the  whole  challenge…  

Page 32: Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker

“top  10%  of  contributors  responsible  for  almost  80%  of  total  classifica7ons.”

Open  ques7on