Www.BZUPAGES.COM Log files presented to : Sir Adnan presented by: SHAH RUKH.

Post on 16-Jan-2016

217 views 0 download

Transcript of Www.BZUPAGES.COM Log files presented to : Sir Adnan presented by: SHAH RUKH.


Log files presented to :

Sir Adnanpresented by:



Log file

A file that lists actions that have occurred.

For example,

Web server:

maintain log files listing every request made to the server.


Some terminologies…….

Visit: Each time a specific user accesses a webpage is considered a visit. Multiple visits by a single user reflect a high degree of interest in the site content. Frequently updated and content-rich sites generate multiple visits

Session: A session includes all the activities of a user during a single website visit. Website management can benefit from knowing time, length of a session and the path that a user follows within a web site.



Hit: The access log records each attempt to retrieve a page or file from a website. These "hits" can add up to impressive numbers but are misleading. Hit statistics include not just webpages but also graphics. Watch the visitor count and ignore the hits!

Cookie: Cookies are small data fragments left on a user's computer by a website. These "crumbs" can be used to track a complete visitor web session. With the increased reliance on all forms of electronic commerce, cookies should be the least of their worries



Personalisation refers to the use of technology and available customer information to tailor e-commerce interactions between a business and each individual customer

Personalisation helps to make a website more responsive to the unique and individual needs of each user


Personalisation in e-commerce

The collection of web data;

Web data pre-processing and analysis; and

Determining which actions to be taken based on the analysis results.

There are three main phases that are followed in personalisation, namely


Customer Profiles

A customer profile is a snapshot of who your customers are, how to reach them and why they buy from you.

In short, a customer profile is a collection of information that describes the customer.

Customer profiles contain comprehensive information about customers‟ demographic details, preferences, characteristics and activities


Methods used to establish customer’s profile

In explicit feedback, customers are requested to register on the website by completing on-line forms that includes biographical details such as name, age, gender, contact details and occupation

In some instances, registration continues by asking customers to complete a questionnaire that focuses on the specific customers’ preferences and may further ask customers to rate products

Explicit and implicit feedbacks are two methods used by on-line businesses to establish customer profiles



In implicit feedback, the on-line system collects user details, without the user’s explicit input and uses the details to model the user‟s on-line behavior or build profiles

Implicit feedback examples include; user purchasing patterns, Web page visits and web surfing paths


Updating Customer Profiles

One way is to implicitly collect customers’ browsing behavior when interacting with the website Web log files are commonly used sources of web visitors’ activities that are used to update and maintain on-line customers’ profiles

As in the below figure




The main motives for having the system recording a number of user activities in a tailor-made log file were:

To avoid data pre-processing if the activities were to be collected from another log file, for example, web server log file and

To prevent users from assigning arbitrary ratings when asked explicitly to rate themselves

The variables that the system recorded in the log file were used by the decision model to update individual customer profiles


User tasks in maintaining profile

Two different sets of tasks

Firstly, users were asked to register and complete an on-line product knowledge questionnaire with specific questions on product categories the system caters

The product questionnaire had five general questions for each product category and five options per question from which a user had to select only one option per question. The options had a weighing attached.

As show in the table below


Option weighting Meaning

1 Not at all

2 To some extent

3 To a moderate extent

4 To a larger extent

5 Always




Secondly, users were asked to perform two similar tasks to purchase specified products from each category on the on-line website. The tasks

were performed in two separate sessions The first task was performed in the first session and the second task in the second session

The system generated a separate activity log file for every user after the user had completed a session.

The system generated a separate activity log file for every user after the user had completed a session.


Decision model and Customer

Profile updating: A decision model was designed, implemented and incorporated in the

on-line e-commerce website

The main purpose of the decision model was to analyse customers‟ activities recorded in the customer log file and to update the customer profiles where necessary

The log file recorded a number of activities a customer performed during a session



Time spent on a particular product category page;

Products browsed per category; and

Number of times asked for more information.

Here we will use the following parameters from the log file were selected to be used by the decision model in making updating decisions



Remote IP address or IP address of the client machine;

User log in name;

Time and date of requests;

Requested pages and results of requests; and

Size of data transferred.

Different log files can be created for the users during browsing, for example, web server access logs, browser caches and proxy logs. Common information contained in web log files includes



Time and Date of requestsTime and date of requests show times users spend on a page before

navigating to another page

Requested pages and ResultsRequested pages and results shows the user’s navigation pattern on

the website

The other web log file entries are used for purposes such as improving server performance and facilitating site modifications

Two of the most commonly used web log file entries in updating customer profiles or modeling on-line user behavior are


Comparison of Customer Profiles:

Initial customer profile and customer profile 1 : Initial customer profiles were created by the system during

registration. Users had to complete an on-line product knowledge questionnaire that determined the customers‟ product knowledge level used by the on-line system to generate an initial customer profile

Customer profile 1 was generated when users completed their first task set.


Lets see the agreement Charts




Types of logs include:

Access logs

Error logs

Referrer logs

Agent logs



1. Access logs record website visits. Most web server logs are kept in common log file format or can be converted to this format. This format makes it possible for statistics programs to analyze web site activity.


Sample log entries



2. Error logs identify file problems such as missing files.

3. Referrer logs list the site that a user came from before accessing

a particular page 4. Agent logs record the type of browser or client software used to

access web pages on a particular host.


Raw log file

The wealth of data in the log files is not readily mined with the naked eye. A raw log file entry looks something like this: - - [19/Jul/1999:00:00:04 -0600] "GET /studio/drives.html HTTP/1.1" 200 20607 "http://www.webdevelopersjournal.com/studio/hard.html" "Mozilla/4.0 (compatible; MSIE 5.0; Windows XP; DigExt)"


Any Comment or question?