Using internet search engines and library catalogs to locate
Internet Search Engines
description
Transcript of Internet Search Engines
Internet Search EnginesInternet Search Engines
Noah A. HabanNoah A. HabanEric LeBarronEric LeBarronAdam BinderAdam BinderSeth HolubarSeth Holubar
Dan KellyDan Kelly
AgendaAgendaTechnology Description and HistoryTechnology Description and History
SMCRSMCR
B-C=VB-C=V
CompetitionCompetition
What does the future hold?What does the future hold?
Summary and QuestionsSummary and Questions
What is a search engine?What is a search engine?
Information retrieval systemInformation retrieval system
Designed to find information on:Designed to find information on:1.1. Computer SystemComputer System
2.2. World Wide WebWorld Wide Web
3.3. Corporate and Proprietary NetworksCorporate and Proprietary Networks
4.4. Personal ComputerPersonal Computer
Description & History SMCR B–C=V Competition Future Summary
Search Engines: The Early YearsSearch Engines: The Early Years
First Internet website: First Internet website: http://info.cern.ch/http://info.cern.ch/
Basically an explanation of the WWWBasically an explanation of the WWW
Left: Tim Berners-Lee, credited with creation of world wide web
Right: Robert Cailliau, collaborator on WWW project. First web surfer
Description & History SMCR B–C=V Competition Future Summary
Pioneer Search EnginesPioneer Search Engines
ArchieArchieGopherGopherVeronicaVeronicaJugheadJugheadWandexWandexAliweb (still runs today)Aliweb (still runs today)WebcrawlerWebcrawlerLycosLycos
Description & History SMCR B–C=V Competition Future Summary
Parts of a search engineParts of a search engine
SpidersSpidersFollow links on web to request pages that have either not been indexed Follow links on web to request pages that have either not been indexed
or have been updated since last indexor have been updated since last index
Index (catalog)Index (catalog)Large, slightly outdated collection of content which roughly Large, slightly outdated collection of content which roughly
represents the content of the webrepresents the content of the web
Search Interface and relevancy softwareSearch Interface and relevancy softwareAccept query, Check relevancy, Gather pages, Request ads. Accept query, Check relevancy, Gather pages, Request ads.
Description & History SMCR B–C=V Competition Future Summary
Parts of a search engineParts of a search engine
Meta TagsMeta Tags
1.1. Allow specification of keywords/conceptsAllow specification of keywords/concepts
2.2. Helpful when words have more than one Helpful when words have more than one meaningmeaning
3.3. Spiders will search page and change Spiders will search page and change meta tags if words do not match up. meta tags if words do not match up. Prevents false categorizing with popular Prevents false categorizing with popular keywords.keywords.
Description & History SMCR B–C=V Competition Future Summary
Building a searchBuilding a search
QueryQuery
A word or combination of words the user A word or combination of words the user submits to search a given topic.submits to search a given topic.
ResultsResults
Put in order according to other searches and Put in order according to other searches and page visitation frequency. page visitation frequency.
Description & History SMCR B–C=V Competition Future Summary
SMCRSMCR
Sender
•Inform
•Deceive
•Get attention
•Persuade
•Educate
Message
•Facts/Figures
•Stories
•Pictures, Video & Sound
•Confidentiality
•Complexity
•Advertising
Channel
•Computer
•Internet
•Webpage
Receivers
•Internet Access
•All
•Ages
•Education Level
•Gender
•Occupation
Description SMCR B–C=V Competition Future Summary
ReceiversReceivers
Who uses search engines?Who uses search engines?
From a March 2004 survey of 1,649 people:From a March 2004 survey of 1,649 people: 56% use search engines on a daily basis56% use search engines on a daily basis Only 13% use different search engines Only 13% use different search engines
for different types of searchesfor different types of searches 57% use the same search engine. Of 57% use the same search engine. Of
those people, the top 3 are Google, those people, the top 3 are Google, Yahoo and MSN.Yahoo and MSN.
Description SMCR B–C=V Competition Future Summary
ReceiversReceivers
Who uses search engines? Who uses search engines? 1.1. Homemakers the finickiest (52% look at Homemakers the finickiest (52% look at
only first page, then move on)only first page, then move on)2.2. Abandonment rates high among Abandonment rates high among
educators (40%), professionals (38%) educators (40%), professionals (38%) and students (27%). and students (27%).
3.3. Age has been shown to make people Age has been shown to make people less patient with searchesless patient with searches
4.4. 49% use one or more search toolbars49% use one or more search toolbars
Description SMCR B–C=V Competition Future Summary
ReceiversReceivers
Who uses search engines?Who uses search engines?
From this info we can conclude:From this info we can conclude: People of all ages use search engines and half People of all ages use search engines and half
use them on a daily basisuse them on a daily basis Most people are loyal to one search engineMost people are loyal to one search engine Most people abandon a search after the first Most people abandon a search after the first
page or two of resultspage or two of results Top search engines have toolbars availableTop search engines have toolbars available
Description SMCR B–C=V Competition Future Summary
BenefitsBenefits
Easy to useEasy to use
Search vast databasesSearch vast databases
Sophisticated searching often availableSophisticated searching often available
GlobalGlobal
Description SMCR B–C=V Competition Future Summary
CostsCosts
The web is growing quicker than it can be The web is growing quicker than it can be indexed.indexed.Search engines slow down because of Search engines slow down because of indexing new web pagesindexing new web pagesQueries currently limited to keywords, Queries currently limited to keywords, resulting in matching random words resulting in matching random words scattered across large pages (as opposed scattered across large pages (as opposed to limiting matches within a paragraph or to limiting matches within a paragraph or phrase)phrase)
Description SMCR B–C=V Competition Future Summary
CostsCosts
Many dynamically generated sites are not Many dynamically generated sites are not indexable, resulting in the “invisible web”indexable, resulting in the “invisible web”
Some search engines do not rank by Some search engines do not rank by relevance, but rather by how much the relevance, but rather by how much the websites paywebsites pay
Hundreds of generated sites use tricks to Hundreds of generated sites use tricks to manipulate a search engine into displaying manipulate a search engine into displaying them higher in results them higher in results
Description SMCR B–C=V Competition Future Summary
CostsCosts
How important are the costs to users and How important are the costs to users and how can usage be increased?how can usage be increased?
Somewhat importantSomewhat important
Sites indexed more by relevancy Sites indexed more by relevancy
Use a “proximity-search” option, which Use a “proximity-search” option, which uses a search bracket to limit matches uses a search bracket to limit matches within a paragraph or phrasewithin a paragraph or phrase
Eliminating linkspam in searchesEliminating linkspam in searches
Description SMCR B–C=V Competition Future Summary
B-C=VB-C=V
Do the benefits outweigh the costs? Do the benefits outweigh the costs? Yes.Yes.
1.1. EasyEasy
2.2. Less time and effortLess time and effort
3.3. Virtually endless result possibilities Virtually endless result possibilities
Description SMCR B–C=V Competition Future Summary
The CompetitionThe Competition
Description SMCR B–C=V Competition Future Summary
Share of Searches: TrendsShare of Searches: Trends
Description SMCR B–C=V Competition Future Summary
What does it look like?What does it look like?
Description SMCR B–C=V Competition Future Summary
GoogleGoogle
Description SMCR B–C=V Competition Future Summary
YahooYahoo
Description SMCR B–C=V Competition Future Summary
MSNMSN
Description SMCR B–C=V Competition Future Summary
DMOZDMOZ
Description SMCR B–C=V Competition Future Summary
Users = $$Users = $$
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Yahoo vs. Google vs. MSNYahoo vs. Google vs. MSN
Description SMCR B–C=V Competition Future Summary
Other CompetitorsOther Competitors
dmoz.orgdmoz.org– Human edited directory of the web. Human edited directory of the web.
Constructed and maintained by community Constructed and maintained by community of volunteersof volunteers
ChaCha.comChaCha.com– Human-powered search engine. Uses Human-powered search engine. Uses
people to sort results by relevance.people to sort results by relevance.
Description SMCR B–C=V Competition Future Summary
DMOZDMOZ
Targeted at other search engines.Targeted at other search engines.Powers directory listings of Netscape, AOL, Google, Lycos, Powers directory listings of Netscape, AOL, Google, Lycos, HotBot, DirectHit, 100s of Others.HotBot, DirectHit, 100s of Others.
BenefitBenefit– Open Source Licensing = 100% FreeOpen Source Licensing = 100% Free
(No cost to submit a site, or to use data retrieved.)(No cost to submit a site, or to use data retrieved.)
CostCost– Community BasedCommunity Based
You Get What You Give! You Get What You Give! Sign up to be an editor, get free tools to do it, and help make Sign up to be an editor, get free tools to do it, and help make the Web a better place.the Web a better place.
Description SMCR B–C=V Competition Future Summary
ChaChaChaCha
Why the name ChaCha? Why the name ChaCha? – Cha = Search in ChineseCha = Search in Chinese– ChaCha is a popular fast-paced danceChaCha is a popular fast-paced dance
BenefitsBenefits– Only Search Engine that provides Human guides that lead Only Search Engine that provides Human guides that lead
searchers (much like they would in the dance) to the most searchers (much like they would in the dance) to the most relevant results on the internet.relevant results on the internet.
– Creates its own database of successful searches and results Creates its own database of successful searches and results to index from.to index from.
CostsCosts– Free to use. (Paid for by advertising?)Free to use. (Paid for by advertising?)– Guides Earn $5-10 per search hour.Guides Earn $5-10 per search hour.
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Description SMCR B–C=V Competition Future Summary
Diffusion of Innovation ModelDiffusion of Innovation Model
Description SMCR B–C=V Competition Future Summary
The Future of Search EnginesThe Future of Search Engines
GoogleGoogle•Google EarthGoogle Earth•MapsMaps•G-mailG-mail•DesktopDesktop•Google VideoGoogle Video•You TubeYou Tube•Froogle Froogle •ToolbarToolbar•Customizable Customizable hompagehompage
Yahoo!Yahoo!•PersonalsPersonals•GeocitiesGeocities•TravelTravel•FinanceFinance•Hot JobsHot Jobs•Yellow PagesYellow Pages•Yahoo! MailYahoo! Mail•Yahoo! MessengerYahoo! Messenger
MSN•City Guides•My MSN•Hotmail•Messenger
Description SMCR B–C=V Competition Future Summary
The Future of Search EnginesThe Future of Search Engines
Current “top dog” is GoogleCurrent “top dog” is Google(Forbes Magazine, January 07)(Forbes Magazine, January 07)
Also Yahoo & MSNAlso Yahoo & MSN Increase TrafficIncrease Traffic Reduce Irrelevant LinksReduce Irrelevant Links Offer More ServicesOffer More Services
Description SMCR B–C=V Competition Future Summary
The Future of Search EnginesThe Future of Search EnginesFuture “top dog” candidates include:Future “top dog” candidates include:
Metasearch sites, or sites which combine Metasearch sites, or sites which combine hits from multiple search engines rather hits from multiple search engines rather than just one. than just one.
www.profusion.comwww.profusion.com
www.metacrawler.comwww.metacrawler.com
Human powered search enginesHuman powered search engineswww.chacha.comwww.chacha.com
Description SMCR B–C=V Competition Future Summary
Summary & ROTSummary & ROT
Description SMCR B–C=V Competition Future Summary
•As long the internet grows in size & complexity, search engines will be needed by users.
•Search Engines make their money from Advertisers, who pay to have their web sites come up more often.
•The most successful search engines provide users with more features than just searching.
•Search Engines must continue to evolve and adapt to attract users.
•Search Engines Feed and Supply each other.
•Most people get used to a search engine, and stick with it.
Description SMCR B–C=V Competition Future Summary