Drupal.org Search Evaluation
-
Upload
isriya-paireepairit -
Category
Economy & Finance
-
view
3.579 -
download
1
description
Transcript of Drupal.org Search Evaluation
![Page 1: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/1.jpg)
Enterprise Search Engine Survey
Isriya Paireepairit
Drupal.org Case
![Page 2: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/2.jpg)
Drupal
• Software
• Content Management System
• Web-based
• PHP
![Page 3: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/3.jpg)
Drupal.org
• Home of Drupal the CMS
• For Drupal users, downloaders, developers
• Definitely use Drupal as CMS
• As well as Drupal Search Function
![Page 4: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/4.jpg)
![Page 5: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/5.jpg)
Drupal.org Content Types
• Projects
• Modules
• Themes
• Translations
• Forums (Support, Discussion, Chit-chat)
• Documents (Manual, Howto)
• Issues (Bugs, Feature Requests)
• API Documents (for Developers)
• User page
• News/Announcement
![Page 6: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/6.jpg)
• As mid April 2008
• Content: 250,000 nodes
• Registered User: 280,000 users
• Page Visits: ~1M/day (Compete.com)
Drupal.org Content Size
![Page 7: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/7.jpg)
Drupal Search Function
• Indexing
• Minimum word length is configurable
• CJK Handling
![Page 8: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/8.jpg)
Drupal Search Function
• Search result ranking
• Weightable
• 3 default factors
• Keyword relevance
• Recency
• Number of comments
![Page 9: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/9.jpg)
Drupal.org Implementation
• Keyword relevance: 10
• Recency: 5
• Number of comments: 1
Source: http://www.civicactions.com/blog/search/part_1
![Page 11: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/11.jpg)
Good
• Simplicity
• Advanced Search
• (Some) Specific content type search
• Detailed result
![Page 12: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/12.jpg)
Simplicity
![Page 13: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/13.jpg)
Advanced Search
![Page 14: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/14.jpg)
(Some) Specific Content Types
![Page 15: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/15.jpg)
Detailed Result
![Page 16: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/16.jpg)
Some Problems
1
2
3
![Page 17: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/17.jpg)
Improvement Ideas
• Add more priority to some content types
• Projects > Documents > Forums
• Add sorting option
• By type
• Also by date, number of comments
![Page 18: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/18.jpg)
More Ideas
• Weight by
• Number of incoming links (like PageRank)
• Tag/Category/Taxonomy
• Misspelling Handler
• Synonym Handler
• e.g. “Category” = “Taxonomy”
![Page 19: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/19.jpg)
More Experimental IdeasFaceted Search
![Page 20: Drupal.org Search Evaluation](https://reader033.fdocuments.in/reader033/viewer/2022051609/546fb90bb4af9f16648b45cf/html5/thumbnails/20.jpg)
Further Issues
• Overall site performance
• Indexing and Searching is resource-consuming
• Solution
• “Outsource” search function to dedicated search software?
• Google Box
• Apache Solr