WCMS and Campus Web Search Updates
description
Transcript of WCMS and Campus Web Search Updates
![Page 1: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/1.jpg)
WCMS and Campus Web Search Updates
![Page 2: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/2.jpg)
Campus Search• Previous campus search used free search providers
• It switched between providers as daily usage limits were exceeded
• Started with Google, then Yahoo then Bing
#watitis2012
![Page 3: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/3.jpg)
Campus Search• Google Search Appliance went live in October
• Indexes web pages in the uwaterloo.ca domain as well as documents and LDAP records
#watitis2012
![Page 4: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/4.jpg)
Campus Search• 1,000,000 document license
– 680,889 documents being served• 59,372 of those “documents” are people from UWLDAP
• The initial crawl quickly used up the license– Duplicate and “retired” servers were omitted– Revision history of wiki pages was ignored– Some sites had pages that presented the same content in different orders
#watitis2012
![Page 5: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/5.jpg)
#watitis2012
![Page 6: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/6.jpg)
#watitis2012
![Page 7: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/7.jpg)
#watitis2012
![Page 8: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/8.jpg)
Campus Search• Content breakdown
– HTML: 514,648– PDF: 82,303– Text files: 37,979– Postscript: 4,938– MS Word: 4,308– MS Powerpoint: 2,496– Flash: 2,140– MS Excel: 399
#watitis2012
![Page 9: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/9.jpg)
Campus Search• We can add suggestions for acronyms
#watitis2012
![Page 10: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/10.jpg)
Campus Search• Or promote pages that should be prominent
#watitis2012
![Page 11: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/11.jpg)
Campus Search• If you want to have something removed from the
campus search, adjust your robots.txt file accordingly or hide it behind a login and it will disappear over time
• If you need something removed immediately, submit an RT ticket
#watitis2012
![Page 12: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/12.jpg)
Campus Search• Future plans
– Upgrade to Google Search Appliance 7.0
– Add different content type options on search page
– Faceted search
#watitis2012
![Page 13: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/13.jpg)
Google Search Appliance 7.0• Universal Search
– Search content across silos– New SharePoint connector
• Relevance and performance improvements– Enhancements to Google’s algorithms– Assisted navigation to refine search results
#watitis2012
![Page 14: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/14.jpg)
Google Search Appliance 7.0• Document preview
– Not just HTML, also includes MS Word, PowerPoint and PDFs
– Good for mobile use as you do not have to download an entire PDF to know it is the one you want
#watitis2012
![Page 15: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/15.jpg)
Google Search Appliance 7.0• Document translation and language capabilities
– Translate titles and snippets from inside the search appliance with support for 60 languages
– Better support for crawling languages such as Arabic and Japanese
#watitis2012
![Page 16: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/16.jpg)
Google Search Appliance 7.0• Index compressed files
– Now crawls and indexes compressed files in .zip, .tar, .tar.gz and .tgz
• Expert search– Find subject matter experts on campus by searching
on keywords• Ex. Search for “network security” and a list of network
security experts will appear on the sidebar
#watitis2012
![Page 17: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/17.jpg)
Other search enhancements
• Auto suggest on search fields
• Not just on the main search page, but on individual Drupal sites as well
#watitis2012
![Page 18: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/18.jpg)
Other search enhancements• Adding content types to the Google search
#watitis2012
![Page 19: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/19.jpg)
Other search enhancements• Faceted search
#watitis2012
![Page 20: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/20.jpg)
WCMS Updates• Lily Yan - New content types
• Liam Morland - Opening up some sites so all CAS users can authenticate
• Chris Shantz – Prototypes for the IT Strategic Plan site and access control with Organic Groups
• Kris Olafson – Feature requests site
#watitis2012
![Page 21: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/21.jpg)
Six content types• Project • Service• Graduate award• Teaching tip• Grebel publication• Exchange board
#watitis2012
![Page 22: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/22.jpg)
Project content type• Proposed, ongoing and completed projects
• It includes project title, description, members (name, role), status, audience and topic
• Projects can be searched by status, topic and audience
• Initially the IST pilot site will use this content type
#watitis2012
![Page 23: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/23.jpg)
Project
#watitis2012
![Page 24: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/24.jpg)
Service content type• Services offered at the University of Waterloo
• Service content type includes service name, description, cost, support for this, link to service, service audience and category
• Can be searched by categories and audience
• IST website will use this content type
#watitis2012
![Page 25: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/25.jpg)
Service
#watitis2012
![Page 26: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/26.jpg)
Graduate award content type• This is a custom content type for graduate-
studies site
• This content type includes all information about graduate awards
• Graduate awards can be searched by name, description, value, program, deadline, type, category and citizenship
#watitis2012
![Page 27: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/27.jpg)
Graduate award
#watitis2012
![Page 28: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/28.jpg)
Graduate award content type• Site managers can view graduate award
report and download a .csv file
• Site mangers can mass publish content
#watitis2012
![Page 29: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/29.jpg)
Graduate award
#watitis2012
![Page 30: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/30.jpg)
Teaching tip content type• A custom content type for the Centre for
Teaching Excellence - this is a tip to help inform users
• This content type includes title, listing image, body, categories, tags and audience.
• Teaching tips can be searched by entering keywords and tag, and by categories
#watitis2012
![Page 31: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/31.jpg)
Teaching tip
#watitis2012
![Page 32: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/32.jpg)
Grebel publication• Grebel publication includes two content types
(Grebel journal and Grebel journal publication)
• The full article of HTML and PDF for past issues can be viewed, PDF can be downloaded
• Only title and author's name can be viewed for current issue
• Issues archived by year
#watitis2012
![Page 33: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/33.jpg)
Grebel publication
#watitis2012
![Page 34: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/34.jpg)
Exchange board content type• This content type is for recreation-committee site
only
• It includes personal information and exchange information
• It can be searched by exchange type (for sale, for rent, free or wanted)
#watitis2012
![Page 35: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/35.jpg)
Exchange board
#watitis2012
![Page 36: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/36.jpg)
#watitis2012
![Page 37: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/37.jpg)
#watitis2012
![Page 38: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/38.jpg)
#watitis2012
![Page 39: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/39.jpg)
#watitis2012
![Page 40: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/40.jpg)
#watitis2012
![Page 41: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/41.jpg)
lkmorlan
#watitis2012
![Page 42: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/42.jpg)
GUID
#watitis2012
![Page 43: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/43.jpg)
Authentication and Performance
• Authenticated traffic skirts around our caching servers (varnish)
• Lack of caching contributed to our last outage
• The Main site is a fairly content heavy site compared to most, but we should still proceed with caution
#watitis2012
![Page 44: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/44.jpg)
Prototypes for New Functionality
• On the IT Strategic Plan site, the following features are being evaluated:
– Organic Groups
– Commenting
– Inline Diffs
#watitis2012
![Page 45: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/45.jpg)
Organic Groups (OG)● Enable users to create and manage their
own 'groups'. ● Each group can have subscribers, and
maintains a group home page where subscribers communicate amongst themselves. (from Drupal.org/project/og)
#watitis2012
![Page 46: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/46.jpg)
Organic Groups (OG)● There are over 35,000 sites that use OG
including whitehouse.gov and groups.drupal.org
● OG allows us to have two sets of permissions for content on a site
● Can have private or public content
● Can have private or public groups
#watitis2012
![Page 47: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/47.jpg)
Comments● With authenticated access we can explore
having comments available for certain content types
● Currently being used on the IT Strategic Plan site with OG content
#watitis2012
![Page 48: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/48.jpg)
Inline Diff● Inline Diff allows you to view changes to
revisions on your site on the node page instead of having to go to the revisions page to see the difference
● The diff widget is in a block on the right, you choose the revision to review from a select box
● Changed text is yellow, added text is green, deleted text is grey
#watitis2012
![Page 49: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/49.jpg)
Inline Diff
#watitis2012
![Page 50: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/50.jpg)
What’s next? You decide!• In the new year, with our next major
release, you will see a link appear on your site dashboards to request a new feature
• We will open up our project management site and you can watch our progress on your requests
#watitis2012
![Page 51: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/51.jpg)
Get involved!• We are starting 3 working groups in
collaboration with Student Technology Services team
– Mobile (apps and web user experience)– Student Portal– Open Data
#watitis2012
![Page 52: WCMS and Campus Web Search Updates](https://reader036.fdocuments.in/reader036/viewer/2022081520/5681685f550346895ddea47c/html5/thumbnails/52.jpg)
Questions?
#watitis2012