SPConnections Amsterdam: Beyond the Search Center - Application or Solution? (Search Based...
-
Upload
agnes-molnar -
Category
Technology
-
view
788 -
download
0
Transcript of SPConnections Amsterdam: Beyond the Search Center - Application or Solution? (Search Based...
About Agnes Molnar• SharePoint Server MVP• Senior Solutions Consultant, BA Insight• Recognized blogger, speaker, writer
• Web: http://www.bainsight.com• Blog: http://aghy.hu• Email: [email protected]• Twitter: @molnaragnes
Search
Search connects people
to the information
they need to get their jobs done.
Search• „I know what I’m searching for and know how to
do that”
• „I know what I’m searching for but I don’t know how to do that”
• „I don’t know what I’m searching for”
• „Am I Searching?...”
Enterprise Search
• Enterprise – is no longer within the firewall• Relevance is critical• Search within the organization• „Transparent” Search• Search Driven Applications
Search Components
Source: http://searchpatterns.org
Search Based Application (SBA)
• Software Application • Built on a Search Engine backbone rather
than a database infrastructure • Purpose is not classic information
retrieval, but rather mission-oriented information access, analysis or discovery
SBA Examples
12
ChallengesUser ExperienceChallenges
Multiple search interfaces, systems, and logons; no unified search results
Data and ExpertiseChallenges
Files and email on desktops; structured and unstructured data silos; untapped expertise
Enterpriseand IT Challenges
Relevance and ranking; security, privacy & compliance; scalability, manageability & extensibility
Customizations for Search Driven ApplicationsBuilding on an extensible platform
Configure Extend CreateUser Context
LOB ConnectivityContent ProcessingBusiness languageFederation Sources
UI Look & Feel…..
Relevance ProfilesUI & Web PartsResult Rollup
Visual ElementsWorkflowsAnalytics
…..
Custom ElementsWork Environments
New Innovations….
Content Sources
Content Sources in SBAs
• Combine (join) data• Connect data
– Existing relationships in the source system– Newly discovered, cross-system relationships
• Aggregate data• Visualize data
Data Collection / Crawling
• Crawler: – Connects to the Content Source– Enumerates the content– Reads the content items– Extracts the metadata– Sends the collected info back to the Indexer
Data Collection / Crawling• Connector: Enables to access different types of content
• OOTB:– SharePoint– File Share– Web site– Exchange Public Folders– Custom Connectors– (Lotus Notes)– (Documentum)
Natural Language Processing• Crawl/Index Time
– Language Detection– Tokenization– Stemming and Lemmatization
• Query Time– Approximate Spelling– Phonetic Spelling– Word Truncation– Regular Expressions– Semantic Expansion– Rules-based Matching
Processing: Crawled and Managed Properties
• Crawled property: metadata extracted from the documents/items during the crawl.
• Managed property: can appear in refined searches and helps users perform more successful queries
Processing: Crawled and Managed Properties
Processing: Ranking• Ranking: produce results that are ordered according
to some computed relevancy score
• Dynamic: Based on weighted managed properties (title, body, social tags, etc.)
• Static: – File Type– Click through relevancy– Depth
Processing: Ranking
Processing: Relevance Tuning
User Interface• OOTB Web Parts
– Refinement Panel– Core Results Web Part
• Federation• People Search• Scopes• Custom Web Parts
– Visual Navigation– Mashups– Etc.
• Workflows – Act on Items Immediately
Search Federation
• Using remote index for queries• Location type:
– SharePoint Search index– FAST index– OpenSearch 1.0/1.1
Search Federation
Search Federation• Benefits:
– No resources needed for indexing– Custom Credentials– Usage restrictions– Prefix / Pattern match– Query Template
• {searchTerms} scope:Documents• {searchTerms} type:.doc type:.docx type:.docm
• BUT:– Live Internet connection is required– Bandwith– No control over results (order, relevance, etc.)– Separated Web Parts
Search Federation
Summary
• Search Based Applications?– Need to Aggregate Heterogeneous Content– Neet to Process Large Volume of Data– Need for Real Time Information– Need for Ad Hoc Reporting
THANK YOU!DON’T FORGET TO FILL IN THE EVALUATION!
Email: [email protected]: @molnaragnes