Building Smart Indexes for Drupal Sites
-
Upload
anant-corporation -
Category
Technology
-
view
283 -
download
2
Transcript of Building Smart Indexes for Drupal Sites
Anant Corporation
Research & DevelopmentSearch – Building Smarter Search Indexes
What do we do?
Streamline, Organize and Unify Business InformationPortals | Integration | Search
Agenda
•Overview – What is Search•Define – Dumb vs. Smart Indexes•Patterns: Ingestion & Retrieval•Technologies: What’s Available?•Questions & Answers
Search – Information RetrievalDocument Retrieval
• Google Search• Amazon Search• LinkedIN Search• *CMS Search• *Portal Search• *CRM Search• * Search
Document Routing
• Google Alerts• Amazon’s
Recommendations• Netflix
Recommendations• LinkedIN
Recommendations
Usual “Search” / Consumer Apps• Interface - Frontend Layer (UI) is
Deployed as Static Files from CDN• Software - Business Logic (API) is
Deployed as Stateless Services • Database - Persistent information (Data)
is any of SQL/NoSQL/Graph/Index/*• Systems - Different Applications
(Systems) are hosted in private/public/* clouds
Define – Smart vs. Dumb Index?“Dumb” Index
• No “Index” (SQL/NoSQL)
• Keyword matching• Term / phrase
matching• Basic highlighting• No Annotation
“Smart” Index
• Meta Data • Named Entity
Extraction • Concepts /
Keywords • “Likeness” /
Clustering
Post
• Title• Content• Author• Date• Tags• Categories• Link
SmartPost
• Title• Content_Raw• Content_HTML• Content_Readable• Author_Name• Author_Email• Date• Tags_User• Tags_Alchemy• Tags_OpenCalais• Categories• Link• Link_Thumbnail_Image
Example – Smart vs. Dumb Index Item ?
Patterns : Ingestion and Retrieval
Stage
Index
API
Technologies : What’s Available Now?
Elastic Search Connector Apache Solr Connector
Anant - D.C. Office
ContactRahul Singh
• Web: http://anant.us• Email: [email protected]• Phone: 1.855.ANANTCO• 1010 Wisconsin Ave. NW,
Suite 250Washington, D.C. 20007