Building Smart Indexes for Drupal Sites

11
Anant Corporation Research & Development Search – Building Smarter Search Indexes

Transcript of Building Smart Indexes for Drupal Sites

Page 1: Building Smart Indexes for Drupal Sites

Anant Corporation

Research & DevelopmentSearch – Building Smarter Search Indexes

Page 2: Building Smart Indexes for Drupal Sites

What do we do?

Streamline, Organize and Unify Business InformationPortals | Integration | Search

Page 3: Building Smart Indexes for Drupal Sites

Agenda

•Overview – What is Search•Define – Dumb vs. Smart Indexes•Patterns: Ingestion & Retrieval•Technologies: What’s Available?•Questions & Answers

Page 4: Building Smart Indexes for Drupal Sites

Search – Information RetrievalDocument Retrieval

• Google Search• Amazon Search• LinkedIN Search• *CMS Search• *Portal Search• *CRM Search• * Search

Document Routing

• Google Alerts• Amazon’s

Recommendations• Netflix

Recommendations• LinkedIN

Recommendations

Page 5: Building Smart Indexes for Drupal Sites

Usual “Search” / Consumer Apps• Interface - Frontend Layer (UI) is

Deployed as Static Files from CDN• Software - Business Logic (API) is

Deployed as Stateless Services • Database - Persistent information (Data)

is any of SQL/NoSQL/Graph/Index/*• Systems - Different Applications

(Systems) are hosted in private/public/* clouds

Page 6: Building Smart Indexes for Drupal Sites

Define – Smart vs. Dumb Index?“Dumb” Index

• No “Index” (SQL/NoSQL)

• Keyword matching• Term / phrase

matching• Basic highlighting• No Annotation

“Smart” Index

• Meta Data • Named Entity

Extraction • Concepts /

Keywords • “Likeness” /

Clustering

Page 7: Building Smart Indexes for Drupal Sites

Post

• Title• Content• Author• Date• Tags• Categories• Link

SmartPost

• Title• Content_Raw• Content_HTML• Content_Readable• Author_Name• Author_Email• Date• Tags_User• Tags_Alchemy• Tags_OpenCalais• Categories• Link• Link_Thumbnail_Image

Example – Smart vs. Dumb Index Item ?

Page 8: Building Smart Indexes for Drupal Sites

Patterns : Ingestion and Retrieval

Stage

Index

API

Page 9: Building Smart Indexes for Drupal Sites

Technologies : What’s Available Now?

Elastic Search Connector Apache Solr Connector

Page 10: Building Smart Indexes for Drupal Sites

Anant - D.C. Office

Page 11: Building Smart Indexes for Drupal Sites

ContactRahul Singh

• Web: http://anant.us• Email: [email protected]• Phone: 1.855.ANANTCO• 1010 Wisconsin Ave. NW,

Suite 250Washington, D.C. 20007