Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1...
Transcript of Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1...
![Page 1: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/1.jpg)
Introductionto
erik.hatcher@
lucidimagination.com
1Sunday, October 16, 2011
![Page 2: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/2.jpg)
AbstractApache Solr serves search requests at the enterprises and the largest companies around the world. Built on top of the top-
notch Apache Lucene library, Solr makes indexing and searching integration into your applications straightforward.
Solr provides faceted navigation, spell checking, highlighting, clustering, grouping, and other search features. Solr also scales query volume with replication and collection
size with distributed capabilities. Solr can index rich documents such as PDF, Word, HTML, and other file types.
Come learn how you can get your content into Solr and integrate it into your applications!
2Sunday, October 16, 2011
![Page 3: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/3.jpg)
About me...
3Sunday, October 16, 2011
![Page 4: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/4.jpg)
http://lucene.apache.org/solr/
4Sunday, October 16, 2011
![Page 5: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/5.jpg)
http://lucene.apache.org/
5Sunday, October 16, 2011
![Page 6: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/6.jpg)
Solr
• Simple: easy to use
• Powerful: feature rich and scales
• Open Source
• from the "Lucene people"
• encapsulates Lucene best practices
6Sunday, October 16, 2011
![Page 7: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/7.jpg)
Fire It Up
• cd example
• java -jar start.jar
• [cd example/exampledocs;
• java -jar post.jar *.xml]
7Sunday, October 16, 2011
![Page 8: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/8.jpg)
Indexing
• /update[/csv|/json|/extract]
• stream from local, remote, or POST data
• tutorial:
• cd example/exampledocs
• java -jar post.jar *.xml
• Tip: java -jar post.jar -help
8Sunday, October 16, 2011
![Page 9: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/9.jpg)
Indexing JSON
POST to /update/json[ {"id" : "1", "title" : "Doc One"}, {"id" : "2", "title" : "Doc Two"}]
9Sunday, October 16, 2011
![Page 10: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/10.jpg)
Indexing CSV
curl http://localhost:8983/solr/update/csv --data-binary @data.csv -H 'Content-type:text/plain; charset=utf-8’
10Sunday, October 16, 2011
![Page 11: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/11.jpg)
Indexing Rich Documents
http://localhost:8983/solr/update/extract ?stream.file=/path/to/file.doc &stream.contentType=application/msword &literal.id=ds1-file.doc"
11Sunday, October 16, 2011
![Page 12: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/12.jpg)
Other conduits
• DataImportHandler (DIH)
• API's: SolrJ, RSolr, (py)solr(.py), etc
• It's just data over HTTP
• "Enterprise"
• LucidWorks: SharePoint, (split) crawling, S3, HDFS, etc; including access control
12Sunday, October 16, 2011
![Page 13: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/13.jpg)
Searching• http://localhost:8983/solr/select?q=*:*
• Typical looking Solr request - http://localhost:8983/solr/select +
• ?q=ipod
• &facet=on
• &facet.field=cat
• &fq=cat:electronics
• [&rows=10&start=20]
• [&fl=id,name,price&sort=price asc]
• [&wt=xml|json|csv|ruby|python|php|xslt|velocity&indent=on]
• [&debugQuery=true]
13Sunday, October 16, 2011
![Page 14: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/14.jpg)
/browse
14Sunday, October 16, 2011
![Page 15: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/15.jpg)
http://www.apache.org/
"Heavy Committing"
15Sunday, October 16, 2011
![Page 16: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/16.jpg)
... works
search platform
www.lucidimagination.com
16Sunday, October 16, 2011
![Page 17: Introduction toIntroduction to erik.hatcher @ lucidimagination.com Sunday, October 16, 2011 1 Abstract Apache Solr serves search requests at the enterprises and the largest companies](https://reader034.fdocuments.in/reader034/viewer/2022042304/5ecf4729f15f187cb0613876/html5/thumbnails/17.jpg)
Events
17Sunday, October 16, 2011