Real-time Data De-duplication using Locality-sensitive Hashing powered by Storm and Riak (Berlin Buzzwords 2014)
Hacking Lucene and Solr for Fun and Profit
Data IO: Next Generation Search with Lucene and Solr 4
Open Source Search FTW
Cassandra Explained
Flexible Indexing in Lucene 4.0
Low latency scalable web crawling on Apache Storm
Making Apache Hadoop Secure Devaraj Das [email protected] Yahoo’s Hadoop Team.
Schindler Uwe - Flexible Indexing in Lucene 4.0