Real Time Fuzzy Matching with Spark and Elastic Search-(Sonal Goyal, Nube)
18
© Nube Technologies Real Time Fuzzy Matching With Spark and ElasticSearch
-
Upload
spark-summit -
Category
Data & Analytics
-
view
668 -
download
0
Transcript of Real Time Fuzzy Matching with Spark and Elastic Search-(Sonal Goyal, Nube)
© Nube Technologies
Challenges
● Quadratic problem● No standard notion of similarity● Omissions, typos and other issues● Different languages
© Nube Technologies
Other Use Cases
● Cross selling● Financial Credit Ratings● Fraud Analytics● Catalog and inventory management● Household and individual level analytics.
© Nube Technologies
Lets start wishing...
● Data variety● Scalable● No manual configuration of rules or
algorithms● Multi language● Real time
© Nube Technologies
Spark Benefits
● Distributed● Scalable● Fast● Machine Learning● Sampling● No need to orchestrate multiple jobs