The DataSift platform

13
Industry Leading Big Data Platform For Social AGGREGATE across sources for real-time & historic data from a single API PROCESS to filter out noise, extract metadata & categorize to add structure DELIVER into BI tools, enterprise & social apps.

description

 

Transcript of The DataSift platform

Page 1: The DataSift platform

Industry Leading Big Data Platform For SocialAGGREGATE across

sources for real-time & historic data from a single

API

PROCESS to filter out noise, extract metadata & categorize to add structure

DELIVER into BI tools, enterprise & social apps.

Page 2: The DataSift platform

Industry Leading Big Data Platform For Social

Page 3: The DataSift platform

Benefits of DataSiftAggregate Process DeliverDescription Description Description

Single API across 20+ sources

Standard data format

Real-time and historical data access

Enrichments: Increase value by adding meaning to the data

Filtering: Get relevant data with unconstrained sophisticated filters

Categorization: Contextualize and add structure to the data

Multiple options for integration using APIs and pre-built connectors

Guarantee data delivery with DataSift PUSH protocol

Configurable database formats to easily map data into tables in your database

Benefits Benefits BenefitsReduce integration costs

Minimize ongoing maintenance

Differentiate with value-added data

Lower infrastructure costs

Speed up time to market

Lower operational costs

Speed up time to market

Page 4: The DataSift platform

One platform for faster integration• Single API• Standard data format• Real-time and historical

data

AGGREGATE

Broad, Unified & Compliant Access

Page 5: The DataSift platform

Broad Access to Data Sources

Twitter Facebook Sina Weibo WordPress Intense Debate Tumblr Google+

YouTube Bitly Instagram NewsCred Reddit WikipediaDailyMotion

Topix IMDb Videos Blogs Message Boards

Historical data available in addition to real-time

Page 6: The DataSift platform

Enrichments: Increase value by adding meaning to the dataFiltering: Get relevant data with unconstrained sophisticated filters Categorization: Contextualize and add structure to the data

PROCESSEnrich, increase relevance and

contextualize data

Page 7: The DataSift platform

Enrichments: Increase Value By Adding Meaning to Data

Add valuable meta data to the raw feeds in real-time

Get more precise by adding enrichment data to your

filters

Page 8: The DataSift platform

Filtering: Get Relevant Data With Unconstrained Sophisticated Filters

Get only the data you need, avoid paying for, and processing, junk data

• Filter across content, meta data and enrichments

• Create filters visually or using code

• Add data sources quickly by applying one filter across multiple sources

It's really important to be able to curate what's interesting out of social content and surface that. And that really requires a robust platform where we can dig in and figure out exactly what is relevant.- Peter Yared, CTO & CIO, CBS Interactive

Page 9: The DataSift platform

Categorization: Contextualize and Add Structure to the Data

Ready the data for consumption – making it easier to analyze and decreasing time to value

• Define rules for classifying and scoring the data

• Use machine learning to categorize and score the data in real-time

• Leverage out of the box data science with the library of pre-built classifiers

Journalist

Tier-1 Customer

Profile

CRMChurn Content

Page 10: The DataSift platform

Simplify integration and leverage your existing tools to consume the data• Stream data in real-time or pull at

your own pace using push/pull APIs• Leverage pre-built connectors to

popular storage solutions and BI tools• Guarantee data delivery with DataSift

PUSH protocol• Configurable database formats to

easily map data into tables in your databases

DELIVERDemocratize Data

Across Your Apps and Org

Page 11: The DataSift platform

Consume The Data In Your Existing Infrastructure

Pull ConnectorHTTP RedisMongoDB

CouchDBAmazon DynamoDB Amazon S3

Google Big Query

FTP

SFTP ZoomData

ElasticSearch

Splunk Enterprise Streaming APIREST API

Page 12: The DataSift platform

1.5 BillionInteractions per day

2.4 PetabytesTotal data archive

2 TerabytesArchive per day

The mission critical nature of social demands an enterprise class social data provider.

Susan Etlinger, Altimeter

Enterprise Class Solution• Built to scale for social• 99.9% reliability with 24/7 support• Onboarding, training and guided deployment services

Page 13: The DataSift platform

Lower Costs and Faster Time To ValueLower Costs Faster time to value

• Get (and pay for) less junk data• Use a single set of filters for real-

time and historical data• Minimize data cleansing costs• Simplify integration with DataSift

PUSH

• Single API for adding new data sources

• Enrichments provide value add data out of the box

• Categorization & scoring provides “analysis-ready” data

• Consume the data using your existing infrastructure

DataSift does the heavy lifting needed to get rich social data, allowing us to focus on building innovative new features for our applications.

Adam Root, Co-founder and CTO, HipLogiq