Real-time data analysis with Azure Stream Analytics

19
Real-time data analysis with Azure Stream Analytics Michael Johnson

Transcript of Real-time data analysis with Azure Stream Analytics

Page 1: Real-time data analysis with Azure Stream Analytics

Real-time data analysis with Azure Stream Analytics

Michael Johnson

Page 2: Real-time data analysis with Azure Stream Analytics

What is streaming data?

Batch data Streaming data

Page 3: Real-time data analysis with Azure Stream Analytics

Where do we find streaming data

Page 4: Real-time data analysis with Azure Stream Analytics

What is Azure Stream Analytics and why would you want to use it

• What• Real-time event processing engine that helps to unlock deep insights from

data.

• Why• Easy to use

• Scalable

• Reliable, Repeatable and quick to recover

• Low cost

Page 5: Real-time data analysis with Azure Stream Analytics

Azure stream analytics architectureAzure stream analytics

Inputs Outputs

Page 6: Real-time data analysis with Azure Stream Analytics

Input source

• Streaming Data• Event hubs

• Blob storage

• IOT Events hubs

• Reference data• Blob storage

Page 7: Real-time data analysis with Azure Stream Analytics

Output sinks

Azure data lake

Power BI

Azure SQL database Azure document DB Azure Blob storage

Azure Service bus

Azure Event hubAzure table storage

Page 8: Real-time data analysis with Azure Stream Analytics

Query language overview

• DML Statements• SELECT• FROM• WHERE• GROUP BY• HAVING• CASE• JOIN• UNION

• Date and time• DATEPART

• DAY

• MONTH

• YEAR

• DATEDIFF

• DATEADD

• Aggregation functions• SUM

• COUNT

• AVG

• MIN

• MAX

• Scaling functions• WITH

• PARTITION BY

• Windowing functions• WITH

• PARTITION BY

• String functions• LEN

• CONCAT

• CHARINDEX

Page 9: Real-time data analysis with Azure Stream Analytics

What does a query look like?

Page 10: Real-time data analysis with Azure Stream Analytics

Tumbling windowsTUMBLINGWINDOW ( timeunit , windowsize, [offsetsize] )

Page 11: Real-time data analysis with Azure Stream Analytics

Hopping windowHOPPINGWINDOW ( timeunit , windowsize , hopsize, [offsetsize] )

Page 12: Real-time data analysis with Azure Stream Analytics

Sliding windowSLIDINGWINDOW ( timeunit , windowsize )

Page 13: Real-time data analysis with Azure Stream Analytics

Javascript UDF intergration

• Supports stateless, compute only functions

• No external connectivity

• Why• REGEX

• Math libraries

Page 14: Real-time data analysis with Azure Stream Analytics

Machine learning integration

Page 15: Real-time data analysis with Azure Stream Analytics

Geospatial Queries (In public preview)• Geospatial Functions

• types• CreatPoint

• CreatePolygon

• CreateLineString

• Functions• ST_Distance

• ST_WITHIN

• ST_OVERLAP

• ST_INTERSECTS

Page 16: Real-time data analysis with Azure Stream Analytics

Deployment options

Azure portal .Net SDK PowerShell

Page 17: Real-time data analysis with Azure Stream Analytics

Pricing

• Streaming unit• 1 streaming unit = 1MB/Second

• 1 x 60 x 60 x 24 = 84.375 GB/day

• Data volume + streaming unit cost• Data volume = $ 0.001 or R 0.014 per GB

• Streaming unit = $ 0.12 or R1.656 per hour

• If run at max through put for a month• Total data ingested = 2.47 TB

• Data = R 35.44

• Streaming Unit = R1 192.32

• Total cost = R 1 227.76

Page 18: Real-time data analysis with Azure Stream Analytics

DEMO

Page 19: Real-time data analysis with Azure Stream Analytics

Questions