AWS_Data_Pipeline
Click here to load reader
-
Upload
ahasan-habib -
Category
Documents
-
view
98 -
download
2
Transcript of AWS_Data_Pipeline
![Page 1: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/1.jpg)
AWS Data Pipeline
~ Ahasan Habib
Technical Project Manager,
Ixora Solutions Ltd.
Dhaka, Bangladesh
![Page 2: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/2.jpg)
What is AWS Data Pipeline?
● Webservice
● Movement & Data transformation
● Data driven workflow
![Page 3: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/3.jpg)
Benefits
● Sequence, Schedule, Run, Manage recurring data processing workloads
reliably.
● Cost effective
● Easy to design ETL
● Support for both structure and unstructure data
● Support on premises and cloud
![Page 4: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/4.jpg)
Data Pipeline Components
● Pipeline Definition
● Pipeline Schedules & run tasks
● Task Runner
![Page 5: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/5.jpg)
Data Pipeline Objects
● ShellCommand Activity
● S3 Data Node
{
"id" : "CreateDirectory",
"type" : "ShellCommandActivity",
"command" : "mkdir new-directory"
}
{
"id" : "OutputData",
"type" : "S3DataNode",
"schedule" : { "ref" : "CopyPeriod" },
"filePath" :
"s3://myBucket/#{@scheduledStartTime}.csv"
}
![Page 6: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/6.jpg)
● EC2 Resource
● Schedule {
"id" : "Hourly",
"type" : "Schedule",
"period" : "1 hours",
"startDateTime" : "2012-09-
01T00:00:00",
"endDateTime" : "2012-10-
01T00:00:00"
}
{
"id" : "MyEC2Resource",
"type" : "Ec2Resource",
"actionOnTaskFailure" : "terminate",
"actionOnResourceFailure" : "retryAll",
"maximumRetries" : "1",
"instanceType" : "m1.medium",
"securityGroups" : [
"test-group",
"default"
],
"keyPair" : "my-key-pair"
}
![Page 7: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/7.jpg)
Work with Other AWS Services
● Amozon Dynamo DB
● Amaxon RDS
● Amazon Redshift
● Amazon S3
● EC2
![Page 8: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/8.jpg)
Accessing Data Pipeline
● Amazon Management Console
● AWS CLI
● AWS SDK
● QUERY API
![Page 9: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/9.jpg)
Create Data Pipeline
● Compose Pipeline Definition objects in a file
● Definition File Structure
{
"id": "S3DataInput",
"type": "S3DataNode",
"schedule": {"ref": "TheSchedule"},
"filePath": "s3://bucket_name",
"myCustomField": "This is a custom value in a custom field.",
"my_customFieldReference": {"ref":"AnotherPipelineComponent"}
}
![Page 10: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/10.jpg)
Step 1
![Page 11: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/11.jpg)
Step 2
![Page 12: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/12.jpg)
Step 3
![Page 13: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/13.jpg)
![Page 14: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/14.jpg)
Notification
● SNS
● Push Delivery
● Pub/sub Model
![Page 15: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/15.jpg)
Q & A
![Page 16: AWS_Data_Pipeline](https://reader038.fdocuments.in/reader038/viewer/2022101314/58ee5c0b1a28abfd578b45d9/html5/thumbnails/16.jpg)
“There's a lot of difference between listening and
hearing.”
~G.K. Chesterton
THANK YOU