Is Crawling Legal?

Post on 02-Dec-2014

1.581 views 4 download

description

Crawling is getting data you want from the website you want data from. But is this legal? Are there rules to this? More at PromptCloud.com

Transcript of Is Crawling Legal?

IS CRAWLING LEGAL?

COMMON CRAWLING QUERIES…

CAN YOU CRAWL AMAZON.COM?

GET PROFILE DATA FROM LINKEDIN?

E-COMMERCE DATA?

GIVE TWEETS FOR #iPHONE6?

TWITTER DATA?

VITAL QUESTION

IS IT LEGAL TO GET THIS DATA?

YES & NO

NOT REALLY!

CRAWLING

IS AUTOMATED FETCHING OF WEBPAGE CONTENT

WEB CRAWLER?

CRAWLING TABOO?QUITE OFTEN USED AGAINST WEBSITE POLICIES &

BREAKS THE GROUND RULES OF CRAWLING

I. ROBOTS.TXT TELLS YOU WHAT URL CAN BE CRAWLED OR NOT!

RARELY BOT-SPECIFIC

ONLY A GUIDELINE

NOT LEGALLY ENFORCEABLE

CRAWL ONLY PUBLIC CONTENT!

COPYRIGHT MUST NOT BE NEGLECTED

II. PUBLIC CONTENT

USE IT WELL

CHECK BEFORE CRAWLING!

III. TERMS OF USE

BEFORE ACCESSING CONTENT

NO BOT POLICY. HUMANS ONLY!

IV. AUTHENTICATION REQUIRED

MAINTAIN DELAY BETWEEN CRAWLS

HIT SERVER TOO HARD, TOO FAST…

CHANCES ARE THAT YOUR IPs WILL BE BLOCKED!

V. CRAWL DELAY

WHY ALLOW CRAWLING?

•CONTENT REACHES PUBLIC• Crawling increases content discovery as long as rules are followed

•SITES HAVE TRUCKLOADS OF INFORMATION! • Bots assimilate entire site data automatically

•CRAWLING YIELDS PRECIOUS DATA• Businesses gain competitive advantage

• Data Analytics gives the edge here

VERDICT?CRAWLING ISN’T STRICTLY

‘ILLEGAL’

BE POLITE

FOLLOW THE GROUND RULES

UNLESS…YOU ASK…WHAT IS THE DATA BEING GATHERED?

WHAT IS ITS USE?

MORE HERE

www.promptcloud.com/blog

sales@promptcloud.com

DATA CRAWLING REQUIREMENT?