What enterprises do with big data- Part 1

Post on 13-Jan-2015

592 views 1 download

Tags:

description

How enterprises have used big data for their portfolio

Transcript of What enterprises do with big data- Part 1

Big Data made Small

What enterprises do with Big Data- Part 1

© PromptCloud Technologies 2013, All rights reserved 1

Notes

2

a) This is a list of simplified requirement statements from ongoing/past projects (not verbatim and in no specific order).

b) You can easily assume that data delivered was in a

structured format.

© PromptCloud Technologies 2013, All rights reserved

3

1. Collect data from Twitter every 5 minutes from specific geographies and categories based on a set of keywords. Our USP is social listening and this data is a crucial part of our business.

© PromptCloud Technologies 2013, All rights reserved

4 © PromptCloud Technologies 2013, All rights reserved

2. We talk social travel and so are building an intelligent social engine for finding and booking hotels. In order to get this running, we'd like you to provide reviews, the reviewer profiles, hotel and restaurant addresses from these popular travel sites and forums. Data could be in the range of tens of millions.

5 © PromptCloud Technologies 2013, All rights reserved

3. We're in the brand monitoring zone. So we'd like you to first collect all reviews belonging to say Nike, and then index it for us so that we could see what people are saying. If you could give us query formats using which say, I could see how many people said “bad” and how many said “good”, etc. that would be ideal.

6

4. Give me all the stories about my interest list of celebrities based on a list of keywords that I provide (like, eat, drink, travel....) from about 400 sources and Twitter. I'm launching a celebrity gossip website in multiple countries.

© PromptCloud Technologies 2013, All rights reserved

7

5. Get me all products with all its fields (name, descriptions, price, specs) present on these supermarket stores that are all AJAX. And sorry! They are in Hebrew.

© PromptCloud Technologies 2013, All rights reserved

8

6. I'm so obsessed with near-real time data (sorry I belong to media) that I need news of any deals, acquisitions, mergers, or any other news based on the phrases I provide to you and expect the feed within minutes of the same being published somewhere.

© PromptCloud Technologies 2013, All rights reserved

9

7. We are in the used car inventory space. There are few platforms that automobile sites use for a set of cars. Please get me data from such places on a daily basis at these particular times in the day. I need both English and French data and I'm interested in both XML and CSV formats.

© PromptCloud Technologies 2013, All rights reserved

10

8. I need data from all the tech support discussion forums from all of these sites in this particular format. I expect about 2 million records from here.

© PromptCloud Technologies 2013, All rights reserved

11

9. We are looking for laptop reviews that have these operating systems. This is our initial list of sources and would be great if you could come up with others for us. We desire weekly updates of these reviews.

© PromptCloud Technologies 2013, All rights reserved

12

10. You get me all high resolution images from these 200 web stores so that whenever a user bookmarks a product on my website, my algorithm can show them those related products to compare prices and eventually make a buying decision.

© PromptCloud Technologies 2013, All rights reserved

13

11. We offer great discounts on tickets for events and games. Please crawl these ticketing sites for us so we have an inventory of all events with their seating-level prices which we can use to offer discounts and run our analyses.

© PromptCloud Technologies 2013, All rights reserved

14

12. We acquire new and updated ‘Pending Legislation’ (as it is being debated in Govt. prior to becoming a Bill/Law) documents and extract associated metadata from legislature websites. The document metadata may be available on the web page where we download it from, or within the document – which may be in HTML or binary formats – e.g. Word, PDF. We need to extract all of this data for our clients.

© PromptCloud Technologies 2013, All rights reserved

15

13. We're in the video-gaming industry where we perform research on video games and provide consulting. We need to extract data daily from all popular gaming sites and gather news, articles and their popularity.

© PromptCloud Technologies 2013, All rights reserved

16

14. Get me product feeds from the Indian E-commerce market with all product-level details and specifications. I need this to build an analytics engine.

© PromptCloud Technologies 2013, All rights reserved

17

15. We'd be interested in these job portal sites and would like to receive updates on a daily basis on the jobs posted in our country. We're developing solutions for the digital classifieds markets.

© PromptCloud Technologies 2013, All rights reserved

18

16. We are a social strategy and analytics firm and need lot of data to do some social data mining. We have about 200 sites which are a mix of blogs, news, forums, articles, travel sites and others, many of which are non-English. Please extract data as you find relevant to the domain. We need updates on a daily basis.

© PromptCloud Technologies 2013, All rights reserved

19

17. Our clients (who are large-scale manufacturing companies) would like to see how their high-value products are doing in the market. So we'll need reviews of this list of products including review date, author, content, review helpful, recommends and other such details from these set of sites.

© PromptCloud Technologies 2013, All rights reserved

20

18. I'm developing a comparison shopping engine that I can feed in data from other sources and have my users compare and shop and see some price trends. Please facilitate this data.

© PromptCloud Technologies 2013, All rights reserved

21

19. I am in the healthcare industry looking to create an inventory of all healthcare-related products from these stores. I need to go to the last level of detail and capture everything possible.

© PromptCloud Technologies 2013, All rights reserved

22

20. We're interested in creating a database of all companies in India that are less than x years old, greater than y in revenue, belong to these industries and provide these specific services.

© PromptCloud Technologies 2013, All rights reserved

We like to remind “Why Us”?

23

Price

•Flexible Pricing based on size and frequency of crawls

Performance

•Low ETA’s •Precision Extraction •Exhaustive data available as feed

Technology

•Highly Scalable •Access to real-time data

Making big data small to alleviate tech-aches

© PromptCloud Technologies 2013, All rights reserved

24

For details, contact Email: info@promptcloud.com

Phone: +91-96 86 56 70 70

Watch out for the next batch…

© PromptCloud Technologies 2013, All rights reserved