Contact information

PromptCloud Inc, 16192 Coastal Highway, Lewes De 19958, Delaware USA 19958

We are available 24/ 7. Call Now. marketing@promptcloud.com

The travel industry is ever changing with exa-bytes of data being updated on a daily (sometimes hourly) basis.

E-mail: sales@promptcloud.com

For organizations operating is such a dynamic industry, access to accurate and structured data at the right time becomes of utmost importance. Here is a use case of our client, where the requirement was to extract travel data daily. Read on to know more:

Client Details

A well-known budget hotel chain who wants to be updated about the hotel prices across India twice a day, to be able to strategize better in terms of pricing as well improve their offerings in general. With ever-growing competition, access to such data becomes a priority to stand out in the industry!

Crawl requirements: Travel Data Daily Required

The client had a specific set of requirements:

  • Check-in Check out dates to be specified by them
  • Such data was to be uploaded to a file sharing server and our crawlers were to pick them up at predefined times and process them. Possible situations:
  • File available in the morning but not in the afternoon
  • File not available in the morning but available in the afternoon
  • File available both in the morning and afternoon
  • No file uploaded in the morning or afternoon
  • Fields for extraction were predefined and in a set order as specified by the client
  • Crawl frequency was twice in a day

Target Sites and Approximate Monthly Volumes

Most of the major OTAs (Online Travel Agents) in India with a volume of around 30 million records per month. Approximately 300,000 to 400,000 records per site per day were delivered based on the files uploaded by the client.

Related Use Cases

Use Case
Data Extraction From Hotel Booking Portals

Large-Scale Price Data Extraction From Hotel Booking Portals

Read More

Use Case

Web Scraping Hotel Prices On Daily Basis

Read More

Use Case

Get Vacation Rental Data Across The Sites

Read More

Approaches Used to tackle the issues

  • We programmed the crawlers to search for the files on the sharing server at a pre-decided time and pick them up, if available. The crawler would check for files once in the morning and once in the afternoon.
  • Additional scripts were written and additional resources were made use of to ensure data delivery happens before 2359 hrs of a particular day.
  • The crawlers were programmed to detect if the currency was in INR. If not INR, the crawlers were programmed to change the currency to INR. This was important when servers from across the world were being used to crawl the data.
  • The crawlers were also programmed to ensure not to hit the target servers very aggressively to avoid being blocked while, at the same time, ensuring that all the necessary data was captured before 2359 hrs.

Benefits delivered

  • Regular & flexible access to the required data as required. Since the crawler picks up data only if available, the client had the flexibility to upload the files only if required.
  • Considering a dedicated team at the client side who were directly involved with this activity, a cost savings of about 23% was achieved by them
  • With a bimonthly inventory crawl, they had access to updated data regularly (fortnightly) and were in a better position to increase their footprint across the country.
  • With a low turnaround time, the data extracted could be used more effectively.
  • Except for the initial onboarding period, the process was completely automated. Any disruption in service was also automatically updated to the support team to ensure that the crawls run in a smooth manner.

Are you looking for a custom data extraction service?

Contact Us