The client wanted job listings to be extracted from 20 job sites like Indeed, CareerBuilder and Monster. The data points that the client needed were Job postings including job titles, location, wages, company profiles, Job descriptions and candidate resumes.
The list of source websites and the data points were provided by the client. They wanted this data to be extracted on a daily basis, which means fresh data had to be provided every day. We set up crawlers to extract the required data fields from the list of websites provided by the client. This requirement comes under the site specific crawl offering of ours since the crawlers have to be setup specifically for each site in the list. The client wanted the data in CSV format and be uploaded to their Dropbox account. Once the initial setup was done, our crawlers started delivering the data which was directly fed into the client’s Dropbox. We delivered close to 2 million job listings during the first crawl and about 200K records of clean and structured data on a daily basis thereafter.
Benefits to the client:
The complex technical aspects of data extraction were taken care of by us
It took only a few days for the initial setup after which data started flowing consistently
Our advanced tech stack handled huge amounts of data effortlessly
The client was able to enrich their job portal with an enormous number of listings within a short period of time
The data was 20 times cheaper than what an in-house setup would have cost the client.