Job Data Overview
Our data can be accessed as a stream of daily data files for each country. You can subscribe to Datafeeds of individual countries, buy historical data starting from 2020 as Datasets, or access all data via our API! Furthermore, since July 2023 offer our data on AWS Data Exchange (ADX).
Data Samples / Excerpts
To get first impression and test the workflow you can access our Luxembourg dataset for free. Luxembourg largely dependent on the banking, steel, and industrial sectors. As of 2023, Luxembourg's population stands at 650k with a GDP per capita of 132k USD. The dataset starts January 1st, 2020, comprises of approx. 250k job ads, and grows by roughly 10k job listings monthly.
Alternatively, we have published several Datasets on Kaggle that you can explore:
- US Job Postings from May 5th 2023 (33k records; 805 MB GZip file - 4.46 GB uncompressed JSON)
- Ireland Job Postings from October 2022 (37k records; 101 MB GZip file)
- Ireland Job Postings from October 2021 (25k records; 56 MB GZip file)
- Ireland Job Postings from October 2020 (30k records; 58 MB GZip file)
- International Job Postings from September 2021 (3.4m records; 8 GB GZip file - 50 GB uncompressed)
or download a small JSON file with upto 3 records from 30 sources and 144 countries within April 2023:
- Sample of Job Postings from April 2023 (1.4k records; 3.3 MB JSON file)
If you need a more specific sample, please don't hesitate to contact us at email@example.com and let us know which dataset you are interested in. Our team will get back to you with a relevant sample to help you make an informed decision about purchasing the full dataset.
We store job postings as JSON in our Database and export it in daily data files with a more compact subset of essential data fields. We make them available via our API and we export them in daily datafiles with a subset of essential data fields. With our job posting data, you'll have all the information you need to build cutting-edge applications and drive success for your company.
Exported Data Fields (via Datafeeds & Datasets)
Our job postings in the datafiles is stored using JSON Lines, a popular format for structured data. Each posting contains a comprehensive set of essential data fields that can be used to extract valuable insights and inform your business decisions.
|Field Key||Field Type||Percentage||Description|
|source||String||100.00%||The origin of the job posting with a countrycode such as 'monster_us'.|
|sourceCC||String (ISO3166)||100.00%||The countrycode (ISO3166) the job posting is located, e.g., 'us'.|
|idInSource||String||100.00%||The ID used in the source the job posting originated from.|
|name||String||100.00%||The title of the job posting.|
|url||String (URL)||100.00%||The link where we found the job posting.|
|text||String||100.00%||The text of the job posting extracted from the html field.|
|html||String (HTML)||100.00%||The original HTML used on the Website or stored in the JSON of the job posting page.|
|json||Object||100.00%||The original JSON objects found in the job posting page including schema.org job posting data if available.|
|referenceID||String||30.40%||An ID stated on the job postings page given by the original company for internal use.|
|position.*||Object||99.96%||Data concerning the job position such as name, contract type (e.g., Permanent), work type (e.g., full-time), or career level (e.g., Junior).|
|salary.*||Object||22.91%||Data concerning the job salary such as amount and period (e.g., 'Weekly').|
|contact.*||Object||30.83%||Data about contact information such as contact name, email, phone, or physical address.|
|orgTags.*||Object||99.73%||Tags found on the job posting page (e.g., skills such as 'Java'), in the JSON (e.g., benefits) or from the browsing hierarchy (e.g., 'IT Jobs').|
|location.orgAddress.*||Object||100.00%||Data about the location of the job as originally stated on the job posting or its JSON.|
|company.*||Object||100.00%||Data about the company of the job as originally stated on the job posting or its JSON.|
|locale||String||28.24%||The language locale the job posting is written for (might not always be correct).|
|dateCreated||Date||100.00%||The date and time the job posting was created by the company (as stated on the page or its JSON).|
|dateScraped||Date||100.00%||The date and time the job posting was loaded and analyzed.|
Database Data Fields (via API)
The job postings in our database are stored using JSON and contain the following data fields (generated using variety.js from an April 2023 data slice over all countries). Please note that we do not list some internal data fields as well as the JSON taken from the job posting page.
|Field Key||Field Type||Counted||Percentage|
|orgCompany.info.companySize||String, Number||707 K||8.42%|