Job Data Overview
Our Data
Our data is provided in JSON format via an API or through downloadable daily data files for each country. You can subscribe to our Job Posting Datafeeds of individual countries, buy historical data starting from 2020 as Job Posting Datasets, or access specific slices of our Data via our Job Postings API (e.g., all Healthcare jobs for the EU)!
We offer our job posting data files on AWS Data Exchange (ADX) and our Job Postings API on RapidAPI.
Dataset Samples
To get first impression and test the workflow you can access our Luxembourg job dataset for free. Luxembourg largely dependent on the banking, steel, and industrial sectors. As of 2023, Luxembourg's population stands at 650k with a GDP per capita of 132k USD. The dataset starts January 1st, 2020, comprises of approx. 250k job ads, and grows by roughly 10k job listings monthly.
Alternatively, we have published several Datasets on Kaggle that you can explore:
- US Job Postings from May 5th 2023 (33k records; 805 MB GZip file - 4.46 GB uncompressed JSON)
- Ireland Job Postings from October 2022 (37k records; 101 MB GZip file)
- Ireland Job Postings from October 2021 (25k records; 56 MB GZip file)
- Ireland Job Postings from October 2020 (30k records; 58 MB GZip file)
- International Job Postings from September 2021 (3.4m records; 8 GB GZip file - 50 GB uncompressed)
or download a small JSON file with upto 3 records from 30 sources and 144 countries within April 2023:
- Sample of Job Postings from April 2023 (1.4k records; 3.3 MB JSON file)
If you need a more specific sample, please don't hesitate to contact our Data Team and let us know which dataset you are interested in. Our team will get back to you with a relevant sample to help you make an informed decision about purchasing the full dataset.
Dataset Data Fields
We store job postings as JSON in our Database and export it in daily data files with a more compact subset of essential data fields. We make them available via our API and we export them in daily datafiles with a subset of essential data fields. With our job posting data, you'll have all the information you need to build cutting-edge applications and drive success for your company.
Exported Data Fields (via Datafeeds & Datasets)
Our job postings in the datafiles is stored using JSON Lines, a popular format for structured data. Each posting contains a comprehensive set of essential data fields that can be used to extract valuable insights and inform your business decisions. The following Table lists the fields available in the data files as of August 2023.
Field Key | Field Type | Percentage | Description |
---|---|---|---|
source | String | 100.00% | The origin of the job posting with a countrycode such as 'monster_us'. |
sourceCC | String (ISO3166) | 100.00% | The countrycode (ISO3166) the job posting is located, e.g., 'us'. |
idInSource | String | 100.00% | The ID used in the source the job posting originated from. |
name | String | 100.00% | The title of the job posting. |
url | String (URL) | 100.00% | The link where we found the job posting. |
text | String | 100.00% | The text of the job posting extracted from the html field. |
html | String (HTML) | 100.00% | The original HTML used on the Website or stored in the JSON of the job posting page. |
json | Object | 100.00% | The original JSON objects found in the job posting page including schema.org job posting data if available. |
referenceID | String | 30.40% | An ID stated on the job postings page given by the original company for internal use. |
position.* | Object | 99.96% | Data concerning the job position such as name, contract type (e.g., Permanent), work type (e.g., full-time), or career level (e.g., Junior). |
salary.* | Object | 22.91% | Data concerning the job salary such as amount and period (e.g., 'Weekly'). |
contact.* | Object | 30.83% | Data about contact information such as contact name, email, phone, or physical address. |
orgTags.* | Object | 99.73% | Tags found on the job posting page (e.g., skills such as 'Java'), in the JSON (e.g., benefits) or from the browsing hierarchy (e.g., 'IT Jobs'). |
location.orgAddress.* | Object | 100.00% | Data about the location of the job as originally stated on the job posting or its JSON. |
company.* | Object | 100.00% | Data about the company of the job as originally stated on the job posting or its JSON. |
locale | String | 28.24% | The language locale the job posting is written for (might not always be correct). |
dateCreated | Date | 100.00% | The date and time the job posting was created by the company (as stated on the page or its JSON). |
dateScraped | Date | 100.00% | The date and time the job posting was loaded and analyzed. |
Database Data Fields
The job postings in our database are stored using JSON and contain the following data fields (generated using variety.js from an April 2023 data slice over all countries). Please note that we do not list some internal data fields as well as the JSON taken from the job posting page.
Field Key | Field Type | Counted | Percentage |
---|---|---|---|
_id | ObjectId | 8 M | 100.00% |
contact | Object | 3 M | 30.83% |
contact.address | String | 74 K | 0.89% |
contact.email | String | 845 K | 10.06% |
contact.name | String | 281 K | 3.34% |
contact.phone | String | 2 M | 24.12% |
dateCreated | Date | 8 M | 100.00% |
dateExpired | Date | 4 M | 48.71% |
dateScraped | Date | 8 M | 100.00% |
datesUpdated | Array | 452 K | 5.38% |
html | String | 8 M | 100.00% |
idInSource | String | 8 M | 100.00% |
json | Object | 6 M | 73.42% |
lastUpdated | Date | 61 K | 0.73% |
locale | String | 2 M | 28.24% |
name | String | 8 M | 100.00% |
orgAddress | Object | 8 M | 100.00% |
orgAddress.addressLine | String | 8 M | 100.00% |
orgAddress.city | String | 6 M | 75.15% |
orgAddress.companyName | String | 8 M | 100.00% |
orgAddress.country | String | 8 M | 98.31% |
orgAddress.countryCode | String | 8 M | 95.65% |
orgAddress.county | String | 8 M | 99.55% |
orgAddress.district | String | 8 M | 98.04% |
orgAddress.formatted | String | 8 M | 100.00% |
orgAddress.geoPoint | Object | 2 M | 28.98% |
orgAddress.geoPoint.lat | Number | 2 M | 28.98% |
orgAddress.geoPoint.lng | Number | 2 M | 28.98% |
orgAddress.houseNumber | String | 8 M | 99.49% |
orgAddress.level | Number | 8 M | 100.00% |
orgAddress.postCode | String | 3 M | 37.29% |
orgAddress.quarter | String | 8 M | 99.09% |
orgAddress.source | String | 933 K | 11.11% |
orgAddress.state | String | 6 M | 66.03% |
orgAddress.street | String | 4 M | 41.80% |
orgCompany | Object | 8 M | 100.00% |
orgCompany.description | String | 6 M | 73.49% |
orgCompany.idInSource | String | 8 M | 100.00% |
orgCompany.imgCover | String | 8 M | 98.90% |
orgCompany.imgLogo | String | 8 M | 98.97% |
orgCompany.info | Object | 809 K | 9.63% |
orgCompany.info.careerpageURL | String | 16 K | 0.19% |
orgCompany.info.companyEmailJobs | String | 101 K | 1.20% |
orgCompany.info.companyEmailMain | String | 101 K | 1.20% |
orgCompany.info.companyPhoneMain | String | 101 K | 1.20% |
orgCompany.info.companySize | String, Number | 707 K | 8.42% |
orgCompany.info.companySizeRange | String | 166 K | 1.98% |
orgCompany.info.companyType | String | 306 K | 3.64% |
orgCompany.info.dateIncorporated | Date | 154 K | 1.84% |
orgCompany.info.foundingDate | Number | 154 K | 1.84% |
orgCompany.info.revenue | String | 296 K | 3.52% |
orgCompany.name | String | 8 M | 100.00% |
orgCompany.nameOrg | String | 8 M | 100.00% |
orgCompany.orgTags | Object | 306 K | 3.64% |
orgCompany.orgTags.INDUSTRIES | Array | 306 K | 3.64% |
orgCompany.registryID | String | 8 M | 98.45% |
orgCompany.source | String | 8 M | 100.00% |
orgCompany.sourceCC | String | 8 M | 100.00% |
orgCompany.url | String | 4 M | 42.33% |
orgTags | Object | 8 M | 99.73% |
orgTags.CATEGORIES | Array | 5 M | 64.52% |
orgTags.COMPANY_BENEFITS | Array | 273 K | 3.26% |
orgTags.COMPANY_TYPES | Array | 6 K | 0.07% |
orgTags.CONTRACT_TYPES | Array | 86 K | 1.03% |
orgTags.INDUSTRIES | Array | 3 M | 36.63% |
orgTags.JOBNAMES | Array | 3 M | 35.34% |
orgTags.KEYWORDS | Array | 2 M | 21.70% |
orgTags.LANGUAGES | Array | 50 K | 0.59% |
orgTags.QUALIFICATIONS | Array | 644 K | 7.67% |
orgTags.REQUIREMENTS | Array | 462 K | 5.51% |
orgTags.RESPONSIBILITIES | Array | 77 K | 0.91% |
orgTags.SKILLS | Array | 679 K | 8.09% |
orgTags.WORK_TYPES | Array | 75 K | 0.90% |
position | Object | 8 M | 99.96% |
position.careerLevel | String | 2 M | 23.09% |
position.contractType | String | 2 M | 26.72% |
position.department | String | 1 M | 11.93% |
position.name | String | 8 M | 99.79% |
position.workType | String | 5 M | 60.73% |
referenceID | String | 3 M | 30.40% |
salary | Object | 2 M | 22.91% |
salary.period | String | 1 M | 12.08% |
salary.text | String | 1 M | 15.17% |
salary.value | Number | 1 M | 12.01% |
source | String | 8 M | 100.00% |
sourceCC | String | 8 M | 100.00% |
text | String | 8 M | 100.00% |
url | String | 8 M | 100.00% |