Blog  
📰  News: Taylor Jobs – A Good, Clean Jobs Dataset

Good, Clean Jobs Data with Taylor

For the past year, Taylor has been developing powerful tools for cleaning and structuring free-text datasets at scale. We've helped our users label job descriptions with occupation codes, classify candidate bios with their work specializations, enrich business descriptions with NAICS codes, and more.

jobs dataset page

Link: Learn more about Taylor Jobs

A lot of this is necessary because the original data, whether purchased from a vendor, or scraped from the web, is a huge mess. Random stuff is in free text fields, HTML escape sequences abound, jobs are mislabeled as remote when they're not, and so much more. To put it nicely, most data vendors' emphasis is definitely on... quantity. 😬

html escape sequence

The dreaded & HTML escape sequence

We're happy to help our users clean all this data up, and we'll definitely keep doing it. But wouldn't it be better if all this job data was just... already good? In that spirit, we've created our first large-scale dataset: Taylor Jobs (opens in a new tab). Our jobs data is collected directly from the source—job postings from employers. We clean it up, add enrichments with our classification and entity extraction models (like O*NET codes and required skills), and make it available to you, with periodic refreshes you don't even have to think about.

Nurse and Construction Worker Cartoon

We already have jobs from over 200 major companies, and that number will continue to grow. Our tools will allow us to provide the highest-quality data on the market, updated every week. In the future, we're excited to offer:

  • Additional high-quality datasets for candidates, companies, and more.
  • Powerful tools for searching and slicing our data to buy only the precise segments you want: precise geographic locations, semantic similarity, keywords, industries...
  • More enrichments to make our data more useful for a broader variety of HR functions, like benchmarking compensation, offering competitive benefits, and sourcing great candidates for a job.

If you're interested in good, clean jobs data, please reach out to us. We'd be happy to provide a sample of our dataset, or even build a custom database for a specific industry (attorneys, doctors, software engineers, you name it!). You can reach us at contact [at] trytaylor.ai, or send a message here (opens in a new tab).