explaingit

carterperez-dev/exs-cyberjob-scraper

21ElixirAudience · generalComplexity · 2/5Setup · easy

TLDR

An Elixir tool that scans hundreds of thousands of job postings from Greenhouse, Lever, Ashby, and Workday and ranks which cybersecurity certifications employers request most often.

Mindmap

mindmap
  root((exs-cyberjob-scraper))
    What it does
      Scrapes job board feeds
      Filters to security roles
      Ranks certifications by frequency
    Supported Boards
      Greenhouse
      Lever
      Ashby
      Workday
    Sample Results
      CISSP 19 percent
      GIAC 10 percent
      Security+ 7.7 percent
    Setup
      One install script
      certscout binary
      Linux two commands
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Find out which cybersecurity certifications employers demand most so you can prioritize which one to study for next.

USE CASE 2

Run a fresh scan of current job board postings to see if certification demand has shifted since the last documented run.

USE CASE 3

Use the ranked output as market research data when advising students or employees on career development in cybersecurity.

Tech stack

ElixirErlang

Getting it running

Difficulty · easy Time to first run · 30min

Run one install script on a fresh Linux machine, it downloads Erlang, Elixir, and dependencies automatically.

No license information was mentioned in the explanation.

In plain English

This tool scans public job posting feeds from hiring platforms like Greenhouse, Lever, Ashby, and Workday, filters out non-security roles, and tallies which cybersecurity certifications employers mention most often in their listings. The output is a ranked list showing how frequently each credential appears across thousands of postings. It is built in Elixir, a programming language that runs on the Erlang virtual machine. The most recent run the README documents scanned 278,145 job postings from 1,195 employers and isolated 11,511 as genuine cybersecurity roles. Of those, CISSP appeared in 19.1% of postings, making it the most requested certification. GIAC came second at 10.4%, followed by CISM at 8.3%, CompTIA Security+ at 7.7%, and OSCP at 6.3%. The list continues through twelve certifications in total, down to CompTIA CASP+ at 2.3%. The README itself functions as a showcase of one particular run's results rather than as a code walkthrough. The data is presented as a table and as text-based bar charts for each certification. A separate file, README2.md, contains the actual setup instructions for running the scraper yourself. Installation on a fresh Linux machine is described as two commands: run an install script that downloads Erlang, Elixir, and the project's dependencies, then run the certscout binary with a flag specifying which job board to target. The installer builds everything needed, so you do not need to have Elixir installed beforehand. This would be useful to anyone trying to figure out which certifications to pursue for a cybersecurity career, or to anyone doing market research on what credentials employers in the security field currently value. The data is tied to a specific scan date, so the numbers reflect one point in time rather than a continuously updated feed.

Copy-paste prompts

Prompt 1
I want to know which cybersecurity certs employers value most right now. Walk me through installing and running exs-cyberjob-scraper on Linux to scan Greenhouse and Lever job boards.
Prompt 2
Show me how to point the certscout binary at Workday job feeds and explain what the output table means.
Prompt 3
I want to extend exs-cyberjob-scraper to track additional certifications beyond the default list. Where in the Elixir code would I add new credential patterns to match against?
Open on GitHub → Explain another repo

← carterperez-dev on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.