Method & limitations
Where the data comes from
Roles are pulled directly from each organization's hiring system: public applicant-tracking
feeds (Greenhouse, Lever, Ashby) where available, and a hand-maintained supplement for orgs whose
boards have no machine-readable feed (Airtable / Notion / custom). Every role is tagged with its
collection_method (api or
manual) so the two are always separable. The current snapshot
(2026-06-17) holds 69 AIS-native roles, all collected via API; the manual supplement is being added next.
How roles are classified
Two transparent, rule-based signals, both auditable in classify.py:
- Title seniority — explicit markers split titles into entry (intern, fellow, junior, new-grad, graduate, apprentice…), senior (senior, staff, principal, lead, head, director, chief, manager, founding…), or unspecified. An unmarked "Research Engineer" stays unspecified rather than being guessed into a level.
- Minimum years — the smallest credibly-stated "N+ years" figure in the description. The parser anchors on structured "Required experience" fields and year-mentions sitting next to the word "experience", and explicitly rejects non-experience uses of "years" (visa residency, post-employment bans, "in the next N years", etc.), each of which was producing false positives in an earlier pass. 11 of 69 roles name an explicit figure; the rest state requirements qualitatively, so this is structurally sparse and serves as a cross-check, with title seniority remaining the primary signal.
A role counts as entry-accessible if its title is entry-marked or its stated minimum is ≤2 years.
Known limitations
- Postings ≠ need. This measures publicly advertised openings. Senior and network-based hiring often never gets posted, so the data undercounts senior demand and must not be read as "where the field needs people most."
- Partial coverage. Only orgs with a reachable feed are included; the roster lists every org considered and its status.
- Title heuristics are coarse. Counting "Manager"/"Lead" as senior is defensible but imperfect; the rules are published so you can disagree precisely.
- Single snapshot. This is one point in time, not yet a trend.