Data Source Catalog

Plug into BLS, O*NET, Census, NAICS — already AI-callable through the toolbox.

The canonical public People Analytics data sources, wrapped with consistent contracts and exposed through the same MCP gateway every toolbox spoke uses. Build once at the toolbox; every consumer gets the same shape. No more each PA team rewriting the same BLS / Census / O*NET adapter.

26 sources · 6 already AI-callable through a live MCP tool today

Why this catalog exists

The same public sources, wrapped once.

Every People Analytics team writes the same adapter, ingest, and crosswalk. The data is public; the wrapping is duplicated work across the industry. The toolbox catalogs these sources once, exposes them through consistent contracts, and binds them to tools so AI agents can call them by name — no bespoke integration per consumer.

The catalog tracks provider, license, freshness cadence, and integration status per source. The integration status is the honest one: connected means a live MCP tool surfaces the data today; partial means part of the source is live via an existing ; planned means a connector card is filed but not built; blocked means licensing or contractual blockers; and not started is the rest of the roadmap.

The catalog

26 sources, grouped by what they describe.

Labor-market churn, wages, taxonomies, geography, demographics, generational and socio-economic cohorts, regulatory frames. Each entry names the provider, the freshness cadence, the license posture, and the connector spoke (when one exists or is planned).

4 sources

Labor-market dynamics

BLS monthly + quarterly series — the canonical reference for churn, employment, and comp-relevant inflation in the U.S.

BLS Job Openings and Labor Turnover Survey

BLS JOLTS · Government

Connected

Monthly federal survey of job openings, hires, quits, layoffs, and other separations by industry (NAICS) and Census region. The canonical reference for U.S. labor-market churn dynamics; the basis for every voluntary-attrition benchmark.

Freshness
Monthly
License
Public domain
Connector spoke
bls (lib)
Homepage
Open ↗
What this source provides
  • quits rate by NAICS industry
  • hires rate by NAICS industry
  • job-openings rate by NAICS industry
  • layoffs + discharges rate by NAICS industry
  • total separations rate by industry × region

Access: BLS public API via src/lib/connectors/bls/ (PAT-46); helpers expose live national-total JOLTS rates + levels. anycomp.labor_metrics retains the cached SOC × MSA grid (PAT-18-FU-A).

Notes: PAT-46 lib connector at src/lib/connectors/bls/. MCP tools toolbox.bls.jolts.{quits-rate,hires-rate,openings}. Spoke promotion deferred to PAT-46-FU-A.

U.S. Bureau of Labor Statistics, Job Openings and Labor Turnover Survey (JOLTS).

BLS Current Employment Statistics

BLS CES · Government

Connected

Monthly federal payroll survey of employment, hours, and earnings by industry. The reference for total nonfarm employment and industry-level employment trends used in labor-market context calculations.

Freshness
Monthly
License
Public domain
Connector spoke
bls (lib)
Homepage
Open ↗
What this source provides
  • total employment by NAICS industry
  • average weekly hours by industry
  • average hourly earnings by industry
  • national + state-level employment trends

Access: BLS public API via src/lib/connectors/bls/ (PAT-46); helpers expose total payroll employment for an industry code. anycomp.labor_metrics retains cached industry-level employment for labor-rate denominators (PAT-18-FU-A).

Notes: PAT-46 lib connector at src/lib/connectors/bls/. MCP tool toolbox.bls.ces.employment. CES is the denominator companion to JOLTS rates.

U.S. Bureau of Labor Statistics, Current Employment Statistics (CES).

BLS Employment Cost Index

BLS ECI · Government

Connected

Quarterly federal index of changes in labor costs (wages + salaries + benefits). The Federal Reserve's preferred comp-inflation gauge; used to scale historical comp data forward.

Freshness
Quarterly
License
Public domain
Connector spoke
bls (lib)
Homepage
Open ↗
What this source provides
  • wages + salaries index, all civilian workers
  • total compensation index by occupation group
  • total compensation index by industry group
  • private vs public sector comp trends

Access: BLS public API via src/lib/connectors/bls/ (PAT-46); helper returns most-recent N quarters of the civilian wages-and-salaries series.

Notes: PAT-46 lib connector at src/lib/connectors/bls/. MCP tool toolbox.bls.eci.trend. ECI is the right inflation deflator for multi-year compensation analyses; CPI over-states comp-relevant inflation.

U.S. Bureau of Labor Statistics, Employment Cost Index (ECI).

BLS Consumer Price Index

BLS CPI · Government

Connected

Monthly federal measure of average price changes for a basket of consumer goods + services. The default inflation reference; useful for cost-of-living adjustment context but inferior to ECI for comp-equity work.

Freshness
Monthly
License
Public domain
Connector spoke
bls (lib)
Homepage
Open ↗
What this source provides
  • CPI-U (all urban consumers)
  • CPI-W (urban wage earners + clerical workers)
  • core CPI (excluding food + energy)
  • regional CPI (Northeast / Midwest / South / West)

Access: BLS public API via src/lib/connectors/bls/ (PAT-46); helper returns latest observation for any CPI series ID, defaulting to CPI-U All Items.

Notes: PAT-46 lib connector at src/lib/connectors/bls/. MCP tool toolbox.bls.cpi.lookup. Pair with ECI when reporting comp trends — CPI for cost-of-living context, ECI for comp-inflation context.

U.S. Bureau of Labor Statistics, Consumer Price Index (CPI).

3 sources

Compensation benchmarks

Wage and pay-survey references. Public-domain OEWS at the SOC × MSA grain; commercial Mercer + Radford for industry-specific premia.

BLS Occupational Employment and Wage Statistics

BLS OEWS · Government

Connected

Annual federal survey of wages and employment by SOC occupation × MSA. The authoritative U.S. wage reference at the (occupation × geography) grain; the public-domain comp-benchmark substrate.

Freshness
Annual
License
Public domain
Connector spoke
bls (lib)
Homepage
Open ↗
What this source provides
  • mean wage by SOC × MSA
  • median wage by SOC × MSA
  • p10 / p25 / p75 / p90 wage by SOC × MSA
  • employment count by SOC × MSA
  • wage by SOC × industry (national)

Access: BLS public API via src/lib/connectors/bls/ (PAT-46); helpers expose median annual wage for (SOC × MSA) cells via buildOewsSeriesId.

Notes: PAT-46 lib connector at src/lib/connectors/bls/. MCP tool toolbox.bls.oews.lookup. Median only in v0; percentile bands (p25/p75/p90) deferred to PAT-46-FU-A.

U.S. Bureau of Labor Statistics, Occupational Employment and Wage Statistics (OEWS).

Mercer Compensation Surveys

Mercer Comp · Commercial

Not started

Mercer's annual compensation survey portfolio: Total Remuneration Surveys (TRS), MERCER PayMonitor, and industry-specific surveys. Commercial subscription; the de facto benchmark for general-industry compensation in mid-to-large enterprises.

Freshness
Annual
License
Commercial — paid
Connector spoke
Homepage
Open ↗
What this source provides
  • base salary by job × level × industry × geography
  • total cash compensation by job × level × industry × geography
  • long-term incentive prevalence + value by job × level
  • geographic differentials + cost-of-labor indices
  • industry × function comp comparison tables

Access: Mercer customer portal (subscription required). Toolbox does not own a license.

Notes: Customers bring their own Mercer license. Future connector pattern: anycomp ingests a customer-supplied Mercer extract into anycomp.comp_models band midpoints, keeping the licensed data inside the customer tenancy.

Mercer LLC, Total Remuneration Survey (annual).

Radford Global Technology Survey

Radford Global Tech · Commercial

Not started

Radford's tech-industry compensation + equity survey portfolio (now an Aon subsidiary). The reference for tech compensation benchmarking; widely used in Silicon Valley + global tech enterprises.

Freshness
Annual
License
Commercial — paid
Connector spoke
Homepage
Open ↗
What this source provides
  • base salary by tech-job-family × level × geography
  • total cash + equity by tech-job-family × level × geography
  • new-hire equity grant ranges by job × level
  • refresh equity grant ranges by job × level + performance tier
  • tech-industry hiring + attrition pulse data

Access: Aon | Radford customer portal (subscription required). Toolbox does not own a license.

Notes: Same connector shape as Mercer: customers bring their own Radford license; anycomp ingests tenant-scoped extracts. The equity-grant surfaces are the most differentiated content — base salary overlaps materially with public-domain OEWS at the SOC level.

Aon | Radford, Global Technology Compensation + Equity Survey (annual).

2 sources

Job classification

Occupation taxonomies and job-content profiles. O*NET maps occupational content; meta-factory layers a maintained family / function / level taxonomy on top.

O*NET — Occupational Information Network

O*NET · Academic

Planned

U.S. Department of Labor's occupational database: 1,016 SOC-mapped occupations with skills, tasks, knowledge, abilities, work context, and work activities. The canonical reference for job-content taxonomy; the substrate for skills inference + role-similarity work.

Freshness
Annual
License
Public domain
Connector spoke
job-family-agent
Homepage
Open ↗
What this source provides
  • 1,016 SOC occupations with full content profiles
  • skill ratings (importance + level) per occupation
  • task statements per occupation
  • knowledge + ability + work-context ratings
  • occupation-to-occupation similarity / career-ladder data

Access: Bulk database downloads (O*NET Resource Center) + free-tier web services. Canonical toolbox home is meta-factory-prod's job-family-agent.

Notes: Canonical home is meta-factory-prod (separate repo). PAT-47 builds the toolbox-side job-family-agent connector that proxies meta-factory's O*NET-mapped taxonomy through MCP. The toolbox does not own the O*NET ingest pipeline.

U.S. Department of Labor, O*NET Database v28.3.

meta-factory job-family-agent

meta-factory job-family-agent · Internal

Planned

Mike's canonical job-family + function + level taxonomy with O*NET cross-references, AI-driven spec authoring, and a maintained role corpus. The toolbox proxies meta-factory's contract; meta-factory-prod owns the algorithms + corpus.

Freshness
Ad-hoc
License
Internal
Connector spoke
job-family-agent
Homepage
What this source provides
  • universal job-family taxonomy
  • function × level matrix with role specs
  • O*NET SOC ↔ job-family crosswalk
  • AI-authored role-spec drafting service
  • corpus of normalized role descriptions

Access: meta-factory-prod HTTP + MCP gateway. Toolbox connector at src/spokes/job-family-agent/ is reserved namespace only today.

Notes: Canonical home is meta-factory-prod, not this toolbox. PAT-47 builds the toolbox-side connector that wraps meta-factory's HTTP/MCP contract. Performix and other consumers vendor meta-factory's contract directly; the toolbox is one of several proxies, not the canonical surface.

Mike West, meta-factory job-family-agent (internal, ~/Vibe Coding Projects/meta-factory-prod/).

2 sources

Industry classification

Industry taxonomies for U.S. and global work. NAICS is the federal-statistics default; GICS is the equity-research counterpart.

North American Industry Classification System

NAICS · Government

Partial

Standard industry classification system used by federal statistical agencies (Census, BLS, BEA). 5-level hierarchy from 2-digit sector to 6-digit national industry. The default industry taxonomy for U.S. labor-market + benchmark work.

Already AI-callable

Freshness
Stable
License
Public domain
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • NAICS 2017 / 2022 hierarchy (all 5 levels)
  • industry titles + descriptions
  • NAICS ↔ ISIC crosswalk
  • historical NAICS revision crosswalks (2002 / 2007 / 2012 / 2017 / 2022)

Access: Census Bureau publication (PDF + CSV). segmentation-studio's canonical-segments catalog (PAT-41) references NAICS sectors at the industry-segment level.

Notes: Revised every 5 years (2017 → 2022 → 2027). Most analytical work today is on the 2017 hierarchy with 2022 forward-mapping. Future connector work pairs NAICS with GICS for cross-mapping commercial benchmarks.

U.S. Census Bureau, North American Industry Classification System (NAICS) Manual.

Global Industry Classification Standard

GICS · Commercial

Blocked

Industry classification standard developed by MSCI + S&P Global, widely used in equity research and global benchmarking. 4-level hierarchy (sector → industry group → industry → sub-industry). Commercial license required for redistribution.

Freshness
Ad-hoc
License
Commercial — licensed
Connector spoke
Homepage
Open ↗
What this source provides
  • GICS 4-level hierarchy
  • sector + industry group + industry + sub-industry definitions
  • GICS ↔ NAICS / ICB / Bloomberg-BICS crosswalks (vendor-specific)
  • company-to-GICS classifications (via vendor data feeds)

Access: MSCI + S&P Global commercial license. Toolbox integration blocked on licensing.

Notes: Commonly requested for benchmarking commercial-equity-comp data (where comp surveys index by GICS). Blocked on licensing — Mike's path is to wait until either an open-data crosswalk exists or a customer brings their own GICS license.

MSCI and S&P Global, Global Industry Classification Standard (GICS).

1 source

Geography

ZIP / metro / state crosswalks. The foundation for translating HRIS employee location to a comp-benchmark or labor-market segment.

U.S. Census ZIP-to-CBSA Crosswalk

Census ZIP↔CBSA · Government

Connected

Mapping from 5-digit ZIP codes to Core-Based Statistical Areas (CBSAs) and Metropolitan Statistical Areas (MSAs). The substrate for geo-rollup in HRIS pipelines — employee ZIP → metro for comp banding + labor-market join.

Freshness
Annual
License
Public domain
Connector spoke
census (lib)
Homepage
Open ↗
What this source provides
  • ZIP code → CBSA mapping
  • ZIP code → MSA mapping
  • CBSA → state crosswalk
  • metro vs micro CBSA flagging

Access: Curated v0 snapshot bundled at src/lib/connectors/census/data/zcta-to-cbsa.json (PAT-48); covers 57 most-populated MSAs as ZIP ranges. Refresh path documented in census/README.md.

Notes: PAT-48 lib connector at src/lib/connectors/census/. MCP tools toolbox.census.{zip.lookup,cbsa.lookup}. Full ~33K ZCTA coverage deferred to PAT-48-FU-A; pairs with BLS OEWS to translate employee ZIPs to MSA-level wage benchmarks.

U.S. Department of Housing and Urban Development, HUD-USPS ZIP Code Crosswalk Files (Census basis).

1 source

Demographics

Population-level demographic, economic, and housing data — the right baseline for representation-ratio benchmarks.

U.S. Census American Community Survey

Census ACS · Government

Connected

Continuous nationwide demographic + economic + housing survey. 1-year and 5-year estimates at multiple geographic levels (national / state / county / tract / block group). The reference for benchmark-population demographic context.

Freshness
Annual
License
Public domain
Connector spoke
census (lib)
Homepage
Open ↗
What this source provides
  • demographic estimates by age + sex + race + ethnicity
  • educational attainment by geography
  • labor-force participation by geography
  • household income distributions by geography
  • industry + occupation distributions by geography

Access: Census Bureau Data API via src/lib/connectors/census/ (PAT-48); helper fetches ACS 5-year demographics for any CBSA (median income, population, age, labor-force participation, education).

Notes: PAT-48 lib connector at src/lib/connectors/census/. MCP tool toolbox.census.acs.demographics. ACS demographics are the right baseline for representation-ratio benchmarks; CENSUS_API_KEY recommended for production volume.

U.S. Census Bureau, American Community Survey (ACS).

1 source

Generational cohorts

Birth-year cohort definitions used to compare attitudes and outcomes across generations.

Pew Research Generational Cohort Definitions

Pew Generational · Research

Connected

Pew Research Center's published birth-year ranges for the Silent Generation, Baby Boomers, Gen X, Millennials, Gen Z, and Gen Alpha. The de facto standard for generational segmentation in U.S. workforce analytics; widely cited in academic and HR-tech research.

Already AI-callable

Freshness
Ad-hoc
License
Open data
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • birth-year range per generational cohort
  • Pew Research methodology notes on cohort boundary choices
  • comparative Strauss-Howe variant boundaries (annotated where they differ)
  • Greatest / Silent / Boomer / Gen X / Millennial / Gen Z / Gen Alpha definitions

Access: Pew Research Center publications + reports (https://www.pewresearch.org/). Toolbox vendored the ranges into segmentation-studio's PAT-55 canonical-segments seed.

Notes: Strauss-Howe Generations Inc. publishes a slightly different boundary set (Boomers 1943-1960; Gen X 1961-1981; Millennials 1982-2004). The Pew ranges are authoritative in the canonical-segments catalog; Strauss-Howe variants are flagged as alternativeStrings on the matching Pew entry. Pew refreshes boundaries periodically (most recent material change was Gen Z 1997 lower bound in 2019). PAT-55 ships 7 canonical entries: Greatest / Silent / Boomer / Gen X / Millennial / Gen Z / Gen Alpha.

Pew Research Center, Generations and Age — Defining Generations: Where Millennials End and Generation Z Begins (2019, updated through 2024).

4 sources

Socio-economic classification

Peer-reviewed sociological scales (Hauser-Warren, Nakao-Treas, ISEI, UK NS-SEC) that classify occupations by socio-economic status — distinct from the work they describe.

Ganzeboom International Socio-Economic Index of Occupations

ISEI · Academic

Partial

Internationally comparable socio-economic-status scale developed by Harry Ganzeboom and Donald Treiman, optimally scaled to mediate between education and income across ISCO occupation codes. The international standard for cross-country SES research; mapped to ISCO-08 (current) and ISCO-88 (legacy).

Already AI-callable

Freshness
Ad-hoc
License
Open data
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • ISEI score (16-90 scale) per ISCO-08 4-digit occupation code
  • ISCO-08 to ISCO-88 crosswalk with score migration notes
  • methodology paper + Stata / R reproduction code
  • EGP (Erikson-Goldthorpe-Portocarero) class mapping as a companion categorical scale

Access: Ganzeboom's website (https://www.harryganzeboom.nl/isco08/) + Stata `iskoisei` package. Toolbox vendored 4 ISEI bands as canonical segments under PAT-56.

Notes: ISEI is the international research standard for SES — used in OECD, World Bank, ILO, and academic comparative-mobility studies. PAT-56 ships 4 ISEI bands (low 16-30, lower-mid 30-50, upper-mid 50-70, high 70-90). Computing an ISEI score from an HRIS requires an ISCO-08 occupation classification first; the upstream job-family-agent connector (PAT-47) is the long-term path to that crosswalk.

Ganzeboom, H. B. G., De Graaf, P. M. & Treiman, D. J. (1992). A standard international socio-economic index of occupational status. Social Science Research 21(1), 1-56. Updated for ISCO-08 by Ganzeboom (2010).

UK National Statistics Socio-Economic Classification

UK NS-SEC · Government

Partial

Official socio-economic classification published by the UK Office for National Statistics, rebased on the Standard Occupational Classification (SOC). Categorical 8-class structure (vs the continuous-scale ISEI / Hauser-Warren / Nakao-Treas), capturing employment relations + conditions in addition to occupation.

Already AI-callable

Freshness
Ad-hoc
License
Open data
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • NS-SEC 8-class analytical model (Classes 1-8)
  • NS-SEC 5-class collapsed model for smaller samples
  • NS-SEC ↔ SOC 2010 occupation-code crosswalk
  • official user manual + rebasing methodology

Access: ONS website (https://www.ons.gov.uk/methodology/classificationsandstandards/otherclassifications/thenationalstatisticssocioeconomicclassificationnssecrebasedonthesoc2010). Toolbox vendored the 8 analytical classes as canonical segments under PAT-56.

Notes: NS-SEC is the official UK SES classification, used in the UK Census and in academic + policy research on inequality + mobility. PAT-56 ships all 8 analytical classes. Operationally, NS-SEC class derivation needs both SOC occupation + employment-relations data (employee vs employer, supervisory status) — the pure SOC-derived ''simplified'' NS-SEC is the practical fallback when employment-relations fields are absent.

UK Office for National Statistics. (2010). The National Statistics Socio-economic Classification (NS-SEC) — Rebased on the SOC 2010 — User Manual.

Hauser-Warren Socioeconomic Index

Hauser-Warren SEI · Academic

Partial

Updated U.S. socio-economic-status index combining occupational education + occupational income, published by Robert Hauser and John Robert Warren in 1997. Built on the 1990 Census occupational classification; widely used in U.S. sociological research on mobility, inequality, and stratification.

Already AI-callable

Freshness
Stable
License
Open data
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • Hauser-Warren SEI score (0-100 scale) per 1990 Census occupation code
  • occupation-prestige sub-index alongside the SEI
  • occupational-education + occupational-income component scores
  • methodology paper with reproduction notes

Access: Sociological Methodology 27(1), 177-298 (1997). Toolbox vendored 4 SEI bands as canonical segments under PAT-56.

Notes: Hauser-Warren is the U.S.-specific update to the Duncan SEI tradition; it predates Nakao-Treas (1994) but uses 1990 Census occupation codes rather than the 1989 GSS prestige ratings Nakao-Treas relied on. The two are not interchangeable; researchers usually pick one and cite consistently. PAT-56 ships 4 bands (low 0-25, lower-mid 25-50, upper-mid 50-75, high 75-100).

Hauser, R. M. & Warren, J. R. (1997). Socioeconomic indexes for occupations: A review, update, and critique. Sociological Methodology 27(1), 177-298.

Nakao-Treas Socioeconomic Index

Nakao-Treas SEI · Academic

Partial

U.S. socio-economic-status index derived from NORC General Social Survey (GSS) occupational prestige ratings, published by Keiko Nakao and Judith Treas in 1994. Companion to the Hauser-Warren SEI; uses 1989 GSS prestige scores combined with 1980 Census occupational education + income to produce both prestige and SEI scales.

Already AI-callable

Freshness
Stable
License
Open data
Connector spoke
segmentation-studio
Homepage
Open ↗
What this source provides
  • Nakao-Treas SEI score (0-100 scale) per 1980 Census occupation code
  • GSS-derived occupational prestige score per occupation
  • occupational-education + occupational-income component scores
  • methodology paper with reproduction notes

Access: Sociological Methodology 24, 1-72 (1994). NORC GSS prestige data (https://gss.norc.org/). Toolbox vendored 4 SEI bands as canonical segments under PAT-56.

Notes: Nakao-Treas updates the older Stevens-Featherman and Stevens-Cho SEI scales using 1989 GSS prestige ratings — the prestige series the NORC GSS has maintained since the 1960s. Most U.S. mobility research from the 1990s-2000s uses either Hauser-Warren or Nakao-Treas; researchers should pick one and cite consistently. PAT-56 ships 4 bands (low 0-25, lower-mid 25-50, upper-mid 50-75, high 75-100).

Nakao, K. & Treas, J. (1994). Updating occupational prestige and socioeconomic scores: How the new measures measure up. Sociological Methodology 24, 1-72.

8 sources

Regulatory standards

Compliance frames and reporting taxonomies. The shape required for U.S. and EU pay-equity, workforce-demographic, and disclosure obligations.

EEOC EEO-1 Component 1 Report

EEO-1 · Government

Not started

U.S. EEOC mandatory workforce demographic report for private employers with 100+ employees. Defines the canonical EEOC race/ethnicity + sex categories + 10 EEO-1 job groups. Required filing surface for compliance reporting.

Freshness
Annual
License
Public domain
Connector spoke
Homepage
Open ↗
What this source provides
  • EEO-1 job-group taxonomy (10 categories)
  • EEOC race/ethnicity category definitions
  • EEOC sex category definitions
  • EEO-1 filing data dictionary + technical spec

Access: EEOC published spec + filing portal. Not yet wired into a toolbox spoke.

Notes: Pairs with the Diversity Composition analyses (PAT-42) — the EEO-1 categories are the canonical reporting taxonomy for workforce-demographic dashboards. Future connector work covers filing-format generation, not analytical surfaces.

U.S. Equal Employment Opportunity Commission, EEO-1 Component 1 Survey.

EU Pay Transparency Directive

EU Pay Transparency · International

Not started

EU Directive 2023/970 on pay transparency and equal-pay reporting. Mandates gender-pay-gap disclosure thresholds, methodology for controlled gap calculations, and remediation triggers. The European compliance counterpart to U.S. EEO-1 + state pay-equity laws.

Freshness
Ad-hoc
License
Open data
Connector spoke
Homepage
Open ↗
What this source provides
  • directive text + transposition deadlines per member state
  • reporting thresholds (employer size × pay-gap percentage)
  • controlled-pay-gap methodology guidance
  • remediation-trigger thresholds (5% unexplained gap)

Access: EUR-Lex (directive text) + per-member-state implementing legislation. Not yet wired into a toolbox spoke.

Notes: Member state transposition is in flight through June 2026. The 'pay-equity-controlled-comparison' analysis (PAT-42) already implements the directive's methodology shape; the data-source entry is for the regulatory frame itself.

European Parliament + Council, Directive (EU) 2023/970 (Pay Transparency Directive).

U.S. Department of Labor — Fair Labor Standards Act

DOL FLSA · Government

Partial

The federal statutory framework for U.S. minimum wage + overtime + child-labor + recordkeeping requirements. Authoritative source for the federal wage floor and the floor every state ordinance compares against.

Freshness
Ad-hoc
License
Public domain
Connector spoke
wage-compliance
Homepage
Open ↗
What this source provides
  • federal minimum hourly wage
  • federal overtime threshold + exempt-employee tests
  • tipped-employee federal floor + tip-credit rules
  • minor / student-worker federal wage rules
  • industry-specific FLSA exemptions

Access: DOL Wage and Hour Division publications + statutory text (29 USC §§ 201-219). Toolbox PAT-85 (DOL FLSA connector) will codify automated refresh.

Notes: PAT-80 seeds the federal floor + overtime threshold into wage_compliance.rule_versions directly. PAT-85 wires automated state-labor-site scraping for state baselines. Citation chain lands in wage_compliance.source_citations.

U.S. Department of Labor, Wage and Hour Division — Fair Labor Standards Act.

State Labor Department Websites (Aggregate)

State Labor Sites · Government

Partial

Meta-entry covering each U.S. state's Department of Labor (or equivalent) authoritative wage-rule page. Each state publishes its own minimum-wage announcements, posters, and FAQ — content varies in shape but is the legally-binding source for state-level wage rules.

Freshness
Ad-hoc
License
Public domain
Connector spoke
wage-compliance
Homepage
What this source provides
  • state minimum hourly wage (general / tipped / minor / industry-specific)
  • state effective dates + announced future increases
  • state-mandated paid-leave + sick-leave wage rules
  • state-mandated overtime thresholds (CA daily-OT, AK 8-hour-day, NV)
  • state required-posting + notice rules

Access: Per-state public webpages (50 + DC). Toolbox PAT-85 (state-labor scraper) maintains the per-state roster + handles HTML/PDF variance via PAT-86 AI ordinance extractor fallback.

Notes: Aggregate entry because each state is a separate scraper target, but the catalog needs a single conceptual citation. The connector resolves per-state URLs from a configured roster maintained alongside PAT-85.

Individual state Departments of Labor (50 states + District of Columbia).

UC Berkeley Labor Center — Inventory of US City and County Minimum Wage Ordinances

UC Berkeley Labor Center · Academic

Connected

Curated public roster of every known U.S. city + county minimum-wage ordinance, maintained by the UC Berkeley Center for Labor Research and Education. The gold-standard third-party inventory — covers ~150 active local ordinances with effective-date history.

Freshness
Quarterly
License
Creative Commons
Connector spoke
wage-compliance
Homepage
Open ↗
What this source provides
  • city + county wage ordinances with effective dates
  • tipped-worker tier-out provisions by jurisdiction
  • small-employer vs large-employer thresholds
  • scheduled future-effective increases (multi-year)
  • historical wage-rule changes by jurisdiction

Access: Public webpage with downloadable PDF/Excel inventory. PAT-80 ingests the roster as the seed for ~150 local-ordinance jurisdictions; PAT-86 AI extractor parses individual ordinance PDFs where Berkeley's summary needs validation.

Notes: The single highest-leverage public source for comprehensive US local-ordinance coverage. Manual review of each entry before promoting from validation_status='pending' → 'validated' is the human-in-the-loop discipline.

UC Berkeley Center for Labor Research and Education — Inventory of US City and County Minimum Wage Ordinances.

National Conference of State Legislatures — State Minimum Wages

NCSL · Academic

Partial

Cross-state inventory of enacted + pending minimum-wage legislation maintained by NCSL (nonpartisan state-government NGO). The reference for tracking state-level wage-law changes before they take effect — feeds the upcoming-increase alert path (PAT-87).

Freshness
Quarterly
License
Open data
Connector spoke
wage-compliance
Homepage
Open ↗
What this source provides
  • state minimum wage by year (current + announced future)
  • pending state minimum-wage legislation status
  • ballot-initiative tracking by state
  • indexation / inflation-tied state laws
  • state preemption laws (state blocks local ordinances)

Access: Public webpages + downloadable tables. PAT-85 connector polls quarterly; PAT-87 temporal diff agent uses NCSL pending-legislation feed for 90/30/7-day alert windows.

Notes: NCSL's pending-legislation tracker is what makes wage-compliance.alerts upcoming-increase alerts non-aspirational — there is a public, regularly-updated source of truth for what's coming, not just what's in effect.

National Conference of State Legislatures — State Minimum Wages.

Symmetry Software — Payroll Tax Engine

Symmetry · Commercial

Not started

Commercial payroll-tax engine covering U.S. federal + state + local jurisdictional pay rules. Optional customer-bring-your-own provider — when a tenant licenses Symmetry, wage-compliance can blend Symmetry's rule data with toolbox-cultivated public sources for cross-validation.

Freshness
Monthly
License
Commercial — paid
Connector spoke
Homepage
Open ↗
What this source provides
  • federal + state + local minimum-wage data
  • payroll-tax rates by jurisdiction × employee type
  • Pennsylvania local-services-tax districts
  • Ohio RITA + CCA tax districts
  • garnishment + wage-attachment rules

Access: Symmetry API (customer subscription required). Toolbox provider-abstraction layer lets customers connect their license without coupling to a single vendor.

Notes: Optional vendor — customers bring their own license. The provider-abstraction layer ensures wage-compliance never depends on a single commercial source; Symmetry blends with DOL + state-labor + Berkeley + NCSL via PAT-104 conflict-detection agent when sources disagree.

Symmetry Software, Symmetry Tax Engine + Minimum Wage Compliance.

Toolbox AI Ordinance Extractor (PAT-86)

AI Ordinance Extractor · Internal

Planned

Toolbox-internal AI connector that extracts structured wage rules from public ordinance PDFs + HTML pages. Citation-required + confidence-scored + human-review-gated. Operates on PUBLIC ordinances only — tenant company data never reaches the model.

Freshness
Real-time
License
Internal
Connector spoke
wage-compliance
Homepage
What this source provides
  • structured rule_version payload from ordinance PDF
  • source citation with page/paragraph references
  • multi-factor confidence score per extraction
  • diff detection across versions of the same ordinance
  • language-agnostic extraction (Claude handles foreign-language ordinances)

Access: MCP tool `wage-compliance.ai-extract.ordinance` (admin-scope only). Served via Vercel AI Gateway (`AI_GATEWAY_API_KEY`; ANTHROPIC_API_KEY fallback) through the shared `src/lib/ai/` module (PAT-86-FU-A). Output lands in wage_compliance.rule_versions with validation_status='pending' awaiting human review.

Notes: Case study #1 for the toolbox's AI-in-HR posture (see /ai-in-hr). AI runs on EXTERNAL data only — public ordinances. Tenant rosters / compensation / evaluation results never enter a prompt. Compliance evaluation itself is deterministic Postgres lookup against canonical structures the AI helped build.

People Analytics Toolbox — AI Ordinance Extractor (PAT-86).

One adapter at the toolbox. Every consumer gets it.

The public sources are the same across every PA team. The toolbox does the wrapping once and exposes it as typed contracts plus MCP tools. Bring your consumer; vendor the contract; skip the year of integration glue.