§ Research · paper

Public funding and researcher careers: funder portfolios, career-stage composition, and why the funded-vs-unfunded productivity gap is mostly selection, not effect

Bucket Foundation · research-atlas working groupBucket Foundationpreprint · 1.0 (preprint draft)2026-06-24CC-BY-4.0

Corpus: research-atlas v0.5.0 — 1,740,326 grant-PI edges; a grant_pi_person bridge resolving 528,570 (30.4%) to 59,180 canonical researchers (concept DOI 10.5281/zenodo.20774322)

read the PDF ↗DOI ↗code + data on github ↗

Abstract

Papers 01–03 in this series studied the funding graph without ever touching the people side: a grant's principal investigator (PI) was a name-only node with no ORCID, so funding could not be joined to careers at all. A new conservative resolver closes that gap, producing a grant_pi_person bridge that links 528,570 of the 1,740,326 PI edges (30.4%) to 59,180 distinct canonical researchers (490,839 edges carry an ORCID).

This paper asks how public funding maps onto researcher careers — and its central methodological move is to refuse a tempting false claim. The resolver succeeds precisely on ORCID-era, OpenAlex-indexed, more-productive researchers: the resolved "funded" population is 89.9% ORCID'd vs 69.1% for the canonical researcher pool it is drawn from (a +20.8pp gap), with 2.12× the mean publications. So a naive "funded researchers publish 2× more" is confounded by resolution/selection bias, not a funding effect, and we state that up front and quantify it.

We then make three descriptive claims that survive the bias. (1) Who each funder funds: Sloan funds the most eminence-skewed portfolio (9.5% eminent, median 451.5 citations), DFG and Wellcome the most early-career-skewed (DFG 69.0% rising-stars); NIH's grant-holders are 88.0% biomedical, NSF's span the disciplines with no field above 28%. (2) Career-stage composition: across the funded population, 58.8% are rising-stars, 19.2% established, 4.6% eminent; grant-holding is concentrated (median 4 grants/researcher, max 394). (3) Funded vs comparison, done honestly: restricting to the resolvable population and matching on field × career-stage × entry-era, the naive 2.11× publication ratio collapses to a matched residual of 1.23× works (95% CI [1.18, 1.28]), 1.23× h-index, and only 1.06× citations — roughly 80% of the apparent gap is selection. We make no causal claim; the selection-bias treatment is itself the contribution.

Key findings

The headline is a refusal: the naive 2.1× funded-vs-unfunded publication gap is mostly selection — matching on field × career-stage × entry-era collapses it to 1.23× works [1.18, 1.28], 1.23× h-index, and just 1.06× citations.
The resolution bias is measured, not assumed: funded PIs are 89.9% ORCID'd vs 69.1% for the pool (+20.8pp) with 2.12× the mean publications — the confound, not a finding.
Within-funder portfolios differ in kind: Sloan is the most eminence-skewed (9.5% eminent), DFG/Wellcome the most rising-star-skewed (DFG 69.0%); NIH grant-holders are 88.0% biomedical, NSF's top field is only 28%.
The matched residual is largest for rising-stars (1.375× works) and smallest for established researchers (1.156×) — reported as descriptive structure, explicitly not a causal return to funding.
No dollar column anywhere; the grant_pi_person bridge carries no PII; every constant is pinned by a seeded test (tests/test_funding_careers.py).

Figures

Selection bias vs matched residual — Figure 1. (A) The selection, not an effect: resolved/funded PIs are far more ORCID'd and more productive than the canonical researcher pool — the confound, not a finding. (B) Restricting to the resolvable population and matching on field × career-stage × entry-era collapses the naive 2.1× publication gap to ~1.2× (works, h-index) and ~1.06× (citations); error bars are 2,000-sample bootstrap 95% CIs.

Funder career-stage portfolios — Figure 2. Who each funder funds, by career stage: the career-stage composition of each funder's distinct resolved grant-holders. Sloan is the most eminence-skewed; DFG and Wellcome the most rising-star-skewed. Within-funder structure — the resolution bias inflates every funder alike.

Funder eminence vs productivity — Figure 3. Funder researcher-portfolios differ on eminence (% of grant-holders who are high-impact) versus median grant-holder citations; marker area ∝ √(number of grant-holders). Sloan occupies the high-eminence/high-impact corner, DFG the low/low corner, NIH the high-volume centroid.

Matched residual by career stage — Figure 4. Matched residual works-gap by career stage (same field × entry-era matching within each seniority). The residual is largest for rising-stars (1.38×) and smallest for established researchers (1.16×); the dashed line is parity. Descriptive, not causal.

Cite this paper

DOI: 10.5281/zenodo.20836727

@misc{bucket2026fundingcareers,
  title        = {Public funding and researcher careers: funder portfolios,
                  career-stage composition, and why the funded-vs-unfunded
                  productivity gap is mostly selection, not effect},
  author       = {{Bucket Foundation research-atlas working group}},
  year         = {2026},
  howpublished = {Bucket Foundation preprint},
  doi          = {10.5281/zenodo.20836727},
  url          = {https://doi.org/10.5281/zenodo.20836727},
  note         = {research-atlas v0.5.0}
}

← all papers the research-atlas graph open datasets