bucket foundation — inverse omegabucket.foundation
§ Research · paper

The structure of public research funding, 2015–2025: concentration, cross-funder co-funding, and the funding→output relationship in a reconciled NIH/NSF/EC/UKRI grant graph

Gianangelo Dichio · research-atlas working groupBucket Foundationpreprint · 1.0 (preprint draft)2026-06-19CC-BY-4.0

Corpus: research-atlas v0.1.0 — 887,016 grants / 69 funders / 226,785 linked works

Abstract

We assemble a reconciled graph of the global public-research economy — 887,016 grants from the U.S. National Institutes of Health (NIH), the U.S. National Science Foundation (NSF), the European Commission (EC, via CORDIS), and UK Research and Innovation (UKRI), 2015–2025 — in which recipient organizations are merged on ROR identifiers, investigators on ORCID, and 156,877 research outputs are linked to the grants that funded them through 285,604 OpenAlex acknowledgement edges.

Working only from statistics that are robust to known entity-resolution noise (grant and work counts, country- and funder-level aggregates, the ROR-resolved subset), we characterize three structural features of the funding system. (1) Concentration. Across 4,840 ROR-resolved recipient institutions the distribution of grants is extreme: Gini = 0.929 (95% bootstrap CI [0.919, 0.936]); the top 1% of institutions hold 53.6% of all grants and the top 10% hold 92.0%. (2) Cross-funder structure. Of 156,877 funded works, 26.3% acknowledge two or more distinct funders; the dominant co-funding pair is intra-European (ERC↔EC, 17,502 shared works), with EC↔NSF (3,430) and NIGMS↔NSF (2,944) the leading trans-Atlantic and cross-agency ties. (3) The funding→output relationship. Restricting to the three funders with output linkage (NIH, NSF, EC), the count-based productivity rate ranges from 0.94 linked works per $1M (NIBIB) to 0.05 (NCATS), with NSF at 0.72 and the ERC at 0.67; these differences track each funder's mission (basic-science vs. translational/infrastructure) rather than efficiency.

We state the corpus and entity-resolution limitations plainly — most importantly that the country distribution reflects the NIH-heavy composition of the corpus, not a measurement of global funding — and we release all code, data references, and a Zenodo-ready metadata record.

Key findings

Figures

Lorenz curve of grant counts across recipient institutions
Figure 1. Lorenz curve of grant counts across 4,840 ROR-resolved recipient institutions, 2015–2025. The observed curve departs maximally from the equality line; Gini = 0.929.
Recipient grants by organization country
Figure 2. Recipient grants by organization country (thousands), top 12, 2015–2025. The U.S. bar dwarfs the EU tail; this reflects the NIH-heavy composition of the corpus, not global funding.
Cross-funder co-funding heatmap
Figure 3. Cross-funder co-funding: cell (i,j) is the number of works acknowledging both funder i and funder j. Two communities (US agencies, European funders) joined by NSF↔EC/ERC bridges.
Linked works per $1M awarded by funder
Figure 4. Linked works per $1M awarded, by funder (NIH/NSF/EC only). Basic-science funders cluster high; translational/infrastructure funders cluster low.
Fastest-rising and fastest-declining research topics
Figure 5. Fastest-rising and fastest-declining topics by funded-output growth ratio (2021–24 / 2016–19). COVID-19 and AI rise; virology niches and classical signaling fall.

Cite this paper

DOI: 10.5281/zenodo.20774322

@misc{dichio2026funding,
  title        = {The structure of public research funding, 2015--2025:
                  concentration, cross-funder co-funding, and the
                  funding-to-output relationship in a reconciled
                  NIH/NSF/EC/UKRI grant graph},
  author       = {Dichio, Gianangelo},
  year         = {2026},
  howpublished = {Bucket Foundation preprint},
  doi          = {10.5281/zenodo.20774322},
  url          = {https://doi.org/10.5281/zenodo.20774322},
  note         = {research-atlas v0.1.0}
}