Docs/Data Reference/Coverage

Coverage

Coverage statistics across all dimensions — what percentage of plants have each data type.

Coverage varies by dimension because each depends on different upstream sources. This page shows exactly how many plants and projects have data in each dimension, where the gaps are, and why.

Coverage summary

DimensionCoverageOf totalPrimary source
Base (plants)15,528 plants100%EIA-860M + Annual
Base (generators)29,738 generators100%EIA-860M
Ownership (GEM)7,818 plants26.8%GEM Trackers
Generation13,534 plants89.9%EIA Generation API
Financial3,246 plants21.6%FERC + LBNL + EQR
Pricing20,618 crosswalk rows~74% of ISO generatorsEIA + OSM + curated nodes
Context (Wikipedia)732 plants4.9%Wikidata + Wikipedia
News8,023 plants + 909 projects53% of plantsGoogle News (Serper)
Interconnection9,783 active projectsLBNL + 7 ISO feeds

Base layer

The EIA inventory is the foundation — every plant and generator registered with the US Energy Information Administration is included. This is 100% coverage by definition: the platform uses EIA as the canonical plant registry. 15,528 plants with 29,738 generators.

Ownership

GEM trackers provide parent company identification for 7,818 plants (26.8%). The remaining plants rely on EIA-reported utility names, which often reflect SPV legal entities rather than beneficial owners. Wikidata contributes cross-reference identifiers for linking to external knowledge bases.

Generation

The EIA Generation API provides monthly net generation (MWh) for 13,534 plants — 89.9% of the fleet. The remaining ~10% are typically very small plants, recently commissioned facilities without full reporting history, or plants exempt from Form 923 filing.

Financial

Financial data comes from three sources with distinct coverage:

FERC Form 1

~1,400 regulated electric utilities. Capital costs, O&M, fuel costs, depreciation. Does not cover independent power producers or merchant generators.

LBNL Utility-Scale Solar

~1,569 solar projects. Installed cost ($/W), PPA prices ($/MWh), energy value, and capacity factor.

FERC EQR

~594 plants matched by seller name. Wholesale contract prices, volumes, and counterparty names.

Combined: ~3,246 plants (21.6%) have at least one financial data point.

The 21.6% financial coverage rate is a structural limit, not a data gap. Merchant generators, IPPs, and smaller facilities do not file public financial data. Beyond the current sources, the coverage ceiling with all structured public data is approximately 40–45%.

Pricing

20,618 plant-to-node crosswalk entries across seven ISOs (CAISO, ERCOT, MISO, PJM, NYISO, ISO-NE, SPP). Approximately 74% of generators in ISO territories have at least one resolved pricing node. Hub assignment covers 99% of ISO generators.

Context

732 plants have Wikipedia summaries (linked via Wikidata Q-IDs). Coverage is naturally concentrated on larger, more notable facilities. Small distributed solar and minor peaker plants typically do not have Wikipedia articles.

News

News articles cover 8,023 plants and 909 interconnection queue projects with ~112,000 articles grouped into ~39,000 unique stories. Coverage targets plants with 10 MW+ nameplate capacity and active queue projects. Approximately 51% of articles have been classified into categories by LLM.

News is in Beta. Classification coverage and accuracy are improving with each build cycle. Unclassified articles are still displayed — they simply lack category badges and entity extraction.

Interconnection

38,043 total queue entries from LBNL + 7 ISO/RTO live feeds, of which 9,783 are active or suspended. Coverage is comprehensive for the 7 major ISOs. Smaller entities (BPA, PacifiCorp, SOCO, etc.) are covered through the LBNL baseline but may lack ISO-specific fields like study phase and service type.