Static Upstream Source Registry

Status: pass

metadata registry only; no source is approved for model training yet

Sources: 22 total; 8 priority-one; 0 model-ready.

PrioritySourceCategoryKindLocal statusFirst actionLeakage rule
1Local Government Directoryadmin_code_registrydated_snapshotnot_downloadedCreate dated LGD extract metadata and compare code coverage across boundary sources.Metadata/backbone only; not a target or benchmark signal.
1GeoParquetartifact_formatstable_formatnot_required_yetStandardize future H3 feature tables and coverage probes.Storage format only; no modeling signal.
1Google Open Buildingsbuilding_footprintsdated_snapshotreferenced_by_prior_artCross-check with ramSeraph and Microsoft footprints for disagreement cells.Physical-structure proxy only; never household truth by itself.
1Microsoft Global ML Building Footprintsbuilding_footprintsdated_snapshotreferenced_by_prior_artCompare disagreement cells against Google/ramSeraph building sources.Physical-structure proxy only; combine with population and land-use evidence.
1ESA WorldCoverlandcover_masksfixed_releasenot_downloadedFill the current missing WorldCover raster gap with source metadata and AOI probes.Land cover can suppress impossible cells; it is not income or household truth.
1Official Census India PCA and houselistingofficial_denominatorfixed_official_releasenot_downloadedPrefer official PCA/houselisting extracts over scraped mirrors for denominator QA.Independent source; age must be corrected with non-GeoIQ calibration or internal outcomes.
1GHSL / Global Human Settlement Layerpopulation_builtupfixed_releasenot_downloadedRegister release metadata, then compare against WorldPop/Census/building denominators.May be used as independent feature; never selected by GeoIQ benchmark fit.
1DuckDB Spatialprocessing_toolstable_toolnot_required_yetUse for registry/probe scripts when dependencies are available.Engineering tool only; no demand signal.
2PMTilesartifact_formatstable_formatnot_required_yetUse after AOI layers exist and need visual review.Visualization format only; no modeling signal.
2Google Open Buildings 2.5D Temporalbuilding_footprintsdated_snapshotnot_downloadedProbe only after static footprint coverage QA is in place.Use time slices fixed before evaluation; no post-outcome leakage in time holdouts.
2Dynamic World V1landcover_masksdated_image_windownot_downloadedUse only fixed AOI/date windows after WorldCover probes are stable.Dated independent raster only; no tuning windows against GeoIQ TAM.
2Planetary Computer STACraster_accesscatalog_access_patternnot_downloadedUse for raster provenance manifests, not as a feature by itself.Catalog metadata only; imagery dates must respect time holdouts.
2OpenStreetMap via OSMnx/SRAIroads_pois_placesdated_extractpartially_supported_by_srai_prior_artReplace ad hoc live pulls with versioned extracts plus source completeness flags.Sparse OSM means missingness can be mapping bias, not low opportunity.
2Overture Mapsroads_pois_placesmonthly_releasenot_downloadedUse named releases for H3 POI/road counts and OSM coverage comparison.Independent features only; release must predate any time-holdout outcome window.
2ohsome APIsource_coverage_qadated_query_outputnot_downloadedAdd completeness features before interpreting OSM sparse cells.Coverage QA only; should explain confidence, not become demand truth.
3M-Labconnectivity_proxydated_aggregatenot_downloadedUse city-level aggregates only if Ookla/internal comparison shows value.Measurement availability is biased; treat as diagnostic until validated.
3Ookla Open Dataconnectivity_proxyquarterly_snapshotnot_downloadedProbe on selected AOIs before adding any national feature.External proxy only; not a substitute for internal serviceability or capacity.
3OpenCelliDconnectivity_proxydated_snapshotnot_downloadedReview license/account constraints, then run only AOI coverage probes.Infrastructure presence does not imply serviceability for this product.
4Clay Foundation Modelfoundation_model_referencemodel_referencenot_downloadedDefer until transparent satellite features are stable.No benchmark-driven feature selection against GeoIQ.
4IBM/NASA Prithvifoundation_model_referencemodel_referencenot_downloadedDefer until transparent raster/building features leave clear residual gaps.No fine-tuning or model selection on GeoIQ labels.
4AllenAI SatlasPretrainfoundation_model_referencemodel_referencenot_downloadedUse only as a benchmark against TorchGeo/transparent raster features.No direct or indirect GeoIQ-label tuning.
4TerraTorchfoundation_model_toolingstable_toolnot_required_yetDefer until the project has stable raster labels and compute budget.Advanced tooling only; no GeoIQ-tuned remote-sensing model.