Population-weighted centres of local administrative units (LAU) in Europe

If you need the coordinates of a location, where do you put the centre?

Giorgio Comai https://giorgiocomai.eu (OBCT/EDJNet)https://www.europeandatajournalism.eu/

N.B. This repository includes datasets pre-processed to facilitate common operations and full correspondence with commonly used sources. Full correspondence is possible only through some compromise solutions: ensure that the data provided are fit for purpose for your specific use case. Please open an issue if you find problems with the data or have suggestions for improvement.

This repository presents datasets of the population-weighted centres of local administrative units in Europe, the scripts used to generate them with different approaches, as well as details on quality checks and peculiarities of the data.

It also tries to facilitate matching a given local administrative unit to their correspondent NUTS region.

Motivation

In reference to population-weighted centres, check out the following blog post for context:

How to find the population-weighted centre of local administrative units

The ~30 seconds you see starting with minute 3 of this video show visually how this works overlaying the population grid to Google Earth imagery.

Preliminary notes on terminology and data

See the relevant Wikipedia page for details.

Eurostat releases every year an updated dataset with LAUs, including basic statistics about each of them as well as the spatial boundaries in formats that can be read by a number of software packages. The same is true for NUTS, with the exception that they are not updated yearly: recent years of release of NUTS regions include 2021, 2016, and 2013.

Besides, the coverage of these datasets is not the same across all years, as some countries were not part of the NUTS system earlier on.

This discrepancy in release dates means that, as LAUs are merged or change their boundaries, sometimes across different NUTS regions, there may not be perfect concordance between, for example, LAU for 2019 and NUTS for 2016. Eurostat does distribute concordance table, but these are not perfectly consistent… some LAUs are missing for some countries, some countries are missing, etc. (see this pages to see examples of how they do not match)).

This forces users of these data to make a judgement call: should they use old data? should they manually find where the LAUs missing in the concordance tables are located? Should they mix and match?

This repository outlines some possible solutions and provides consistently pre-processed data. The user should still be mindful of the structural imperfection of the results.

Steps and intermediate datasets

Since fully consistent concordance tables between LAU and NUTS published by the EU are not available for most years, a first step in the data processing involves situating each LAU in a NUTS region. This is achieved by calculating the spatial overlap between the relevant geometries.

This process results in three sets of datasets:

It is finally possible to get back to the original objective of this endeavour: having a ready-made dataset with the population-weighted centres of all LAUs accross Europe. As detailed in the page explaining how population-weighted centres are calculated, there are various elements determining the final coordinates. At this stage, datasets with the following characteristics are available:

All datasets are available in the lau_centres folder in this repository

“I just need a dataset with LAU centres, what should I download?”

The most recent, consistent, and complete datasets are probably the following:

Future and forthcoming

This dataset may be updated to include datasets based on more recent data, to add coverage for other European countries not included in Eurostat datasets, or to recalculate the population-weighted centres based on high-resolution population grids.

Data availability

This repository covers local administrative units as distributed by Europen Union’s Gisco services, and tentatively in nieghbouring jurisdictions.

The main determinants of data availability are:

Additional sources for the population grid include High Resolution Population Density Maps distributed by Facebook.

GADM for administrative boundaries.

Sources

Local Administrative Units (LAU)

The original dataset of Local Administrative Units (LAU) can be downloaded from the following link: https://gisco-services.ec.europa.eu/distribution/v2/lau/download/

LAU can be matched to NUTS via concordance tables: https://ec.europa.eu/eurostat/web/nuts/local-administrative-units

Population grid

The population grid can be downloaded from the following link: https://ec.europa.eu/eurostat/web/gisco/geodata/reference-data/population-distribution-demography/geostat

Details on how the population grid was generated as well as context on the reliability of the data is included in the factsheet that can be downloaded with the latest available dataset (at the time of writing), based on 2018 data and released in 2021.

The data included in this repository are based on various datasets that come with different licensing. Geographic data are not included in this repository and should instead be downloaded from the sources, as current licensing does not allow for redistribution of geographic data nor for their use for commercial purposes (see details below). Indeed, we do not include any of the geo-spatial data provided in the original datasets, and include only standard identifiers of NUTS regions and LAUs as well as basic statistics distributed by Eurostat.

See below and links to the sources for more details on copyright and licensing.

© EuroGeographics for the administrative boundaries (see full licensing details for LAUs)

Licensing for the population grid dataset varies depening on year and the country originally providing the data. See the relevant page for full details. The population grid for 2018 has been created by the European Commission, Joint Research Centre (B.3) and DG REGIO (B.1); ownership: European Commission.

Correspondence tables do not seem to carry additional licensing, besides what is standard to all contents distributed by Eurostat.

Besides considering all of the above, if you use these data or the scripts that generate them you are encouraged, but not required, to credit EDJNet and, if relevant, to link to this repository.