For a quick summary of the dataset, see Section I of Henderson et al. (2012). For detailed discussion on the data, see Doll (2008).
The data is becoming popular among economists.
Henderson et al. (2012) and Pinkovskiy and Sala-i-Martin (2016) use nighttime light to improve the data on national accounts GDP.
Michalopoulos and Papaioannou (2013, 2014), and Alesina et al. (2016) use nighttime light as a measure of living standards across African ethnic groups.
Hodler and Raschky (2014) exploit the annual panel nature of the data to find that the birth place of a new national leader becomes brighter after he assumes power.
Baskaran et al (2015) relate nighttime light to electoral cycles in India.
Storeygard (2016) uses light as a measure of city-level income across cities in Africa.
Bleakey and Lin (2012) use nighttime light as a measure of spatial distribution of contemporary economic activity, to see whether portage sites still predict where economic activities are concentrated today, long after their original advantage became obsolete.
To understand how this dataset is constructed from the original satellite images and the potential data issues, see Elvidge et al. (2001) and Elvidge et al. (2010). Noor et al. (2008) is also useful to understand this data. See also Alexei Abrahams's guest post for Development Impact Blog.
Digital number: it's "not exactly proportional to the physical amount of light received (called true radiance)," quoted from p. 999 of Henderson et al. (2012).
Top-coding: The maximum value of light intensity is 63. This issue shouldn't matter much for poor and middle-income countries. Henderson et al. (2012) remove Singapore and Bahrain from their cross-country analysis for this concern (see footnote 16)
Bottom-censoring: Henderson et al. (2012) notes that there are "remarkably few pixels with digital numbers of 1 or 2" (p. 1000). Storeygard (2016) describes how the data processing algorithm causes bottom-censoring (see Appendix section A.8).
Compatibility across years and satellites: Satellite sensors age over time and are replaced periodically. Thus, the same digital number does not necessarily mean the same level of light intensity across years and satellites. Henderson et al. (2012) deal with this concern by controlling for year fixed effects in a regression of log GDP on log light per area.
- Alternatively, the following book chapter attempts to calibrate values from different satellites to account for inter-satellite differences and inter-annual sensor decay:
- Elvidge, Christopher D., Feng-Chi Hsu, Kimberly E. Baugh and Tilottama Ghosh (2014). "National Trends in Satellite Observed Lighting: 1992-2012." Global Urban Monitoring and Assessment Through Earth Observation. Ed. Qihao Weng. CRC Press. (The working paper version is available here.)
- The calibrated version aggregated to the 0.5x0.5 degree cell level is available as part of the PRIO-GRID data.
Blooming: Light tends to be magnified over certain terrain types such as water and snow cover.
Blurring: A single point source of light would be recorded in several neighbouring cells due to the way the satellite sensor captures the light emission. See Alexei Abrahams's guest post for Development Impact Blog for more detail.
- To deblur the data with Abrahams's Matlab code, you need the pct_lights.tif files. Unfortunately, this file for 2011 is missing on the website. If you have downloaded and kept this file somewhere in your computer, let NOAA people know about it.
High latitude locations: Due to long daytime length, nighttime light cannot be observed in summer for high latitude locations (the raw satellite images are taken between 8:30 and 10:00 pm local time). For this reason, Henderson et al. (2012) exclude observations north of the Arctic Circle.
Validation as a measure of income/wealth
Logarithm of light intensity per area (and its long-run change over the 15-year period) is known to be linearly correlated with
- Logarithm of total GDP (and its change over the 15-year period) at the country-level (Henderson et al. (2012)), at the sub-national region level (Hodler and Raschky (2014)) and at the Chinese city/prefecture level (Storeygard (2016), Table 1 columns 4-5). Estimated elasticity is around 0.3 in all these studies.
- Average DHS wealth index (see this post) across households at the enumeration area level in Africa (Michalopoulos and Papaioannou 2013)
- Logarithm of per capita GDP (and household-survey mean income/expenditure) across the world for 1992-2010 (Pinkovskiy and Sala-i-Martin 2016).
Validation as a measure of public goods provision
Michalopoulos and Papaioannou (2014) shows that logarithm of light intensity per area is correlated with access to electrification, presence of a sewage system, access to piped water, and education (averaged across households in each enumeration area) from Afrobarometer Surveys in 17 African countries.
- Electrified villages are consistently brighter than unelectrified villages across a variety of nighttime satellite images
- Electrified villages appear brighter in satellite imagery because of the presence of streetlights, and brightness increases with the number of streetlights.
- The correlation between light output recorded by the satellite with household electricity use and access is low.
See also Chen and Nordhaus (2011).
The raw data ranges from 0 to 63 at the 30x30 arc-second cells. To be used in regression analysis, there are several ways to aggregate the raw data.
- Henderson et al. (2012) (see footnote 7) obtain the weighted average across pixels within a country, where the weight is the land area of each 30x30 arc-second pixel, obtained from CIESIN/IFPRI/CIAT (2004).
- Michalopoulos and Papaioannou (2013, 2014) and Hodler and Raschky (2014) use the logarithm of light intensity per area within each spatial unit of analysis.
- Logarithmic transformation is used because the distribution of nighttime light intensity is right-skewed with around 10% of observations being zero.
- 0.01 is added to the average before taking log, to use the 10% of the observations without light.
- Alesina et al. (2016) and Baskaran et al (2015) use the average or sum of light values from all pixels within each spatial unit of analysis divided by population.
- Baskaran et al (2015) also measure the proportion of villages with the positive value of nighttime light at the village centroid.
- Storeygard (2016) measure the city-level light intensity as follows: first convert the original data "into one binary grid encoding whether a pixel was lit in at least one satellite-year. These ever-lit areas were then converted to polygons; contiguous ever-lit pixels were aggregated, and their DNs were summed within each satellite-year." (p. 1268)