Past project data and code
Model of shocks in networks code from Marchand et al. (2016)
Shock detection function from Gephart et al. (2017)
Food system resilience code and data from Seekell et al. (2017)
Blue food environmental performance code and data from Gephart et al. (2021)
China re-export code and data from Asche et al. (2022)
Shock detection function from Gephart et al. (2017)
Food system resilience code and data from Seekell et al. (2017)
Blue food environmental performance code and data from Gephart et al. (2021)
China re-export code and data from Asche et al. (2022)
Aquatic Resource Trade in Species (ARTIS) Database
The ARTIS Database is currently under development and will provide estimated species/species group trade flows from source producing country to final consuming country. Check out the ARTIS Database website for more information or to get involved.
Food security and the environment data sources
Much of my work leverages existing data sources. As a result, I often keep track of data that is potentially useful for my research.
I have included some of these below in case it is helpful to others.
I have included some of these below in case it is helpful to others.
Food production and trade
Food and Agriculture Organization (FAO): FAOSTAT
This database provides population data and projections for each country from 1950–2100 and food production, food trade, and agriculture-related resource use data for 1961–2014. There is also a suite of food security indices. In order to provide complete spatial and temporal coverage, some data provided is interpolated or modeled. One should pay close attention to the documentation before proceeding with an analysis.
Food and Agriculture Organization (FAO) Fisheries and Aquaculture Department: FishStatJ
This database consists of productions and trade data in terms of price and quantity by country for 1950–2012. FishStat allows for multiple species and commodity grouping schemes to facilitate matching across data sources. It should be noted though that there is not complete consistency between the production by source and production and trade data sets.
Sea Around Us
This database estimates marine extractions for each country from 1950–present. The data can be disaggregated by industrial, artisanal, recreational, or subsistence sectors, by taxon, and by reported vs. unreported.
United Nations: COMTRADE
This data is maintained by the United Nations and consists of reported bilateral trade data (annually and monthly) for commodities (using HS codes). This data is publicly available when accessing a limited number of records. A subscription allows bulk downloads and access to a derived data product produced by CEPII, the BACI trade data.
U.S. Commodity Flow Survey
The Commodity Flow Survey (CFS), undertaken through a partnership between the Census Bureau and the Bureau of Transportation Statistics (BTS) is conducted every 5 years as part of the Economic Census. The CFS produces data on the movement of goods in the United States. It provides information on commodities shipped, their value, weight, and mode of transportation, as well as the origin and destination of shipments of commodities from manufacturing, mining, wholesale, and selected retail and services establishments.
NOAA Commercial Fisheries Statistics
The National Oceanic and Atmospheric Administration (NOAA) maintains US fisheries landings and trade statistics. Additional US trade data is available through the US Census data portal.
USDA Food Availability Data System
This data system contains the food availability data through 2015, the loss-adjusted food availability data through 2015, and the nutrient availability data through 2010. The 2015 data contains estimates for over 200 commodities.
Health and demographic
Institute for Health Metrics and Evaluation (IHME): Global Health Data Exchange (GHDx)
This data set provides estimates for health and demographic data using Bayesian models which are informed by a range of researcher, government, and NGO data. The estimates include uncertainty and the code generating the estimates is available. Among other estimates, the database contains estimates of deaths by cause (including protein-energy malnutrition, iodine deficiency, iron-deficiency anemia, and other nutritional deficiencies). There is good documentation for the estimates, but one must be careful to know what data went into the models so as not to test an effect when non-independent data were used in the estimate. The article describing the data includes a supplementary file detailing the estimation methods and can be found here.
USAID: Demographic and Health Surveys (DHS)
This is a database of survey data by country. The surveys are based on a set of model questionnaires, but are modified for each country and therefore are not identical. A description of this data is available here.
The World Bank: Subnational Malnutrition
Database with subnational estimates of child malnutrition (prevalence for stunting, underweight, overweight, wasting and severe wasting indicators) using available data up to 2012. Data can be accessed using an API. These statistics by quintile are available in the World Bank’s “Health Nutrition and Population Statistics by Wealth Quintile” database.
The World Bank: Central Microdata Catalog
This database consists of national living standards and health surveys, including access to the surveys used.
World Health Organization (WHO): Mortality Database
The WHO Mortality Database is a compilation of mortality data by age, sex and cause of death, as reported annually by Member States from their civil registration systems.
Tufts’ Global Dietary Database (GDD)
This database compiles global information on food and nutrition consumption levels.
Economic
The World Bank: Global Consumption Database
This database consists of data on household consumption patterns in developing countries based on household surveys. Four levels of consumption are used to segment the market in each country: lowest, low, middle, and higher. They are based on global income distribution data, which rank the global population by income per capita. More details on the data can be found here.
Bureau of Economic Analysis
This database includes a range of economic data for the US, including GDP, employment by industry, price indices, and international trade.
United States Department of Agriculture Economic Research Service: Commodity and Food Elasticities
This database consists of information and estimates on price and income elasticities for agricultural products. Most of the demand elasticities in this data product are from academic and government research conducted in the United States on consumer demand, as published in working papers, dissertations, and peer-reviewed journals and as presented at professional conferences in the United States. The most fully covered countries are the United States and China, and the greatest number of demand studies are for vegetables, fruits, and grocery products such as coffee and ketchup.
United States Department of Labor: Bureau of Labor Statistics
This database consists of economic data for the United States, including data on on inflation, prices, and time use.
Food and the Environment
Water Footprint Network Water Footprint Statistics
This data set contains estimates for green, blue and grey water footprints of crops, derived crop products, biofuels, livestock products, and industrial products. All data are available at national and sub-national level.
Food and Agriculture Organization (FAO): CropWat
This computer program enables the calculation of crop water requirements and irrigation requirements based on soil, climate and crop data. The program allows for exploration of alternate management conditions.
International Fertilizer Association Fertilizer Use by Crop
This data set contains fertilizer use by crop in the main fertilizer-consuming countries. The data covers 27 countries and the EU-27, which accounts for 94% of global fertilizer consumption.
USDA Cropland Data Layer
This database contains annual spatial crop cover data for the US for 1997-present.
WorldClim
WorldClim is a set of global climate layers (gridded climate data) with a spatial resolution of about 1 km2. These data can be used for mapping and spatial modeling.
EORA World MRIO
This global supply chain database consists of a multi-region input-output table with matching environmental and social satellite accounts for 190 countries. The database includes over 15,000 sectors for 1990-2015 and a range of environmental indicators, including GHG emissions, air pollution, energy use, water requirements, land occupation and N and P emissions.
Food and Agriculture Organization (FAO): FAOSTAT
This database provides population data and projections for each country from 1950–2100 and food production, food trade, and agriculture-related resource use data for 1961–2014. There is also a suite of food security indices. In order to provide complete spatial and temporal coverage, some data provided is interpolated or modeled. One should pay close attention to the documentation before proceeding with an analysis.
Food and Agriculture Organization (FAO) Fisheries and Aquaculture Department: FishStatJ
This database consists of productions and trade data in terms of price and quantity by country for 1950–2012. FishStat allows for multiple species and commodity grouping schemes to facilitate matching across data sources. It should be noted though that there is not complete consistency between the production by source and production and trade data sets.
Sea Around Us
This database estimates marine extractions for each country from 1950–present. The data can be disaggregated by industrial, artisanal, recreational, or subsistence sectors, by taxon, and by reported vs. unreported.
United Nations: COMTRADE
This data is maintained by the United Nations and consists of reported bilateral trade data (annually and monthly) for commodities (using HS codes). This data is publicly available when accessing a limited number of records. A subscription allows bulk downloads and access to a derived data product produced by CEPII, the BACI trade data.
U.S. Commodity Flow Survey
The Commodity Flow Survey (CFS), undertaken through a partnership between the Census Bureau and the Bureau of Transportation Statistics (BTS) is conducted every 5 years as part of the Economic Census. The CFS produces data on the movement of goods in the United States. It provides information on commodities shipped, their value, weight, and mode of transportation, as well as the origin and destination of shipments of commodities from manufacturing, mining, wholesale, and selected retail and services establishments.
NOAA Commercial Fisheries Statistics
The National Oceanic and Atmospheric Administration (NOAA) maintains US fisheries landings and trade statistics. Additional US trade data is available through the US Census data portal.
USDA Food Availability Data System
This data system contains the food availability data through 2015, the loss-adjusted food availability data through 2015, and the nutrient availability data through 2010. The 2015 data contains estimates for over 200 commodities.
Health and demographic
Institute for Health Metrics and Evaluation (IHME): Global Health Data Exchange (GHDx)
This data set provides estimates for health and demographic data using Bayesian models which are informed by a range of researcher, government, and NGO data. The estimates include uncertainty and the code generating the estimates is available. Among other estimates, the database contains estimates of deaths by cause (including protein-energy malnutrition, iodine deficiency, iron-deficiency anemia, and other nutritional deficiencies). There is good documentation for the estimates, but one must be careful to know what data went into the models so as not to test an effect when non-independent data were used in the estimate. The article describing the data includes a supplementary file detailing the estimation methods and can be found here.
USAID: Demographic and Health Surveys (DHS)
This is a database of survey data by country. The surveys are based on a set of model questionnaires, but are modified for each country and therefore are not identical. A description of this data is available here.
The World Bank: Subnational Malnutrition
Database with subnational estimates of child malnutrition (prevalence for stunting, underweight, overweight, wasting and severe wasting indicators) using available data up to 2012. Data can be accessed using an API. These statistics by quintile are available in the World Bank’s “Health Nutrition and Population Statistics by Wealth Quintile” database.
The World Bank: Central Microdata Catalog
This database consists of national living standards and health surveys, including access to the surveys used.
World Health Organization (WHO): Mortality Database
The WHO Mortality Database is a compilation of mortality data by age, sex and cause of death, as reported annually by Member States from their civil registration systems.
Tufts’ Global Dietary Database (GDD)
This database compiles global information on food and nutrition consumption levels.
Economic
The World Bank: Global Consumption Database
This database consists of data on household consumption patterns in developing countries based on household surveys. Four levels of consumption are used to segment the market in each country: lowest, low, middle, and higher. They are based on global income distribution data, which rank the global population by income per capita. More details on the data can be found here.
Bureau of Economic Analysis
This database includes a range of economic data for the US, including GDP, employment by industry, price indices, and international trade.
United States Department of Agriculture Economic Research Service: Commodity and Food Elasticities
This database consists of information and estimates on price and income elasticities for agricultural products. Most of the demand elasticities in this data product are from academic and government research conducted in the United States on consumer demand, as published in working papers, dissertations, and peer-reviewed journals and as presented at professional conferences in the United States. The most fully covered countries are the United States and China, and the greatest number of demand studies are for vegetables, fruits, and grocery products such as coffee and ketchup.
United States Department of Labor: Bureau of Labor Statistics
This database consists of economic data for the United States, including data on on inflation, prices, and time use.
Food and the Environment
Water Footprint Network Water Footprint Statistics
This data set contains estimates for green, blue and grey water footprints of crops, derived crop products, biofuels, livestock products, and industrial products. All data are available at national and sub-national level.
Food and Agriculture Organization (FAO): CropWat
This computer program enables the calculation of crop water requirements and irrigation requirements based on soil, climate and crop data. The program allows for exploration of alternate management conditions.
International Fertilizer Association Fertilizer Use by Crop
This data set contains fertilizer use by crop in the main fertilizer-consuming countries. The data covers 27 countries and the EU-27, which accounts for 94% of global fertilizer consumption.
USDA Cropland Data Layer
This database contains annual spatial crop cover data for the US for 1997-present.
WorldClim
WorldClim is a set of global climate layers (gridded climate data) with a spatial resolution of about 1 km2. These data can be used for mapping and spatial modeling.
EORA World MRIO
This global supply chain database consists of a multi-region input-output table with matching environmental and social satellite accounts for 190 countries. The database includes over 15,000 sectors for 1990-2015 and a range of environmental indicators, including GHG emissions, air pollution, energy use, water requirements, land occupation and N and P emissions.