Wayne's Github Page

A place to learn about statistics

Predicting corn yields

One of the biggest challenges in agriculture is to forecast the amount of corn that will be harvested in the US (given the decreasing land dedicated to agriculture). This also commonly one of the headlines in the USDA announcements and drastically affect commodity prices.

There are many sources for corn yield data but the most official survey is the US Department of Agriculture (USDA) National Agricultural Statistics Service (NASS). They survey farmers then aggregate the results to create (surprisingly) accurate forecasts for the total production of corn each year. This survey is different from the census, which is calculating from tallying actual yields reported for tax filing purposes.

To get the data, the NASS QuickStats API is recommended. You’ll need to register for a key here.

An example code to retrieve the data is shown here. It is important to know that “yield” is calculated by total production (the mass of corn, US is special in using bushels, a volume measure for this) divided by the land (the acres of land harvested). There are scenarios where land can be planted but not harvested (e.g. puddling causing the roots to rot).

Tasks: