Lite Learning: Efficient Crop Classification in Tanzania Using Traditional Machine Learning & Crowd Sourcing

19 Pages Posted: 10 Jan 2025

See all articles by Michael Mann

Michael Mann

The George Washington University

Lisa Colson

affiliation not provided to SSRN

Rory Nealon

USAID

Ryan Engstrom

George Washington University - Department of Geography

Stellamaris Nakacwa

Texas Tech University

Abstract

This study introduces a novel methodology for crop type classification in Tanzania by integrating crowdsourced data with time-series features extracted from Sentinel-2 satellite imagery. Leveraging the YouthMappers network, we collected ground validation data on various crops, including challenging types such as cassava, millet, sunflower, sorghum, and cotton across a range of agricultural areas. Traditional machine learning algorithms, augmented with carefully engineered time-series features, were employed to map the different crop classes. Our approach achieved high classification accuracy, evidenced by a Cohen's Kappa score of 0.80 and an F1-micro score of 0.82. The model often match or outperform broadly used land cover models which simply classify 'agriculture' without specifying crop types. By interpreting feature importance using SHAP values, we identified key time-series features driving the model's performance, enhancing both interpretability and reliability. Our findings demonstrate that traditional machine learning techniques, combined with computationally efficient feature extraction methods, offer a practical and effective “lite learning” approach for mapping crop types in data-scarce environments. This methodology facilitates accurate crop type classification using a low-cost, resource-limited approach that contributes valuable insights for sustainable agricultural practices and informed policy-making, ultimately impacting food security and land management in resource-limited contexts, such as sub-Saharan Africa.

Keywords: Agriculture, Feature Extraction, Classification, Time-Series, Crop Type

Suggested Citation

Mann, Michael and Colson, Lisa and Nealon, Rory and Engstrom, Ryan and Nakacwa, Stellamaris, Lite Learning: Efficient Crop Classification in Tanzania Using Traditional Machine Learning & Crowd Sourcing. Available at SSRN: https://ssrn.com/abstract=5090897 or http://dx.doi.org/10.2139/ssrn.5090897

Michael Mann (Contact Author)

The George Washington University ( email )

Washington, DC

Lisa Colson

affiliation not provided to SSRN ( email )

No Address Available

Rory Nealon

USAID ( email )

Dhaka
Bangladesh

Ryan Engstrom

George Washington University - Department of Geography ( email )

1922 F St., NW
Washington
DC 20052
United States

Stellamaris Nakacwa

Texas Tech University ( email )

2500 Broadway
Lubbock, TX 79409
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
33
Abstract Views
320
PlumX Metrics