Untangling crop management and environmental influences on wheat yield variability in Bangladesh: An application of non-parametric approaches

In South Asia, wheat is typically grown in favorable environments, although policies promoting intensification in Bangladesh's stress-prone coastal zone have resulted in expanded cultivation in this non-traditional area. Relatively little is known about howto best manage wheat in these unique environments. Research is thus needed to identify ‘best-bet’ entry points to optimize productivity, but classical parametric analyses offer limited applicability to elucidate the relative importa nce of the multiple factors and interactions that influence yield under such conditions. This problem is most evident in datasets derived from farmer-participatory research, where missing values and skewed data are common. This paper examines the predictive power of three nonparametric approaches, including linear mixed effects models (LMMs), and two binary recursive partitioning methods: classification and regression trees (CARTs)and Random Forests We collected yield, crop management, and environmental observations from 422 wheat fields in the 2012–13 season, across six production environments spanning southern Bangladesh, where nutrient rates and genotypes were imposed, but management of other p roduction factors varied from farmer to farmer. Fields were grouped into categories including early and late-sowing, depending on crop establishment before or after December 15, respectively, and in combination, across both early- and late-sowing groups. For each of these groups, we investigated how each non-parametric analysis predicted the factors influencing yield. All three approaches identified nitrogen rate and environment as the most important factors, regardless of sowing category. CART also identified assemblages of high- and low-yielding environments, although those located in saline and warmer thermal zones were not necessarily the lowest yielding, indicating that farmers can optimize crop management to overcome these constra ints. The number of days farmers sowed wheat before or after December 15, days to maturity, and the number of irrigations and weedings also influenced yield, though each method weighted these factors differently. LMMs also indicated a slight yield advantage when farmers used stress-tolerant genotypes, though CART and Random Forests did not. One-to-one plots for observed vs. predicted yields from LMMs and Random Forests showed better performance by the former than the latter, wit h smaller root mean square and mean absolute error for the combined, early- and late-sowing groups, respectively. While the LMMs were superior in this case, Random Forests may still prove useful in the classification and interpretation of farm survey data in which no treatment interventions have been administered.

Data and Resources

Additional Info

Field Value
Author Timothy J. Krupnik, Zia Uddin Ahmed, Jagadish Timsina, Samina Yasmin, Farhad Hossain, Abdullah Al Mamun, Aminul Islam Mridha, Andrew J. McDonald
Maintainer CIMMYT Research Data & Software Repository Network
Last Updated January 20, 2025, 16:49 (UTC)
Created January 20, 2025, 16:49 (UTC)
contributor Ashok Rai
creator Timothy J. Krupnik
date 2017-09-25T00:00:00
harvest_object_id f3df49ba-d3f1-496b-aacc-fa6ba358bb9b
harvest_source_id a58b0729-e941-4389-816d-5823f01c0d28
harvest_source_title CIMMYT Research Data
identifier https://hdl.handle.net/11529/11037
language English
metadata_modified 2024-10-26T07:00:02
set_spec csisadvn