Bimonthly

ISSN 1006-9585

CN 11-3693/P

+Advanced Search 中文版
Meteorological factors-driven prediction of interannual variability of soybean yield in Northeast China based on machine learning methods
DOI:
Author:
Affiliation:

1.Department of Atmospheric Sciences, Yunnan University;2.National Key Laboratory of Earth System Numerical Modeling and Application, Institute of Atmospheric Physics, Chinese Academy of Sciences;3.National Key Laboratory of Earth System Numerical Modeling and Application, Institute of Atmospheric Physics

Fund Project:

Guangdong Major Project of Basic and Applied Basic Research (2021B0301030007), National Natural Science Foundation of China (2017YFA0604302), National Natural Science Foundation of China (42175074), Natural Science Foundation of Yunnan Province (202302AN360006), Technological Infrastructure project “Earth System Science Numerical Simulator Facility”

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
    Abstract:

    Soybean is one of the world"s four major grain crops and the most important source of vegetable oil and protein. China, as the world"s fourth-largest soybean producer and largest consumer, relies heavily on Northeast China for domestic production, which accounts for approximately half of the national output. The interannual variability of soybean yield in Northeast China is primarily driven by meteorological factors. Its accurate prediction is crucial for food security and market stability. Previous statistical prediction studies have been limited to local areas or single provinces and have only provided fitting skills or short-term (≤5 years) prediction skills. To address these limitations, this study developed prediction models for interannual soybean yield variations at the provincial scale in Northeast China during 1981-2018 using six machine learning methods based on meteorological factors. The main findings are: (1) Ridge regression showed the best overall performance among the six methods, with cross-validation correlation coefficients reaching 0.48 (P<0.01), 0.58 (P<0.001), and 0.72 (P<0.001) in Heilongjiang, Jilin, and Liaoning provinces, respectively; (2) Compared to stepwise linear regression, ridge regression demonstrated superior performance in a correlation coefficient (R) and root mean square error (RMSE), with slightly lower accuracy only in amplitude prediction for Jilin and Liaoning provinces; (3) Predictor selection and sample augmentation generally improved the cross-validation prediction skills of machine learning models; (4) The critical meteorological impact window concentrated in the flowering and pod-setting period (July-August), during which the positive effects of temperature, precipitation, and sunshine duration significantly enhanced final yields through promoting pod formation, grain development, and photosynthesis. These findings provide scientific support for soybean yield prediction and agricultural risk management in Northeast China.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 21,2025
  • Revised:April 11,2025
  • Adopted:April 13,2025
  • Online: April 18,2025
  • Published: