Comparative Analysis of Machine Learning Regression Models for Paddy Yield Prediction

Authors

  • Chery Cardinawati Sitohang Universitas Informatika dan Bisnis Indonesia image/svg+xml Author
    • Fitri Kinkin Universitas Informatika dan Bisnis Indonesia image/svg+xml Author
      • Budiman Universitas Informatika dan Bisnis Indonesia image/svg+xml Author

        DOI:

        https://doi.org/10.65780/bima.v1i3.17

        Keywords:

        Paddy yield prediction, machine learning, regression algorithms, agricultural data, Linear Regression

        Abstract

        Accurate paddy yield prediction is essential to support food security, agricultural planning, and data-driven decision-making. The increasing availability of agricultural data has encouraged the adoption of machine learning approaches to overcome the limitations of conventional yield estimation methods. This study presents a comparative analysis of five regression-based machine learning algorithms—Linear Regression, K-Nearest Neighbors Regressor, Decision Tree Regressor, Random Forest Regressor, and Support Vector Regression—for paddy yield prediction. The experiments were conducted using the Paddy dataset from the UCI Machine Learning Repository, which consists of 2,789 samples and 45 variables (44 input features and 1 target variable). The dataset was preprocessed through data cleaning, feature standardization, and an 80:20 train–test split. Model performance was evaluated using Mean Absolute Error, Mean Squared Error, Root Mean Squared Error, and the coefficient of determination (R²). Experimental results show that Linear Regression achieved the best overall performance with an R² value of 0.9896 and an RMSE of 942.09, indicating strong predictive accuracy and stability. Despite its simplicity, Linear Regression outperformed more complex models, suggesting that the underlying relationships between input variables and paddy yield in the dataset are predominantly linear. These findings highlight the importance of systematic model evaluation and demonstrate that simpler regression models can remain effective and interpretable for practical paddy yield prediction and agricultural decision support systems.

        Downloads

        Download data is not yet available.

        References

        [1] “A. Ashari, E. Ariningsih, S. Saptana, H. P. Saliem, and P. Laksono, ‘An overview of reducing rice yield loss to improve national food security,’ BIO Web of Conferences, vol. 119, p. 01011, 2024.”.

        [2] “V. Joshua, S. M. Priyadharson, and R. Kannadasan, ‘Exploration of machine learning approaches for paddy yield prediction in eastern part of Tamilnadu,’ Agronomy, vol. 11, no. 10, p. 2068, 2021.”.

        [3] “J. Cao et al., ‘Integrating multi-source data for rice yield prediction across China using machine learning and deep learning approaches,’ Agricultural and Forest Meteorology, vol. 297, p. 108275, 2021.”.

        [4] “D. Sakthipriya and T. Chandrakumar, ‘Weather based paddy yield prediction using machine learning regression algorithms,’ Journal of Agrometeorology, vol. 26, no. 3, pp. 344–348, 2024.”.

        [5] “A. Satpathi et al., ‘Comparative analysis of statistical and machine learning techniques for rice yield forecasting for Chhattisgarh, India,’ Sustainability, vol. 15, no. 3, p. 2786, 2023.”.

        [6] “I. Michael, ‘Evaluation metrics and benchmarking for predictive accuracy,’ 2025.”.

        [7] “A. Badshah, B. Y. Alkazemi, F. Din, K. Z. Zamli, and M. Haris, ‘Crop classification and yield prediction using robust machine learning models for agricultural sustainability,’ IEEE Access, vol. 12, pp. 1–15, 2024,”.

        [8] “M. A. Jabed and M. A. A. Murad, ‘Crop yield prediction in agriculture: A comprehensive review of machine learning and deep learning approaches, with insights for future research and sustainability,’ Heliyon, vol. 10, no. 24, 2024.”.

        [9] “B. Panigrahi, K. C. R. Kathala, and M. Sujatha, ‘A machine learning-based comparative approach to predict the crop yield using supervised learning with regression models,’ Procedia Computer Science, vol. 218, pp. 2684–2693, 2023.”.

        [10] “P. P. Jorvekar, S. K. Wagh, and J. R. Prasad, ‘Predictive modeling of crop yields: A comparative analysis of regression techniques for agricultural yield prediction,’ Agricultural Engineering International: CIGR Journal, vol. 26, no. 2, 2024.”.

        [11] “M. Kuradusenge et al., ‘Crop yield prediction using machine learning models: Case of Irish potato and maize,’ Agriculture, vol. 13, no. 1, p. 225, 2023.”.

        [12] “M. D. Islam et al., ‘Rapid rice yield estimation using integrated remote sensing and meteorological data and machine learning,’ Remote Sensing, vol. 15, no. 9, p. 2374, 2023.”.

        [13] “B. Kumar Pankaj et al., ‘Paddy yield prediction based on 2D images of rice panicles using regression techniques,’ The Visual Computer, vol. 40, no. 6, pp. 4457–4471, 2024.”.

        [14] “P. Setiya, A. Satpathi, and A. S. Nain, ‘Predicting rice yield based on weather variables using multiple linear, neural networks, and penalized regression models,’ Theoretical and Applied Climatology, vol. 154, no. 1–2, pp. 365–375, 2023.”.

        [15] “J. Ansarifar, L. Wang, and S. V. Archontoulis, ‘An interaction regression model for crop yield prediction,’ Scientific Reports, vol. 11, no. 1, p. 17754, 2021.”.

        Downloads

        Published

        2026-03-31

        How to Cite

        Comparative Analysis of Machine Learning Regression Models for Paddy Yield Prediction. (2026). Bulletin of Intelligent Machines and Algorithms, 1(3), 93-100. https://doi.org/10.65780/bima.v1i3.17

        Most read articles by the same author(s)