ANNOTATED IMAGE SET OF SOYBEAN FIELDS FOR TRAINING PLANT DETECTION MODELS
DOI:
https://doi.org/10.54668/2789-6323-2026-121-1-100-109Keywords:
точное земледелие, соя, детектирование, данные с БПЛА, нейронные сети, YOLOv8.Abstract
This article presents annotated Soybean_2021_Almalybak dataset designed for automatic analysis of agricultural areas based on aerial photographs obtained using unmanned aerial vehicles (UAVs). The dataset includes images of soybean fields covering various stages of plant growth and containing manual labelling of 12 object classes, including cultivated and weed species. The work emphasises the importance of field data reflecting the realistic state of agricultural landscapes for training robust computer vision models.
YOLOv8x, a modern architecture for simultaneous object localisation and classification, was used as the base model. The model was trained on a labelled sample of 300 images without augmentation, which allowed for a benchmark accuracy assessment. The experimental results demonstrated the high efficiency of the model: the F1-score for the Glycine max (soybean) class was 0.933, and the metric mAP@0.5 was 0.72. Despite the limited amount of data and partial annotation, the model showed resistance to variability in field conditions and class imbalance
References
Popa, C. (2011). Adoption of artificial intelligence in agriculture. Bulletin of University of Agricultural Sciences and Veterinary Medicine Cluj-Napoca. Agriculture, 68(1), 1–7.
Lottes, P., Hörferlin, M., Sander, S., Stachniss, C. (2017). Effective vision‐based classification for separating sugar beets and weeds for precision farming. Journal of Field Robotics, 34(6), 1160–1178.
Sa, I., Chen, Z., Popović, M., Khanna, R., Liebisch, F., Nieto, J., & Siegwart, R. (2017). WeedNet: Dense semantic weed classification using multispectral images and MAV for smart farming. IEEE Robotics and Automation Letters, 3(1), 588–595.
Olsen, A., Konovalov, D. A., Philippa, B., Ridd, P., Wood, J. C., Johns, J., White, R. D. (2019). DeepWeeds: A multiclass weed species image dataset for deep learning. Scientific Reports, 9, 2058. https://doi.org/10.1038/s41598-018-38343-3
Leminen Madsen, S., Mathiassen, S. K., Dyrmann, M., Laursen, M. S., Paz, L. C., Jørgensen, R. N. (2020). Open plant phenotype database of common weeds in Denmark. Remote Sensing, 12(8), 1246.
Haug, S., & Ostermann, J. (2015). A crop/weed field image dataset for the evaluation of computer vision based precision agriculture tasks. In Computer Vision – ECCV 2014 Workshops (pp. 105–116). Springer International Publishing.
Lameski, P., Zdravevski, E., Trajkovik, V., & Kulakov, A. (2017). Weed detection dataset with RGB images taken under variable light conditions. In A. Goleva, D. Trajanov, & A. Stojanovski (Eds.), ICT Innovations 2017: Data-Driven Innovation (pp. 112–119). Springer.
Giselsson, T. M., Jørgensen, R. N., Jensen, P. K., Dyrmann, M., & Midtiby, H. S. (2017). A public image database for benchmark of plant seedling classification algorithms. arXiv preprint arXiv:1711.05458.
Teimouri, N., Dyrmann, M., Nielsen, P. R., Mathiassen, S. K., Somerville, G. J., & Jørgensen, R. N. (2018). Weed growth stage estimator using deep convolutional neural networks. Sensors, 18(5), 1580.
dos Santos Ferreira, A., Freitas, D. M., da Silva, G. G., Pistori, H., Folhes, M. T. (2017). Weed detection in soybean crops using ConvNets. Computers and Electronics in Agriculture, 143, 314–324.
Sa, I., Popović, M., Khanna, R., Chen, Z., Lottes, P., Liebisch, F., Siegwart, R. (2018). WeedMap: A large-scale semantic weed mapping framework using aerial multispectral imaging and deep neural network for precision farming. Remote Sensing, 10(9), 1423.
[ML Paper] Explanation of all of YOLO series Part 11 - https://zenn.dev/yuto_mo/articles/14a87a0db17dfa (date of access: 15.10.2025)
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788.
Набор данных COCO - https://docs.ultralytics.com/ru/datasets/detect/coco/ (date of access: 06.10.2025)
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Надия Юничева, Равиль Мухамедиев, Валентин Смурыгин, Людмила Городецкая, Диас Құсайын, Адилхан Сымагулов

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.




