ANNOTATED IMAGE SET OF SOYBEAN FIELDS FOR TRAINING PLANT DETECTION MODELS

Authors

DOI:

https://doi.org/10.54668/2789-6323-2026-121-1-100-109

Keywords:

точное земледелие, соя, детектирование, данные с БПЛА, нейронные сети, YOLOv8.

Abstract

This article presents annotated Soybean_2021_Almalybak dataset designed for automatic analysis of agricultural areas based on aerial photographs obtained using unmanned aerial vehicles (UAVs). The dataset includes images of soybean fields covering various stages of plant growth and containing manual labelling of 12 object classes, including cultivated and weed species. The work emphasises the importance of field data reflecting the realistic state of agricultural landscapes for training robust computer vision models.

YOLOv8x, a modern architecture for simultaneous object localisation and classification, was used as the base model. The model was trained on a labelled sample of 300 images without augmentation, which allowed for a benchmark accuracy assessment. The experimental results demonstrated the high efficiency of the model: the F1-score for the Glycine max (soybean) class was 0.933, and the metric mAP@0.5 was 0.72. Despite the limited amount of data and partial annotation, the model showed resistance to variability in field conditions and class imbalance

References

Popa, C. (2011). Adoption of artificial intelligence in agriculture. Bulletin of University of Agricultural Sciences and Veterinary Medicine Cluj-Napoca. Agriculture, 68(1), 1–7.

Lottes, P., Hörferlin, M., Sander, S., Stachniss, C. (2017). Effective vision‐based classification for separating sugar beets and weeds for precision farming. Journal of Field Robotics, 34(6), 1160–1178.

Sa, I., Chen, Z., Popović, M., Khanna, R., Liebisch, F., Nieto, J., & Siegwart, R. (2017). WeedNet: Dense semantic weed classification using multispectral images and MAV for smart farming. IEEE Robotics and Automation Letters, 3(1), 588–595.

Olsen, A., Konovalov, D. A., Philippa, B., Ridd, P., Wood, J. C., Johns, J., White, R. D. (2019). DeepWeeds: A multiclass weed species image dataset for deep learning. Scientific Reports, 9, 2058. https://doi.org/10.1038/s41598-018-38343-3

Leminen Madsen, S., Mathiassen, S. K., Dyrmann, M., Laursen, M. S., Paz, L. C., Jørgensen, R. N. (2020). Open plant phenotype database of common weeds in Denmark. Remote Sensing, 12(8), 1246.

Haug, S., & Ostermann, J. (2015). A crop/weed field image dataset for the evaluation of computer vision based precision agriculture tasks. In Computer Vision – ECCV 2014 Workshops (pp. 105–116). Springer International Publishing.

Lameski, P., Zdravevski, E., Trajkovik, V., & Kulakov, A. (2017). Weed detection dataset with RGB images taken under variable light conditions. In A. Goleva, D. Trajanov, & A. Stojanovski (Eds.), ICT Innovations 2017: Data-Driven Innovation (pp. 112–119). Springer.

Giselsson, T. M., Jørgensen, R. N., Jensen, P. K., Dyrmann, M., & Midtiby, H. S. (2017). A public image database for benchmark of plant seedling classification algorithms. arXiv preprint arXiv:1711.05458.

Teimouri, N., Dyrmann, M., Nielsen, P. R., Mathiassen, S. K., Somerville, G. J., & Jørgensen, R. N. (2018). Weed growth stage estimator using deep convolutional neural networks. Sensors, 18(5), 1580.

dos Santos Ferreira, A., Freitas, D. M., da Silva, G. G., Pistori, H., Folhes, M. T. (2017). Weed detection in soybean crops using ConvNets. Computers and Electronics in Agriculture, 143, 314–324.

Sa, I., Popović, M., Khanna, R., Chen, Z., Lottes, P., Liebisch, F., Siegwart, R. (2018). WeedMap: A large-scale semantic weed mapping framework using aerial multispectral imaging and deep neural network for precision farming. Remote Sensing, 10(9), 1423.

[ML Paper] Explanation of all of YOLO series Part 11 - https://zenn.dev/yuto_mo/articles/14a87a0db17dfa (date of access: 15.10.2025)

Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788.

Набор данных COCO - https://docs.ultralytics.com/ru/datasets/detect/coco/ (date of access: 06.10.2025)

Published

2026-04-01

How to Cite

Yunicheva Н., Mukhamediev Р., Smurygin В., Gorodetskaya Л. ., Qusain Д., & Symagulov А. (2026). ANNOTATED IMAGE SET OF SOYBEAN FIELDS FOR TRAINING PLANT DETECTION MODELS. Hydrometeorology and Ecology, (1), 100–109. https://doi.org/10.54668/2789-6323-2026-121-1-100-109

Issue

Section

ECOLOGY

Most read articles by the same author(s)