A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception
Research Article  ·  Published: 17 April 2026
Issue cover
ICCK Transactions on Intelligent Systematics
Volume 3, Issue 2, 2026: 94-107
Research Article Free to Read

A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception

1 College of Engineering, China Agricultural University, Beijing 100083, China
2 National Sub-Center for R&D of Agro-Products Processing Technology and Equipment, Beijing 100083, China
* Corresponding Author: Bin Zhang, [email protected]
Volume 3, Issue 2

Article Information

Abstract

To address key challenges in post-harvest fruit grading—namely the difficulty of distinguishing visually similar varieties, the invisibility of internal quality, and mechanical damage during grasping—this study develops an intelligent robotic grading system that integrates advanced computer vision, Near-Infrared (NIR) spectroscopy, and flexible force-controlled grasping. First, an improved object detection algorithm, YOLOv11-TFE, is proposed to mitigate visual confusion between Qixia Fuji apples and Beijing Pinggu peaches and to handle the irregular geometry of Nanshui pears. By embedding the parameter-free SimAM attention mechanism into the backbone to explicitly enhance and decouple surface texture features (gloss versus tomentum) and incorporating the DySample upsampling operator to better preserve complex edge information, the discriminative capability of the detector is significantly improved. Second, a non-destructive internal quality detection module based on NIR spectroscopy (650–1100 nm) is constructed. Through the optimization of preprocessing strategies—specifically Improved Derivative Correction (IDC) and Standard Normal Variate (SNV)—robust PLSR models for soluble solids content (SSC) prediction are established for fruits with differing skin textures. Finally, a dynamic actuation unit for a biomimetic flexible gripper has been designed to achieve non-destructive sorting under continuous flow conditions. Experimental results show that the improved visual algorithm achieves an average [email protected] of 94.6%, with detection accuracies for apples and peaches reaching 96.0% and 97.8%, respectively, markedly reducing inter-class confusion. The root mean square error of prediction (RMSEP) for SSC across all three fruit varieties is kept within 0.65%. System-level validation further demonstrates an overall dynamic grasping success rate of 91.7% without causing visible damage. Overall, the proposed system achieves precise and comprehensive grading for multiple high-value fruit varieties.

Graphical Abstract

A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception

Keywords

fruit grading robot YOLOv11 fine-grained classification NIR spectroscopy flexible grasping

Data Availability Statement

Data will be made available on request.

Funding

This work was supported without any funding.

Conflicts of Interest

The authors declare no conflicts of interest.

AI Use Statement

The authors declare that no generative AI was used in the preparation of this manuscript.

Ethical Approval and Consent to Participate

Not applicable.

References

  1. Mukhiddinov, M., Muminov, A., & Cho, J. (2022). Improved classification approach for fruits and vegetables freshness based on deep learning. Sensors, 22(21), 8192.
    [CrossRef] [Google Scholar]
  2. Arunima, P. L., Gopinath, P. P., Lekshmi, P. G., & Esakkimuthu, M. (2024). Digital assessment of post-harvest Nendran banana for faster grading: CNN-based ripeness classification model. Postharvest Biology and Technology, 214, 112972.
    [CrossRef] [Google Scholar]
  3. Ghonimy, M., Alayouni, R., Alshehry, G., Barakat, H., & Ibrahim, M. M. (2025). Integrated Physical–Mechanical Characterization of Fruits for Enhancing Post-Harvest Quality and Handling Efficiency. Foods, 14(14), 2521.
    [CrossRef] [Google Scholar]
  4. Kang, H., Zhou, H., Wang, X., & Chen, C. (2020). Real-time fruit recognition and grasping estimation for robotic apple harvesting. Sensors, 20(19), 5670.
    [CrossRef] [Google Scholar]
  5. Tang, Y., Huang, W., Tan, Z., Chen, W., Wei, S., Zhuang, J., ... & Ren, J. (2025). Citrus fruit detection based on an improved YOLOv5 under natural orchard conditions. International Journal of Agricultural and Biological Engineering, 18(3), 176-185.
    [CrossRef] [Google Scholar]
  6. Wang, X., Huang, Y., Wei, S., Xu, W., Zhu, X., Mu, J., & Chen, X. (2025). ELD-YOLO: A lightweight framework for detecting occluded mandarin fruits in plant research. Plants, 14(11), 1729.
    [CrossRef] [Google Scholar]
  7. Yu, H., Qian, C., Chen, Z., Chen, J., & Zhao, Y. (2025). Ripe-Detection: A lightweight method for strawberry ripeness detection. Agronomy, 15(7), 1645.
    [CrossRef] [Google Scholar]
  8. Zhang, Z., Cheng, H., Geng, W., & Guan, J. (2025). Research advances in hyperspectral imaging technology for fruit quality assessment. Smart Agriculture, 7(5), 52–66.
    [CrossRef] [Google Scholar]
  9. Jiang, X., Zhu, M., Yao, J., Zhang, Y., & Liu, Y. (2022). Calibration of near infrared spectroscopy of apples with different fruit sizes to improve soluble solids content model performance. Foods, 11(13), 1923.
    [CrossRef] [Google Scholar]
  10. Zong, H., Tian, S., Liu, Z., Guo, B., Wang, Y., Ren, J., ... & Zhang, E. (2025). Dual-branch feature-enhanced neural network for apple SSC estimation from hyperspectral imaging. Journal of Food Composition and Analysis, 108700.
    [CrossRef] [Google Scholar]
  11. Guo, Z., Zhai, L., Zou, Y., Sun, C., Jayan, H., El-Seedi, H. R., ... & Zou, X. (2024). Comparative study of Vis/NIR reflectance and transmittance method for on-line detection of strawberry SSC. Computers and Electronics in Agriculture, 218, 108744.
    [CrossRef] [Google Scholar]
  12. He, W., Huang, W., Li, Y., Latinović, N., Zhang, Y., & Zhang, X. (2025). Multimodal information fusion and precision harvesting system for fruit growth driven by flexible optoelectronic sensing and hierarchical attention networks. Computers and Electronics in Agriculture, 238, 110784.
    [CrossRef] [Google Scholar]
  13. Huang, W., Xia, J., Wang, Y., Jin, X., Zhu, H., & Zhang, X. (2024). Flexible multimode sensors based on hierarchical microstructures enable non-destructive grading of fruits in cold chain logistics. Materials Today Sustainability, 25, 100691.
    [CrossRef] [Google Scholar]
  14. Ren, S., He, K., Girshick, R., & Sun, J. (2016). Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE transactions on pattern analysis and machine intelligence, 39(6), 1137-1149.
    [CrossRef] [Google Scholar]
  15. Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.
    [Google Scholar]
  16. Teng, H., Sun, F., Wu, H., Lv, D., Lv, Q., Feng, F., ... & Li, X. (2025). DS-YOLO: A Lightweight Strawberry Fruit Detection Algorithm. Agronomy, 15(9), 2226.
    [CrossRef] [Google Scholar]
  17. Liao, Y., Li, L., Xiao, H., Xu, F., Shan, B., & Yin, H. (2025). YOLO-MECD: Citrus detection algorithm based on YOLOv11. Agronomy, 15(3), 687.
    [CrossRef] [Google Scholar]
  18. Khanam, R., & Hussain, M. (2024). Yolov11: An overview of the key architectural enhancements. arXiv preprint arXiv:2410.17725.
    [Google Scholar]
  19. Liu, Y., Wang, Q., Gao, X., & Xie, A. (2019). Total phenolic content prediction in Flos Lonicerae using hyperspectral imaging combined with wavelengths selection methods. Journal of Food Process Engineering, 42(6), e13224.
    [CrossRef] [Google Scholar]
  20. Cheng, J. H., & Sun, D. W. (2017). Partial least squares regression (PLSR) applied to NIR and HSI spectral data modeling to predict chemical properties of fish muscle. Food engineering reviews, 9(1), 36-49.
    [CrossRef] [Google Scholar]

Cite This Article

APA Style
Ma, Z., Zhang, B., & Yin, T. (2026). A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception. ICCK Transactions on Intelligent Systematics, 3(2), 94-107. https://doi.org/10.62762/TIS.2025.566749
Export Citation
RIS Format
Compatible with EndNote, Zotero, Mendeley, and other reference managers
TY  - JOUR
AU  - Ma, Zhenhao
AU  - Zhang, Bin
AU  - Yin, Tianzhen
PY  - 2026
DA  - 2026/04/17
TI  - A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception
JO  - ICCK Transactions on Intelligent Systematics
T2  - ICCK Transactions on Intelligent Systematics
JF  - ICCK Transactions on Intelligent Systematics
VL  - 3
IS  - 2
SP  - 94
EP  - 107
DO  - 10.62762/TIS.2025.566749
UR  - https://www.icck.org/article/abs/TIS.2025.566749
KW  - fruit grading robot
KW  - YOLOv11
KW  - fine-grained classification
KW  - NIR spectroscopy
KW  - flexible grasping
AB  - To address key challenges in post-harvest fruit grading—namely the difficulty of distinguishing visually similar varieties, the invisibility of internal quality, and mechanical damage during grasping—this study develops an intelligent robotic grading system that integrates advanced computer vision, Near-Infrared (NIR) spectroscopy, and flexible force-controlled grasping. First, an improved object detection algorithm, YOLOv11-TFE, is proposed to mitigate visual confusion between Qixia Fuji apples and Beijing Pinggu peaches and to handle the irregular geometry of Nanshui pears. By embedding the parameter-free SimAM attention mechanism into the backbone to explicitly enhance and decouple surface texture features (gloss versus tomentum) and incorporating the DySample upsampling operator to better preserve complex edge information, the discriminative capability of the detector is significantly improved. Second, a non-destructive internal quality detection module based on NIR spectroscopy (650–1100 nm) is constructed. Through the optimization of preprocessing strategies—specifically Improved Derivative Correction (IDC) and Standard Normal Variate (SNV)—robust PLSR models for soluble solids content (SSC) prediction are established for fruits with differing skin textures. Finally, a dynamic actuation unit for a biomimetic flexible gripper has been designed to achieve non-destructive sorting under continuous flow conditions. Experimental results show that the improved visual algorithm achieves an average [email protected] of 94.6%, with detection accuracies for apples and peaches reaching 96.0% and 97.8%, respectively, markedly reducing inter-class confusion. The root mean square error of prediction (RMSEP) for SSC across all three fruit varieties is kept within 0.65%. System-level validation further demonstrates an overall dynamic grasping success rate of 91.7% without causing visible damage. Overall, the proposed system achieves precise and comprehensive grading for multiple high-value fruit varieties.
SN  - 3068-5079
PB  - Institute of Central Computation and Knowledge
LA  - English
ER  - 
BibTeX Format
Compatible with LaTeX, BibTeX, and other reference managers
@article{Ma2026A,
  author = {Zhenhao Ma and Bin Zhang and Tianzhen Yin},
  title = {A Robotic System for Fine-Grained Non-Destructive Grading of Visually Similar Fruits Based on Improved YOLOv11 and Multi-modal Perception},
  journal = {ICCK Transactions on Intelligent Systematics},
  year = {2026},
  volume = {3},
  number = {2},
  pages = {94-107},
  doi = {10.62762/TIS.2025.566749},
  url = {https://www.icck.org/article/abs/TIS.2025.566749},
  abstract = {To address key challenges in post-harvest fruit grading—namely the difficulty of distinguishing visually similar varieties, the invisibility of internal quality, and mechanical damage during grasping—this study develops an intelligent robotic grading system that integrates advanced computer vision, Near-Infrared (NIR) spectroscopy, and flexible force-controlled grasping. First, an improved object detection algorithm, YOLOv11-TFE, is proposed to mitigate visual confusion between Qixia Fuji apples and Beijing Pinggu peaches and to handle the irregular geometry of Nanshui pears. By embedding the parameter-free SimAM attention mechanism into the backbone to explicitly enhance and decouple surface texture features (gloss versus tomentum) and incorporating the DySample upsampling operator to better preserve complex edge information, the discriminative capability of the detector is significantly improved. Second, a non-destructive internal quality detection module based on NIR spectroscopy (650–1100 nm) is constructed. Through the optimization of preprocessing strategies—specifically Improved Derivative Correction (IDC) and Standard Normal Variate (SNV)—robust PLSR models for soluble solids content (SSC) prediction are established for fruits with differing skin textures. Finally, a dynamic actuation unit for a biomimetic flexible gripper has been designed to achieve non-destructive sorting under continuous flow conditions. Experimental results show that the improved visual algorithm achieves an average [email protected] of 94.6\%, with detection accuracies for apples and peaches reaching 96.0\% and 97.8\%, respectively, markedly reducing inter-class confusion. The root mean square error of prediction (RMSEP) for SSC across all three fruit varieties is kept within 0.65\%. System-level validation further demonstrates an overall dynamic grasping success rate of 91.7\% without causing visible damage. Overall, the proposed system achieves precise and comprehensive grading for multiple high-value fruit varieties.},
  keywords = {fruit grading robot, YOLOv11, fine-grained classification, NIR spectroscopy, flexible grasping},
  issn = {3068-5079},
  publisher = {Institute of Central Computation and Knowledge}
}

Article Metrics

Citations
Crossref
0
Scopus
0
Views
299
PDF Downloads
89

Publisher's Note

ICCK stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and Permissions

Institute of Central Computation and Knowledge (ICCK) or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
ICCK Transactions on Intelligent Systematics
ICCK Transactions on Intelligent Systematics
ISSN: 3068-5079 (Online) | ISSN: 3069-003X (Print)
Portico
Preserved at
Portico