A NEURAL NETWORK APPROACH TO SEMANTIC SEGMENTATION OF VEHICLES IN VERY HIGH RESOLUTION IMAGES

Authors

  • V. Yu. Kashtan Dnipro University of Technology, Dnipro, Ukraine, Ukraine
  • V. V. Hnatushenko Dnipro University of Technology, Dnipro, Ukraine, Ukraine
  • I. M. Udovyk Dnipro University of Technology, Dnipro, Ukraine, Ukraine
  • O. V. Kazymyrenko Dnipro University of Technology, Dnipro, Ukraine, Ukraine
  • Y. D. Radionov Dnipro University of Technology, Dnipro, Ukraine, Ukraine

DOI:

https://doi.org/10.15588/1607-3274-2025-3-8

Keywords:

semantic segmentation, vehicles, deep neural networks, ResNet-101, DeepLab, multi-scale analysis, very high resolution images

Abstract

Context. The semantic segmentation of vehicles in very high resolution aerial images is essential in developing intelligent transportation systems. It allows for the automation of real-time traffic management and the detection of congestion and emergencies.
Objective. This work aims to develop and evaluate the effectiveness of a neural network approach to semantic segmentation in very high resolution aerial images, which provides high detail and correct reproduction of object boundaries.
Method. The DeepLab architecture with ResNet-101 as a backbone is used for gradient preservation and multiscale feature analysis. We trained on DOTA data and retrained on specialized sets with classes: vehicles, green areas, buildings, and roads. A loss function based on the Dice coefficient was applied to reduce the imbalance of classes. It effectively solves the class imbalance problem and improves the accuracy of segmenting objects of different sizes. Using ResNet-101 instead of Xception in the backbone network allows us to maintain the gradient as the network depth increases.
Results. Experimental studies have confirmed the effectiveness of the proposed approach, which achieves a segmentation accuracy of more than 90%, outperforming existing analogs. The use of multiscale feature analysis allows for preserving the texture features of objects, reducing false classifications. A comparative study with U-Net, SegNet, FCN8s, and other methods confirms the higher performance of the proposed approach in terms of mIoU (82.3%) and Pixel Accuracy (95.1%).
Conclusions. The experiments confirm the effectiveness of the proposed method of semantic segmentation of vehicles in ultrahigh spatial resolution images. Using DeepLab v3+ResNet-101 significantly improves the quality of vehicle segmentation in an urbanized environment. Excellent metric performance makes it promising for infrastructure monitoring and traffic planning tasks. Further research will focus on adapting the model to new datasets

Author Biographies

V. Yu. Kashtan, Dnipro University of Technology, Dnipro, Ukraine

PhD, Associate Professor, Associate Professor of Department of Information Technology and
Computer Engineering

V. V. Hnatushenko , Dnipro University of Technology, Dnipro, Ukraine

Dr. Sc., Professor, Head of Department of Information Technology and Computer Engineering

I. M. Udovyk, Dnipro University of Technology, Dnipro, Ukraine

PhD, Associate Professor, Dean of Information Technologies Department

O. V. Kazymyrenko , Dnipro University of Technology, Dnipro, Ukraine

Postgraduate student of Department of Information Technology and Computer Engineering

Y. D. Radionov , Dnipro University of Technology, Dnipro, Ukraine

Postgraduate student of Department of Information Technology and Computer Engineering

References

Yongtao Yu., Tiannan Gu., Haiyan G., Dilong Li, Shenghua J. Vehicle detection from high resolution remote sensing imagery using convolutional capsule networks, IEEE Geosci. Remote Sens. Lett., 2019, Vol. 16, No. 12, pp. 1894– 1898. DOI:10.1109/LGRS.2019.2912582.

Byun S., Shin I.-K., Moon J., Kang J., Choi S.-I. Road traffic monitoring from UAV images using deep learning networks, Remote Sens, 2021, No. 13, P. 4027. DOI: 10.3390/rs13204027.

Khrissi L., El Akkad N., Satori H., Satori K. Clustering method and sine cosine algorithm for image segmentation, Evol. Intell., 2022, No. 15, pp. 669–682. DOI:10.1007/s12065-020-00544-z.

Osco L., Junior J., Ramos A., de Castro Jorge L., Fatholahi S., de Andrade Silva J., Matsubara E., Pistori H., Gonçalves W., Li J. A review on deep learning in UAV remote sensing, International Journal of Applied Earth Observation and Geoinformation, 2021. DOI:102:102456.

Kemker R., Salvaggio C., Kanan C. Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning, ISPRS Journal of Photogrammetry and Remote Sensing, 2018, Vol. 145. DOI:10.1016/j.isprsjprs.2018.04.014.

Hong D., Danfeng H., Gao L., Yokoya N., Jing Y., Chanussot J., Du Q., Zhang B. More diverse means better: multimodal deep learning meets remote-sensing imagery classification, IEEE Transactions on Geoscience and Remote Sensing, 2020, Vol. 59 (5), pp. 4340–4354.

DOI: 10.1109/TGRS.2020.3016820.

Binge C., Chen X., Lu Y. Semantic segmentation of remote sensing images using transfer learning and deep convolutional neural network with dense connection, IEEE Access2020, No. 8, pp. 116744–116755. DOI:10.1109/Access.6287639.

Singh C., Mishra V., Jain K., Shukla A. FRCNN-based reinforcement learning for real-time vehicle detection, tracking and geolocation from UAS, Drones, 2022, No. 6, P. 406. DOI: 10.3390/drones6120406.

Saqib M., Khan S., Sharma N., Blumenstein M. A study on detecting drones using deep convolutional neural networks, In Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2017, pp. 1–5. DOI: 10.1109/AVSS.2017.8078541.

Kong X., Zhang Y., Tu S., Xu C., Yang W. Vehicle detection in high-resolution aerial images with parallel RPN and density-assigner, Remote Sens, 2023, No. 15, P. 1659. DOI: 10.3390/rs15061659.

Hordiiuk D., Oliinyk I., Hnatushenko V., Maksymov K. Semantic segmentation for ships detection from satellite imagery, 2019 IEEE 39th International Conference on Electronics and Nanotechnology (ELNANO), 2019. DOI:10.1109/elnano.2019.8783822.

Borovyk D., Fedoniuk R., Oliinyk A., Subbotin S., Kolpakova T. Detection of vehicles in aerial photographs using convolutional neural network, Smartindustry, 2024. https://ceur-ws.org/Vol-3699/paper12.pdf

Bochkovskiy A., Wang C., Liao H. YOLOv4: optimal speed and accuracy of object detection, arXiv 2020, arXiv:2004.10934, 2020. DOI: 10.48550/ARXIV.2004.10934.

Radionov Y., Kashtan V., Hnatushenko V., Kazymyrenko O. Aircraft detection with deep neural networks and contour-based methods, Radio Electronics, Computer Science, Control, 2024, №4(71), pp. 121–129. DOI:10.15588/1607- 3274-2024-4-12.

Zhang Y., Guo Z., Wu J., Tian Y., Tang H., Guo X. Realtime vehicle detection based on improved YOLO v5, Sustainability, 2022, No. 14(19), P. 12274. DOI:10.3390/su141912274.

DOTA [Electronic resource]. Access mode: https://captainwhu.github.io/DOTA/index.html

Kashtan V., Hnatushenko V., Shedlovska Y. Processing technology of multispectral remote sensing images, 2017 IEEE International Young Scientists Forum on Applied

Physics and Engineering (YSF), 2017. DOI:10.1109/ysf.2017.8126647.

He K., Zhang X., Ren S., Sun J. Deep residual learning for image recognition, in Proc. IEEE Conf. Comput. Vis. Pattern Recognit, 2016, pp. 770–778. DOI: 10.48550/arXiv.1512.03385.

Chengzhi Y., Hongjun G. A Method of Image Semantic Segmentation Based on PSPNet, Mathematical Problems in Engineering, 2022, pp. 1–9. DOI:10.1155/2022/8958154.

Downloads

Published

2025-09-22

How to Cite

Kashtan, V. Y. ., Hnatushenko , V. V. ., Udovyk, I. M. ., Kazymyrenko , O. V., & Radionov , Y. D. . (2025). A NEURAL NETWORK APPROACH TO SEMANTIC SEGMENTATION OF VEHICLES IN VERY HIGH RESOLUTION IMAGES. Radio Electronics, Computer Science, Control, (3), 77–85. https://doi.org/10.15588/1607-3274-2025-3-8

Issue

Section

Neuroinformatics and intelligent systems