A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset

Parsa, Antara Firoz; Abdullah, S. M.; Talukder, Anika Hasan; Kabbya, Md. Asif Shahidullah; Hasan, Shakib Al; Islam, Md. Farhadul; Noor, Jannatun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.06602 (cs)

[Submitted on 11 Jan 2025]

Title:A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset

Authors:Antara Firoz Parsa, S. M. Abdullah, Anika Hasan Talukder, Md. Asif Shahidullah Kabbya, Shakib Al Hasan, Md. Farhadul Islam, Jannatun Noor

View PDF HTML (experimental)

Abstract:The study involves a comprehensive performance analysis of popular classification and segmentation models, applied over a Bangladeshi pothole dataset, being developed by the authors of this research. This custom dataset of 824 samples, collected from the streets of Dhaka and Bogura performs competitively against the existing industrial and custom datasets utilized in the present literature. The dataset was further augmented four-fold for segmentation and ten-fold for classification evaluation. We tested nine classification models (CCT, CNN, INN, Swin Transformer, ConvMixer, VGG16, ResNet50, DenseNet201, and Xception) and four segmentation models (U-Net, ResU-Net, U-Net++, and Attention-Unet) over both the datasets. Among the classification models, lightweight models namely CCT, CNN, INN, Swin Transformer, and ConvMixer were emphasized due to their low computational requirements and faster prediction times. The lightweight models performed respectfully, oftentimes equating to the performance of heavyweight models. In addition, augmentation was found to enhance the performance of all the tested models. The experimental results exhibit that, our dataset performs on par or outperforms the similar classification models utilized in the existing literature, reaching accuracy and f1-scores over 99%. The dataset also performed on par with the existing datasets for segmentation, achieving model Dice Similarity Coefficient up to 67.54% and IoU scores up to 59.39%.

Comments:	8 Tables, 7 Figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.06602 [cs.CV]
	(or arXiv:2501.06602v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.06602

Submission history

From: Antara Firoz Parsa [view email]
[v1] Sat, 11 Jan 2025 18:03:46 UTC (4,780 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators