Pavement Distress Detection From Orthophotos With Two-Stream Convolutional Neural Networks

Deprecation warning

This website is now archived. Please check out the new website for Centre for Intelligent Systems which includes both A-Lab Control Systems Research lab and Re:creation XR lab.

However, the Dynamic System Model Database can still be used and may be updated in the future.

Author: 

Roland Lõuk

Degree: 

M.Sc.

Supervisors: 

Date: 

Wednesday, January 22, 2020

Thesis language: 

English

Abstract: 

Automated pavement distress detection is an important but challenging task towards the goal of timely road maintenance. Given the vastness of road networks across the world, there is a lot of labor involved in manual defect detection for roads. In the recent years, however, convolutional neural networks have been shown to achieve groundbreaking results in the field of image classification. This thesis seeks to research and develop methods for applying convolutional neural networks to pavement distress detection for sections of orthophotos (orthoframes) with a large resolution. To address GPU memory limitations and increase detection localization, a sliding-window approach is used to partition the orthoframe into 224x224-pixel segments, which are subject to binary classification. However, the sliding-window approach does not allow for the model to account for the context surrounding the segment and results may suffer due to the small window size. This thesis proposes a ResNet architecture based convolutional neural network which accounts for two inputs streams, one of which is the 224x224-pixel content segment, which is subject to classification, and the other is the downscaled context view around the content segment. Experiments on two different datasets show an increased classification accuracy for the two-stream approach compared to the single stream approach.

Electronic version: