In road traffic, safety is a “must-have”. This is why motorized vehicles are regularly put to the test in almost all European countries. Do the engine and brakes function properly? Do the wheels have sufficient profile? Is the chassis OK? All this is carefully checked. In addition, test centres and workshops also check the authenticity of vehicle documents and registration plates.
AI prototype for process automation in the automotive environment
The vehicle identification number (VIN) plays an important role in this context: it is as unique as a fingerprint and provides important information on vehicle history and equipment. At present, VINs are still generally recorded manually in workshops, test facilities, and road traffic offices. This bears the risk of errors and also costs time. This process can be fully automated with the help of artificial intelligence and machine learning (ML). We have tested how this can work on behalf of a European platform provider. The result is a prototype for the automatic recording of chassis and master numbers – and all within a development period of only three weeks.
Our AI solution paves the way for road traffic authorities, garages, and test centres to achieve significant time savings and quality improvements – and is also very easy to use. All the user has to do is take a picture of the vehicle registration document on a tablet, and the artificial intelligence does the rest: it extracts the 17-digit VIN from the photo and then transfers it to the desired IT system in the back end. The advantage of this solution: transposed numbers and transmission errors are a thing of the past, the captured VIN is available at the touch of a button.
Sounds pretty simple, actually. But the road to getting there is an enormous challenge – at least for artificial intelligence: It must learn to find the right information from the multitude of data in the vehicle paper and then transform the image into a column of numbers. And that is easier said than done. Because normal neural networks (NN) can detect given objects within an image. However, if the position of the object changes, many NN fail to locate it. In addition, different image qualities resulting from different shooting angles, lighting conditions and distances make it even more difficult to identify the desired information. For while the human brain automatically suppresses this “noise”, intelligent software must learn from scratch to correctly classify such entropies.
To make learning easier for our AI prototype, we have developed the model on the basis of two Convolutional Neural Networks (CNN). The Deep Learning Architecture is specialized in image processing and uses different filters (Convolutional Layer) and aggregation layers (Pooling Layer). These enable CNN, in contrast to conventional NN, to recognize displayed images based on a given matrix and to transform them into the desired data format.
Software model masters complex tasks with confidence
So in our case, the task was: Find the VIN in the photographed vehicle paper and transform it into a 17-digit number. We had to train artificial intelligence accordingly. Ideally, this would require an extensive database. The larger and more varied the training pool, the higher the hit rate in the end. However, for data protection reasons, we only had 40 (mainly expired) registration papers – far too few to show the required amount of deviations and entropies in training. So we artificially enlarged our database so that it ultimately contained around 3,000 different virtual vehicle papers.
Now nothing stood in the way of iterative training. In concrete terms, this means that we continuously fed the artificial intelligence with new learning data, evaluated results and, on this basis, modified and improved the CNN matrix again and again. A complex process that took a lot of time: In the current ERNI prototype, the iterative training of artificial intelligence accounted for around two-thirds of the total development time. An effort that was worthwhile, however: In the end, the software delivered the correct VIN in nine out of ten cases – and that after only two weeks of training.
Wicher Visser
Principal Consultant
ERNI Switzerland
Marek Linka
Senior Software Developer
ERNI Singapore