In this document, we present an object detection system for italian documents. Unlike similar systems which use a single deep-learning solution, this system employs different solutions for a fast and accurated detection. The first is an image segmentation module which process an acquired-scanner image and find every important artificats. The second is a custom CNN for detect every artificat's rotation and then use the information for set the document to the upright (this is important for read the document-text content). The third is a simple CNN for detect each upright element. We present the algorithm used in the first part and the training methods for both types of networks. We also perform analysis on the networks, and present empirical results on a large test set. Finally, we present preliminary results for detecting documents.

An object detection system for automatic document reorientation and identification

Stefani, Massimo
2020/2021

Abstract

In this document, we present an object detection system for italian documents. Unlike similar systems which use a single deep-learning solution, this system employs different solutions for a fast and accurated detection. The first is an image segmentation module which process an acquired-scanner image and find every important artificats. The second is a custom CNN for detect every artificat's rotation and then use the information for set the document to the upright (this is important for read the document-text content). The third is a simple CNN for detect each upright element. We present the algorithm used in the first part and the training methods for both types of networks. We also perform analysis on the networks, and present empirical results on a large test set. Finally, we present preliminary results for detecting documents.
2020-03-13
File in questo prodotto:
File Dimensione Formato  
821736-1194598.pdf

accesso aperto

Tipologia: Altro materiale allegato
Dimensione 7.27 MB
Formato Adobe PDF
7.27 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14247/6489