University of Bahrain
Scientific Journals

DL-based Generation of facial portraits from diverse data sources

Show simple item record

dc.contributor.author BERRAHAL, Mohammed
dc.contributor.author AZIZI, Mostafa
dc.date.accessioned 2023-03-02T10:32:27Z
dc.date.available 2023-03-02T10:32:27Z
dc.date.issued 2023-03-02
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/4781
dc.description.abstract Artificial intelligence has a tremendous potential to reinforce and revolutionize law enforcement. By automating the tedious and time-consuming tasks of data collection and analysis, AI can help police departments become more effective. In this paper, we provide an overview of possible deep learning models that can be utilized by law enforcement to reconstruct the suspect’s face, through generating real and sketch facial portraits. To accomplish this, we collect four types of data from the crime scene, handwritten text and audio from an officer’s note, images from a smartphone, and video from a surveillance camera. Then, employ two pre-trained models: ABM-CNN for attribute multi-label classification and Google Speech API for speech-to-text conversion. In addition, we train three other models, the first of which, a handwritten model trained on the IAM Handwriting dataset, reads and digitizes handwritten notes with an accuracy of 76%, outperforming state-of-the-art results. Second, we train YoloV5 with the Wider Face dataset to detect one or multiple faces on images or videos with an average precision of 93%, a recall of 90%, and a precision of 88%. In the third model, we adapt the Zero-Shot Text-to-Image Generation technology to generate real faces and sketch. Our resulting model outperforms existing models from literature regards to high-quality, and the training reach a loss of 37.4%. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject Portrait Generation, Text-to-image Synthesis, Deep Learning, Handwritten OCR, YOLOv5, Face Detection. en_US
dc.title DL-based Generation of facial portraits from diverse data sources en_US
dc.type Article en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/130175 en
dc.contributor.authoraffiliation MATSI Research Lab, ESTO, Mohammed First University, Oujda, Morocco en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account