Abstract: Car license plates in Oman feature a unique vertical arrangement of Arabic and English letters. This distinctive format, combined with the often low-resolution images, presents challenging ...
This repository contains the complete implementation of a multilingual ASR system for a 6-credit major project. The system uses wav2vec2 XLS-R for feature extraction and Transformer architecture for ...
Hugging Face has teamed up with NVIDIA, Mistral AI, and the University of Cambridge to launch the Open ASR Leaderboard, a public benchmark for automatic speech recognition (ASR). The researchers noted ...
Abstract: In this paper, we provide an overview of image recognition using deep learning and introduce its applications. Then, challenges in application and how to deal with them are discussed from a ...