Automate extraction of braille text to speech from an image. (October 2022)
- Record Type:
- Journal Article
- Title:
- Automate extraction of braille text to speech from an image. (October 2022)
- Main Title:
- Automate extraction of braille text to speech from an image
- Authors:
- Revelli, Vishnu Preetham
Sharma, Gauri
S.Kiruthika devi, - Abstract:
- Highlights: Our project falls under the domain of Natural Language Processing and Machine Learning. With the advent of digitization, it became necessary to preserve data produced by humans. Voice and text are how we communicate. Visual impairment can be defined as partial or complete loss of vision. The primary goal of this project is to extract text from braille text images, so as to store it in a drive for later use. Although there are various ways to implement this, we'll use Deep Learning. Visually impaired people face numerous difficulties like interacting with the surroundings, navigation, etc. The final accuracy we obtained was 0.9615. These results show the model's robustness against factors like low light illumination. Abstract: Our project falls under the domain of Natural Language Processing and Machine Learning. With the advent of digitization, it became necessary to preserve data produced by humans. Voice and text are how we communicate. Visual impairment can be defined as partial or complete loss of vision. The primary goal of this project is to extract text from braille text images, so as to store it in a drive for later use. Although there are various ways to implement this, we'll use Deep Learning. Visually impaired people face numerous difficulties like interacting with the surroundings, navigation, etc. Different machine learning models like CNN, etc. have been used to solve this problem but usually have a high cost of detection in terms of computationalHighlights: Our project falls under the domain of Natural Language Processing and Machine Learning. With the advent of digitization, it became necessary to preserve data produced by humans. Voice and text are how we communicate. Visual impairment can be defined as partial or complete loss of vision. The primary goal of this project is to extract text from braille text images, so as to store it in a drive for later use. Although there are various ways to implement this, we'll use Deep Learning. Visually impaired people face numerous difficulties like interacting with the surroundings, navigation, etc. The final accuracy we obtained was 0.9615. These results show the model's robustness against factors like low light illumination. Abstract: Our project falls under the domain of Natural Language Processing and Machine Learning. With the advent of digitization, it became necessary to preserve data produced by humans. Voice and text are how we communicate. Visual impairment can be defined as partial or complete loss of vision. The primary goal of this project is to extract text from braille text images, so as to store it in a drive for later use. Although there are various ways to implement this, we'll use Deep Learning. Visually impaired people face numerous difficulties like interacting with the surroundings, navigation, etc. Different machine learning models like CNN, etc. have been used to solve this problem but usually have a high cost of detection in terms of computational time and hence they are not suitable for real-time applications [1] . To meet the project objective, we studied about Convolutional Neural Networks. The plan for picking this field is that it empowers machines to see the world as people do, see it likewise and even utilize the information for a huge number of undertakings like Image Recognition, Image Analysis and Classification, Media Recreation, Recommendation Systems, and so on [2] . This paper proposes a customized CNN model based on Braille Character dataset which has 26 different classes of all alphabets. This model uses a feature extractor that recognizes relevant features corresponding to each image, and then uses a classifier network to classify images into a set of classes. This model will take an image input, then pass those individual images to our classifier and then detect the braille language characters present in that image and give a translated English text and audio output. The final accuracy we obtained was 0.9615. These results show the model's robustness against factors like low light illumination. … (more)
- Is Part Of:
- Advances in engineering software. Volume 172(2022)
- Journal:
- Advances in engineering software
- Issue:
- Volume 172(2022)
- Issue Display:
- Volume 172, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 172
- Issue:
- 2022
- Issue Sort Value:
- 2022-0172-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-10
- Subjects:
- Deep learning -- Convolutional neural networks -- Machine learning
Computer-aided engineering -- Periodicals
Engineering -- Computer programs -- Periodicals
Engineering -- Software -- Periodicals
Periodicals
620.0028553 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09659978 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.advengsoft.2022.103180 ↗
- Languages:
- English
- ISSNs:
- 0965-9978
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0705.450000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23281.xml