Raspberry Pi Based Smart Reader For Visually Impaired People

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 12
At a glance
Powered by AI
The key takeaways are that this system aims to develop an assistive device using a Raspberry Pi to help visually impaired people read text documents using optical character recognition (OCR) and text-to-speech conversion.

This system aims to address the difficulties that visually impaired people face in accessing text resources independently by developing an affordable and user-friendly image-to-speech conversion system.

The main components of the system are a webcam, Raspberry Pi, text recognition software, text-to-speech conversion module, and speaker. The webcam captures an image of text which is converted to text using OCR software and then to speech using text-to-speech for the user to listen to.

Raspberry Pi based Smart Reader for Visually

Impaired People
INTRODUCTION
• Visually impaired people fail to excess text using existing technology,
including problems with alignment, focus, accuracy, mobility and efficiency.
• This model uses the methodology of a camera based assistive device that can
be used by people to read Text document.
• Internet plays a vital role in today’s world of communication.
• But some of the people in today’s world don’t know how to make use of
internet
• some are blind or some are illiterate
• So it goes very difficult to them when to live in this world of internet.
• Nowadays there are various technologies available in this world like screen
readers, ASR, TTS, STT, etc.
• This system approaches an important digital image analysis domain
• Face detection represents a computer technology that determines the locations
and sizes of human faces in arbitrary digital images.
• In object-class detection, the task is to find the positions and sizes of all objects
in an image that belong to a given class .
• For colour images, various literatures have shown that is possible to separate
human skin regions from complex background
• The face candidates can be generated from the identified skin regions.
• wavelet packet analysis , template matching for faces, eyes and mouths feature
extraction using watersheds and projections.
• While being a sub-domain of the object detection field, face detection
represents a generalization of face localization.
• In face localization, the task is to find the location and size of a given input
image, while in face detection one does not have any information about the
human faces.
LITERATURE SURVEY
1. A Smart Reader for Visually Impaired People Using Raspberry PI D.Velmurugan, M.S.Sonam ,
S.Umamaheswari , S.Parthasarathy , K.R.Arun

The paper addresses the integration of a complete Text Read-out system designed for the visually
challenged. The system consists of a webcam interfaced with raspberry pi which accepts a page of printed
text. The OCR (Optical Character Recognition) package installed in raspberry pi scans it into a digital
document which is then subjected to skew correction, segmentation, before feature extraction to perform
classification. Once classified, the text is readout by a text to speech conversion unit (TTS engine) installed
in raspberry pi. The output is fed to an audio amplifier before it is read out. The simulation for the proposed
project can be done in MATLAB. The system finds interesting applications in libraries, auditoriums, offices
where instructions and notices are to be read and also in assisted filling of application forms. Results along
with analysis are presented.

2. Smart Book Reader for Visual Impairment Person using IoT Device

This paper focuses on development of Smart Book Reader will help the blind people or who have low
vision to read the book without using braille. This project utilises IoT technology with the use of an IoT
device, IoT infrastructure and service. An IoT device, Raspberry Pi, is used which is very energy efficient
because it only uses 5V of power to run. It is also a high portability device with only credit card size and
can be carried out anywhere. Book reader will capture the picture of book pages using camera and book
reader will process the images using Optical Character Recognition software. The motivation to develop
this product is to encourage all blind people to read ordinary books. This will help them to gain particular
knowledge from the reading without a need to learn Braille.
3. A Smart Reader for Visually Impaired People (Standard Image Vs Real Time Image: A
Comparative Study) Ram Nivas Duraisamy , Sathya Manoharan

Optical character recognition (OCR) is the process of identifying the printed characters using photoelectric
device and computer software. It converts images of typed, handwritten or printed text into machine
encoded text from scanned document or from text superimposed on an image. These images are converted
into audio output. OCR is mainly used in the field of research in character recognition artificial intelligence
and computer vision and it is also used for pattern recognition and to perform Document Image Analysis.
The work focuses on the OCR based automatic book reader for the visually impaired using Raspberry Pi.
The aim is to provide assistance to the visually impaired at low cost and to demonstrate the easily
designable version. The results of text detection rate, accuracy, text to speech conversion rate and error rate
are tabulated. The proposed model's result is compared with MATLAB image processing method.

4. A Smart Navguide System for Visually Impaired


Kiran Rakshana R, Chitra C

This system consists of three modules such as voice searching module, image processing module and voice-
processing module. These modules are implemented by using the keyword operation search, which is in the
form of voice, which is given by the user. After the keyword is received by the Raspberry pi, the pi camera
will capture the image, according to the given keyword. This system integrates the Optical Character
Recognition and Text to Speech Synthesizer concept. It comprises text extraction from image and translates
the text into speech; this helps the user to read the text easier. In computer vision, the extraction of text is a
difficult task from the color images. The image processing module stage includes binarization, de-noising,
de-skewing, segmentation and feature extraction. It is also used to identify the bus name and bus number in
the bus stop or bus stand for the visually impaired and it also achieve the obstacle avoidance by using the
ultrasonic sensor and it is informed to the user by vibration sensing technique.
OBJECTIVES

• Bring relief to the agonizing tasks that the visually impaired has to go through

this difficulty

• To provide technical solution and to assist the visually impaired people to access

various text resources and enhance their knowledge

• Aims to study the image recognition technology with speech synthesis

• To develop a cost effective, user friendly image to speech conversion system


PROBLEM STATEMENT
• Visually Impaired people faces difficulty in navigation, recognizing obstacles, identifying or
differentiating objects from others.
• This traditional method is limited to certain objects.
• For instance, this method of identification is not applicable to objects like shirts or bunch of keys.
Consequently the need for the development of a computerized way to solve this problem.
• In this project converting image to speech. The image in the form of scanned image or real time
captured image is converted in text using Optical Character Recognition.
• Then the text is converted into speech using Text To Speech conversion. Optical character
recognition is the one of the simplest method used to converts (Electronic or mechanical) printed
images or text, handwritten documents into machine-encoded text.
• The Optical Character Recognition returns the extracted text, along with information about the
location of the detected text in the original image back to the device app for further processing
(such as text-to-speech) or display.
• Text-to-speech is the functions that convert the text into digital audio signal.
METHODOLOGY
• The power supply is given to the 5V micro USB connector of raspberry pi through the
Switched Mode Power Supply .

• The web camera is connected to the USB port of raspberry pi. The audio output is taken
from the audio jack of the raspberry pi.

• The Internet is connected to the Ethernet port in raspberry pi.

• The page to be read is placed on a base and the camera is focused to capture the image.
The captured image is processed.

• The captured image is converted to text by the software.

• The text is converted into speech by using voice to text conversion module.

• The final output is taken by speaker.

• Speaker can also be replaced by a headphone for convenience.


HARDWARE AND SOFTWARE
REQUIREMENTS

1. Raspberry Pi

2. Power supply unit

3. Display

4. LED

5. Phython
REFERENCES

[1] Bindu Philip and r. d. sudhaker Samuel 2009 “Human machine interface – a smart ocr for
the visually challenged” International journal of recent trends in engineering, vol
no.3,November
[2] Roy shilkrot, pattie maes, jochen huber, suranga c. nanayakkara, connie k (april may 2014)
“Finger reader: a wearable device to support text reading on the go”Journal of emerging trend
and information
[3] V. Ajantha devi1, dr. Santhosh baboo “Embedded optical character recognition on tamil
text image using raspberry pi” international journal of computer science trends and technology
(ijcst) – volume 2 issue 4, jul-aug 2014
[4] Prachi khilari, bhope v. (july 2015) “Online speech to text engine” International journal of
innovative research in science, engineering andtechnology. vol. 4, issue 7, july 2015
[5] Gopinath , aravind , pooja et.Al “Text to speech conversion using matlab” International
journal of emerging technology and advanced engineering. volume 5, issue 1, (january 2015)

You might also like