Speech2Braille - Jacky Zhao

Please make sure to check out the 'commit' branch for the full project history.

Over 360 million people in the world have disabling hearing loss. Hearing loss can have debilitating effects on a person that makes day to day communication and life very difficult. They face discrimination, as many people and employers find it too much effort to communicate with the deaf. More importantly, the deaf are not able to receive public announcements, warnings, and alarms, which can serve to be a health and safety hazard. For many, current solutions that allow them alleviate these problems have problems themselves that prevent them from being accessible to everyone.

This project entailed creating an end-to-end speech recognition system using an ANN and a portable device to display braille. The device, made from a Raspberry Pi B, is able to recognize audio and transcribe it into Braille through the haptic feedback device via the ANN. The feedback device is a self-made hat, consisting of 6 solenoids, allows the Raspberry Pi to control the 6 solenoids via the GPIO outputs. The neural network itself is 2 layered LSTM network with 256 hidden cells in each layer. The model was trained on the LibriSpeech ‘clean-100’ dataset for 44 hours and 10 minutes, going through 37 epoches, and attained a final accuracy of 74.77% on the training set and 71.50% on the test set. Accuracy of the network was determined using a metric called the Levenshtein edit distance. The model accuracy could not be improved due to time and hardware constraints.

Network Details:

Character Wise Decoding
Training Data - LibriSpeech "clean-100"
Preprocessing for features - 13 MFCCs + 1st Derivatives, normalize to stddev of 1
Optimizer: RMSPropOptimizer
Learning Rate: 1e-4
Number of Hidden Cells: 256
Number of Layers: 2
Regulurization: White Noise (stddev: 0.01)
CTC Loss
Greedy Decoder

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
_dir		_dir
best_chkpt		best_chkpt
ext		ext
totalsummary		totalsummary
wavs		wavs
.gitignore		.gitignore
README.md		README.md
ReportPDF.pdf		ReportPDF.pdf
_config.yml		_config.yml
braille_util.py		braille_util.py
braille_util.pyc		braille_util.pyc
denoise.sh		denoise.sh
gpiostates.py		gpiostates.py
gpiostates.py.org		gpiostates.py.org
init.sh		init.sh
init.sh.org		init.sh.org
load_meta.py		load_meta.py
main		main
noisemask		noisemask
test.wav		test.wav
test_wav.py		test_wav.py
tf_model.py		tf_model.py
wakeup-screen		wakeup-screen

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech2Braille - Jacky Zhao

Please make sure to check out the 'commit' branch for the full project history.

Network Details:

About

Releases

Packages

Languages

jackyzha0/Speech2Braille

Folders and files

Latest commit

History

Repository files navigation

Speech2Braille - Jacky Zhao

Please make sure to check out the 'commit' branch for the full project history.

Network Details:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages