The idea is to use recent advances in machine learning to help visually impaired persons to "see" using audio.
- First, use a camera, and machine learning algorithms to get a text description of a scene. Several learning algorithms can be combined for this (object recognition, attention model...)
- Then, use a text to speech synthetizer to read the generated text via audio. It can be listened in real time with headphones.