About¶

Introduction¶

In recent past, Deep Learning models have proven their potential in many application areas, however its entry into embedded world has its own twists and practical difficulties.

Problem Statement¶

To come up with a framework that enables a fast prototyping of Deep Learning models for Audio (to start with!) and provides an easy way to port the models on to Android using TFLite.

Proposed Solution¶

Come up with following modular components which can be then used as plug and play components:

Dataset modules with preprocessing modules
DataIterator modules
Tensorflow Models (Estimators)
Engine to run the models
Tensorflow model serving using TFLite
- Web app
- Mobile

Architecture¶

Dataset ¶

FreeSound from Kaggle
Speech Recognition
- Kaggle
- Google

Notebooks ¶

Python Environment¶

conda create -n shabda python=3.6
source activate shabda
pip install -r requirements.txt

Examples ¶