项目作者: voletiv

项目描述 :
My experiments in lip reading using deep learning with the LRW dataset
高级语言: Python
项目地址: git://github.com/voletiv/lipreading-in-the-wild-experiments.git


lipreading-in-the-wild-experiments

This repository contains my experiments with lip reading using deep learning in Keras. I train and test on the LRW dataset.

process-lrw/

Codes to convert the videos in LRW Dataset to:

  1. - frames pertaining to words
  2. - mouth area of those frames
  3. - audio

Instructions are provided in README file in directory.

shape-predictor/

Directory to place the “shape_predictor_68_face_landmarks.dat” file, required by process-lrw and head-pose

image-retrieval/

Codes and files —- considering the lipreader as an image retrieval system

head-pose/

Codes and files —- to compute head pose in all frames in LRW dataset (extracted using process-lrw)

Head pose is determined using voletiv/deepgaze (my fork of deepgaze).

Instructions are provided in README file in directory.