Detecting head orientation (Euler's angles), lips opening, face expression and bouding box size. Construct a decoder with these data.