According to MPEG-4's TTS architecture, facial animation can be driven by two streams simultaneously—text and Facial Animation Parameters. A Text-To-Speech converter drives the mouth shapes of the face. An encoder sends Facial Animation Parameters to the face. The text input can include codes, or bookmarks,...http://www.google.com/patents/US7844463?utm_source=gb-gplus-sharePatent US7844463 - Method and system for aligning natural and synthetic video to speech synthesis