Achieving real-time lip-synch via SVM-based phoneme classification and lip shape refinement

T Kim, Y Kang, H Ko - Proceedings. Fourth IEEE International …, 2002 - ieeexplore.ieee.org
T Kim, Y Kang, H Ko
Proceedings. Fourth IEEE International Conference on Multimodal …, 2002ieeexplore.ieee.org
In this paper, we develop a real time lip-synch system that activates a 2D avatar's lip motion
in synch with incoming speech utterance. To realize" real time" operation of the system, we
contain the processing time by invoking a merge and split procedure performing coarse-to-
fine phoneme classification. At each stage of phoneme classification, we apply a support
vector machine (SVM) to constrain the computational load while attaining desirable
accuracy. Coarse-to-fine phoneme classification is accomplished via 2 stages of feature …
In this paper, we develop a real time lip-synch system that activates a 2D avatar's lip motion in synch with incoming speech utterance. To realize "real time" operation of the system, we contain the processing time by invoking a merge and split procedure performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply a support vector machine (SVM) to constrain the computational load while attaining desirable accuracy. Coarse-to-fine phoneme classification is accomplished via 2 stages of feature extraction, where each speech frame is acoustically analyzed first for 3 classes of lip opening using MFCC as the feature and then a further refined classification for detailed lip shape using formant information. We implemented the system with 2D lip animation that shows the effectiveness of the proposed 2-stage procedure accomplishing the real-time lip-synch task.
ieeexplore.ieee.org
Showing the best result for this search. See all results