How to generate speech signal from spectral envelope, aperiodicity, fundamental frequency, V/UV signal

4 views (last 30 days)
I have implemented the network as shown in fig which takes 2 inputs namely, video input and mfcc(audio) input. Video input consists of lip images and audio input is mfcc of corresponding video frame. The video and mfcc frames are passed through several layers and then added to generate speeech parameters. I have found fundamental frequency, spectral envelope, V/UV speech, fundamental frequency. I have taken ifft of spectral envelope to generate sound but it generates random signal.
please guide how to generate speech signal from speech parameters.

Answers (0)

Categories

Find more on Simulation, Tuning, and Visualization in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!