In our daily lives, there occur many forms of communication, for instance: speech, pictorial language, textual language, and body
language, etc. However, amongst those forms speech is always regarded as the most powerful form because of its rich dimensions
character. Except for the speech text (words), the rich dimensions also refer to the gender, attitude, emotion, health situation, and
identity of a speaker. Such information is very important for effective communication. From the signal processing view, speech can
be distinguished in terms of the signal transmitting message information. The waveform could be the representation of speech, also
this kind of signal has been most useful in practical utilization. Extracting from the speech signal, we could get three main kinds of
information: Speech Text, Language, and Speaker Identity