Poulose Jacob,K; Sonia, Sunny; David, Peter S(Computer Science, 2013)
[+]
[-]
Abstract:
Speech processing and consequent recognition are important areas of Digital Signal Processing
since speech allows people to communicate more natu-rally and efficiently. In this work, a
speech recognition system is developed for re-cognizing digits in Malayalam. For recognizing
speech, features are to be ex-tracted from speech and hence feature extraction method plays an
important role in speech recognition. Here, front end processing for extracting the features is
per-formed using two wavelet based methods namely Discrete Wavelet Transforms (DWT) and
Wavelet Packet Decomposition (WPD). Naive Bayes classifier is used for classification purpose.
After classification using Naive Bayes classifier, DWT produced a recognition accuracy of
83.5% and WPD produced an accuracy of 80.7%. This paper is intended to devise a new
feature extraction method which produces improvements in the recognition accuracy. So, a new
method called Dis-crete Wavelet Packet Decomposition (DWPD) is introduced which utilizes
the hy-brid features of both DWT and WPD. The performance of this new approach is evaluated
and it produced an improved recognition accuracy of 86.2% along with Naive Bayes classifier.
Description:
Computer Science & Information Technology (CS & IT)
Poulose Jacob,K; Sonia, Sunny; David, Peter S(IEEE, August 9, 2012)
[+]
[-]
Abstract:
Speech signals are one of the most important means
of communication among the human beings. In this paper, a
comparative study of two feature extraction techniques are
carried out for recognizing speaker independent spoken
isolated words. First one is a hybrid approach with Linear
Predictive Coding (LPC) and Artificial Neural Networks
(ANN) and the second method uses a combination of Wavelet
Packet Decomposition (WPD) and Artificial Neural Networks.
Voice signals are sampled directly from the microphone and
then they are processed using these two techniques for
extracting the features. Words from Malayalam, one of the
four major Dravidian languages of southern India are chosen
for recognition. Training, testing and pattern recognition are
performed using Artificial Neural Networks. Back propagation
method is used to train the ANN. The proposed method is
implemented for 50 speakers uttering 20 isolated words each.
Both the methods produce good recognition accuracy. But
Wavelet Packet Decomposition is found to be more suitable for
recognizing speech because of its multi-resolution
characteristics and efficient time frequency localizations
Description:
Advances in Computing and Communications (ICACC), 2012 International Conference on