This paper presents a Robust Content Based Video
Retrieval (CBVR) system. This system retrieves similar videos
based on a local feature descriptor called SURF (Speeded Up
Robust Feature). The higher dimensionality of SURF like
feature descriptors causes huge storage consumption during
indexing of video information. To achieve a dimensionality
reduction on the SURF feature descriptor, this system employs
a stochastic dimensionality reduction method and thus
provides a model data for the videos. On retrieval, the model
data of the test clip is classified to its similar videos using a
minimum distance classifier. The performance of this system is
evaluated using two different minimum distance classifiers
during the retrieval stage. The experimental analyses
performed on the system shows that the system has a retrieval
performance of 78%. This system also analyses the
performance efficiency of the low dimensional SURF
descriptor.
Description:
2013 Third International Conference on Advances in Computing and Communications
Poulose Jacob,K; Sonia, Sunny; David, Peter S(IEEE, August 9, 2012)
[+]
[-]
Abstract:
Speech signals are one of the most important means
of communication among the human beings. In this paper, a
comparative study of two feature extraction techniques are
carried out for recognizing speaker independent spoken
isolated words. First one is a hybrid approach with Linear
Predictive Coding (LPC) and Artificial Neural Networks
(ANN) and the second method uses a combination of Wavelet
Packet Decomposition (WPD) and Artificial Neural Networks.
Voice signals are sampled directly from the microphone and
then they are processed using these two techniques for
extracting the features. Words from Malayalam, one of the
four major Dravidian languages of southern India are chosen
for recognition. Training, testing and pattern recognition are
performed using Artificial Neural Networks. Back propagation
method is used to train the ANN. The proposed method is
implemented for 50 speakers uttering 20 isolated words each.
Both the methods produce good recognition accuracy. But
Wavelet Packet Decomposition is found to be more suitable for
recognizing speech because of its multi-resolution
characteristics and efficient time frequency localizations
Description:
Advances in Computing and Communications (ICACC), 2012 International Conference on