
3.2.3 Spectrum 
The spectrum of frequencies present in the voiced 
sample is used in two steps of processing. First, the 
spectrum is used for the cepstral analysis, which 
serves for to find the fundamental frequency precise 
value. The second use of the spectrum provides 
input data for the filtering using harmonic 
frequencies filters. 
The spectrum is calculated by Fourier transform 
using its fast form (3). 
1, ... ,0     
1
0
2
NkexX
N
n
N
n
ki
nk
 
(3)
3.2.4  Cepstrum, Fundamental Frequency 
The next step provides a cepstrum. The cepstra 
analysis, as described above, provides cepstral 
coefficients. 
The real cepstrum is used to find the value of the 
fundamental frequency. The value is expected in the 
range from 60 to 400 Hz for the human voice 
(Campbell, 1997). The peak is to be found in this 
range (Figure 1) and converted from the cepstral 
coefficient number to the frequency domain. The 
fundamental frequency is the base for the calculating 
of the harmonic frequencies to be used for the 
filtering. 
3.2.5  Harmonic Spectrum Vector 
When the harmonic frequency filters are set using 
the fundamental frequency, the spectrum is filtered 
(Figure 6). Because the power at the specific 
frequency depends on the volume of the input signal, 
the absolute values can not be used. The power 
values are related to the power at the fundamental 
frequency. 
0 100 200 300 400 500 600 700
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
Frequency c ontent
frequency (Hz)
  f
  2f
  5f
 
Figure 6: The frequency content after filtering. 
The power relations between given harmonic and 
the fundamental frequency constitute the values of 
harmonic frequency vector, we expect to be specific 
for the given speaker. The vectors are calculated 
from more voiced segments to be ready to process 
by statistic methods. 
The Figure 5 shows the powers of harmonic 
frequencies obtained from the spectrum using 
harmonic filters set by the fundamental frequency 
value (2). 
4 CONCLUSIONS 
The proposed technique is in the testing phase. All 
the computations are processed in the MATLAB 
environment. 
The partial results are before the deeper process 
of comparision with another methods. If the testing 
shows and confirm the measurable dependency of 
the voice harmonic spectrum on the given speaker, it 
will be usable to improve the reliability of the 
speaker identification process based on the 
charasteristic features of the speaker’s voice. 
ACKNOWLEDGEMENTS 
This work was supported by the project No. 
CZ.1.07/2.2.00/28.0327 Innovation and support of 
doctoral study program (INDOP), financed from EU 
and Czech Republic funds. 
REFERENCES 
Abdulla, W. H., 2002. Auditory based feature vectors for 
speech recognition systems. In: Advances in 
Communications and Software Technologies.WSEAS 
Press, Stevens Point, Wisconsin, USA. 
Atassi, H., 2008. Metody detekce základního tónu řeči. In: 
Elektrorevue, Vol.4. 
Campbell, Jr, J. P., 1997. Speaker recognition: a tutorial. 
In: IEEE 85. 
Horák, O., 2012. The Voice Segment Type Determination 
using the Autocorrelation Compared to Cepstral 
Method. In: WSEAS Transactions on Signal 
Processing, vol. 8, issue 1. 
Horák, O., 2012. Phoneme Recognizer Based Verification 
of the Voice Segment Type Determination. In: 
Proceedings of the 3rd International conference on 
Applied Informatics and Computing Theory (AICT 
'12). WSEAS Press, Stevens Point, Wisconsin, USA. 
Jung, J. S., Kim, J. K., and Bae, M. J., 2004. Speaker 
Recognition System Using the Prosodic Information. 
ICSOFT2013-8thInternationalJointConferenceonSoftwareTechnologies
276