EEG Signal Analysis via a Cleaning Procedure based on Multivariate

Empirical Mode Decomposition

Esteve Gallego-Jutglà

, Tomasz M. Rutkowski

2,3

, Andrzej Cichocki

and Jordi Solé-Casals

Digital Technologies Group, University of Vic, Sagrada Família 7,08500 Vic, Spain

LABSP, RIKEN Brain Science Institute, 2-1 Hirosawa, Saitama, 351-0106 Wako-Shi, Japan

Multimedia Lab,Computer Science Department & Tara Life Science Center, University of Tsukuba, Tsukuba, Japan

Keywords: EEG, Alzheimer Disease, Multivariate Empirical Mode Decomposition, Artifacts, Linear Discriminant

Analysis, Neural Networks.

Abstract: Artifacts are present in most of the electroencephalography (EEG) recordings, making it difficult to interpret

or analyze the data. In this paper a cleaning procedure based on a multivariate extension of empirical mode

decomposition is used to improve the quality of the data. This is achieved by applying the cleaning method

to raw EEG data. Then, a synchrony measure is applied on the raw and the clean data in order to compare

the improvement of the classification rate. Two classifiers are used, linear discriminant analysis and neural

networks. For both cases, the classification rate is improved about 20%.

1 INTRODUCTION

Electroencephalogram (EEG) signals recorded from

the scalp, commonly present different interference

signals due to muscle artifacts, such as eye blinks or

eye movement. Electric potentials due to these

artifacts can be orders of magnitude larger than the

EEG and can propagate across the scalp, masking

and distorting brain signals (Croft and Barry, 2000).

This paper focuses on improving the quality of

the data, removing artifacts from EEG data using a

new signal processing technique, Multivariate

Empirical Mode Decomposition (mEMD). This

technique is an extension of the Empirical Mode

Decomposition (EMD), and provides a

decomposition of the original EEG data into several

oscillatory modes computed along multichannel data

(Rehman and Mandic, 2010). Then the efficiency of

the proposed method of cleaning artifacts is

evaluated on real EEG data from an Alzheimer

Disease (AD) data base. The evaluation of this

cleaning procedure is calculated in terms of

classification rate. Obtained results with clean data

are much better that those obtained with raw data,

hence the detection of AD is simplified.

Recently it was shown that EMD is a good

method to separate eye movements from

neurophysiological signals as pointed out in

(Rutkowski et al., 2009a, Rutkowski et al., 2009b,

Molla et al., 2012), where results were obtained

comparing the extracted modes with the modes of

the EOG.

A previous study using mEMD (Gallego-Jutglà

et al., 2011) presented promising results using this

decomposition on simulated EEG data, where the

cleaned data presented always a correlation higher

than 0.8 with the simulated data without artifacts.

Another study had used mEMD for Seizure

detection in EEG signals (Rehman et al., 2010c). In

this study, Hilbert Huang transform and mEMD are

combined to extract spectral features form

multichannel EEG signals. The spectral feature used

is the mean frequency of the signals derived from

the Hilbert-Huang spectrum, and the method have

shown to be helpful for epileptic seizure detection.

At the end of this article, it is also suggested that

some artifacts can be removed by subtracting the

unwanted signals from the decomposition.

This paper is organized as follows. First,

methods used, including EMD and mEMD

description, the cleaning method, the synchrony

measure used and the classifiers used are presented

in Section 2. Section 3 describes the experimental

results obtained. Finally, discussion and conclusions

are depicted in Section 4 and Section 5 respectively.

670

Gallego-Jutglà E., Rutkowski T., Cichocki A. and Solé-Casals J..

EEG Signal Analysis via a Cleaning Procedure based on Multivariate Empirical Mode Decomposition.

DOI: 10.5220/0004182206700676

In Proceedings of the 4th International Joint Conference on Computational Intelligence (SSCN-2012), pages 670-676

ISBN: 978-989-8565-33-4

 2012 SCITEPRESS (Science and Technology Publications, Lda.)

2 METHODS

To eliminate EEG artifacts, the use of mEMD is

proposed. mEMD is a new technique to decompose

EEG data based on EMD. mEMD decomposition is

applied in an Alzheimer disease data base and then

data is cleaned using the cleaning procedure

presented in (Gallego-Jutglà et al., 2011). It is

important to note that now we deal with any kind of

artefacts and not only eyeblinks, therefore we

generalise the method to be more useful. In order to

evaluate the improvement of the cleaning procedure,

we don’t have a reference cleaned signals to

compare with, phase synchrony is computed and

then a classifier is set up in order to discriminate

between Alzheimer disease subjects and control

subjects. Two types of classifiers, Linear

Discriminant Analysis (LDA) and Neural Network

(NN), are explored in order to see the effect of the

cleaning method.

EEG dataset is composed of 15 healthy Ctrl

subjects and 15 patients with mild AD. The EEG

time series were recorded using 21 electrodes at a

sampling frequency of 128 Hz.

2.1 Empirical Mode Decomposition

EMD algorithm is a method designed for multiscale

decomposition and time –frequency analysis, which

can analyze nonlinear and non-stationary data

(Huang et al., 1998).

With this method, any time-series data set can be

decomposed into a finite and often small number of

oscilatory modes. These oscillatory modes are called

Intrinsic Mode Functions (IMFs). IMFs are defined

so as to exhibit locality in time and to represent a

single oscillatory mode. Each IMF satisfies two

basic conditions: (i) the number of zero-crossings

and the number of extrema must be the same or

differ at most by one in the whole dataset, and (ii) at

any point, the mean value of the envelope defined by

the local maxima and the envelope defined by the

local minima is zero (Huang et al., 1998).

The EMD algorithm (Huang et al., 1998) for the

signal x(t) can be summarized as follows.

(i) Determine the local maxima and minima of

x(t);

(ii) Generate an upper and a lower signal

envelope by connecting the local maxima and

minima computed previously respectively by an

interpolation method;

(iii) Compute the local mean 









, by

averaging the upper and lower signal envelopes;

(iv) Subtract the local mean from the data:

ℎ









=







−









;

(v) If ℎ









obeys the stopping criterion, then

define 







=ℎ









as an IMF, otherwise set









=ℎ









and repeat the process from step i.

Then, the empirical mode decomposition of a

signal x(t) can be written as:





=IMF







+ε











(1)

Where n is the number of extracted IMFs, and the

final residue ε







is the mean trend or a constant.

2.2 Multivariate Empirical Mode

Decomposition (mEMD) Applied to

EEG Signals

Multivariate Empirical Mode Decomposition, is an

extension for multivariate signals of EMD.

Even though EMD has achieved optimal results

in data processing (Diez et al. 2009, Molla et al.,

2010), several shortcomings are presented when this

technique is used in multichannel data sets such as

EEG. The IMFs from different time series do not

necessarily correspond to the same frequency, and

different time series may end up having a different

number of IMFs. These shortcomings complicate the

use of this technique to work with multichannels

data sets, because it is difficult to match the different

obtained IMFs from different channels (Mutlu and

Aviyente, 2011).

To solve the presented shortcomings of working

with multichannels data sets, several extensions of

EMD have been proposed. This extensions are

Bivariate Empirical Mode Decomposition (Molla et

al. 2010), to decompose two time series at the same

time, and Trivariate Empirical Mode Decomposition

(Rehman and Mandic, 2010a), to decompose three

time series at the same time. Recently, for

multichannel data sets, such as EEG, an extension of

EMD to mEMD was proposed (Rehman and

Mandic, 2010b).

In mEMD the local mean is computed by

tanking an average of upper and lower envelopes

obtained from all the sensors. The upper and lower

envelopes, in turn are obtained by interpolating

between the local maxima and minima. However, in

general, for multivariate signals, the local maxima

and minima may not be defined directly. To deal

with these problems multiple n-dimensional

envelopes are generated by taking signal projections

along different direction in n-dimensional spaces

EEGSignalAnalysisviaaCleaningProcedurebasedonMultivariateEmpiricalModeDecomposition

671

(Rehman and Mandic, 2010b). mEMD is the

technique used in this paper to compute all the

decompositions.

The algorithm (Rehman and Mandic, 2010b) can be

summarized as follows:

(i) Choose a suitable pointset for sampling on an



−1



sphere (this



−1



 sphere resides in an 

dimensional Euclidean coordinate system).

(ii) Calculate the projection, p















, of the

input signal v









along the direction vector, x





for all k giving p















(iii) Find the time instants t







corresponding to

the maxima of the set of projected

signalsp















(iv) Interpolate t







,vt







 to obtain

multivariate envelope curvese















(v) For a set of K direction vectors, the mean of

the envelope curves is calculated as 







⁄∑













(vi) Extract the detail 







using 







=







−









. If the detail 







fulfills the stopping criterion

for a multivariate IMF, apply the above procedure

to







−







, otherwise apply it to 







Then, the mEMD of a signal x







can be written as

detailed in equation 1

The used stopping criterion is defined in (Rilling

et al., 2003).

2.3 Cleaning Pprocedure

The used cleaning procedure was previously

presented in (Gallego-Jutglà et al., 2011). In this

article the proposed procedure was applied to

simulated EEG data with eyeblink artefacts. Now,

the extension to any kind of artefacts and the

performance on real EEG data is evaluated.

The cleaning procedure is based on mEMD and

seeks the common modes which are present in all

the electrodes. Here the key idea is that if a mode is

present in all the electrodes, it is probably due to

artifacts and not to EEG signals, so this mode is

suppressed in the reconstruction process.

The cleaning procedure can be summarized as

follows:

(i) Apply mEMD to raw EEG data of N

electrodes, in order to obtain M oscillatory modes of

the multivariate data.

(ii) Construct a matrix containing the same mode

for all the channels. Therefore a total of M matrices

are obtained.

(iii) Calculate the Correlation Matrix (CM) of

each one of these previous matrices, obtaining

 ∈ ℝ

··

(iv) Compute the Communality Index  ∈ ℝ



containing the mean correlation of each mode for all

the sensors. The CI is computed using the following

expression:

 =

















(2)

(v) Normalize CI between 0 and 1.

(vi) Threshold CI in order to find which of these

modes are common within all the channels. Modes

with high correlation

|

>0.8



are eliminated

(vii) Reconstruct clean signals without taking

into account the eliminated modes

The proposed cleaning procedure was applied

independently to all the subjects contained in the

data base.

2.4 Measure

In order to evaluate the efficiency of the proposed

cleaning method, each one of the subjects was

characterized with a measure.

Different studies have shown that Alzheimer

disease cause a change in EEG synchrony, so to

characterize the presents subjects in the data base,

the phase synchrony measure was used.

Phase synchrony measure the phase dependence

between two time series x and y, computing the

dependence between their instantaneous phases 



and 



. Even though the amplitudes of x and y are

independent, their instantaneous phases may be

synchronized. The instantaneous phase 



of a time

serie x may be extracted as:











=arg











+









(3)

where 

is the Hilbert transform of x. The phase

synchrony index



for two instantaneous phases 



and 



is defined as:

=















∈



0,1



(4)

where

and

are integers (usually

mn 1

The phase synchrony value that characterized

each subject was computed as presented in (Dauwels

et al., 2009). For each subject, the synchrony

between all the possible pairs of electrodes was

computed. Then, 5 regions of the head were defined

(frontal, right temporal, left temporal, central and

occipital areas). To evaluate local synchrony, the

average of the synchrony values obtained between

the electrodes of each region was computed. Then,

IJCCI2012-InternationalJointConferenceonComputationalIntelligence

672

to compute the global synchrony, an average of the

computed local synchrony was done.

Phase synchrony was computed in different

frequency bands, according to the classical used

division on  (2 to 4 Hz.),  (4 to 8 Hz.), 

(8 to 10

Hz.), 

(10 to 12 Hz.) and  (12 to 25 Hz.) bands.

Signals were band-pass filtered between the selected

frequencies ranges. 3rth order Butterworth filters

were used, as they can be implemented easily and

offer good transition band characteristics at low

coefficient orders.

2.5 Classification

Two different types of classifiers were used to

classify the synchrony measures obtained with the

raw and the clean EEG data. Synchrony measures

obtained in the 5 frequency bands were used as input

features of the classifier.

2.5.1 Linear Discriminant Analysis (LDA)

Linear Discriminant Analysis (LDA) is a well-

known scheme for feature extraction and dimension

reduction. It has been used widely in many

applications involving high-dimensional data, such

as face recognition and image retrieval. Classical

LDA projects the data onto a lower-dimensional

vector space such that the ratio of the between-class

distances to the within-class distance is maximized,

thus achieving maximum discrimination. The

optimal projection (transformation) can be readily

computed by applying the eigendecomposition on

the scatter matrices. See (Duda et al., 2000,

Fukunaga, 1990) for details on the algorithm.

LDA was used to classify the computed

synchrony measures obtained from the EEG data of

Alz and Ctr subjects. As the number of subjects in

the data base is small, Leave-One-Out (LOO)

procedure was used. In this LOO crossvalidation

scheme of N observations, N-1 are used for training

and the last is used for evaluation. This process is

repeated N times, leaving one different observation

for evaluation each time. The mean success

classification value in percentage (%) is obtained as

a final result.

2.5.2 Neural Network

In recent years several classification systems have

been implemented using different techniques, such

as Neural Networks.

The widely used Neural Networks techniques are

very well known in pattern recognition applications.

An artificial neural network (ANN) is a

mathematical model that tries to simulate the

structure and/or functional aspects of biological

neural networks. It consists of an interconnected

group of artificial neurons and processes information

using a connectionist approach to computation. In

most cases an ANN is an adaptive system that

changes its structure based on external or internal

information that flows through the network during

the learning phase.

Neural networks are non-linear statistical data

modelling tools. They can be used to model complex

relationships between inputs and outputs or to find

patterns in data.

One of the simplest ANN is the so called

perceptron that consist of a simple layer that

establishes its correspondence with a rule of

discrimination between classes based on the linear

discriminator. However, it is possible to define

discriminations for non-linearly separable classes

using multilayer perceptrons (MLP).

The Multilayer Perceptron (Multilayer

Perceptron, MLP), also known as Backpropagation

Net (BPN), is one of the best known and used

artificial neural network model as pattern classifiers

and functions approximators (Lippman, 1987),

(Freeman and Skapura, 1991). It belongs to the so-

called feedforward networks class, and its topology

is composed by different fully interconnected layers

of neurons, where the information always flows

from the input layer, whose only role is to send input

data to the rest of the network, toward the output

layer, crossing all the existing layers (called hidden

layers) between the input and output. Essentially the

inner layers are responsible for carrying out

information processing, extracting features of the

input data.

Although there are many variants, usually each

neuron in one layer has directed connections to the

neurons of the subsequent layer but there is no

connection or interaction between neurons on the

same layer (Bishop, 1995, Hush and Horne, 1993).

In this work we have used a multilayer

perceptron with one hidden layer of 30 (empirically

obtained value) different neurons (nodes). Each

neuron is associated with weights and biases. These

weights and biases are set to each connections of the

network and are obtained from training in order to

make their values suitable for the classification task

between the different classes.

The number of input neurons is equal to the

number of frequency bands considered, and the

number of output neurons is just one as we needs to

EEGSignalAnalysisviaaCleaningProcedurebasedonMultivariateEmpiricalModeDecomposition

673

Figure 1: Comparison of the cleaning procedure. The top

image presents a 5-sec portion of raw EEG time series for

an Alzheimer subject. The bottom image presents the

same 5-sec of data after applying the cleaning procedure.

discriminate between only two classes (binary

problem).

For the neural network classifier, again the LOO

crossvalidation was used. To compute the

classification rate the LOO was computed 3 times,

the final classification was the mean of these 3

different values.

3 RESULTS

The proposed cleaning method was applied to all the

subjects contained in the data base. Then the phase

synchrony was computed for raw and clean data and

the classification of each type of data was computed.

The improvement of the quality of the data after

applying the cleaning procedure can be seen in

Figure 1, where some of the visible artifacts are not

present in the image of the clean data (bottom

image). The eliminated IMFs during the cleaning

process for this subject are presented in Figure 2,

where 11 IMFs were obtained. The used threshold

|

>0.8



is presented with a dotted line.

As can be seen in Figure 2, the presented CI has

several values higher than the threshold. The IMF

that hold the lower frequencies of the decomposition

(IMF7, IMF8, IMF9, IMF10, IMF11) and the

residue ε







, are the ones that are eliminated for

this subject during the reconstruction process. IMF 5

was also eliminated by the cleaning process. For all

the subjects the eliminated modes were those that

hold the low frequency oscillation.

Figure 2: Obtained Communality Index for an Alzheimer

diseases patient. 11 IMF where obtained during the

decomposition. Eliminated modes during the cleaning

procedure are the ones that present |r|>0.8. The threshold

|r|>0.8 is presented with a dotted line.

Obtained Classification Rates (CR) of synchrony

measures after classifying each type of data with the

two classifiers, LDA and NN, are presented in

Figure 3. With LDA, 56.67% of CR was obtained

with raw data and 76.67% was obtained with clean

data. On the other hand, the results obtained with

NN presented a CR of 58.89% for raw data and 80%

with clean data.

The presented results improve the classification

rate for both classifiers. For LDA an improvement of

20% was obtained and for NN an improvement of

21.11%.

4 DISCUSSION

The cleaning method presented an improvement of

the quality of the data. The classification results

obtained for both types of classifiers presented better

results for the clean data, than the classification rate

obtained with the raw data.

The eliminated modes presented in Figure 2 and

the modes eliminated from all the subjects,

correspond to low frequency oscillation. These

results are consistent with previous knowledge of

artifacts, in which the artifact interference is found

to be in the low frequencies.

These results point out that the criterion used to

select the modes to be discarded, based on the

Communality Index (CI), is reliable and can be used

for any kind of artifacts.

Also, results emphasizes that the use of mEMD

to correct artifacts may be a good procedure for EEG

IJCCI2012-InternationalJointConferenceonComputationalIntelligence

674

Figure 3: Classification Rates obtained after classifying

the synchrony values with LDA and NN. In both

classifiers, grey bars correspond to results obtained with

raw EEG data, black bars correspond to results obtained

with clean EEG data.

signal preprocessing, a necessary step to be taken

before any kind of EEG signal analysis.

5 CONCLUSIONS

In this paper a procedure for removing artifacts from

EEG data is tested in real data. This method is based

on an EEG decomposing technique, which allows

flexible signal decomposition of the original time

series in different oscillatory modes. The so-

obtained components from each EEG channel have

been analyzed and those that were present in all the

electrodes have been removed from the

reconstructed signal. Then phase synchrony has been

computed for all the subjects, and the obtained

values have been classified using two different

classifiers, linear discriminant analysis and neural

network.

Future work will include the comparison of this

method with ICA-based cleaning procedures (Solé-

Casals et al., 2010), or Wavelet-based cleaning

procedures (Krishnaveni et al., 2006, Vialatte et al.,

2008).

Of course, it is important to point out that the

data set at hand is fairly small. A larger sample size

and a more diverse data set will be used in order to

generalize the results of this study.

ACKNOWLEDGEMENTS

This work has been partially supported by the

Secretaria d’Universitats i Recerca of the

Departament d’Economia i Coneixement of the

Generalitat de Catalunya under the grant 2010BE1-

00772 to Dr. Jordi Solé-Casals; and under a

predoctoral grant from the University of Vic to Mr.

Esteve Gallego-Jutglà, ("Amb el suport de l'ajut

predoctoral de la Universitat de Vic").

REFERENCES

Croft, R. J., Barry, R. J., (2000). Removal of ocular

artefact form the EEG: a review. Neurophysiol

Clin,30, 5-19.

Bishop, C.M., (1995). Neural Networks for Pattern

Recognition. Oxford University Press.

Dauwels, J., Vialatte, F., Musha, T., Cichocki, A. (2010)

A comparative study of synchrony measures for the

early diagnosis of alzheimer’s disease based on EEG.

NeuroImage, 49, 668-693,

Diez, P. F., Mut, V., Laciar, E., Torres, A., Avilla, E.

(2009). Application of the Empirical Mode

Decomposition to the Extraction of Features form

EEG signals for Mental Task Classification. 31

Annual International Conference of the IEEE EMBS.

2579-2582.

Duda, R. O., Hart, P. E., Stork, D. (2000). Pattern

Classification. Wiley.

Freeman, J.A., Skapura, D.M. (1991). Neural Networks:

Algorithms, Applications and Programming

Techniques. Addison-Wesley Publishing Company,

Inc. Reading, MA

Fukunaga, K. (1990). Introduction to Statistical Pattern

Classification. Academic Press, San Diego, California,

USA.

Gallego-Jutglà, E., Solé-Casals, J., Rutkowski, T.,

Cichocki, A. (2011). Application of multivariate

empirical mode decomposition for cleaning eye blinks

artifacts from EEG signals. NCTA conference

proceedings, INSTICC 2011.

Huang, N. E., Shen, Z., Long,S. R., Wu, M. C., Shih, H.

H:, Zheng, Q., Yen, N.C., Tung, C. C., Liu, H. H.

(1998). The empirical mode decomposition and the

Hilbert spectrum for nonlinear and non-stationary time

series analysis. Proc. R. Soc. Lond., 495, 2317-2345.

Hush, D. R., Horne, B. G. (1993). Progress in supervised

neural networks. IEEE Signal Processing Magazine,10

(1), pp. 8-39.

Krishnaveni, V., Jayaraman, S., Aravind, S.,

Hariharasudhan, V., Ramadoss, K. (2006). Automatic

Identification and Removal of Ocular Artifacts from

EEG using Wavelet Transform. Measurement Science

Review, Vol. 6, Sec. 2, No. 4.

Lippmann, D. E. (1987). An Introduction to Computing

with Neural Networks. IEEE ASSP Magazine, 3(4),

pp. 4-22

Molla, K. I., Tanaka, T., Rutkowski, T. M., Cichocki, A.,

(2010). Separation of EOG artifacts from EEG singals

using bivariate EMD. Acoustics Speech and Signal

EEGSignalAnalysisviaaCleaningProcedurebasedonMultivariateEmpiricalModeDecomposition

675

Processing (ICASSP), 2010 IEEE Interational

Conference On. 562-565.

Molla, K. I., Islam, R.,Tanaka, T.,Rutkowski, T. M.

(2012). Artifact suppression from EEG signals using

data adaptive time domain filtering. Neurocomputing.

(Accepted manuscrit)

Mutlu, A. Y., Aviyente, S. (2011). Mutivariate Empirical

Mode Decomposition for Quantifying Multivariate

Phase Synchronization. EURASIP Jounal on Advances

in Signal Processing. Article ID 615717.

Rehman, N., Mandic, D. P. (2010a). Empirical Mode

Decomposition for Trivariate Signals. IEEE

Transactions on signal processing 58.3.

Rehman, N., Mandic, D. P., (2010b). Multivariate

empirical mode decomposition. Proc. R. Soc. A. 466,

1291-1302.

Rehman, N., Xia, Y., Mandic, D. P. (2010c). Application

of Multivariate Empirical Mode Decomposition for

Seizure detection in EEG signals. 32nd Annual

International Conference of the IEEE EMBS.

Rilling, G., Flandrin, P., Goncalves, P. (2003). On

Empirical Mode Decomposition and its Algorithms.

Proc of the IEEE-EURASIP, Workshop on Nonlinear

Signal and Image Processing, NSIP-03.

Rutkowski, T. M., Cichocki, A., Tanaka, T., Mandic, D.

P., Cao, J., Ralescu, A. L., (2009a). Multichannel

spectral pattern separation – An EEG processing

application. 2009 IEEE International Conference on

Acoustics, Speech and Signal Processing.

Rutkowski, T. M., Cichocki, A., Tanaka, T., Ralescu,

A.L., Mandic, D.P., (2009b). ICONIP’08 Proceedings

of the 15

international conference of Advances in

neuro-information processing. Vol Part I.

Solé-Casals, J., Vialatte, F. B., Pantel, J., Prvulovic, D.,

Haenschel, C., Cichocki, A.: ICA Cleaning Procedure

for EEG Signals Analysis - Application to Alzheimer's

Disease Detection. BIOSIGNALS 2010: 484-489

Vialatte, F. B., Solé-Casals, J., Cichocki, A. (2008). EEG

windowed statistical wavelet scoring for evaluation

and discrimination of muscular artifacts. Physiol.

Meas. 29 1435–1452.

IJCCI2012-InternationalJointConferenceonComputationalIntelligence

676