APPLICATION OF THE MP THEORY TO SYSTEMS BIOLOGY

Vincenzo Manca and Luca Marchetti

University of Verona, Department of Computer Science, Strada Le Grazie 15, 37134 Verona, Italy

Keywords:

Biomathematical discrete modelling, Systems biology, Metabolic P Systems.

Abstract:

The main framework analysis for the most part of biological dynamics remains the theory of ordinary differ-

ential equations (ODEs). However, ODEs present some intrinsic limitations in the evaluation of the kinetic

reaction rates. In contrast to ODEs, Metabolic P systems (MP systems), based on P˘aun’s P systems, were

introduced for modelling metabolic systems by means of suitable multiset rewriting grammars. In this work

three applications of MP systems are presented, for discovering the internal regulation logic of three phenom-

ena relevant in systems biology: i) the Goldbeter’s mitotic oscillator; ii) the glucose/insulin dynamics in the

Intravenous Glucose Tolerance Test; iii) the HER-2 oncogene-regulated transcriptome in human SUM-225

cells. Despite the differences between the considered phenomena, in all the cases a model was found that

exhibits good approximation of the observed time series and highlights results which are new or that have

been only theorized before.

1 INTRODUCTION

Systems biology (Ideker et al., 2001; Kitano, 2002)

has been brought to the forefront of life-science re-

search. Its goal is to understand biology at the sys-

tem level by examining the whole structure and the

dynamics of cellular and organismal function. How-

ever, the huge amount of experimental data which

very often can be measured by means of high through-

put technologies makes this job very difﬁcult. For

overcoming the problem, mathematical modelling is

emerging as a suitable way for analysing data and de-

veloping new knowledge (Bailey, 1998). In partic-

ular, an important problem of systems biology is the

mathematical deﬁnition of dynamical systems that ex-

plain observed dynamics of phenomena under investi-

gation, by taking into account what is already known

about each phenomenon.

The main framework analysis for the most part

of biological dynamics remains the theory of ordi-

nary differential equations (ODEs). However, ODEs

present some intrinsic limitations in the evaluation

of the kinetic reaction rates. In fact, very often,

the evaluation of the kinetic reaction rates in dif-

ferential models is problematic because it may re-

quire measurements hardly accessible in living or-

ganisms. Moreover, these measurements dramati-

cally alter the context of the investigated processes.

In contrast to ODEs, Metabolic P systems (MP sys-

tems) (Manca et al., 2005; Manca, 2010), based

on P˘aun’s P systems (P˘aun, 2002), were introduced

for modelling metabolic systems by means of suit-

able multiset rewriting grammars (see (Decraene and

Hinze, 2010; Hinze et al., 2007) for other discrete ap-

proaches compared to ODE systems).

A Metabolic P system is essentially a multiset

grammar where multiset transformations are regu-

lated by functions (MP grammar). Namely, a rule

like a + b → c means that a number u of molecules

of kind a and u of kind b are replaced by u molecules

of type c. The value of u is the ﬂux of the rule ap-

plication. Assume to consider a system at some time

steps i = 0, 1, 2, . . . , t, and consider a substance x that

is produced by rules r

, r

and is consumed by rule r

If u

[i], u

[i] are the ﬂuxes of the rules r

, r

respectively, in the passage from step i to step i + 1,

i ∈ N, the set of natural numbers, then the variation of

substance x is given by:

x[i+ 1] − x[i] = u

[i] − u

[i] + u

[i].

In an MP system it is assumed that in any state

the ﬂux u

of rule r

is provided by a state function

, called regulator of the rule. A state is essen-

tially determined by the values of the system vari-

ables, that is, substances and parameters (quantities

which are not transformed by the rules). However,

usually only some variables enter as arguments of reg-

ulators, therefore if u

= ϕ

(x, y, . . .), the arguments

x, y, . . . of ϕ

will be called tuners of the regulator.

Substances, reactions, and regulators specify the fol-

303

Manca V. and Marchetti L..

APPLICATION OF THE MP THEORY TO SYSTEMS BIOLOGY.

DOI: 10.5220/0003852003030308

In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSIGNALS-2012), pages 303-308

ISBN: 978-989-8425-89-8

 2012 SCITEPRESS (Science and Technology Publications, Lda.)

Table 1: Goldbeter’s oscillator, which has a cycle of about 25 min (Goldbeter, 1991).

= v

− v

− k

= V

(1−M)

+(1−M)

−V

= V

(1−X)

+(1−X)

−V

, V

= MV

lowing discrete dynamics (x[i]|i ∈ N) for any sub-

stance x, starting from the given value x[0], called

Equational Metabolic Algorithm (EMA):

x[i+ 1] = x[i] +

∑

j=1

[i] (1)

where m is the number of rules and α

are integer sto-

ichiometric coefﬁcients determined by the reactions

acting on substance x. Moreover, a temporal interval

τ, a conventional mole size ν, and substances masses

are considered, which specify the time and population

(discrete) granularities respectively. In the following

the MP dynamics we will present are computed in

MATLAB by applying the EMA formula given in (1).

MP systems are equipped with a powerful re-

gression algorithm, called Log-Gain Stoichiomet-

ric Stepwise Regression (LGSS), which derives MP

models from the time series of observed dynam-

ics and that can be applied independently from any

knowledge about reaction rate kinetics (Manca and

Marchetti, 2011). LGSS represents the most recent

solution, in terms of MP systems, of the dynamical

inverse problem, that is, of the identiﬁcation of (dis-

crete) mathematical models exhibiting an observed

dynamics and satisfying all the constraints required

by the speciﬁc knowledge about the modelled phe-

nomenon. The LGSS algorithm combines and ex-

tends the log-gain principles developed in the MP

system theory (Manca, 2008; Manca, 2009) with

the classical method of Stepwise Regression (Hock-

ing, 1976), which is a statistical regression technique

based on Least Squares Approximation and statistical

F-tests (Draper and Smith, 1981).

LGSS has been implemented by Luca Marchetti in

2010 as a set of MATLAB functions

. All the func-

tions have been ad hoc implemented (including the

stepwise regression function), and do not require ad-

ditional MATLAB toolboxes. The code which needs

harder computation (regression, simulation and tun-

ing of regression parameters) has been implemented

by taking advantage of the parallel processing fa-

See http://www.mathworks.it/index.html for details on

the MATLAB software.

cilities offered by the MATLAB Parallel Comput-

ing Toolbox (the software, however, runs also when

this toolbox is not installed). When the Optimiza-

tion Toolbox is installed in the system, LGSS supports

also the usage of the lsqlin function which computes

constrained linear least squares problems. This last

feature is very important when we need to force com-

plex constraints on the least squares estimation of the

computed regressor coefﬁcients.

The size of the systems of equations solved by

LGSS depends on the number of substances and re-

actions of the MP system under examination and on

its temporal interval τ (a smaller temporal interval re-

quire longer time series and so larger system of equa-

tions). However, the regression usually ends in few

minutes (less than one minute in many cases, using a

common laptop with a dual core CPU and 4 Gbyte of

RAM memory), but it can increase to hours when the

system is very big (i.e. a system with many hundreds

of thousands of equations, and a regression dictionary

of hundreds of regressors).

Even if computational tools are available for eval-

uating unknown parameters of ODE models (Mai-

wald and Timmer, 2008; Hoops et al., 2006), LGSS

seems to point out a general methodology for solving

dynamical inverse problems. In fact, LGSS not only

discovers unknowns parameters, but suggests also the

form of regulators as a combination of basic func-

tions. This possibility could be very important in the

case where the knowledge about the phenomenon un-

der investigation is so poor that no clear idea is avail-

able about the kind of model underlying the observed

behaviour.

In the following, three applications of MP systems

will be presented for discovering, by means of LGSS,

the internal regulation logic of three phenomena rele-

vant in systems biology:

1. the Goldbeter’s mitotic oscillator (Goldbeter,

1991);

2. the glucose/insulin dynamics in the Intravenous

Glucose Tolerance Test (IVGTT);

3. the HER-2 oncogene-regulated transcriptome in

human SUM-225 cells (working in progress with

BIOSIGNALS 2012 - International Conference on Bio-inspired Systems and Signal Processing

304

the Karmanos Cancer Institute, Wayne State Uni-

versity, Detroit).

Despite the differences between the considered phe-

nomena, in all the cases a model was found that ex-

hibits good approximation of the observed time series

and highlights results which are new or that have been

only theorized before (Manca and Marchetti, 2010a;

Manca et al., 2011; Marchetti and Manca, 2011).

2 MP GOLDBETER’S MITOTIC

OSCILLATOR

Rhythmic phenomena represent one of the most strik-

ing manifestations of dynamic behaviour in biological

systems. Understanding the molecular and cellular

mechanisms responsible for oscillations is crucial for

unravelling the dynamics of life (Goldbeter, 2002).

The Goldbeter’s mitotic oscillator represents the

simplest form of mitotic trigger mechanism found in

early amphibian embryos (Goldbeter,1991). The fun-

damental mechanism of mitotic oscillations concerns

the periodic change in the activation state of a pro-

tein produced by the cdc2 gene in ﬁssion yeast or by

homologous genes in other eukaryotes. The simplest

form of this mechanism is found in early amphibian

embryos. Here (see the picture in the left part of Ta-

ble 1) cyclin (C) is synthesized at a constant rate and

triggers the transformation of inactive (M

) into ac-

tive (M) cdc2 protein, which leads to the formation

of a complex known as M-phase promoting factor

(MPF). MPF triggers mitosis, but at the same time

M elicits the activation of a protease from state X

X. The active protease then degrades cyclin resulting

in the inactivation of cdc2. This brings the cell back

to initial conditions and a new division cycle can take

place. The ODE presented in Table 1 is the differ-

ential model of dynamics described in the right part

of Table 1, where C, M, X are the concentrations of

C, M, X respectively and 1− M, 1− X are the concen-

trations of M

, X

respectively (the deﬁnitions of the

parameters of the ODE model are not simple and are

not relevant for our further discussion, however they

can be found in (Goldbeter, 1991)).

In (Manca and Marchetti, 2010a) LGSS has been

applied to Goldbeter’s oscillator for showing that MP

systems yield a robust method for biological mod-

elling. In this manner, were automatically generated

700 models of this oscillator, which, for the most part,

provide the same order of approximation of Gold-

beter’s model (see Table 2). Moreover, by consid-

ering the phenomenon at different values of τ, dif-

ferent models have been obtained and in many cases

the analytical form of these models is simpler than

Goldbeter’s model. These models have been also cat-

egorised with respect to the analytical form of their

regulators. In this way a set of grammatical schemata

was obtained which express the regulation relation-

ship acting on the systems in different intervals of the

temporal interval τ.

Table 2: Example of MP mitotic oscillator (τ = 10.4 sec).

Constants and initial values as in (Manca and Marchetti,

2010a).

0 → C ϕ

= v

:C →

0 ϕ

= k

+ k

M + k

X −k

: M

→ M ϕ

= k

+ k

: M → M

= k

M + k

: X

→ X ϕ

= k

C+ k

: X → X

= k

+ k

X +k

+ k

3 GLUCOSE-INSULIN

INTERACTIONS IN THE IVGTT

The Intra Venous Glucose Tolerance Test (IVGTT) is

an experimental procedure used to study the glucose-

insulin endocrine regulatory system. Glucose is the

primary source of energy for body’s cells. It is trans-

ported from the intestines or liver to body cells via

the bloodstream, and is absorbed by the cells with

the intervention of the hormone insulin produced by

the pancreas. Normally, in mammals the blood glu-

cose concentration is tightly regulated as a part of

metabolic homeostasis (see Figure 1).

Figure 1: The glucose homeostasis.

If the plasma glucose concentration level is con-

stantly out of the usual range, then we are in presence

of blood glucose problems. In particular, when this

level is constantly higher than the range upper bound,

we are in presence of Diabetes: a dreadfully severe

and pervasive illness which concerns a good number

APPLICATION OF THE MP THEORY TO SYSTEMS BIOLOGY

305

of structures in the body. This motivates researches

to study the glucose-insulin endocrine regulatory sys-

tem.

The intravenous glucose tolerance test focuses on

the metabolism of glucose in a period of 3 hours start-

ing from the infusion of a bolus of glucose at time

t = 0. It is based on the assumption that, in a healthy

person, the glucose concentration decreases exponen-

tially with time followingthe loading dose. In (Manca

et al., 2011) Metabolic P systems theory has been ap-

plied for developing new physiologically based mod-

els of the glucose-insulin system which can be applied

to the IVGTT. In that work, ten data-sets obtained

from literature were considered and, for each of them,

an MP model which ﬁts the data and explains the reg-

ulations of the dynamics was found (see Figure 2).

Figure 2: The calculated insulin dynamics related to four of

the considered data-sets (Manca et al., 2011) (τ = 2 min).

In the differential models proposed in literature,

the delay of the insulin release is approached by

adding artiﬁcial substances or by considering a delay

integral kernel. Here, instead, the problem has been

solved by assuming that the insulin production is reg-

ulated by the plasma glucose concentration level both

at the current time and at some previous simulation

steps (glucose memories as introduced in (Manca and

Marchetti, 2010b)). This has permitted to point out,

in a more natural and detailed way, the delays which

act in the insulin production. Moreover, even if dif-

ferences were found in the regulation governing the

release of insulin, it was possible to observe a com-

mon logic which before was only theorized during the

development of the differential models (see (Manca

et al., 2011) for details). These preliminary results

and analysis suggest that MP models seem to pro-

vide comprehensive tools for discovering personal-

ized glucose-insulin dynamics.

4 MP ANALYSIS OF THE HER-2

ONCOGENE-REGULATED

TRANSCRIPTOME IN HUMAN

SUM-225 CELLS

The identiﬁcation of new gene networks are now

an important part of systems biology. In addition to

high-throughputexperimental methods, mathematical

and computational approaches are indispensable for

the analysis of gene networks. Given the large num-

ber of components of most networks of biological in-

terest, connected by positive and negative feedback

loops, an intuitive comprehension of the dynamics

of the system is often difﬁcult, if not impossible to

obtain. Mathematical modelling supported by com-

puter tools can contribute to the analysis of a regu-

latory network by allowing the biologist to focus on

a restricted number of plausible hypotheses. Many

reviews of the modelling and simulation of gene net-

works have been published in recent years (e.g. (Cao

et al., 2010; Bolouri and Davidson, 2002; Gilman

and Arkin, 2002; Jong, 2002; Hasty et al., 2001;

Smolen et al., 2000)), presenting the wide variety

of formalisms that have been proposed in the liter-

ature, such as oriented graphs, Bayesian networks,

Boolean networks, differential equations, stochastic

master equations and stochastic P systems.

MP systems were initially introduced to model

metabolic processes, but they can be successfully

used in each context where we want to infer models of

a system from a given set of time series. In (Marchetti

and Manca, 2011) an application of the MP theory to

gene expression analysis was developed. In this case,

a standard way for translating MP grammars involv-

ing gene expressions into corresponding quantitative

gene networks was found (see Table 3).

The number of the raw microarray time series

which need to be processed for a generic experiment

on human cells is usually of the order of tens of thou-

sands. Generally, however, only a small part of them

are related to the phenomenon under examination.

For this reason, before to start with the modelling of

the MP model, raw data need to be preprocessed fol-

lowing a methodology which comprises normaliza-

tion, ﬁltering and clustering. This methodology has

been developed during a work in progress where the

MP theory has been successfully applied for deﬁning

the gene network underlying the regulations acting on

the HER-2 oncogene-regulated transcriptome in hu-

man SUM-225 cells in order to deﬁne new therapies

for the breast cancer.

HER-2 is an epidermal growth factor receptor

which have been implicated in radioresistance in

BIOSIGNALS 2012 - International Conference on Bio-inspired Systems and Signal Processing

306

Table 3: An example of MP grammar related to a gene network (Marchetti and Manca, 2011).

MP grammar Quantitative gene network

: G1 →

0 ϕ

= k

· G1

0 → G2 ϕ

= k

· G3+ k

· G4

: G2 →

0 ϕ

= k

· G2

: G2 → G3 ϕ

= k

· G1

: G3 →

0 ϕ

= k

· G3

0 → G4 ϕ

= k

· G2

: G4 →

0 ϕ

= k

· G4

breast cancer and other malignancies (see Figure 3).

The analysis started by considering more than 24000

time series and ﬁnished by pointing out 1175 genes

which seem to be HER-2 oncogene-regulated. These

genes have been clustered following the ad hoc pro-

cedure deﬁned in (Marchetti and Manca, 2011) and,

ﬁnally, the MP model and the corresponding gene net-

work have been provided, which seem to explain the

regulation of the phenomenon.

Figure 3: The action of the HER-2 growth factor on the cell

transcriptome.

5 CONCLUSIONS

In contrast to ODEs, Metabolic P systems (MP sys-

tems), based on P˘aun’s P systems, were introduced

for modelling metabolic systems by means of suitable

multiset rewriting grammars. In this work, three ap-

plications of MP systems for discovering the internal

regulation logic of three phenomena relevant in sys-

tems biology have been presented.

The last two modelled phenomena are currently

under development in order to extend the MP method-

ology in cases more complex (i.e. the insulin-glucose

dynamics where C-peptide time series are taken into

account) or other kind of gene expression analysis re-

lated to other pathological situations. In this perspec-

tive we intend to develop algorithmic and computa-

tional tools for making the MP modelling more ade-

quate and useful in biomedical applications.

REFERENCES

Bailey, J. (1998). Mathematical modeling and analysis in

biochemical engineering: past accomplishments and

future opportunities. Biotechnology Progress, 14:8–

20.

Bolouri, H. and Davidson, E. (2002). Modeling transcrip-

tional regulatory networks. BioEssays, 24(12):1118–

1129.

Cao, H., Romero-Campero, F., Heeb, S., C´amara, M., and

Krasnogor, N. (2010). Evolving cell models for sys-

tems and synthetic biology. Systems and Synthetic Bi-

ology, 4(1):55–84.

Decraene, J. and Hinze, T. (2010). A Multidisciplinary Sur-

vey of Computational Techniques for the Modelling,

Simulation and Analysis of Biochemical Networks.

Journal of Universal Computer Science, 16(9):1152–

1175.

Draper, N. and Smith, H. (1981). Applied Regression Anal-

ysis, 2nd Edition. John Wiley & Sons, New York.

Gilman, A. and Arkin, A. (2002). Genetic “code”: Rep-

resentations and dynamical models of genetic compo-

nents and networks. Annual Review of Genomics and

Human Genetics, 3:341–369.

Goldbeter, A. (1991). A minimal cascade model for the

mitotic oscillator involving cyclin and cdc2 kinase.

PNAS, 88(20):9107–9111.

Goldbeter, A. (2002). Computational approaches to cellular

rhythms. Nature, 420:238–245.

Hasty, J., McMillen, D., Isaacs, F., and Collins, J. (2001).

Computational studies of gene regulatory networks:

In numero molecular biology. Nature Review Genet-

ics, 2(4):268–279.

Hinze, T., Hayat, S., Lenser, T., Matsumaru, N., and Dit-

trich, P. (2007). Hill Kinetics Meets P Systems: A

Case Study on Gene Regulatory Networks as Comput-

ing Agents in silico and in vivo. In Eleftherakis, G.,

Kefalas, P., and Paun, G., editors, Proceedings of the

Eight Workshop on Membrane Computing (WMC8),

pages 363–381. SEERC Publishers.

Hocking, R. (1976). The Analysis and Selection of Vari-

ables in Linear Regression. Biometrics, 32.

APPLICATION OF THE MP THEORY TO SYSTEMS BIOLOGY

307

Hoops, S., Sahle, S., Gauges, R., Lee, C., and Pahle, J.

(2006). COPASI–a COmplex PAthway SImulator.

Bioinformatics, 22(24).

Ideker, T., Galitski, T., and Hood, L. (2001). A new ap-

proach to decoding life: Systems biology. Annual Re-

view of Genomics and Human Genetics, 2:343–372.

Jong, H. (2002). Modeling and simulation of genetic regu-

latory systems: A literature review. Journal of Com-

putational Biology, 9(1):69–105.

Kitano, H. (2002). Systems biology: A brief overview. Sci-

ence, 295(5560):1662–1664.

Maiwald, T. and Timmer, J. (2008). Dynamical model-

ing and multi-experiment ﬁtting with PottersWheel.

Bioinformatics, 24(18):2037–2043.

Manca, V. (2008). The metabolic algorithm for P systems:

Principles and applications. Theoretical Computer

Science, 404:142–155.

Manca, V. (2009). Algorithmic Bioprocesses, chapter 28:

Log-Gain Principles for Metabolic P Systems, pages

585–605. Natural Computing. Springer-Verlag.

Manca, V. (2010). Metabolic P systems. Scholarpedia,

5(3):9273.

Manca, V., Bianco, L., and Fontana, F. (2005). Evolu-

tions and Oscillations of P systems: Theoretical Con-

siderations and Application to biological phenomena.

LNCS, (3365):63–84.

Manca, V. and Marchetti, L. (2010a). Goldbeter’s Mitotic

Oscillator Entirely Modeled by MP Systems. LNCS,

6501:273–284.

Manca, V. and Marchetti, L. (2010b). Metabolic approx-

imation of real periodical functions. The Journal of

Logic and Algebraic Programming, 79:363–373.

Manca, V. and Marchetti, L. (2011). Log-Gain Stoi-

chiometic Stepwise regression for MP systems. Inter-

national Journal of Foundations of Computer Science,

22(1):97–106.

Manca, V., Marchetti, L., and Pagliarini, R. (2011). MP

modelling of glucose-insulin interactions in the Intra-

venous Glucose Tolerance Test. International Journal

of Natural Computing Research, 2(3):13–24.

Marchetti, L. and Manca, V. (2011). A methodology based

on MP theory for gene expression analysis. LNCS. In

print.

P˘aun, G. (2002). Membrane Computing. An Introduction.

Springer.

Smolen, P., Baxter, D., and Byrne, J. (2000). Modeling tran-

scriptional control in gene networks: Methods, recent

results, and future directions. Bulletin of Mathemati-

cal Biology, 62(2):247–292.

BIOSIGNALS 2012 - International Conference on Bio-inspired Systems and Signal Processing

308