ARTICLE
Received 16 Mar 2011 | Accepted 1 Sep 2011 | Published 4 Oct 2011 DOI: 10.1038/ncomms1496
Daniel Silk1,2, Paul D.W. Kirk1,2, Chris P. Barnes1,2, Tina Toni1,2,3,, Anna Rose3,4, Simon Moon3, Margaret J. Dallman3,4 & Michael P.H. Stumpf1,2,3,5
Chaos and oscillations continue to capture the interest of both the scientic and public domains. Yet despite the importance of these qualitative features, most attempts at constructing mathematical models of such phenomena have taken an indirect, quantitative approach, for example, by tting models to a nite number of data points. Here we develop a qualitative inference framework that allows us to both reverse-engineer and design systems exhibiting these and other dynamical behaviours by directly specifying the desired characteristics of the underlying dynamical attractor. This change in perspective from quantitative to qualitative dynamics, provides fundamental and new insights into the properties of dynamical systems.
Designing attractive models via automated identication of chaotic and oscillatory dynamical regimes
1 Centre for Bioinformatics, Imperial College London, London SW7 2AZ, UK. 2 Institute of Mathematical Sciences, Imperial College London, London SW7 2AZ, UK. 3 Centre for Integrative Systems Biology at Imperial College London, London SW7 2AZ, UK. 4 Division of Cell and Molecular Biology, Imperial College London, London SW7 2AZ, UK. 5 Institute of Chemical Biology, Imperial College London, London SW7 2AZ, UK. Present address: Department for Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. Correspondence and requests for materials should be addressed to M.P.H.S. (email: [email protected]).
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
ARTICLE
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms1496
Mathematical modelling requires a combination of experimentation, domain knowledge and, at times, a measure of luck. Beyond the intrinsic challenges of describing com
plex and complicated phenomena, the difficulty resides at a very fundamental level with the diversity of models that could explain a given set of observations. This is a manifestation of the so-called inverse problem1, which is encountered whenever we aim to reconstruct a model of the process from which data have been generated. Exploring the potential space of solutions computationally can be prohibitively expensive and will generally require sophisticated numerical approaches or search heuristics, as well as expert guidance and manual interventions. Parameter estimation2, model inference3 and model selection4,5 all address aspects of this problem.
The inverse problem also applies in a dierent context: the design of systems with specied or desired outputs. Here again we have a multitude of dierent modelsor, for sufficiently complicated models, a potentially vast range of parametersthat full a given set of design objectives. Therefore, system design can be fraught with the same challenges as statistical inference or reverse-engineering tasks: in the former case, we want to learn the existing structure and properties of a system that has produced certain types of data, whereas in the latter we want to design constructible systems that will reliably and robustly exhibit certain types of behaviour.
These challenges are oen further exacerbated by unsuitable or insufficient encoding of the behaviour that we observe (in natural systems) or would like to see (in designed systems). For example, if we aim to estimate parameters describing an oscillating system from a series of observations, it is possible to get good and even globally optimal ts to the data, without nding a qualitatively acceptable solution. Various methods of qualitative inference have been developed to address this issue; the topology of bifurcation diagrams6,7, local stability properties of dynamically invariant sets8,9, symbolic sequences of chaotic systems10 and temporal logic constraints11,12 have variously been used to drive parameter searches, or for model checking. However, these methods are either limited in the complexity of behaviour they can detect, or by conditioning on surrogate data (for example, forcing solutions through a small number of points), they suer in the same way as quantitative approaches. The method proposed here extends the scope of the promising, but underdeveloped, class of qualitative parameter estimation algorithms13, allowing detection and control of the most complex and elusive dynamical behaviours, such as oscillations, chaos and hyperchaos.
We consider models of the general form dd
a
b
0< 0
0= 0
0> 0
0 > 0
1> 0
?
X0 X0 + 0
t
c
Vk1
L( k1 , y0; )
target
Vk Vk+1
k1
k
k+1
L( k , y0; ) L( k+1 , y0; )
target
k1
k+1
k
Figure 1 | Encoding and inferring the desired dynamics. (a) LEs, 0, n, characterize the contraction/expansion of an initially small perturbation, 0, to the system. (b) The leading LE determines the principal dynamics and characteristics of the attractor of a dynamical system. For 0 < 0, the attractor will be a stable xed point; stable oscillating solutions will be obtained, if 0 = 0; for 0 > 0 we observe chaos and the system will exhibita so-called strange attractor; if more than one LE is positive, then we speak of hyperchaos and the attractor will exhibit behaviour with similar statistical properties to white noise. (c) Key steps in the UKF for qualitative inference. At the kth iteration, the current prior parameter distribution is formed by perturbing the previous posterior, k, with the process noisevk. The distribution of LEs for the model f induced by the prior parameter distribution is calculated via the LE estimation routine L and the unscented transform. Comparing the mean LE, lk, to the target LE, target, the prior parameters are updated using the UKF update equations. As the lter proceeds, parameters are found that locally minimise the sum of squared error between target and estimated LEs.
) ( ( ), ; ),
= 0 q
where y(t) denotes the n-dimensional state of the system at time t; f is the gradient eld characterized by a parameter vector; , and y0 = y(0) are the initial conditions, which may also be unknown.
Coaxing the solutions of such systems into exhibiting a desired dynamical behaviour is reliant on the ability to, rst, encode the behaviour sufficiently as constraints on a set of model properties that may be conveniently evaluated, and second, to identify regions in parameter space for which these constraints are satised. Here we meet these challenges using a combination of statistical and dynamical systems techniques. In particular, we pose the problem within a statespace framework, where the observation function corresponds to evaluating the type of attractor exhibited by the model with given parameters and initial conditions. We then exploit the exibility and efficiency of the unscented Kalman lter (UKF) to systematically move in parameter space until the desired or expected dynamical behaviour is exhibited. The approach, outlined in Figure 1 and developed fully in the Methods and Supplementary Information, is demonstrated below within dierent contexts, covering some classical dynamical model systems and electronic circuits that
(1)(1)
y t
( t f y t y
exhibit oscillations, chaos and hyperchaos, and a biological regulatory system that exhibits oscillatory behaviour.
ResultsOscillations and chaos in electronic circuits. The elimination of chaos from a system, or conversely its chaotication, have potential applications to biological, medical, information processing and other technological systems14. Here we use a simple electric circuit15 (shown in Fig. 2a), to illustrate how our method can be used to tune the system parameters such that the dynamics are driven into and out of chaos. The circuit model includes a parameter a, representing the scaled resistance of a variable resistor, R2, which we make the lone subject of the inference. In turn, we start the system in an oscillatory regime and tune the parameter according to the posterior predictions at each step of the UKF, until we enter a chaotic regime, and vice versa (see Fig. 2b). The two desired behaviours are encoded as constraints only on the target maximal Lyapunov exponent (LE), specifying, 1 = 0, for oscillations, and, 1 = d > tol for chaos, where
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms1496
ARTICLE
a
b
regulatory dynamics with mRNA transcription modelled by a Hill function,
M k M P P
= deg h
0.8
0.0
+Vb
R0
D
+ +
a
a
0.0
C L
1 1 2 0
/( ( / ) )
P k P M k P
deg
1 1 1 1
= +
n
P k P k P
deg
2 2 1 1
= + ,
where state variables M, P1 and P2, are the molecular concentrations of Hes1 mRNA, cytoplasmic and nuclear proteins, respectively. The parameter kdeg is the Hes1 protein degradation rate that we assume to be the same for both cytoplasmic and nuclear proteins; k1 is the rate of transport of Hes1 protein into the nucleus; P0 is the amount of Hes1 protein in the nucleus, when the rate of transcription of Hes1 mRNA is at half its maximal value; is the rate of translation of Hes1 mRNA, and h is the Hill coefficient. For the inference we take, kdeg, to be the experimentally determined value of 0.03 min 1
(ref. 19).
In Figure 2c, we show the results for the inference using our algorithm on the model shown above. Note that the value inferred for parameter k1, is signicantly lower than the range of values investigated for the continuous deterministic delay model of Momiji and Monk17. Interestingly, repeating the inference with dierent initial parameter sets leads to similar values of k1 (k1 < 0.01), but to a broad range of values for the other parameters, all of which result in oscillatory behaviour. Our qualitative inference thus suggests that oscillations of Hes1 protein and mRNA levels are strongly dependent on maintaining a low rate of transport of Hes1 protein into the nucleus, and that the dependence on other system parameters is less strong. As 1/k1 is the expected time Hes1 spends in the cytoplasm, this
corresponds to the delay that had previously been posited to be necessary for such oscillations to occur17. Our approach readily identies a parameter regime exhibiting oscillatory dynamics without explicitly requiring (discrete) time delays.
Next, we used the qualitative inference result as the basis to estimate the model parameters from the Hes1 data described below. An approximate Bayesian computation algorithm (ABC SMC20), capable of sampling from non-Gaussian and multimodal posteriors, was employed and Figure 2d shows the ts of simulated trajectories for 20 parameters drawn randomly from the resulting posterior distribution; these are in good agreement with the condence intervals (the blue bands in Fig. 2d), which can be obtained from the time-course data via a Bayesian nonparametric method21. It is worth noting that using the UKF alone, we could in principle consider the LEs and data together to infer parameters that are both qualitatively and quantitatively acceptable. However, by splitting the inference, we take advantage of the strengths of each algorithm within the Bayesian framework; rst we exploit the efficiency of the UKF to work with a sophisticated encoding of the desired behaviour that is computationally expensive to calculate; subsequently we use this qualitative information to construct suitable priors for an ABC method capable of dealing with non-Gaussian posteriors.
Designing attractors. Although the maximal LE alone is sufficient to encode xed points, limit cycles and strange attractors, we may include extra target exponents to design the complete Lyapunov spectrum (Fig. 3a), design the (KaplanYorke) fractal dimension22
(D k ik i k
= + = +
1 1
l l
5 15 25
0 10 20 30 40 50
(2)(2)
(3)(3)
(4)(4)
OA C*
+
Chaos
R R1 R2
0.8
Oscillations
UKF iteration
c
d
K 1 v hP 0
1.00.07.04.00.60.00.60.0
0 40 80 120
E ~ 0.01E ~ 1e -4
Hes1 fold change
8
6
4
2
0 50 150 250
Time (mins)
UKF iteration
E ~ 0.1
Figure 2 | Detecting and controlling chaos and oscillations. Plots show the estimated parameters at successive iterations of the UKF. Snapshotsof the developing attractor are shown above the plots. The sum of squared error (E) is indicated for different sections of the parameter trajectories. (a) Diagram of the circuit investigated. (b) The lter is able to drive the electric circuit between oscillations and chaos in less than 10 iterations. (c) Parameter trajectories for a simple model of the Hes1 regulatory system that yield oscillations. Several regions in parameter space can be identied that exhibit oscillatory behaviour. (d) Examples of trajectories generated from a region in parameter space that was found using our approach. Here we used the qualitative inference procedure to elicit a prior to be used for parameter inference. Trajectories were sampled from the prior. Data are indicated by red circles and represent fold change in Hes1 mRNA; the blue strips indicate the condence intervals obtained using Gaussian process regression, in which standard Gaussian noise is assumed, with maximum marginal likelihood estimates for the other hyperparameters21.
tol is taken larger-than-the-expected error in the LE estimation procedure, as discussed in the Supplementary Information.
For systems of this size, the qualitative dynamical regimes can be explored exhaustively and in short time (nding the desired behaviour takes minutes even for moderate sized systems).
Detecting oscillations in immune signalling. Oscillations appear to be ubiquitous in nature, yet, for reasons noted above, they oen remain elusive to quantitatively driven parameter inference techniques. Here we consider a dynamical system describing the expression levels of the transcription factor Hes1, which is involved in regulating the segmentation of vertebrate embryos16.
Oscillations of Hes1 expression levels have been observed in vitro in mouse cell lines, and reproduced using various modelling approaches, including continuous deterministic delay16,17 and
discrete stochastic delay models18. We investigate a simple three-component ordinary dierential equation (ODE) model of the
/ | |, where k is the largest integer for which ik i
=
1 0
l ) of a systems attractor (Fig. 3b), or drive models to behave hyper-chaotically (Fig. 3c).
The rst two of these applications are illustrated with the Lorenz system23 which has become a canonical example of how sensitivity to initial conditions can give rise to unpredictable behaviour. The model is known to exhibit a chaotic regime with LEs, = (0.906, 0, 14.57), for parameter vector (, , ) = (10, 28, 8/3). Here we
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
ARTICLE
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms1496
a b
20 11.6
10.827.827.5
2.5
0.5
5
1 < EE ~ 1 E ~0.1E ~0.01
1 < EE ~ 4e-3 E ~ 7e-5
16
12
8
25
15
5 15
UKF iteration
8
c
4
0 20 60 100
UKF iteration
Figure 3 | Designing attractive models. (a) Inferring a complete spectrum. After only 22 iterations, the characteristic buttery strange attractor emerges. The nal parameters and LEs are = 10.2, = 29.2, = 2.45 and (0.899, 2.7410 4, 14.6). (b) A function of the LEs, the KaplanYorke fractal dimension may also be used to specify the desired attractor. Here parameters for a target dimension of 1 are found for the Lorenz system within 20 iterations, giving rise to a limit cycle as required by the theory. (c) Three-dimensional projections of the hyperchaotic system with parameter vector (a, b, c, d, e, f) = (49.98, 35.86, 30.5, 1.35, 36.6, 33.8) and corresponding LEs (31.8, 16.8, 19.1, 71.4). Within a few iterations, our algorithm was able to drive the system towards an attractor characterized by LEs twice the size of any that had previously been reported.
infer back these parameters, starting with dierent prior means, by setting our target Lyapunov spectrum to . If we restrict the parameter search to the region [0, 30]3, as described in the Supplementary Information, we are able to do this reliably from random starting positions. The parameter trajectories and evolving attractor of a representative run of the inference algorithm is shown in Figure 3a, where the sum of squared error between estimated and target LEs is less than 810 5 aer the 100th iteration. Without constraints on the parameters, the inference algorithm converges to dierent parameter combinations that display indistinguishable LEs. This allows us to assess (for example) the robustness of chaotic dynamics by mapping systematically the regions of parameter space that yield similar LEs. Figure 3b shows how the fractal dimensiona function of the LEsmay also be tuned (in this example, halved). Although computational difficulties have in the past precluded such investigations, our approach allows us to map attractor structures (and the range of parameters giving rise to similar attractors) very efficiently.
For the third application of driving a system into hyper-chaos, we investigate a four-dimensional system with six parameters, whose signicance lies in having two very large LEs (1[10.7741, 12.9798] and 2[0.4145, 2.6669]) over a broad parameter range24. The resulting highly complex deterministic dynamics share statistical properties with white noise, making it attractive for engineering applications such as communication encryption and random number generation. By setting large target values of 1 and 2, we use our method to obtain parameters for which the system displays LEs that are over two times bigger than previously found for the system. Figure 3c shows the three-dimensional projections of the resulting hyper-chaotic attractor.
These are, of course, toy applications, but they demonstrate the exibility and potential uses of this approach: we can really start to explore qualitative behaviour in a numerically efficient and speedy
manner. For example, it becomes possible to map (or design) the qualitative characteristics of complex systems and to test robustness of qualitative system features systematically.
Discussion
We have demonstrated that it is possible to use statistical inference techniques to condition dynamical systems on observed (biological oscillations in Hes1) or desired qualitative characteristics (oscillations, chaos and hyper-chaos in natural and engineered systems). This provides us with unprecedented ability to probe the workings of dynamical systems. Here we have only used the approach for inference and design of the attractors of dynamical systems, as encoded by their LEs. This, however, has already been enough to show that it is not necessary to impose discrete time delays to explain the oscillations in the Hes1 system1618.
A focus on qualitative features has several advantages: rst, it is notoriously difficult, for example, to ensure that parameter inference preferentially (let alone exclusively) explores regions in parameter space that correspond to the correct qualitative behaviour such as oscillations. This is the case for optimization as well as the more sophisticated estimation procedures. Arguably, however, solutions that display the correct qualitative behaviour are more interesting than those that locally minimize some cost-function in light of some limited data. Obviously, in a design setting, ensuring the correct qualitative behaviour is equally important.
Second, the numerical performance of the current approach allows us to study fundamental aspects related to the robustness of qualitative behaviour. This allows us, for the rst time, to ascertain how likely a system is to produce a given Lyapunov spectrum (and hence attractor dimension) for dierent parameter values, . Our approach, coupled with means of covering large-dimensional parameter spaces, such as Latin-hypercube or Sobol sampling25,
allows us to explore such qualitative robustness. Or more specically, we can map out boundaries separating areas in phase space with dierent qualitative types of behaviour. We can also drive systems into regions with LEs of magnitudes not previously observed. The last aspect will have particular appeal to information and communication scientists as such hyper-chaos shares important properties with white noise and potential applications in cryptography and coding theory abound26.
Finally, our approach can also be used to condition dynamical systems on all manner of observed or desired qualitative dynamics, such as threshold behaviour, bifurcations, robustness, temporal ordering and so on. To rule out that a mathematical model can exhibit a certain dynamical behaviour will, however, require exhaustive numerical sampling of the parameter space; but coupled to ideas from probabilistic computing27; our procedure lends itself to such investigations. Both for inference and design problems, we foresee vast scope for applying this type of qualitative inference-based modelling. There is still a lack of understanding about the interplay between qualitative and quantitative features of dynamical systems28; this becomes more pressing to address as the systems we are considering become more complicated and the data collected more detailed. Flexibility in parameter estimationwhether based on qualitative or quantitative system featureswill be an important feature for the analysis of such system, as well as the design of synthetic systems in engineering and synthetic biology.
Methods
Encoding dynamics through Lyapunov exponents. Consider a continuous time dynamical systemsimilar results hold for the discrete casedescribed by,dd
yt f y
t t
= ( ),
where f is an n-dimensional gradient eld. To study the sensitivity of f to initial conditions, we consider the evolution of an initially orthonormal axes of n vectors,
(5)(5)
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms1496
ARTICLE
{1, 2, , n}, in the tangent space at y0. At time t, each i satises the linear equation,
d d
e e
i t i
t Df y
efficiency), the deterministic and minimal sampling scheme at the heart of the lter requires relatively few LE evaluations at each iteration (2np + 1, where np is the number of parameters to be inferred). Further, the function that is the subject of the inference may be highly-nonlinear and can take any parametric form, such as a feed-forward neural network36, or as in our case, a routine for estimating the LEs of a model with a given parameter set.
With [X]i denoting the ith column of the matrix X, the UKF algorithm for parameter estimation is given by,
Initialize: ( )
(( )( ) )
q q
q q q q
( ) ,
where Df(yt) is the Jacobian of f evaluated along the orbit yt. Equations (5) and (6) describe the expansion/contraction of an n-dimensional ellipsoid in the tangent space at yt, and we denote the average exponential rate of growth over all t of the ith principal axis of the ellipsoid as i. The quantities, 1 2 n, are called the global LEs of f. In particular, the sign of the maximal LE, 1, determines the fate of almost all small perturbations to the systems state, and consequently, the nature of the underlying dynamical attractor. For 1 < 0, all small perturbations die out and trajectories that start sufficiently close to each other converge to the same stable xed point in state-space; for 1 = 0, initially close orbits remain close but distinct, corresponding to oscillatory dynamics on a limit-cycle or torus (for tori, at least one other exponent must be zero); and nally for 1 > 0, small perturbations grow exponentially, and the system evolves chaotically within the folded space of a so-called strange attractor (for two or more positive denite LEs, we speak of hyperchaos).
In general, nonlinear system equations and the asymptotic nature of the LEs precludes any analytic evaluation. Instead, various methods of numerical approximation of these quantities, both directly from ODE models and from time-series data2931 have been developed. In this paper, Lyapunov spectra are calculated using a Python implementation of a method proposed by Benettin et al.32 and Shimada and Nagashima33 (outlined in the Supplementary Information) for inference of LEs when the dierential equations are known.
Lyapunov spectrum driven parameter estimation. Unlike in the case for linear systems, where identifying suitable parameters that produce observed or desired dynamics is trivial, inference for highly nonlinear systems is far from straightforward. Indeed, exact inferences are prohibitively expensive for even small systems, and so a host of dierent approximation methods have been proposed20,34,35. In our
case, two further complications arise from using LEs to encode the desired behaviour. First, the form of the mapping between model parameters and LEs is not closed, making methods that rely on an approximation of the estimation routine or its derivatives, such as the extended Kalman lter, difficult to apply. Second, LEs are signicantly more expensive to compute than more traditional cost functions, ruling out the use of approaches such as particle ltering or sequential Monte-Carlo methods that require extensive sampling of regions of parameter space and calculation of the corresponding LEs at each iteration.
To overcome these challenges, we exploit the efficiency and exibility of the UKF3638, seeking here to infer the posterior distribution over parameters that give rise to the desired LEs. Typically, the UKF is applied for parameter estimation ofa nonlinear mapping g() from a sequence of noisy measurements, yk, of the true states, xk, at discrete times k = t1, , tN. A dynamical state-space model is dened,
q q
k k k
v
=
(6)(6)
po
po T
E
P E
=
=
For each time point k = t1, , tN: Prediction step:
( | )
q q
q
k
0
0 0 0
pr
E y
P P R
=
=
= +
i k
po
k
1
1
1 1
pr
po
k
k
k
Update step:
po
pr
K y y
P P K P K
( )
q q
k
po
= +
=
where
k
k k k
pr
T
k
k
k yk k
Y g x
y Y
P Y y Y
k k k
k
i
L
=
=
=
( , )
2 [ ]
2 ([ ] )([
qkpr
w
L w ]] )
2 ([ ] )([ ] )
i k
T
m
i
k i
0
=
=
c
qkpr
y Q
P Y y
K P
+
=
=
yk
i
k i k k
k
i
0
q w q
0
pr yk i
c
i
k i
prk k i k
T
k
k
P1
qkpr qkpr
L qkpr qkpr
qqk
= pr yk yk
Various schemes for sigma-point selection exist including those for minimal set size, higher than third-order accuracy and (as dened and used in this study) guaranteed positive-deniteness of the parameter covariance matrices3942, which is
necessary for the square roots obtained by Cholesky decomposition when calculating the sigma points. The scaled sigma-point scheme thus proceeds as,
[ ]
[ ] ( ) , ,
0 0 0
qkpr
qkpr
qkpr
k k
(7)(7)
(8)(8)
= +
pr m
k i k
pr
= = + =
= + +
q w l l
q l w
1 00
2
1 0
L i
L P i L
1
y g x u
k k k k
pr
k
=
L i
L P i L L
= + + + =
= +
l l a b
( )
= +
( , )
q where uk(0, Qk) represents the measurement noise, vk(0, Rk) is the articial process noise driving the system, and g() is the mapping for which parametersk are to be inferred. The UKF (described in full below) is then characterized by the iterative application of a two step, predict and update, procedure. In the prediction step the current parameter estimate is perturbed by the driving process noise vk forming a priori estimates (which are conditional on all but the current observation) for the parameter mean and covariance. These we denote as qkpr and
Pk
i
[ ] ( ) , ,
ww w l
i
q l
pr
= +
L i L
= = + =
pr
m
c
k i k
k
1 2
c
i
1
2 1 2
( ) , , ,
i
where
l a k
= +
2( )
L L and parameters , and may be chosen to control the positive deniteness of covariance matrices, spread of the sigma-points, and error in the kurtosis, respectively. Our choices for the UKF parameter and noise covariances are discussed in the Supplementary Information.
To apply to the UKF for qualitative inference, we amend the dynamical state-space model to,q q
k k k
v
pr, respectively. The update step then updates the a priori statistics using the further measurement, yk, to form a posteriori estimates, qkpo and Pk
po. Aer all observations have been processed, we arrive at the nal parameter estimate, qtNpo
(with covariance PtN
po).
A crucial step in the algorithm is the propagation of the a priori parameter distribution statistics through the model, g(). Assuming linearity of this transformation, a closed form optimal lter may be derived (known as the Kalman lter). However, this assumption would make the algorithm inappropriate for use with the highly nonlinear systems and the choice of g() considered here. It is how the UKF copes with this challenge, namely its use of the unscented transform, that makes it particularly suitable for our method of qualitative feature-driven parameter estimation.
The unscented transform is motivated by the idea that probability distributions are easier to approximate than highly nonlinear functions39. In contrastto the Extended Kalman lter where nonlinear state transition and observation functions are approximated by their linearized forms, the UKF denes a set, k,
of sigma-pointsdeterministically sampled particles from the current posterior
parameter distribution (given by qkpo1 and Pk
po), that along with corresponding
weights, { , }
w w
= +
1
l q
target = +
L y f u
k k
( , ; ) ,
0 where L() maps parameters to the encoding of the dynamical behaviour (here a numerical routine to calculate the Lyapunov spectrum), target is a constant target vector of LEs, y0 denotes the initial conditions, and f is the dynamical system under investigation (with unknown parameter vector , considered as a hidden state of the system and not subject to temporal dynamics). To see how equations (9) and (10) t the state-space model format for UKF parameter estimation, it is helpful to consider the time series (target, target, target, ) as the observed data from which
we learn the parameters of the nonlinear mapping L(). Our use of the UKF is characterized by a repeated comparison of the simulated dynamics for each sigma point to the same (as specied) desired dynamical behaviour. In this respect, we use the UKF as a smoother; there is no temporal ordering of the data supplied to the lter because all information about the observed (target) dynamics is given at each iteration. From an optimization viewpoint, the lter aims to minimize the prediction-error function,
E g y f Q g y f i
im ic k, completely capture its mean and covariance. The sigma points can be propagated individually through the nonlinear observation function, and recombined to estimate the mean and covariance of the predicted observation, yk,
to third-order accuracy in the Taylor expansion, using the equations given below. Under the approximate assumption of Gaussian prior and posterior distributions (higher order moments may be captured if desired at the cost of computational
( k T k
q q l q l
=
=
) [ ( , ; ) ] ( ) [ ( , ; ) ],
0 1 0
target target
(9)(9)
(10)(10)
(11)(11)
1
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
ARTICLE
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms1496
thus moving the parameters towards a set for which the system exhibits the desired dynamical regime.
Hes 1 quantitative real-time PCR. Dendritic cells were dierentiated from bone marrow, as described previously43. Rat Jgd1/humanFc fusion protein (R&D Systems) or human IgG1 (Sigma Aldrich) (control samples) were immobilized onto tissue culture plates (10 g ml 1 in PBS) overnight at 4 C. Dendritic cells were spun onto the plate and cells were collected at the appropriate time. Total RNA was isolated using the Absolutely RNA micro prep kit (Stratagene). Complementary DNA was generated from 125 ng of total RNA using an archive kit (Applied Biosystems). 1 l of cDNA was used with PCR Mastermix and TaqMan primer and probes (both Applied Biosystems) and analysed on an Applied Biosystems 7500 PCR system. Cycle thresholds were normalized to 18S and calibrated to a PBS-treated control sample for relative quantication.
Computational implementation. All routines were implemented in Python using LSODE for integrating dierential equations. ABC inference was performed using the ABC-SysBio package44. Code is available from the authors on request.
References
1. Tarantola, A. Inverse Problem Theory and Methods for Model Selection (SIAM,
2005).
2. Moles, C. G., Mendes, P. & Banga, J. R. Parameter estimation in biochemical pathways: a comparison of global optimization methods. Genome Res. 13, 24672474 (2003).
3. Hartemink, A. Reverse engineering gene regulatory networks. Nat. Biotechnol. 23, 554555 (2005).
4. Vyshemirsky, V. & Girolami, M. A. Bayesian ranking of biochemical system models. Bioinformatics 24, 833839 (2008).
5. Toni, T. & Stumpf, M. Simulation-based model selection for dynamical systems in systems and population biology. Bioinformatics 26, 104110 (2010).
6. Lu, J., Engl, H. W. & Schuster, P. Inverse bifurcation analysis: application to simple gene systems. Algorithms Mol. Biol. 1, 11 (2006).
7. Khinast, J. & Luss, D. Mapping regions with dierent bifurcation diagrams of a reverseow reactor. AIChE J. 43, 20342047 (1997).
8. Chickarmane, V., Paladugu, S., Bergmann, F. & Sauro, H. Bifurcation discovery tool. Bioinformatics 21, 3688 (2005).
9. Conrad, F. Parameter estimation in some diusion and reaction models: an application of bifurcation theory. Chem. Eng. Sci. 39, 705711 (1984).10. Wu, X. & Hu, H. Parameter estimation only from the symbolic sequences generated by chaos system. Chaos 22, 359366 (2004).
11. Batt, G., Yordanov, B., Weiss, R. & Belta, C. Robustness analysis and tuning of synthetic gene networks. Bioinformatics 23, 24152422 (2007).
12. Rizk, A., Batt, G., Fages, F. & Soliman, S. On a continuous degree of satisfaction of temporal logic formulae with applications to systems biology: in Computational Methods in Systems Biology, 251268 (Springer, 2008).
13. Endler, L. et al. Designing and encoding models for synthetic biology. J. R. Soc. Interface 6, S405S417 (2009).
14. Chen, G. Control and anticontrol of chaos. Control of Oscillations and Chaos, 1997. Proceedings., 1997 1st International Conference 2, 181186 (1997).
15. Tamaeviius, A., Mykolaitis, G. & Pyragas, V. A simple chaotic oscillator for educational purposes. Eur. J. Phys. 26, 6163 (2005).
16. Momiji, H. & Monk, N. A. M. Dissecting the dynamics of the Hes1 genetic oscillator. J. Theor. Biol. 254, 784798 (2008).
17. Monk, N. A. M. Oscillatory expression of Hes1, p53, and NF-kappaB driven by transcriptional time delays. Curr. Biol. 13, 14091413 (2003).
18. Barrio, M., Burrage, K., Leier, A. & Tian, T. Oscillatory regulation of Hes1: discrete stochastic delay modelling and simulation. PLoS Comput. Biol. 2, e117 (2006).
19. Hirata, H. et al. Oscillatory expression of the bhlh factor hes1 regulated by a negative feedback loop. Science 298, 840843 (2002).
20. Toni, T., Welch, D., Strelkowa, N., Ipsen, A. & Stumpf, M. P. Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems. J. R. Soc. Interface 6, 187202 (2009).
21. Kirk, P. & Stumpf, M. Gaussian process regression bootstrapping: exploring the eects of uncertainty in time course data. Bioinformatics 25, 13001306 (2009).
22. Castiglione, P., Falcioni, M., Lesne, A. & Vulpiani, A. Chaos and Coarse Graining in Statistical Mechanics, Cambridge University Press, 2008.
23. Lorenz, E. Deterministic nonperiodic ow. Atmos. Sci. 20, 130141 (1963).24. Qi, G., van Wyk, B. & van Wyk, M. Analysis of a new hyperchaotic system with two large positive Lyapunov exponents. J. Phys.: Conf. Ser. 96, 012057 (2008).
25. Saltelli, A., Ratto, M., Andres, T. & Campolong, F. Global Sensitivity Analysis: The Primer (Wiley, 2008).
26. Schroeder, M. Number Theory in Science and Communication (Springer, 2008).
27. Mitzenmacher, M. & Upfal, E. Probability and Computing: Randomized Algorithms and Probabilistic Analysis (Cambridge University Press, 2005).
28. Kirk, P., Toni, T. & Stumpf, M. Parameter inference for biochemical systems that undergo a Hopf bifurcation. Biophys. J. 95, 540549 (2008).
29. Rosenstein, M., Collins, J. & Luca, C. D. A practical method for calculating largest lyapunov exponents from small data sets. Physica D 65, 117134 (1993).
30. Bryant, P., Brown, R. & Abarbanel, H. Lyapunov exponents from observed time series. Phys. Rev. Lett. 65, 15231526 (1990).
31. Gencay, R. & Dechert, W. An algorithm for the n Lyapunov exponents of an n-dimensional unknown dynamical system. Physica D: Nonlinear Phenomena 59, 142157 (1992).
32. Benettin, G., Galgani, L., Giorgilli, A. & Strelcyn, J. Lyapunov characteristic exponents for smooth dynamical systems and for hamiltonian systems; a method for computing all of them. Part 1: theory. Meccanica 15, 920 (1980).
33. Shimada, I. & Nagashima, T. A numerical approach to ergodic problem of dissipative dynamical systems. Prog. Theor. Phys. 61, 16051616 (1979).
34. Newton, M. & Raery, A. Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B (Methodol.) 56, 348 (1994).
35. Moral, P. D., Doucet, A. & Jasra, A. An adaptive sequential Monte Carlo method for approximate Bayesian computation. Ann. Appl. Stat. 112 (2009).
36. Wan, E. & van der Merwe, R. The unscented Kalman lter for nonlinear estimation. Adaptive Systems for Signal Processing, Communications, and Control Symposium 2000 153158 (2000).
37. Wan, E. A. & van der Merwe, R. Kalman Filtering and Neural Networks (John Wiley & Sons, 2002).
38. Quach, M., Brunel, N. & DAlch-Buc, F. Estimating parameters and hidden variables in non-linear state-space models based on odes for biological networks inference. Bioinformatics 23, 32093216 (2007).
39. Julier, S. & Uhlmann, J. A general method for approximating nonlinear transformations of probability distributions. Tech. Rep. RRG, Dept. of Engineering Science, University of Oxford (1996).
40. Tenne, D. & Singh, T. The higher order unscented lter. Proceedings of the American Control Conference 3, 24412446 (2003).
41. Julier, S. Reduced sigma point lters for the propagation of means and covariances through nonlinear transformations. Proceedings of the American Control Conference 2, 887892 (2002).
42. Julier, S. The scaled unscented transformation. Proceedings of the American Control Conference 6, 45554559 (2002).
43. Bugeon, L., Gardner, L., Rose, A., Gentle, M. & Dallman, M. Notch signaling induces a distinct cytokine prole in dendritic cells that supports T cell-mediated regulation and IL-2-dependent IL-17 production. J. Immunol. 181, 81898193 (2008).
44. Liepe, J. et al. ABC-SysBioapproximate Bayesian computation in Python with GPU support. Bioinformatics 26, 17971799 (2010).
Acknowledgements
This research was supported by the UK Biotechnology and Biological Sciences Research Council under the Systems Approaches to Biological Research (SABR) initiative (BB/FO05210/1) and the Centre for Integrative Systems Biology an Imperial College (CISBIC) (D.S., T.T., C.B., S.M., A.R., M.J.D. and M.P.H.S.) and a project grant to M.P.H.S.; P.K. was supported by the Wellcome Trust. M.P.H.S. is a Royal Society Wolfson Research Fellow. We dedicate this article to the memory of Jaroslav Stark.
Author contributions
D.S., A.R., M.J.D. and M.P.H.S. designed the research; D.S., P.D.W.K., C.B., T.T., A.R. and S.M. carried out the research; D.S., A.R. and M.P.H.S. wrote the manuscript. All authors have read and approved the nal version of the manuscript.
Additional information
Supplementary Information accompanies this paper at http://www.nature.com/ naturecommunications
Competing nancial interests: The authors declare no competing nancial interests.
Reprints and permission information is available online at http://npg.nature.com/ reprintsandpermissions/
How to cite this article: Silk, D. et al. Designing attractive models via automated identication of chaotic and oscillatory dynamical regimes. Nat. Commun. 2:489 doi: 10.1038/ncomms1496 (2011).
License: This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivative Works 3.0 Unported License. To view a copy of this license, visit http:// creativecommons.org/licenses/by-nc-nd/3.0/
NATURE COMMUNICATIONS | 2:489 | DOI: 10.1038/ncomms1496 | www.nature.com/naturecommunications
2011 Macmillan Publishers Limited. All rights reserved.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Oct 2011
Abstract
Chaos and oscillations continue to capture the interest of both the scientific and public domains. Yet despite the importance of these qualitative features, most attempts at constructing mathematical models of such phenomena have taken an indirect, quantitative approach, for example, by fitting models to a finite number of data points. Here we develop a qualitative inference framework that allows us to both reverse-engineer and design systems exhibiting these and other dynamical behaviours by directly specifying the desired characteristics of the underlying dynamical attractor. This change in perspective from quantitative to qualitative dynamics, provides fundamental and new insights into the properties of dynamical systems.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer