INTRODUCTION
Host-microbe interactions can have variable outcomes that result in at least four states, namely mutualism, commensalism, latency, and disease (1, 2). These states can be distinguished by the amount of damage incurred on the host. When damage is sufficient to affect homeostasis, disease ensues and is manifested as symptoms of distress. When damage is sufficient to cripple essential systems, death ensues. Studying the dynamics of host-microbe interactions is inherently difficult, since measurements of host-microbe interactions, such as survival time post-infection, are often very difficult to reproduce quantitatively even when the initial conditions appear to be nearly identical. There are several non-exclusionary explanations for the lack of detailed inter-experimental reproducibility even when the overall result is reproducible. On one hand, microbial pathogenesis is dependent on a dizzying array of variables that include microbe- and host-specific factors, such as inoculum size, route of infection, host state, temperature, associated microbiota, etc. (3) Even in the most carefully conducted experiment, it is unlikely that all variables are perfectly controlled from experiment to experiment. This is apparent even in experiments where a qualitative variable such as host death is consistent but quantitative outcomes such as time to death can vary significantly from experiment to experiment (4). For instance, even within an experiment where a set of genetically identical hosts in the form of inbred mice are infected with the same inoculum of a bacterium or parasite, organ microbial burden across individuals can vary by 100-fold (for an example, see Fig. 2 in reference 5). Such ranges are often attributed to experimental variation, although to our knowledge the sources of such variability have not been exhaustively investigated in any host-microbe interaction, and such studies may not be feasible with current experimental capabilities. This interpretation assumes that the system is deterministic since it operates on the implicit assumption that if an experimenter could control for all experimental variables, the results would be perfectly reproducible from experiment to experiment. Yet, this assumption has never been formally tested, and it is possible that experimental noise and lack of reproducibility in host-microbe interactions also reflect the mathematical properties of the systems under study.
Dynamical systems are either stochastic or deterministic. Stochastic systems are dominated by random variables and are inherently unpredictable. Deterministic systems are predictable in that they produce the same output given the same initial conditions. Chaotic systems are a subset of deterministic systems where the long-term course of the system is highly unpredictable, even while the system is theoretically predictable. This seeming contradiction stems from the high sensitivity to initial conditions possessed by a chaotic system, popularly captured by the notion of “the butterfly effect,” where a small change to the weather system such as a butterfly flapping its wings cascades onto a large change in the system later, potentially culminating in a tornado. In a deterministic system, the future course of the system is theoretically possible to determine, if given perfect knowledge of the current state of the system and its dynamics. But since in practice, we can never account fully and with complete precision for all the state variables, such long-term prediction is impossible. By the same token, experiments carried out in the context of a chaotic system cannot be expected to be repeatable since we cannot control for all variables with perfect precision. Here, we carried out an exploration of the dynamics of host-microbe interactions by first posing a thought experiment, or thought hypothesis, where we consider whether an experiment would be perfectly reproducible if all the variables could be controlled. Since chaotic systems are deterministic, such an experiment would, theoretically, in principle, be reproducible, but in reality, small variations in repeating the experiment are inevitable, and consequently, final outcomes would differ. We emphasize to the non-specialist reader that mathematical chaos does not mean “chaotic” in vernacular parlance. The free online dictionary defines chaos as “a condition or place of great disorder or confusion,” but the mathematical definition is that of a system that is highly sensitive to the initial conditions, which is quantified by the maximal Lyapunov exponent being greater than zero, indicating that nearby trajectories at a given point in time diverge exponentially with respect to time in the future. Chaotic systems are to be contrasted with stochastic, or random, dynamical systems. In stochastic systems, neither short- nor long-term prediction is possible due to an irreducible element of randomness. For example, the outcome of a gambling game where the cards are dealt at random cannot be predicted because complete chance determines the combinations at various points. Importantly, it has been observed that in biology, chaotic behavior is prevalent; however, as noted by Rogers et al. (6), it is often somewhat difficult to conclusively establish a chaotic signature due to a lack of sufficient data, as is the case in the current study.
There is very little information available on the dynamical characteristics of host-microbe interactions, i.e., whether these dynamics are stochastic, predictably deterministic, or chaotic.
RESULTS
Detecting chaos in time-to-death distributions
When faced with a time series sampled from some dynamical process, one might want to detect whether the underlying process is chaotic or stochastic. Of course, there could be some stochasticity on top of an underlying chaotic process, in which case one might still want to detect the presence of chaos. The difficulty arises from the fact that many statistics of a chaotic process resemble a stochastic one—the determinism of the process rendered barely detectable due to the unstable dynamical trajectories characteristic of chaos. However, methods exist for distinguishing chaos (18, 19). One such method is the permutation spectrum test, which studies the distribution of ordinal patterns in short consecutive subsequences of the time series (20, 21). In particular, the existence of forbidden ordinal patterns—those with no or almost no occurrence—is known as a strong indication of chaos, which distinguishes it from a truly stochastic time series. In the experiments described here, a population of hosts is exposed to pathogens, and the resulting time-to-death distribution is compared to that of a naturally occurring time-to-death distribution of a control population. In this case, the problem of detecting underlying chaos in the dynamics of host-microbe interactions, when the readout is time-to-death, is complicated by two factors. First, we do not have access to the internal dynamics of the system, only to the time at which a terminal event (death) occurs. Second, since we only gather one data point from each individual (time until death), we must study an assemblage of systems rather than a single continual system. This forces a degree of randomness on the data, especially as we cannot control the system’s internal parameters. In particular, we can no longer use the permutation spectrum test in the traditional way since we have a distribution of time-to-deaths pulled from a sample population rather than a sample of time series from single individual subjects. In addition, we note that in both worms and flies, the infection may not begin at the same time for all animals, as they may ingest the inoculum at different times post-exposure. As such, the time zero reflects the beginning of the experiment rather than the time of initial infection. Nevertheless, as stated in the methods, for both systems, infection is expected to occur shortly after the experiments begin. Given that this is a physiological infection method where the animals are infected by ingesting the pathogen, repeated infections are also theoretically likely, but neither are common since infected worms remain infected after first injecting bacteria and flies incur such gut damage from the initial infection that they are unlikely to be candidates for subsequent infection. Hence, our dynamic analysis is for the whole infection system rather than for individual animals, and we note that this approach is more akin to what happens in natural epidemics resulting from the encounter of a pathogen with a susceptible host population.
Faced with these problems, we devised a new test that we term an “inversion measure,” which we suggest can be used in detecting chaos based only on the distribution of time-to-event in an ensemble system. This method is based on a histogram representation of the time-to-event distribution. Similar to the permutation spectrum test, we subdivide the histogram’s bins into consecutive non-overlapping sequences of
To begin validating this measure, we apply it to simulated time-to-event data from two processes, one chaotic and one stochastic. Specifically, for a chaotic process, we generate data from the logistic map (18)
Fig 1
Time-to-event distributions. (A) Sampling geometric distribution. (B) Sampling from the chaotic quadratic map
We find a higher proportion of inversions in the chaotic data as opposed to the stochastic data, and we find statistical significance for rejecting the null hypothesis of a locally linear distribution in the chaotic case (
Fig 2
Inversion method of bootstrap for locally linear approximation of the distributions (histograms). (A) Sampling from a geometric distribution. (B) Sampling from the chaotic quadratic map. Red line indicates the actual inversion measure of the distribution. Cautionary note: the results derived by the inversion method presented in panel A, though in agreement with our hypothesis, are sensitive to parameter choice and can vary from run to run, even with a large sample size. In the text, please see a discussion of the false-positive levels as a function of the histogram’s number of bins, for data coming from the Weibull, a generalized exponential distribution.
To further test the ability of the inversion measure to reliably detect a chaotic signature, we applied the same test to time-to-event distributions associated with two other known chaotic systems: the Henon map and the Lorenz system, which are, respectively, two- and three-dimensional. In each of these, we picked a small window in space that intersects the attractor of the system (so it will be repeatedly visited by the system) and defined this window as our “event.” The Henon map is given by the dynamics
As stochastic control, we sampled data from various Weibull distributions, which is a common family of distributions of waiting times, with a wide variety of shapes. We swept over different choices of parameters for the number of bins for the histogram, and for each one ran the inversion measure multiple times to calculate the percentage of false positives. We found that for a certain range of bin size parameters (around 100–300 bins, given 10,000 sample points), false positives were low for a variety of Weibull distributions (ranging between 3% and 7%). However, if more bins were used, the number of false positives increased significantly. Still, given an appropriate choice for the number of bins, we were broadly able to distinguish between chaotic systems and stochastic ones. Specifically, when using 256 bins on 1,000 runs of 10,000 sample points each for data coming from Weibull distribution, Lorenz, and Henon maps, we obtained a 9.8% false-negative rate for the Lorenz process, an 11% false rate for the Henon process, and a 6.6% false-negative rate for the Weibull distribution. For these tests, we used a kernel-smoothed density (MATLAB function ksdensity with its default settings) as the null hypothesis rather than a local linearization.
We repeated the above experiments with an alternative choice of “terminal event,” where the event is given by passing a certain extreme threshold rather than landing inside of a small region,
We note that the Lorenz system as shown in Fig. 3 is characterized by erratic flips between two regions characterized by
Fig 3
Characteristic behavior of the Lorenz dynamics. (A) A typical dynamical trajectory of the first variable (
An additional problem associated with the inversion method needs further investigation. The inversion method uses the distribution of time to events as its input rather than a dynamic time series. This fact makes it harder to discriminate between chaotic and stochastic dynamics for certain chaotic systems whose time-to-event stationary distributions are similar to those of stochastic processes, an issue which can be heightened when taking into account the distribution of longer waiting times, for which the chaotic signal tends to fade away.
Biological data analysis
For the biological model systems described below, time-to-death data are obtained as described in the experimental setup in Materials and Methods, with no preprocessing or change of units. These experiments were conducted using automated
Caenorhabditis elegans
We first constructed the time-to-death histograms for both non-infected and infected
As can be observed in Fig. 4, the two histograms differ substantially. While the time-to-death distribution of the non-infected individuals resembles the normal distribution, the infected individuals’ histogram is far more irregular. This irregularity may partially result from a smaller sample size of the infected population (Fig. 3). The two histograms, however, come from two different underlying biological processes and so cannot be directly compared. The former results from a process of aging-related death, while the latter reflects the pathogenic process of the worm-microbe interaction and the consequent immune system response.
Fig 4
Survival analysis of non-infected (data from Kulp [21],
To discriminate whether our system of host-pathogen interactions exhibited chaos, we employed the inversion measure on these time distributions, as described above and in Materials and Methods. We find statistical significance for rejecting the null hypothesis in the infected population (
Fig 5
Inversion method of bootstrap for locally linear approximation of the distributions (histograms). (A) Sampling from non-infected
This observation, though not definitive due to the preliminary nature of our inversion-based test for chaos, lends credence to the hypothesis that the time-to-death data result from the infected
We repeated a similar procedure to analyze infected and non-infected fly data sets (Fig. 6). Here, however, we had to contend with data that were collected manually instead of by machine. While observations were mostly carried out in regular intervals, there was a specific time interval used between data collection points. This means that the number of deaths observed at the subsequent time point needed to be distributed over a wider time interval. To do this, we randomly redistributed the number of deaths observed at wider time intervals into regular subintervals. This introduces an additional element of randomness into the analysis, which we would expect to drown out the chaotic signature. Nevertheless, we still detected statistical significance (
Fig 6
Survival analysis of non-infected (
Fig 7
Comparing the null and sample distributions of the inversion measure. (A) Non-infected. (B) Infected.
Power analysis
To add context to our results, we conducted a power analysis to determine how many data points can be removed while maintaining statistical significance (see Materials and Methods for more details). We conducted this analysis with the infected worm data and saw that the statistical significance threshold of
Fig 8
Power analysis for the
DISCUSSION
In this study, we first established that a time-to-death distribution provided adequate information to analyze dynamical signatures in
The observation of potential chaotic signatures in time to death of a population of infected animals may have important implications for the predictability of microbial pathogenesis. Both models are invertebrates, which lack adaptive immunity. We do not know the generalizability of these observations to other host-microbe systems, but the occurrence of chaotic dynamics in vertebrates would affect our ability to predict future threats. For example, the coronavirus disease 2019 pandemic has highlighted the importance of identifying other potential threats in nature. The challenge of virulence forecasting is highlighted by the fact that there are more than 300,000 viruses among the more than 6,000 known mammalian species (24). Could the subset of mammalian viruses that threaten humans be predicted? Some may argue that the answer to this question must await additional mechanistic information that could be used to model human infections. However, even if we knew all the steps by which a single virus could interact with host cells, the immune response to that virus, the potential for transmissibility, prediction of virulence in a population, etc., it would not be possible if the overall process is stochastic or chaotic. This is because small differences in the host-microbe interaction would have major effects on the outcome of an infection, as manifested by a range of outcomes from asymptomatic infection to pathogenicity, which in the extreme results in the death of the host. An example of this phenomenon is the unpredictability of infection at the level of the individual, as evident in the current SARS-CoV-2 pandemic where outcomes range from asymptomatic infection to death and are determined by a set of variables that in combination produce uncertain outcomes (3). Chaotic systems are inherently unpredictable although it is possible to generate probabilistic models for short-term prediction of such systems, as is done in applications of stochastic processes and machine learning to weather forecasting. Hence, our finding that some host-microbe interactions may manifest as chaotic dynamical systems implies that absolute predictability with regard to the outcome of these interactions may be impossible if these dynamics are widespread in nature. Nevertheless, knowledge of whether some host-microbe interactions are chaotic could inform efforts to control such diseases including the possibility of exploiting the properties of chaos, itself, to our advantage. For example, once the relevant driving forces have been identified in the analysis of chaotic systems, tools from the field of control theory, in general, and chaos control theory, in particular, can be applied to hypothesize perturbations that could be applied to stabilize a chaotic system (25).
In our prior exploration of chaos with
In addition, further validation of our inversion measure and the bootstrap method for obtaining the statistical significance is necessary to solidify our conclusions. In this regard, we identify areas for future investigation, including obtaining a better understanding of how the choice of number (individuals infected) and the size of bins (time intervals) affect the results when a finer time resolution of the data sampling becomes available. In fact, this analysis indicates the need to develop new methods for following the dynamics of host-microbe interactions inside the host since the currently used variables to measure outcome, such as time to death, weight loss, fever, etc., do not have sufficient resolution to provide a conclusive result. In addition, since the inversion measure appears sensitive to parameters such as the number of bins and can result in false positives/negatives in certain parameter settings, further work will be needed to refine the measure and increase its accuracy. Finally, a deeper theoretical study is needed to understand the limits of discerning chaos from time-to-event data, given the nature of ergodic processes.
In recent years, there has been great concern about the reproducibility of biomedical research (26, 27). The detection of chaotic signatures in host-microbe interactions implies that these systems may have inherent challenges to reproducibility. Experimenters studying virulence or the efficacy of a therapeutic intervention in an animal model often choose an inoculum that rapidly kills all members of the infected group, leaving no time for the host-pathogen dynamics to present itself. Such doses can ensure conclusive results with relatively few animals but translate into making the experimental system deterministic. Consequently, experiments that force an outcome are not informative of the dynamical nature of natural host-microbe interactions. However, even when using a large inoculum to ensure mortality, there is considerable variation in such variables as time to death. In this regard, our observation of chaotic signatures suggests that differences in the microbial load administered and/or the site of infection could translate into large differences in effect size, negating the consistency and reproducibility of inter-experimental comparisons.
The fact that variation in experimental conditions is present should not affect the dynamical properties of the underlying dynamical response of the immune system to pathogens. In our simulation, we start with a random initial condition and are nonetheless able to distinguish between chaotic and stochastic dynamical behavior.
In summary, this work should be considered an exploration of the dynamics of host-microbe interactions. While our work suggests that a time-to-death distribution may provide adequate information to analyze for chaotic signatures in host-microbe interactions and that such signatures can be observed in
MATERIALS AND METHODS
Wild-type (N2)
The inversion measure on a time distribution
Given a distribution of times, we first convert the distribution into a histogram. In the case when the time points are naturally discrete, such as in our simulated data of waiting times sampled from discrete chaotic or stochastic processes, then we use each discrete time point as a bin. In the case of a continuous process, then we pick a parameter
Given a histogram with n bins, where each bin contains a whole number, we first add a random noise, distributed uniformly at random between 0 and epsilon, where epsilon < 1, to the histogram so as to break ties, ensuring each bin has a unique value while preserving the order of the unequal counts. Then, we subdivide the bins into consecutive nonoverlapping sequences of four bins each, with the remainder discarded. For each such sequence
To obtain a
For the fly data sets, in which some observations were not made in regular intervals, we redistributed events along gaps as follows. Whenever one or more observations were missed, we randomly uniformly distributed the next-observed number of events along the interval between the given observation and the prior observation. We then formed a histogram from this redistributed data and proceeded as before to calculate the inversion measure. Due to the randomness of redistribution, we carried this process out 1,000 times, thus generating a sample distribution of inversion measures. For each of these 1,000 histograms, we ran the bootstrap process with a sample size of 1,000, thereby generating a total of 1,000,000 inversion measures that make up the null distribution. We used these two distributions to generate a
For the uninfected flies, we combined two distinct data sets. In this case, we redistributed each data set along gaps as mentioned above and only then combined the two histograms into one and proceeded with the analysis as just described.
Power analysis
For the power analysis, we varied a parameter
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2024 Sella et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
ABSTRACT
Host-microbe interactions constitute dynamical systems that can be represented by mathematical formulations that determine their dynamic nature and are categorized as deterministic, stochastic, or chaotic. Knowing the type of dynamical interaction is essential for understanding the system under study. Very little experimental work has been done to determine the dynamical characteristics of host-microbe interactions, and its study poses significant challenges. The most straightforward experimental outcome involves an observation of time to death upon infection. However, in measuring this outcome, the internal parameters and the dynamics of each particular host-microbe interaction in a population of interactions are hidden from the experimentalist. To investigate whether a time-to-death (time-to-event) data set provides adequate information for searching for chaotic signatures, we first determined our ability to detect chaos in simulated data sets of time-to-event measurements and successfully distinguished the time-to-event distribution of a chaotic process from a comparable stochastic one. To do so, we introduced an inversion measure to test for a chaotic signature in time-to-event distributions. Next, we searched for chaos in the time-to-death of
IMPORTANCE
Is microbial pathogenesis a predictable scientific field? At a time when we are dealing with coronavirus disease 2019, there is intense interest in knowing about the epidemic potential of other microbial threats and new emerging infectious diseases. To know whether microbial pathogenesis will ever be a predictable scientific field requires knowing whether a host-microbe interaction follows deterministic, stochastic, or chaotic dynamics. If randomness and chaos are absent from virulence, there is hope for prediction in the future regarding the outcome of microbe-host interactions. Chaotic systems are inherently unpredictable, although it is possible to generate short-term probabilistic models, as is done in applications of stochastic processes and machine learning to weather forecasting. Information on the dynamics of a system is also essential for understanding the reproducibility of experiments, a topic of great concern in the biological sciences. Our study finds preliminary evidence for chaotic dynamics in infectious diseases.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer