ARTICLE
Received 1 Jun 2015 | Accepted 6 Sep 2015 | Published 14 Oct 2015
Jessica L. Nielson1, Jesse Paquette2, Aiwen W. Liu1, Cristian F. Guandique1, C. Amy Tovar3, Tomoo Inoue4, Karen-Amanda Irvine5, John C. Gensel6, Jennifer Kloke7, Tanya C. Petrossian8, Pek Y. Lum9,Gunnar E. Carlsson7,10, Geoffrey T. Manley1, Wise Young11, Michael S. Beattie1, Jacqueline C. Bresnahan1 & Adam R. Ferguson1,12
Data-driven discovery in complex neurological disorders has potential to extract meaningful syndromic knowledge from large, heterogeneous data sets to enhance potential for precision medicine. Here we describe the application of topological data analysis (TDA) for data-driven discovery in preclinical traumatic brain injury (TBI) and spinal cord injury (SCI) data sets mined from the Visualized Syndromic Information and Outcomes for Neurotrauma-SCI (VISION-SCI) repository. Through direct visualization of inter-related histopathological, functional and health outcomes, TDA detected novel patterns across the syndromic network, uncovering interactions between SCI and co-occurring TBI, as well as detrimental drug effects in unpublished multicentre preclinical drug trial data in SCI. TDA also revealed that perioperative hypertension predicted long-term recovery better than any tested drug after thoracic SCI in rats. TDA-based data-driven discovery has great potential application for decision-support for basic research and clinical problems such as outcome assessment, neurocritical care, treatment planning and rapid, precision-diagnosis.
1 Department of Neurosurgery, Brain and Spinal Injury Center, University of California, San Francisco, 1001 Potrero Avenue, Building 1, Room 101, San Francisco, California 94143, USA. 2 Tagb.io, 1 Quartz Way, San Francisco, California 94131, USA. 3 Department of Neuroscience, Ohio State University, 460 West 12th Avenue, 670 Biomedical Research Tower, Columbus, Ohio 43210, USA. 4 Department of Neurosurgery, Tohoku University Graduate School of Medicine, Sendai city, Miyagi prefecture 980-0856, Japan. 5 Department of Neurology, San Francisco VA Medical Center, University of California San Francisco, San Francisco, California 94110, USA. 6 Department of Physiology, Spinal Cord and Brain Injury Research Center, Chandler Medical Center, University of Kentucky Lexington, B463 Biomedical & Biological Sciences Research Building, 741 South Limestone Street, Kentucky 40536, USA. 7 Ayasdi Inc., 4400 Bohannon Drive Suite #200, Menlo Park, California 94025, USA. 8 GenePeeks, Inc., 777 Avenue of the Americas, New York, New York 10001, USA. 9 Capella Biosciences, 550 Hamilton Avenue, Palo Alto, California 94301, USA. 10 Department of Mathematics, Stanford University, Building 380, Stanford, California, 94305, USA.
11 Department of Cell Biology and Neuroscience, W.M. Keck Center for Collaborative Neuroscience, Rutgers University, Piscataway, New Jersey 08854, USA.
12 Department of Neurosurgery, San Francisco VA Medical Center, University of California San Francisco, San Francisco, California 94110, USA. Correspondence and requests for materials should be addressed to A.R.F. (email: mailto:[email protected]
Web End [email protected] ).
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 1
& 2015 Macmillan Publishers Limited. All rights reserved.
DOI: 10.1038/ncomms9581 OPEN
Topological data analysis for discovery in preclinical spinal cord injury and traumatic brain injury
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
Bioinformatics approaches for precision medicine are gaining momentum as biomedical researchers grapple with overwhelming amounts of data generated by all areas of
science in the era of big-data1,2. The central nervous system (CNS) injury literature seeks to understand the multifaceted effects of injuries to the brain and spinal cord by collecting high-volumes of detailed information on individual subjects, ranging from histological, physiological and bio-behavioral outcomes to health records from therapeutic trials. The sheer volume of data presents a problem for managing and interpreting therapeutic ndings without computational assistance35. Informatics tools are currently being developed in preclinical and clinical CNS injury studies6,7, and resources such as the Neuroscience Information Framework (NIF, http://www.neuinfo.org/
Web End =http://www.neuinfo.org/ ), ClinicalTrials.gov (http://www.clinicaltrials.gov
Web End =www.clinicaltrials.gov) and PubMed (http://www.ncbi.nlm.nih.gov/pubmed/
Web End =http:// http://www.ncbi.nlm.nih.gov/pubmed/
Web End =www.ncbi.nlm.nih.gov/pubmed/ ) offer user-friendly query interfaces to bridge knowledge that exists in biomedical research. However, there remains a lack of user-friendly statistical integration and visualization tools that can be applied to primary research data from multifaceted CNS disorders.
In this sense, translation of basic research into clinical therapeutics can be conceptualized as a big-data integration issue. Thousands of studies have been published aiming to characterize spinal cord injury (SCI) and traumatic brain injury (TBI) from a basic scientic view point, yet we still do not fully understand these complicated disorders. In addition, few therapies have navigated successfully through clinical trials into standards for patient care810. The emerging eld of precision medicine seeks to apply analytics and data-visualization tools3,4 to improve understanding and treatment of complex disorders such as SCI and TBI11.
The present study applies a data-analytic approach, topological data analysis (TDA)12, for improved discovery of fundamental syndromic injury patterns and assessment of precision therapeutic targeting from preclinical drug trials in SCI and TBI. TDA couples unsupervised pattern detection13 and network visualization12 to rapidly extract the full syndromic injury disease taxonomy from the full set of inter-correlated biological, behavioral and health outcomes in diverse SCI14 and TBI15 animal research data. Harnessing a TDA framework, data-driven navigation of the syndromic space is performed by rapid colour-based re-mapping of individual outcomes onto the network to improve interpretation of histopathology, functional recovery and experimental therapeutic effects. TDA helps facilitate the identication of novel relationships in complex, heterogeneous data sets, allowing for data-driven hypothesis generation that may uncover mechanisms for increased morbidity following SCI and TBI. TDA uncovered location-specic impact of SCI and TBI polytrauma on recovery of forelimb function, and differential sensitivity of forelimb measures and locomotion measures in cervical SCI. Application of TDA to preclinical therapeutic trials revealed irreproducible efcacy of methylprednisolone (MP) and minocycline treatment between cervical and thoracic SCI, yet uncovered the novel discovery that perioperative hypertension predicts worse neurological recovery following thoracic SCI.
ResultsInitial attempts to visualize the syndromic space following CNS injury in rodents and nonhuman primates have revealed proof-of-concept multivariate relationships of tissue pathology and functional recovery, with each dimension showing specic sensitivity to different injury models13,14,16. Visualizing the syndromic space through traditional methods such as principal components analysis (PCA) requires database querying, statistical coding and graphical programming. These requirements
disempower basic researchers and clinicians by limiting rapid and actionable access to syndromic ndings (Fig. 1a,b). In contrast, TDA can apply PCA through singular value decomposition (SVD) to reveal the complex multivariate relationship of all predictor and outcome variables simultaneously as a network diagram, where similar individuals are clustered into nodes, and clusters that share one or more individuals are joined by an edge (Fig. 1c). The full syndromic topological map provides a platform for rapid and intuitive exploration of the data set in an unbiased, data-driven manner (Fig. 1d). Once the network is generated, the shape of the data set can be investigated to understand the relationship of each variable across the topological syndromic space to identify groups of clustered individuals that can be further probed for specic relationships among outcomes, validation and targeted hypothesis testing.
TDA uncovers complex SCI and TBI outcome by injury location. To test the application of TDA to CNS injury research, we assessed the syndromic network topology of a recently developed rodent model of combined SCI and TBI. The results of the functional and histopathological decits of this model have been described at the univariate level15, with only a subset of endpoints reaching signicance (Fig. 2, bar graphs), leading to potentially unclear conclusions about outcome. TDA combined with SVD rapidly re-evaluated the ndings across all endpoints simultaneously. Subjects were mapped into the network based on functional (Fig. 2a,b; Supplementary Software 1, dropdown) and histopathological outcomes (Fig. 2c,d; Supplementary Software 1, dropdown), showing a distinct separation of each injury model into sub-networks (Fig. 2e; Supplementary Software 1, dropdown). Sham and TBI-only subjects clustered into distinct regions in the network. SCI-only subjects and SCI TBI contralateral to each
other clustered together into a separate sub-network in the topology, demonstrating worse outcome than the other injury groups (Supplementary Software 1, dropdown). In contrast, subjects with SCI TBI on the ipsilateral side (Fig. 2, circled)
clustered near the sham condition in a sub-network that mapped to better performance on measures of forelimb function (Fig. 2a; Supplementary Software 1, dropdown). This multidimensional difference between ipsilateral and contralateral TBI occurred despite equally-sized lesions (Fig. 2b; Supplementary Software 1, dropdown). Although the univariate effects of lesion location were subtle and varied in their statistical signicance across endpoints, (Fig. 2a,c), TDA uncovered a dramatic multidimensional effect when the full ensemble of endpoints was used to render the full syndromic space. Together, these TDA ndings reveal the clear separation of syndromic features of compound injuries according to location, providing a proof-of-concept for application of TDA in poly-traumatic CNS injury.
TDA reveals forelimb outcomes most sensitive to cervical SCI. To test the application of TDA combined with SVD to SCI, we assembled raw data from several common SCI models, including hemisections, weight-drop and force-driven hemi-contusion injuries to the cervical spinal cord (Fig. 3; Supplementary Software 2, dropdown). Grooming behaviour and paw preference in a cylinder reveal graded levels of recovery (Fig. 3a), that map to lesion size, tissue sparing and deformation (Fig. 3b; Supplementary Software 2, dropdown). However, measures of open-eld locomotion for both forelimb (Fig. 3a; Supplementary Software 2, dropdown) and hindlimb (Supplementary Software 2, dropdown) do not show much variability in recovery of function. In the syndromic topology, grooming function has the strongest visual mapping to lesion size (Fig. 3b; Supplementary Software 2,
2 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581 ARTICLE
a Metric: normed correlation
all outcome measures
b Point cloud: subjects in syndromic space
c Topological resampling:
d Extracted syndromic network topology
Sham
TBI
SCI
TBI Contra + SCI Ipsi
TBI Ipsi + SCI Ipsi
All outcome measures
Lens:
SVD
1.50.5 0.5 1.5
1.50.5 0.5 1.5
PC3
PC3
2.5
2.5
1.5
1.5
0.5
0.5 1.5
0.5
0.5 1.5
PC2
2.5
2.5 1.5 0.5
PC1
PC2
2.5
2.5 1.5 0.5
PC1
1.0 0 +1.0
Pearson correlation
Figure 1 | Topological representation of the syndromic space using TDA. (a) Data sets containing functional and histological outcomes were analysed using TDA from a bivariate correlation matrix of all outcomes. (b) Data were processed with the principal and secondary metric singular value decomposition (SVD) lens to generate the syndrome space. (c) TDA resamples the syndromic space many times to link subjects into nodes (red/blue circles) and connects overlapping subjects with edges (purple line) (d) to create a robust network topology for rapid visualization and interpretation of outcomes over time.
dropdown), whereas recovery of paw preference in the cylinder shows a stronger visual mapping with white matter sparing. Little to no variability is seen in the hindlimb open eld (Supplementary Software 2, dropdown), most likely because it was designed to measure hindlimb coordination following bilateral thoracic injuries17, whereas these subjects received various grades of unilateral cervical injuries. There was some variance in the measure of forelimb open eld for the most severe injuries (Fig. 3a, circle), however, it did not map to the full range of lesion pathology. TDA enabled rapid multivariate visualization of group differences with a quicker turn-around for interpretation.
Differential mapping of injuries in the syndromic network. To understand how injury models map onto the SCI syndromic space, we recoloured the network using categorical experimental SCI groups (Fig. 3c; Supplementary Software 2, dropdown: sham, hemisection, contusion and so on), and observed biomechanical tissue deformation (mm) measured at the time of injury by servo-feedback position detectors on the SCI contusion devices (Fig. 3b; Supplementary Software 2, dropdown). Each injury group occu-pies a distinct section of the network (Supplementary Software 2, sham, hemisection, 75 kdyn and 100 kdyn force-driven contusions18; 6.25 mm and 12.5 mm weight-drop contusions19; red nodes), validating TDA syndromic comparisons of pathology and function across multiple injury models. Mapping tissue changes onto the network (Fig. 3b; Supplementary Software 2, dropdown) conrmed that tissue changes vary as a function of injury group (Fig. 3c; Supplementary Software 2, dropdown) and predict
subject positions within the full syndromic network space (Supplementary Software 2). Lesion size shows less variability between the different injury models, with the exception of the most severe 12.5 mm contusions (Supplementary Software 2, dropdown). These larger lesions are conrmed visually in the network, where larger lesion pathology corresponds to 12.5 mm weight-drop injuries and 100 kdyn force-driven injuries. White matter sparing (Fig. 3b; Supplementary Software 2) shows a wide range of graded severities for the contusion injuries (weight drop and force driven), and hemisection injuries show a substantial loss in white matter (Supplementary Software 2, dropdown). This pattern is conrmed in the network, with the distribution of nodes with the most white matter sparing appearing on the perimeter of the bottom are. Motor neuron (MN) sparing along the rostro-caudal axis of the lesion (Supplementary Software 2, dropdown) is the histological feature most sensitive to injury in this data set; nearly all of the contusion subjects had large-scale loss of MNs, even with the mildest of injuries. This sensitivity of MN loss is visually reected in the network topology where only the shams and hemisection regions of the ares show MN sparing (Supplementary Software 2, dropdown), illustrating the vulnerability of this cell type to contusive spinal cord damage.
Visually guided data exploration uncovers drug effects. We identied the nodes within the topology that stood out as having poor functional recovery on grooming and forelimb open eld (Fig. 3a, circles), despite nodes in this region showing less-severe
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 3
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
a
e
Grooming
Paw preference Forelimb openfield
Hindlimb openfield
Extracted TBI-SCI syndromic network topology injury groups
0
1
2
Behavior
3
4
5
Sham TBI SCI TBI contra + SCI Ipsi TBI Ipsi + SCI Ipsi
Lowest values Highest values
Grooming
Laterality index
Forelimb openfield
Hindlimb openfield
Sham
SCI + TBI contra
100%
Group membership
5
1.0
Paw preference
Martinez scale
21
0
21
0
18
Univariate grooming score
0
4
0.8
18
15
15
0.6
12
2
9
BBB score
3
12
0.4
9
1
0.2
6
6
3
3
0
b
TBI
Grooming Hindlimb openfield
Forelimb openfield
Paw preference
Syndromic TDA
c
SCI tissue sparing
Motor neuron sparing
Epicenter
SCI
Histopathology
Contra TBI sparing Ipsi TBI sparing
Sham TBI SCI TBI contra + SCI Ipsi TBI Ipsi + SCI Ipsi
Ipsi TBI sparing
SCI sparing
100
Contra TBI sparing
100
100
20
0
Univariate
% Sparing
60
Cell counts
Motor neuron sparing
80
80
% Sparing
% Sparing
80
15
60
60
10
40
40
40
20
20
20
5
0
0
0
d
Lowest values Highest values
SCI + TBI Ipsi
Contra TBI sparing Ipsi TBI sparing
SCI tissue sparing Motor neuron sparing
Syndromic TDA
0%
Figure 2 | Histo-behavioural network topology of combined TBI-SCI model. (a,b) Behavioral outcomes of forelimb function and (c,d) histopathology were mapped onto the topological network using TDA. Data from this model shows a distinct recovery pattern depending on whether the combined TBI is contralateral (contra) or ipsilateral (ipsi) to the SCI. (e) Each injury group occupies a distinct region of the network topology, highlighted as red nodes for 100% enrichment (heat map) for each particular injury model. Sham controls (n 9) and TBI-only (n 10) subjects are located in the right cluster.
SCI-only (n 10) and SCI TBI contra (n 10) are both located in the left cluster. SCI TBI ipsi (n 10) interestingly are grouped next to the sham
subjects in the right cluster (circled part of the network), due to a syndromic functional recovery similar to shams (a), despite showing no difference in pathology compared with subjects with SCI alone or SCI TBI contra (c). All outcome averages and injury models were exported into an HTML
gure (Supplementary Software 1) for rapid visualization and user-guided exploration of the syndromic topological space in this data set.
injuries based on the degree biomechanical tissue deformation (Fig. 3b, circle). This sub-network was also signicantly enriched for 12.5 mm weight-drop contusions, yet we noticed that not all injuries of this type performed so poorly. To probe factors that might contribute to abnormally bad function, we drilled into this effect. We compared subjects in these nodes with 12.5 mm contusions that performed well on forelimb open eld using a ranked KolmogorovSmirnov (KS) test. The ranked KS test is analogous to a gene-set enrichment analysis here applied to identify predictor and outcome metric sets (rather than gene sets) that are most sensitive to group conditions (hypothesis testing)20. KS tests between nodes within the high and low functioning groups uncovered an external predictor (not included in the generation of the network) that could account for functional differences:
subjects were part of a preclinical trial of two anti-inammatory drugs: minocycline and MP and no-drug controls. The network was then recoloured based on treatment condition to highlight nodes enriched for drugs and 12.5 mm weight-drop contusions (Fig. 3d, red nodes, no-drug control (n 11 original subjects;
pure nodes 7, n 8), minocycline (n 11 original subjects;
pure nodes 4, n 6) and MP (n 10 original subjects; pure
nodes 2, n 4)). KS test results comparing treatment groups
enriched in the network suggested signicant differences on several outcomes based on t-test and KS test P values between groups (Po0.05; Fig. 3e) in the TDA-identied responder
subjects. To independently conrm this, we performed a one-way analysis of variance (ANOVA) on the TDA-identied subject subsets, conrming signicant drug effects on MN sparing
4 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581 ARTICLE
a
Paw preference Grooming Forelimb openfield
d
No drug(7 nodes, 8 subjects)
Minocycline(4 nodes, 6 subjects)
Methylprednisolone (2 nodes, 4 subjects)
Behavior
b
e
Data-driven discovery
Lesion size Tissue deformation
Histopathology
White matter sparing
Motor neuron sparing P = 0.02
Tissue area at epicenterP = 0.002
Grooming 28DPLP = 0.07
% Of contralateral
30
20
10
0
% Area
80
70
60
50
40
Grooming score
6
4
2
0
#
Lowest values Highest values
Numerical heat map
No drug (n = 8)
#
Minocycline (n = 6)
Methylprednisolone (n = 4)
c
f
Superset cross-validation
Sham Force-driven
contusions
Weight-driven contusions
Motor neuron sparing P = 0.006
Tissue area at epicenter P < 0.0001
Grooming 28DPL P = 0.04
Methylprednisolone (n = 10)
#
*
Injury device
% Of contralateral
30
20
10
0
#
% Area
80
70
60
50
40
Grooming score
6
4
2
0
Low High Injury device categorical enrichment
No drug (n = 11)
Minocycline (n = 11)
Figure 3 | Data-driven discovery of decits in rats in cervical SCI drug trials. (a) Behavioural decits in forelimb function were identied in the syndromic network (circled area). (b) Visual mapping of histopathology patterns in the network did not identify similar patterns to explain behavioral decits, despite less tissue deformation in this portion of the network. (c) Enrichment for injury condition revealed these subjects were given the same type injury (weight-drop contusions, 12.5 mm, Supplementary Software 2). Data-driven exploration of these subjects within the network identied a no-drug controlled trial of minocycline and methylprednisolone (MP). (d) Nodes containing subjects signicantly enriched for respective drug condition, and 12.5 mm weight-drop injuries were isolated (red nodes) for group comparisons using the KolmogorovSmirnov test (KS test). (e) The three outcomes with the smallest P values from the KS test results were identied in the sub-selection of subjects identied for each treatment condition. Results revealed signicant MN loss in subjects receiving MP (n 2 nodes, 4 subjects) compared with minocycline (n 4 nodes, 6 subjects) and no-drug controls (n 7 nodes, 8 subjects)
(P 0.02, F(2,15) 5.21, Z2 0.41, 1 b 0.74), and signicantly less tissue area at the injury epicentre in both minocycline and MP-treated subjects,
compared with no-drug controls (P 0.002, F(2,15) 10.02, Z2 0.57, 1 b 0.96). Non-signicant functional decits in grooming were observed 28
days post lesion (DPL) (P 0.07, F(2,15) 3.18, Z2 0.30, 1 b 0.52). (f) Validation of these signicant detrimental treatment effects were found in the
entire superset of subjects for both MN sparing (P 0.006, F(2,29) 6.08, Z2 0.30, 1 b 0.85) and total tissue area at epicentre (Po0.0001,
F(2,29) 19.94, Z2 0.60, 1 b 1.0), and grooming at 28 DPL was also signicant (P 0.04, F(2,29) 3.68, Z2 0.20, 1 b 0.63). Box and whisker
plots show mean and minimum/maximum range of values. P values represent overall treatment effect using one-way ANOVA. Post hoc pairwise comparisons between each drug condition identied signicant decreases in MN sparing in MP-treated subjects, and more tissue area in no-drug controls (#,signicantly different from both groups; *Po0.05). All outcomes at each time point, location of injury conditions and treatment groups are mapped onto the HTML network Supplementary Software 2.
(P 0.02) and tissue area at the epicentre (P 0.002), with other
outcomes approaching signicance, including grooming at 28 days post lesion (DPL) (P 0.07). The effect size (Z2) and power
calculation (1 b) values (detailed in Fig. 3e legend) suggest that
the TDA-identied subset of subjects and outcome metrics for each group had large effect sizes21, yielding high power, despite the limited n in the subpopulations of interest. Post hoc means testing was performed on signicant main effects of treatment using pairwise comparisons between all treatment conditions with multiple-comparison correction. MP-treated subjects had signicantly less MN sparing compared with both no-drug controls (P 0.03) and minocycline-treated subjects (P 0.006),
but no difference in MN sparing was found between no-drug and minocycline treatment groups (P 0.33). For total tissue area at
epicentre, control subjects showed signicantly greater tissue compared with both MP (P 0.001) and minocycline (P 0.006)
subjects, but no difference in tissue area was found between MP and minocycline (P 0.24). These statistical results suggest that
MP signicantly reduced MN sparing, and both MP and minocycline impacted total tissue area at the epicentre. After interviewing the original data donors, we discovered that data from this drug trial was not previously published because treatments were thought not to show functional benets (the le-drawer phenomenon).
The TDA-identied subpopulation analysis suggested that minocycline and MP had effects on a subset of endpoints, in a subset of the individuals. To conrm the generality of these effects, we next tested for the effects of minocycline and MP on MN sparing, total tissue area and grooming function on the full data set (superset cross-validation) from this drug trial (Fig. 3f). One-way ANOVA and post hoc testing of individual treatment groups was performed with the same criteria as the comparisons in Fig. 3e. Results conrmed a signicant main treatment effect for MN sparing (P 0.006), with the MP group showing
signicantly less MN sparing than either minocycline (P 0.002) or no-drug controls (P 0.04), but not between
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 5
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
no-drug and minocycline (P 0.21). Total tissue area at the
epicentre also had a main treatment effect (Po0.0001), with no-drug controls showing signicantly more total tissue compared with both minocycline (Po0.0001) and MP (Po0.0001) groups, but not between minocycline and MP (P 0.79) groups. Last,
grooming function at 28 DPL had a signicant main treatment effect (P 0.04), with the MP group showing signicant
functional decits compared with no-drug controls only (P 0.02), with non-signicant grooming decits found in
minocycline compared with no-drug controls (P 0.06) or MP
(P 0.49). Together this suggests that MN sparing and tissue
area at the epicentre were the major drivers of the network-detected effects, with more modest contributions by other variables.
Conicting cross-validation and irreproducible drug effects. TDA-based data-driven discovery revealed a hidden nding in legacy data that MP was potentially detrimental in cervical SCI. To test whether the same might be true in thoracic SCI, we pooled data from the VISION-SCI database14 containing other subjects that were part of controlled MP drug trials. A previously conducted trial from the Multicenter Animal Spinal Cord Injury Study (MASCIS)19 was identied, which contained a larger cohort of subjects (1996, N 72). TDA was performed using the
same PCA/SVD lens and norm correlation metric used on the cervical data set (Fig. 3; Supplementary Software 2, dropdown) to cross-validate the detrimental effects of MP on this independent
thoracic data set. Identication of nodes in the thoracic network receiving either vehicle control or different doses of MP (coded as MP1, MCP in Supplementary Software 3, dropdown) did not show the same detrimental effects in either functional recovery measured by locomotion with the BBB or tissue sparing at the injury epicentre (Supplementary Software 3, dropdown). A separate analysis on the same data set using TDA was performed with the L-innity centrality lens, which attempts to cluster subjects in the network based on maximal distance between subjects and how far they are from the group norm12. TDA revealed that subjects were distributed along three main ares in the network. Identication of nodes enriched for either vehicle (Fig. 4a) or MP-treated subjects (Fig. 4b) revealed that a maximum of 50% group membership was represented in the red nodes in the network. Location of these nodes for each treatment condition within the network was visually mapped to functional recovery of BBB (Fig. 4c) and tissue sparing at the injury epicentre (Fig. 4d). Querying all subjects within this trial that received either vehicle control (N 10) or MP (N 12) did not show signicant group
difference on either BBB locomotor recovery (Fig. 4e, P 0.73)
or tissue sparing at the epicentre (Fig. 4f, P 0.15). The results
suggest that MP treatment had no signicant effect in the thoracic SCI, in contrast to the deleterious effect observed in the cervical SCI trial (Fig. 3; Supplementary Software 2, dropdown). On the whole, these previously unpublished preclinical ndings seem to conrm the lack of denitive data from preclinical trials of MP in SCI.
a b
Vehicle Methylprednisolone
BBB Tissue sparing
BBB
43%
50%
0%
0%
c d
10.5
31%
0
3%
e f
Tissue sparing
8 P=0.73
25 P=0.15
6
20
Locomotor score
15
4
% Sparing
10
2
5
0
0
Vehicle Methylprednisolone
Figure 4 | Cross-validation attempt of MP in thoracic SCI L-innity centrality network. TDA was performed on data mined from the VISION-SCI repository, queried based on subjects that were part of treatment trials testing MP (MP1 and MCP) following SCI (N 72). Location of treatment groups
within the network for either (a) vehicle-treated control or (b) MP-treated subjects are shown, however, no nodes were 100% pure for either treatment condition, suggesting treatment was not a signicant predictor of placement of subjects within this network. (c) BBB recovery and (d) total tissue sparing at the injury epicentre were mapped into the network to identify the range of recovery in this data set (red better recovery, blue worse recovery).
Grouping subjects in the data set based on treatment condition did not reveal the same signicant decits observed in the cervical trial for MP for either (e) recovery of locomotor recovery measured by the BBB (P 0.73), or (f) the total tissue sparing at the epicentre (P 0.15). However, there was a trend
towards less tissue sparing in subjects that received MP, similar to histopathology observed in cervical SCI (Fig. 3). The most striking difference in the network were subjects who had very large differences in tissue sparing along the top arm of the network, yet showed similar ranges of BBB functional recovery, which are explored further in Fig. 5. Histograms plotted as means.e. Student t-test used for signicance testing between treatment groups.
6 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581 ARTICLE
Data-driven discovery that hypertension predicts dysfunction. Application of TDA in the context of cross-validation testing of MP treatment following thoracic SCI revealed an unexpected and much stronger predictor of neurological recovery than any of the drug conditions. Visually guided exploration of TDA sub-networks uncovered unusually large differences in functional recovery on the BBB locomotor scale in putatively identical injury severities. We isolated these disparate subject populations and categorized them into groups for further comparisons (Fig. 5a, circles). Nodes containing subjects that received identical 25 mm weight-drop contusions were grouped and compared with a KS test to identify measures that signicantly differed between these two groups and also mapped to signicant functional differences on the BBB (P 0.0002). This data-drill down revealed that blood
pressure spikes at the time of SCI signicantly differed between high and low locomotor recovery subgroups (KS 0.7, P 0.03),
suggesting that subjects with poorer outcome may have had hypertensive mean arterial pressure (MAP) at the time of injury.
Cross-validation and conrmation of hypertension hypothesis. The data-driven discovery of hypertension as a major predictor of SCI recovery from semi-structured big-data could potentially represent a capitalization on chance22. To explicitly rule this out, we performed two waves of additional analyses. First, we independently cross-validated the TDA-based data-driven discovery, by curating an additional data set from subjects with less-severe injuries (12.5 mm) from a separate round of the MASCIS trial (19941995, N 154)
(Fig. 5b) queried from the VISION-SCI repository. Nodes within the network that received thoracic 12.5 mm weight-drop injuries showed distinct subpopulations with signicant differences in BBB locomotor recovery (Fig. 5b, circles, P 0.01). KS testing of the
good versus bad recovery subgroups within this network conrmed
that hypertensive events (maximum MAP) during surgery predicted lower locomotor recovery in the chronic phase (KS 0.6,
P 0.0009).
Second, we explicitly tested the formal hypothesis that perioperative hypertension predicts long-term outcome using a repeated measures general linear model (GLM) on the 1996 and the 19941995 data sets. Explicit hypothesis testing separately conrmed the hypothesis that perioperative MAP (covariate) predicted poorer functional recovery of BBB (dependent) between 1 and 6 weeks post injury (repeated measure). In both data sets, post-injury MAP (15 min after SCI) signicantly predicted the main effect of recovery of BBB locomotion following injury (1996, F(5,20) 3.701, P 0.02; 19941995, F(5,110) 2.671, P 0.03).
TDA-based data-driven discovery versus traditional tools. The fact that TDA-guided discovery uncovered a novel nding that was hiding in plain sight in 20-year-old data, provides strong potential support for this approach. However, we wondered whether a similar set of results could have been revealed using traditional analytics. To test this, we pooled all data from MASCIS (N 334) in the VISION-SCI repository and performed side-
by-side bivariate correlational analysis and TDA (Fig. 6). Pearson correlation conrmation of the signicant inverse correlation between elevated perioperative blood pressure and BBB functional recovery was performed by plotting a bivariate correlation matrix for MASCIS OSU trial subjects (N 334) for all measures
of survival, histology, perioperative vitals and blood gases, functional recovery, bladder health and weight over 16 weeks post SCI (Fig. 6a). Blood pressure measures showing the most signicant inverse correlations to BBB recovery were conrmed, with elevated diastolic blood pressure at the time of injury, showing the most signicant negative correlations at multiple
a b
Data-driven discovery - MASCIS 1996 trial Independent cross-validation - MASCIS 19941995 trials
0 Bottom flare
Injury condition
Injury condition
50mm Injury
12.5mm Injury
25 mm injury
12.5 mm injury
Upper group
Lower group
Bottom flare
Outer flare
BBB
MAP at injury
125
80
BBB
Average MAP
110
90
Maximum MAP
143
125
10.5
0
12
0
P = 0.0002
P = 0.03
P = 0.01 12
P = 0.06
P = 0.0009
150
110
Transformed BBB
10
8 6 4 2
MAP (mm Hg)
160 140 120 100
80 60
Transformed BBB
10
140
8 6 4 2 0
MAP (mm Hg)
140
120
100
80
MAP (mm Hg)
130
120
Outer flare
40 Upper group
Lower group
Bottom flare
Outer flare
60 Upper group
Lower group
100 Upper group
Lower group
Figure 5 | Perioperative hypertension predicts worse recovery after thoracic SCI. (a) Exploration of the TDA network from the MASCIS OSU 1996 methylprednisolone trial (N 72) revealed a cluster of subjects in the network given the same targeted injury (circled bottom and outer ares) that showed
very signicant differences in BBB function (P 0.0002). A query of variables with signicant differences based on KS test results between these two
groups uncovered subjects with signicant hypertension during SCI surgery (P 0.03) clustering in the groups with poorer functional recovery. (b) Cross-
validation of these relationships between perioperative blood pressure and functional recovery was performed in a separate group of test subjects from the same 3-year drug trial (MASCIS 19941995, N 154) with matching outcome measures and subject grouping. Visually guided identication of subjects in
the network given the same injury condition (circled upper and lower groups) but showing poorer functional recovery on the BBB scale (P 0.01)
uncovered the same signicant detrimental effect of hypertension during SCI surgery on recovery (P 0.06), specically when assessing peak MAP values
recorded during surgery (P 0.0009). Box and whisker plots show mean and minimum/maximum range of values. P values obtained using student t-test
for signicant differences between groups.
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 7
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
a b
MASCIS OSU 19941996 trials - bivariate correlation matrix
MASCIS OSU 19941996 trials - TDA
Survival
0 +1
Tissue deformation Tissue sparing
Histology
Blood pressure
Heart rate
Body temp
Blood gases
Locomotion
Bladder care
Urine content
Weight
Diastolic BP at hit
MAP at hit
Survival
Weight change
BBB
Bladder size
Histology
Survival
Pearson correlation
1
Blood pressure
Heart rate
Body temp
Blood
Gases
Locomotion
Bladder care
Urine content
Weight
P<0.05
Low values
High values
Figure 6 | Comparing traditional tools to TDA in MASCIS data set. (a) A bivariate correlation matrix was generated for every outcome measured over time along with measures of heart rate, blood pressure and blood gases before, during and after surgery. Each variable is correlated to every other variable, with clusters of similar measures represented with larger text, with specics about each measure and collected as the same times post injury (16 weeks post injury). Histology includes tissue deformation and tissue sparing. Blood pressure includes diastolic pressure, mean arterial pressure and systolic pressure at time of hit, 15 min after hit (PostHit) and 15 min before hit (PreOP). Similar time bins were collected for heart rate and body temperature. Blood gases were measured only either before (PreOP) or after (PostOP) injury. Locomotion was measured between 1 and 6 weeks using the BBB scale. Bladder function was monitored daily and binned across each week post injury for bladder voiding/expression, rmness, size and urine content was recorded for colour and pH. Weight change between each week post injury was also recorded to assess health. Numbers along the y-axis are reected in the x-axis to line up variable comparisons. The heat map represents either negative (blue) or positive correlations between each variable within the matrix, with signicant correlations (Po0.05) highlighted with black boxes. Although this method of visualizing correlations is useful for understanding how different measures all relate to each other within the context of all other comparisons, it does not allow for mapping of each test subjects placement within the network based on all these complex relationships. (b) TDA of the same data set reveals the distribution of every subject within the network, from all subjects in the entire OSU MASCIS trial (19941996, N 334). TDA revealed the same visually guided relationships between perioperative blood pressure
and autonomic and locomotor dysfunction following SCI identied in Fig. 5. Complete mapping of all outcomes and perioperative measures of vitals and
blood gases over time were exported into an HTML viewer (Supplementary Software 4).
time points (36 weeks), with MAP and systolic blood pressure only showing a signicant correlation to BBB decits at 5 weeks post injury. Additional measures also showed signicant correlations to BBB recovery in this large correlation matrix, including expected ones such as tissue pathology, bladder care complications and weight gain, with additional measures of body temperature and several blood gas measures during surgery (bicarbonate; blood pH; total carbon dioxide; and partial pressure of carbon dioxide) also showing signicant correlations to BBB. Taken together, we found that the major changes in MAP predicting long-term motor impairment, typically occurring in the range between 100 and 143 mm Hg for upwards of 5 min at a time, occurred immediately post injury, during surgery and in the recovery phase in the animal neuroICU.
Although visualization and interpretation of the complex interactions between all the variables in this data set can, in theory, be achieved by simple correlation, this approach does not identify clusters of subjects that are most sensitive to these interactions across the full spectrum of variables. Navigating the same data set with TDA creates a syndromic map of all subjects based on the full network of correlations, enabling rapid comparative hypothesis testing about factors such as injury condition, recovery rate, autonomic factors or even gender differences (Fig. 6b, Supplementary Software 4, dropdown). All
outcomes measured over time, including BBB locomotion, bladder function, weight, and perioperative blood pressure and blood gases were mapped onto the network for each time point (Supplementary Software 4, dropdown). Additional mapping of enrichment for gender differences in the network revealed that subjects within the nodes that showed the strongest relationship between perioperative hypertension and BBB recovery were mostly males. Due to bladder complications being more pronounced in males following SCI14, and the strong correlation between bladder function and health and recovery of locomotion, males may be more sensitive to the complications of hypertension during surgery, potentially contributing to more autonomic complications post injury, leading to increased morbidity. This TDA-based hypothesis discovery has served to accelerate ongoing interests in our centre in linked preclinical experimental and prospective clinical observational trials of critical care variables including MAP and the use of pressors23,24 to assess the impact of hypertension, as well as hypotension on recovery of function after SCI.
DiscussionWe report the novel application of TDA to extract the fundamental shape of the multidimensional syndromic space
8 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581 ARTICLE
after CNS damage, using preclinical TBI and SCI data as illustrating examples. TDA-based data-driven analyses revealed a set of important ndings, some of which dramatically conrmed existing hunches in the published literature, whereas others represented novel ndings that were not previously identied, even in legacy data sets (for example, 20-year-old MASCIS studies). As an illustration of value for conrmatory analysis, TDA revealed a dramatic interaction between SCI and concurrent TBI that depended on anatomical location of brain lesions. Although this effect has previously been reported, it only reached signicance on a subset of univariate endpoints15, whereas TDA revealed this effect to be very large and robust at the network level. As an illustration of value for exploratory hypothesis generation, TDA identied potential detrimental consequences of MP treatment on tissue pathology in cervical SCI, and to a lesser extent in thoracic SCI. In attempting to cross-validate this observation, we discovered a novel previously unrecognized relationship between perioperative hypertension and poorer long-term functional recovery. The relationship between perioperative hypotension (o85 mm Hg MAP) and poorer neurological recovery was recently reported in humans with SCI23, providing a basis for clinical relevance in early neurocritical care on outcomes. However the relationship between acute hypertension has not been reported. Further investigation in humans with SCI needs to be conducted to determine if the ndings presented in the current study translate into humans. While the practice guidelines for treatment of acute SCI include avoidance of hypotension, there is little experimental data to support this, and the role of hypertension in outcomes has received less attention23,25,26. The ndings presented here suggest that both extremes in MAP may contribute to increased morbidity following injury. Additional prospective experiments are currently underway in rats to test the mechanism by which hypertension exacerbates functional decits following SCI. Based on the literature, the leading mechanistic candidates include increased oedema cord27, increased hemorrhage28, bloodbrain-barrier breakdown and inux of inammatory cells and cytokines around the injury. Such effects may promote a more cytotoxic spinal cord micro-environment contributing to increased morbidity25,2935. Taken together with the increased autonomic complications that exist in patients following SCI36 (for example, autonomic dysreexia), which may be triggered by haemodynamic events, hypertension during acute management of SCI may pose a signicant risk for patients. This suggests that careful monitoring of blood pressure in acute patients may need to be considered for both the upper (hypertension) and lower limits (hypotension), as both extremes may impact neurological recovery.
The present application of a big-data-analytic tool for novel discovery has broad implications for translational SCI research. Previous work has demonstrated both the univariate and multivariate impact of graded cervical SCI13 in rats and primates14,16, as well as a univariate assessment of a combined SCI and TBI preclinical model in rats15. The results of prior studies show that information about impairment of function and tissue pathology can be understood at the univariate level, and to a greater degree at the multivariate level, providing powerful opportunities for therapeutic discovery that harnesses CNS trauma big-data in ensemble. However, progress is hampered by the intensive data pre-processing typically required before and after analysis to generate a full view of the multidimensional syndromic state in CNS injury37. The difculty of sophisticated analytics may partially account for the slow progress to both understand and successfully treat these complicated CNS injury syndromes. Typical CNS injury studies generate enormous quantities of data, yet only a few of these measures are assessed
at a time. The ability to interpret the full pattern of disease pathology and recovery is further confounded by basic visualization attempts using bar graphs and/or recovery curves combined with potentially inappropriate statistical techniques to detect signicant effects38 in a manner that is at once both prone to false-positives (familywise type-1 error) and wasteful of information (multivariate type-2 error)39. Taken together, these factors may hinder progress to rapidly translate promising preclinical studies into clinical trials, and may point researchers in the wrong direction regarding the conclusions that can be drawn from their studies.
TDA applies the mathematical concepts from geometric topology to unlock relationships in data that would be considered as noise by traditional parametric approaches such as regression and GLMs40. By extracting the fundamental shape from the entire multidimensional data set, TDA ascribes meaning to an otherwise unforeseen pattern of relationships among individuals. The TDA algorithm achieves this goal by iterating through multiple views of lower dimensional shape of the data to extract the persistent shape of the syndromic space across these multiple views using ensemble machine learning. Through this process, TDA can resolve meaningful signal from noise by identifying its true source, improving our understanding of the whole data set. TDA has been used previously to navigate complicated, high-dimensional biological data sets including functional brain connectivity41,42 and biomolecular folding pathways43. Novel applications for TDA in precision medicine are also beginning to appear in the literature. For example, TDA has been used to uncover novel relationships in immune cell reactivity between patients with type-1 and type-2 diabetes44, and in identifying novel subgroups of patients with asthma and the unique relationships of specic T-cell mediated interleukins with these patient subgroups45. Another prominent example of TDAs application towards precision medicine was found in breast cancer data regarding genetic inuences on patient survival that had not been previously identied, even-though the data sets containing this information had been publically available for over 10 years46. Similar methods can now be applied for neurotrauma data sets at both the preclinical and clinical level, given the emergence of large-scale multicentre repositories targeting precision medicine for the CNS11,37.
In the present paper, we expand the concepts of precision medicine to the application of TDA for preclinical translational discovery, using CNS injury data sets containing diverse information from multiple preclinical treatment trials with histopathological, functional and health outcomes. TDA allows for both rapid analysis and rapid visualization of all measures collected in a particular study to increase efciency of recovery testing following injury, and allows drill down into subpopulation clusters for targeted hypothesis testing regarding treatment efcacy across the complex variability that exists in SCI. Due to the high dimensionality of many SCI and TBI data sets, it can be difcult to interpret which measures are sensitive to improved recovery in therapeutic trials, and whether particular subgroups are selectively responsive. As shown in the present paper, the network generated from TDA can then be harnessed to test the generality of therapiesthat is, whether treatments are effective within the full syndromic spaceas well as specic therapeutic features such as determining whether particular outcome measures are more sensitive to therapeutic targeting.
It should be noted that the current work does have limitations. Perhaps the most translationally signicant nding was the identication of a detrimental relationship between perioperative hypertension and long-term locomotor recovery following SCI. It is unclear from mining the animal hospital records what specic mechanisms may lead to animals having hypertensive episodes
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 9
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
during SCI operation and recovery. One potential confounder is that variability in anesthesia may impact blood pressure. Although it is difcult to completely discount this possibility in a retrospective study, there is no evidence of systematic variability in anesthesia reected in the detailed perioperative animal care records. In addition, the MASCIS pilot study tested multiple anesthetics and developed a rigid protocol of pentobarbital anesthesia, with the contusion injury delivered at a standardized time point of 1 h, with surgical plane conrmed by areexia for the multicentre study data presented here. However, further experimental studies are needed to assess the impact of anesthetics as a potential mediator of the perioperative hypertensionlocomotion relationship47. Regarding the potential for TDA as a precision medicine tool applied to SCI and TBI research, the current study was performed in inbred animals with consistent graded injuries living in optimal conditions that were tightly controlled within a given study (but highly variable across centres and strain). This intrinsic multicentre variability is useful for providing a proof-of-concept validation of TDA for neurotrauma in the face of potential cross-laboratory variance. However, it remains an open question whether TDA could overcome high variability seen in human clinical data for SCI and TBI, though previous studies using TDA in other diseases demonstrate its value for clinical decision support4446.
In conclusion, rapid visualization and analysis of CNS injury big-data may facilitate rapid, accurate big-data analysis of preclinical and clinical studies, allowing for quicker validation of hypotheses tested. By exploring a large preclinical data set with multiple injury models, outcome measures and study designs, TDA discovered unique features of TBI SCI determined by
injury location, and detrimental inuences of perioperative hypertension on locomotor recovery and bladder function that were previously unpublished. In this sense, TDA presents a powerful and novel bioinformatics tool for the eld of neurotrauma research for testing large, heterogeneous data sets. By mapping all data collected across an entire test subject population as a multidimensional topology, TDA helps extract new knowledge about neurotrauma populations and their associated states of disease and recovery. This may expedite the translational pipeline for therapeutic discovery in neurological disease research.
Methods
Proof-of-concept application of TDA to neurotrauma data sets. TDA was used to rapidly analyse and visualize clustering of individuals based on their similarity across hundreds of variables simultaneously (Fig. 1). TDA is an adaptation of the methods of topology, the mathematical discipline which studies robust methods of measuring and representing shape, to create compact visual representations of high-dimensional data sets40,48. This is performed automatically within the software, by deploying an ensemble machine learning algorithm that iterates through overlapping subject bins of different sizes that resample the metric space (with replacement), thereby using a combination of the metric location and similarity of subjects in the network topology. After performing millions of iterations, the algorithm returns the most stable, consensus vote for the resulting golden network (Reeb graph), representing the multidimensional data shape12,40. The application of this method to our data sets creates clusters of subjects which appear as nodes (points) and relations among clusters are represented as interconnections (edges or lines) between the nodes (Fig. 1d). Once the topological network is developed, rapid exploration of the full neurotrauma syndrome and its various manifestations across different measures can be performed (Supplementary Software 14). Although the application of a licensed version of TDA software was used for the present study through the Ayasdi cloud-based platform (http://www.ayasdi.com
Web End =www.ayasdi.com, v 2.0), open source versions of the program code are available in either Python48 or R49,50.
TDA applied to combined TBI and cervical SCI in rats. We applied TDA to a data set containing several controlled models for combined TBI and SCI in 23-month-old female Long Evans rats (n 49, P 94) from a previously published
study15. Data were analysed using the variance-normalized Euclidean metric
(VNE), which nds the mean and s.d., and rescales the value of the coordinate around its mean by dividing by the s.d. of the set of values taken by the coordinate. This metric calculates the distance between two points, taking into account that each column in the data set could have signicantly different variance. VNE distance between two points X and Y is given by:
VNE X; Y
XN
i1
v
u
u
t
Xi Yi2 Vi
1
Where Vi is the variance associated with each column i and is given by:
Vi
1 M
XM
j1
Zj;i
2 2
And
Zi is the mean of column i and is given by:
Zi
1 M
Zi
XM
j
Zj;i 3
VNE was combined with the principal and secondary metric SVD lenses, which are analogous to PCA. The network was set at a resolution of 30 and a gain of 4.0
(equalized) from which subjects with shared syndromic features were clustered together and distributed into a syndromic network topology (Fig. 2). Adjusting the resolution and gain alters the number of bins and the degree of overlap of these bins. Once the network is extracted, resolution and gain are used to focus the network similar to focusing a microscope on an image. We begin with a standard resolution of 30 and gain of 4.0 and then adjust these parameters to ensure that the majority of subjects are included in a connected node (as opposed to isolated from the network), and that all nodes are connected as a single network (if possible). Changing the resolution and gain alters the number of bins and the degree of overlap of these bins respectively, spreading subjects out across more nodes (high resolution) or forcing more subjects into each node (high gain). Network extraction, focusing and face validation of the syndromic space is based solely on primary outcomes of interest (for example, locomotion), while remaining blind to experimental conditions/predictors. In this sense we begin with the full outcome pattern and then reverse engineer the largest predictors in a data-driven manner.
Variables that were analysed included all available endpoint data, excluding predictor data such as categorical injury condition, gender or treatment. For networks in Figs 13, these endpoint data included injury biomechanics of brain and spinal cord tissue displacement, force and velocity, terminal tissue sparing, weight change, and 6-week time-course data points for measures of grooming, paw preference in the cylinder13,51, the Basso Beattie Bresnahan (BBB) hindlimb locomotor scale13,17, the Martinez scale of forelimb locomotion52 and the Irvine Beatties Bresnahan (IBB) scale for object manipulation53,54. The majority of these variables are conceptualized and listed in Fig. 2a,c, and in the drop-down menu in the living gure Supplementary Software 1. Topologies were colour coded for each injury group,PC1 and PC2 distributions, histopathology and a few key examples of averages over time of functional outcomes (grooming, paw preference, object manipulation, forelimb and hindlimb open eld). These were exported from the cloud into an HTML viewer to rapidly visualize and interpret the relationshipof functional recovery to injury group and tissue pathology (Supplementary Software 1). For visualized distribution of injury models (Fig. 2c), red nodes indicated a pure population for each particular category, which included uninjured sham controls (n 9), mild TBI (n 10), unilateral 75 kdyn force-driven
contusions (n 10), mild TBI contralateral to 75 kdyn force-driven contusion
(n 10, SCI TBI Contra) and mild TBI ipsilateral to 75 kdyn force-driven
contusion (n 10, SCI TBI Ipsi). Schematic diagrams of each injury model
illustrate the placement of each injury (black ellipses) or sham controls (open ellipses) to demonstrate the laterality of each injury model that was tested.
Schematic diagrams for measures of functional recovery (Fig. 2a) and histopathology (Fig. 2b) were created for animal model visualization. Terminal outcomes were then visualized at the univariate level (Fig. 2a,b), which is the current standard in the SCI preclinical literature, showing the distribution of subjects for each injury group (Fig. 2c) for grooming, preference for the uninjured forepaw during vertical exploration in a Plexiglas cylinder, and forelimb and hindlimb locomotion in the open eld (Fig. 2a). Histological measures of tissue in the brain and spinal cord, and MN sparing along the rostro-caudal extent of the injury were also plotted in the same manner (Fig. 2b).
TDA applied to graded unilateral cervical SCI in rats. We applied TDA to graded unilateral cervical SCI in 23-month-old female Long Evans rats (n 132
subjects, P 119 variables, Fig. 3; Supplementary Software 2) from previously
published studies13,51. Data were analysed using the norm correlation metric equationequation (4). This metric normalizes the columns to become comparable. This metric is used when the data columns have ranges and means that vary signicantly. The norm correlation (Corr) distance between two points is given by the Pearson correlation and is given by Corr (X, Y) 1 r(X0, Y0), where
X0, Y0 are the column-wise, mean-centred and variance-normalized versions of X
10 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581 ARTICLE
and Y.
rX; Y
N
q
N
PNi1 XiYi PNi1 Xi PNi1 Yi
N
Pi X2i Pi Xi
2
q
4
This was combined with the principal and secondary metric SVD lenses. These lenses generate a factorization of the data matrix into linearly uncorrelated components. The principal SVD lens is the highest variance component and the secondary SVD is the second highest variance component. These lenses assume that your data is using the Euclidean metric.
f X
min
Z
Pi Y2i Pi Yi
2
2 5
The analysis was set at a resolution of 50 and a gain of 5.0 (equalized) from
which subjects with shared syndromic features were clustered together and distributed into the syndromic network topology (Fig. 3; Supplementary Software 2, dropdown).
Variables that were analysed included all endpoint data, excluding predictor information about categorical injury condition, gender or treatment. Endpoint data used for Figs 35 include a standardized measure of tissue compression for injury biomechanics across different contusion devices, terminal tissue pathology measured by lesion size and white/grey matter and MN sparing, and 6-week time-course data points for measures of daily or weekly weight change, CatWalk55, grooming, paw preference in the cylinder13,51, BBB hindlimb locomotion17,56,a 4-point measure of forelimb locomotion13 and the IBB scale for object manipulation53,54. Topologies were colour coded for each injury model, PC1 and PC2 distributions, histopathology and a few key examples of functional outcomes (grooming, paw preference, forelimb and hindlimb open eld) at 7, 21 and 42 DPL. These were exported from the cloud into an HTML viewer to monitor recovery of each outcome over time in relation to injury model and tissue pathology (Supplementary Software 2, dropdown). Heat maps for the colour schemes of the ares represent the range of highest values (red) to lowest values (blue) for each respective outcome being visualized (for example, lesion size; blue 0%,
red 100% lesion, Fig. 3b). For visualized distribution of injury models, red nodes
indicated a pure population for each particular category of graded SCI, which included uninjured sham controls (n 16), hemisections (n 9), 75 kdyn (n 31)
and 100 kdyn (n 34) unilateral contusions with the force-driven impactor, and
6.25 mm (n 10) and 12.5 mm (n 32) unilateral contusions with the weight-drop
impactor (Fig. 3; Supplementary Software 2, dropdown).
A detailed interpretation of the syndromic space for graded unilateral cervical SCI has been reported previously13, however, those analyses were performed in SPSS v. 19, and do not allow for rapid analysis and visualization of the syndromic SCI space that is presented here.
Data-driven exploration of preclinical drug trial efcacy. Comparison of continuous variables was performed by two tests: KS test and t-test. The KS test was used to investigate the non-parametric probabilistic distributions of samples across each (one-dimensional) variable, while the t-test explores whether the null hypothesis (mean value of both samples) is supported. Comparison of categorical variables was performed by Fisher exact test. These methods were used to identify group differences from the graded cervical SCI data set between selected nodes that were classied based on a combination of purity for both injury condition (for example, 12 mm weight-drop) and treatment condition (for example, MP, minocycline, No drug; Fig. 3d, red nodes). Nodes satisfying both these criteria were designated as groups and analysed for measures that differentiated the groups from each other. Signicant differences between these groups were based on KS scores with the largest absolute values (0.751.0) and KS P values.
Data-driven exploration of MASCIS as a cross-validation test. Data mined from the VISION-SCI repository14 for previous trials of MP in SCI resulted in identication of the MASCIS preclinical trial from the OSU testing site. This was an NIH-sponsored multicentre trial (19941997) to validate the contusion model for SCI using the weight-drop contusion device19, and to test the efcacy of pharmacological treatments for SCI. Only subjects from year 3 (1996, N 72) had
un-blinded treatment codes in the current version of the database. A norm correlation metric and L-innity Centrality lens (resolution 30, Gain 4.0 ,
equalized) was used to generate the network from 23-month-old rats receiving graded thoracic (T9) bilateral contusions (12.5 and 25 mm injuries, in both males and females, across 6 MP combination treatment conditions) with 49 separate outcome measures. Only endpoint data were used in the analysis, excluding predictor information about categorical injury condition, gender or treatment. Endpoint data used in the analysis included tissue deformation injury biomechanics and vitals measured during the SCI operation, including body temperature, heart rate and blood pressure (systolic, diastolic, mean). Vitals, along with blood gases, were measured using an intra-arterial tail catheter, and averages and maximum values were taken from 15 min before injury (PreOP), at the time of injury and 15 min post injury (PostOP). Post injury functional outcomes included averages for recovery of bladder function, urine content, weight gain and locomotor recovery on the BBB scale during the 6-week time period prior to sacrice and terminal total tissue sparing. The full list of these variables is provided
in Fig. 6b and the drop-down menu of Supplementary Software 4. These data were analysed using L-innity centrality, which groups subjects into nodes in the network using the maximal distance of each subject from all other subjects.
f x
max
Y d X; Y
6 Only subjects in the vehicle (N 10) and MP (N 12) treated groups had
complete data for BBB locomotion and tissue sparing for hypothesis testing about treatment effects (Fig. 4).
Testing perioperative hypertension-recovery association. KS tests were used to compare group differences in the networks generated for the MASCIS OSU trial year 3 data set (N 72) to identify signicant group differences between BBB
functional recovery that were predicted by MAP levels at the time of injury(Fig. 5a). Rats with an age range of 25 months from years 12 of the MASCIS trial (19941995, N 154) with the same 49 outcome measures were analysed using the
same TDA parameters as the 1996 data set to validate the hypothesis that elevated MAP during SCI surgery signicantly predicted poorer functional recovery(Fig. 5b). Conrmation of perioperative MAP levels predicting poorer neurological recovery was performed in SPSS v. 19 using a GLM repeated measures ANOVA. The dependent variable was BBB locomotor score, time points of 16 weeks post injury were the repeated measures, and MAP values at either PreOP, PostOP, or at the time of injury were each used separately as covariates within the GLM and tested on each data set separately (1996 and 19941995). The bivariate correlation matrix comparing all variables measured over time in the entire MASCIS OSU trial (N 334) was generated in SPSS v.19, and two versions were overlaid to depict
both Pearson correlation values and valence (Fig. 6a, redblue heat map for positive or negative correlations, respectively), and the signicance of each correlation (outlined boxes). Comparison of the bivariate correlation matrix to TDA on the same data set and set of variables measured over 6 weeks post SCI (N 334,
P 150; TDA metric norm correlation, L-innity centrality lens, resolution 50,
Gain 4.0 , equalized) was plotted together to assess the greater efcacy of TDA to
perform visually guided comparisons of the networked interactions between all test subjects based on correlations of outcome variables for a more comprehensive, holistic view and exploration of the SCI syndrome (Fig. 6b, Supplementary Software 4, dropdown).
Statistical analysis. Statistical analysis testing between groups for the identied measures were performed in Ayasdi v2.0 for group differences in the network, and plotted for box plots or histograms in GraphPad Prism 5 and analysed for signicance using two-tailed t-tests and one-way ANOVAs in SPSS v19 (Figs 35).
References
1. Celi, L. A., Mark, R. G., Stone, D. J. & Montgomery, R. A. "Big data" in the intensive care unit. Closing the data loop. Am. J. Respir. Crit. Care Med. 187, 11571160 (2013).
2. Larson, E. B. Building trust in the power of "big data" research to serve the public good. JAMA 309, 24432444 (2013).
3. Akil, H., Martone, M. E. & Van Essen, D. C. Challenges and opportunities in mining neuroscience data. Science 331, 708712 (2011).
4. Fox, P. & Hendler, J. Changing the equation on scientic data visualization. Science 331, 705708 (2011).
5. Gough, N. R. & Yaffe, M. B. Focus issue: conquering the data mountain. Sci. Signal. 4, eg2 (2011).
6. Cohen, M. J. et al. Identication of complex metabolic states in critically injured patients using bioinformatic cluster analysis. Crit. Care 14, R10 (2010).
7. Sorani, M. D., Hemphill, 3rd J. C., Morabito, D., Rosenthal, G. & Manley, G. T. New approaches to physiological informatics in neurocritical care. Neurocrit. Care 7, 4552 (2007).
8. Filli, L. & Schwab, M. E. The rocky road to translation in spinal cord repair. Ann. Neurol. 72, 491501 (2012).
9. Siddiqui, A. M., Khazaei, M. & Fehlings, M. G. Translating mechanisms of neuroprotection, regeneration, and repair to treatment of spinal cord injury. Prog. Brain Res. 218, 1554 (2015).
10. Ramer, L. M., Ramer, M. S. & Bradbury, E. J. Restoring function after spinal cord injury: towards clinical translation of experimental strategies. Lancet Neurol. 13, 12411256 (2014).
11. Manley, G. T. & Maas, A. I. Traumatic brain injury: an international knowledge-based approach. JAMA 310, 473474 (2013).
12. Lum, P. Y. et al. Extracting insights from the shape of complex data using topology. Sci. Rep. 3, 1236 (2013).
13. Ferguson, A. R. et al. Derivation of multivariate syndromic outcome metrics for consistent testing across multiple models of cervical spinal cord injury in rats. PLoS ONE 8, e59712 (2013).
14. Nielson, J. L. et al. Development of a database for translational spinal cord injury research. J. Neurotrauma 31, 17891799 (2014).
15. Inoue, T. et al. Combined SCI and TBI: recovery of forelimb function after unilateral cervical spinal cord injury (SCI) is retarded by contralateral traumatic
d Xi; Xj
X L2 Zi; Zj
NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 11
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9581
brain injury (TBI), and ipsilateral TBI balances the effects of SCI on paw placement. Exp. Neurol. 248, 136147 (2013).16. Rosenzweig, E. S. et al. Extensive spontaneous plasticity of corticospinal projections after primate spinal cord injury. Nat. Neurosci. 13, 15051510 (2010).
17. Basso, D. M., Beattie, M. S. & Bresnahan, J. C. A sensitive and reliable locomotor rating scale for open eld testing in rats. J. Neurotrauma 12, 121 (1995).18. Scheff, S. W., Rabchevsky, A. G., Fugaccia, I., Main, J. A. & Lumpp, Jr J. E. Experimental modeling of spinal cord injury: characterization of a force-dened injury device. J. Neurotrauma 20, 179193 (2003).
19. Young, W. Spinal cord contusion models. Prog. Brain Res. 137, 231255 (2002).20. Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression proles. Proc. Natl Acad. Sci. USA 102, 1554515550 (2005).
21. Cohen, J. A power primer. Psychol. Bull. 112, 155159 (1992).22. MacCallum, R. C., Roznowski, M. & Necowitz, L. B. Model modications in covariance structure analysis: the problem of capitalization on chance. Psychol. Bull. 111, 490504 (1992).
23. Hawryluk, G. W. et al. Mean arterial blood pressure correlates with neurological recovery following human spinal cord injury: analysis of high frequency physiologic data. J. Neurotrauma doi:http://dx.doi.org/10.1089/neu.2014.3778
Web End =10.1089/neu.2014.3778 (2015).
24. Inoue, T., Manley, G. T., Patel, N. & Whetstone, W. D. Medical and surgical management after spinal cord injury: vasopressor usage, early surgerys, and complications. J. Neurotrauma 31, 284291 (2014).
25. Guha, A., Tator, C. H. & Rochon, J. Spinal cord blood ow and systemic blood pressure after experimental spinal cord injury in rats. Stroke 20, 372377 (1989).
26. Kong, C. Y. et al. A prospective evaluation of hemodynamic management in acute spinal cord injury patients. Spinal Cord 51, 466471 (2013).
27. Scallan, J., Huxley, V. H. & Korthuis, R. J. Capillary Fluid Exchange: Regulation, Functions, and Pathology (Morgan & Claypool Life Sciences, 2010).
28. Gorelick, P. B. New horizons for stroke prevention: PROGRESS and HOPE. Lancet Neurol. 1, 149156 (2002).
29. Choi, D. W. Excitotoxic cell death. J. Neurobiol. 23, 12611276 (1992).30. Crowe, M. J., Bresnahan, J. C., Shuman, S. L., Masters, J. N. & Beattie, M. S. Apoptosis and delayed degeneration after spinal cord injury in rats and monkeys. Nat. Med. 3, 7376 (1997).
31. Faden, A. I. & Simon, R. P. A potential role for excitotoxins in the pathophysiology of spinal cord injury. Ann. Neurol. 23, 623626 (1988).
32. Ferguson, A. R. et al. Cell death after spinal cord injury is exacerbated by rapid TNF alpha-induced trafcking of GluR2-lacking AMPARs to the plasma membrane. J. Neurosci. 28, 1139111400 (2008).
33. Park, E., Velumian, A. A. & Fehlings, M. G. The role of excitotoxicity in secondary mechanisms of spinal cord injury: a review with an emphasis on the implications for white matter degeneration. J. Neurotrauma 21, 754774 (2004).
34. Tator, C. H. & Fehlings, M. G. Review of the secondary injury theory of acute spinal cord trauma with emphasis on vascular mechanisms. J. Neurosurg. 75, 1526 (1991).
35. Beattie, M. S., Hermann, G. E., Rogers, R. C. & Bresnahan, J. C. Cell death in models of spinal cord injury. Prog. Brain Res. 137, 3747 (2002).
36. Hou, S. & Rabchevsky, A. G. Autonomic consequences of spinal cord injury. Compr. Physiol. 4, 14191453 (2014).
37. Ferguson, A. R., Nielson, J. L., Cragin, M. H., Bandrowski, A. E. & Martone, M.E. Big data from small data: data-sharing in the long tail of neuroscience. Nat. Neurosci. 17, 14421447 (2014).38. Burke, D. A., Whittemore, S. R. & Magnuson, D. S. Consequences of common data analysis inaccuracies in CNS trauma injury basic research. J. Neurotrauma 30, 797805 (2013).
39. Bacchetti, P., Deeks, S. G. & McCune, J. M. Breaking free of sample size dogma to perform innovative translational research. Sci. Transl. Med. 3, 24 (2011).
40. Carlsson, G. Topology and data. Bull. Am. Math. Soc. 46, 255308 (2009).41. Lee, H., Kang, H., Chung, M. K., Kim, B. N. & Lee, D. S. Persistent brain network homology from the perspective of dendrogram. IEEE Trans. Med. Imaging 31, 22672277 (2012).
42. Singh, G. et al. Topological analysis of population activity in visual cortex.J. Vis. 8, 11.111.18 (2008).43. Yao, Y. et al. Topological methods for exploring low-density states in biomolecular folding pathways. J. Chem. Phys. 130, 144115 (2009).
44. Sarikonda, G. et al. CD8 T-cell reactivity to islet antigens is unique to type 1 while CD4 T-cell reactivity exists in both type 1 and type 2 diabetes.J. Autoimmun. 50, 7782 (2014).45. Hinks, T. S. et al. Innate and adaptive T cells in asthmatic patients: relationship to severity and disease mechanisms. J. Allergy Clin. Immunol. 136, 323333 (2015).
46. Nicolau, M., Levine, A. J. & Carlsson, G. Topology based data analysis identies a subgroup of breast cancers with a unique mutational prole and excellent survival. Proc. Natl Acad. Sci. USA 108, 72657270 (2011).
47. Nout, Y. S., Beattie, M. S. & Bresnahan, J. C. Severity of locomotorand cardiovascular derangements after experimental high-thoracic spinal cord injury is anesthesia dependent in rats. J. Neurotrauma 29, 990999
2012:
48. Singh, G., Memoli, F. & Carlsson, G. in Eurographics Symposium on Point-Based Graphics. (eds Botsch, M. & Pajorola, R., 2007 http://danifold.net/mapper/introduction.html
Web End =http://danifold.net/ http://danifold.net/mapper/introduction.html
Web End =mapper/introduction.html .
49. Tausz, A. Persistent Homology in R v. 1.0.3. (2014) http://cran.r-project.org/web/packages/phom/index.html
Web End =http://cran.r-project.org/ http://cran.r-project.org/web/packages/phom/index.html
Web End =web/packages/phom/index.html .
50. Fasy, B. T., Jisu Kim, J., Lecci, F. & Maria C. TDA: Statisticial tools for topological data analysis (2014) https://cran.r-project.org/web/packages/TDA/index.html
Web End =https://cran.r-project.org/web/packages/TDA/ https://cran.r-project.org/web/packages/TDA/index.html
Web End =index.html .
51. Gensel, J. C. et al. Behavioral and histological characterization of unilateral cervical spinal cord contusion injury in rats. J. Neurotrauma 23, 3654 (2006).
52. Martinez, M., Brezun, J. M., Bonnier, L. & Xerri, C. A new rating scale for open-eld evaluation of behavioral recovery after cervical spinal cord injury in rats.J. Neurotrauma 26, 10431053 (2009).53. Irvine, K. A. et al. A novel method for assessing proximal and distal forelimb function in the rat: the Irvine, Beatties and Bresnahan (IBB) forelimb scale.J. Vis. Exp. 46, 2246 (2010).54. Irvine, K. A. et al. The Irvine, Beatties, and Bresnahan (IBB) Forelimb Recovery Scale: an assessment of reliability and validity. Front. Neurol. 5, 116 (2014).
55. Hamers, F. P., Koopmans, G. C. & Joosten, E. A. CatWalk-assisted gait analysis in the assessment of spinal cord injury. J. Neurotrauma 23, 537548 (2006).
56. Ferguson, A. R. et al. A simple post hoc transformation that improves the metric properties of the BBB scale for rats with moderate to severe spinal cord injury. J. Neurotrauma 21, 16011613 (2004).
Acknowledgements
This work was funded by the Craig H. Neilsen Foundation Grant 224308, the National Institutes of Health (NIH) NS067092 (A.R.F.), NS069537 (A.R.F), NS038079 (J.C.B. and M.S.B.), NS031193 and AG032518 (M.S.B. and J.C.B.), NS079030 (J.L.N.), NS032000 (W.Y.; M.S.B. site PI, multicentre study); NYSCoRE CO19772 (M.S.B. and J.C.B.), the Department of Defense (DoD) grants W81XWH-10-1-0910 (M.S.B.) and W81XWH-13-1-0297 (M.S.B.), and Wings for Life Foundation grants WFLUS008/12 and WFLUS006/ 14 (A.R.F.). We thank J.R. Huie and J. Haefeli for useful comments, and A. Lin andS. Visuthikraisee for technical support.
Author contributions
J.L.N. performed all analyses, prepared all gures and drafted the manuscript. J.L.N., J.P. and A.R.F. conceived of the application of TDA to the data sets presented. J.L.N., A.W.L., C.F.G. and A.R.F. all contributed to data wrangling of original data. J.L.N., A.W.L., C.F.G., G.T.M., J.C.B., M.S.B. and A.R.F. all contributed to data provenance in preparation of data visualization with TDA. T.I., K.-A.I., J.C.G., A.C.T., J.C.B. and M.S.B. collected primary experimental data. G.E.C. developed TDA. J.P., J.K., P.Y.L. and G.E.C. worked on the development of the data analytics and visualization software. J.L.N., J.P., J.K. and A.R.F. developed the optimal parameters to apply TDA to the presented data. T.C.P., P.Y.L., G.E.C., M.S.B., J.C.B. and A.R.F. edited the manuscript. G.T.M., W.Y., M.S.B., J.C.B. and A.R.F. were involved in study design and interpretation of the original data sets. J.L.N., J.P., G.T.M., J.C.B., M.S.B. and A.R.F. contributed to the interpretation of the results.
Additional information
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications
Web End =http://www.nature.com/ http://www.nature.com/naturecommunications
Web End =naturecommunications
Competing nancial interests: J.P., J.K., T.C.P, P.Y.L, and G.E.C. are current or former employees of Ayasdi Inc. and own shares of the company. All other authors declare no competing nancial interests.
Reprints and permission information is available online at http://npg.nature.com/reprintsandpermissions/
Web End =http://npg.nature.com/ http://npg.nature.com/reprintsandpermissions/
Web End =reprintsandpermissions/
How to cite this article: Nielson, J. L. et al. Topological data analysis for discovery in preclinical spinal cord injury and traumatic brain injury. Nat. Commun. 6:8581doi: 10.1038/ncomms9581 (2015).
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the articles Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Web End =http://creativecommons.org/licenses/by/4.0/
12 NATURE COMMUNICATIONS | 6:8581 | DOI: 10.1038/ncomms9581 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Oct 2015
Abstract
Data-driven discovery in complex neurological disorders has potential to extract meaningful syndromic knowledge from large, heterogeneous data sets to enhance potential for precision medicine. Here we describe the application of topological data analysis (TDA) for data-driven discovery in preclinical traumatic brain injury (TBI) and spinal cord injury (SCI) data sets mined from the Visualized Syndromic Information and Outcomes for Neurotrauma-SCI (VISION-SCI) repository. Through direct visualization of inter-related histopathological, functional and health outcomes, TDA detected novel patterns across the syndromic network, uncovering interactions between SCI and co-occurring TBI, as well as detrimental drug effects in unpublished multicentre preclinical drug trial data in SCI. TDA also revealed that perioperative hypertension predicted long-term recovery better than any tested drug after thoracic SCI in rats. TDA-based data-driven discovery has great potential application for decision-support for basic research and clinical problems such as outcome assessment, neurocritical care, treatment planning and rapid, precision-diagnosis.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer