1. Introduction
Data visualization and multivariate data analysis are an active and current research area in applied statistics, engineering, and data mining. They become an increasingly popular area for displaying and exploring complex and multidimensional data involving several application domains (finances, engineering, and healthcare) in which the relationships between many attributes are of vital interest [1,2]. Multivariate statistical methods are designed to simultaneously analyze data sets of multiple variables and are used to model different forms of relationships among variables.
Biomedical data are usually multidimensional and multimodal [3,4]. For that reason, biomedical data mining research requires the use of multivariate analysis to study the association between various variables whilst controlling for several other variables [5]. Multivariate data visualization is therefore strongly motivated to investigate the interrelationships between different data attributes, to identify, cluster, and correlate the underlying data.
In this paper, we investigate the multivariate relationship between knee kinematics and clinical parameters of patients who suffer from end-stage knee osteoarthritis (OA). The kinematic parameters are first derived from a gait analysis, which is one of the promising assessment tools used in orthopedic research for assessing changes in gait of knee OA patients. Gait analysis, including joint kinematics, kinetics, and dynamic EMG data, provides objective and shareable data. This analysis was initially introduced to clinical practice [6] and is one of the specific methods used to gain useful clinical knowledge on the functional status of the join.
Clinical gait analysis is the process of recording and interpreting biomechanical measurements during gait in order to assist in treatment decision-making for groups of patient with gait dysfunction. Thereby, appropriateness of gait analysis prescription and reliability of data obtained are required in the clinical environment. Clinical gait analysis [7] is performed to allow help in the clinical decision making regarding treatment options including the conservative (non-surgical) or surgical treatment. This is based on one or more diagnosis, assessment, monitoring, and prediction. The most common use is for assessment of patients with a known condition prior to planning treatment [8]. According to Baker [6], the usefulness of clinical gait analysis is now widely accepted and there is now a growing demand for this service. The principal clinical domains of the clinical use of gait analysis are: cerebral palsy, stroke, traumatic brain injury, and lower limb amputation. At present, the clinical gait analysis operates in the classification of a group of patients in specific normal or abnormal patterns of movement with particular impairments of body structure and function. Certain criteria must be fulfilled for gait analysis to be useful in the clinical evaluation of patients, the biomechanical parameters must (1) correlate with the functional capacity of the patient, (2) supply additional, more relevant information than the clinical examination, (3) be accurate and repeatable, (4) result from a test which does not alter the natural performance of the patient, and (5) be interpreted by experienced clinicians [6].
In this context of clinical gait analysis, we propose a technical approach that aims at understanding the association between kinematic measurements and clinical data related to the functional status of the patient. In that way, we will be able to identify some gait characteristics associated with clinical measures that should therefore be considered in the treatment decision process.
Many studies analyze the correlation between kinematic features of gait waveform data and clinical parameters by using bivariate analysis [9,10]. Through some bivariate analysis results, gait and biomechanical changes were associated with knee OA severity levels [11,12]. For the end-staged knee OA patients, correlations were found between kinematics features in the sagittal plane and clinical measures such as symptoms during specific tasks, active range of motion, functional test, and clinical frailty scale [10]. Associations were also found between radiographic features and clinical functional status in severe knee OA [13], and between physical activity level and physical performance for patients with end-stage knee osteoarthritis [14]. However, a multivariate analysis is more adapted to investigate the correlation between kinematic and clinical parameters, because it can evaluate the strength of the relationship considering the complexity of biomechanical data [15] and the interrelationships of the two datasets of parameters, as we showed in our previous study [16].
The aim of this paper is to investigate visually the multivariate correlation between two datasets of kinematic measurements and clinical parameters which are measured during specific tasks and functional tests of patients with end-stage knee OA. 2. Materials and Methods This study performs, first, a regularized canonical correlation analysis (RCCA) to evaluate the multivariate relationship between clinical and kinematic datasets and, second, a combined visualization method to better understand the relationships between these multivariate data. Univariate and multivariate statistical techniques are two important methods for understanding and analyzing detailed statistical data. Univariate analysis is the precursor to multivariate analysis, and knowledge of the first is needed to understand the second. Univariate correlation analysis is a simple linear correlation in which the relationship between two variables depends on their constant. Multivariate (canonical) correlation analysis (CCA) is used to explain as much as possible the variance derived from the relationships between two groups of numerical variables in a reduced dimension space. The CCA may suffer from instability. That is why a regularized approach of CCA (RCCA) is investigated in this study. 2.1. Regularized Canonical Correlation Analysis (RCCA)
Canonical correlation analysis (CCA) is the multivariate extension of linear correlation analysis that has been described by Hotelling [17]. The main purpose of CCA is the exploration of sample correlations between two sets of variables whose roles in the analysis are strictly symmetric and are observed on the same individual. The CCA is currently used in some disciplines such as ecology [18,19], biology [20,21], hydrology [22,23], and health [16].
In this study, the CCA is performed to investigate the relationships between two sets of variables, namely the kinematicXand clinicalYdatasets, measured on the same individuals.
LetX(X1,X2,…,Xp)andY(Y1,Y2,…,Yq)be two matrices of standardized variables (centered and reduced) of ordern×pandn×q, respectively. Wheren=143OA patients,p=69kinematic variables andq=42clinical variables. The CCA consists in finding in stepwise optimization problem two vectorsAsandBs, fors=1,…,min(p,q), that maximize the correlations between the linear combinations:
Us=a1s X1+a2s X2+…+aps Xp=XAs
Vs=b1s Y1+b2s Y2+…+bqs Yq=YBs
assuming that vectorsAsandBsare normalized so that var(Us) = var(Vs) = 1.
In other words, the problem consists in solving
ρs=cor(Us,Vs)=maxAs,Bs cor(XAs,YBs)
subject to var(XAs) = var(YBs) = 1 and the restriction thatcor(Ut,Us)=cor(Vt,Vs)=0for1≤t<s.ρsis called the canonical correlation,UsandVs(UtandVt, respectively), are known as the canonical variate vectors. The coefficient vectorsAsandBs are known as the canonical weights (or loadings), canonical vectors, or canonical coefficients. The Pillai’s trace test is used for the statistical significance of the canonical correlation model [24,25].
The CCA may suffer from instability. Indeed, when the variable related to a data set are highly correlated (nearly collinear), the sample correlation matrix tends to be ill-conditioned and its inverse unreliable. In that case, a regularized approach of CCA (RCCA) solves the instability of the loadings due to multicollinearity by adding a regularization term on the diagonal of the ill-conditionned matrices, i.e., the covariance matrices [20]. A regularization step in the computations of classical CCA consists in the regularization of the covariance matricesSXXandSYYofXandY, respectively, by adding a multiple number of the corresponding identity matrices:
SXX(λ1)= SXX + λ1 Ip
SYY(λ2) = SYY+ λ2 Iq
whereλ1andλ2are non-negative numbers (also called regularization terms) imposed to the diagonal of the covariance matrices such thatSXX(λ1)andSYY(λ1) becomes regularized and nonsingular [20]. The regularization terms,λ1andλ2, associated to each dataset are chosen by cross-validation in order to maximize the first canonical correlation.
2.2. RCCA Cross-Correlation Matrix
TheXandYvariables are projected onto a low-dimensional space. Lets=2for the chosen dimension to adequately account for the data correlation. The projection is done on the equiangular vectorWkbetween the canonical variatesUkandVk(k=1,2) represented byWk=(Uk+Vk)/2. In CCA, the variablesXandYare symmetrically analyzed, and it is clear thatWare the closest (most correlated) variable toXandY [26]. To identify and grouping each pair of strongly correlated variables, we use a standard measure to quantify the most important cross-correlations, which is based on the scalar product of theXandYcoordinates on the canonical axesWk(k=1,2).
Letxandybe the coordinates of the variablesXandY, respectively, on the axesW1andW2, which are the correlations between the variablesX(orY) andWk:x=(cor(X,W1);cor(X,W2))andy=(cor(Y,W1);cor(Y,W2)). The MatrixZ=xy′(0≤Z≤1) represents an approximation of the cross-correlation score ofXandY, using the scalar product between the variables vectorsxandy. Indeed, the scalar product is defined as the product of the two vectors lengths and their cosine angle:Z=∥x∥ ∥y∥ cosθ(x,y), whereθ(x,y)is the angle betweenxandy. Therefore, the cross-correlation value should be stronger or weaker depending on the lengths and the angles between the variable vectors; the higher the norm of the projected vectors, the stronger their relationship. Furthermore, the variablesXandYhave a strong relationship if they are projected in the same (or opposite) direction, so the cosine is close to 1 (resp. of −1).
2.3. Graphical Visualizations Different graphical representations to visualize and interpret the results of RCCA are investigated: (1) the correlation variable representation that has the ability to identify the between-correlated clusters of each type of variables and (2) the cross-correlation variable representation which provides additional visual inferences, allowing the combination between a selected subsets of the between-correlated clusters that have the higher correlation. The relevance of relationships can be controlled by tuning a threshold of correlation. The combined visualization of these graphical outputs is important and immediately becomes imminent for clustering the most relevant correlated variables and highlighting omitted knowledge about multivariate correlation.
2.3.1. Representation of Canonical Variable Correlation
The correlation variable representation is performed using a biplot to visualize the variables of two different types. This representation discerns the structure of correlation between the two sets of variablesXandY. For a given pair(t,s)of axes such that1≤t<s, variables plots can be considered with respect toWtandWs. The correlation is between the two types of projected variablesXandYonto the space spanned by the first components retained in the analysis (W1andW2). The correlation points of each variables are visualized as a scatter plot on the axesW1andW2 . Therefore, the correlation variable is represented by a correlation circle plot [20] by plotting the coordinates vectors ofxandyin a 2-dimensional cartesian coordinate system. The closer the distance to the circle of radius1, the stronger is the correlation between the variables. The cosine angle between any two segments related to each points represents the nature of correlation (negative, positive, or null) between two variables.
2.3.2. Cross-Correlation Variable Representation
As a simple graphical display of a correlation matrix, heatmap, and relevance network are two visualizations of the cross-correlation matrixZthat give an alternative view of the biplot and shows clearly the patterns of negative/positive correlations.
- Visualizing the cross-correlation variable by the heatmap.
Heatmap for one matrix is a 2-dimensional visualization of the correlation matrix with rows and/or columns reordered according to some hierarchical clustering method to identify interesting patterns. Generated dendrograms (tree diagrams) from clustering are added to the image to represent the arrangement of the clusters. The nature of the correlation between variables (positive, negative, strong, or weak) is represented by color, whereas the proximity between correlated variables is represented by the dendrogram. Usually, we try to find well-visible rectangles of the same color corresponding to the long sections of the dendrograms to identify the clustered correlations [27].
- Visualizing the cross-correlation variable by the relevance network.
Relevance network is a simple approach for modeling correlation structures between two datasets. This approach generates a graph where nodes represent variables, and edges represent variable correlations. The relevance network, in our study of interest, is displayed through the use of connections between variables of two different types. The relevance networks represent simultaneously positive and negative correlations; the edge colors indicate the nature of the correlation (positive, negative, strong, or weak). It allows the identification of clusters or sub-networks of subsets of variables. Each of these clusters highlight a specific correlation structure between the two types of variables [28].
2.4. Data Collection and Parameters Extraction
The dataset consists in 143 severe or end-staged knee OA patients with clinically and radiographically confirmed knee osteoarthritis. Participants were asked to answer the Oxford Knee Score (OKS) and Pain Catastrophizing Scale (PCS) questionnaires and some measures related to the clinical symptoms during specific tasks and functional tests. For the biomechanical data, participants underwent an in-clinic 3D knee kinematic analysis using the KneeKG TM system (Emovi, Canada), which is an objective tool of gait analysis, enabling the biomechanical assessment of the behavior of the knee joint during gait and providing a precise quantitative information [29].
The data collection was approved by Nova Scotia Health Authority Research Ethics Board (Reference number: NSHA ROMEO 1016253), institutional ethics committees of the University of Montreal Hospital Research Center (Reference numbers: CE 10.001-BSP and BD 07.001-BSP), and of the École de technologie supérieure (Reference numbers: H20100301 and H20170901). All subjects provided an informed consent before participation.
2.4.1. Kinematic Data
For each participant, continuous series of 3D knee kinematics of full gait cycles during treadmill walking were collected. 3D kinematics describes the three rotations occurring at the knee joint, flexion/extension (in the sagittal plane), abduction/adduction (Abd/Add) (in the frontal plane), and internal/external rotation (Int/Ext rot) (in the transverse plane). Each gait cycle kinematic pattern was then interpolated to a hundred points (from 1% to 100% of the gait cycle) and averaged to obtain a representative mean gait pattern for each participants and for each rotations. Then, among these 100 measurements for each kinematics curves, a set of 69 parameters routinely assessed in clinical biomechanical studies for knee OA populations has been extracted. These parameters are based on maximum and minimum, varus and valgus thrust, angles at initial contact, and mean values and range of motion (ROM) throughout gait cycles or specific gait subcycles (i.e., loading, stance, swing, etc.). Hereafter, the variables representing these parameters are notedX1,X2,…,X69.
2.4.2. Sociodemographic and Clinical Data
Clinical and sociodemographic data regroup 42 measures (Table 1): (1) Sociodemographic characteristics, (2) the degree of end-stage OA severity, (3) the symptoms during specific tasks, and (4) functional tests.
1.
Sociodemographic characteristics (Y1–Y5) such as age and gender.
2.
The degree of end-stage OA severity (Y6) is estimated using Kellgren and Lawrence (KL) scores.
3.
The symptoms during specific tasks are measured using the questionnaires OKS (Y7–Y19) and PCS (Y20–Y33 ). The OKS is a 12-item questionnaire developed specifically for measuring the outcome of total knee replacement (TKR) [30]. It consists of questions covering function and pain associated with the knee. The OKS has demonstrated good validity, reliability, and sensitivity in patients undergoing TKR [31] and thus can be considered a reliable and valid measurement for outpatients with OA. The PCS is a 13-item questionnaire along with three subscale scores assessing rumination, magnification, and helplessness (Y34–Y36 ) developed tohelp quantify an individual’s pain experience, asking about how they feel and what they think about when they are in pain [32].
4.
The functional tests (Y37–Y42) simulates the forces encountered during sport-specific activity under controlled clinical conditions.
3. Results and Discussion
Kinematic (Xmatrix), sociodemographic, and clinical data (Ymatrix) were structured in a single database in order to identify the relationships between the two datasets. In this study, the analyses were carried out using mainly the package MixOmics of R.
We evaluated the statistical significance of the RCCA. The p-value for Pillai’s trace statistic is<0.001. The first two canonical correlations values corresponding to the first two dimensions are equal to0.80and0.79, respectively. These high correlation values show that we can consider a multivariate correlation model and that two dimensions are sufficient to understand the association between the two sets of variables. For the graphical outputs, we empirically adjust a threshold value to0.35such that we can highlight the most important correlations between the two sets of dataXandY.
The value of the threshold has been fixed to 0.35 on the basis of the graphical representation of canonical variable correlation (Figure 1). Indeed, most of the variable points are located around the inner-correlation circle with cut-off equal to 0.5. Therefore, we fixed the correlation threshold below this value, i.e., 0.35.
The representation of correlation and cross-correlation variable are displayed in Figure 1, Figure 2 and Figure 3. We distinguish and deduce three clusters (C1, C2, and C3) of highly correlated variables of both sets of data, i.e., each cluster regroups the variablesX(kinematic measurements) and the variablesY (clinical parameters). The clusters are visualized in Figure 1, only the variables that are closely located inside the circle of radius between 0.35 and 1 are represented. The clusters C1 and C2 regroup subsets of parameters fromXandYvariables. C3 regroups the variablesY3andX46which are highly and positively correlated. The correlation between C1 and C2 is strongly negative. The correlation between C3 and each one of C1 and C2 is negligible because the angle between each pair of clusters (C1, C3) and (C2, C3) is apparently right, so that the correlation is almost null.
CCA, which is a linear subspace analysis, involves the projection of two variable setsXandYonto a joint subspace such that the correlations between the projected vectors of variables are maximized. The correlation between clusters (between all variables) is quantifying, as explained before, by means of the first two canonical correlations values corresponding to the first two dimensions which are equal to0.80and0.79, respectively.
The representation of cross-correlation variable is shown by the heatmap in Figure 2 and relevance network in Figure 3. For the hierarchical clustering in the heatmap, the Euclidean distance was used as proximity measure and the Ward’s method was used for the agglomeration hierarchical clustering procedure [33,34]. The correlation between groups of the variableXandY is represented by a colored block. The two clusters C1 and C2 identified in Figure 1 are clearly highlighted in Figure 2 by the blue and red blocks that correspond to the two long sections of the dendrograms and are summarized in a single subnetwork in Figure 3, which visualizes simultaneously positive and negative correlations of variables to represent only the strongest cross-correlations.
The representation of canonical variables (Figure 1) and of the cross-correlation variables (Figure 2 and Figure 3) highlights the importance of a combined analysis. For instance, the cluster C3 in Figure 1, which combines the pair of parameters{Y3,X46} , is not visualized by the cross-correlation variable representations in Figure 2 and Figure 3 because of the neglected correlation between C3 and the other two clusters C1 and C2. For that reason, the combination of correlation and cross-correlation variable representations implies to consider the cluster 3 with the clusters 1 and 2 as the groups of variables highlighting the highest correlations. From a clinical point of view the cluster C3 is important. As, it emphases the relationship between the BMI and internal/external tibial rotation angle at the end of push-off phase.
Following the combination of these representations, the correlated clinical and kinematic parameters are described in Table 2. Cluster C1 regroups kinematic parameter in the sagittal plane related to flexion/extension ROM (X52) and transverse plane, which include only the tibial rotation variation value between the end of loading phase and initial contact. Each of these parameters in sagittal and transverse plane are highly and positively correlated with the functional tests indicating the active knee flexion ROM (Y37) and flexion contracture (Y38) (or extension ROM). The couple of parameters that is important is the one that connects the flexion/extension ROM (X52) with the index of joint extension ROM (Y38), this corresponds to the red rectangle in the heatmap (2) where correlation are higher than0.5.
The second cluster C2 regroups five clinical parameters and seven kinematic parameters. The clinical parameters consist in the functional test of the Clinical Frailty Scale (CFS) index and the Oxford questionnaire (using the scale 1 to 5 of feeling experiencing activities of daily living during the past 4 weeks) that includes four measures Q4, Q6, Q11, and the total Oxford score, which reflects the severity of problems that the patients have with their knee. The kinematic parameters comprise only those of the sagittal plane: flexion angle at initial contact (X13andX48) and the end of terminal stance phase (X25), and minimum flexion/extension angle during the loading phase, the stance phase, the stance and early swing phases, and during the gait cycle. The functional test of CFS (Y42) and the Q4 Oxford Question (Y10) have the largest positive correlations with the kinematic parameters in the sagittal plane.
The third cluster C3 relates the internal/external tibial rotation angle at the end of push-off phase and the BMI by a considerable positive correlation (near the inner circle of correlation with a cut-off =0.5 in Figure 1).
Finally, the between cluster analysis shows that the correlation between the clinical variables in cluster C1 and the kinematic variables in cluster C2 are strongly negative. The same statement is observed for the correlation between the clinical variables in cluster C2 and the kinematic variables in cluster C1. Moreover, the correlations are negligible between the third cluster C3 and the other two clusters (C1 and C2). As explained above, the angle between each pair of clusters (C1, C3) and (C2, C3) is apparently right, so that the correlation is almost null.
The correlation analysis is expanded to a higher level of threshold. Figure 4 shows a cross-correlation variable representation by relevance networks with a cut-off=0.40. We notice that only one subset (of few variables) remains important with higher correlation level. The number of correlated variables is reduced for the most important clinical parametersY38(in cluster C1) strongly correlated with nine kinematic parameters (in clusters C1 and C2). This analysis shows that increasing the threshold value establishes the most important correlated parameters. However, it leads to the loss of certain correlations that remain significant.
The analysis of correlation between kinematic features of gait waveform data and clinical parameters has been, basically, done using a univariate correlation analysis [9,10,11,12]. The later investigate the relationship between a pair of two variables individually, which could occult important correlation as shown in our previous study [16]. CCA belongs to the family of exploratory analysis and allows investigation of the (simultaneous) relationship between two sets of variables. The multivariate relationship depends on the covariance derived from the two sets of variables which is not the case with the univariate correlation. Moreover, the former is more adapted to investigate the correlation between kinematic and clinical parameters considering the complexity of biomechanical data [15].
4. Conclusions The purpose of this research was to explore some interesting visual approaches for better analyzing and explaining the multivariate relationships between kinematic measurements and clinical parameters. We performed RCCA and implemented three types of insightful graphical outputs to better understand and interpret the results. The complementarity of these graphical displays was illustrated and allowed us to extract and deduce clusters of sub-sets of variables that are highly correlated, taking into account the positive and negative correlations between variables. Some parameters of the OKS are clustered with more particular kinematic parameters than others. This was evident by the graphical representation of the data. These associations would not be seen with a regular univariate analysis. The kinematic parameters are in the sagittal plane (according to cluster C2), and it would therefore be possible to consider paying particular attention to these parameters to improve the OKS score. Furthermore, the BMI must be considered if it would act to reduce the rotation deficits in the transverse plane (according to cluster C3). In conclusion, the visualizations allow a fairly accurate description of the multivariate correlation and aid to identify the clusters underlying higher cross-correlation. Moreover, the combined visualization of the graphical outputs summarizes multivariate data into understandable and meaningful forms for determining the patterns of cross-correlation of the variables relating to the two sets of data. Finally, the method proposed in this study highlights hidden information and allows more understanding of the multivariate correlation. This study could be expanded using a canonical partial correlation analysis that studies the correlation between two sets of data by controlling another set of confounding variables whose effect will give misleading results if they are not partialled out.
Figure 1. Correlation variable representation with a threshold=0.35. The radius of the inner circle represents the correlation with a cut-off=0.5.
Figure 2. Cross-correlation variable representation by heatmap with a threshold=0.35. The red and blue color indicate the positive and negative correlation, respectively.
Figure 3. Cross-correlation variable representation by relevance networks with a cut-off=0.35.
Figure 4. Cross-correlation variable representation by relevance networks with a cut-off=0.40.
Y | Variable Description and Range | Mean (SD) |
---|---|---|
Y1 | Age (years) (40 to 90) | 64.75 (8.89) |
Y2 | Gender (0:Women; 1:Men) | Women: n = 83; Men: n = 60 |
Y3 | BMI-Body Mass Index (kg/m2) (21.8 to 58.5) | 32.37 (6.05) |
Y4 | Measure of health-related quality of life (0.03 to 1) | 0.61 (0.22) |
Y5 | Visual Analogue Scale for Pain (0 to 95) | 44.34 (24.60) |
Y6 | Degree of end-stage OA severity (0:Moderate; 1:Severe) | (Moderate: n = 71; severe: n = 72) |
Y7,…,18 | Q1, …, Q12 Oxford Question 1 to 12 from Oxford Questionnaire (1 to 5) | 1.75 to 3.81 (0.82 to 0.76) |
Y19 | Total Oxford Score: Normalized overall Oxford score (15 to 54) | 33.15 (9.12) |
Y20,…,32 | Q1,..., Q32 PCS: Question 1 to 13 from PCS Questionnaire (0 to 4) | 0.61 to 1.38 (1.01 to 1.33) |
Y33 | Total PCS score: Normalized overall PCS score (0 to 52) | 13.14 (13.22) |
Y34 | Rumination score related to depression (0 to 16) | 4.43 (4.58) |
Y35 | Magnification: Measure of enlargement (0 to 12) | 2.98 (3.14) |
X36 | Helplessness: Measure of Helplessness-related quality of life (0 to 24) | (5.85) (6.11) |
Y37 | Index Joint Flex: knee flexion ROM (Range Of Motion) (45 to 154) | 123.30 (12.15) |
Y38 | Index Joint Ext: knee extension ROM (−29 to 11) | −7.75 (7.02) |
Y39 | 4 metres Walk (S): Time taken to walk 4 meters, Gait speed in m/s (1.6 to 10.8) | 3.40 (1.25) |
Y40 | Timed Up and Go (S): The progress of balance, sit to stand, and walking (4.7 to 20.9) | 9.21 (3.08) |
Y41 | 30-s Chair (Reps): Sit-to-stand repetitions (0 to 20) | 9.36 (3.73) |
Y42 | Frailty Index: Clinical Frailty Scale (CFS) (1 to 6) | 2.68 (1.12) |
Clusters | Variables | Characteristics |
---|---|---|
C1 | X67 X51, X52, X19 | Absolute variation in tibial rotation between the end of loading phase and heel strike Flex/Ext total range of motion, absolute value of the Flex/Ext angle, Flex/Ext angle. |
Y38 Y37 | Index Joint Ext: knee extension range of motion (ROM) Index Joint Flex: knee flexion ROM | |
X13 | Flexion angle at heel strike (at initial contact) | |
X48 | Absolute value of flexion angle at heel strike | |
X25 | Flexion angle at the end of terminal stance phase | |
X49 | Absolute value of the minimum Flex/Ext angle during loading phase | |
X22 | Minimum Flex/Ext angle (Min Flex/Ext) during the gait cycle | |
C2 | X20, X23 | Min Flex/Ext during stance phase, Min Flex/Ext during stance and early swing phases |
Y42 | Frailty Index: Clinical Frailty Scale (CFS) | |
Y10 | Q4 OKS: For how long have you been able to walk before pain becomes severe? | |
Y12 | Q6 OKS: Have you been limping when walking, because of your knee? | |
Y17 | Q11 OKS: Could you do the household shopping on your own? | |
Y19 | Total Oxford Score | |
C3 | X46 | Internal/external tibial rotation angle at the end of push-off phase |
Y3 | BMI |
Author Contributions
Formal analysis, methodology, and investigation, F.B. Validation and supervision, N.M. Data curation: Y.O. Writing-original draft preparation, F.B. Writing-review and editing, F.B., N.M., N.H. and A.F. Clinical interpretation: G.R., A.F., N.H. and M.D. All authors have read and agreed to the published version of the manuscript.
Funding
This research was supported by the Canada Research Chair on Biomedical Data Mining (950-231214).
Conflicts of Interest
The authors declare no conflict of interest.
1. Chan, W.W.Y. A survey on multivariate data visualization. Department of Computer Science and Engineering. Hong Kong Univ. Sci. Technol. 2006, 8, 1-29.
2. McLeod, A.I.; Provost, S.B. Multivariate data visualisation. Wiley StatsRef: Statistics Reference Online. 2014. Available online: http://fisher.stats.uwo.ca/faculty/aim/2003/mviz/ (accessed on 1 September 2019).
3. Garcia-Milian, R.; Hersey, D.; Vukmirovic, M.; Duprilot, F. Data challenges of biomedical researchers in the age of omics. PeerJ 2018, 6, e5553.
4. Arrais, J.P.; Lopes, P.; Oliveira, J.L. Challenges storing and representing biomedical data. In Proceedings of the Symposium of the Austrian HCI and Usability Engineering Group, Graz, Austria, 25-26 November 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 53-62.
5. Singanamalli, A.; Wang, H.; Lee, G.; Shih, N.; Rosen, M.; Master, S.; Madabhushi, A. Supervised multi-view canonical correlation analysis: Fused multimodal prediction of disease diagnosis and prognosis. In Medical Imaging 2014: Biomedical Applications in Molecular, Structural, and Functional Imaging; International Society for Optics and Photonics: San Diego, CA, USA, 2014; Volume 9038, p. 903805.
6. Baker, R.; Esquenazi, A.; Benedetti, M.G.; Desloovere, K. Gait analysis: Clinical facts. Eur. J. Phys. Rehabil. Med. 2016, 52, 560-574.
7. Brand, R.A. Can biomechanics contribute to clinical orthopaedic assessments? Iowa Orthop. J. 1989, 9, 61.
8. Baker, R. Gait analysis methods in rehabilitation. J. Neuroeng. Rehabil. 2006, 3, 4.
9. Wilson, J.A.; Deluzio, K.; Dunbar, M.; Caldwell, G.; Hubley-Kozey, C. The association between knee joint biomechanics and neuromuscular control and moderate knee osteoarthritis radiographic and pain severity. Osteoarthr. Cartil. 2011, 19, 186-193.
10. Bensalma, F.; Dunbar, M.; Whynot, S.; Fuentes, A.; Macdonald, H.; Ouakrim, Y.; Richardson, G.; Mezghani, N. Correlations between kinematics and clinical measures in end-staged knee osteoarthritis patients. In Proceedings of the 20th Biennial Meeting of the Canadian Society for Biomechanics, Halifax, Nova Scotia, 14-17 August 2018; SCITEPRESS: Setúbal, Portugal, 2018; p. 165(P074).
11. Astephen, J.L.; Deluzio, K.J.; Caldwell, G.E.; Dunbar, M.J. Biomechanical changes at the hip, knee, and ankle joints during gait are associated with knee osteoarthritis severity. J. Orthop. Res. 2008, 26, 332-341.
12. Astephen, J.L.; Deluzio, K.J.; Caldwell, G.E.; Dunbar, M.J.; Hubley-Kozey, C.L. Gait and neuromuscular pattern changes are associated with differences in knee osteoarthritis severity levels. J. Biomech. 2008, 41, 868-876.
13. Barker, K.; Lamb, S.E.; Toye, F.; Jackson, S.; Barrington, S. Association between radiographic joint space narrowing, function, pain and muscle power in severe osteoarthritis of the knee. Clin. Rehabil. 2004, 18, 793-800.
14. Thomas, S.G.; Pagura, S.M.; Kennedy, D. Physical activity and its relationship to physical performance in patients with end stage knee osteoarthritis. J. Orthop. Sports Phys. Ther. 2003, 33, 745-754.
15. Mezghani, N.; Mechmeche, I.; Ouakrim, Y.; Mitiche, A.; de Guise, J.A. An analysis of 3d knee kinematic data complexity in knee osteoarthritis and asymptomatic controls. PLoS ONE 2018, 13, e0202348.
16. Bensalma, F.; Mezghani, N.; Ouakrim, Y.; Fuentes, A.; Choinière, M.; Bureau, N.J.; Durand, M.; Hagemeister, N. A multivariate relationship between the kinematic and clinical parameters of knee osteoarthritis population. BioMed. Eng. Online 2019, 18, 58.
17. Harold Hotelling. Relations between two sets of variates. Biometrika 1936, 28, 321-377.
18. Robert, G. Canonical Analysis: A Review with Applications in Ecology; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; Volume 12.
19. Legendre, P.; Legendre, L.F. Numerical Ecology, 2nd ed.; Elsevier: Amsterdam, The Netherlands, 1998.
20. González, I.; Déjean, S.; Martin, P.G.; Baccini, A. CCA: An R package to extend canonical correlation analysis. J. Stat. Softw. 2008, 23, 1-14.
21. González, I.; Déjean, S.; Martin, P.G.; Gonçalves, O.; Besse, P.; Baccini, A. Highlighting relationships between heterogeneous biological data through graphical displays based on regularized canonical correlation analysis. J. Biol. Syst. 2009, 17, 173-199.
22. Rice, R.M. Using canonical correlation for hydrological predictions. Hydrol. Sci. J. 1972, 17, 315-321.
23. Cavadias, G.S. The canonical correlation approach to regional flood estimation. Reg. Hydrol. 1990, 191, 171-178.
24. Seber, G.A. Multivariate Observations. Biom. J. 1986, 28, 766-767.
25. Pillai, K.C.S. Some new test criteria in multivariate analysis. Ann. Math. Stat. 1955, 26, 117-121.
26. González, I.; Lê Cao, K.A.; Davis, M.; Déjean, S. Insightful graphical outputs to explore relationships between two 'omics' data sets. BioData Min. 2013, 5, 19.
27. González, I.; Lê Cao, K.A.; Davis, M.J.; Déjean, S. Visualising associations between paired 'omics' data sets. BioData Min. 2012, 5, 19.
28. Rohart, F.; Gautier, B.; Singh, A.; Lê Cao, K.A. mixomics: An R package for 'omics feature selection and multiple data integration. PLoS Comput. Biol. 2017, 13, e1005752.
29. Lustig, S.; Magnussen, R.A.; Cheze, L.; Neyret, P. The KneeKG system: A review of the literature. Knee Surg. Sports Traumatol. Arthrosc. 2012, 20, 633-638.
30. Xie, F.; Ye, H.; Zhang, Y.; Liu, X.; Lei, T.; LI, S.C. Extension from inpatients to outpatients: Validity and reliability of the oxford knee score in measuring health outcomes in patients with knee osteoarthritis. Int. J. Rheum. Dis. 2009, 14, 206-210.
31. Dawson, J.; Fitzpatrick, R.; Murray, D.; Carr, A. Questionnaire on the perceptions of patients about total knee replacement. J. Bone Jt. Surg. Br. Vol. 1998, 80, 63-69.
32. Sullivan, M.J.; Bishop, S.R.; Pivik, J. The pain catastrophizing scale: Development and validation. Psychol. Assess. 1995, 7, 524.
33. Ward, J.H., Jr. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 1963, 58, 236-244.
34. De Amorim, R.C. Feature relevance in ward's hierarchical clustering using the L p norm. J. Classif. 2015, 32, 46-62.
Fatima Bensalma1,2,*, Glen Richardson3, Youssef Ouakrim1,2, Alexandre Fuentes2, Michael Dunbar3, Nicola Hagemeister2 and Neila Mezghani1,2,*
1Centre de Recherche LICEF, TELUQ University, Montréal, QC H2S 3L5, Canada
2Laboratoire de recherche en imagerie et orthopédie (LIO), CRCHUM, Montréal, QC H2X 0A9, Canada
3Division of Orthopaedic Surgery, Dalhousie University, Halifax, NS B3H 4R2, Canada
*Authors to whom correspondence should be addressed.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2020. This work is licensed under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
This paper aims to analyze the correlation structure between the kinematic and clinical parameters of an end-staged knee osteoarthritis population. The kinematic data are a set of characteristics derived from 3D knee kinematic patterns. The clinical parameters include the answers of a clinical questionnaire and the patient’s demographic characteristics. The proposed method performs, first, a regularized canonical correlation analysis (RCCA) to evaluate the multivariate relationship between the clinical and kinematic datasets, and second, a combined visualization method to better understand the relationships between these multivariate data. Results show the efficiency of using different and complementary visual representation tools to highlight hidden relationships and find insights in data.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer