1. Introduction
In a world that is rapidly embracing precision medicine and patient-centric state-of-the-art tools and technologies designed for healthcare, the quantitative classification of non-specific low back pain (NSLBP) and hence effective personalized treatment, remain elusive in most clinical settings. Indeed, while, several NSLBP classification systems were developed in the last few decades, their incompleteness and/or complexity, have led to the use of simpler tools. These include observational assessment, surveys and questionnaires, such as the STarT Back Screening Tool (STarT) [
1]. This tool classifies NSLBP patients into three specific subgroups—low risk, medium risk and high risk based on 9 questions spanning educational therapy, physical therapy, as well as physical therapy combined with cognitive behavioral therapy. Risk is defined as the risk of developing persistent disabling symptoms [
1]. Simplicity, ease of application and attention to psychosocial factors are key reasons that have made this multi-language questionnaire a valuable screening tool in clinical practice [
2,
3,
4]. Abedi et al. found an approximately 80% correlation between the translated STarT questionnaire and other screening questionnaires, including the Roland Morris Disability Questionnaire (RMDQ), Tampa Scale for Kinesiophobia (TSK), Coping Strategies Questionnaire (CSQ) and Hospital Anxiety and Depression Scale (HADS) [
4]. The McKenzie [
5] method has also demonstrated practicality in clinical settings. This approach divides patients into three mechanical subgroups (derangement, dysfunction or postural syndrome) and applies appropriate exercises [
5].
Marras et al. [
6,
7] used the Quebec Task Force Questionnaire to categorize patients with low back disorders (LBDs), in conjunction with the Lumbar Motion Monitor system (LMM) developed by these authors. They aimed to demonstrate that kinematic parameters, particularly angular velocity and acceleration, can be effectively used for patient classification and distinction between healthy individuals and low back pain (LBP) patients. Using a three-dimensional electromagnetic tracking system, Barrett et al. [
8] indicated that the plane and direction of movement could influence the kinematic variables of low back patients. Sheeran et al. [
9] also used a three-dimensional kinematics motion capture system to assess spinal and pelvic kinematics during sitting and standing activities in healthy and LBP patients. O’Sullivan et al. [
10] introduced a multi-dimensional classification system (MDCS) to stratify patients based on movement and pain-based subsets (flexion pattern (FP), active extension pattern (AEP) and passive extension pattern (PEP)). Based on the novel Cardiff Classifier method, they were able to discriminate between different LBP subgroups with an accuracy of 96.8%, 87.7% and 70.27% for FP from PEP, FP from AEP and AEP from PEP subtypes, respectively. The MDCS system, however, is based on the range of motion and pain score of individuals during task performance without consideration of any psychosocial aspects. The need for the presence of a trained expert for patient evaluation is another limitation of this method, especially during remote evaluation. In general, despite their success in research settings, marker-based motion capture systems have limited accessibility and functionality for clinical applications, emphasizing the need for portable systems with sufficient accuracy and reliability.
Wearable inertial sensors have recently emerged as effective tools for objective quantitative LBP assessment due to their low cost, accuracy and portability [
11,
12,
13]. Inertial Measurement Units (IMUs) are a combination of accelerometers, gyroscopes and/or magnetometers [
14]. An accelerometer measures the linear acceleration of the sensor’s 3D local coordinate system [
15], while an orthogonal gyroscope measures the angular velocity with respect to the sensor-embedded reference frame [
16]. Magnetometers measure the direction and magnitude of the earth’s magnetic field [
16]. Adding a magnetometer reduces orientation errors as calculated by the accelerometer and gyroscope [
17]. Data fusion algorithms, such as Kalman or Complimentary filters, are typically utilized to estimate 3D orientations, based on the combined signals obtained from the accelerometer, gyroscope and magnetometer [
14]. Navigation errors, which may occur during high acceleration movements and magnetic disturbance, should be accounted for [
14,
18]. Commercially available IMUs incorporate approaches such as a rotation matrix or Euler angle to quantify the captured kinematics in a global fixed frame [
19]. Ashouri et al. [
20] used a Support Vector Machine classifier to discriminate between LBP patients and healthy controls, by capturing the angular velocity and acceleration vectors during flexion-extension movements using two inertial sensors placed on the trunk and Pelvis. They concluded that simple flexion-extension movement in the sagittal plane, measured with only one sensor placed on the trunk, could distinguish healthy individuals from LBP patients with an accuracy of 96%, sensitivity of 100% and specificity of 92%. Esfahani et al. [
21,
22] developed and validated the inertial sensors used in this study (SHARIF-HMIS). Repeatability and reliability of these sensors were investigated by Abedi et al. [
23]. Gorganbeik et al. [
24] also assessed knee kinematics during landing tasks using the SHARIF-HMIS.
The main objective of the current study is to present a “sensor-based” practical methodology, that combines our previously validated inertial sensor with the STarT clinical screening tool. The methodology will be applied for quantitatively assessing and classifying NSLBP patients in clinical settings, towards tailored personalized interventions for LBP patients, as well the ability to objectively monitor and evaluate the efficacy of these interventions.
2. Materials and Methods
2.1. Participants
One hundred males with NSLBP were recruited from the physiotherapy clinic of Milad Hospital in Tehran. The inclusion criteria comprised of male patients aged between 20 and 50 years, with a history of LBP in the last year lasting at least three months and without any history of spinal deformities or surgery. The pain intensity of the patients needed to be less than 5 based on the Visual Analogue Scale (VAS) at the time of the test. Patients were excluded if they were unwilling or unable to participate. Prior to performing the experiment, the consent form (approved by Institutional Review Board (IRB) of Shahid Beheshti University of Medical Sciences, Tehran, IR) was reviewed and signed by all participants.
The demographic characteristics of the patients in each subgroup are presented in
Table 1. There were no significant differences among the three subgroups in terms of age, height, weight and pain level based on VAS (
Table 1).
2.2. Experimental Design
An inertial sensor (9DOF SHARIF-HMIS [
21]), enclosing a 3-axis accelerometer (Analog Devices Inc., Norwood, MA, USA), a 3-axis gyroscope (InvenSense Inc., San Jose, CA, USA) and a 3-axis magnetometer (Honeywell Inc., Morris Plains, NJ, USA) with a 100 Hz frequency, was mounted vertically on the Xiphoid process of all participants. The captured data was transmitted to a PC-based data logger using an AVR microcontroller (Atmel Corp., San Jose, CA, USA). The technical information on the SHARIF-HMIS system is detailed in Esfahani et al. [
21,
22].
All patients were asked to complete the STarT questionnaire before the experimental test. They wore a stretchable shirt, on which the inertial sensor was placed, in a manner minimizing relative movement. The participants were then asked to perform trunk flexion-extension movements in multiple planes to familiarize themselves with the setup. The experimental methodology was designed according to Marras et al. [
6,
7]. The patients performed flexion-extension trunk movements in five different planes of motion (PLMs) (sagittal plane (0°), 15° and 30° of right and left rotation, respectively) for a duration of 14 s, using their maximum comfortable and pain free speed. The angular velocity and acceleration data were directly acquired by the gyroscope and accelerometer.
Figure 1 depicts the defined experimental planes of motion, as well as, the direction of the trunk’s flexion/extension. Due to the position of the sensor (vertically placed), the angular rate was dominant along the
Y-axis of the gyroscope during flexion-extension and hence the data for this axis (angular velocity and linear acceleration) were used for subsequent analysis.
To avoid moving off plane, especially during asymmetrical movements, 5 lines were clearly marked on the ground at 5 different positions, ensuring that the participants restricted their movement to the intended plane (
Figure 1). Here, asymmetry was defined as the amount of trunk twist (i.e., rotation in the transverse plane around the proximal-distal axis). The participants were asked not to use their ankle, knee or hip joints during twisting. The order of the planes of motion for each patient during the test was randomized. To avoid fatigue, the patients had a rest period for two minutes after each testing stage [
23].
2.3. Data Processing
All data were analyzed using codes written in MATLAB (Mathworks, Inc., Natick, MA, USA). A 4th order Butterworth low pass filter, with a 10 Hz cutoff frequency, was applied to the raw data. A Fast Fourier Transform (FFT) was also utilized to compute the frequency components of the signals [
25]. Although the duration of movement was the same for all patients (14 s), the number of cycles differed among them, since they were asked to do flexion-extension movements as fast as they comfortably could. The method employed by Ashouri et al. [
20] was also used here to define the onset of movement. In each identified cycle, five kinematic variables, including maximum and average angular velocity, linear acceleration and maximum jerk (the second derivative of the angular velocity) were obtained from the sensors’ filtered data in both flexion and extension directions.
2.4. Statistical Analysis
One-way ANOVA and Kruskal-Wallis tests were used to compare the demographic data and VAS among the three groups. A mixed ANOVA model was applied on the dependent variables of velocity, acceleration and jerk, to investigate the kinematic differences among the 3 subgroups. The between-subjects factors included the three different subgroups (low risk, moderate risk and high risk NSLBP), while the within-subjects factors consisted of the direction (flexion and extension) and planes of motion (0° rotation, 15° right rotation, 15° left rotation, 30° right rotation, 30° left rotation). All statistical analyses were performed using IBM® SPSS, version 20 (IBM Corporation, Armonk, NY). The level of significance for all tests was set at p < 0.05. Normality was tested using a Kolmogorov-Smirnov test.
2.5. Linear Discriminant Analysis
In this study, the Linear Discriminant Analysis (LDA) method was used to classify patients based on kinematic data [
26]. Using IBM
® SPSS software (IBM SPSS Statistics for Windows, Version 25.0. Armonk, NY: IBM Corp), the purpose of the LDA was to determine the appropriate conditions in which subgroup separation is performed. All conditions (10 permutations: 5 PLMs × 2 directions) were evaluated for each of the 5 measured kinematic variables (
Section 2.3), in order to discriminate among the three subgroups. The 15 degrees right axial pre-rotation position during extension was identified here as the condition with the highest discrimination based on kinematic behavior (see Table 6 in
Section 3.2). Hereon, the 15° right axial pre-rotation during extension model will be labeled as Model 1.
In general, different methods can be used for analyzing data using LDA. Here, the feature selection stepwise method was selected. In this method, the appropriate variables to separate the groups can be identified by considering the remaining variables in the last step of the analysis. No other measures need to be included due to correlation with the selected variables. Upon examining the output of Model 1 (15° right axial pre-rotation during extension), Model 2 (extension in the planes of 15° right axial pre-rotation and 30° left axial pre-rotation) was developed to study the effect of adding conditions on the classifier’s performance. Using stepwise approach, only measures selected as the result of Model 1 were considered as suitable input variables for model 2.
Section 3.2 provides further explanation.
Next, a discriminative analysis was performed to distinguish the high-risk subgroup from the low-moderate risk combination group, since this group is the most critical form diagnostic and prognostic perspectives. 32 subjects were randomly selected out of 68 from the low and moderate subgroups. The conditions in models 1 and 2 were fixed and the procedure was repeated 10 times to decrease random selection.
4. Discussion
The current study presented a novel methodology towards the practical clinical application of inertial sensors as a quantitative tool to classify NSLBP. Various relevant kinematic variables among three subgroups of patients (low, high and medium risk) with NSCLBP were measured and compared. The results can be summarized as follows—(1) Increasing asymmetry in the planes of motion, resulted in decreasing the acceleration and angular velocity, while increasing the jerk; (2) The angular velocity and jerk were higher during extension, while the acceleration was higher during flexion. (3) Trunk extension position of 15 degrees rotation to the right can potentially be used to identify LBP patients, especially in the high-risk group.
4.1. Planes of Motion Effect
The results demonstrated a decrease in the angular velocity and acceleration with increasing asymmetry. We hypothesize that during trunk flexion-extension in a symmetrical plane, the movement is mostly controlled by the large stabilizing spinal muscles. As the angle of motion changes from zero, either to the right or left, the contribution of the smaller external and internal abdominal muscles is increased to control the movement [
27]. This shift from large stabilizer muscles to secondary smaller muscles may have caused a decrease in the velocity and acceleration. Control of asymmetric motion typically requires a high degree of muscle co-contraction. Increasing the number of the involved muscles, increases the time needed to control the activity of these muscles, which may lead to slower movement [
28].
Ogon et al. [
29] reported jerk as a natural part of relatively rapid movements. They suggested that changes in the strain rate of the viscoelastic biological lumbar tissues, as well as the axis of spine rotation, during flexion-extension may contribute to the jerk. Indeed, in the present study, the observed increase in jerk was associated with increasing asymmetry. It is possible that as the angular velocity and acceleration became more non uniform with increasing asymmetry, producing higher jerk values.
4.2. Direction Effect
The higher velocity observed during trunk extension, as compared to flexion during asymmetric movement, may be attributed to the stronger extensor muscles (vs. flexor muscles). Although Kim et al. [
30] proposed a flexion-relaxation phenomenon for extensor muscles in healthy individuals, they reported that this phenomenon is not replicated in LBP patients. The extensor muscles of these patients are constantly active during flexion, which may impair flexion. This may be considered as a potential reason for the decreased flexion velocity seen in the three subgroups of LBP patients in the present study.
The results also revealed less acceleration during extension and yet greater jerk in the same direction, which seems to indicate control challenges during extension in these patients, especially in the high-risk group, who had the least acceleration (
Figure 8). Various factors, such as fear of pain and a history of pain during extension, in addition to weak lumbar extension muscles can affect the patient’s movement. Moreover, structural, neurophysiological and behavioral changes in the lumbar spine may lead to a decrease in the deep sense and awareness of the patient’s lumbar movements, especially during extension [
31].
4.3. Subgrouping Effect
Using inertial sensors for the assessment of the trunk kinematic indices, Ashouri et al. [
20] were able to classify the participants into 2 subgroups of healthy and LBP patients. They found that an experimental protocol, based on a single sensor placed on the chest during simple flexion-extension tasks in the sagittal plane, can identify LBP patients with an accuracy of 96% and a sensitivity of 100%. In our study, all subjects were NSLBP. They had already been divided into three subgroups of low, medium and high risk (based on the clinical classification). Our results indicate that the kinematic indices were not always significantly different between the subgroups in flexion (
Figure 8). The mean acceleration was significantly affected by subgrouping, while the interaction of Subgroup by PLM approached significance. In addition the three-way interaction of Subgroup by PLM by direction was significant.
One of the reasons that subgrouping did not have a significant effect on some of the kinematic variables, may be the bio-psycho-social STarT questionnaire, which despite its simplicity and clinical popularity, remains a subjective tool that is not designed to account for kinematic parameters. In addition, in this study, all patients were at approximately similar pain level (VAS < 5) during testing. As can be seen in
Figure 8 and the results of the LDA analysis, identifying high-risk individuals can well be made based on their kinematic status.
4.4. Linear Discriminant Analysis
The results of the discriminant analysis in the current study suggest that by examining the trunk kinematic profiles of a patient in defined conditions, the health care provider may be able to better assess the level of the patient’s impairment, especially if it is high risk, towards prescribing more effective treatment/rehabilitation modalities. Evidence-based and theoretical considerations indicate that correctly identifying high-risk patients is critical and yet often neglected in clinical practice, despite the negative consequences of missing the chance to provide timely cognitive therapy, in addition to physiotherapy.
This study aimed to introduce a specific methodology/protocol for the discrimination of LBP patients, based on the risk/level of LBP, by using inertial sensors. Contrary to the study of Sheeran et al. [
9], which requires a well-equipped laboratory space, the portable wearable inertial sensors and protocol presented here, can be easily applied in clinical settings. Furthermore, using the STarT questionnaire, unlike other subgrouping methods of LBP patients such as O’Sullivan [
10], does not require any specific expertise and can be easily integrated in modern clinical practice. Moreover, flexion/extension tasks are among the simplest possible movements that a patient can safely perform under supervision. As recommended by Ashouri et al. [
20], only one sensor is needed, which makes the application inexpensive and convenient.
This study revealed a major difference in extension between the LBP subgroups, particularly for the acceleration variable. However, contrary to the results of Ashouri et al. [
20], who distinguished LBP patients from healthy subjects in a zero-degree symmetrical plane with an accuracy of 96%, our study indicated that a 15-degree of asymmetry was needed to better classify the three subgroups. Based on the identified differences, three simple equations were formulated for quantification. By inserting the acceleration value in these equations, the subgroup could be predicted, with some error, especially the high-risk group. The best discrimination percentage obtained in this study is well within the range described in the literature using inertial sensors in clinical applications (~70%) [
32,
33,
34,
35]. Brandt et al. [
36] used LDA on trunk angular data for discriminating people who performed high-risk and low risk lifting tasks. They were able to separate these individuals with 65% accuracy, which was enhanced to 75% by adding electromyography data.
In summary, the following are some notable points of discussion:
All participants in the present study were patients with LBP. In the previous study, using the same inertial sensors, the discrimination accuracy between healthy individuals and LBP patients was approximately 96% [
20]. Since the current investigation only included patients, the trunk kinematic variability among these three subgroups may be less than those between healthy participants and patients. Further research is needed for validation.
According to the STarT questionnaire, a patient with scores of 3 and 4 is classified as low risk and medium risk, respectively. Many patients, however, may be on the boundary between these two subgroups. Classified as “cusp patients,” these individuals are currently separated by only one affirmative answer on the questionnaire. This process may clearly increase the classification error, potentially explaining the low achieved discrimination percentage between the low and medium risk subgroups observed here.
Sheeran et al. [
9] revealed good classification results among the defined subtypes of LBP patients. However, they did not evaluate the pain score of these patients. Significant differences in the pain score can affect the separation results. In the present study, the pain score was not significantly different among the subgroups of patients during testing. Subgrouping in the O’Sullivan approach was performed by an expert clinician through examining the lumbar range of motion during active and passive flexion/extension, as long as the patient feels pain [
10]. Therefore, it can be postulated that pain is an important factor for distinguishing patients, as it directly affects classification and should not be eliminated from assessment protocols.
In a companion study, Abdollahi et al. (published in preprint) [
37], used inertial sensors for the classification of subgroups of LBP patients. They could distinguish high risk subgroups from low and medium risk patients with a 76% accuracy by applying the SVM method on trunk movement patterns. Using a simpler approach, such as LDA applied in the present study, makes it easier for non-engineers, especially therapists and clinicians, to better understand the biomechanical factors in play and kinematic variability among different subgroups. The simplistic quantitative approach adopted in this study may bridge the wide existing gap between biomechanists and health care providers to assist patients more effectively.
Our sample size was 100 patients with approximately 33 participants in each subgroup. Ashouri et al. [
20] studied 28 patients and 24 healthy individuals, while Sheeran et al. study assessed 87 patients in total [
9]. The sample sizes, including three subgroups with flexion pattern, active extension pattern and passive extension pattern, were 49, 23 and 14 LBP patients, respectively. Since both previous studies reported higher classification percentage, the effect of sample size on subgrouping performance should be examined in future studies. Esfehani et al. [
38] used a method known as Bayes [
39] to investigate the maximum possible accuracy of classification using wearable sensors. This method may be utilized to identify the best possible accuracy that can be achieved using the data collected in the present study, independent of any specific classification method.
5. Limitations
LDA was the only method used in this study to discriminate among the three LBP patient subgroups. With the help of machine learning and nonlinear dynamic methods, more improved discriminant analyses tools can be exploited in future studies towards better results of accuracy and sensitivity. By analyzing the temporal patterns of the variables measured by the sensor, instead of standard indicators, such as maximum and mean, the behavior of patients in the three subgroups can be compared in real time during dynamic movement. Another limitation of this study is that due to the hospital’s logistics, only male participants were recruited and hence the kinematic profiles of females were not examined. Moreover, all the participants were right-handed, which may have influenced the results, as it is usually easier to move to the dominant side. Although the sensor was tightly attached, some error may have resulted from sensor placement. An adaptive Kalman filter [
40] was implemented in the SHARIF-HMIS system in order to reduce the error based on high-acceleration movement and to separate the linear acceleration variable from the gravity component [
21,
22]. In the current study, due to the lack of access to these programs and the limitations in the original clinical study that precluded the inclusion of added static postures needed for proper calibration [
12,
21,
22], the output of the IMU sensors was directly used without adaptation. This also prevented the estimation of trunk angular position, as well as the compensation of the gravitational acceleration in the output of linear accelerometers. We have partially compensated for this shortcoming by dividing the repetitive movements into four phases of motion—acceleration/deceleration phases of trunk flexion and extension, respectively. Limited ankle movement or hip flexion was also unavoidable while performing the various movements. Furthermore, in the current study, the features were extracted only from the signals obtained in the
Y axis of the sensor’s local reference frame due to the domination of the angular velocity in this direction during flexion/extension as previously mentioned. The signals captured by the sensor in the X and Z directions were not included in our analysis. Locally measured
Y-axis-signals provided enough information on patients’ trunk kinematics for this study. Future work, however, should study other available signals.