Provisional THOR Chest Injury Risk Function With Frontal and Oblique Sled Tests
Provisional THOR Chest Injury Risk Function With Frontal and Oblique Sled Tests
Provisional THOR Chest Injury Risk Function With Frontal and Oblique Sled Tests
January 8, 2016
This document explains the functional form of the chest injury risk function relative to frontal
crash test conditions described in Appendix A. Results are preliminary and presented only to
provide a general impression for the metrics under current consideration as well as the likely risk
function.
1. There is an approximate rule of thumb that each considered predictor requires 10 events (AIS
3+ injuries) and 10 non-events for a reliable model fit (Vittinghoff et al., 2012, p.422). These
data contain 25 subject observations with AIS 3+ injuries resulting and 19 observations not
sustaining AIS 3+ injuries, thus limiting the model to two independent predictors.
2. Age is considered an important predictor of PMHS injury and should be included in the
model.
3. The remaining predictor should be left/right symmetric. That is if the deflection patterns were
reflected across the dummy’s left/right aspect, the estimated injury risk should be unchanged.
∗
[email protected]
†
[email protected]
1
3.0
2.5
Variances
2.0
1.5
1.0
0.5
Component
Figure 1 shows that almost all the variability in the test data can be explained by a single
principal component. Moreover, that principal component is an approximately equally weighted
sum of the 4 inputs scaled to have mean 0 and variance 1.
2
1.2 Calculating the Principal Components Score
Since one principal component explains almost all of the deflection patterns observed in the data
set (87.1%, Fig. 1), loadings from the first principal component are used as the basis for the primary
predictor of injury. The first principal component can be calculated by:
updif − m3
uptot − m1 lowtot − m2 lowdif − m4
P C1 = l1 + l2 + l3 + l4 . (2)
s1 s2 s3 s4
The variables in equation (2) are defined as follows. uptot , lowtot , updif , and lowdif are the variables
described in (1).
l1 , . . . , l4 are the principal components loadings (first column in Table 1). Their values are
l1 l2 l3 l4
0.486 0.492 0.496 0.526
m1 , . . . , m4 are the means of the 4 input deflection metrics (described in (1)). Their values are
m1 m2 m3 m4
59.856 47.417 27.389 26.345
s1 , . . . , s4 are the standard deviations of the 4 input deflection metrics (described in (1)). Their
values are
s1 s2 s3 s4
17.439 14.735 9.672 12.384
3
2 Preliminary Fitted Models
Statistical modeling was conducted in two stages. The first stage, presented in this section, is
to assess the fitted model for goodness of fit, etc. There is a failed assumption with the models
presented in this section with regard to the independence of observations, as 4 of the cadavers were
tested twice, the first time at a slow, non-injurious impact speed, and the second at a higher speed.
These repeated observations on the same subject cannot be treated as independent. Remedying
this failed assumption is addressed in Section 3.1.
Two types of models were fit using the predictor described by (3) and age of the PMHS as
a potential confounding factor. The first is a logistic regression including age and the translated
principal components score as predictors. The second is a parametric survival model, specifically,
an accelerated failure time (AFT) model with Weibull distribution, treating the deflection metric
P C1trans as the failure “time”, and age as a covariate. In the present setting the two models are
expected to produce results of similar quality due to the double-censored nature of the data. The
first approach (i.e., logistic modeling) has the advantage of being more commonly used and having
well-established diagnostic procedures. The logistic regression model output is:
Call:
glm(formula = ais3 ~ respcaTrans + age, family = binomial, data = thor)
Deviance Residuals:
Min 1Q Median 3Q Max
-2.1762 -0.7262 0.2952 0.7209 1.5378
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -12.76301 4.19781 -3.040 0.00236 **
respcaTrans 1.04992 0.33120 3.170 0.00152 **
age 0.11757 0.04638 2.535 0.01125 *
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
The second modeling approach (AFT) has the advantage of producing an estimated 0 risk
corresponding to 0 deflection, and it makes it easier to account for repeated measures. The fitted
accelerated failure time model is:
Call:
survreg(formula = Surv(respcaTrans, time2 = NULL, event = 2 *
ais3, type = "interval") ~ age, data = thor, dist = "weibull")
Value Std. Error z p
(Intercept) 2.931 0.47419 6.18 6.37e-10
4
age -0.019 0.00786 -2.41 1.58e-02
Log(scale) -1.216 0.32432 -3.75 1.76e-04
Scale= 0.296
Weibull distribution
Loglik(model)= -19.6 Loglik(intercept only)= -23.5
Chisq= 7.88 on 1 degrees of freedom, p= 0.005
Number of Newton-Raphson Iterations: 5
n= 44
The diagnostics for the logistic regression showed one potential outlier, a 40 year old subject
whose cause of death was deemed respiratory failure as a result of scleroderma. Upon review, the
severity of the scleroderma was deemed to increase the risk of fracture, so the subject was removed
for all analysis.
2
●● ● ● ●
● ●
●● ●
Std. deviance resid.
1
●● ●●
1
●●
●
● ●●●●
●
●● ●
●● ●●●
●●● ●●
Residuals
● ● ●●●●●
●●●
0
●●●●
● ●●●
● ●●
●
●●
●● ●●
−1
●
● ●
●●●
−1
●
● ●●
● ●●
●1 ●
4
−2
4 ●1 ●
−2
26 ●
● 26
−4 −2 0 2 −2 −1 0 1 2
26 ●
●4
1 ● 39 22 ●
●● ●
1
Std. deviance resid.
● ●
● ● ●
Std. Pearson resid.
●● ● ●
● ●
● ●● ●
●●
1.0
● ●
●●● ●
●
●
● ●
0
●● ●●
● ●●●
● ● ●
● ● ● ●
● ● ●
●
● ●
●● ● ● ●
−1
●● ● ●
●●
● ●
●
0.5
●
●
−2
●●
●● ●
0.5
−3
● 26
Cook's distance
0.0
Since the fitting processes for both models are similar, the diagnostics for the AFT model will
be similar, and are therefore omitted.
Both logistic and AFT models produce non-significant Hosmer-Lemeschow goodness of fit
(GOF) statistics, which is an indicator of good model fit. Presented below are the results for
logistic regression.
5
1.0
●
●
● ●●●
●●
0.8
●●
●
●
●
●●
●
0.6
●
●●
●
0.4 ●
● ●
●
●
●
● ●
●
0.2
● ●
●
0.0
● ●
●
Figure 3: The logistic and survival models produce very similar estimates of the probability of
injury.
The Hosmer-Lemeschow test requires grouping the data into g groups by quantile. In both cases,
the test results hold for all reasonable choices of g. Test results for g = 8 are shown above.
6
ROC Curves
1.0 ● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ●
● ● ●
● ● ● ●
0.8
●
●
probability of detection
● ●
●
●
0.6
(Sensitivity)
●
●
● ●
●
●
0.4
●
● ●
● ●
● ●
0.2
●
●
●
●
AFT
●
●
●
Logistic
0.0
Call:
survreg(formula = ss ~ age, data = thorRM, dist = "weibull")
Value Std. Error z p
(Intercept) 2.8677 0.47407 6.05 1.46e-09
age -0.0181 0.00785 -2.30 2.15e-02
Log(scale) -1.1975 0.32246 -3.71 2.04e-04
7
Scale= 0.302
Weibull distribution
Loglik(model)= -19.2 Loglik(intercept only)= -22.6
Chisq= 6.74 on 1 degrees of freedom, p= 0.0095
Number of Newton-Raphson Iterations: 5
n= 40
It is seen from Figure 4 that the two models produce fitted values that are quite similar.
1.0
●
●●
●
●
●
●●
●
●
●
0.8
●
●●
●
●
●
0.6
●
RM Model
●
●
●
0.4
●●
●
●
●●
●●
●
●
0.2
●●
0.0
●
●
Original Model
Figure 4: Comparison between the repeated measures survival model and the original independent
data survival model. The two models perform similarly.
id Max dfbeta
323 0.31
433 0.36
444 0.33
481 0.31
674 0.37
736 0.37
8
1.0
● ● ●● ●
● ● ●
0.8
P(AIS 3+Injury)
0.6
0.4
0.2
0.0
● ● ● ● ● ● ●
●●
0 5 10 15
PC Score
Figure 5: Injury risk function for a 55 year old (solid) with 95% pointwise horizontal confidence
limits (dashed). The confidence interval for a 50% chance of injury is illustrated by the arrow.
4 Example
Suppose a series of frontal crash tests were run using the THOR-metric dummy. The associated
AIS 3+ injury risk can be estimated through the following stepwise process:
1. Calculate uptot , lowtot , updif , and lowdif as described in (1). For the remainder of the
example, we use the values
uptot = 75.206
lowtot = 51.751
updif = 33.461
lowdif = 36.753
2. Calculate the translated PC Score as given by equation (3). Use the values for uptot , lowtot ,
9
updif , and lowdif calculated in Step 1, and the the values for li and si given under equation (2).
3. Calculate the probability of injury using equation (4). Substitute P C1trans calculated in Step
2 for x, and input the occupant’s age. For this example, we assume the occupant is 60 years
old.
We estimate that a 60 year old subject has an 83.3% chance of injury given the crash test
conditions.
Scale= 0.298
Weibull distribution
Loglik(model)= -19.8 Loglik(intercept only)= -22.9
Chisq= 6.3 on 1 degrees of freedom, p= 0.012
Number of Newton-Raphson Iterations: 5
n= 40
10
1.0
●
●
●●
● ●●
●
●
0.8
●● ●
●
●
●
0.6
●
●●
●
●
●
●
0.4
●
●
●●
●●
● ●●
0.2
●
●
●
●
0.0
●
Deviance Residuals:
Min 1Q Median 3Q Max
-2.0873 -0.7540 0.3397 0.6459 1.6057
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -11.72791 3.78122 -3.102 0.00192 **
overall_max 0.14170 0.04424 3.203 0.00136 **
age 0.10398 0.04211 2.469 0.01354 *
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
11
Value Std. Error z p
(Intercept) 5.6211 0.41735 13.47 2.40e-41
age -0.0151 0.00688 -2.20 2.81e-02
Log(scale) -1.3027 0.35838 -3.64 2.78e-04
Scale= 0.272
Weibull distribution
Loglik(model)= -20.8 Loglik(intercept only)= -23.9
Chisq= 6.34 on 1 degrees of freedom, p= 0.012
Number of Newton-Raphson Iterations: 5
n= 40
●
●
●●
●
●
●●
●
Total Deflection AIS 3+ Injury
0.8
●
●
●● ●
●
●
0.6
●
● ● ● ●●
●
● ● ●
0.4
●
●●
●●
● ●●
0.2
●
●
●
0.0
●
●
12
14
●
8
7 ●
●
7
12
● 4 ●
● 6 ●
●
4
10
9 ● ●
THOR PC Score
● 2
6
5 11 ●
●
8
8
●
14 9 ●
●
11
6
4
● ●
●
5
1312 14
●
●
4
3 13 ●
12 ●
2
2
●
3
●
10 10
0
20 30 40 50 0 2 4 6 8 10 12 14
References
Eric Vittinghoff, David V Glidden, Stephen C Shiboski, and Charles E McCulloch. Regression
methods in biostatistics: linear, logistic, survival, and repeated measures models. Springer, New
York, 2nd edition, 2012.
13