Introduction

Anterior cruciate ligament (ACL) injury is one of the most common injuries in the knee joint occurring amongst adolescents and young adults during sports that involve sudden stops or changes in direction, jumping, and landing.1 Diagnosis of ACL injury in the initial medical consultation is difficult because of the common acute signs and symptoms of an ACL injury, such as immediate pain, swelling, and hemarthrosis. According to two previous studies in the U.K,1,2 correct ACL injury diagnosis was made in 28.2% and 14.4% of cases in the acute phase, respectively, which suggests that diagnostic accuracy is lower and difficult in the acute phase, as the median time to diagnosis was six weeks after ACL injury. Poor diagnostic accuracy within the first six weeks after injury is often related to an unnecessary delay in the diagnosis and subsequent treatment, and increases the risk of secondary injury to the knee joint.1 Accurate diagnosis in the acute stage is needed to reduce these secondary risks and appropriately manage ACL injuries.

Diagnostic imaging modalities and physical examinations are used to diagnose ACL injuries. Magnetic resonance imaging (MRI) is the gold standard owing to its high diagnostic accuracy. A previous systematic review indicated that the positive likelihood ratio (LR+) and negative likelihood ratio (LR−) were 14.4 (95% confidence interval [CI] = 9.2–22.5) and 0.1 (95% CI = 0.1–0.3), respectively,3 indicating that MRI is a useful independent tool for diagnosing ACL injury.

Conversely, physical examinations performed by athletic trainers, physical therapists, and medical doctors are inexpensive and can be performed without the use of highly specialized machines. Four tests are commonly used to diagnose or determine ACL injury: the Lachman test, the anterior drawer test, the pivot shift test, and the lever sign test. The diagnostic accuracy of these tests has been examined in systematic reviews and meta-analyses. The Lachman test had the lowest LR− of 0.17 (95% CI = 0.11–0.25), and the pivot shift test had the highest LR+ of 16.00.4 Additionally, the latest systematic review, conducted in 2019 regarding the lever sign test, suggested that it may be used in addition to other tests to rule out the presence of an ACL injury.5

Based on these findings of systematic reviews, physical examination is useful in clinical practice for diagnosing ACL injury. However, previous meta-analyses have not analyzed acute and chronic conditions individually, and no systematic review has examined whether these physical examination tests are useful in acute settings for diagnosing ACL injuries. There is a possibility that the diagnostic accuracy for ACL injury differs between acute and chronic conditions because of the effects of signs and symptoms in acute conditions, such as pain, swelling, and hemarthrosis. Additionally, there has been no systematic review examining the diagnostic accuracy of these four physical examinations, including the lever sign test, in the same analysis, and the evidence for the efficacy of these tests for the diagnosis of acute ACL injury is limited. Clarifying the diagnostic accuracy of physical examination tests can help healthcare providers with the decision-making process in clinical practice.

Thus, this study aimed to perform a systematic review and meta-analysis of original research studies that involved the diagnostic accuracy of four physical examination tests for ACL injuries in acutely after an injury in adolescents and young adults.

Materials and methods

This study was conducted according to the Preferred Reporting Items for Systematic Review and Meta-analysis of Diagnostic Test Accuracy (PRISMA-DTA).6 The study protocol of this systematic review was registered in the University Hospital Medical Information Network (UMIN), the biggest registration system in Japan. The study approval number was UMIN000044766.

Physical Examination tests

This systematic review included studies evaluating the accuracy of four physical examination tests (Lachman, anterior drawer, pivot shift, and lever sign tests) in diagnosing ACL injuries. All tests are performed in the supine position. The Lachman test is performed with the knee flexed at 15°. The examiner stabilizes the distal femur with one hand and grasps the proximal tibia with the other hand. Then, the examiner applies an anterior tibial force to the proximal tibia. The anterior drawer test is performed with the knee flexed at 90°. The examiner sits on the patient’s foot and grasps behind the proximal tibia with the thumbs palpating the tibial plateau and index fingers palpating the tendons of the hamstring muscle group medially and laterally. The anterior tibial force is then applied by the examiner. In these two tests, a positive test for a torn ACL is indicated by greater anterior tibial displacement on the affected side when compared to the unaffected side. In the pivot shift test, the examiner stands on the side of the patient’s knee under examination. The examiner wraps one arm around the patient’s leg, pinning it firmly and flexing the knee to 90° while using the palm of the other hand to rotate the tibia medially, effectively subluxing the lateral tibial plateau. The examiner slowly extends the knee while maintaining the rotation of the tibia. As the patient’s knee reaches full extension, the tibial plateau relocates. A positive test traditionally is indicated by an audible or palpable click. In the lever sign test, the patient lies in the supine position with both legs fully extended. One fist of the examiner is placed under the proximal third of the calf of one leg. Then, with the other hand, a downward force is applied over distal third of the patient’s quadriceps of same leg. A positive test is indicated by a heel rise (Figure 1).

Figure 1
Figure 1.Descriptions of physical examination tests

a, Lachman test; b, Anterior drawer test; c, Pivot shift test; d, Lever sign test.

Eligibility criteria

Studies were included in this review if they met the following inclusion criteria: (1) original articles in English or Japanese; (2) single-gate studies, such as prospective cohort and cross-sectional studies including acute settings; (3) reported findings that enabled the calculation of the number of true-positive (TP), true-negative (TN), false-positive (FP), and false-negative (FN) values for the test accuracy of index tests (the present study included articles that reported sensitivity, specificity, and prevalence to manually calculate TP, TN, FP, and FN if the study did not show specific numbers); (4) reported data on adolescents and young adults; (5) reported findings of partial or complete ACL injury; and (6) evaluated MRI and/or arthroscopy as the reference standard. Based on previous studies,1,7 the acute phase was defined as six weeks in this study. No limits regarding the date of publication were established, although the following types of studies were excluded: (1) participants who had undergone surgery in knee joints affected by ACL injury, (2) participants with other associated diseases such as osteoarthritis and meniscus injury, (3) diagnosis determined using questionnaires, (4) diagnosis determined using devices, and (5) participants who were diagnosed with laxity.

Search strategy

The languages used in the search were English and Japanese. The electronic search was conducted using the following databases: PubMed, Medical Literature Analysis and Retrieval System Online (MEDLINE), Cumulative Index to Nursing and Allied Health Literature (CINAHL), Web of Science, and Ichushi (a Japanese medical database). The most recent search was conducted on April 23, 2021. The details of the search strategy are presented in Table 1. The same approach was used for all searches and was adopted as necessary, according to the specifics of each database.

Table 1.Search strategy
#1 "Anterior cruciate ligament"
#2 Injury
#3 Rupture
#4 Tear
#5 #2 OR #3 OR #4
#6 "Lachman test"
#7 "Pivot shift test"
#8 "Anterior drawer test"
#9 "Lever sign test"
#10 #6 OR #7 OR #8 OR #9
#11 #1 AND #5 AND #10

Selection of the studies

First, duplicate studies were removed after searching databases. One evaluator (ST) read the titles and abstracts of the identified articles and excluded irrelevant studies. The full texts of the selected studies were evaluated, and the suitability for inclusion was determined by two independent evaluators (ST and YI). Disagreements between evaluators were resolved by consensus. In cases in which no consensus was reached, a third evaluator (RT) was consulted to determine eligibility.

Risk of bias

The methodological quality of the diagnostic studies was evaluated using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS)-2 tool.8 This tool comprises four domains: patient selection, index test, reference standard, and flow and timing. Each domain of the QUADAS-2 was assessed in terms of the risk of bias and classified accordingly as low, high, or unclear by two independent evaluators, ST and RT. Applicability concerns of the three domains of the QUADAS-2, such as patient selection, index test, and reference standard, were assessed by ST and RT.

Statistical analysis

To analyze the data, free software Meta-Disc version 1.4 was used. The statistical methodology was based on a previous study described by Devillé et al.9 Summary sensitivity, specificity, LR+, and LR– with 95% CI were calculated using random-effects models with the Der-Simonian and Laird method or fixed-effects models with the Mantel-Haenszel method, depending on the level of heterogeneity of the study. Data are presented as forest plots and summary receiver operating characteristic (SROC) curve plots. The percentage of variability was crossed using the chi-squared test (p < 0.10) and I2 statistics. A random-effects model was used if the heterogeneity test showed statistical significance (I2 > 50%, p < 0.01). Otherwise, a fixed effects model was used. In addition, the area under the curve (AUC) was calculated using 95% CI. A mean AUC-ROC value of > 0.70, was defined as an effective risk predictor.

Results

Figure 2
Figure 2.Flow diagram of the literature screening process

A systematic electronic search of PubMed, MEDLINE, CINAHL, Web of Science, and Ichushi resulted in 3,836 studies. After duplicates were removed, 1,536 studies were included. The titles and abstracts of these studies were screened, and 86 full-text articles were evaluated for eligibility. Of the 86 articles, 40 were not applicable according to the study design, 14 did not describe test accuracy, 10 were review articles, eight articles did not use physical examination tests, and six articles did not include the acute phase. This, eight studies were included in this review. A summary of the literature search is shown in Figure 2. The characteristics of the included studies are summarized in Table 2.

A total of 620 participants were evaluated in the included studies. Six of the articles involved assessment using an index test by medical doctors, and two of the articles reported assessment using index test by physical therapists. The Lachman test was used in eight studies, the anterior drawer test in six studies, the pivot shift test in five studies, and the lever sign test in four studies. For the reference test, seven studies used MRI, one study used arthroscopy, and two studies used both MRI and arthroscopy. Index tests, which included the four physical examination tests of interest in this review, were performed while awake in four studies, under anesthesia in one study, and patient status was unclear in four studies (Table 2). In included studies, a test was considered positive if it met the following criteria: (1) > 1+ in the International Knee Documentation Committee parameters; (2) perceivable anterior subluxation of the tibia; (3) no positive endpoint to the subluxation of the knee in 10°–20° of flexion; and (4) the foot did not raise passively in the lever sign test.

Table 2.Summary of characteristics of the included studies
Source (year) No. of subjects Male Female Age years mean (range) Assessor Anesthesia Reference test Index test TP TN
Jarbo,10 2017 102 58 44 23 (15-66) Medical doctor 29 patients under anesthesia MRI Lachman test 40 56
15 patients while awake Anterior drawer test 39 55
Pivot-shift test 26 57
Lever sign test 28 52
Lee,11 1988 79 NA NA NA (14-59) Medical doctor NA MRI Lachman test 21 56
Anterior drawer test 18 56
Massey,12 2017 91 61 30 28 (NA) Medical doctor while awake MRI Lachman test 63 17
Anterior drawer test 58 16
Pivot-shift test 44 15
Lever sign test 59 16
Gürpınar,13 2019 78 69 9 26 (17-44) Medical doctor while awake MRI Lachman test 50 10
Anterior drawer test 48 11
Pivot-shift test 32 15
Lever sign test 57 15
Kıyak,14 2018 62 NA NA NA Medical doctor while awake MRI Lachman test 14 20
Anterior drawer test 10 18
Pivot-shift test 8 22
Mulligan,15 2015 45 21 24 40 (20-64) Physical therapist NA MRI Lachman test 13 29
Arthroscopy
Mulligan,16 2017 60 38 22 42 (18-65) Physical therapist NA MRI Lachman test 9 26
Arthroscopy Lever sign test 20 32
Kostov,17 2014 103 81 22 29 (16-58) Medical doctor NA Arthroscopy Lachman test 67 30
Anterior drawer test 69 30
Pivot-shift test 45 29

TP, true positive; TN, true negative; MRI, magnetic resonance imaging.

The risk of bias assessment of the included studies is described under the headings of the QUADAS-2 assessment tool (Figure 3). For the patient selection domain, only one study had an unclear risk of bias because there was no information regarding consecutive sampling. For the index test domain, one study had a high risk of bias. This study did not state whether the index test was interpreted without the knowledge of the results of the reference test, and it was unclear how this study judged a positive result. For the domain of flow and timing, three studies had a high risk of bias because these studies did not assess the timing between reference and index tests. All applicability concerns in the three domains were judged to have a low risk of bias.

Figure 3
Figure 3.Summary of risk of bias

The sensitivity and specificity with 95% CI and forest plots for the four physical examinations for all included studies are shown in Figure 4. Pooled sensitivities for the Lachman, anterior drawer, pivot shift, and lever sign tests were 0.79 (95% CI = 0.74 – 0.83), 0.78 (95% CI = 0.73–0.83), 0.55 (95% CI = 0.49–0.60), and 0.82 (95% CI = 0.76–0.87), respectively. Pooled specificities were 0.91 (95% CI = 0.87–0.94), 0.91 (95% CI = 0.86–0.95), 0.96 (95% CI = 0.91–0.98), and 0.88 (95% CI = 0.82–0.93), respectively. Forest plots for LR+ and LR− are shown in Figure 5. The lever sign test had the lowest LR− (0.21) and the pivot shift test had the highest LR+ (11.60). The SROC curves for the four physical examinations are shown in Figure 6. The AUC of the four physical examinations were > 0.70.

Figure 4
Figure 4.Forest plots of sensitivity and specificity
Figure 5
Figure 5.Forest plots of positive/negative likelihood ratio
Figure 6
Figure 6.SROC curve plots

Discussion

This systematic review included eight studies. The risk of bias of the included studies was evaluated using the QUADAS-2 tool. For the domains of patient selection, the index test, and the reference standard, the risk of bias was low. For the domain of flow and timing, three studies were evaluated as having a high risk of bias because they did not assess the timing between the reference and index tests. The present study included two new studies that were not included in previous systematic reviews. Thus, this study provides additional evidence and emphasize the method healthcare providers should select for ACL injury diagnosis in an acute setting in adolescents and young adults.

The AUC values of the Lachman, anterior drawer, pivot shift, and lever sign tests were large enough (0.9270, 0.9193, 0.8910, and 0.9038, respectively) for the physical examination tests to be considered effective in diagnosing ACL injuries. Furthermore, the pooled sensitivity was highest in the lever sign test, and the pooled specificity was highest in the pivot shift test. The lever sign test had the lowest LR− (0.21) and the pivot shift test had the highest LR+ (11.60). In general, LR+ above 10 and LR- below 0.1 are considered to provide strong evidence to rule in or rule out diagnoses, respectively, under most circumstances.18 Based on the results, the lever sign test is useful to rule out among the tests, and the pivot shift test is useful to confirm an ACL injury in an acute setting.

Previous systematic reviews have assessed the diagnostic accuracy of physical examination tests for ACL injuries. In a previous systematic review in 2016, it was elucidated that the pivot shift test was the most specific, showing a pooled specificity of 97.5% (95% CI = 0.95–0.99), and had the highest LR+ of 16.00 (95% CI = 7.34–34.87).4 Reiman et al.19 conducted a systematic review of diagnostic accuracy regarding the lever sign test and found that LR+ and LR− were 9.2 (95% CI = 0.70–46.1) and 0.58 (95% CI = 0.18–1.28), respectively. This study supports these findings, and the pivot shift and lever sign tests are proficient tools for the diagnosis of ACL injuries in acute settings. Although LR+ and LR- are sufficient to diagnose ACL injuries, it is difficult to make a complete diagnosis using a single physical examination test. Therefore, a combination of these four physical examination tests with high diagnostic accuracy may help reduce the number of incorrect diagnoses in clinical practice.

Among the four physical examinations, the lever sign test was introduced as a new physical examination test in the 2010s.20 This test is relatively easy to perform and requires less examiner strength than the other three commonly used tests. The lever sign test uses the common force of gravity instead of relying on the sensation of the examiner, thus increasing test accuracy. The results of the current study revealed that the diagnostic accuracy of the lever sign test is favorable. Furthermore, Lichtenberg et al.21 examined the reliability of four physical examinations and indicated that the kappa values for the lever sign test were the highest among the physical examinations. This is a notable point for considering the use of the lever sign test in clinical practice.

This systematic review and meta-analysis have some limitations. First, this study searched for both English and Japanese articles in various databases; however, EMBASE, a database frequently used in other systematic reviews, could not be used. Furthermore, this systematic review focused only on diagnosis in acute conditions and excluded chronic conditions; thus, only eight studies were included. There may be a difference in the included studies and results between other systematic reviews and those in the present study. Additionally, this study did not include subjects with meniscal injury and laxity. These factors may limit the generalizability of the results. Second, this systematic review did not consider the relationship between anesthesia and test result, and included patients with complete or partial ACL injuries. In particular, anesthesia might be related to test accuracy due to complete relaxation of the patient. Therefore, clinicians should be cautious about its clinical applications. Additionally, two types of reference standards, MRI and arthroscopy, were used, which may have led to mixing bias.

Conclusion

The AUC of the Lachman, anterior drawer, pivot shift, and lever sign tests suggests high accuracy for use of each in the diagnosis of ACL injury. The lever sign test had the lowest LR− and the pivot shift test had the highest LR+. Thus, the lever sign test is recommended to rule out among the tests, and the pivot shift test is recommended to confirm an ACL injury in an acute setting.


COI statement

The authors have no conflicts of interest. This study did not receive any specific grants from funding agencies in the public, commercial, or not-for-profit sectors.