Advertisement for orthosearch.org.uk
Results 1 - 20 of 383
Results per page:
The Bone & Joint Journal
Vol. 102-B, Issue 4 | Pages 478 - 484
1 Apr 2020
Daniels AM Wyers CE Janzing HMJ Sassen S Loeffen D Kaarsemaker S van Rietbergen B Hannemann PFW Poeze M van den Bergh JP

Aims. Besides conventional radiographs, the use of MRI, CT, and bone scintigraphy is frequent in the diagnosis of a fracture of the scaphoid. However, which techniques give the best results remain unknown. The investigation of a new imaging technique initially requires an analysis of its precision. The primary aim of this study was to investigate the interobserver agreement of high-resolution peripheral quantitative CT (HR-pQCT) in the diagnosis of a scaphoid fracture. A secondary aim was to investigate the interobserver agreement for the presence of other fractures and for the classification of scaphoid fracture. Methods. Two radiologists and two orthopaedic trauma surgeons evaluated HR-pQCT scans of 31 patients with a clinically-suspected scaphoid fracture. The observers were asked to determine the presence of a scaphoid or other fracture and to classify the scaphoid fracture based on the Herbert classification system. Fleiss kappa statistics were used to calculate the interobserver agreement for the diagnosis of a fracture. Intraclass correlation coefficients (ICCs) were used to assess the agreement for the classification of scaphoid fracture. Results. A total of nine (29%) scaphoid fractures and 12 (39%) other fractures were diagnosed in 20 patients (65%) using HR-pQCT across the four observers. The interobserver agreement was 91% for the identification of a scaphoid fracture (95% confidence interval (CI) 0.76 to 1.00) and 80% for other fractures (95% CI 0.72 to 0.87). The mean ICC for the classification of a scaphoid fracture in the seven patients diagnosed with scaphoid fracture by all four observers was 73% (95% CI 0.42 to 0.94). Conclusion. We conclude that the diagnosis of scaphoid and other fractures is reliable when using HR-pQCT in patients with a clinically-suspected fracture. Cite this article: Bone Joint J 2020;102-B(4):478–484


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 4 | Pages 484 - 488
1 Apr 2006
Rogers BA Thornton-Bott P Cannon SR Briggs TWR

We assessed the reproducibility and accuracy of four ratios used to measure patellar height, namely the Blackburne-Peel, Caton-Deschamps, Insall-Salvati and modified Insall-Salvati, before and after total knee arthroplasty. The patellar height was measured, by means of the four ratios, on the pre- and post-operative lateral radiographs of 44 patients (45 knees) who had undergone total knee arthroplasty. Two independent observers measured the films sequentially, in identical conditions, totalling 720 measurements per observer. Statistical analysis, comparing both observers and ratios, was carried out using the intraclass correlation coefficient. Before operation there was greater interobserver variation using either the Insall-Salvati or modified Insall-Salvati ratios than when using the Caton-Deschamps or Blackburne-Peel methods. This was because of difficulty in identifying the insertion of the patellar tendon. Before operation, there was a minimal difference in reliability between these methods. After operation the interobserver difference was greatly reduced using both the Caton-Deschamps and Blackburne-Peel methods, which use the prosthetic joint line, compared with the Insall-Salvati and modified Insall-Salvati, which reference from the insertion of the patellar tendon. The theoretical advantage of using the Insall-Salvati and modified Insall-Salvati ratios in measuring true patellar height after total knee arthroplasty needs to be balanced against their significant interobserver variability and inferior reliability when compared with other ratios


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 670 - 672
1 Jul 1998
Flinkkilä T Nikkola-Sihto A Kaarela O Päakkö E Raatikainen T

Interobserver reliability of the AO system of classification of fractures of the distal radius was assessed using plain radiographs and CT. Five observers classified 30 Colles’-type fractures using only plain radiographs; two months later they were reclassified using CT in addition. Interobserver reliability was poor in both series when detailed classification was used. By reducing the categories to five, interobserver reliability was slightly improved, but was still poor. When only two AO types were used, the reliability was moderate using plain radiographs and good to excellent with the addition of CT. The use of CT as well as plain radiographs brings interobserver reliability to a good level in assessment of the presence or absence of articular involvement, but is otherwise of minor value in improving the interobserver reliability of the AO system of classification of fractures of the distal radius


The Journal of Bone & Joint Surgery British Volume
Vol. 72-B, Issue 2 | Pages 202 - 204
1 Mar 1990
Simmons E Graham H Szalai J

Fifteen independent observers of three levels of experience (consultant staff, fellows, residents) assessed 40 radiographs of children presenting with Perthes' disease using the Catterall and the Salter-Thompson grading systems. Each observer was supplied with descriptions and illustrations of the classifications and each hip was grouped by both systems by each observer. The results were statistically analysed using 'kappa' statistics. The level of interobserver agreement was higher for the Salter-Thompson system and correlated with the level of experience of the observer. Both systems can give acceptable levels of interobserver agreement, but the Salter-Thompson grouping is simpler and easier to apply in the earlier stages of the disease when treatment must be decided, and has a higher degree of reproducibility amongst more experienced observers


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 7 | Pages 950 - 954
1 Sep 2002
Brorson S Bagger J Sylvest A Høbjartsson A

We investigated whether training doctors to classify proximal fractures of the humerus according to the Neer system could improve interobserver agreement. Fourteen doctors were randomised to two training sessions, or to no training, and asked to categorise 42 unselected pairs of plain radiographs of fractures of the proximal humerus according to the Neer system. The mean kappa difference between the training and control groups was 0.30 (95% CI 0.10 to 0.50, p = 0.006). In the training group the mean kappa value for interobserver variation improved from 0.27 (95% CI 0.24 to 0.31) to 0.62 (95% CI 0.57 to 0.67). The improvement was particularly notable for specialists in whom kappa increased from 0.30 (95% CI 0.23 to 0.37) to 0.79 (95% CI 0.70 to 0.88). These results suggest that formal training in the Neer system is a prerequisite for its use in clinical practice and research


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 48 - 49
1 Jan 2002
Javed A Siddique M Vaghela M Hui ACW

We carried out a prospective study in order to establish to what extent the intra-articular evaluation undertaken during arthroscopy of the knee differed between surgeons. Two senior specialist registrars and a consultant orthopaedic surgeon with a special interest in knee surgery were involved. A total of 78 knee arthroscopies (78 patients) was studied. Arthroscopy was first carried out by the trainee and then by the senior author (ACWH). The intra-articular evaluation during the arthroscopy was recorded independently by a third person in the operating theatre. Data were collected to record variations in examination under anaesthesia, the morphology and pathology of the menisci and anterior cruciate ligament and the state of the articular surfaces. The overall interobserver variation was 20% in all categories. We question the published results of intra-articular evaluation during knee arthroscopy when surgeons of different levels of experience are involved in a single study


The Journal of Bone & Joint Surgery British Volume
Vol. 82-B, Issue 5 | Pages 636 - 642
1 Jul 2000
Wainwright AM Williams JR Carr AJ

We assessed the inter- and intraobserver variation in classification systems for fractures of the distal humerus. Three orthopaedic trauma consultants, three trauma registrars and three consultant musculoskeletal radiologists independently classified 33 sets of radiographs of such fractures on two occasions, each using three separate systems. For interobserver variation, the Riseborough and Radin system produced ‘moderate’ agreement (kappa = 0.513), but half of the fractures were not classifiable by this system. For the complete AO system, agreement was ‘fair’ (kappa = 0.343), but if only AO type and group or AO type alone was used, agreement improved to ‘moderate’ and ‘substantial’, respectively (kappa = 0.52 and 0.66). Agreement for the system of Jupiter and Mehne was ‘fair’ (kappa = 0.295). Similar levels of intraobserver variation were found. Systems of classification are useful in decision-making and evaluation of outcome only if there is agreement and consistency among observers. Our study casts doubt on these aspects of the systems currently available for fractures of the distal humerus


The Journal of Bone & Joint Surgery British Volume
Vol. 84-B, Issue 1 | Pages 15 - 18
1 Jan 2002
Whelan DB Bhandari M McKee MD Guyatt GH Kreder HJ Stephen D Schemitsch EH

The reliability of the radiological assessment of the healing of tibial fractures remains undetermined. We examined the inter- and intraobserver agreement of the healing of such fractures among four orthopaedic trauma surgeons who, on two separate occasions eight weeks apart, independently assessed the radiographs of 30 patients with fractures of the tibial shaft which had been treated by intramedullary fixation. The radiographs were selected from a database to represent fractures at various stages of healing. For each radiograph, the surgeon scored the degree of union, quantified the number of cortices bridged by callus or with a visible fracture line, described the extent and quality of the callus, and provided an overall rating of healing. The interobserver chance-corrected agreement using a quadratically weighted kappa (κ) statistic in which values of 0.61 to 0.80 represented substantial agreement were as follows: radiological union scale (κ = 0.60); number of cortices bridged by callus (κ = 0.75); number of cortices with a visible fracture line (κ = 0.70); the extent of the callus (κ = 0.57); and general impression of fracture healing (κ = 0.67). The intraobserver agreement of the overall impression of healing (κ = 0.89) and the number of cortices bridged by callus (κ = 0.82) or with a visible fracture line (κ = 0.83) was almost perfect. There are no validated scales which allow surgeons to grade fracture healing radiologically. Among those examined, the number of cortices bridged by bone appears to be a reliable, and easily measured radiological variable to assess the healing of fractures after intramedullary fixation


The Bone & Joint Journal
Vol. 105-B, Issue 10 | Pages 1123 - 1130
1 Oct 2023
Donnan M Anderson N Hoq M Donnan L

Aims. The aim of this study was to investigate the agreement in interpretation of the quality of the paediatric hip ultrasound examination, the reliability of geometric and morphological assessment, and the relationship between these measurements. Methods. Four investigators evaluated 60 hip ultrasounds and assessed their quality based the standard plane of Graf et al. They measured geometric parameters, described the morphology of the hip, and assigned the Graf grade of dysplasia. They analyzed one self-selected image and one randomly selected image from the ultrasound series, and repeated the process four weeks later. The intra- and interobserver agreement, and correlations between various parameters were analyzed. Results. In the assessment of quality, there a was moderate to substantial intraobserver agreement for each element investigated, but interobserver agreement was poor. Morphological features showed weak to moderate agreement across all parameters but improved to significant when responses were reduced. The geometric measurements showed nearly perfect agreement, and the relationship between them and the morphological features showed a dose response across all parameters with moderate to substantial correlations. There were strong correlations between geometric measurements. The Graf classification showed a fair to moderate interobserver agreement, and moderate to substantial intraobserver agreement. Conclusion. This investigation into the reliability of the interpretation of hip ultrasound scans identified the difficulties in defining what is a high-quality ultrasound. We confirmed that geometric measurements are reliably interpreted and may be useful as a further measurement of quality. Morphological features are generally poorly interpreted, but a simpler binary classification considerably improves agreement. As there is a clear dose response relationship between geometric and morphological measurements, the importance of morphology in the diagnosis of hip dysplasia should be questioned. Cite this article: Bone Joint J 2023;105-B(10):1123–1130


The Bone & Joint Journal
Vol. 106-B, Issue 3 | Pages 227 - 231
1 Mar 2024
Todd NV Casey A Birch NC

The diagnostic sub-categorization of cauda equina syndrome (CES) is used to aid communication between doctors and other healthcare professionals. It is also used to determine the need for, and urgency of, MRI and surgery in these patients. A recent paper by Hoeritzauer et al (2023) in this journal examined the interobserver reliability of the widely accepted subcategories in 100 patients with cauda equina syndrome. They found that there is no useful interobserver agreement for the subcategories, even for experienced spinal surgeons. This observation is supported by the largest prospective study of the treatment of cauda equina syndrome in the UK by Woodfield et al (2023). If the accepted subcategories are unreliable, they cannot be used in the way that they are currently, and they should be revised or abandoned. This paper presents a reassessment of the diagnostic and prognostic subcategories of cauda equina syndrome in the light of this evidence, with a suggested cure based on a more inclusive synthesis of symptoms, signs, bladder ultrasound scan results, and pre-intervention urinary catheterization. Cite this article: Bone Joint J 2024;106-B(3):227–231


The Bone & Joint Journal
Vol. 106-B, Issue 9 | Pages 898 - 906
1 Sep 2024
Kayani B Wazir MUK Mancino F Plastow R Haddad FS

Aims. The primary objective of this study was to develop a validated classification system for assessing iatrogenic bone trauma and soft-tissue injury during total hip arthroplasty (THA). The secondary objective was to compare macroscopic bone trauma and soft-tissues injury in conventional THA (CO THA) versus robotic arm-assisted THA (RO THA) using this classification system. Methods. This study included 30 CO THAs versus 30 RO THAs performed by a single surgeon. Intraoperative photographs of the osseous acetabulum and periacetabular soft-tissues were obtained prior to implantation of the acetabular component, which were used to develop the proposed classification system. Interobserver and intraobserver variabilities of the proposed classification system were assessed. Results. The BOne trauma and Soft-Tissue Injury classification system in total Hip arthroplasty (BOSTI Hip) grades osseous acetabular trauma and periarticular muscle damage during THA. The classification system has an interclass correlation coefficient of 0.90 (95% CI 0.86 to 0.93) for interobserver agreement and 0.89 (95% CI 0.84 to 0.93) for intraobserver agreement. RO THA was associated with improved BOSTI Hip scores (p = 0.002) and more pristine osseous surfaces in the anterior superior (p = 0.001) and posterior superior (p < 0.001) acetabular quadrants compared with CO THA. There were no differences between the groups in relation to injury to the gluteus medius (p = 0.084), obturator internus (p = 0.241), piriformis (p = 0.081), superior gamellus (p = 0.116), inferior gamellus (p = 0.132), quadratus femoris (p = 0.208), and vastus lateralis (p = 0.135), but overall combined muscle injury was reduced in RO THA compared with CO THA (p = 0.023). Discussion. The proposed BOSTI Hip classification provides a reproducible grading system for stratifying iatrogenic bone trauma and soft-tissue injury during THA. RO THA was associated with improved BOSTI Hip scores, more pristine osseous acetabular surfaces, and reduced combined periarticular muscle injury compared with CO THA. Further research is required to understand if these intraoperative findings translate to differences in clinical outcomes between the treatment groups. Cite this article: Bone Joint J 2024;106-B(9):898–906


The Bone & Joint Journal
Vol. 106-B, Issue 5 | Pages 468 - 474
1 May 2024
d'Amato M Flevas DA Salari P Bornes TD Brenneis M Boettner F Sculco PK Baldini A

Aims. Obtaining solid implant fixation is crucial in revision total knee arthroplasty (rTKA) to avoid aseptic loosening, a major reason for re-revision. This study aims to validate a novel grading system that quantifies implant fixation across three anatomical zones (epiphysis, metaphysis, diaphysis). Methods. Based on pre-, intra-, and postoperative assessments, the novel grading system allocates a quantitative score (0, 0.5, or 1 point) for the quality of fixation achieved in each anatomical zone. The criteria used by the algorithm to assign the score include the bone quality, the size of the bone defect, and the type of fixation used. A consecutive cohort of 245 patients undergoing rTKA from 2012 to 2018 were evaluated using the current novel scoring system and followed prospectively. In addition, 100 first-time revision cases were assessed radiologically from the original cohort and graded by three observers to evaluate the intra- and inter-rater reliability of the novel radiological grading system. Results. At a mean follow-up of 90 months (64 to 130), only two out of 245 cases failed due to aseptic loosening. Intraoperative grading yielded mean scores of 1.87 (95% confidence interval (CI) 1.82 to 1.92) for the femur and 1.96 (95% CI 1.92 to 2.0) for the tibia. Only 3.7% of femoral and 1.7% of tibial reconstructions fell below the 1.5-point threshold, which included the two cases of aseptic loosening. Interobserver reliability for postoperative radiological grading was 0.97 for the femur and 0.85 for the tibia. Conclusion. A minimum score of 1.5 points for each skeletal segment appears to be a reasonable cut-off to define sufficient fixation in rTKA. There were no revisions for aseptic loosening at mid-term follow-up when this fixation threshold was achieved or exceeded. When assessing first-time revisions, this novel grading system has shown excellent intra- and interobserver reliability. Cite this article: Bone Joint J 2024;106-B(5):468–474


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1345 - 1350
1 Aug 2021
Czubak-Wrzosek M Nitek Z Sztwiertnia P Czubak J Grzelecki D Kowalczewski J Tyrakowski M

Aims. The aim of the study was to compare two methods of calculating pelvic incidence (PI) and pelvic tilt (PT), either by using the femoral heads or acetabular domes to determine the bicoxofemoral axis, in patients with unilateral or bilateral primary hip osteoarthritis (OA). Methods. PI and PT were measured on standing lateral radiographs of the spine in two groups: 50 patients with unilateral (Group I) and 50 patients with bilateral hip OA (Group II), using the femoral heads or acetabular domes to define the bicoxofemoral axis. Agreement between the methods was determined by intraclass correlation coefficient (ICC) and the standard error of measurement (SEm). The intraobserver reproducibility and interobserver reliability of the two methods were analyzed on 31 radiographs in both groups to calculate ICC and SEm. Results. In both groups, excellent agreement between the two methods was obtained, with ICC of 0.99 and SEm 0.3° for Group I, and ICC 0.99 and SEm 0.4° for Group II. The intraobserver reproducibility was excellent for both methods in both groups, with an ICC of at least 0.97 and SEm not exceeding 0.8°. The study also revealed excellent interobserver reliability for both methods in both groups, with ICC 0.99 and SEm 0.5° or less. Conclusion. Either the femoral heads or acetabular domes can be used to define the bicoxofemoral axis on the lateral standing radiographs of the spine for measuring PI and PT in patients with idiopathic unilateral or bilateral hip OA. Cite this article: Bone Joint J 2021;103-B(8):1345–1350


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1339 - 1344
1 Aug 2021
Jain S Mohrir G Townsend O Lamb JN Palan J Aderinto J Pandit H

Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic. Validity was tested on surgically managed UCS type B PFFs where stem stability was documented in operation notes (n = 50). Validity was assessed using percentage agreement and Cohen kappa statistic between radiological assessment and intraoperative findings. Kappa statistics were interpreted using Landis and Koch criteria. All six observers were blinded to operation notes and postoperative radiographs. Results. Interobserver reliability percentage agreement was 58.5% and the overall kappa value was 0.442 (moderate agreement). Lowest kappa values were seen for type B fractures (0.095 to 0.360). The mean intraobserver reliability kappa value was 0.672 (0.447 to 0.867), indicating substantial agreement. Validity percentage agreement was 65.7% and the mean kappa value was 0.300 (0.160 to 0.4400) indicating only fair agreement. Conclusion. This study demonstrates that the UCS is unsatisfactory for the classification of PFFs around PTS stems, and that it has considerably lower reliability and validity than previously described for other stem types. Radiological PTS stem loosening in the presence of PFF is poorly defined and formal intraoperative testing of stem stability is recommended. Cite this article: Bone Joint J 2021;103-B(8):1339–1344


The Bone & Joint Journal
Vol. 103-B, Issue 8 | Pages 1380 - 1385
2 Aug 2021
Kim Y Ryu J Kim JK Al-Dhafer BAA Shin YH

Aims. The aim of this study was to assess arthritis of the basal joint of the thumb quantitatively using bone single-photon emission CT/CT (SPECT/CT) and evaluate its relationship with patients’ pain and function. Methods. We retrospectively reviewed 30 patients (53 hands) with symptomatic basal joint arthritis of the thumb between April 2019 and March 2020. Visual analogue scale (VAS) scores for pain, grip strength, and pinch power of both hands and Patient-Rated Wrist/Hand Evaluation (PRWHE) scores were recorded for all patients. Basal joint arthritis was classified according to the modified Eaton-Glickel stage using routine radiographs and the CT scans of SPECT/CT, respectively. The maximum standardized uptake value (SUVmax) from SPECT/CT was measured in the four peritrapezial joints and the highest uptake was used for analysis. Results. According to Eaton-Glickel classification, 11, 17, 17, and eight hands were stage 0 to I, II, III, and IV, respectively. The interobserver reliability for determining the stage of arthritis was moderate for radiographs (k = 0.41) and substantial for CT scans (k = 0.67). In a binary categorical analysis using SUVmax, pain (p < 0.001) and PRWHE scores (p = 0.004) were significantly higher in hands with higher SUVmax. Using multivariate linear regression to estimate the pain VAS, only SUVmax (B 0.172 (95% confidence interval (CI) 0.065 to 0.279; p = 0.002) showed a significant association. Estimating the variation of PRWHE scores using the same model, only SUVmax (B 1.378 (95% CI, 0.082 to 2.674); p = 0.038) showed a significant association. Conclusion. The CT scans of SPECT/CT provided better interobserver reliability than routine radiographs for evaluating the severity of arthritis. A higher SUVmax in SPECT/CT was associated with more pain and functional disabilities of basal joint arthritis of the thumb. This approach could be used to complement radiographs for the evaluation of patients with this condition. Cite this article: Bone Joint J 2021;103-B(8):1380–1385


The Bone & Joint Journal
Vol. 105-B, Issue 1 | Pages 21 - 28
1 Jan 2023
Ndlovu S Naqshband M Masunda S Ndlovu K Chettiar K Anugraha A

Aims. Clinical management of open fractures is challenging and frequently requires complex reconstruction procedures. The Gustilo-Anderson classification lacks uniform interpretation, has poor interobserver reliability, and fails to account for injuries to musculotendinous units and bone. The Ganga Hospital Open Injury Severity Score (GHOISS) was designed to address these concerns. The major aim of this review was to ascertain the evidence available on accuracy of the GHOISS in predicting successful limb salvage in patients with mangled limbs. Methods. We searched electronic data bases including PubMed, CENTRAL, EMBASE, CINAHL, Scopus, and Web of Science to identify studies that employed the GHOISS risk tool in managing complex limb injuries published from April 2006, when the score was introduced, until April 2021. Primary outcome was the measured sensitivity and specificity of the GHOISS risk tool for predicting amputation at a specified threshold score. Secondary outcomes included length of stay, need for plastic surgery, deep infection rate, time to fracture union, and functional outcome measures. Diagnostic test accuracy meta-analysis was performed using a random effects bivariate binomial model. Results. We identified 1,304 records, of which six prospective cohort studies and two retrospective cohort studies evaluating a total of 788 patients were deemed eligible for inclusion. A diagnostic test meta-analysis conducted on five cohort studies, with 474 participants, showed that GHOISS at a threshold score of 14 has a pooled sensitivity of 93.4% (95% confidence interval (CI) 78.4 to 98.2) and a specificity of 95% (95% CI 88.7 to 97.9) for predicting primary or secondary amputations in people with complex lower limb injuries. Conclusion. GHOISS is highly accurate in predicting success of limb salvage, and can inform management and predict secondary outcomes. However, there is a need for high-quality multicentre trials to confirm these findings and investigate the effectiveness of the score in children, and in predicting secondary amputations. Cite this article: Bone Joint J 2023;105-B(1):21–28


The Bone & Joint Journal
Vol. 105-B, Issue 6 | Pages 696 - 701
1 Jun 2023
Kurisunkal V Morris G Kaneuchi Y Bleibleh S James S Botchu R Jeys L Parry MC

Aims. Intra-articular (IA) tumours around the knee are treated with extra-articular (EA) resection, which is associated with poor functional outcomes. We aim to evaluate the accuracy of MRI in predicting IA involvement around the knee. Methods. We identified 63 cases of high-grade sarcomas in or around the distal femur that underwent an EA resection from a prospectively maintained database (January 1996 to April 2020). Suspicion of IA disease was noted in 52 cases, six had IA pathological fracture, two had an effusion, two had prior surgical intervention (curettage/IA intervention), and one had an osseous metastasis in the proximal tibia. To ascertain validity, two musculoskeletal radiologists (R1, R2) reviewed the preoperative imaging (MRI) of 63 consecutive cases on two occasions six weeks apart. The radiological criteria for IA disease comprised evidence of tumour extension within the suprapatellar pouch, intercondylar notch, extension along medial/lateral retinaculum, and presence of IA fracture. The radiological predictions were then confirmed with the final histopathology of the resected specimens. Results. The resection histology revealed 23 cases (36.5%) showing IA disease involvement compared with 40 cases without (62%). The intraobserver variability of R1 was 0.85 (p < 0.001) compared to R2 with κ = 0.21 (p = 0.007). The interobserver variability was κ = 0.264 (p = 0.003). Knee effusion was found to be the most sensitive indicator of IA involvement, with a sensitivity of 91.3% but specificity of only 35%. However, when combined with a pathological fracture, this rose to 97.5% and 100% when disease was visible in Hoffa’s fat pad. Conclusion. MRI imaging can sometimes overestimate IA joint involvement and needs to be correlated with clinical signs. In the light of our findings, we would recommend EA resections when imaging shows effusion combined with either disease in Hoffa’s fat pad or retinaculum, or pathological fractures. Cite this article: Bone Joint J 2023;105-B(6):696–701


The Bone & Joint Journal
Vol. 102-B, Issue 8 | Pages 1041 - 1047
1 Aug 2020
Hamoodi Z Singh J Elvey MH Watts AC

Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values. Validity was assessed by calculating the percentage agreement against intraoperative findings. Results. Of the 48 patients, three (6%) had type A injury, 11 (23%) type B, 16 (33%) type B+, 16 (33%) Type C, two (4%) type D+, and none had a type D injury. All 48 patients had anteroposterior (AP) and lateral radiographs, 44 had 2D CT scans, and 39 had 3D reconstructions. The interobserver reliability kappa value was 0.52 for radiographs, 0.71 for 2D CT scans, and 0.73 for a combination of 2D and 3D reconstruction CT scans. The median intraobserver reliability was 0.75 (interquartile range (IQR) 0.62 to 0.79) for radiographs, 0.77 (IQR 0.73 to 0.94) for 2D CT scans, and 0.89 (IQR 0.77 to 0.93) for the combination of 2D and 3D reconstruction. Validity analysis showed that accuracy significantly improved when using CT scans (p = 0.018 and p = 0.028 respectively). Conclusion. The Wrightington classification system is a reliable and valid method of classifying fracture-dislocations of the elbow. CT scans are significantly more accurate than radiographs when identifying the pattern of injury, with good intra- and interobserver reproducibility. Cite this article: Bone Joint J 2020;102-B(8):1041–1047


The Bone & Joint Journal
Vol. 102-B, Issue 1 | Pages 102 - 107
1 Jan 2020
Sharma N Brown A Bouras T Kuiper JH Eldridge J Barnett A

Aims. Trochlear dysplasia is a significant risk factor for patellofemoral instability. The Dejour classification is currently considered the standard for classifying trochlear dysplasia, but numerous studies have reported poor reliability on both plain radiography and MRI. The severity of trochlear dysplasia is important to establish in order to guide surgical management. We have developed an MRI-specific classification system to assess the severity of trochlear dysplasia, the Oswestry-Bristol Classification (OBC). This is a four-part classification system comprising normal, mild, moderate, and severe to represent a normal, shallow, flat, and convex trochlear, respectively. The purpose of this study was to assess the inter- and intraobserver reliability of the OBC and compare it with that of the Dejour classification. Methods. Four observers (two senior and two junior orthopaedic surgeons) independently assessed 32 CT and axial MRI scans for trochlear dysplasia and classified each according to the OBC and the Dejour classification systems. Assessments were repeated following a four-week interval. The inter- and intraobserver agreement was determined by using Fleiss’ generalization of Cohen’s kappa statistic and S-statistic nominal and linear weights. Results. The OBC showed fair-to-good interobserver agreement and good-to-excellent intraobserver agreement (mean kappa 0.68). The Dejour classification showed poor interobserver agreement and fair-to-good intraobserver agreement (mean kappa 0.52). Conclusion. The OBC can be used to assess the severity of trochlear dysplasia. It can be applied in clinical practice to simplify and standardize surgical decision-making in patients with recurrent patella instability. Cite this article: Bone Joint J 2020;102-B(1):102–107