Advertisement for orthosearch.org.uk
Results 1 - 20 of 128
Results per page:
Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_14 | Pages 3 - 3
10 Oct 2023
Verma S Malaviya S Barker S
Full Access

Technological advancements in orthopaedic surgery have mainly focused on increasing precision during the operation however, there have been few developments in post-operative physiotherapy. We have developed a computer vision program using machine learning that can virtually measure the range of movement of a joint to track progress after surgery. This data can be used by physiotherapists to change patients’ exercise regimes with more objectively and help patients visualise the progress that they have made. In this study, we tested our program's reliability and validity to find a benchmark for future use on patients. We compared 150 shoulder joint angles, measured using a goniometer, and those calculated by our program called ArmTracking in a group of 10 participants (5 males and 5 females). Reliability was tested using adjusted R squared and validity was tested using 95% limits of agreement. Our clinically acceptable limit of agreement was ± 10° for ArmTracking to be used interchangeably with goniometry. ArmTracking showed excellent overall reliability of 97.1% when all shoulder movements were combined but there were lower scores for some movements like shoulder extension at 75.8%. There was moderate validity shown when all shoulder movements were combined at 9.6° overestimation and 18.3° underestimation. Computer vision programs have a great potential to be used in telerehabilitation to collect useful information as patients carry out prescribed exercises at home. However, they need to be trained well for precise joint detections to reduce the range of errors in readings


Bone & Joint Open
Vol. 5, Issue 6 | Pages 524 - 531
24 Jun 2024
Woldeyesus TA Gjertsen J Dalen I Meling T Behzadi M Harboe K Djuv A

Aims. To investigate if preoperative CT improves detection of unstable trochanteric hip fractures. Methods. A single-centre prospective study was conducted. Patients aged 65 years or older with trochanteric hip fractures admitted to Stavanger University Hospital (Stavanger, Norway) were consecutively included from September 2020 to January 2022. Radiographs and CT images of the fractures were obtained, and surgeons made individual assessments of the fractures based on these. The assessment was conducted according to a systematic protocol including three classification systems (AO/Orthopaedic Trauma Association (OTA), Evans Jensen (EVJ), and Nakano) and questions addressing specific fracture patterns. An expert group provided a gold-standard assessment based on the CT images. Sensitivities and specificities of surgeons’ assessments were estimated and compared in regression models with correlations for the same patients. Intra- and inter-rater reliability were presented as Cohen’s kappa and Gwet’s agreement coefficient (AC1). Results. We included 120 fractures in 119 patients. Compared to radiographs, CT increased the sensitivity of detecting unstable trochanteric fractures from 63% to 70% (p = 0.028) and from 70% to 76% (p = 0.004) using AO/OTA and EVJ, respectively. Compared to radiographs alone, CT increased the sensitivity of detecting a large posterolateral trochanter major fragment or a comminuted trochanter major fragment from 63% to 76% (p = 0.002) and from 38% to 55% (p < 0.001), respectively. CT improved intra-rater reliability for stability assessment using EVJ (AC1 0.68 to 0.78; p = 0.049) and for detecting a large posterolateral trochanter major fragment (AC1 0.42 to 0.57; p = 0.031). Conclusion. A preoperative CT of trochanteric fractures increased detection of unstable fractures using the AO/OTA and EVJ classification systems. Compared to radiographs, CT improved intra-rater reliability when assessing fracture stability and detecting large posterolateral trochanter major fragments. Cite this article: Bone Jt Open 2024;5(6):524–531


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 275 - 275
1 Sep 2012
Dawoodi A Perera A
Full Access

Background. Metatarsus adductus is the most common forefoot deformity. Variable prevalence values were reported in literature using different techniques in different populations. Numerous radiological measurements have been proposed to assess this deformity with a paucity of studies reporting the reliability of these methods. The metatarsus adductus angle was shown to correlate with the severity of hallux abductovalgus in normal feet and preselected populations of juvenile hallux valgus. Materials & Methods. Weight bearing dorsoplantar radiographs of 150 feet were examined for 5 angles commonly used in assessing metatarsus adductus: angle between the second metatarsus and the longitudinal axis of the lesser tarsus (using the 4th or 5th metatarso-cuboid joint as a reference), Engel's angle and modified angle's angle. The prevalence of metatarsus adductus was assessed according to published criteria for different techniques. Inter and intra-observer reliabilities of these angles were evaluated on 50 X-rays. Linear regression tests were used to assess the correlation between hallux valgus and different angles used in assessing metatarsus adductus. Results. Intraclass correlation coefficients were high for intra- as well as inter-observer reliability for the 5 angles tested. Prevalence of metatarsus adductus ranged (45–70%) depending on the angle used in the same population. Only the metatarsus adductus angle using the 4th metatasro-cuboid joint as a reference demonstrated significant correlation between metatarsus adductus and hallux abductovalgus angles. Conclusions. Five techniques commonly used in assessing metatarsus adductus demonstrated high inter and intra-observer reliability values. Prevalence of metatarsus adductus and the correlation between the severity of this deformity and hallux valgus angle is sensitive to the assessment method


The Journal of Bone & Joint Surgery British Volume
Vol. 80-B, Issue 4 | Pages 670 - 672
1 Jul 1998
Flinkkilä T Nikkola-Sihto A Kaarela O Päakkö E Raatikainen T

Interobserver reliability of the AO system of classification of fractures of the distal radius was assessed using plain radiographs and CT. Five observers classified 30 Colles’-type fractures using only plain radiographs; two months later they were reclassified using CT in addition. Interobserver reliability was poor in both series when detailed classification was used. By reducing the categories to five, interobserver reliability was slightly improved, but was still poor. When only two AO types were used, the reliability was moderate using plain radiographs and good to excellent with the addition of CT. The use of CT as well as plain radiographs brings interobserver reliability to a good level in assessment of the presence or absence of articular involvement, but is otherwise of minor value in improving the interobserver reliability of the AO system of classification of fractures of the distal radius


The Journal of Bone & Joint Surgery British Volume
Vol. 88-B, Issue 9 | Pages 1204 - 1206
1 Sep 2006
Malek IA Machani B Mevcha AM Hyder NH

Our aim was to assess the reproducibility and the reliability of the Weber classification system for fractures of the ankle based on anteroposterior and lateral radiographs. Five observers with varying clinical experience reviewed 50 sets of blinded radiographs. The same observers reviewed the same radiographs again after an interval of four weeks. Inter- and intra-observer agreement was assessed based on the proportion of agreement and the values of the kappa coefficient. For inter-observer agreement, the mean kappa value was 0.61 (0.59 to 0.63) and the proportion of agreement was 78% (76% to 79%) and for intra-observer agreement the mean kappa value was 0.74 (0.39 to 0.86) with an 85% (60% to 93%) observed agreement. These results show that the Weber classification of fractures of the ankle based on two radiological views has substantial inter-observer reliability and intra-observer reproducibility


The Journal of Bone & Joint Surgery British Volume
Vol. 91-B, Issue 6 | Pages 766 - 771
1 Jun 2009
Brunner A Honigmann P Treumann T Babst R

We evaluated the impact of stereo-visualisation of three-dimensional volume-rendering CT datasets on the inter- and intraobserver reliability assessed by kappa values on the AO/OTA and Neer classifications in the assessment of proximal humeral fractures. Four independent observers classified 40 fractures according to the AO/OTA and Neer classifications using plain radiographs, two-dimensional CT scans and with stereo-visualised three-dimensional volume-rendering reconstructions. Both classification systems showed moderate interobserver reliability with plain radiographs and two-dimensional CT scans. Three-dimensional volume-rendered CT scans improved the interobserver reliability of both systems to good. Intraobserver reliability was moderate for both classifications when assessed by plain radiographs. Stereo visualisation of three-dimensional volume rendering improved intraobserver reliability to good for the AO/OTA method and to excellent for the Neer classification. These data support our opinion that stereo visualisation of three-dimensional volume-rendering datasets is of value when analysing and classifying complex fractures of the proximal humerus


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 400 - 400
1 Sep 2012
Odri G Fraquet N Isnard J Redon H Frioux R Gouin F
Full Access

Cam type femoroacetabular impingement (FAI) is due to an aspheric femoral head, which is best quantified by the alpha angle described on MRI and CT-scan. Radiographic measurement of the alpha angle is not well codified and studies from the literature cannot conclude on the best view to measure it. Most authors also describe a mixed type FAI which associates an aspheric femoral head with an excessive anterior acetabular coverage of the femoral head. Anterior center edge (ACE) angle has been described on the false profile view to measure anterior acetabular coverage in hip dysplasia and has never been evaluated in FAI. In this study, we developed a new lateral hip view which associates a lateral view of the femoral neck and a false profile view of the acétabulum, which we called profile view in impingement position (PVIP). Twenty six patients operated for FAI had CT-scan, the PVIP and the false profile view of one or two hips according to pain. A control group of 19 patients who did not suffer from the hip had the PVIP. Alpha angles were measured twice on 17 CT scan of FAI patients by two observers and compared with the alpha angles measured on the corresponding hip PVIP by a correlation analysis. Alpha angles were measured twice on 45 PVIP in FAI patient and on 19 PVIP in the control group by three observers. ACE angles were measured once on 15 PVIP and on 15 false profile views. Means were compared by two tail paired t-tests, intra- and inter-observer reliability were measured by intraclass correlation coefficient. Mean alpha angle on CT scan was 65.8° and 65.6° for observers 1 and 2 respectively (p>0.05). It was 63.6° and 64.3° on the PVIP (p>0.05). No significant difference was found between CT scan and radiographic measurements, and Pearson's correlation coefficients were good at 0.74 and 0.8. ICC was 0.86 for inter-rater reliability, and 0.91 for intra-rater reliability for CT-scan alpha angle measures. ICC for PVIP measures varied from 0.82 to 0.9 for intra-rater reliability and from 0.6 to 0.9 for inter-rater reliability. Mean alpha angle measured on PVIP in FAI patients was 63.3° and was 44.9° in control subjects and the difference was significant (p<0.001) for the three observers. None of the FAI patients and 88% of the control subjects had an alpha angle < 50°. Mean ACE angle was 26.8° on PVIP and 32.8° on the false profile view, the difference was significant (p=0.015), and the Pearson's correlation coefficient was moderate (r=0.58). The PVIP is a reliable radiographic view to measure the alpha angle. It allows a good quantification of the alpha angle comparable to CT-scan measurements and permits to differentiate patients from control subjects. PVIP is not a good view to quantify anterior edge angle probably because of acetabular retroversion due to the hip flexion needed in this view. Mean ACE angle measured on the false profile view in FAI patient was comparable to ACE angle in general population reported in the literature


The Bone & Joint Journal
Vol. 100-B, Issue 2 | Pages 242 - 246
1 Feb 2018
Ghoshal A Enninghorst N Sisak K Balogh ZJ

Aims. To evaluate interobserver reliability of the Orthopaedic Trauma Association’s open fracture classification system (OTA-OFC). Patients and Methods. Patients of any age with a first presentation of an open long bone fracture were included. Standard radiographs, wound photographs, and a short clinical description were given to eight orthopaedic surgeons, who independently evaluated the injury using both the Gustilo and Anderson (GA) and OTA-OFC classifications. The responses were compared for variability using Cohen’s kappa. Results. The overall interobserver agreement was ĸ = 0.44 for the GA classification and ĸ = 0.49 for OTA-OFC, which reflects moderate agreement (0.41 to 0.60) for both classifications. The agreement in the five categories of OTA-OFC was: for skin, ĸ = 0.55 (moderate); for muscle, ĸ = 0.44 (moderate); for arterial injury, ĸ = 0.74 (substantial); for contamination, ĸ = 0.35 (fair); and for bone loss, ĸ = 0.41 (moderate). Conclusion. Although the OTA-OFC, with similar interobserver agreement to GA, offers a more detailed description of open fractures, further development may be needed to make it a reliable and robust tool. Cite this article: Bone Joint J 2018;100-B:242–6


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 272 - 272
1 Sep 2012
Rolfson O Salomonsson R Dahlberg L Garellick G
Full Access

This randomised methodological study sought to test the reliability of an Internet questionnaire and investigate the differences in response rates between traditional pen-and-paper questionnaires and Internet questionnaires for measuring patient-reported outcome after total hip arthroplasty (THA) surgery. From the Swedish Hip Arthroplasty Register, 2 400 patients were chosen at random but stratified by age, sex and diagnosis for inclusion in a four-year follow-up using the health-related quality of life (HRQoL) tool EQ-5D and visual analogue scales for pain and satisfaction. The patients were randomized to answer the follow-up model protocol either via a password-protected Internet questionnaire or via a mailed pen-and-paper questionnaire. A reliability test for the Internet follow-up instrument showed adequate correlation. However, the Internet group and the pen-and-paper group differed significantly (p<0.001) with a 92% response rate in the latter and 49% in the former. Adjusted to the normal age distribution of the THA population, the Internet response rate was 34%. The patient-administered Internet questionnaire alone does not give a sufficient response rate in the THA population to replace the pen-and-paper questionnaire. However, the system is reliable and could be used for measuring patient- reported outcome if supplemented with traditional pen-and-paper questionnaires for Internet non-respondents. It is expected that this answer procedure will soon predominate in view of the general development of Internet functions. Register work may then become less resource-consuming and the results may be analysed in real time


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 122 - 122
1 Sep 2012
Jensen C Overgaard S Aagaard P
Full Access

Introduction. Total leg muscle function in hip OA patients is not well studied. We used a test-retest protocol to evaluate the reproducibility of single- and multi-joint peak muscle torque and rapid torque development in a group of 40–65 yr old hip patients. Both peak torque and torque development are outcome measures associated with functional performance during activities of daily living. Material and Methods. Patients: Twenty patients (age 55.5±3.3, BMI 27.6±4.8) who underwent total hip arthroplasty participated in this study. Reliability: We used the intra-class correlation (ICC) and within subject coefficients of variation (CVws) to evaluate reliability. Agreement: Relative Bland-Altman 95% limits of agreements (LOA) and smallest detectable difference (SDD) were calculated and used for evaluation of measurement accuracy. Parameters: Maximal muscle strength (peak torque, Nm) and rate of torque development (Nm•sec-1) for affected (AF) and non-affected (NA) side were measured during unilateral knee extension-flexion (seated), hip extension-flexion, and hip adduction-abduction (standing), respectively. Contractile RTD100, 200, peak was derived as the average slope of the torque-time curve (torque/time) at 0–100, 0–200 and 0 peak relative to onset of contraction. Protocol: After 5 min level walking at self-selected and maximum speeds each muscle group was tested using 1–2 sub-maximal contraction efforts followed by 3 maximal contractions 4s duration. Statistics: The variance components were estimated using STATA12, with muscle function and occasion as independent variable and patients as random factor, using the restricted maximum likelihood method (=0.05). Results. For all exercises and sides, the ICC's for peak torque were good (0.81–0.96) with CVws ranging from 5.0–10.8%. Similar good ICC's were observed for RTD200 on the non-affected side (0.83–0.93), whereas most exercises (4/6) on the affected side showed moderate to good ICC (0.72–0.82). We found moderate CVws for RTD200 with 12.8–18.7% and 10.3–18.9%, affected and non-affected, respectively. With few exceptions the ICC's and CVws for RTD100 were moderate to poor on the affected side but good to moderate on the non-affected side. The SDD's for peak torque ranged from 14.9 Nm to 39.0 Nm, equal to relative LOA of 13.9–23.8%. For RTD200, the SDD's were 77–257 Nm•sec-1 and 29.2–86.2%, absolute and relative, respectively. With few exceptions interventions measuring RTD100 and RTDpeak would have to find changes exceeding 60% for them to be statistical significant. Conclusions. Our novel set-up for lower limb isometric muscle testing showed overall good reproducibility for peak torque, moderate for RTD200, while poor for RTD100 and RTDpeak. The results for peak torque and RTD200 are promising for defining relevant changes in muscle function in future longitudinal clinical trials in this patient group


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 29 - 29
1 Sep 2012
Bajada S Harrison P Mofidi A Richardson J
Full Access

Introduction. Regenerative medicine is a rapidly expanding discipline. However due to a lack of validated outcome measures, clinical trials have been far few. This study aims to assess the validity, inter-observer reliability and intra-observer reproducibility of experimental fracture healing assessment on plain radiographies. This technique involves implantation of mesenchymal stem cell (MSC) seeded constructs on only one side of the fracture after randomisation. Methods. We examined inter/intraobserver agreement on the area and “bridging length” of callus formed on opposite sides of the fracture. Among 16 orthopaedic surgeons with trauma commitments (8 consultants, 8 registrars) on two separate occasions (average 52 days apart). They independently assessed the radiographs (AP or lateral) of 28 patients with fractures of the tibial or femoral shaft. The fractures chosen included non-unions treated with MSC/constructs and fresh fractures at 4–9 months. For each radiograph the assessor assigned which side (medial or lateral) is there more callus. Chase-corrected agreement using Fleiss kappa was used to compare opinions. Digital analysis software (Image-J) was used to quantify extent/bridging callus and correlate it with surgeons opinion. Results. Inter-observer variation showed a substantial overall agreement (k = 0.716) on the fracture side containing a larger “area” of callus but moderate agreement (k = 0.489) on side with more “bridging length”. These results were reproducible with a substantial overall intraobserver agreement. MSC/construct treated non-union showed a larger amount of agreement than fresh fractures for area (k = 0.754 vs 0.613) and bridging (0.550 vs 0.406). Utilizing digital analysis, non-unions showed a significant larger quantifiable difference between sides than fresh fractures (p = 0.009) for area but not bridging length (p = 0.269). Digital analysis quantification and surgeons opinion showed an almost perfect agreement for area (k = 0.867) and bridging (k = 0.846). Discussion. In this study we aimed to validate a novel method at studying the efficacy and effect of regenerative techniques on fracture healing. In particular, plain radiographs for comparing a treatment/internal control side. In this study we showed this method assessing area of callus is valid, reliable and reproducible. This is particularly so for MSC/construct treated non-union where the difference in both sides is higher as quantified in digital analysis. This is a novel method of experimental fracture healing using an internal control which decreases the variation between groups and sample size needed. This makes regenerative medicine clinical trials easier


Orthopaedic Proceedings
Vol. 95-B, Issue SUPP_33 | Pages 7 - 7
1 Sep 2013
Lavery J Blyth M Jones B Anthony I
Full Access

To validate the Modified Forgotten Joint Score (MFJS) as a new patient-reported outcome measure (PROM) in hip and knee arthroplasty (THR/TKR) against the UK's gold standard Oxford Hip and Knee Scores (OHS/OKS).

The MFJS is a new assessment tool devised to provide a greater discriminatory power, particularly in well performing patients. It measures an appealing concept; the ability of a patient to forget about their artificial joint in everyday life.

Postal questionnaires were sent out to 400 THR and TKR patients who were 1–2 years post-op. The data collected from the 212 returned questionnaires was analysed in relation to construct and content validity. 77 patients took part in a test-retest repeatability assessment.

The MFJS proved to have an increased discriminatory power in high-performing patients in comparison to the OHS and OKS, highlighted by its more normal frequency of distribution and reduced ceiling effects. 30.8% of patients (n=131) achieved excellent OHS/OKS scores of 42–48 this compared to just 7.69% of patients who achieved a proportionately equivalent MFJS score of 87.5–100. The MFJS proved to have an increased test-retest repeatability based upon its intra-class correlation coefficient of 0.97 compared to the Oxford's 0.85.

The MFJS provides a more sensitive tool in the assessment of well performing hip and knee arthroplasties in comparison to the OHS/OKS. The MFJS tests the concept of awareness of a prosthetic joint, rather than pain and function and therefore should be used as adjunct to the OKS/OHS.


Orthopaedic Proceedings
Vol. 95-B, Issue SUPP_16 | Pages 6 - 6
1 Apr 2013
Sakagoshi D Sawaguchi T Shima Y Inoue D Oshima T Goldhahn S
Full Access

Introduction

Tip apex distance (TAD) is reported as a predictor for cut outs of lag screws in the treatment of intertrochanteric fractures, and surgeons are adviced to strive for TAD within 20 mm. However the definition of neck axis and the limb position of lateral radiograph are not clearly described in the original literature. We propose the refined TAD by defining these factors. The objective of this study was to analyze the interobserver agreement of this refined TAD.

Materials and Method

X rays of 130 cases of unstable trochanteric fractures were used for the analysis of the refined TAD. In the refined TAD, neck axis was defined as the line between the center of femoral head and midpoint of narrowest part of the femoral neck, and lateral radiograph was taken with hip flexion 90 degrees and abduction 45 degrees. The refined TAD was independently measured by 2 experienced (observer 1,2) and 2 inexperienced (observer 3,4) orthopaedic surgeons who were trained with the new method before the measurement. Intraclass correlation coefficient (ICC [2,4]) was calculated to assess the interobserver agreement.


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXXVII | Pages 207 - 207
1 Sep 2012
Chandrasenan J Rajan R Price K
Full Access

The lateral pillar classification (LPC) is a widely used tool in determining prognosis and planning treatment in patients who are in the fragmentation stage of Perthes disease. The original classification has been modified to help increase the accuracy of the classification system by the Herring group. The purpose of our study was to independently assess this modified Herring classification.

35 standardized true antero-posterior radiographs of children in various stages of fragmentation were independently assessed by 6 senior observers on 2 separate occasions (6 weeks apart). Kappa analysis was used to assess the inter and intraobserver agreement between observations made. The degrees of agreement were as follows: poor, fair, moderate, good and very good.

Intraobserver analysis revealed at best only moderate agreement for two observers. 3 observers showed fair consistency, whilst 1 remaining observer showed poor consistency between repeated observations (p<0.01). The highest scores for interobserver agreement varying between moderate to good could only be established between 2 observers. For the remaining observers results were just fair (p<0.01).

This study highlights the lack of agreement between senior clinicians when applying the modified LPC. This has clinical implications when applying the classification to the decision making process in treating patients at risk of developing adverse outcomes from the disease. To our knowledge, this is the first time the modified LPC has been independently tested for its reproducibility by another specialist paediatric orthopaedic unit.


The Bone & Joint Journal
Vol. 97-B, Issue 8 | Pages 1139 - 1143
1 Aug 2015
Hutt JRB Ortega-Briones A Daurka JS Bircher MD Rickman MS

The most widely used classification system for acetabular fractures was developed by Judet, Judet and Letournel over 50 years ago primarily to aid surgical planning. As population demographics and injury mechanisms have altered over time, the fracture patterns also appear to be changing. We conducted a retrospective review of the imaging of 100 patients with a mean age of 54.9 years (19 to 94) and a male to female ratio of 69:31 seen between 2010 and 2013 with acetabular fractures in order to determine whether the current spectrum of injury patterns can be reliably classified using the original system.

Three consultant pelvic and acetabular surgeons and one senior fellow analysed anonymous imaging. Inter-observer agreement for the classification of fractures that fitted into defined categories was substantial, (κ = 0.65, 95% confidence interval (CI) 0.51 to 0.76) with improvement to near perfect on inclusion of CT imaging (κ = 0.80, 95% CI 0.69 to 0.91). However, a high proportion of injuries (46%) were felt to be unclassifiable by more than one surgeon; there was moderate agreement on which these were (κ = 0.42 95% CI 0.31 to 0.54).

Further review of the unclassifiable fractures in this cohort of 100 patients showed that they tended to occur in an older population (mean age 59.1 years; 22 to 94 vs 47.2 years; 19 to 94; p = 0.003) and within this group, there was a recurring pattern of anterior column and quadrilateral plate involvement, with or without an incomplete posterior element injury.

Cite this article: Bone Joint J 2015;97-B:1139–43.


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims. Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before. Methods. Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method. Results. The RUSHU demonstrated good interobserver reliability with an ICC of 0.78 (95% CI 0.72 to 0.83) at six weeks and 0.77 (95% CI 0.71 to 0.82) at 12 weeks. Intraobserver reproducibility was good or excellent for all analyses. Area under the curve in the ROC analysis was 0.83 (95% CI 0.77 to 0.88) at six weeks and 0.89 (95% CI 0.84 to 0.93) at 12 weeks, indicating excellent discrimination. The optimal cut-off values for predicting nonunion were ≤ eight points at six weeks and ≤ nine points at 12 weeks, providing the best specificity-sensitivity trade-off. Conclusion. The RUSHU proves to be a reliable and reproducible radiological scoring system that aids in identifying patients at risk of nonunion at both six and 12 weeks post-injury during non-surgical treatment of humeral shaft fractures. The statistically optimal cut-off values for predicting nonunion are ≤ eight at six weeks and ≤ nine points at 12 weeks post-injury


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_6 | Pages 4 - 4
1 Jun 2022
Hoban K Downie S Adamson D MacLean J Cool P Jariwala AC
Full Access

Mirels’ score predicts the likelihood of sustaining pathological fractures using pain, lesion site, size and morphology. The aim is to investigate its reproducibility, reliability and accuracy in upper limb bony metastases and validate its use in pathological fracture prediction. A retrospective cohort study of patients with upper limb metastases, referred to an Orthopaedic Trauma Centre (2013–18). Mirels’ was calculated in 32 patients; plain radiographs at presentation scored by 6 raters. Radiological aspects were scored twice by each rater, 2-weeks apart. Inter- and intra-observer reliability were calculated (Fleiss’ kappa test). Bland-Altman plots compared variances of individual score components &total Mirels’ score. Mirels’ score of ≥9 did not accurately predict lesions that would fracture (11% 5/46 vs 65.2% Mirels’ score ≤8, p<0.0001). Sensitivity was 14.3% &specificity was 72.7%. When Mirels’ cut-off was lowered to ≥7, patients were more likely to fracture (48% 22/46 versus 28% 13/46, p=0.045). Sensitivity rose to 62.9%, specificity fell to 54.6%. Kappa values for interobserver variability were 0.358 (fair, 0.288–0.429) for lesion size, 0.107 (poor, 0.02–0.193) for radiological appearance and 0.274 (fair, 0.229–0.318) for total Mirels’ score. Values for intraobserver variability were 0.716 (good, 95% CI 0.432–0.999) for lesion size, 0.427 (moderate, 95% CI 0.195–0.768) for radiological appearance and 0.580 (moderate, 0.395–0.765) for total Mirels’ score. We showed moderate to substantial agreement between &within raters using Mirels’ score on upper limb radiographs. Mirels’ has poor sensitivity &specificity predicting upper limb fractures - we recommend the cut-off score for prophylactic surgery should be lower than for lower limb lesions


Orthopaedic Proceedings
Vol. 105-B, Issue SUPP_14 | Pages 8 - 8
10 Oct 2023
Leow J Oliver W Bell K Molyneux S Clement N Duckworth A
Full Access

To develop a reliable and effective radiological score to assess the healing of isolated ulnar shaft fractures (IUSF), the Radiographic Union Score for Ulna fractures (RUSU). Initially, 20 patients with radiographs six weeks following a non-operatively managed ulnar shaft fracture were selected and scored by three blinded observers. After intraclass correlation (ICC) analysis, a second group of 54 patients with radiographs six weeks after injury (18 who developed a nonunion and 36 who united) were scored by the same observers. In the initial study, interobserver and intraobserver ICC were 0.89 and 0.93, respectively. In the validation study the interobserver ICC was 0.85. The median score for patients who united was significantly higher than those who developed a nonunion (11 vs 7, p<0.001). A ROC curve demonstrated that a RUSU ≤8 had a sensitivity of 88.9% and specificity of 86.1% in identifying patients at risk of nonunion. Patients with a RUSU ≤8 (n = 21) were more likely to develop a nonunion (n = 16/21) than those with a RUSU ≥9 (n = 2/33; OR 49.6, 95% CI 8.6–284.7). Based on a PPV of 76%, if all patients with a RUSU ≤8 underwent fixation at 6-weeks, the number of procedures needed to avoid one nonunion would be 1.3. The RUSU shows good interobserver and intraobserver reliability and is effective in identifying patients at risk of nonunion six weeks after fracture. This tool requires external validation but may enhance the management of patients with isolated ulnar shaft fractures


The Bone & Joint Journal
Vol. 102-B, Issue 1 | Pages 17 - 25
1 Jan 2020
Trickett RW Mudge E Price P Pallister I

Aims. The aim of this study was to develop a psychometrically sound measure of recovery for use in patients who have suffered an open tibial fracture. Methods. An initial pool of 109 items was generated from previous qualitative data relating to recovery following an open tibial fracture. These items were field tested in a cohort of patients recovering from an open tibial fracture. They were asked to comment on the content of the items and structure of the scale. Reduction in the number of items led to a refined scale tested in a larger cohort of patients. Principal components analysis permitted further reduction and the development of a definitive scale. Internal consistency, test-retest reliability, and responsiveness were assessed for the retained items. Results. The initial scale was completed by 35 patients who were recovering from an open tibial fracture. Subjective and objective analysis permitted removal of poorly performing items and the addition of items suggested by patients. The refined scale consisted of 50 Likert scaled items and eight additional items. It was completed on 228 occasions by a different cohort of 204 patients with an open tibial fracture recruited from several UK orthoplastic tertiary referral centres. There were eight underlying components with tangible real-life meaning, which were retained as sub-scales represented by ten Likert scaled and eight non-Likert items. Internal consistency and test-retest reliability were good to excellent. Conclusion. The Wales Lower Limb Trauma Recovery (WaLLTR) Scale is the first tool to be developed from patient data with the potential to assess recovery following an open tibial fracture. Cite this article: Bone Joint J 2020;102-B(1):17–25


Bone & Joint Research
Vol. 5, Issue 4 | Pages 153 - 161
1 Apr 2016
Kleinlugtenbelt YV Nienhuis RW Bhandari M Goslings JC Poolman RW Scholtes VAB

Objectives. Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. Methods. A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies. Results. In all, 19 out of 1508 identified unique studies were included, in which 12 PROMs were rated. The Patient-rated wrist evaluation (PRWE) and the Disabilities of Arm, Shoulder and Hand questionnaire (DASH) were evaluated on most measurement properties. The evidence for the PRWE is moderate that its reliability, validity (content and hypothesis testing), and responsiveness are good. The evidence is limited that its internal consistency and cross-cultural validity are good, and its measurement error is acceptable. There is no evidence for its structural and criterion validity. The evidence for the DASH is moderate that its responsiveness is good. The evidence is limited that its reliability and the validity on hypothesis testing are good. There is no evidence for the other measurement properties. Conclusion. According to this systematic review, there is, at best, moderate evidence that the responsiveness of the PRWE and DASH are good, as are the reliability and validity of the PRWE. We recommend these PROMs in clinical studies in patients with distal radial fractures; however, more clinimetric studies of higher methodological quality are needed to adequately determine the other measurement properties. Cite this article: Dr Y. V. Kleinlugtenbelt. Are validated outcome measures used in distal radial fractures truly valid?: A critical assessment using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Bone Joint Res 2016;5:153–161. DOI: 10.1302/2046-3758.54.2000462