Advertisement for orthosearch.org.uk
Results 1 - 20 of 55
Results per page:

Aims

Classifying trochlear dysplasia (TD) is useful to determine the treatment options for patients suffering from patellofemoral instability (PFI). There is no consensus on which classification system is more reliable and reproducible for the purpose of guiding clinicians’ management of PFI. There are also concerns about the validity of the Dejour Classification (DJC), which is the most widely used classification for TD, having only a fair reliability score. The Oswestry-Bristol Classification (OBC) is a recently proposed system of classification of TD, and the authors report a fair-to-good interobserver agreement and good-to-excellent intraobserver agreement in the assessment of TD. The aim of this study was to compare the reliability and reproducibility of these two classifications.

Methods

In all, six assessors (four consultants and two registrars) independently evaluated 100 axial MRIs of the patellofemoral joint (PFJ) for TD and classified them according to OBC and DJC. These assessments were again repeated by all raters after four weeks. The inter- and intraobserver reliability scores were calculated using Cohen’s kappa and Cronbach’s α.


Orthopaedic Proceedings
Vol. 104-B, Issue SUPP_2 | Pages 1 - 1
1 Mar 2022
Lacey A Chiphang A
Full Access

16 to 34% of the population suffer from shoulder pain, the most common cause being rotator cuff tears. NICE guidance recommends using ultrasound scan (USS) or MRI to assess these patients, but does not specify which is preferable. This study assesses the accuracy of USS and MRI in rotator cuff tears in a DGH, to establish the most appropriate imaging modality. Patients who had at least two of shoulder ultrasound, MRI or arthroscopy within a seven month period (n=55) were included in this retrospective study. Sensitivity, Specificity, Positive Predictive Value (PPV) and Negative Predictive Value (NPV) were calculated using arthroscopy as the true result, and kappa coefficients calculated for each pairing. 59 comparisons were made in total. Sensitivity for MRI in full supraspinatus tears was 0.83, and for USS 0.75. Specificity for MRI in these tears was 0.75, and for USS 0.83. Values were much lower in other tears, which occurred less frequently. USS and MRI completely agreed with each other 61.3% of the time. Both modalities were only completely accurate 50% of the time. Kappa coefficient between arthroscopy and MRI for supraspinatus tears was 0.658, and for USS was 0.615. There was no statistical difference between MRI and USS sensitivity or specificity (p=1), suggesting that one modality cannot be recommended over the other for full supraspinatus tears. They also do not tend to corroborate one another, suggesting that there is no benefit from doing both scans. Further research is needed to see how both modalities can be improved to increase their accuracy


Orthopaedic Proceedings
Vol. 106-B, Issue SUPP_11 | Pages 12 - 12
4 Jun 2024
Chapman J Choudhary Z Gupta S Airey G Mason L
Full Access

Introduction. Treatment pathways of 5. th. metatarsal fractures are commonly directed based on fracture classification, with Jones types for example, requiring closer observation and possibly more aggressive management. Primary objective. To investigate the reliability of assessment of subtypes of 5. th. metatarsal fractures by different observers. Methods. Patients were identified from our prospectively collected database. We included all patient referred to our virtual fracture clinic with a suspected or confirmed 5. th. metatarsal fracture. Plain AP radiographs were reviewed by two observers, who were initially trained on the 5. th. metatarsal classification identification. Zones were defined as Zone 1.1, 1.2, 1.3, 2, 3, diaphyseal shaft (DS), distal metaphysis (DM) and head. An inter-observer reliability analysis using Cohen's Kappa coefficient was carried out, and degree of observer agreement described using Landis & Koch's description. All data was analysed using IBM SPSS v.27. Results. 878 patients were identified. The two observers had moderate agreement when identifying fractures in all zones, apart from metatarsal head fractures, which scored substantial agreement (K=.614). Zones 1.1 (K=.582), 2 (K=.536), 3 (K=.601) and DS (K=.544) all tended towards but did not achieve substantial agreement. Whilst DS fractures achieved moderate agreement, there was an apparent difficulty with distal DS, resulting in a lot of cross over with DM (DS 210 vs 109; DM 76 vs 161). Slight agreement with the next highest adjacent zone was found when injuries were thought to be in zones 1.2, 1.3 and 2 (K=0.17, 0.115 and 0.152 respectively). Conclusions. Reliability of sub-categorising 5. th. metatarsal fractures using standardised instructions conveys moderate to substantial agreement in most cases. If the region of the fracture is going to be used in an algorithm to guide a management plan and clinical follow up during a virtual clinic review, defining fractures of zones 1–3 needs careful consideration


Bone & Joint Research
Vol. 10, Issue 12 | Pages 759 - 766
1 Dec 2021
Nicholson JA Oliver WM MacGillivray TJ Robinson CM Simpson AHRW

Aims. The aim of this study was to establish a reliable method for producing 3D reconstruction of sonographic callus. Methods. A cohort of ten closed tibial shaft fractures managed with intramedullary nailing underwent ultrasound scanning at two, six, and 12 weeks post-surgery. Ultrasound capture was performed using infrared tracking technology to map each image to a 3D lattice. Using echo intensity, semi-automated mapping was performed to produce an anatomical 3D representation of the fracture site. Two reviewers independently performed 3D reconstructions and kappa coefficient was used to determine agreement. A further validation study was undertaken with ten reviewers to estimate the clinical application of this imaging technique using the intraclass correlation coefficient (ICC). Results. Nine of the ten patients achieved union at six months. At six weeks, seven patients had bridging callus of ≥ one cortex on the 3D reconstruction and when present all achieved union. Compared to six-week radiographs, no bridging callus was present in any patient. Of the three patients lacking sonographic bridging callus, one went onto a nonunion (77.8% sensitive and 100% specific to predict union). At 12 weeks, nine patients had bridging callus at ≥ one cortex on 3D reconstruction (100%-sensitive and 100%-specific to predict union). Presence of sonographic bridging callus on 3D reconstruction demonstrated excellent reviewer agreement on ICC at 0.87 (95% confidence interval 0.74 to 0.96). Conclusion. 3D fracture reconstruction can be created using multiple ultrasound images in order to evaluate the presence of bridging callus. This imaging modality has the potential to enhance the usability and accuracy of identification of early fracture healing. Cite this article: Bone Joint Res 2021;10(12):759–766


Orthopaedic Proceedings
Vol. 103-B, Issue SUPP_8 | Pages 9 - 9
1 May 2021
Nicholson JA Oliver WM Perks F Macgillivray T Robinson CM Simpson AHRW
Full Access

Sonographic callus may enable assessment of fracture healing. The aim of this study was to establish a reliable method for three-dimensional reconstruction of sonographic callus. Patients that underwent non-operative management of displaced midshaft clavicle fractures and intramedullary nailing of tibia fractures were prospectively recruited and followed to union. Ultrasound scanning was performed at periodical time points following injury. Infra-red tracking technology was used to map each image to a three-dimensional lattice. Criteria was fist established for two-dimensional bridging callus detection in a pilot study. Using echo intensity of the ultrasound image, semi-automated mapping was used to create an anatomic three-dimensional representation of fracture healing. Agreement on the presence of sonographic bridging callus was assessed using the kappa coefficient and intra-class-correlation (ICC) between observers. 112 clavicle fractures and 10 tibia fractures completed follow-up at six months. Sonographic bridging callus was detected in 62.5% (n=70/112) of the clavicles at six weeks post-injury. If present, union occurred in 98.6% of the fractures (n=69/70). If absent, nonunion developed in 40.5% of cases (n=17/42)(73.4%-sensitive and 100%-specific to predict union). Out of 10 tibia fractures, 7 had bridging callus of at least one cortex at 6 weeks and when present all united. Of the three patients lacking sonographic bridging callus, one went onto a nonunion (77.8%-sensitive and 100%-specific to predict union). The ICC for sonographic callus between four reviewers was 0.82 (95% CI 0.68–0.91). Three-dimensional ultrasound reconstruction of bridging callus has the potential to identify impaired fracture healing at an early stage in fracture management


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_X | Pages 72 - 72
1 Apr 2012
Sundaram R Shaw D De Matas M Pillay R
Full Access

To review the accuracy of our systematic process in preventing wrong level lumbar microdiscectomy. X-ray is used to identify the correct level for the skin incision to be made, x-ray is again used if the surgeon is in doubt prior performing the flavotomy. Following a lumbar microdiscectomy a Watson Chane is inserted into the empty disc space and an intra-operative x-ray is taken to confirm the level the discectomy has occurred. Observers A and B independently reviewed intra-operative x-ray in patients undergoing lumbar microdiscectomies and correlated the accuracy of the x-ray in determining correct level surgery against the pre-operative MRI scan and the preposed level of surgery. 123 patients, 66 males and 57 females underwent 127 lumbar microdiscectomy procedures between 2007 and 2009. The levels where surgery occurred are;- L2/3 -1 patient, L3/4–8 patients, L4/5–53 patients and L5/S1-65 patients. Kappa coefficient was used to determine inter-observer and Pearson Correlation coefficient was used to determine the X-ray and MRI relationship. Percentage of patients who required a pre-flavotomy x-ray level check are:- L2/3–100%, L3/4-63%, L4/5–45%, and L5/S1–40%. Pearson's correlation in confirming the level lumbar microdiscectomy was performed using final x-ray and the pre-operative MRI scan was 1. Kappa coefficient between observer A and B was 1. This process of using intra-operative x-ray in determining the exact level where lumbar microdiscectomy was performed is 100% accurate. This is our standard process in preventing wrong level surgery for lumbar microdiscectomy


Orthopaedic Proceedings
Vol. 103-B, Issue SUPP_2 | Pages 65 - 65
1 Mar 2021
Nicholson J
Full Access

Abstract. Objectives. Three-dimensional visualisation of sonographic callus has the potential to improve the accuracy and accessibility of ultrasound evaluation of fracture healing. The aim of this study was to establish a reliable method for producing three-dimensional reconstruction of sonographic callus. Methods. A prospective cohort of ten patients with a closed tibial shaft fracture managed with intramedullary nailing were recruited and underwent ultrasound scanning at 2-, 6- and 12-weeks post-surgery. Ultrasound B-mode capture was performed using infrared tracking technology to map each image to a three-dimensional lattice. Using echo intensity, semi-automated mapping was performed by two independent reviewers to produce an anatomic three-dimensional representation of the fracture. Agreement on the presence of sonographic bridging callus on three-dimensional reconstructions was assessed using the kappa coefficient. Results. Nine of the ten patients achieved union at six months. At six weeks, seven patients had bridging callus at ≥1 cortex on the three-dimensional reconstruction; when present all united. Compared to radiographs, no bridging callus was present in any patient. Of the three patients lacking sonographic bridging callus, one went onto a nonunion (77.8%-sensitive and 100%-specific to predict union). At twelve weeks, nine patients had bridging callus at ≥1 cortex on three-dimensional reconstruction and all united (100%-sensitive and 100%-specific to predict union). Compared to radiographs, seven of the nine patients that united had bridging callus. Three-dimensional reconstruction of the anteromedial and anterolateral tibial surface was achieved in all patients, and detection of sonographic bridging callus on the three-dimensional reconstruction demonstrated substantial inter-observer agreement (kappa=0.78, 95% confidence interval 0.29–1.0, p=0.011). Conclusions. Three-dimensional fracture reconstruction can be created using multiple ultrasound images in order to evaluate the presence of bridging callus. This imaging modality has the potential to identify impaired healing at an early stage in fracture management. Declaration of Interest. (b) declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported:I declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research project


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXV | Pages 116 - 116
1 Jun 2012
Konan S Rayan F Meermans G Witt J Haddad FS
Full Access

Introduction. In recent years, there has been a significant advancement in our understanding of femoro-acetabular impingement and associated labral and chondral pathology. Surgeons worldwide have demonstrated the successful treatment of these lesions via arthroscopic and open techniques. The aim of this study is to validate a simple and reproducible classification system for acetabular chondral lesions. Methods. In our classification system, the acetabulum is first divided into 6 zones as described by Ilizalithurri VM et al [Arthroscopy 24(5) 534-539]. The cartilage is then graded as 0 to 4 as follows: Grade 0 – normal articular cartilage lesions; Grade 1 softening or wave sign; Grade 2 - cleavage lesion; Grade 3 - delamination and Grade 4 –exposed bone. The site of the lesion is further typed as A, B or C based on whether the lesion is 1/3 distance from acetabular rim to cotyloid fossa, 1/3 to 2/3 distance from acetabular rim to cotyloid fossa and > 2/3 distance from acetabular rim to cotyloid fossa. For validating the classification system, six surgeons reviewed 14 hip arthroscopy video clips. All surgeons were provided with written explanation of our classification system. Each surgeon then individually graded the cartilage lesion. A single observer then compared results for observer variability using kappa statistics. Results. We observed a high inter-observer reliability of the classification system with a kappa coefficient of 0.89 (range 0.78 to 0.91) and high intra-observer reliability with a kappa coefficient of 0.91 (range 0.89 to 0.96). In conclusion we have developed a simple reproducible classification system for acetabular cartilage lesions seen at hip arthroscopy


Orthopaedic Proceedings
Vol. 93-B, Issue SUPP_III | Pages 304 - 304
1 Jul 2011
Konan S Rayan F Meermans G Witt J Haddad F
Full Access

Introduction: In recent years, there has been a significant advancement in our understanding of femoro-acetabular impingement and associated labral and chondral pathology. Surgeons worldwide have demonstrated the successful treatment of these lesions via arthroscopic and open techniques. The aim of this study is to validate a simple and reproducible classification system for acetabular chondral lesions. Methods: In our classification system, the acetabulum is first divided into 6 zones as described by Ilizalithurri VM et al [Arthroscopy 24(5) 534–539]. The cartilage is then graded as 0 to 4 as follows: Grade 0 – normal articular cartilage lesions; Grade 1 softening or wave sign; Grade 2 – cleavage lesion; Grade 3 – delamination and Grade 4 -exposed bone. The site of the lesion is further typed as A, B or C based on whether the lesion is 1/3 distance from acetabular rim to cotyloid fossa, 1/3 to 2/3 distance from acetabular rim to cotyloid fossa and > 2/3 distance from acetabular rim to cotyloid fossa. For validating the classification system, six surgeons reviewed 14 hip arthroscopy video clips. All surgeons were provided with written explanation of our classification system. Each surgeon then individually graded the cartilage lesion. A single observer then compared results for observer variability using kappa statistics. Results: We observed a high inter-observer reliability of the classification system with a kappa coefficient of 0.89 (range 0.78 to 0.91) and high intra-observer reliability with a kappa coefficient of 0.91 (range 0.89 to 0.96). Discussion: In conclusion we have developed a simple reproducible classification system for acetabular cartilage lesions


Orthopaedic Proceedings
Vol. 92-B, Issue SUPP_II | Pages 281 - 281
1 May 2010
El-Husseiny M Coleman N
Full Access

The study was designed to assess the reproducibility and reliability of Mirels scoring system and the conventional scoring system for impending pathological fractures. The results of both classification systems influence the choice of therapeutic procedures offered to these patients. Blinded plain antero-posterior radiographs from forty-seven patients with bone metastases were scored by eight independent observers (four orthopaedic surgeons and four radiologists with varying clinical experience). Each observer scored the radiographs as per the Mirels and the conventional systems. After twelve weeks, the radiographs were scored again by the same observers. Inter– and intra-observer agreement was assessed based on the weighted kappa coefficient values for both systems. For intra-observer reproducibility, kappa values for the conventional system had a mean of 0.495 (SD 0.12) showing a moderate agreement, while Mirels scoring system had a mean of 0.386 (SD 0.143) showing a fair agreement. For inter-observer reliability, kappa values for the conventional scoring system were 0.322 for the first test and 0.47 for the second test giving fair and moderate agreements respectively. For Mirels’ scoring system, kappa coefficient for inter-observer reliability was 0.183 for the first test and 0.218 for the second giving poor and fair agreements respectively. The conventional scoring system showed a better inter and intra-observer agreement compared to Mirels’ scoring system. Consultants had a better intra-observer agreement compared to their registrars. We also found that radiologists, in our study, had better intra-observer agreement compared to orthopaedic surgeons. Radiographs should be scored cautiously by experienced doctors in a multi-disciplinary setting to achieve the best decision of whether to proceed with a prophylactic surgery or not


Orthopaedic Proceedings
Vol. 86-B, Issue SUPP_II | Pages 113 - 113
1 Feb 2004
Behensky H Giesinger K Ogon M Krismer M
Full Access

Objective: To compare multi surgeon reliability of the classification systems of H. A. King and R.W. Coonrad and to analyse controversial classified curve patterns. Design: Three scoliosis surgeons and one orthopedic fellow were presented the AP radiographs of seventy adolescent idiopathic scoliosis patients. All reviewers assigned a type to each curve according to the classification systems of H. A. King [. 1. ] and R. W. Coonrad [. 2. ]. Subjects: Interobserver agreement and intraobserver reproducibility were tested. Kappa coefficients were used to test reliability. Between the observers, the divergent assignments to curve patterns were analysed in quantitative as well as in qualitative terms. An error analysis was performed. Results: For King’s classification, paired comparisons revealed a mean interobserver kappa coefficient of 0.45, and for Coonrad’s classification system 0.38, respectively. According to Svanholm et al., these values indicate poor reliability in terms of interobserver agreement. Error analyses for both classification systems revealed that the reason for poor reproducibility is disagreement on structural upper thoracic and structural lumbar curves among the observers. Conclusion: Neither King’s nor Coonrad’s method appear to have sufficient interobserver reliabilty. In order to improve reliability we recommend unequivocal description of structural stigmata of upper thoracic and lumbar curves


Orthopaedic Proceedings
Vol. 87-B, Issue SUPP_III | Pages 356 - 357
1 Sep 2005
Howie D Wallace R Wimhurst J MacDowell A
Full Access

Introduction and Aims: To aid the comparison of results of different techniques of femoral revision at total hip replacement and in choosing types of revision, a number of radiographic classifications have been proposed. We aimed to determine the reliability of five popular radiographic classification systems for grading the extent of femoral bone deficiency. Method: Twenty pre-revision total hip replacement femoral radiographs were assessed by a senior consultant specialist in revision surgery, a junior consultant, a fellow and a trainee registrar. The femoral bone deficiency was classified using the systems of the American Academy of Orthopaedic Surgeons (AAOS) and EndoKlinik, and those described by Paprosky, Gross and Gustillo. Intra-observer agreement and inter-observer agreement between assessors were determined using the kappa coefficient. Radiographs were reassessed after a minimum of two weeks. Kappa coefficients of 0.6–0.8 (substantial) or > 0.8 (almost perfect) were considered to indicate acceptable agreement. Intra-operative measurement of deficiency was also undertaken. Results: Intra-oberser agreement was rated as acceptable for the Paprosky, Gross and Gustillo systems, each giving substantial agreement, but was unacceptable for the AAOS and EndoKlinik systems. Inter-observer agreement was unacceptable for all systems except the Gross classification system, which was rated as having substantial agreement. Conclusion: Comparing results of femoral revision between different surgeons based on bone deficiency according to the most popular radiographic classification systems is doubtful because of poor reliability. These classifications can be used by an individual surgeon as a guide to management, but most classifications should not be used to recommend the type of femoral revision to other surgeons


Orthopaedic Proceedings
Vol. 93-B, Issue SUPP_II | Pages 192 - 192
1 May 2011
Kobayashi A Ishii Y Takeda M Noguchi H Higuchi H
Full Access

Introduction: Preoperative planning is an important part of the total knee arthroplasty(TKA) surgical procedure. In joint arthroplasty, the use of a templating system has been recommended and it is routinely used with most designs. The aim of this study was to compare the accuracy of preoperative templating in TKA between conventional two-dimensional (2D) and computed tomography (CT)-based 3D procedures in order to confirm the necessity of using 3D evaluations for preoperative planning. Method: One-hundred consecutive primary TKAs performed during the period between December 2005 and May 2009 were analyzed. The mean age of the patients was 73.3 years (range, 33 to 90 years). Preoperative templating was performed for each TKA using both conventional 2D radiographs (both anteroposterior and lateral views) which were analyzed by a single senior surgeon. Preoperative CT scans of the knee were performed and a CT-based 3D image model (superimposing the computer aided design model of the implant) was generated using KneeCAS (KneeCAS: Knee Computer-Assisted System) and then was analyzed by a radiology technologist without any knowledge of the 2D procedure. Based on the operation notes, we determined which size implant had been inserted at the time of surgery and used this as the gold standard. The accuracy and reliability were assessed for all measurements of the two different templating procedures (2D and CT-based 3D procedures). The Chi-square test for independence for paired observations was used to analyze the accuracy. The weighted kappa test was used to analyze reliability. Results: 56% of the 2D procedures were found to be an exact match. This increased to 98% for the template sizes within one size above or below that used and 2% were two sizes or more adrift. Otherwise, 59% of the CT-based 3D procedures were an exact match; 98% were within one size and 2% were two sizes or more adrift. The CT-based 3D procedure was slightly more accurate than the 2D procedure. However, the difference was not statistically significant (p = 0.67). The weighted kappa coefficient of the 2D procedure was 0.49 (which indicates a moderate agreement), while that of the CT-based 3D procedure was 0.49 (which indicates a moderate agreement). The results of the weighted kappa coefficients were not statistically significant (p = 0.65). Conclusion: Computer-assisted surgery systems are used often for preoperative planning in TKA. However, our results do not support the superiority of 3D preoperative templating to 2D conventional evaluation in predicting implant size. Thus, 3D templating may not be necessary for preoperative planning in TKA


Orthopaedic Proceedings
Vol. 102-B, Issue SUPP_7 | Pages 7 - 7
1 Jul 2020
Schaeffer E Teo T Cherukupalli A Cooper A Aroojis A Sankar W Upasani V Carsen S Mulpuri K Bone J Reilly CW
Full Access

The Gartland extension-type supracondylar humerus fracture is the most common elbow fracture in the paediatric population. Depending on fracture classification, treatment options range from nonoperative treatment such as taping, splinting or casting to operative treatments such as closed reduction and percutaneous pinning or open reduction. Classification variability between surgeons is a potential contributing factor to existing controversy over nonoperative versus operative treatment for Type II supracondylar fractures. The purpose of this study was to investigate levels of agreement in classification of extension-type supracondylar humerus fractures using the Gartland classification system. A retrospective chart review was conducted on patients aged 2–12 years who had sustained an extension-type supracondylar fracture and received either operative or nonoperative treatment at a tertiary children's hospital. De-identified baseline anteroposterior (AP) and lateral plain elbow radiographs were provided along with a brief summary of the modified Gartland classification system to surgeons across Canada, United States, Australia, United Kingdom and India. Each surgeon was blinded to patient treatment and asked to classify the fractures as Type I, IIA, IIB or III according to the classification system provided. A total of 21 paediatric orthopaedic surgeons completed one round of classification, of these, 15 completed a second round using the same radiographs in a reshuffled order. Kappa values using pre-determined weighted kappa coefficients were calculated to assess interobserver and intraobserver levels of agreement. In total, 60 sets of baseline elbow radiographs were provided to survey respondents. Interobserver agreement for classification based on the Gartland criteria between surgeons was a mean of 0.68, 95% CI [0.67, 0.69] (0.61–0.80 considered substantial agreement). Intraobserver agreement was a mean of 0.80 [0.75, 0.84]. (0.61–0.80 substantial agreement, 0.81–1 almost perfect agreement). Radiographic classification of extension-type supracondylar humerus fractures at baseline demonstrated substantial agreement both between and within surgeon raters. Levels of agreement are substantial enough to suggest that classification variability is not a major contributing factor to variability in treatment between surgeons for Type II supracondylar fractures. Further research is needed to compare patient outcomes between nonoperative and operative treatment for these fractures, so as to establish consensus and a standardized treatment protocol for optimal patient care across centres


Orthopaedic Proceedings
Vol. 101-B, Issue SUPP_9 | Pages 31 - 31
1 Sep 2019
Broekema A Molenberg R Kuijlen J Groen R Reneman M Soer R
Full Access

Introduction. The Odom's criteria are, since 1958, a widely used 4-point rating scale for assessing the clinical outcome after cervical spine surgery. Surprisingly, the Odom's criteria have never been validated. The aim of this study was to investigate the reliability and validity of the Odom's criteria for the evaluation of surgical procedures of the cervical spine. Methods. Patients with degenerative cervical spine disease were included and divided into two groups, based on their most predominant symptom: myelopathy or radiculopathy. Reliability was assessed with inter-rater and test-retest design using a quadratic weighted Kappa coefficient. Construct validity was assessed by means of hypothesis testing with related constructs. To evaluate if the Odom's criteria could act as a global perceived effect (GPE) scale, we assessed concurrent validity by comparing the areas under the curves (AUCs) of the receiver operating characteristic curves (ROCs) with both the Odom's criteria, as the GPE as an anchor. Results. A total of 110 patients were included in the study. Overall inter-rater reliability was k=0.77 and the test-retest reliability k=0.93. Inter-rater reliability for the radiculopathy patients was κ=0.81 and for myelopathy patients κ=0.68. More than 75% of the hypotheses were met. The AUCs showed similar characteristics between the Odom's criteria and GPE. Conclusion. The Odom's criteria meet the predefined criteria for reliability and validity. Therefore, the Odom's criteria may be used to measure surgical outcome after a cervical spine procedure. Results of previous studies that have been deemed less trustworthy, because of the use of the Odom's criteria, should be reconsidered. No conflicts of interests. No funding obtained


Introduction:. Inadequate reduction and fixation of ankle fractures leads to poor clinical outcomes although there are no well-established criteria to evaluate the quality of surgical fracture fixation of the ankle. The aim of our study was to validate Pettrone's criteria that can be used in the radiological assessment of the quality of ankle fracture fixation that predict the functional outcome. Methods:. A retrospective study was completed following the operative management of ankle fractures at a University teaching hospital between 1. st. January 2009 and 31. st. December 2009 were included in the study. Exclusion criteria were paediatric fractures, polytrauma, and fractures involving the tibial plafond. The fracture pattern was classified using the AO classification system. Three independent Foot and Ankle Consultants assessed the quality of surgical ankle fracture fixation using Pettrone's criteria. Approximately one year following the surgery, functional outcome was obtained using Lower Extremity Function Score (LEFS) and a modified American Orthopaedic Foot and Ankle Society score (AOFAS). The Mann-Whitney test was used for the LEFS and AOFAS functional scores. Logistic regression was performed upon age and gender with regards to functional outcome. Given that the Kappa coefficient is a pair wise statistic, the average pair wise agreement for each category of the Pettrone criteria was also determined. Results:. Sixty-one consecutive patients were included in the study with a mean age of 51 years (17–74 years) and a mean follow-up of 17.41 months (13–24 months). Using Pettrone's criterias, mean interobserver agreement was 90.0% (89.4–92.6%) with inadequate reduction in 20 cases (32.5%). Mean LEFS following inadequate reduction was 47.5 (1–79) and following satisfactory reduction was 55.9 (9–80) p=0.03. Conclusion:. Pettrone's criteria has high interobserver agreement for the quality of surgical fracture fixation of the ankle which correlates with functional outcome


Orthopaedic Proceedings
Vol. 95-B, Issue SUPP_11 | Pages 3 - 3
1 Feb 2013
Baker M Davis B Hutchinson R Sanghrajka A
Full Access

Aim. Our aim was to compare the Wilkins'-modified Gartland classification and AO classifications of supracondylar humeral fracture with respect to: -Inter-observer reliability; Association of fracture-grade with radiological quality of reduction; Association of fracture-grade with complications. Methods. The unit database was interrogated to identify all operated supracondylar fractures between 2007–2011. Radiographs from each case were evaluated by four observers (three consultants, one trainee) and classified according to Gartland and the AO system. Inter-observer reliability was calculated using Cohen's Kappa coefficient. Radiological quality of reduction was evaluated using a new scoring system, combining the anterior humeral line, the lateral capitello-humeral angle (LCHA) and Baumann's angle, (compared to reference values). Analysis of variance (ANOVA) was performed to determine whether there was a statistically significant difference in scores between the fracture grades. Case notes were reviewed for pre- and post-operative complication, and chi-squared test performed to compare the incidence between different fracture grades. Results. 83 cases were identified for fracture classification and 66 for assessment of reduction and complications. Inter-observer reliability was 0.58 for Gartland (moderate agreement), and 0.70 for the AO classification (good agreement). ANOVA demonstrated no significant difference in the overall reduction scores between the different grades of either classification. Mean scores for LCHA were lower for Gartland-2b (p=0.02) and AO-3 (p=0.007) groups (compared to Gartland-3 and AO-4). Complications were more frequent with the higher grades of fracture in both classifications (p<0.05). Conclusion. The AO classification has greater reliability and should be considered as an alternative to the Gartland classification. A higher grade of fracture is not associated with inferior radiographic outcomes. Surgeons should be aware that restoration of the LCHA is more difficult with the Gartland-2b and AO-3 fractures, probably because of the intact posterior cortex. A greater rate of complications should be expected with higher grades of fracture


Orthopaedic Proceedings
Vol. 93-B, Issue SUPP_IV | Pages 592 - 593
1 Nov 2011
Bailey CS Alsaleh K Ho D Rosas-Arellano P Bailey SI Gurr KR
Full Access

Purpose: Magnetic resonance imaging (MRI) and Computerized tomography (CT) are commonly used for the diagnosis and assessment of lumbar spinal stenosis. The available literature has not identified which modality is superior. We compared the reliability and accuracy of CT and MRI in the assessment of lumbar spinal stenosis. Method: We performed a prospective review of CT and MRI scans of 54 patients referred for surgical consultation. One orthopaedic spine fellow and one neuro-radiologist reviewed the CTs and MRIs. A qualitative and quantitative analysis was performed. Intra-observer and inter-observer reliability was determined using Kappa coefficient. The patient’s official reports were correlated with analysis performed by the two reviewers. Owsestry and SF-36 data was correlated with the qualitative and qualitative assessment of stenosis on CT, MRI using the Pearson’s R coefficient. Results: MRI – substantial inter-observer agreement was achieved between surgeon and neuro-radiologist as well as between surgeon and reporting radiologist (κ= 0.74 and κ=0.64 respectively). Moderate agreement was found between neuro-radiologist and reporting radiologist (κ=0.57). Almost perfect intra-observer reliability for MRI was achieved by the two expert reviewers (κ=0.91 for surgeon and κ=0.92 for neuro-radiologist). CT – moderate inter-observer agreement (κ=0.58) was found between surgeon and neuro-radiologist. Fair agreement was found between neuro-radiologist and reporting radiologist and between surgeon and reporting radiologist (κ=0.30 and 0.32 respectively). Substantial intra-observer agreement was found for the surgeon (κ=0.77) while the neuro-radiologist achieved almost perfect agreement (κ=0.96). Conclusion: This study directly demonstrates that MRI is likely a more reliable tool than CT, but neither correlates with functional status


Orthopaedic Proceedings
Vol. 94-B, Issue SUPP_XXV | Pages 107 - 107
1 Jun 2012
Kailathuvalapil JT Sasidharan B
Full Access

Introduction. Proper alignment of the components and soft tissue balance are the two factors that determine the long term outcome of total knee arthroplasty (TKA). On the femoral side a distal cut made perpendicular to the MA will restore the MA of the leg. Different methods are commonly used to resect the femur perpendicular to its MA. In uncomplicated cases, most surgeons routinely use a fixed valgus cut angle (VCA) of 5° or 6°. Various studies have questioned the use of fixed valgus angle resection to restore the mechanical axis. The purpose of this prospective study is to analyze the variability in the valgus angle following computer assisted TKA. Materials and methods. Twenty-three patients who underwent computer assisted TKA in our institution in 2009 were involved in the study. A total of 40 knees were available for analysis. All the knees underwent a CT scanogram postoperatively. Each scanogram was analyzed using the Amrita medvision(r) software. The angle subtended between the mechanical axis and the distal femoral anatomic axis is the valgus angle. Two independent observers calculated all the values and the interobserver reliability was calculated. Results. The average age of the patients was 65.6 years. The kappa coefficient of agreement was 0.8, which shows good interobserver reliability. The average angle formed by the femoral component with the mechanical axis was 91.6. 0. and the average valgus angle calculated was 7.41. 0. 14 knees out of 40 (35%) were lying within the range of 4 - 7 degrees. In 25 knees (65%) valgus angle was more than 7 degrees. In one case the valgus angle was less than 4 degrees. Conclusions. Fixed valgus angle resection is not reliable in restoring mechanical axis in total knee arthroplasty. In the absence of facilities for surgical navigation, a pre operative planning with long leg films is extremely important to achieve long-term success


Orthopaedic Proceedings
Vol. 92-B, Issue SUPP_IV | Pages 553 - 553
1 Oct 2010
Malek I Hyder N Machani B Mevcha A
Full Access

Introduction: Large numbers of studies have been conducted to help the decision making of appropriate management of an ankle fracture. Aim: To assess intra-observer and inter-observer agreement of treatment for ankle fracture based on plain radiographs. Materials and Methods: Fifty patients with ankle fracture were randomly selected. Antero-posterior and lateral view ankle radiographs were blinded by the first author and then reviewed by five orthopaedic surgeons with varying clinical experience. The observers were asked for their opinion about how they would like to treat the fracture? They were provided with additional basic information of patient age, sex, mechanism of injury and associated comorbidities. This exercise was repeated again after four weeks. The kappa coefficient and observed agreement values were used for statistical analysis. Results: The kappa values on both occasions were 0.41(SE: 0.084, 95% Confidence interval: 0.248–0.576, P< 0.00001) and 0.29(SE: 0.099, 95% confidence interval: 0.098 TO 0.487, P< 0.00001). These results show only fair inter-observer agreement. The kappa values for intra-observer agreement were from 0.34 to 0.69 (P< 0.001) for different observers. The observed agreement for these observers was from 70% to 94%. Only two most senior observers had good agreement. Conclusion: These results show that there is only fair inter-observer agreement of the treatment for ankle fracture based on plain radiographs and only senior clinicians were consistent about their preferred mode of treatment on both occasions. There is a need of clear guidelines and better understanding of biomechanics of Ankle Fractures to minimize the ‘grey’ zone of when to intervene