Aims. The Wrightington classification system of fracture-dislocations of the elbow divides these injuries into six subtypes depending on the involvement of the coronoid and the radial head. The aim of this study was to assess the reliability and reproducibility of this classification system. Methods. This was a blinded study using radiographs and CT scans of 48 consecutive patients managed according to the Wrightington classification system between 2010 and 2018. Four trauma and orthopaedic consultants, two post CCT fellows, and one speciality registrar based in the UK classified the injuries. The seven observers reviewed preoperative radiographs and CT scans twice, with a minimum four-week interval. Radiographs and CT scans were reviewed separately. Inter- and intraobserver reliability were calculated using Fleiss and Cohen kappa coefficients. The Landis and Koch criteria were used to interpret the strength of the kappa values.
Reimers migration percentage (MP) is a key measure to inform decision-making around the management of hip displacement in cerebral palsy (CP). The aim of this study is to assess validity and inter- and intra-rater reliability of a novel method of measuring MP using a smart phone app (HipScreen (HS) app). A total of 20 pelvis radiographs (40 hips) were used to measure MP by using the HS app. Measurements were performed by five different members of the multidisciplinary team, with varying levels of expertise in MP measurement. The same measurements were repeated two weeks later. A senior orthopaedic surgeon measured the MP on picture archiving and communication system (PACS) as the gold standard and repeated the measurements using HS app. Pearson’s correlation coefficient (r) was used to compare PACS measurements and all HS app measurements and assess validity. Intraclass correlation coefficient (ICC) was used to assess intra- and inter-rater reliability.Aims
Methods
The Manchester-Oxford Foot Questionnaire (MOxFQ) is an anatomically specific patient-reported outcome measure (PROM) currently used to assess a wide variety of foot and ankle pathology. It consists of 16 items across three subscales measuring distinct but related traits: walking/standing ability, pain, and social interaction. It is the most used foot and ankle PROM in the UK. Initial MOxFQ validation involved analysis of 100 individuals undergoing hallux valgus surgery. This project aimed to establish whether an individual’s response to the MOxFQ varies with anatomical region of disease (measurement invariance), and to explore structural validity of the factor structure (subscale items) of the MOxFQ. This was a single-centre, prospective cohort study involving 6,637 patients (mean age 52 years (SD 17.79)) presenting with a wide range of foot and ankle pathologies between January 2013 and December 2021. To assess whether the MOxFQ responses vary by anatomical region of foot and ankle disease, we performed multigroup confirmatory factor analysis. To assess the structural validity of the subscale items, exploratory and confirmatory factor analyses were performed.Aims
Methods
We aimed to assess the reliability and validity of OpenPose, a posture estimation algorithm, for measurement of knee range of motion after total knee arthroplasty (TKA), in comparison to radiography and goniometry. In this prospective observational study, we analyzed 35 primary TKAs (24 patients) for knee osteoarthritis. We measured the knee angles in flexion and extension using OpenPose, radiography, and goniometry. We assessed the test-retest reliability of each method using intraclass correlation coefficient (1,1). We evaluated the ability to estimate other measurement values from the OpenPose value using linear regression analysis. We used intraclass correlation coefficients (2,1) and Bland–Altman analyses to evaluate the agreement and error between radiography and the other measurements.Aims
Methods
The purpose of this study was to assess the reliability and responsiveness to hip surgery of a four-point modified Care and Comfort Hypertonicity Questionnaire (mCCHQ) scoring tool in children with cerebral palsy (CP) in Gross Motor Function Classification System (GMFCS) levels IV and V. This was a population-based cohort study in children with CP from a national surveillance programme. Reliability was assessed from 20 caregivers who completed the mCCHQ questionnaire on two occasions three weeks apart. Test-retest reliability of the mCCHQ was calculated, and responsiveness before and after surgery for a displaced hip was evaluated in a cohort of children.Aims
Methods
We have previously developed a radiographic technique, the oblique posterior condylar view, for assessment of the posterior aspect of the femoral condyles after total knee arthroplasty. The purpose of this study was to confirm the validity of this radiographic view based upon intra-operative findings at revision total knee arthroplasty. Lateral and oblique posterior condylar views were performed for 11 knees prior to revision total knee arthroplasty, and radiolucent lines or osteolysis of the posterior aspect of the femoral condyles were identified. These findings were compared with the intra-operative appearance of the posterior aspects of the femoral condyles. Statistical analysis showed that sensitivity and efficacy were significantly better for the oblique posterior condylar than the lateral view. This method can, therefore, be considered as suitable for routine follow-up radiographs of the femoral component and in the pre-operative planning of revision surgery.
The objective of this study was to determine if a synthetic bone
substitute would provide results similar to bone from osteoporotic
femoral heads during Pushout studies were performed with the dynamic hip screw (DHS)
and the DHS Blade in both cadaveric femoral heads and artificial
bone substitutes in the form of polyurethane foam blocks of different
density. The pushout studies were performed as a means of comparing
the force displacement curves produced by each implant within each
material.Introduction
Methods
Patients undergoing limb reconstruction surgery often face a challenging and lengthy process to complete their treatment journey. The majority of existing outcome measures do not adequately capture the patient-reported outcomes relevant to this patient group in a single measure. Following a previous systematic review, the Stanmore Limb Reconstruction Score (SLRS) was designed with the intent to address this need for an effective instrument to measure patient-reported outcomes in limb reconstruction patients. We aim to assess the face validity of this score in a pilot study. The SLRS was designed following structured interviews with several groups including patients who have undergone limb reconstruction surgery, limb reconstruction surgeons, specialist nurses, and physiotherapists. This has subsequently undergone further adjustment for language and clarity. The score was then trialled on ten patients who had undergone limb reconstruction surgery, with subsequent structured questioning to understand the perceived suitability of the score.Aims
Methods
The patient-rated wrist evaluation (PRWE) and the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire are patient-reported outcome measures (PROMs) used for clinical and research purposes. Methodological high-quality clinimetric studies that determine the measurement properties of these PROMs when used in patients with a distal radial fracture are lacking. This study aimed to validate the PRWE and DASH in Dutch patients with a displaced distal radial fracture (DRF). The intraclass correlation coefficient (ICC) was used for test-retest reliability, between PROMs completed twice with a two-week interval at six to eight months after DRF. Internal consistency was determined using Cronbach’s α for the dimensions found in the factor analysis. The measurement error was expressed by the smallest detectable change (SDC). A semi-structured interview was conducted between eight and 12 weeks after DRF to assess the content validity.Objectives
Methods
Aims. This aim of this study was to assess the reliability and validity of the Unified Classification System (UCS) for postoperative periprosthetic femoral fractures (PFFs) around cemented polished taper-slip (PTS) stems. Methods. Radiographs of 71 patients with a PFF admitted consecutively at two centres between 25 February 2012 and 19 May 2020 were collated by an independent investigator. Six observers (three hip consultants and three trainees) were familiarized with the UCS. Each PFF was classified on two separate occasions, with a mean time between assessments of 22.7 days (16 to 29). Interobserver reliability for more than two observers was assessed using percentage agreement and Fleiss’ kappa statistic. Intraobserver reliability between two observers was calculated with Cohen kappa statistic.
The aim of this study was to validate the Mirels score in predicting
pathological fractures in metastatic disease of the lower limb. A total of 62 patients with confirmed metastatic disease met
the inclusion criteria. Of the 62 patients, 32 were female and 30
were male. The mean age of patients was 65 years (35 to 89). The
primary malignancy originated from the breast in 27 (44%) patients,
prostate in 15 (24%) patients, kidney in seven (11%), and lung in
four (6%) of patients. One patient (2%) had metastatic carcinoma
from the lacrimal gland, two patients (3%) had multiple myeloma,
one patient (2%) had lymphoma of bone, and five patients (8%) had
metastatic carcinoma of unknown primary. Plain radiographs at the
time of initial presentation were scored using Mirels system by
the four authors. The radiographic components of the score (anatomical
site, size, and radiographic appearance) were scored two weeks apart.
Inter- and intraobserver reliability were calculated with Fleiss’
kappa test. Bland-Altman plots were created to compare the variances
of the individual components of the score and the total Mirels score.Aims
Patients and Methods
The aim of this study was to assess the current evidence relating
to the benefits of virtual reality (VR) simulation in orthopaedic
surgical training, and to identify areas of future research. A literature search using the MEDLINE, Embase, and Google Scholar
databases was performed. The results’ titles, abstracts, and references
were examined for relevance.Aims
Materials and Methods
Aims. This study investigates the use of the metabolic equivalent of task (MET) score in a young hip arthroplasty population, and its ability to capture additional benefit beyond the ceiling effect of conventional patient-reported outcome measures. Methods. From our electronic database of 751 hip arthroplasty procedures, 221 patients were included. Patients were excluded if they had revision surgery, an alternative hip procedure, or incomplete data either preoperatively or at one-year follow-up. Included patients had a mean age of 59.4 years (SD 11.3) and 54.3% were male, incorporating 117 primary total hip and 104 hip resurfacing arthroplasty operations. Oxford Hip Score (OHS), EuroQol five-dimension questionnaire (EQ-5D), and the MET were recorded preoperatively and at one-year follow-up. The distribution was examined reporting the presence of ceiling and floor effects.
Aims. The aim of this study was to evaluate the reliability and validity of a patient-specific algorithm which we developed for predicting changes in sagittal pelvic tilt after total hip arthroplasty (THA). Methods. This retrospective study included 143 patients who underwent 171 THAs between April 2019 and October 2020 and had full-body lateral radiographs preoperatively and at one year postoperatively. We measured the pelvic incidence (PI), the sagittal vertical axis (SVA), pelvic tilt, sacral slope (SS), lumbar lordosis (LL), and thoracic kyphosis to classify patients into types A, B1, B2, B3, and C. The change of pelvic tilt was predicted according to the normal range of SVA (0 mm to 50 mm) for types A, B1, B2, and B3, and based on the absolute value of one-third of the PI-LL mismatch for type C patients. The reliability of the classification of the patients and the prediction of the change of pelvic tilt were assessed using kappa values and intraclass correlation coefficients (ICCs), respectively.
Aims. To develop a core outcome set of measurements from postoperative radiographs that can be used to assess technical skill in performing dynamic hip screw (DHS) and hemiarthroplasty, and to validate these against Van der Vleuten’s criteria for effective assessment. Methods. A Delphi exercise was undertaken at a regional major trauma centre to identify candidate measurement items. The feasibility of taking these measurements was tested by two of the authors (HKJ, GTRP).
The principles of evidence-based medicine (EBM) are the foundation of modern medical practice. Surgeons are familiar with the commonly used statistical techniques to test hypotheses, summarize findings, and provide answers within a specified range of probability. Based on this knowledge, they are able to critically evaluate research before deciding whether or not to adopt the findings into practice. Recently, there has been an increased use of artificial intelligence (AI) to analyze information and derive findings in orthopaedic research. These techniques use a set of statistical tools that are increasingly complex and may be unfamiliar to the orthopaedic surgeon. It is unclear if this shift towards less familiar techniques is widely accepted in the orthopaedic community. This study aimed to provide an exploration of understanding and acceptance of AI use in research among orthopaedic surgeons. Semi-structured in-depth interviews were carried out on a sample of 12 orthopaedic surgeons. Inductive thematic analysis was used to identify key themes.Aims
Methods
The metabolic equivalent of task (MET) score examines patient performance in relation to energy expenditure before and after knee arthroplasty. This study assesses its use in a knee arthroplasty population in comparison with the widely used Oxford Knee Score (OKS) and EuroQol five-dimension index (EQ-5D), which are reported to be limited by ceiling effects. A total of 116 patients with OKS, EQ-5D, and MET scores before, and at least six months following, unilateral primary knee arthroplasty were identified from a database. Procedures were performed by a single surgeon between 2014 and 2019 consecutively. Scores were analyzed for normality, skewness, kurtosis, and the presence of ceiling/floor effects. Concurrent validity between the MET score, OKS, and EQ-5D was assessed using Spearman’s rank.Aims
Methods
Children with spinal dysraphism can develop various musculoskeletal deformities, necessitating a range of orthopaedic interventions, causing significant morbidity, and making considerable demands on resources. This systematic review aimed to identify what outcome measures have been reported in the literature for children with spinal dysraphism who undergo orthopaedic interventions involving the lower limbs. A PROSPERO-registered systematic literature review was performed following PRISMA guidelines. All relevant studies published until January 2023 were identified. Individual outcomes and outcome measurement tools were extracted verbatim. The measurement tools were assessed for reliability and validity, and all outcomes were grouped according to the Outcome Measures Recommended for use in Randomized Clinical Trials (OMERACT) filters.Aims
Methods
Literature surrounding artificial intelligence (AI)-related applications for hip and knee arthroplasty has proliferated. However, meaningful advances that fundamentally transform the practice and delivery of joint arthroplasty are yet to be realized, despite the broad range of applications as we continue to search for meaningful and appropriate use of AI. AI literature in hip and knee arthroplasty between 2018 and 2021 regarding image-based analyses, value-based care, remote patient monitoring, and augmented reality was reviewed. Concerns surrounding meaningful use and appropriate methodological approaches of AI in joint arthroplasty research are summarized. Of the 233 AI-related orthopaedics articles published, 178 (76%) constituted original research, while the rest consisted of editorials or reviews. A total of 52% of original AI-related research concerns hip and knee arthroplasty (n = 92), and a narrative review is described. Three studies were externally validated. Pitfalls surrounding present-day research include conflating vernacular (“AI/machine learning”), repackaging limited registry data, prematurely releasing internally validated prediction models, appraising model architecture instead of inputted data, withholding code, and evaluating studies using antiquated regression-based guidelines. While AI has been applied to a variety of hip and knee arthroplasty applications with limited clinical impact, the future remains promising if the question is meaningful, the methodology is rigorous and transparent, the data are rich, and the model is externally validated. Simple checkpoints for meaningful AI adoption include ensuring applications focus on: administrative support over clinical evaluation and management; necessity of the advanced model; and the novelty of the question being answered. Cite this article:
Patients with femoral neck fractures (FNFs) treated with total hip arthroplasty (THA) have an almost ten-fold increased risk of dislocation compared to patients undergoing elective THA. The surgical approach influences the risk of dislocation. To date, the influence of differing head sizes and dual-mobility components (DMCs) on the risk of dislocation has not been well studied. In an observational cohort study on 8,031 FNF patients with THA between January 2005 and December 2014, Swedish Arthroplasty Register data were linked with the National Patient Register, recording the total dislocation rates at one year and revision rates at three years after surgery. The cumulative incidence of events was estimated using the Kaplan-Meier method. Cox multivariable regression models were fitted to calculate adjusted hazard ratios (HRs) with 95% confidence intervals (CIs) for the risk of dislocation, revision, or mortality, stratified by surgical approach.Aims
Methods
To estimate the measurement properties for the Oxford Knee Score (OKS) in patients undergoing revision knee arthroplasty (responsiveness, minimal detectable change (MDC-90), minimal important change (MIC), minimal important difference (MID), internal consistency, construct validity, and interpretability). Secondary data analysis was performed for 10,727 patients undergoing revision knee arthroplasty between 2013 to 2019 using a UK national patient-reported outcome measure (PROM) dataset. Outcome data were collected before revision and at six months postoperatively, using the OKS and EuroQol five-dimension score (EQ-5D). Measurement properties were assessed according to COnsensus-based Standards for the selection of health status Measurement Instruments (COSMIN) guidelines.Aims
Methods
This study aimed to determine the minimal detectable change (MDC), minimal clinically important difference (MCID), and substantial clinical benefit (SCB) under distribution- and anchor-based methods for the Mayo Elbow Performance Index (MEPI) and range of movement (ROM) after open elbow arthrolysis (OEA). We also assessed the proportion of patients who achieved MCID and SCB; and identified the factors associated with achieving MCID. A cohort of 265 patients treated by OEA were included. The MEPI and ROM were evaluated at baseline and at two-year follow-up. Distribution-based MDC was calculated with confidence intervals (CIs) reflecting 80% (MDC 80), 90% (MDC 90), and 95% (MDC 95) certainty, and MCID with changes from baseline to follow-up. Anchor-based MCID (anchored to somewhat satisfied) and SCB (very satisfied) were calculated using a five-level Likert satisfaction scale. Multivariate logistic regression of factors affecting MCID achievement was performed.Aims
Methods
To validate the Sydney Hamstring Origin Rupture Evaluation (SHORE), a hamstring-specific clinical assessment tool to evaluate patient outcomes following surgical treatment. A prospective study of 70 unilateral hamstring surgical repairs, with a mean age of 47.3 years (15 to 73). Patients completed the SHORE preoperatively and at six months post-surgery, and then completed both the SHORE and Perth Hamstring Assessment Tool (PHAT) at three years post-surgery. The SHORE questionnaire was validated through the evaluation of its psychometric properties, including; internal consistency, reproducibility, reliability, sensitivity to change, and ceiling effect. Construct validity was assessed using Pearson’s correlation analysis to examine the strength of association between the SHORE and the PHAT.Aims
Methods
The aim of this study was to assess the reproducibility and validity
of cross table radiographs for measuring the anteversion of the
acetabular component after total hip arthroplasty (THA) and to compare
it with measurements using CT scans. A total of 29 patients who underwent THA between June 2010 and
January 2016 were included. There were 17 men and 12 women. Their
mean age was 43 years (26 to 65). Seven patients underwent a bilateral
procedure. Thus, 36 THAs were included in the study. Lateral radiographs
and CT scans were obtained post-operatively and radiographs repeated
three weeks later. The anteversion of the acetabular component was
measured using the method described by Woo and Morrey and the ischiolateral
method described by Pulos et al and these were compared with the
results obtained from CT scans.Aims
Patients and Methods
The primary aim of this study was to define and quantify three
new measurements to indicate the position of the greater trochanter.
Secondary aims were to define ‘functional antetorsion’ as it relates
to abductor function in populations both with and without torsional
abnormality. Three new measurements, functional antetorsion, posterior tilt,
and posterior translation of the greater trochanter, were assessed
from 61 CT scans of cadaveric femurs, and their reliability determined.
These measurements and their relationships were also evaluated in
three groups of patients: a control group (n = 22), a ‘high-antetorsion’ group
(n = 22) and a ‘low-antetorsion’ group (n = 10).Aims
Patients and Methods
Outcome measures quantifying aspects of health in a precise,
efficient, and user-friendly manner are in demand. Computer adaptive
tests (CATs) may overcome the limitations of established fixed scales
and be more adept at measuring outcomes in trauma. The primary objective
of this review was to gain a comprehensive understanding of the
psychometric properties of CATs compared with fixed-length scales
in the assessment of outcome in patients who have suffered trauma
of the upper limb. Study designs, outcome measures and methodological
quality are defined, along with trends in investigation. A search of multiple electronic databases was undertaken on 1
January 2017 with terms related to “CATs”, “orthopaedics”, “trauma”,
and “anatomical regions”. Studies involving adults suffering trauma
to the upper limb, and undergoing any intervention, were eligible.
Those involving the measurement of outcome with any CATs were included.
Identification, screening, and eligibility were undertaken, followed
by the extraction of data and quality assessment using the Consensus-Based
Standards for the Selection of Health Measurement Instruments (COSMIN) criteria.
The review is reported according to the Preferred Reporting Items
for Systematic Reviews and Meta-Analyses (PRISMA) criteria and reg istered (PROSPERO: CRD42016053886).Aims
Materials and Methods
To explore whether orthopaedic surgeons have adopted the Proximal Fracture of the Humerus: Evaluation by Randomisation (PROFHER) trial results routinely into clinical practice. A questionnaire was piloted with six orthopaedic surgeons using a ‘think aloud’ process. The final questionnaire contained 29 items and was distributed online to surgeon members of the British Orthopaedic Association and British Elbow and Shoulder Society. Descriptive statistics summarised the sample characteristics and fracture treatment of respondents overall, and grouped them by whether they changed practice based on PROFHER trial findings. Free-text responses were analysed qualitatively for emerging themes using Framework Analysis principles.Objectives
Methods
Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies.Objectives
Methods
The aim of this study was to validate the use of three models of fracture fixation in the assessment of technical skills. We recruited 21 subjects (six experts, seven intermediates, and eight novices) to perform three procedures: application of a dynamic compression plate on a cadaver porcine model, insertion of an unreamed tibial intramedullary nail, and application of a forearm external fixator, both on synthetic bone models. The primary outcome measures were the Objective Structural Assessment of technical skills global rating scale on video recordings of the procedures which were scored by two independent expert observers, and the hand movements of the surgeons which were analysed using the Imperial College Surgical Assessment Device. The video scores were significantly different for the three groups in all three procedures (p <
0.05), with excellent inter-rater reliability (α = 0.88). The novice and intermediate groups specifically were significantly different in their performance with dynamic compression plate and intramedullary nails (p <
0.05). Movement analysis distinguished between the three groups in the dynamic compression plate model, but a ceiling effect was demonstrated in the intramedullary nail and external fixator procedures, where intermediates and experts performed to comparable standards (p >
0.6). A total of 85% (18 of 21) of the subjects found the dynamic compression model and 57% (12 of 21) found all the models acceptable tools of assessment. This study has validated a low-cost, high-fidelity porcine dynamic compression plate model using video rating scores for skills assessment and movement analysis. It has also demonstrated that Synbone models for the application of and intramedullary nail and an external fixator are less sensitive and should be improved for further assessment of surgical skills in trauma. The availability of valid objective tools of assessment of surgical skills allows further studies into improving methods of training.
Epidemiological studies enhance clinical practice
in a number of ways. However, there are many methodological difficulties
that need to be addressed in designing a study aimed at the collection
and analysis of data concerning fractures and other injuries. Most
can be managed and errors minimised if careful attention is given
to the design and implementation of the research. Cite this article:
This study validates the short-form WOMAC function scale for assessment of conservative treatment of osteoarthritis of the knee. Data were collected before treatment and six and nine months later, from 100 patients with osteoarthritis of the knee to determine the validity, internal consistency, test-retest reliability, floor and ceiling effects, and responsiveness of the short-form WOMAC function scale. The scale showed high correlation with the traditional WOMAC and other measures. The internal consistency was good (Cronbach α: 0.88 to 0.95) and an excellent test-retest reliability was found (Lin’s concordance correlation coefficient (ρc): 0.85 to 0.94). The responsiveness was adequate and comparable to that of the traditional WOMAC (standardised response mean 0.56 to 0.44 and effect size 0.64 to 0.57) and appeared not to be significantly affected by floor or ceiling effects (0% and 7%, respectively). The short-form WOMAC function scale is a valid, reliable and responsive alternative to the traditional WOMAC in the evaluation of patients with osteoarthritis of the knee managed conservatively. It is simple to use in daily practice and is therefore less of a burden for patients in clinical trials.
A variety of radiological methods of measuring
version of the acetabular component after total hip replacement (THR)
have been described. The aim of this study was to evaluate the reliability
and validity of six methods (those of Lewinnek; Widmer; Hassan et
al; Ackland, Bourne and Uhthoff; Liaw et al; and Woo and Morrey)
that are currently in use. In 36 consecutive patients who underwent
THR, version of the acetabular component was measured by three independent
examiners on plain radiographs using these six methods and compared
with measurements using CT scans. The intra- and interobserver reliabilities
of each measurement were estimated. All measurements on both radiographs
and CT scans had excellent intra- and interobserver reliability
and the results from each of the six methods correlated well with
the CT measurements. However, measurements made using the methods
of Widmer and of Ackland, Bourne and Uhthoff were significantly
different from the CT measurements (both p <
0.001), whereas
measurements made using the remaining four methods were similar
to the CT measurements. With regard to reliability and convergent
validity, we recommend the use of the methods described by Lewinnek,
Hassan et al, Liaw et al and Woo and Morrey for measurement of version
of the acetabular component.
The Vancouver classification has been shown by its developers to be a valid and reliable method for categorising the configuration of periprosthetic proximal femoral fractures and for planning their management. We have re-validated this classification system independently using the radiographs of 30 patients with periprosthetic fractures. These were reviewed by six experienced consultant orthopaedic surgeons, six trainee surgeons and six medical students in order to assess intra- and interobserver reliability and reproducibility. Each observer read the radiographs on two separate occasions. The results were subjected to weighted kappa statistical analysis. The respective kappa values for interobserver agreement were 0.72 and 0.74 for consultants, 0.68 and 0.70 for trainees on the first and second readings of the radiographs and 0.61 for medical students. The intra-observer agreement for the consultants was 0.64 and 0.67, for the trainees 0.61 and 0.64, and for the medical students 0.59 and 0.60 for the first and second readings, respectively. The validity of the classification was studied by comparing the pre-operative radiological findings within B subgroups with the operative findings. This revealed agreement for 77% of these type-B fractures, with a kappa value of 0.67. Our data confirm the reliability and reproducibility of this classification system in a European setting and for inexperienced staff. This is a reliable system which can be used by non-experts, between centres and across continents.
We have developed an illustrated questionnaire, the Hand20, comprising 20 short and easy-to-understand questions to assess disorders of the upper limb. We have examined the usefulness of this questionnaire by comparing reliability, validity, responsiveness and the level of missing data with those of the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire. A series of 431 patients with disorders of the upper limb completed the Hand20 and the Japanese version of the DASH (DASH-JSSH) questionnaire. The norms for Hand20 scores were determined in another cross-sectional study. Most patients had no difficulty in completing the Hand20 questionnaire, whereas the DASH-JSSH had a significantly higher rate of missing data. The standard score for the Hand20 was smaller than the reported norms for the DASH. Our study showed that the Hand20 questionnaire provided validation comparable with that of the DASH-JSSH. Explanatory illustrations and short questions which were easy-to-understand led to better rates of response and fewer missing data, even in elderly individuals with cognitive deterioration.
We developed the Oxford ankle foot questionnaire to assess the disability associated with foot and ankle problems in children aged from five to 16 years. A survey of 158 children and their parents was carried out to determine the content, scaling, reliability and validity of the instrument. Scores from the questionnaire can be calculated to measure the effect of foot or ankle problems on three domains of children’s lives: physical, school and play, and emotional. Scores for each domain were shown to be internally consistent, stable, and to vary little whether reported by child or parent. Satisfactory face, content and construct validity were demonstrated. The questionnaire is appropriate for children with a range of conditions and can provide clinically useful information to supplement other assessment methods. We are currently carrying out further work to assess the responsiveness of questionnaire scores to change over time and with treatment.
We developed a questionnaire to assess patient-reported outcome after surgery of the elbow from interviews with patients. Initially, 17 possible items with five response options were included. A prospective study of 104 patients (107 elbow operations) was carried out to analyse the underlying factor structure, dimensionality, internal and test-retest reliability, construct validity and responsiveness of the questionnaire items. This was compared with the Mayo Elbow performance score clinical scale, the Disabilities of the Arm, Shoulder and Hand questionnaire, and the Short-Form (SF-36) General Health Survey. In total, five questions were considered inappropriate, which resulted in the final 12-item questionnaire, which has been referred to as the Oxford elbow score. This comprises three unidimensional domains, ‘elbow function’, ‘pain’ and ‘social-psychological’; with each domain comprising four items with good measurement properties. This new 12-item Oxford elbow score is a valid measure of the outcome of surgery of the elbow.