Aims. The extended wait that most patients are now experiencing for hip and knee arthroplasty has raised questions about whether reliance on waiting time as the primary driver for prioritization is ethical, and if other additional factors should be included in determining surgical priority. Our Prioritization of THose aWaiting hip and knee ArthroplastY (PATHWAY) project will explore which perioperative factors are important to consider when prioritizing those on the waiting list for hip and knee arthroplasty, and how these factors should be weighted. The final product will include a weighted benefit score that can be used to aid in surgical prioritization for those awaiting elective primary hip and knee arthroplasty. Methods. There will be two linked work packages focusing on opinion from key stakeholders (patients and surgeons). First, an online modified Delphi process to determine a consensus set of factors that should be involved in patient prioritization. This will be performed using standard Delphi
Aims. Total hip arthroplasty (THA) and total knee arthroplasty (TKA) are common orthopaedic procedures requiring postoperative radiographs to confirm implant positioning and identify complications. Artificial intelligence (AI)-based image analysis has the potential to automate this postoperative surveillance. The aim of this study was to prepare a scoping review to investigate how AI is being used in the analysis of radiographs following THA and TKA, and how accurate these tools are. Methods. The Embase, MEDLINE, and PubMed libraries were systematically searched to identify relevant articles. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for scoping reviews and Arksey and O’Malley framework were followed. Study quality was assessed using a modified Methodological Index for Non-Randomized Studies tool. AI performance was reported using either the area under the curve (AUC) or accuracy. Results. Of the 455 studies identified, only 12 were suitable for inclusion. Nine reported implant identification and three described predicting risk of implant failure. Of the 12, three studies compared AI performance with orthopaedic surgeons. AI-based implant identification achieved AUC 0.992 to 1, and most algorithms reported an accuracy > 90%, using 550 to 320,000 training radiographs. AI prediction of dislocation risk post-THA, determined after five-year follow-up, was satisfactory (AUC 76.67; 8,500 training radiographs). Diagnosis of hip implant loosening was good (accuracy 88.3%; 420 training radiographs) and measurement of postoperative acetabular angles was comparable to humans (mean absolute difference 1.35° to 1.39°). However, 11 of the 12 studies had several
Aims. The purpose of this study was to develop a convolutional neural network (CNN) for fracture detection, classification, and identification of greater tuberosity displacement ≥ 1 cm, neck-shaft angle (NSA) ≤ 100°, shaft translation, and articular fracture involvement, on plain radiographs. Methods. The CNN was trained and tested on radiographs sourced from 11 hospitals in Australia and externally validated on radiographs from the Netherlands. Each radiograph was paired with corresponding CT scans to serve as the reference standard based on dual independent evaluation by trained researchers and attending orthopaedic surgeons. Presence of a fracture, classification (non- to minimally displaced; two-part, multipart, and glenohumeral dislocation), and four characteristics were determined on 2D and 3D CT scans and subsequently allocated to each series of radiographs. Fracture characteristics included greater tuberosity displacement ≥ 1 cm, NSA ≤ 100°, shaft translation (0% to < 75%, 75% to 95%, > 95%), and the extent of articular involvement (0% to < 15%, 15% to 35%, or > 35%). Results. For detection and classification, the algorithm was trained on 1,709 radiographs (n = 803), tested on 567 radiographs (n = 244), and subsequently externally validated on 535 radiographs (n = 227). For characterization, healthy shoulders and glenohumeral dislocation were excluded. The overall accuracy for fracture detection was 94% (area under the receiver operating characteristic curve (AUC) = 0.98) and for classification 78% (AUC 0.68 to 0.93). Accuracy to detect greater tuberosity fracture displacement ≥ 1 cm was 35.0% (AUC 0.57). The CNN did not recognize NSAs ≤ 100° (AUC 0.42), nor fractures with ≥ 75% shaft translation (AUC 0.51 to 0.53), or with ≥ 15% articular involvement (AUC 0.48 to 0.49). For all objectives, the model’s performance on the external dataset showed similar accuracy levels. Conclusion. CNNs proficiently rule out proximal humerus fractures on plain radiographs. Despite rigorous training
Aims. The diagnosis of periprosthetic joint infection (PJI) continues to present a significant clinical challenge. New biomarkers have been proposed to support clinical decision-making; among them, synovial fluid alpha-defensin has gained interest. Current research
Aims. To identify unanswered questions about the prevention, diagnosis, treatment, and rehabilitation and delivery of care of first-time soft-tissue knee injuries (ligament injuries, patella dislocations, meniscal injuries, and articular cartilage) in children (aged 12 years and older) and adults. Methods. The James Lind Alliance (JLA)
There is increasing popularity in the use of artificial intelligence and machine-learning techniques to provide diagnostic and prognostic models for various aspects of Trauma & Orthopaedic surgery. However, correct interpretation of these models is difficult for those without specific knowledge of computing or health data science
The anterior cruciate ligament (ACL) is frequently injured in elite athletes, with females up to eight times more likely to suffer an ACL tear than males. Biomechanical and hormonal factors have been thoroughly investigated; however, there remain unknown factors that need investigation. The mechanism of injury differs between males and females, and anatomical differences contribute significantly to the increased risk in females. Hormonal factors, both endogenous and exogenous, play a role in ACL laxity and may modify the risk of injury. However, data are still limited, and research involving oral contraceptives is potentially associated with
Aims. Arthroplasty is being increasingly used for the management of distal humeral fractures (DHFs) in elderly patients. Arthroplasty options include total elbow arthroplasty (TEA) and hemiarthroplasty (HA); both have unique complications and there is not yet a consensus on which implant is superior. This systematic review asked: in patients aged over 65 years with unreconstructable DHFs, what differences are there in outcomes, as measured by patient-reported outcome measures (PROMs), range of motion (ROM), and complications, between distal humeral HA and TEA?. Methods. A systematic review of the literature was performed via a search of MEDLINE and Embase. Two reviewers extracted data on PROMs, ROM, and complications. PROMs and ROM results were reported descriptively and a meta-analysis of complications was conducted. Quality of
Research into COVID-19 has been rapid in response to the dynamic global situation, which has resulted in heterogeneity of
Aims. Return to sport following undergoing total (TKA) and unicompartmental knee arthroplasty (UKA) has been researched with meta-analyses and systematic reviews of varying quality. The aim of this study is to create an umbrella review to consolidate the data into consensus guidelines for returning to sports following TKA and UKA. Methods. Systematic reviews and meta-analyses written between 2010 and 2020 were systematically searched. Studies were independently screened by two reviewers and
Aims. The tibial component of total knee arthroplasty can either be an all-polyethylene (AP) implant or a metal-backed (MB) implant. This study aims to compare the five-year functional outcomes of AP tibial components to MB components in patients aged over 70 years. Secondary aims are to compare quality of life, implant survivorship, and cost-effectiveness. Methods. A group of 130 patients who had received an AP tibial component were matched for demographic factors of age, BMI, American Society of Anesthesiologists (ASA) grade, sex, and preoperative Knee Society Score (KSS) to create a comparison group of 130 patients who received a MB tibial component. Functional outcome was assessed prospectively by KSS, quality of life by 12-Item Short-Form Health Survey questionnaire (SF-12), and range of motion (ROM), and implant survivorships were compared. The SF six-dimension (6D) was used to calculate the incremental cost effectiveness ratio (ICER) for AP compared to MB tibial components using quality-adjusted life year
Aims. The amount of glenoid bone loss is an important factor in deciding between soft-tissue and bony reconstruction when managing anterior shoulder instability. Accurate and reproducible measurement of glenoid bone loss is therefore vital in evaluation of shoulder instability and recommending specific treatment. The aim of this systematic review is to identify the range methods and measurement techniques employed in clinical studies treating glenoid bone loss. Methods. A systematic review of the PubMed, MEDLINE, and Embase databases was undertaken to cover a ten-year period from February 2011 to February 2021. We identified clinical studies that incorporated bone loss assessment in the
Aims. The aim of this study was to identify the minimal clinically important difference (MCID), minimal important change (MIC), minimal detectable change (MDC), and patient-acceptable symptom state (PASS) in the Forgotten Joint Score (FJS) according to patient satisfaction six months following total hip arthroplasty (THA) in a UK population. Methods. During a one-year period, 461 patients underwent a primary THA and completed preoperative and six-month FJS, with a mean age of 67.2 years (22 to 93). At six months, patient satisfaction was recorded as very satisfied, satisfied, neutral, dissatisfied, or very dissatisfied. The difference between patients recording neutral (n = 31) and satisfied (n = 101) was used to define the MCID. MIC for a cohort was defined as the change in the FJS for those patients declaring their outcome as satisfied, whereas receiver operating characteristic curve analysis was used to determine the MIC for an individual and the PASS. Distribution-based
Aims. Sarcopenia is characterized by a generalized progressive loss of skeletal muscle mass, strength, and physical performance. This systematic review primarily evaluated the effects of sarcopenia on postoperative functional recovery and mortality in patients undergoing orthopaedic surgery, and secondarily assessed the methods used to diagnose and define sarcopenia in the orthopaedic literature. Methods. A systematic search was conducted in MEDLINE, EMBASE, and Google Scholar databases according to the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines. Studies involving sarcopenic patients who underwent defined orthopaedic surgery and recorded postoperative outcomes were included. The quality of the criteria by which a diagnosis of sarcopenia was made was evaluated. The quality of the publication was assessed using Newcastle-Ottawa Scale. Results. A total of 365 studies were identified and screened, 26 full-texts were reviewed, and 19 studies were included in the review. A total of 3,009 patients were included, of whom 2,146 (71%) were female and 863 (29%) were male. The mean age of the patients was 75.1 years (SD 7.1). Five studies included patients who underwent spinal surgery, 13 included hip or knee surgery, and one involved patients who underwent fixation of a distal radial fixation. The mean follow-up was 1.9 years (SD 1.9; 5 days to 5.6 years). There was wide heterogeneity in the measurement tools which were used and the parameters for the diagnosis of sarcopenia in the studies. Sarcopenia was associated with at least one deleterious effect on surgical outcomes in all 19 studies. The postoperative rate of mortality was reported in 11 studies (57.9%) and sarcopenia was associated with poorer survival in 73% (8/11) of these. The outcome was most commonly assessed using the Barthel Index (4/19), and sarcopenic patients recorded lower scores in 75% (3/4) of these. Sarcopenia was defined using the gold-standard three parameters (muscle strength, muscle quantity or quality, and muscle function) in four studies (21%), using two parameters in another four (21%) and one in the remaining 11 (58%). The
Aims. Our objective was to conduct a systematic review and meta-analysis, to establish whether differences arise in clinical outcomes between autologous and synthetic bone grafts in the operative management of tibial plateau fractures. Methods. A structured search of MEDLINE, EMBASE, the online archives of Bone & Joint Publishing, and CENTRAL databases from inception until 28 July 2021 was performed. Randomized, controlled, clinical trials that compared autologous and synthetic bone grafts in tibial plateau fractures were included. Preclinical studies, clinical studies in paediatric patients, pathological fractures, fracture nonunion, or chondral defects were excluded. Outcome data were assessed using the Risk of Bias 2 (ROB2) framework and synthesized in random-effect meta-analysis. The Preferred Reported Items for Systematic Review and Meta-Analyses guidance was followed throughout. Results. Six studies involving 353 fractures were identified from 3,078 records. Following ROB2 assessment, five studies (representing 338 fractures) were appropriate for meta-analysis. Primary outcomes showed non-significant reductions in articular depression at immediate postoperative (mean difference -0.45 mm, p = 0.25, 95%confidence interval (CI) -1.21 to 0.31, I. 2. = 0%) and long-term (> six months, standard mean difference -0.56, p = 0.09, 95% CI -1.20 to 0.08, I. 2. = 73%) follow-up in synthetic bone grafts. Secondary outcomes included mechanical alignment, limb functionality, and defect site pain at long-term follow-up, perioperative blood loss, duration of surgery, occurrence of surgical site infections, and secondary surgery. Mean blood loss was lower (90.08 ml, p < 0.001, 95% CI 41.49 to 138.67) and surgery was shorter (16.17 minutes, p = 0.04, 95% CI 0.39 to 31.94) in synthetic treatment groups. All other secondary measures were statistically comparable. Conclusion. All studies reported similar
Objectives. Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the
Aims. The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs?. Methods. The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). Results. Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). Conclusion. The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve
Aims. There is concern that aggressive target pricing in the new Bundled Payment for Care Improvement Advanced (BPCI-A) penalizes high-performing groups that had achieved low costs through prior experience in bundled payments. We hypothesize that this
Aims. This systematic review asked which patterns of complications are associated with the three reverse total shoulder arthroplasty (RTSA) prosthetic designs, as classified by Routman et al, in patients undergoing RTSA for the management of cuff tear arthropathy, massive cuff tear, osteoarthritis, and rheumatoid arthritis. The three implant design philosophies investigated were medial glenoid/medial humerus (MGMH), medial glenoid/lateral humerus (MGLH), and lateral glenoid/medial humerus (LGMH). Methods. A systematic review of the literature was performed via a search of MEDLINE and Embase. Two reviewers extracted data on complication occurrence and patient-reported outcome measures (PROMs). Meta-analysis was conducted on the reported proportion of complications, weighted by sample size, and PROMs were pooled using the reported standardized mean difference (SMD). Quality of
Aims. The aim of this study was to identify the origin and development of the threshold for surgical intervention, highlight the consequences of residual displacement, and justify the importance of accurate measurement. Methods. A systematic review of three databases was performed to establish the origin and adaptations of the threshold, with papers screened and relevant citations reviewed. This search identified papers investigating functional outcome, including presence of arthritis, following injury. Orthopaedic textbooks were reviewed to ensure no earlier mention of the threshold was present. Results. Knirk and Jupiter (1986) were the first to quantify a threshold, with all their patients developing arthritis with > 2 mm displacement. Some papers have discussed using 1 mm, although 2 mm is most widely reported. Current guidance from the British Society for Surgery of the Hand and a Delphi panel support 2 mm as an appropriate value. Although this paper is still widely cited, the authors published a re-examination of the data showing