Acessibilidade / Reportar erro

Reproducibility of three classifications of proximal humeral fractures

Abstracts

OBJECTIVE: To propose a new system for classifying proximal humeral neck fractures, and to evaluate intra- and interobserver agreement using the Neer system that is the most commonly used in the area and the Arbeit Gemeinschaft für Osteosynthesefragen system created by an European group, and a new classification system proposed by the authors of this study. METHODS: A total of 56 patients with proximal humeral fractures were selected, and submitted to digitized simple radiography in antero-posterior shoulder and scapular profile. Radiographs were analyzed by three observers at time one, and then three and six weeks later. The kappa coefficient modified by Fleiss was used for the analysis. RESULTS: The mean intra-observer Kappa agreement index (k=0.687) of the new classification, was higher than both the Neer classification (k=0.362) and the Arbeit Gemeinschaft für Osteosynthesefragen (k=0.46). The mean interobserver Kappa agreement index (0.446) of the new classification, also had better results than both the Neer classification (k=0.063) and the Arbeit Gemeinschaft für Osteosynthesefragen (k=0.028). CONCLUSION: the new classification considering bone compression had higher results for intra- and interobserver compared to the Neer system, and the Arbeit Gemeinschaft für Osteosynthesefragen system.

Shoulder fractures; Observer variation


OBJETIVO: Propor um novo sistema para classificação das fraturas que envolvem o colo do úmero e avaliar a concordância intra e interobservador do sistema de classificação de Neer (a mais utilizada em nosso meio), do sistema proposto pelo grupo europeu Arbeit Gemeinschaft für Osteosynthesefragen e de um novo sistema de classificação proposto pelos autores. MÉTODOS: Foram selecionados 56 pacientes com fraturas da região proximal do úmero, com radiografias simples digitalizadas nas incidências anteroposterior do ombro e perfil da escápula. Quatro observadores avaliaram as imagens radiográficas em três momentos diferentes, com intervalos de 3 semanas. Os dados foram analisados segundo o coeficiente de concordância kappa, modificado por Fleiss. RESULTADOS: O índice de concordância kappa intraobservador médio da nova classificação (k=0,687) foi superior ao da classificação de Neer (k=0,362) e da Arbeit Gemeinschaft für Osteosynthesefragen (k=0,46). O índice de concordância kappa interobservador da nova classificação (k=0,446) também apresentou um valor médio maior que o da classificação de Neer (k=0,063) e da Arbeit Gemeinschaft für Osteosynthesefragen (k=0,028). CONCLUSÃO: A nova classificação, que considera o conceito de compressão óssea, apresentou melhores resultados inter e intraobservador, em relação às classificações de Neer e Arbeit Gemeinschaft für Osteosynthesefragen.

Fraturas do ombro; Variações dependentes do observador


ORIGINAL ARTICLE

IHospital Israelita Albert Einstein - HIAE, São Paulo (SP), Brazil

IIUniversidade Federal de São Paulo - UNIFESP, São Paulo (SP), Brazil

Corresponding author

ABSTRACT

OBJECTIVE: To propose a new system for classifying proximal humeral neck fractures, and to evaluate intra- and interobserver agreement using the Neer system that is the most commonly used in the area and the Arbeit Gemeinschaft für Osteosynthesefragen system created by an European group, and a new classification system proposed by the authors of this study.

METHODS: A total of 56 patients with proximal humeral fractures were selected, and submitted to digitized simple radiography in antero-posterior shoulder and scapular profile. Radiographs were analyzed by three observers at time one, and then three and six weeks later. The kappa coefficient modified by Fleiss was used for the analysis.

RESULTS: The mean intra-observer Kappa agreement index (k=0.687) of the new classification, was higher than both the Neer classification (k=0.362) and the Arbeit Gemeinschaft für Osteosynthesefragen (k=0.46). The mean interobserver Kappa agreement index (0.446) of the new classification, also had better results than both the Neer classification (k=0.063) and the Arbeit Gemeinschaft für Osteosynthesefragen (k=0.028).

CONCLUSION: the new classification considering bone compression had higher results for intra- and interobserver compared to the Neer system, and the Arbeit Gemeinschaft für Osteosynthesefragen system.

Keywords: Shoulder fractures/classification; Observer variation

INTRODUCTION

Proximal humerus fractures are the seventh most common fracture in frequency and correspond to approximately 80% of all humeral fractures(1). The incidence of fractures at this site may vary from 63 to 73% per 100,000 person-years, being more predominant among the elderly population. This type of fracture often occurs in approximately 75% of the population over 60 years old, and is more common among women. The proportion of women to men is three to one(1,2).

The classification system proposed by Neer in 1970 is widely used as a way to assess and determine guidelines to treat fractures of the proximal humerus(3). Recently, an European group(4) the Arbeit Gemeinschaft für Osteosynthesefragen (AO/ASIF)(4) proposed a system that also became acknowledged as a classification for these fractures.

Currently classification systems used for fractures of the proximal extremity of the humerus have low agreement and reproducibility in radiographies and also in computed tomography. Such difference towards classification' agreement could be due to multiple variables, for example, studies of low quality because of difficulties in placing patients with a fracture of proximal humerus, and also the surgeon's lack of experience to analyze these fractures(5-10).

In spite of improvements demonstrated by some authors in the agreement of intra- and interobserver results using more complex tests like simple tomography and tridimensional (3D) reconstructions, these results are inconclusive when the relationship between classifications and suggestions of treatment is evaluated towards physician's experience(7-9).

OBJECTIVE

This study aimed to evaluate intra- and interobserver agreement of classification systems proposed by Neer and by the AO/ASIF group, and to compare their results with our classification using radiographic exams of patients with fracture of the proximal extremity of the humerus.

METHODS

This study was carried out at the Orthopedic and Traumatology Department at Universidade Federal de São Paulo (UNIFESP), and was approved by the Ethical and Research Committee of the same institution (process #0234/08). From January 2002 to January 2008 antero-posterior and lateral shoulder radiographic views at the scapular plane of adult patients with isolated fracture or fracture-dislocation of proximal humerus were selected.

Radiographs were performed with the patient sitting down or standing upright. For the anteroposterior radiography the posterior face of the affected shoulder was placed close to the film and the patient's shoulder was externally rotated in about 40º. To perform the lateral radiographs, the anterior face of the affected shoulder was placed close to the film and to the patient's opposite shoulder also rotated in about 40º. The axillary view was not performed in all patients of the study mainly for the difficulty in placing them. Hence, because not all patients had this view, it was not included in this analysis.

Exclusion criteria: (1) radiographs of a pathologic fracture in the proximal humerus region, or any other abnormal change or tumor that could interfere in the joint normal anatomy; (2) skeletally immature patient' radiographs; (3) a previous fracture; (4) a tuberosity isolated fracture; (5) low quality X-ray exams.

To calculate the sample size the kappa index modified by Fleiss et al. was used as the main variable(11). Type I error was pre-stated as 5% (confidence of 95%) and type II error as 20% (test power of 80%). A population standard deviation of 0.40 of kappa value and a minimal difference to be detected of 0.30 of Kappa value were used. A total sample of 14 radiographs per observer was calculated after the definition of such values.

Simple shoulder radiographs in antero-posterior and lateral position of the scapula were digitized by a radiologist not involved in the study. To take the radiographs a Nikon® Coolpix 4500 model camera was used. Images were analyzed by four observers after digitalization. In the analysis they considered the Neer classification(3), the AO/ASIF European group system(4) and the new classification suggested in this study.

The observers were: a sixth year medical student (1), an orthopedist specialized in shoulder (2), a radiologist specialized in musculoskeletal diseases (3) and one orthopedist specialized in trauma(4).

Classifications were illustrated and explained on a pamphlet describing each classification (Figures 1 to 3 ). To each observer a ruler and goniometer were given to be used during the fracture' assessment. The participant' personal identifications in the radiographs were protected and then they were randomly numbered. Images were assessed and classified by all observers at time one (T1); then they were randomly numbered and the procedure redone after three (T2) and six (T3) weeks. The four observers assessed the same radiography on three different times (T1, T2, and T3).


According to the Neer classification(3), the fractures were divided into six groups (Figure 1):

Group I: minimally displaced fracture, with displacement of <1 cm or angle <45º;

Group II: fracture with displacement of the proximal humerus anatomical neck;

Group III: fracture with displacement of the proximal humerus surgical neck;

Group IV: fracture with displacement of the greater humerus tuberosity dividing into two parts with no surgical neck displacement; into three parts with surgical neck displacement; or into four parts with fracture and displacement of the lesser tuberosity;

Group V: fracture with displacement of the lesser tuberosity presenting the same features of group IV fractures regarding the subdivision into two, three or four parts;

Group VI: fractures associated with glenoumeral dislocation being also subdivided into two, three or four parts.

In the AO/ASIF system(4), fractures were classified in three types, structured in three groups and totalizing nine types of fractures (Figure 2).


In the classification system proposed by the present study, fractures were divided as follow: compression fractures and non-compression fractures (Figure 3 ).

Fractures were considered as compression fractures when a plastic or permanent deformity of the spongious metaphyseal bone occurred by compression or shearing among the fractured fragments no matter the number of fragments involved. Radiographic image of fractured fragments are usually not well identified or defined in compression fractures. Non-compression fractures are often identified by well-defined fragments on radiographic images. In this type of fracture no loss of bone tissue is seen due to compression mechanism or shearing among fragments.

Statistical analysis

The kappa coefficient agreement modified by Fleiss et al. was used for intra- and interobserver analyses(11).

The kappa coefficient provided an agreement proportion matched among observers. Values of kappa coefficient varied from -1 to +1. When the value was -1 it meant total disagreement, if +1 meant total agreement, and when zero meant disagreement. The kappa coefficient values may also be attributed arbitrarily to subdivisions. If values between 0.00 and 0.20 are found, they suggest unsatisfactory agreement; between 0.21 and 0.40 little agreement; between 0.41 and 0.60 moderate agreement; between 0.61 and 0.80 satisfactory and adequate agreement; values over 0.80 indicated almost perfect agreement(12-14).

A significance level of 5% was adopted, and there was rejection for a null coefficient for descriptive levels (p) <0.05. It is important to note that a coefficient different from zero does not indicated high agreement. Agreement percentage was also calculated among several measures that had easier interpretation. Calculations were done using the R statistical package.

RESULTS

A total of 174 patients with fracture of proximal extremity of the humerus were assisted from January 2002 to January 2008. From this total, only 71 patients met the inclusion criteria. Radiographs of the 71 patients were evaluated by an orthopedist and a radiologist not involved in the study, who selected 56 images in anteroposterior view of the shoulder and lateral of the scapula. The demographic data of patients are presented on table 1.

The highest kappa intra-observer agreement index obtained in the three moments (M1, M2 and M3) was seen for the classification proposed in this study (k=0.687) followed by the AO/ASIF classification (k=0.460), and by the Neer classification (k=0.362) (Table 2).

As for the kappa interobserver index the classification of this study had a high mean value in the three moments (k=0.446) as well, followed by the Neer classification (k=0.063), and by the AO/ASIF classification (k=0,028) (Table 3).

The less experienced observer (1) obtained the lowest values for the kappa intra-observers index in all classification systems. The orthopedist specialized in trauma had the highest values of the kappa index in the classification proposed in this study and in the AO/ ASIF. The orthopedist specialized in shoulder presented the highest kappa index for the Neer classification (Table 2).

DISCUSSION

Classification systems should provide tools to support clinical evaluation. A good classification system has to be valid, reliable and reproducible. However, to be ideal, a system must have a standard language for safety communication, create guidelines for treatments, and help to determine the disease prognosis. In addition, it should be an instrument to evaluate and compare results achieved by the treatment of similar diseases at several research centers registered in the literature at different times(15).

The concept that the main systems for the classification of proximal extremity humeral fractures had low agreement and are of little reproducibility was demonstrated by the results of this study. The classification models proposed by Neer and the AO/ ASIF group are widely accepted and used nowadays, however, there are many criticisms related to the difficulty to reproduce them(5,10,16,17).

In a literature review by Brorson and Hrobjartsson(17) evaluating the Neer classification, involving 11 studies, 88 observers and 468 fractures, the kappa interobserver agreement varied between 0.17 and 0.52, that is, unsatisfactory to moderate. This study evaluated a heterogeneous group of observers and had low agreement index in the Neer classification, that is, unsatisfactory.

Siebenrock e Gerber(16) evaluated 96 proximal humerus fractures using at least three radiographic views. Those radiographs were classified by five observers with interest and experience in shoulder surgery at two different moments (8 months difference between the first and the second analysis) using the Neer classification and the AO/ASIF. The interobserver agreement by the Neer classification was 0.40 and by the AO/ASIF 0.42 considering its nine groups. In addition, the mean intraobserver kappa coefficient agreement for both classifications was 0.60 and 0.68, respectively. The results of the present study as related to the AO/ASIF classification presented an unsatisfactory interobserver mean agreement index and a moderate intraobserver mean agreement index.

Nowadays it is discussed if the appraiser' experience can influence intra and interobservers agreements. Studies have been shown that less experienced observers achieve the lowest intraobserver agreement indexes than specialist physicians(18,19). On the other hand, other studies using a specific classification comparing a more experienced group with a less experienced one did not find significant differences in the interobservers agreement indexes(15).

So, it is believed that when an observer is comfortable using a specific classification the result is higher. However, some studies have been observing that if the same classification is used repeatedly in different times it does not change intra- and interobservers reproducibility(20). This study included a less experienced observer (a medical student). In spite of the fact that a student participation could be a complication factor for the data analysis, his participation was on purpose in order to assess the validity and reproducibility of the classifications used.

A randomized clinical trial(21) noted an improvement in the agreement using the Neer classification after training 14 observers, being the kappa index before the training 0.34 and after it 0.62, which was not seen in the control group. Although, a systematic review (16) done in 1993 did not show that more experienced observers had less disagreement than those with less experience.

Because classifications most used by investigators present low agreement and reproducibility to identify of proximal humerus fractures, it is proposed a new concept to classify these fractures in order to improve interpretations and investigate new treatment alternatives.

This study has a number of limitations such as: (1) this is a retrospective study in which bias rate may affect results; (2) only two radiographs were used to evaluate the fractures not being possible to examine the three perpendicular plane radiographies therefore it was not possible to precisely evaluate fracture dislocations; (3) this study did not include more complex and widely imaging exams used in clinical practice, as simple tomography or with reconstruction, causing bias and misunderstandings. This approach reflects the reality of some developing countries.

Bone compression determines the great difference between the two types of the proximal humerus extremity fractures (the so called compression and non-compression fractures) enabling to identify complex fractures that often present poor prognosis. The metaphyseal spongious bone which is compressed between the fractured shaft fragment and the humeral head, constitutes a barrier to reduce and maintain the reduction during the intra- and post-operative periods, being these fractures classified as complex or of poor prognosis. However, if this bone abnormal situation is considered at reduction or fixation time, it will provide a better understanding for the surgeon regarding the fracture type and its reduction, therefore, enabling a better prognosis(22).

The literature describes some unsatisfactory prognosis criteria for fractures of the proximal humerus, among them it is possible to highlight the metaphyseal comminution and the epiphyseal varus displacement, the elderly patients over 65 years old, the associated fractures, and the humeral head dislocation(23,24). It is believed that one of the most important factors to define the fracture complexity is the displacement with shearing or compression among fragments that create a loss of the spongious metaphyseal bone support. To consider this feature in fractures, no matter the number of involved fragments, is the concept that was applied to the classification proposed by this study.

CONCLUSION

The classification considering bone compression had better results in intra- and interobserver interpretation than the Neer and the AO/ASIF classifications.

Corresponding author:

Eduardo da Frota Carrerra

Avenida Albert Einstein, 627, Building A1, office #322

Zip code: 05652-900 - São Paulo (SP), Brazil

Phone: (55 11) 2151 2417

E-mail: e.carrera@me.com

Received on: Jun 29, 2012

Accepted on: Nov 5, 2012

Conflicts of interest: none.

Study carried out at Universidade Federal de São Paulo - UNIFESP, São Paulo (SP), Brazil.

  • 1. Court-Brown CM, Caesar B. Epidemiology of adult fractures: A review. Injury. 2006;37(8):691-7.
  • 2. Lind T, Kroner K, Jesen J. Epidemiology of fractures of the proximal humerus. Arch Orthop Trauma Surg. 1989;108(5):285-87.
  • 3. Neer CS 2nd. Displaced proximal humeral fractures. Part I. Classification and evaluation. J Bone Joint Surg Am. 1970;52(6):1077-89.
  • 4. Mueller ME, Nazarian S, Koch P, Schatzker J. The comprehensive classification of fractures of long bones. New York: Springer; 1990. p. 54-63.
  • 5. Brien H, Noftall F, MacMaster S, Cummings T, Landells C, Rockwood P. Neer's classification system: a critical appraisal. J Trauma. 1995;38(2):257-60.
  • 6. Sidor M, Zuckerman J, Lyon T, Koval K, Cuomo F, Schoenberg N. The Neer classification system for proximal humeral fractures. An assessment of interobserver reliability and intraobserver reproducibility. J Bone Joint Surg Am. 1993;75-A(12):1745-50.
  • 7. Bernstein J, Adler LM, Blank JE, Dalsey RM, Williams GR, Iannotti JP. Evaluation of the Neer system of classification of proximal humeral fractures with computerized tomographic scans and plain radiographs. J Bone Joint Surg Am. 1996;78(9):1371-5.
  • 8. Sjödén G, Movin T, Güntner P, Aspelin P, Ahrengart L, Ersmark H, Sperber A. Poor reproducibility of classification of proximal humeral fractures. Additional CT of minor value. Acta Orthop Scand. 1997;68(3):239-42.
  • 9. Sjödén GO, Movin T, Aspelin P, Güntner P, Shalabi A. 3D-radiographic analysis does not improve the Neer and AO classifications of proximal humeral fractures. Acta Orthop Scand. 1999;70(4):325-8.
  • 10. Sallay PI, Pedowitz RA, Mallon WJ, Vandemark RM, Dalton JD, Speer KP. Reliability and reproducibility of radiographic interpretation of proximal humeral fracture pathoanatomy. J Shoulder Elbow Surg. 1997;6(1):60-9.
  • 11. Fleiss JL, Slakter MJ, Fischman SL, ParK MH, Chinton NW. Inter-examiner reliability in caries trials. J Dent Res. 1979;58(2):604-9.
  • 12. Landis JR, Koch GC. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159-74.
  • 13. Martin JS, Marsh JL. Bonar SK, DeCoster TA, Found EM, Brandser EA. Assessment of the AO/ASIF fracture classification for distal tibia. J Orthop Trauma. 1997;11(7):477-83.
  • 14. Svanholm H, Starklint H, Gundersen HJ, Fabricius J, Barlebo H, Olsen S. Reproducibility of histomorphological diagnosis with special reference to the kappa statistic. APMIS. 1989;97(8):689-98.
  • 15. Martin JS, Marsh JL. Current classification of fractures. Rationale and utility. Radiol Clin North Am. 1997;35(3):491-506.
  • 16. Siebenrock KA, Gerber C. The reproducibility of classification of fractures of the proximal end of the humerus. J Bone Joint Surg Am. 1993;75(12):1751-5.
  • 17. Brorson S, Hróbjartsson A. Training improves agreement among doctors using the Neer system for proximal humeral fractures in a systematic review. J Clin Epidemiol. 2008;61(1):7-16.
  • 18. Kreder HJ, Hanel DP, Mckee M, Jupiter J, Mcgillivary G, Swiontkowski MF. Consistency of AO fractures classification for the distal radius. J Bone Joint Surg Br. 1996;78(5):726-31.
  • 19. Oliveira Filho OM, Belangero WD, Teles JB. Fraturas do rádio distal: avaliação das classificações. Rev Assoc Med Bras. 2004;50(1):55-61.
  • 20. Andersen DJ, Blair WR, Steyers CM, Adams BD, El-Khouri GY, Brandser EA. Classification of distal radius fractures: an analysis of interobserver reliability and intraobserver reproducibility. J Hand Surg. 1996;21(4):574-82.
  • 21. Brorson S, Bagger J, Sylvest A, Hrøbjartsson A. Improved interobserver variation after training of doctors in the Neer system. A randomised trial. J Bone Joint Surg Br. 2002;84(7):950-4.
  • 22. Russo R, Lombardi LV, Ciccarelli M, Giudice G, Cautiero F. A new osteosynthesis device for the treatment of proximal humerus fractures. Description of the technique and preliminary results. Chir Organi Mov. 2008;91(1):27-34.
  • 23. Soliman OA, Koptan WM. Four-part fracture dislocations of the proximal humerus in young adults: Results of fixation. Injury. 2012 Sep 28. pii: S00201383(12)00389-0. Ahead of print.
  • 24. Okike K, Lee OC, Makanji H, Harris MB, Vrahas MS. Factors associated with the decision for operative versus non-operative treatment of displaced proximalhumerus fractures in the elderly. Injury. 2012 Sep 27. pii: S00201383(12)00386-5. Ahead of print.
  • Reproducibility of three classifications of proximal humeral fractures

    Eduardo da Frota CarrerraI; Andre WajnsztejnII; Mario LenzaI; Nicola Archetti NettoII
  • Publication Dates

    • Publication in this collection
      22 Jan 2013
    • Date of issue
      Dec 2012

    History

    • Received
      29 June 2012
    • Accepted
      05 Nov 2012
    Instituto Israelita de Ensino e Pesquisa Albert Einstein Avenida Albert Einstein, 627/701 , 05651-901 São Paulo - SP, Tel.: (55 11) 2151 0904 - São Paulo - SP - Brazil
    E-mail: revista@einstein.br