Original Investigation

Diaphragm Thickness Measurement in Computed Tomography: Intra- and Inter-Observer Agreement

10.4274/imj.galenos.2018.65471

  • Furkan Ufuk
  • Pınar Çakmak
  • Ergin Sağtaş
  • Duygu Herek
  • Muhammet Arslan
  • Ahmet Baki Yağcı

Received Date: 03.08.2018 Accepted Date: 03.09.2018 İstanbul Med J 2019;20(2):101-106

Introduction:

Diaphragm thickness (DT) measurement in computed tomography (CT) has become popular in recent years. Our aim was to assess the intra- and inter-observer agreement of DT measurement and to investigate the best points for DT assessment in CT based on the most reliable measurement.

Methods:

Thoraco-abdominal CT angiography scans of 44 patients (23 males, mean age: 49.9±17.8 years) were retrospectively evaluated. All of the CT images were evaluated independently by four radiologists. Each observer evaluated images twice to assess intra-observer reproducibility. On both axial and coronal reconstructed CT images, the crura of 88 hemidiaphragms were measured at five different points. A p value less than 0.05 was considered to indicate statistical significance. Intra- and inter-observer agreement was evaluated by using intraclass correlation coefficient (ICC) scores with a 95% confidence interval.

Results:

In intra-observer analysis, ICC scores demonstrated substantial to almost perfect agreement for all measurements (ICC score range: 0.758-1). When we analyzed the inter-observer agreement, there was a moderate to almost perfect agreement for each measurement point (ICC score range: 0.523-0.895). All axial measurements showed the highest inter-observer agreement (ICC scores were 0.926 and 0.886 for first and second measurements, respectively). The maximum inter- and intra-observer agreement for single point DT measurements were found in maximum DT at the level of the origin of the celiac artery and in anterior DT at the level of the upper part of the L1 vertebral body.

Conclusion:

CT is a reliable tool DT measurement with excellent intra- and inter-observer agreement.

Keywords: Computed tomography, diaphragm thickness, measurement, reliability

Introduction

The diaphragm is the major inspiratory muscle with a dome-like shape and measurement of diaphragm thickness (DT) in various disorders has become popular in recent years. Decreased DT on ultrasound was shown to correlate with reduced myocyte cross-sectional area in a porcine model (1). Measurement of DT in computed tomography (CT) has been shown to be successful in the assessment of the success of diaphragm pacing system in amyotrophic lateral sclerosis (ALS) patients (2), detection of diaphragmatic thinning due to mechanical ventilator therapy and sepsis (3,4), and the diagnosis of unilateral diaphragm paralysis (5). In previous studies (2-5), DT measurement in CT was performed by single observers. The main limitation of CT in DT measurement is the absence of a consensus on reliable measurement locations (points). This may lead to variability in the performance of CT in DT measurements, which may weaken the reliability of CT measurements. To the best of our knowledge, no study has investigated the best points for DT measurement and agreement between observers.

Our aim was to assess the intra- and inter-observer agreement of DT measurement and to investigate the best points for DT assessment in CT based on the most reliable measurement.


Methods

This retrospective study started after our institutional Pamukkale University Local Ethics Committee approved letter of application (decision no: 60116787-20/85543). Informed consent was not obtained from the patients because of retrospective nature of the study.


Patients

Patients who underwent thoraco-abdominal CT angiography for suspected acute aortic syndrome between January 2016 and July 2017 were included the study. The exclusion criteria were as follows: pulmonary pathology that can affect DT measurement (such as lower lob atelectasis, mass or pleural effusion), focal or diffuse diaphragmatic crus defect and CT examinations with inadequate diagnostic quality due to motion artifacts. A total of 44 patients (23 males, mean age: 49.9+17.8 years, range: 18-85 years) were included in the study.


Computed Tomography Scanning Protocol

All thoraco-abdominal CT angiography scans were obtained in the supine position with maximum inspiration using a multi-detector CT scanner (Brilliance 16; Philips Healthcare, Best, Netherlands). The area between the thoracic inlet and the deep costophrenic sulcus was scanned. The scanning parameters were as follows: tube voltage, 120 kV; tube current, 100 mAs; collimation, 16x0.75 mm; field of view, 300 mm; matrix, 512x512; rotation time, 0.75 s; table speed, 15 mm/s and beam pitch, 0.94. All CT images were reconstructed in transverse and coronal planes at 1.5 mm slice thickness.


Radiological Evaluation

All CT images were evaluated independently by four radiologists with various experience (3, 9, 12 and 20 years) on the same workstation (Extended Brilliance Workspace, Philips Healthcare). Observers were blinded to measurements of each other. Each observer evaluated the images twice to assess intra-observer reproducibility. The second assessment was done at least three months after the first to prevent recall.

Five measurements were performed on both axial and coronal images for each hemidiaphragm (88 hemidiaphragms) thickness. CT images were evaluated in the mediastinal window (window center, 90 HU; window width, 350 HU) and magnification was freely modifiable. Measurements were performed as described by Sukkasem et al. (5). On both axial and coronal reconstructed CT images, the crura of the hemidiaphragms were measured at the level of the origin of the celiac artery, and minimum and maximum DT were recorded. In addition, measurements were recorded for each crura at the level of the upper part of the L1 vertebral body along the anterior, middle and posterior aspects of the vertebral body on axial images and upper, middle and lower aspects of the vertebral body on coronal images (Figure 1a ,1b, 1c, 1d, 1e).

 


Statistical Analysis

Statistical analysis was performed using Statistical Package for the Social Sciences (SPSS) for Windows (Version 24.0, SPSS Inc. IBM Corp, Chicago, IL). The distribution of normality for each continuous variable group was calculated by Kolmogorov-Smirnov Z-test. Student’s t-test was used for comparisons of paired samples. A p value less than 0.05 was considered to indicate statistical significance. Intra- and inter-observer agreement was evaluated by using intraclass correlation coefficient (ICC) scores with a 95% confidence interval (CI). Intra- and inter-observer agreement was categorized as follows: 0.01-0.20 as poor, 0.21-0.40 as fair, 0.41-0.60 as moderate, 0.61-0.80 as substantial and 0.81-1.00 as almost perfect.


Results

In all groups, DT values showed normal distribution. The results of DT measurements of 44 patients and intra-observer agreement scores for each measurement are shown in Table 1. The inter-observer agreement for the first and second measurements is given in Table 2. There was a statistically significant difference between the measurements of observer 2 and observer 4 when all DT measurements were compared in the axial and coronal plane (p=0.045) (Table 3).

In intra-observer analysis, ICC scores demonstrated substantial to almost perfect agreement for all measurements (ICC scores range=0.758-1) (Table 1). When we analyzed the inter-observer agreement, there was a moderate to almost perfect agreement for each measurement points (ICC score range=0.523-0.895). All axial measurement values showed the highest inter-observer agreement (ICC scores were 0.926 and 0.886 for first and second measurements, respectively). There was a substantial agreement between observers for all coronal measurement values with ICC scores of 0.790 (95% CI, 0.684 - 0.870) and 0.776 (95% CI, 0.662 - 0.838) for the first and second measurements, respectively (Table 2).

The maximum inter- and intra-observer agreement for single point DT measurements were found in maximum DT at the level of the origin of the celiac artery and in anterior DT at the level of the upper part of the L1 vertebral body. In the coronal plane, the intra- and inter-observer agreement for DT measurements varied widely for each measurement point (Tables 1 and 2).


Discussion

This study showed that the assessment of DT by CT has high intra and inter-observer reliability. All DT measurements showed the highest inter-observer agreement at five points (maximum, minimum thickness at the level of celiac artery origin, anterior, middle and posterior thickness at the level of L1 vertebral body) on axial CT images. There was substantial agreement between observers for all measurements on coronal CT images.

The maximum inter- and intra-observer agreement for single point DT measurements were found in maximum DT at the level of the origin of the celiac artery and in anterior DT at the level of the upper part of the L1 vertebral body. In the coronal plane, the intra- and intra-observer agreement for DT measurements varied widely for each measurement point. Therefore, we suggest that if DT will be evaluated in the coronal plane, it would be appropriate to evaluate by measuring from all described 5 points.

Despite the increased use of CT for diaphragm evaluation, there is a lack of reference locations for DT measurement in CT. To the best of our knowledge, this is the first study that assessed the intra- and inter-observer agreement for DT measurement in CT and that investigated the best points for measurement in CT. The inter-observer variability in our study varied from moderate to almost perfect according to the measurement points and image plane. Therefore, we can say that the image plane (axial or coronal) and location is very important for DT measurement in CT. In our study, DT measurements were performed from the upper part L1 vertebra corpus level due to blurring of the diaphragm crus by psoas muscle in the middle and lower parts of the L1 vertebra corpus level. Unlike Sukkasem et al. (5), we did not measure DT at the level of Smooth Muscle Actin (SMA) origin, due to blurring of the diaphragm crus by psoas muscle at SMA origin level in many patients.

In mechanically ventilated patients and patients with paralyzed hemidiaphragm, diaphragmatic dysfunction is an important cause of complications, such as pulmonary atelectasis, infection and hypoxemia. These complications have been shown to correlate with mortality, poor prognosis and significantly higher health costs (6-9). In addition, weakness of the diaphragm is a major cause of difficult weaning from mechanical ventilation (10). Therefore, evaluation of DT and functions is very important. It has been shown that ultrasound (US) measurements of DT have a high degree of reproducibility and decreased DT on ultrasound has been shown to correlate with reduced myocyte volume (1,9-14). Experience is important when performing DT measurement by US. For an inexperienced observer, it may be difficult to accurately measure the DT by UA in incompatible patients (14). Compared to US, CT better depicts the anatomy of the diaphragm and also demonstrates associated lung and mediastinal pathologies (15-18). However, CT is not a suitable method for only assessing DT due to radiation exposure. Therefore, it is more appropriate to assess DT on CT images which were obtained to evaluate lung or mediastinal pathology.

There are only a few studies on DT evaluation in CT (2-5). Sanli et al. (2) found that DT is an important feature in patients with ALS in whom diaphragm pacing system implantation is planned. In their study, a single observer measured DT at the level of thoracic 11-12 vertebra corpus on coronal images. The location specified for DT measurement is not a specific point and represents a very large area for DT measurement. In addition, they did not specify the CT slice thickness used in the study (2). Lee et al. (3) demonstrated that DT decreased in patients who underwent mechanical-ventilation therapy in CT. In that study, a radiologist measured the DT at the level of the celiac artery by using software on axial and coronal images. However, the number of patients included in the study was thirteen, which is quite small. Also, they did not specify the thickness of the CT slice used in the study (3). Jung et al. (4) found that diaphragm volume was decreased in patients with sepsis in CT. In their study, the diaphragm volumes were calculated semi-automatically by a single radiologist.

However, in this study, it was not specified in which phase (inspiration-expiration) the CT images were obtained (4). Sukkasem et al. (5) found a high sensitivity and specificity in the differentiation of paralyzed and non-paralyzed hemidiaphragms when they assumed 2.5 mm as the threshold value of a minimum DT on axial images at the level of celiac artery in CT. In their study, all measurements were performed by one observer and the slice thickness of the CT was variable (range: 0.625 to 5 mm) (5). We used a standard 1.5 mm slice thickness in our study. Our results demonstrated a better inter-observer agreement for axial CT images than coronal CT images. This may be due to relatively thick CT slices or due to the natural shape of the diaphragm (the shape of the dome).

Our study also has some limitations. The relatively small sample size is the first one. However, this study was not intended to determine DT values in the population. The aim of this study was to determine the best DT measurement points in CT and the reliability of DT measurement in CT. The retrospective design of the study is the second one. However, these are acceptable limitations due to the lack of similar CT studies.


Conclusion

CT is a reliable tool for DT measurement with excellent intra and inter-observer reliability. This is the first study that reveals the best and reproducible measurement points and plane for DT measurements in CT. All DT measurement values at five identified points on axial CT images showed the highest inter-observer reliability. The most reliable single measurement point in the axial plane was maximum DT at the level of the celiac artery or anterior DT at the level of the upper part of the L1 vertebral body.


Ethics Committee Approval: This retrospective study started after our institutional Pamukkale University Local Ethics Committee approved letter of application (decision no: 60116787-20/85543).

Informed Consent: Informed consent was not obtained from the patients because of retrospective nature of the study.

Peer-review: Externally peer-reviewed.

Author Contributions: Concept - F.U., P.Ç.; Design - F.U., P.Ç., D.H., E.S.; Supervision - F.U., A.B.Y.; Data Collection and/or Processing - F.U., P.Ç., D.H.; Analysis and/or Interpretation - F.U., D.H., A.B.Y.; Literature Search - F.U., M.A., E.S.; Writing Manuscript – F.U., D.H., M.A.; Critical Review - D.H., A.B.Y.

Conflict of Interest: Authors have no conflicts of interest to declare

Financial Disclosure: The authors declared that this study has received no financial support.

1. Reynolds SC, Meyyappan R, Thakkar V, Tran BD, Nolette MA, Sadarangani G, et al. Mitigation of ventilator-induced diaphragm atrophy by transvenous phrenic nerve stimulation. Am J Respir Crit Care Med 2017; 195: 339-48.
2. Sanli A, Sengun IS, Tertemiz KC, Alpaydin AO, Karacam V, Sanli BA, et al. Importance of diaphragm thickness in amyotrophic lateral sclerosis patients with diaphragm pacing system implantation. Surg Endosc 2016; 30: 154-8.
3. Lee GD, Kim HC, Yoo JW, Lee SJ, Cho YJ, Bae K, et al. Computed tomography confirms a reduction in diaphragm thickness in mechanically ventilated patients. J Crit Care 2016; 33: 47-50.
4. Jung B, Nougaret S, Conseil M, Coisel Y, Futier E, Chanques G, et al. Sepsis is associated with a preferential diaphragmatic atrophy: a critically ill patient study using tridimensional computed tomography. Anesthesiology 2014; 120: 1182-91.
5. Sukkasem W, Moftah SG, Kicska G, Godwin JD, Pipavath S, Stern E. Crus atrophy: Accuracy of computed tomography in diagnosis of diaphragmatic paralysis. J Thorac Imaging 2017; 32: 383-90.
6. Vassilakopoulos T, Petrof BJ. A Stimulating Approach to Ventilator induced Diaphragmatic Dysfunction. Am J Respir Crit Care Med 2017; 195: 281-2.
7. Kim WY, Suh HJ, Hong SB, Koh Y, Lim CM. Diaphragm dysfunction assessed by ultrasonography: influence on weaning from mechanical ventilation. Crit Care Med 2011; 39: 2627-30.
8. Damuth E, Mitchell JA, Bartock JL, Roberts BW, Trzeciak S. Long-term survival of critically ill patients treated with prolonged mechanical ventilation: a systematic review and meta-analysis. Lancet Respir Med 2015; 3: 544-53.
9. Dhungana A, Khilnani G, Hadda V, Guleria R. Reproducibility of diaphragm thickness measurements by ultrasonography in patients on mechanical ventilation. World J Crit Care Med 2017; 6: 185-9.
10. Dres M, Dubé B-P, Mayaux J, Delemazure J, Reuter D, Brochard L, et al. Coexistence and Impact of Limb Muscle and Diaphragm Weakness at Time of Liberation from Mechanical Ventilation in Medical Intensive Care Unit Patients. Am J Respir Crit Care Med 2017; 195: 57-66.
11. Vivier E, Mekontso Dessap A, Dimassi S, Vargas F, Lyazidi A, Thille AW, et al. Diaphragm ultrasonography to estimate the work of breathing during non-invasive ventilation. Intensive Care Med 2012; 38: 796-803.
12. Boussuges A, Gole Y, Blanc P. Diaphragmatic motion studied by m-mode ultrasonography: methods, reproducibility, and normal values. Chest 2009; 135: 391-400.
13. Goligher EC, Laghi F, Detsky ME, Farias P, Murray A, Brace D, et al. Measuring diaphragm thickness with ultrasound in mechanically ventilated patients: feasibility, reproducibility and validity. Intensive Care Med 2015; 41: 642-9.
14. Sarwal A, Walker FO, Cartwright MS. Neuromuscular ultrasound for evaluation of the diaphragm. Muscle Nerve 2013; 47: 319-29.
15. Caskey CI, Zerhouni EA, Fishman EK, Rahmouni AD. Aging of the diaphragm: a CT study. Radiology 1989; 171: 385-9.
16. Dovgan DJ, Lenchik L, Kaye AD. Computed tomographic evaluation of maximal diaphragmatic crural thickness. Conn Med 1994; 58: 203-6.
17. Harlaar L, Ciet P, van der Ploeg AT, Brusse E, van der Beek NAME, Wielopolski PA, et al. Imaging of respiratory muscles in neuromuscular disease: A review. Neuromuscul Disord 2018; 28: 246-56.
18. Goo HW. Four-Dimensional CT of the diaphragm in children: initial experience. Korean J Radiol 2018; 19: 111-8.