Introduction

Acute respiratory distress syndrome (ARDS) accounts for an estimated 75,000 deaths and 3.6 million hospital days annually in the USA [1]. Mechanical ventilation with low tidal volumes [2, 3] and early neuromuscular blockade [4] have been shown definitively to reduce mortality from ARDS. Despite these advancements, mortality from ARDS remains high, ranging from 27 to 45 % for mild to severe disease [5, 6].

Prone positioning has been used for four decades in patients with ARDS [7]. Its physiological benefits are well described: improved ventilation-perfusion matching [8], recruitment of dependent lung regions [9, 10], optimized chest wall mechanics [11], and enhanced drainage of tracheobronchial secretions [12]. Yet, these physiological benefits have failed to translate into reduced mortality in multiple clinical trials of prone positioning for ARDS [1318].

By contrast, a recent multicenter randomized trial by Guérin et al. [19] found that prone positioning reduced the risk of death by half. While previous meta-analyses and post hoc analyses have suggested benefit in patients with the highest overall illness severity [13, 16, 20] and with severe ARDS [13, 21, 22], no such benefit was seen even in the largest prior randomized trials enrolling 300–800 patients each [13, 14, 18].

Therefore, we performed the following meta-analysis of randomized trials of prone positioning in ARDS to identify sources of heterogeneity between studies that would account for their divergent findings. We anticipated that the gradual adoption of low tidal volume ventilation over the last decade would largely explain the conflicting results of these trials [2325]. We hypothesized that prone positioning reduces 60-day mortality from ARDS only when injurious high tidal volumes are avoided.

Methods

Identification of trials

Studies were identified using computerized literature searches of MEDLINE, EMBASE, LILACS, and Cochrane Central Register of Controlled Trials. The following Medical Subject Headings (MeSH) and keywords were combined to retrieve relevant articles: randomized controlled trial OR controlled clinical trial OR random* OR trial OR groups AND prone position (MeSH) OR supine position (MeSH) OR patient positioning (MeSH) OR prone OR proning OR supine AND respiratory distress syndrome, adult (MeSH) OR acute lung injury OR ARDS OR respiratory distress syndrome OR respiratory failure.

Selection of studies

Using the PICOS framework [26], two independent reviewers (SBM, SS) applied criteria to evaluate studies for inclusion. Disagreement in study selection was addressed through evaluation by a third reviewer (JRB) and final determination achieved by team consensus. Eligible studies involved adult patients meeting the Berlin definition of ARDS [5, 27], including those previously classified as having acute lung injury (PaO2:FiO2 201–300 mmHg) under the 1994 American-European Consensus Conference definition [28]. Studies were required to assess the intervention prone positioning compared to supine positioning during conventional mechanical ventilation. Co-interventions in addition to prone positioning were permitted. Studies that did not report mortality were excluded. Only randomized controlled trials were eligible.

Data extraction

Two authors (JRB, SBM) independently extracted data using a standardized data collection form. Discrepancies in collected data were addressed through team consensus. One included trial, Taccone et al. [18], stratified at randomization according to moderate or severe ARDS (PaO2:FiO2 100–200 or <100 mmHg). It was decided before data extraction to treat this trial as two separate studies in the meta-analysis because prior post hoc analyses have suggested the benefit of prone positioning may vary with ARDS severity [13, 21, 22].

Evaluation of primary outcome

The primary outcome was risk ratio (RR) of death at 60 days among patients assigned to prone compared to supine positioning. When not directly reported, 60-day mortality was determined by digitally enhancing the Kaplan-Meier plot to measure directly the relative distance from zero of the product limit estimator at 60 days [29]. In one study [15], neither 60-day mortality nor a Kaplan–Meier plot was available; the corresponding author provided the required information.

Quality assessment

Study quality was evaluated by considering concealment of allocation, completeness of follow-up, blinded analysis, crossover between study arms, post hoc exclusions, and early trial discontinuation. The Jadad score for randomized trials [30] was not considered because blinding of patients or caregivers to the study intervention was not possible.

Statistical analysis

The primary outcome of 60-day mortality was chosen a priori because it represents the maximum follow-up interval reported in all studies selected for analysis. Potential sources of heterogeneity considered a priori were use of low tidal volume ventilation, duration of respiratory failure prior to prone positioning, and dose (in hours/day) of prone positioning [25]. Before data extraction, an analysis plan was developed to account for anticipated sources of heterogeneity, consisting of stratification by high (>8 ml/kg predicted body weight, PBW) versus low (≤8 ml/kg PBW) mean baseline tidal volume.

Initial meta-analysis was carried out using random effects models to calculate risk ratios with 95 % confidence intervals for each included study and a combined risk ratio for all studies. This method was selected for its proven ability to account for within- and between-study variability [31]. Heterogeneity was quantified with the Q-statistic and I 2 index. Cumulative meta-analysis was also performed to evaluate the potential effects of gradual widespread adoption of lung-protective ventilation and other advances in critical care over time.

To evaluate whether tidal volume was a significant source of inter-study heterogeneity, additional random effects models were developed after stratifying by mean baseline tidal volume. Unrestricted maximum likelihood mixed effects regression was performed to evaluate for a dose-response relationship between mortality (log risk ratio) and mean baseline tidal volume (ml/kg PBW). Random effects models also were used to evaluate the effect of high (≥12 h/day) versus low (<12 h/day) proning dose.

To evaluate for publication bias, a funnel scatter plot of sample and effect size was constructed. Publication bias was also assessed via Begg-Mazumdar rank correlation and Egger’s regression to account for selection bias impact on the overall combined significance of the results. The meta-analysis was conducted using Comprehensive Meta-Analysis v2 (Biostat, Eaglewood, NJ, USA) with conscious effort to follow PRISMA guidelines [32].

Results

The initial search yielded 643 citations comprising 336 unique abstracts (Fig. 1). After excluding 108 abstracts that did not represent peer-reviewed primary research and 217 that did not meet the inclusion criteria, 11 articles were identified for full-text review to assess for eligibility. Four of these 11 studies were excluded after review for non-conventional ventilation in the control arm or non-randomized design. Seven randomized trials encompassing 2,119 patients were included for meta-analysis. A total of 1,088 patients were mechanically ventilated in the prone position and 1,031 in the supine position. One included study, Taccone et al. [18], stratified patients at randomization according to moderate or severe ARDS (PaO2:FiO2 100–200 or <100 mmHg) and was treated as two separate trials throughout the analysis.

Fig. 1
figure 1

Literature search strategy and study selection

Study quality

All studies were multicenter randomized controlled trials. In all cases, concealed allocation was achieved through either a centralized process or sealed envelopes. Loss to follow-up occurred rarely but was accounted for in detail. Post hoc exclusions occurred only for withdrawal of consent or mistaken inclusion in enrollment. Three trials were terminated early for slow enrollment [13, 16, 17], leading to a risk of insufficient statistical power to detect differences between groups.

Patient and study characteristics

Table 1 summarizes key features of the included studies. The primary endpoint for five of the seven trials was mortality at a specified time of follow-up [13, 14, 1719]. One trial’s primary endpoint was ICU mortality [16] and the other duration of mechanical ventilation [15]. Between 40 and 802 subjects were enrolled in each trial. Degree of hypoxemia required for study eligibility became increasingly severe over time. Targeted daily dose of prone positioning ranged from 6 to >20 h/day, with higher doses in more recent trials. All trials used lower levels of PEEP than currently recommended [33].

Table 1 Key design features of included studies

Patient characteristics and individual study outcomes are summarized in Table 2. The earliest studies are characterized by the highest tidal volumes and highest mortality. Four studies [1316] began enrollment before publication of the landmark NHLBI ARDS Network trial that demonstrated reduced mortality with low tidal volumes [2]. Four studies [15, 1719] targeted tidal volumes in the current recommended range of 6–8 ml/kg [34, 35]. Adverse events, including unintended extubation, were rare.

Table 2 Patient characteristics and outcomes by study

Prone positioning and mortality

Among individual studies, only the most recent trial by Guérin et al. [19] demonstrated significantly lower mortality with prone positioning. Collectively, risk ratio of death at 60 days was 0.83 (95 % CI 0.68–1.02; p = 0.073) (Fig. 2). Significant heterogeneity was found across studies (Q-statistic = 19.6; I 2 = 64 %; p = 0.006). Cumulative meta-analysis with successive addition of increasingly recent studies demonstrated a clear time trend toward reduced mortality with prone positioning (Fig. 3).

Fig. 2
figure 2

Effect of prone positioning on mortality, overall and stratified by mean baseline tidal volume. High tidal volume was defined as >8 ml/kg predicted body weight and low tidal volume as ≤8 ml/kg predicted body weight

Fig. 3
figure 3

Cumulative meta-analysis forest plot over time, demonstrating a trend favoring prone positioning with inclusion of each successive study

Effect of proning stratified by tidal volume

To test the a priori hypothesis that prone positioning reduces mortality only when injurious high tidal volumes are avoided, stratified analysis was performed by high (>8 ml/kg PBW) versus low (≤8 ml/kg PBW) mean baseline tidal volume (Fig. 2). After stratification, prone positioning was associated with a significant decrease in risk ratio of death for studies that used low tidal volumes (RR = 0.66; 95 % CI 0.50–0.86; p = 0.002) but not high tidal volumes (RR 1.00; 95 % CI 0.88–1.13; p = 0.949). The difference in risk ratio of death between strata was highly significant (p < 0.001). Stratification by tidal volume also substantially reduced heterogeneity. I 2 decreased from 64 % in the combined model to 11 and 25 % in the stratified high and low tidal volume models, respectively. The Q-statistic similarly decreased from 19.6 in the combined model to 3.4 and 4.0 in the stratified high and low tidal volume models, respectively. Within-stratum heterogeneity was not significant in either group. Meta-regression demonstrated a dose–response relationship between mean baseline tidal volume (in ml/kg PBW) and log-transformed risk ratio of death at 60 days with proning. A decrease in mean baseline tidal volume of 1 ml/kg PBW was associated with a decrease in risk ratio of death by 16.7 % (95 % CI 6.1–28.3; p = 0.001). Adjusted analysis accounting for potential confounders and other predictors to yield a more reliable meta-regression effect estimate could not be performed because of the small number of included studies.

In post hoc analysis, data were re-analyzed incorporating Taccone et al. [18] as a single study to determine whether the a priori decision to treat this reference as two studies affected the results. Treating Taccone et al. as a single study had no substantial impact on the risk ratio of death with prone positioning (RR = 0.67; 95 % CI 0.49-0.92; p = 0.014 for studies with low mean baseline tidal volume). Moreover, stratifying the analysis instead by whether the protocol specified a low tidal volume strategy produced the same groupings achieved with stratification by mean baseline tidal volume.

Other potential sources of heterogeneity

Stratified analysis by high (≥12 h/day) or low (<12 h/day) proning dose demonstrated a significant reduction in mortality with high doses (RR = 0.71; 95 % CI 0.56–0.90; p = 0.004) but not low doses (RR = 1.05; 95 % CI 0.92–1.19; p = 0.472). However, substantial within-stratum heterogeneity remained among studies of high proning dose, with a Q-statistic of 5.4 and I 2 of 44 %. Duration of respiratory failure prior to proning was not pursued because it was only reported in the four most recent studies [1619] (Table 1).

Mean SAPS score was strikingly similar across all studies (Table 2) such that analysis by overall illness severity was not pursued as a source of heterogeneity. Similarly, while maximum allowed PaO2:FiO2 for patient inclusion decreased with successive studies (Table 1), mean PaO2:FiO2 at enrollment ranged between 100 and 161 mmHg for six of the seven included trials [13, 14, 1619] (Table 2). The lone trial with a higher PaO2:FiO2 also was the smallest study included in the analysis [15]. Therefore, analysis by mean baseline PaO2:FiO2 was not pursued.

Assessment for publication bias

Funnel plot analysis, Egger’s regression, and Begg-Mazumdar rank correlation did not suggest evidence of publication bias (Kendall’s tau = −0.286, p = 0.322; Egger’s regression intercept −1.94; p = 0.129).

Discussion

This meta-analysis demonstrates that prone positioning significantly reduces mortality from ARDS when used with low tidal volume ventilation. No such significant benefit was seen among studies employing higher tidal volumes or when all studies were considered irrespective of tidal volume. Stratification by low or high mean tidal volume at baseline accounted for more than half the heterogeneity observed in the unstratified analysis. Two unique aspects of this study may account for these findings differing from prior meta-analyses.

First, prior meta-analyses focused on disease-related factors, namely degree of hypoxemia [21, 22] and overall illness severity [20], to account for heterogeneity and to suggest reasons for failure of demonstrable mortality benefit in clinical trials. Yet, mean SAPS score and PaO2:FiO2 were remarkably similar across studies, even as inclusion criteria for later studies targeted more severely ill patients. In fact, only one trial reported a mean baseline PaO2:FiO2 above 161 mmHg [15]. Additionally, only one trial performed stratified randomization by degree of hypoxemia [18]. All other trials included, without stratification, patients who would be classified with both moderate and severe ARDS by the Berlin definition [5], and three trials [1315] included patients with mild ARDS in addition to moderate and severe forms. Similarly, only one trial performed stratified randomization by illness severity [17]. No other trials accounted a priori for effect modification by illness severity in the study design. Differences in hypoxemia and illness severity are patient-specific factors that would best be evaluated with an individual patient-level meta-analysis.

By contrast, this study focuses on tidal volume, a modifiable treatment-related factor. Because tidal volume was protocolized in most trials, it can be assessed reasonably with a study-level meta-analysis. As a result, between-study heterogeneity measured by the Q and I 2 statistics is reduced by more than half when stratified by mean baseline tidal volume, indicating tidal volume alone accounted for over half the heterogeneity in the combined meta-analysis. Importantly, post hoc analysis stratified by protocol-specified tidal volume strategy yielded a near-identical effect estimate, alleviating concern about using mean baseline tidal volume to infer management for the length of the study.

Second, this analysis includes the recent study by Guérin et al. [19], the first randomized trial to demonstrate that prone positioning reduces mortality from ARDS. This trial achieved the lowest tidal volumes of all proning studies and had near-universal use of neuromuscular blockade early in the course of ARDS, two proven therapies [2, 4] that may augment the potential benefits of prone positioning. Additionally, Guérin et al. maintained patients in the prone position for an average of 17 ± 3 h/day. This dose is comparable to the most recent trials of prone positioning but far longer than earlier such trials. Proning dose has been suggested previously to correlate with survival [22].

This meta-analysis is limited by the small number of robust studies published on prone positioning for ARDS. Only randomized controlled trials were included to avoid risk of confounding by indication in observational studies from use of prone positioning as rescue for refractory hypoxemia [3638]. Studies excluded from this analysis but considered in previous meta-analyses involved patients without ARDS [39, 40], neonatal and pediatric patients only [41], high-frequency oscillatory ventilation as the control arm [42], or observational study design without a clearly defined control group [43]. Some of the heterogeneity in prior meta-analyses may have been avoided here by defining more focused inclusion criteria.

It is possible the effect modification attributed here to low tidal volumes is due to other explanatory factors [25]. Stratification by high (≥12 h/day) or low (<12 h/day) proning doses demonstrated a significant reduction in mortality in the high-dose group. However, this approach explained less than one-third of overall heterogeneity between studies (I 2 = 44 % in the high-dose proning stratum, compared to I 2 = 64 % overall). By comparison, little heterogeneity remained with stratification by mean baseline tidal volume (I 2 = 25 % in the low tidal volume stratum), indicating stratification by tidal volume better explained heterogeneity across studies. Still, it is likely that both low tidal volumes and increased proning duration make important contributions to the findings here.

Duration of respiratory failure prior to proning could not be considered here because of underreporting in included trials and may partially account for divergent study findings. While prone positioning may be more effective with increasing ARDS severity [21], similar mean baseline PaO2:FiO2 among six of the seven included trials precluded meaningful trial-level analysis. Furthermore, critical care for other conditions associated with ARDS, most notably sepsis [44], has improved considerably over the same period that low tidal volumes became widely adopted. Regardless, the conclusion regarding the benefit of prone positioning in the modern era would not change. An individual patient-level meta-analysis would provide important insights into the relative contribution of tidal volume, ARDS severity, proning dose, and other potential explanatory factors.

Importantly, the current literature does not capture the potential learning curve of newly introducing prone positioning to an inexperienced intensive care unit. Complications were rare in the experienced centers performing clinical trials of prone positioning [21]. However, complication rates during an introductory period, including dislodgment of life-sustaining tubes and equipment, may well be higher when proning is done in previously inexperienced centers.

Finally, the existing literature fails to consider the role for prone positioning with high-PEEP strategies (Table 1). Most recent trials targeting alveolar recruitment and prevention of atelectrauma have advocated for applying considerably higher PEEP for a given FiO2 requirement [4550] as part of an open lung-protective approach. A high-PEEP strategy is supported by a previous patient-level meta-analysis that demonstrated reduced mortality from moderate or severe ARDS with this approach [33]. Yet, even the most recent study by Guérin et al. [19] used precisely the low-PEEP strategy from the ARDS Network ALVEOLI trial [45]. Whether the clinical benefit of prone positioning holds true for patients receiving higher PEEP strategies is unknown, although physiological effects may be synergistic [51]. Confirmatory trials must evaluate prone positioning with concurrent open lung-protective ventilation to determine the potential role for broad adoption of prone positioning.