Article Text


Idiopathic pulmonary fibrosis: a paradigm of late-onset, single-gene human disease?
  1. Stephen J Chapman1 and
  2. Matt P Wise2
  1. 1Oxford University Hospitals and Oxford Biomedical Research Centre, Oxford, UK
  2. 2University Hospital of Wales, Cardiff, UK
  1. Correspondence to Dr Stephen J Chapman; schapman{at}

Statistics from

Idiopathic pulmonary fibrosis (IPF) remains as one of the most devastating respiratory diseases. Incompletely understood and difficult to characterise clinically, there are only limited treatments of proven benefit available and the prognosis is typically poor. Early studies were hampered by a lack of clear classification and diagnostic precision among interstitial lung diseases, and until recently the pathogenesis of IPF has only been poorly understood. However, the last few years have seen major advances in our understanding of the molecular basis of this disease, and in particular a series of landmark studies have begun to uncover the genetic basis of IPF.

Familial forms of IPF are well recognised, although apparently account for only a minority (approximately 10%) of adult cases.1 A number of genes have been implicated in familial IPF through linkage analysis, including genes involved in regulation of telomere length (eg, TERT and TERC) and those encoding surfactant proteins (eg, SFTPA2 and SFTPC) and mucin 5B (MUC5B).1–3 Recently, sporadic (non-familial) IPF has been subject to extensive genome-wide analysis in an attempt to identify underlying genetic susceptibility variants. Genome-wide association studies (GWAS) comprise the genotyping of hundreds of thousands of common genetic variants (termed polymorphisms) spread at intervals across the genome, and the frequency of these variants are typically compared between disease cases and healthy controls. The attraction of such studies is that they are not based on existing assumptions about disease pathogenesis—which are usually incomplete and may be misleading—and hence have the potential to identify associations with previously unsuspected genes and pathways. The cost of this unbiased approach is that large sample sizes (typically in the thousands) with independent replication case–control collections are required in order to power robust associations, given the very large number of statistical tests performed.

Such GWAS approaches have confirmed the role of genetic variation in TERT and TERC and MUC5B in susceptibility to sporadic as well as familial IPF.4–6 These studies have also identified novel associations, for example with polymorphism in the genes TOLLIP involved in innate immune signalling and DSP and DPP9 involved in cell–cell adhesion.5 ,6 Furthermore they have extended the phenotypic spectrum by describing associations between MUC5B polymorphism and early-stage interstitial lung abnormalities in the general population as well as outcome from IPF.7 ,8 The role of MUC5B in particular in sporadic IPF is remarkable, as the functional MUC5B promoter polymorphism appears to exert a disproportionately large effect size (an approximately eightfold increase in risk) for such a common variant.9 Nevertheless, as the heritability of IPF is unknown, it is unclear how much of the genetic component of this disease is accounted for by these variants. For other complex human adult diseases, the common polymorphisms identified through GWAS together appear to account for only a small proportion of known heritability. A popular explanation for the remaining ‘missing heritability’ is multiple different rare mutations of individually large effect size, which sit below the frequency cut-off identifiable through GWAS: identification of such rarer variants requires gene resequencing approaches.10

Aaron Hamvas and colleagues now report a candidate gene resequencing study of adults with IPF, compared with a disease-control group of patients with chronic obstructive pulmonary disease (COPD) and population-based database controls.11 Six genes were selected for resequencing based on their known involvement in cases of familial IPF and/or childhood interstitial lung disease: genes encoding surfactant proteins (SFTPA2, SFTPC), ATP-binding cassette member A3 (ABCA3), telomerase (TERT), thyroid transcription factor (NKX2–1) and mucin 5B (MUC5B). Unlike association studies of common polymorphism, the analysis of single variants is probably not appropriate for rare variant analysis and lacks statistical power, and instead a collapsing analysis was performed, which aggregated rare variants to assess mutational load across genes. Remarkably, a significant excess of rare, computationally predicted deleterious mutations were identified in the genes SFTPC and TERT in adult patients with IPF when compared to patients with COPD and population-based controls.11 Common polymorphisms in these genes were present at similar frequencies between the groups. Interestingly, the majority of patients with IPF with a rare, predicted deleterious mutation (11 of 14 cases) did not report a positive family history and had apparently sporadic, adult-onset IPF.11 The association between IPF and the common MUC5B promoter polymorphism was again replicated; of note there was no demonstrable enrichment of rare coding mutations in MUC5B in IPF cases compared to controls, although exon 49 was not sequenced.

There did not appear to be any interaction between these rare variants and the common MUC5B promoter polymorphism, although it should be noted that the sample size is relatively small and the study lacks power to demonstrate such epistatic (gene–gene) interactions. The study has other limitations common to IPF research, in particular that the diagnosis of IPF can be difficult to make with absolute certainty and may be mistaken for other idiopathic interstitial pneumonias or other forms of interstitial lung disease, which may have led to misclassification of cases. The recruitment of candidates for lung transplantation into the current study has resulted in patients at the younger end of the disease spectrum (average age 54 years), who may arguably be enriched for host genetic susceptibility factors. Nevertheless, the findings raise the intriguing possibility that rare genetic mutations of individually large effect may account for a significant fraction of adults with apparently sporadic IPF. Taken together with the recently described major role of MUC5B polymorphism in IPF susceptibility, an emerging paradigm is that adult-onset IPF may be in the large part a genetically determined condition, with some of the longest latency periods observed in human single-gene disease. Why do such large-effect mutations not manifest as clear-cut familial disease? In some cases they may represent familial IPF but a clear family history of other affected members is not apparent; alternatively the size of effect may not be sufficient to result in classical Mendelian segregation, or in some cases they may even represent de novo mutations which have arisen in affected individuals.

As with all genetic association studies, the findings require confirmatory replication in an independent study group. In addition it would be worthwhile extending the list of candidates to include other genes recently found to associate with IPF at genome-wide level, as in some cases an apparently disease-associated common polymorphism may simply be a marker which reflects a true association driven by causative rare variants on the same haplotype.5 Indeed, the study suggests that whole exome-based or genome-based sequencing approaches may be fruitful in this disease, and perhaps lead to the identification of novel mutations in previously unsuspected genes and pathways, some of which may be amenable to drug therapy. Such larger studies may also uncover the genetic architecture of IPF, in particular the relative contribution of common and rare variants, as well as possible epistatic interactions between variants of different frequency and potential gene–environment interactions, for example with cigarette smoking and ageing.

Even following the reclassification of interstitial lung disease, IPF is likely to be a rather crude umbrella term which contains a number of clinically and radiologically indistinguishable molecular subphenotypes within it. In addition to the use of genetic association studies, complementary approaches to subphenotype IPF include proteomics and microarray analysis of lung and peripheral blood cell gene expression and DNA methylation profile.12–15 Identification of these molecular subphenotypes will lend a new level of precision to clinical trials and outcome prediction, as well as valuable insights into the underlying pathophysiology and indeed aetiology—perhaps at last taking the ‘idiopathic’ out of pulmonary fibrosis.

View Abstract


  • Competing interests None.

  • Provenance and peer review Not commissioned; internally peer reviewed.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Linked Articles