ARTICLE
Received 12 Jan 2015 | Accepted 21 Sep 2015 | Published 30 Oct 2015
DOI: 10.1038/ncomms9727 OPEN
Gut mucosal microbiome across stages of colorectal carcinogenesis
Geicho Nakatsu1,2, Xiangchun Li1,2,*, Haokui Zhou3,*, Jianqiu Sheng4,*, Sunny Hei Wong1,2,*, William Ka Kai Wu1,2,5,*, Siew Chien Ng1,2, Ho Tsoi1,2, Yujuan Dong1,2, Ning Zhang6, Yuqi He4, Qian Kang4, Lei Cao1,2, Kunning Wang1,2, Jingwan Zhang1,2, Qiaoyi Liang1,2, Jun Yu1,2 & Joseph J.Y. Sung1,2
Gut microbial dysbiosis contributes to the development of colorectal cancer (CRC). Here we catalogue the microbial communities in human gut mucosae at different stages of colorectal tumorigenesis. We analyse the gut mucosal microbiome of 47 paired samples of adenoma and adenoma-adjacent mucosae, 52 paired samples of carcinoma and carcinoma-adjacent mucosae and 61 healthy controls. Probabilistic partitioning of relative abundance proles reveals that a metacommunity predominated by members of the oral microbiome is primarily associated with CRC. Analysis of paired samples shows differences in community congurations between lesions and the adjacent mucosae. Correlations of bacterial taxa indicate early signs of dysbiosis in adenoma, and co-exclusive relationships are subsequently more common in cancer. We validate these alterations in CRC-associated microbiome by comparison with two previously published data sets. Our results suggest that a taxonomically dened microbial consortium is implicated in the development of CRC.
1 Department of Medicine and Therapeutics, Institute of Digestive Disease, State Key Laboratory of Digestive Disease, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, 30-32 Ngan Shing Street, Shatin, Hong Kong SAR, China. 2 CUHK Shenzhen Research Institute, 2 Yuexing Road, Nanshan District, Shenzhen 518057, China. 3 Department of Microbiology, The Chinese University of Hong Kong, 30-32 Ngan Shing Street, Shatin, Hong Kong SAR, China. 4 Department of Gastroenterology, Beijing Military General Hospital, 28 Fuxing Road, Haidian, Beijing 100853, China. 5 Department of Anaesthesia and Intensive Care, The Chinese University of Hong Kong, 30-32 Ngan Shing Street, Shatin, Hong Kong SAR, China. 6 Department of Gastroenterology, The First Afliated Hospital of Sun Yat-sen University, 58 Zhongshan Second Road, Yuexiu, Guangzhou 510080, China. * Co-rst authors. Correspondence and requests for materials should be addressed to J.Y. (email: mailto:[email protected]
Web End [email protected] ) or to J.J.Y.S. (email: mailto:[email protected]
Web End [email protected] ).
NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 1
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727
The human intestinal mucosa is a dynamic interface between host cells and a network of microbial ecosystems1. Sustained gut microbial dysbiosis is a potential risk factor
for exacerbating colorectal lesions towards carcinogenesis2. Progression of colorectal neoplasia has been linked to alterations of tumour microenvironment and mucosal barrier function, which facilitate the interaction of microbial products with host pathways3. A variety of gut commensals and their metabolites, such as butyrate and hydrogen sulphide4,5, are known for triggering inammatory cascades and oncogenic signalling, thereby promoting genetic and epigenetic alterations in the development of colorectal cancer (CRC)3. Given our lack of understanding on how microbiome proles change during the transition from normal mucosae, adenomatous to malignant lesions, assigning certain members or a consortium of the gut microbes with potential causative roles in CRC remains a grand challenge. Although the enrichment of Fusobacterium species and their regulation of tumour microenvironment have been described611, increasing evidence suggests that colorectal lesions are home to various other members of the gut microbiota1214. Thus, variations in the taxonomic footprints of microbial communities across major stages of CRC development need to be claried.
Here we perform 16S ribosomal RNA (rRNA) gene sequencing on mucosal microbiome of normal colorectal mucosae, adenomatous polyps and adenocarcinomas. Our approach focuses on the identication of distinct taxonomic congurations, or metacommunities. To determine associations of meta-communities with disease status, we adopt an approach similar to that published by Ding and Schloss15. By further analyses of paired samples and microbial relationships, we demonstrate that mucosal microbial communities show distinct alterations across stages of colorectal carcinogenesis.
ResultsMetacommunities associated with colorectal tumour statuses. To determine associations of microbiome proles with mucosal phenotypes, we performed 16S rRNA gene sequencing on mucosal biopsy samples collected from subjects with normal colons (n 61), subjects with histology-proven adenoma
(n 47), and subjects with invasive adenocarcinoma (n 52) at
the Prince of Wales Hospital of the Chinese University of Hong Kong and the First Afliated Hospital of the Sun Yat-Sen University (see Supplementary Table 1 for overview of patient demographics). We implemented the sequence curation pipeline optimized for analyses of amplicon libraries as described in mothur software package16. This approach for quality control has been shown to result in a low sequencing error rate (0.06% or less)17. Using the reference Greengenes taxonomies (version13.8), post-quality control reads were assigned to bacterial phylotypes. Phylotypes with the deepest taxonomic annotations were tted to Dirichlet multinomial mixture (DMM) models to partition microbial community proles into a nite number of clusters, using the Laplace approximation as previously described (see Supplementary Fig. 1 for comparison of ordination results from DMM and partitioning around medoid (PAM)-based clustering)15,18. We identied ve metacommunities designated in Fig. 1 as AE and observed strong associations with phenotypes of colorectal mucosae (Fishers exact test with Monte Carlo simulation; q 7.0 10 5; see also Supplementary
Data 1 for associations of metacommunities with clinical features). Subsequently, we screened for taxa that distinguished the metacommunities using the LEfSe algorithms (Fig. 1; see Supplementary Data 2 for the summary of linear discriminant analysis scores), and performed receiver operating characteristic
analyses to conrm that these markers condently differentiated normal mucosae from lesions (Supplementary Fig. 2). The performance of metacommunity markers was comparable to that of the markers identied by Random Forests (see Supplementary Data 3 for the list of markers selected by tenfold cross-validations of the Random Forests algorithm)19. Furthermore, using metacommunity markers, we designed a two-way index, termed Microbial Community Polarization index (MCPI), to quantify the degree of mucosal dysbiosis associated with colorectal lesions (Fig. 1; see Supplementary Fig. 2a for the performance of the index).
Metacommunity A was represented by phylotypes of major bacterial phyla, including Bacteroidetes, Firmicutes, Proteobacteria and Fusobacteria. The representative members included Bacteroides, Bacteroides fragilis, Fusobacterium, Escherichia coli, Faecalibacterium prausnitzii and Blautia (Fig. 1). Metacommunity B was predominated by E. coli and had the least diverse community prole (MannWhitney U-test; mean q 1.5 10 4; Supplementary Fig. 1). Metacommunity C
differed by high intra-cluster variability due to inconsistent appearances of taxa (Supplementary Fig. 1). Metacommunity D was overrepresented by members of the Firmicutes with Bacteroides being equally abundant. Of all the metacommunity compositions examined, metacommunity E was particularly interesting in that Fusobacterium as well as some other Firmicutes associated with periodontal diseases were enriched. Indeed, metacommunity E had signicantly higher levels of oral and/or potentially pathogenic taxa sharing nearly identical sequences with the reference 16S rRNA genes from the Human Oral Microbiome (MannWhitney U-test; mean q 1.1 10 3;
Supplementary Data 4) and the PATRIC bacterial pathogen databases (MannWhitney U-test; mean q 8.4 10 3;
Supplementary Data 4). Metacommunities C and E were strongly associated with adenomas and carcinomas (Fishers exact test; qo1.0 10 5), respectively. In total, 40% of adenomas were
classied as metacommunity C whereas 48% of carcinomas were classied as metacommunity E. Metacommunities A and D together represented 59% of the normal controls.
To validate the consistent enrichments of metacommunities in independent cohorts, we analysed the publicly available data sets of similar experimental design7,20. By training logistic regression models with LASSO penalization21 on relative abundance proles in our discovery cohort that was previously subjected to DMM partitioning, we classied these independent samples into the metacommunities AE. Fishers exact tests showed that the enrichment of metacommunity E and depletion of metacommunity D in carcinomas are signicant and consistent in both studies (Fishers exact test; qo0.005 for both data sets;
Supplementary Data 5). For metacommunity markers, we tted multiple linear regression models to their fold changes in carcinoma relative to corresponding carcinoma-adjacent mucosa and demonstrated statistically signicant agreements between our discovery cohort and the two studies (Fig. 2a,b). Furthermore, we performed real-time PCR amplication of the most abundant 16S rRNA marker gene sequences of representative bacterial phylotypes in an independent Chinese cohort comprising 116 individuals (normal colon, n 25;
adenoma-affected, n 41; carcinoma-affected, n 50; see
Supplementary Table 2 for overview of patient demographics) and conrmed the consistent enrichments of these markers (Fig. 2c).
Paired analysis of mucosal metacommunities. The availability of paired samples allowed us to investigate how the microbiome changed at colorectal lesions when compared with adjacent
2 NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727 ARTICLE
PATRIC HOM Mean ISDI
0.8
Relative
abundance
10
0.60.40.2
0.0 0
5
Alpha
diversity
Normal control Adenoma-adjacent Adenoma Carcinoma-adjacent Carcinoma
MCPI 0 +
Phenotypes of colorectal mucosae
A B C D E
A B C D E Metacommunities
Bacteroides
Bacteroides fragilis
Parabacteroides distasonis
Eggerthella lenta
Ruminococcus gnavus
Escherichia coli Enterobacteriaceae
Pseudomonas veronii
Streptococcus
Pedobacter cryoconitis
SMB53
Faecalibacterium prausnitzii
Blautia
Oscillospira
Lachnospiraceae
Ruminococcaceae
Ruminococcus
Coprococcus
Bacteroides uniformis
Sutterella
Blautia obeum
Subdoligranulum variabile
Clostridiales
Collinsella aerofaciens
Parabacteroides
Clostridium clostridioforme
Ruminococcus bromii
Alistipes putredinis
Butyricicoccus pullicaecorum
Odoribacter
Fusobacterium
Parvimonas
Granulicatella
Haemophilus parainfluenzae
Peptostreptococcus
Gemella
Leptotrichia
Phylum
Mogibacterium
Bacteroidetes
Firmicutes
Proteobacteria
Relative taxon abundance
Min. Max.
Fusobacteria
Actinobacteria
Figure 1 | Characterization of 16S rRNA gene catalogue for mucosal microbial communities in colorectal carcinogenesis. Fitting microbiome data to DMM models dened ve metacommunities. Reads that are considered as being potentially originated from oral strains or known pathogenic strains in the human gut were classied against the 16S rRNA gene collections from the Human Oral Microbiome (HOM; version 13) database and PATRIC bacterial pathogen database as dened by pseudo-bootstrapped (n 1,000) condence scores of 100 at species-level taxa or deeper, using the naive Bayesian
classier. The panels of metacommunity markers are ranked in the descending order of linear discriminant analysis scores from top to bottom. Columns represent microbiome proles (arcsine square root-transformed) of 269 mucosal biopsies from individuals with or without adenoma or adenocarcinomas. (MCPIo0 for changes characteristic of adenomas; MCPI40 for changes characteristic of carcinomas).
mucosae at different stages. Although the proportion of discordant metacommunities between tumour and tumour-adjacent mucosae were similar in adenoma (36%) and carcinoma (39%) samples, we observed signicant patterns of change in community congurations specically among cancerous mucosae (Fig. 3a; Supplementary Data 6). Remarkably, the sampling of metacommunity E at lesion-adjacent mucosae was almost always accompanied by sampling of the same metacommunity at lesions (92%). The discordances in metacommunity D were mainly explained by the sampling of metacommunity E at lesions relative to lesion-adjacent tissues (69%) (Fig. 3b; see Supplementary Fig. 3 for metacommunity pairs across individuals). Using paired Wilcoxons signed-rank test, we found no statistical differences in inverse Simpsons diversity index (ISDI) between lesion-adjacent mucosae and lesions (P 0.804 in adenoma group; P 0.158 in
carcinoma group). Nevertheless, there was a signicant increase in diversity within carcinomas as compared with adenomas (false discovery rate (FDR) 0.0386).
Using all microbiome parameters described in Fig. 1, we tested whether there were any differences among the subset of individuals with concordant community types between lesions and lesion-adjacent mucosae. Among the matched samples with concordant metacommunity D, the relative abundance of taxa that were classied to Human Oral Microbiome database was moderately higher in lesions than lesion-adjacent tissues (P 0.0361; FDR 0.239). This difference was also reected as
an increase in ISDI for lesions (P 0.0289; FDR 0.239). By
contrast, among samples with matched metacommunity E, there was a moderate decrease in diversity as well as increase in dysbiosis indexes for lesions as compared with lesion-adjacent tissues (ISDI: P 0.0479, FDR 0.239; MCPI: P 0.0105,
FDR 0.210). As for other metacommunities, no difference was
found between lesion and lesion-adjacent tissues.
To examine changes in bacterial markers across disease stages, we calculated the fold change of each metacommunity marker relative to lesion-adjacent mucosae. In early-stage CRC, Fusobacterium, Parvimonas, Gemella and Leptotrichia were most signicantly enriched (Fig. 3c), which was accompanied by signicant losses of Bacteroides and Blautia, F. prausnitzii, Sutterella, Collinsella aerofaciens and Alistipes putredinis. Neither of these changes was signicant in pathological stages of adenoma as well as late-stage CRC (Fig. 3c).
Interactions of microbial taxa in disease states. We next inferred all pairwise taxonomic correlations within and/or between normal control, lesion and lesion-adjacent mucosae, using the SparCC algorithm22. After iteratively correcting for spurious correlation coefcients and controlling for false discovery rates, we demonstrated that the distribution of taxonomic correlations were signicantly different across disease stages (Fig. 4; Supplementary Fig. 4). Among taxa
NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 3
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727
a
b
Mean of log 2fold change in discovery cohort dataset
Mean of log 2fold change in discovery cohort dataset
Leptotrichia
2
Leptotrichia
2
Fusobacterium
Parvimonas
Gemella Granulicatella
1
Streptococcus
Parvimonas
Peptostreptococcus
1
Streptococcus
Fusobacterium
Bacteroides fragilis
Bacteroides fragilis
Granulicatella
Gemella
0
0
1
1
Adjusted R2= 0.51 P = 2.1 106
Adjusted R2= 0.66 P = 2.7 109
2
2
1 0 1 2 Mean of log2 fold change in Kostic et al. dataset
1 0 1 2 3Mean of log2 fold change in Zeller et al. dataset
Marker types: A B C D E
c
Bacteroides fragilis Granulicatella
0.0
Gemella
Peptostreptococcus
Relative abundance
Parvimonas
* *****
* **
Normal control Adenoma-adjacent Adenoma Carcinoma-adjacent Carcinoma
0.4
0.6
0.15
(arcsine square root)
0.5
****
0.3
0.12
0.4 ****
* **
**
0.5
0.6 ****
*
***
0.4
0.3
0.4
0.2
0.3
0.08
*
0.2
0.3
*
**** *****
0.2
0.1
0.04
0.2
0.1
0.1
0.1
0.0
0.0
0.00
0.0
Mucosal phenotypes:
Figure 2 | Validations of metacommunity markers in independent cohorts. (a,b) Fold-change analyses in paired carcinoma and carcinoma-adjacent samples in two additional cohorts demonstrated signicant agreement with our discovery cohort: (a) Kostic et al.7 data set (n 74) and (b) Zeller et al.20
data set (n 48). Shown are adjusted R2 and P values for goodness of t from multiple linear regression models. (c) Real-time PCR amplications of
the most abundant sequences of representative bacterial phylotypes showed consistent enrichments in an additional Chinese cohort consisting of207 mucosal biopsies (normal control, n 25; adenoma, n 41; adenocarcinoma, n 50). Error bars represent s.e.m. P values from MannWhitney U-tests
are adjusted by Benjamini-Hochberg (BH) step-up procedure; *qo0.05; **qo0.01; ***qo0.001; ****qo0.0001.
colonizing the normal control mucosae, we found the highest number of signicant positive correlations with strengths of 0.5 or above (mean qo0.01; Fig. 4a). Interestingly, trans-phylum relationships with strengths of 0.5 or above were less common in disease states than normal colonic mucosae (Fig. 4b,c; see Supplementary Data 7 for the complete list of correlation coefcients with FDRo0.05). Members of the Firmicutes were more likely to form strong co-occurring relationships with one another in normal colonic mucosae than lesions and lesion-adjacent samples. These results indicate that members of the gut microbiota can form niche-specic relationships, which may be a response to an altered colonic mucosal microenvironment or could be one of the reasons for the disease state.
Our network analysis identied signicant interactions among several prominent taxonomic members (Fig. 4; Supplementary Data 7). For example, Parvimonas and Peptostreptococcus, which are members of the oral microbiota, formed one of the strongest positive relationships exclusively within carcinoma and carcinoma-adjacent mucosae. Although Fusobacterium was positively related to the oral members of the Firmicutes, the strengths were relatively weak. Nevertheless, the occurrence of Fusobacterium was specic to carcinomas as indicated by relatively weak correlation between carcinoma-adjacent mucosae and carcinomas. This was in contrast to the occurrences of Parvimonas and Peptostreptococcus, which showed strong correlations between carcinoma-adjacent mucosae and carcinomas (Fig. 4c). We also identied several negative relationships of
Fusobacterium with other taxa, including Subdoligranulum variabile, F. prausnitzii, Blautia, Clostridium clostridioforme, and Sutterella within and between carcinomas and carcinoma-adjacent mucosae. Among members of the gut commensals, the positive relationship between F. prausnitzii and Blautia was among the strongest of the Firmicutes in normal control and paired cancerous mucosae. Despite a weaker positive association within and between paired adenoma samples, F. prausnitzii exhibited a progressively stronger positive association with members of the Ruminococcaceae toward carcinogenesis. Conversely, the co-occurrence of Blautia and Bacteroides was remarkably stronger in normal mucosae but weakened with tumour development. Though E. coli and members of the Enterobacteriaceae were among the most abundant in paired adenoma samples, their co-occurrence relationship was weaker in paired carcinomas. Besides, Pseudomonas veronii correlated positively with low-abundance taxa such as Massilia, Pedobacter cryoconitis, and members of the Sphingomonadaceae and Erythrobacteraceae, and negatively with Bacteroides,F. prausnitzii and members of the Lachnospiraceae.
To validate our correlation analyses in independent cohorts, we
performed Fishers exact tests on the total number of signicant positive and negative taxonomic relationships that had false discovery rates of 0.25 or less between two studies in comparison. The directions of taxonomic correlations were signicantly concordant between our discovery cohort and the two studies (Po1.0 10 35 for both Kostic et al.7 and Zeller et al.20 data
4 NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727 ARTICLE
a
Adenoma Carcinoma
Change
b
Percentage of change
P = 0.719
P = 2.0 104
LGDP
HGDP
ECRC
LCRC
10%
100
Change
No change
Percentage of metacommunities
> 10%
75
E
Lesion adjacent
> 30%
Lesion
50
25
A
D
0
No change
Metacommunity: A B C D E
c
B
C
Colorectal polyps (low-grade dysplasia)
4
2
0
Colorectal polyps (highgrade dysplasia)
2
1. Bacteroides3. Parabacteroides distasonis
A
A
4
2. Enterobacteriaceae B
B
4
7
7 16
1
9 13
17
2
9 13
17
2
6
7
7
1 6
9 13
1
1 6
9 13
1
2
2
1
1
2
2
1. Pseudomonas veronii C
C
2
D
1. Faecalibacterium prausnitzii 2. Blautia9. Sutterella 13. Collinsella aerofaciens17. Alistipes putredinis
D
0
Early CRC (Stage I II)
2
4
Mean of log 2fold change
E
1. Fusobacterium2. Parvimonas6. Gemella7. Leptotrichia
E
Gain
Loss
4
2
7
7
1
1
2
2
6
6
21713 93
2
21713 93
2
1
0
Late CRC (Stage III IV)
q < 0.05
q < 0.1
q < 0.25
q 0.25
2
2
1
1
2
2
3
3
1
1
2
13
1
2 13
1
2
4
4 2 0 2 4 4 2 0 2 4 4 2 0 2 4Gain
Loss
Mean of log2 fold change
Figure 3 | Community-wide alterations of microbiome proles are important aspects of multistage colorectal tumour progression. (a) Discordance of taxonomic congurations between lesions and lesion-adjacent tissues was signicantly associated with the metacommunities identied within carcinoma. Shown are mean P values from 1,000 iterations of Fishers exact tests with Monte Carlo simulation (10,000 replicates). (b) Percentages of change between metacommunities from lesion-adjacent mucosae to lesions within each clinicopathologic stage of tumours. LGDP, colorectal polyps with low-grade dysplasia (n 39); HGDP, colorectal polyps with high-grade dysplasia (n 13); ECRC, early-stage CRC (n 26); LCRC, late-stage CRC (n 26).
(c) Signicances of fold change in metacommunity markers, as estimated by paired MannWhitney U-tests, were greatest at early-stage CRC.
sets). We also subjected the concordant taxonomic relationships to multiple linear regression analysis to show that the strengths of correlations are signicantly supported by the two studies (Supplementary Fig. 5).
DiscussionInter-individual variations in tumour-associated mucosal micro-biome have posed a long-standing challenge for deciphering microbial signatures implicated in colorectal tumorigenesis. In this study, we demonstrate that as colorectal neoplasm progresses along the adenoma-carcinoma sequence, mucosal microbial communities can establish micro-ecosystems of their own, giving rise to metacommunities of specic structure with functional
features that can be predicted (Supplementary Figs 68). Although a myriad of factors, such as lifestyle and dietary habits, could contribute to CRC, our systematic analysis highlighted the importance of microbial consortia as a potential player in colorectal tumour development. In this regard, the rediscovery of CRC-specic enrichment of Fusobacterium79 and B. fragilis23 and the identication of novel CRC-associated candidates, such as Gemella, Peptostreptococcus and Parvimonas, expands the current scope of bacterial involvement in CRC development. In particular, Gemella, Peptostreptococcus and Parvimonas along with other microbes of oral origin formed a strong symbiotic network, which characterized the CRC-associated metacommunity E. Future studies on their potential oncogenic functions using murine models of CRC will delineate whether
NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 5
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727
a
Proteobacteria Bacteroidetes
1. Escherichia coli2. Bacteroides3. Fusobacterium4. Faecalibacterium prausnitzii5. Bacteroides fragilis6. Blautia7. Streptococcus8. Enterobacteriaceae9. Parvimonas10. Oscillospira11. Haemophilus parainfluenzae12. Lachnospiraceae13. Granulicatella14. Ruminococcaceae15. Ruminococcus16. Gemella17. Peptostreptococcus18. Bacteroides uniformis19. Pseudomonas veronii
20. Coprococcus21. Sutterella22. Subdoligranulum variabile23. Blautia obeum24. Clostridiales25. Parabacteroides distasonis26. SMB5327. Collinsella aerofaciens28. Leptotrichia29. Ruminococcus bromii30. Clostridium clostridioforme31. Pedobacter cryoconitis32. Parabacteroides33. Mogibacterium34. Alistipes putredinis35. Butyricicoccus pullicaecorum36. Eggerthella lenta37. Odoribacter38. Ruminococcus gnavus
2534
31
8
8
37
37 2534
31
11 1
11 1
19
21 5
18
18
21
5
2
2
32
32
19
36
36
366
36
3
27
27
28
28
Actinobacteria
22 24
29
22 24
20 7
20 7
14 4 30
14 4 30
12
16
12
16
15
15
35
13
13
10
23
35
9
9
10
Min.
Min. Max.
Max. PATRIC
HOM
29
Min.
Min. Max.
Max.
17
17
6
6
33
33
333
33333
33
26
266
2
8
38
33888888888
338
38
3
Firmicutes
Fusobacteria
Co-occuring r > 0.6 r > 0.5 r > 0.3
Co-excluding r < 0.4 r < 0.3
Neither being classified to PATRIC nor HOM
b
Bacteroidetes
Bacteroidetes
c
Bacteroidetes
Bacteroidetes
18 37
5
18
2534
2
32
32
18
18
25
25
34
34
18
31
34
3732
25
31
5
3732
25
31
37
5
2534
2
25
25
37
337
3
34
3444
37
37
37
37
37
37
3
31
34
31
3
5
5
5
Actinobacteria
31
31
18
Actinobacteria
Actinobacteria
18
Actinobacteria
2
2
5
2
32
32
36
2
2
32
32
36
36
27
36
18 5 2 36
27
36
27
27
27
27
27
27
1
11
11
8
11
21
8
19
11
11
11
8
1
11 1
11
21
1 8 19
1
1
2119
8
1
1
8
19
2119
8
21
21
21
8
19
21
19
19
355
3
Proteobacteria
Proteobacteria
Proteobacteria
Proteobacteria
38 22
30
38 22
30
20
3038
20
14 20 22 30
7 4
14 20 22 30
7 4
22
22
3038
20
20
14
7 4
14
7 4
20
20
22
22
26
26
26
26
26666
26666666
22666
2
24
24
30
30
35
35
26
26
12
12
15
15
35
35
1224
1224
15
15
33
33
33
23
33
23
15
6
4
6
4
13
13
6
6
15
17
13
13
9
9
10
10
15
17
23
23
9
9
38
38
4
4
38
338
1666
166
3
28
355
335
355
335
3
16
3
10
10
23
23
16
10
16
16
29
29
9
9
9
3
13
113
1
17
17
16
166
16
11666
16
166
1
26
26
13
9 13
23
23
26
10
26
10
14 12 7
14 12 7
24
24
29
29
17
17
17
17
17
6
6
33
33333
10 29 14 7
29 14 7
12
12
24
24
2917 6 33
2917 6 33
Firmicutes
Fusobacteria Fusobacteria
Firmicutes
3
15
Firmicutes
3
Firmicutes
28
28
28
28
3
28
Fusobacteria
Fusobacteria
Adenoma
Adenoma adjacent Carcinoma
Carcinoma adjacent
Figure 4 | Microbial community ecology at mucosal interface are different across stages of colorectal carcinogenesis. (ac) Correlation network of taxonomic partners in: (a) normal (n 61), (b) adenomatous polyps (n 52) and (c) cancerous mucosae (n 52). Correlation coefcients were estimated
and corrected for compositional effects using the SparCC algorithm. A subset of correlations with strengths of at least 0.3 was selected for visualization. Node size represents mean taxon abundance in each mucosal phenotype; metacommunity markers are denoted by node numbers accordingly. Taxa that are classied as members of the same bacterial phylum are encircled by dashed lines.
these candidates are drivers or passengers in colorectal tumorigenesis.
A unique feature of our experimental design is the sampling of mucosa near the site of a lesion at distinct stages of colorectal neoplasia. With this approach in mind, we have illustrated patterns of discordances in metacommunities between lesions and lesion-adjacent mucosae (Fig. 3b). A novel aspect of CRC pathogenesis that has been recently described is the association of biolm-forming bacterial communities and their capacity to modulate cancer metabolism2426. Thus, sub-networks of cooccurring and co-excluding microbes at and around neoplastic sites may reect disease-specic colonic microenvironment (Fig. 4). In particular, we have identied co-exclusive relationships between members of Proteobacteria and Firmicutes in adenoma-adjacent samples. Such changes persisted in carcinoma-adjacent samples, implying that a substantial degree of dysbiosis may have already occurred in the greater colonic environment in tumour-bearing colons. This
has a major implication as many gut microbiome studies were based on stool samples, which may reect the disease state but possibly not the tumour microenvironment.
Our study identied alterations of taxonomic relationships at trans-phylum levels in tumours and tumour-adjacent mucosae. These could be a response to altered host cellular processes, such as energy metabolism and inammation, at tumour niches. For example, dietary carbohydrate can promote intestinal epithelial cell proliferation4 and has been associated with incidences of CRC27,28. Inammation or colitis-associated niche may also favour the growth of specic bacterial populations that could elicit oncogenesis2932. In adenomatous lesions, the enrichments of E. coli and P. veronii are intriguing, raising the possibility of bacteria-triggered mutagenesis (see Supplementary Data 8 for detection of pks genomic islands for E. coli) as well as host-microbiome lateral gene transfer33,34, both of which may drive transformation of otherwise benign colonocytes by inuencing genomic stability. Similarly, predicted enrichments in functional
6 NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727 ARTICLE
potentials for xenobiotics metabolism35, utilization of polyamines36, and degradation of polycyclic aromatic compounds37 in metacommunities C and E (Supplementary Fig. 6) may suggest an increased susceptibility of colonocytes for pro-tumorigenic bacterial metabolites. Furthermore, the association of bacterial peptidoglycan biosynthesis pathways with metacommunity E (Supplementary Fig. 6) may modulate local inammation in evolving neoplasms38 by enhancing intestinal cell permeability30, which may allow for a vicious cycle of tumour-potentiating activities of co-occurring invasive bacterial species. However, given the hypothetical nature and potential database biases in metagenome imputation, it remains to be determined whether or how such functional traits of gut microbial communities affect host cells during colon tumorigenesis.
An important issue that could not be directly addressed by our study is the identication of adenoma-associated metacommunities that are predictive of cancer progression. On the other hand, we have identied bacterial operational taxonomic units (OTUs) with progressively increasing abundance, such as B. fragilis and Granulicatella (Supplementary Fig. 9), in the adenoma-carcinoma sequence. B. fragilis is known to induce signal transducer and activator of transcription 3 and Th17-dependent pathway in colitis-associated CRC (see Supplementary Data 8 for detection of bft genes from enterotoxigenic B. fragilis)37 whereas the abundance of Granulicatella adiacens in saliva is associated with chronic pancreatitis and pancreatic cancer39. These bacterial candidates will require functional validations to assess their prognostic values for tumour recurrence in polypectomized adenoma patients in future prospective studies. Another limitation of our study is that mucosa-associated microbiota could be altered by bowel cleansing preparation and reagents40. However, this is inevitable given the necessary procedure for sample acquisition.
Our study marks an additional step towards dening mucosal community congurations in colorectal tumorigenesis. Perhaps the most practically challenging step is the temporal association of metacommunities with pre-onset monitoring and post-manifestation follow-up of diseases. Future genomic analyses interrogating the cross-talk between subtypes of immune cell populations, host cell epigenomes and microbial consortia will be essential to dene the multifaceted roles of gut microbiome in human health and diseases.
Methods
Patient recruitment and informed consent. We enroled individuals who had undergone standardized colonoscopic examinations at the Prince of Wales Hospital of the Chinese University of Hong Kong and the First Afliated Hospital of Sun Yat-Sen University in Guangzhou between March 2011 and January 2014. Mucosal biopsies were obtained from a total of 160 individuals with tumour-free colon (n 61), with conrmed histology of colorectal polyps (n 47), or with
invasive adenocarcinomas (n 52). We also recruited an independent cohort of
116 individuals of which 25 subjects had normal colons, 41 subjects had colorectal adenomas, and 50 subjects were diagnosed with CRC, from the Beijing Military General Hospital. Written informed consents were obtained from subjects or their authorized representatives. Samples originating from Hong Kong were collected as part of a screening cohort, which has been previously described4143. Eligibility criteria for colonoscopy included: (1) age 5070 years; (2) absence of existing or previous CRC symptoms, such as haematochezia, tarry stool, change in bowel habit in the past 4 weeks, or a weight loss of 45 kg in the past 6 months and (3) not having received any CRC screening tests in the past 5 years. Samples originating from Chinese populations in Guangzhou and Beijing were collected through routine colonoscopy services for conventional indications, including (1) CRC symptoms such as haematochezia, tarry stool, change in bowel habit or weight loss;(2) positive faecal occult blood; (3) abnormal imaging such as barium enema, computed tomography, magnetic resonance imaging or positron emission tomography. The exclusion criteria for colonoscopy included: (1) personal history of CRC, inammatory bowel disease, prosthetic heart valve or vascular graft surgery and (2) the presence of medical disorders, which were contraindications for colonoscopy.
Polyethylene glycol powders (Klean-Prep, Helsinn Birex Pharmaceuticals, Ireland) were mixed with 4 l of cathartic suspension for use as standard bowel preparation regime among all participants. Air insufation was used for all procedures, which were performed by experienced colonoscopists in the endoscopy centres of each hospital in this study; we strictly aimed for caecal intubation and a withdrawal time of more than 6 min according to the current quality indicators for colonoscopy. Multiple mucosal biopsies were taken from each colorectal tumour with the greatest dimension of at least 0.5 cm and subsequently evaluated by H&E staining at the pathology suite. Biopsies were snap-frozen in cryovial immediately after polypectomy and stored at 80 C until DNA extraction. Adjacent normal tissues were taken at least 4 cm away from lesions. Colorectal mucosae were obtained using cold biopsy forceps separately for lesions and lesions-adjacent tissues to avoid cross-contamination between samples. The histopathology reports were made according to the checklist recommended by the College of American Pathologists (3.1.0.0). Control biopsy samples were provided by individuals who had no lesion detected during colonoscopy. Although the biopsies originated in various anatomical regions throughout the caecum, colon and rectum, we observed no signicant biogeographical bias in metacommunities sampled (Supplementary Data 1). Any nucleic acid or remaining biopsy samples from participants who withdrew consent after endoscopic examinations were destroyed. As enroled subjects had highly stratied medical records, we tested whether the observed inter-individual differences in mucosal microbiome proles were due to potentially confounding effects of subject demographics and laboratory-proven clinical diagnoses (Supplementary Fig. 10; Supplementary Data 1,9 and 10). The study conformed to the ethical principles outlined by the Declaration of Helsinki and was approved by the Institutional Review Boards of the Chinese University of Hong Kong, the Sun Yat-Sen University and the Beijing Military General Hospital.
Preparation of DNA amplicon library. For optimal isolation of bacterial DNA44, mucosal biopsies were disrupted by bead-beating on digestion in enzymatic cocktail of mutanolysin and lysozyme (Sigma) before extraction and purication by QIAamp DNA Mini Kit, and quantication by Agilent 2100 Bioanalyzer. Amplicon library for unidirectional sequencing (Lib-L) on the 454 GS FLX Titanium
platform was constructed using fusion primers ligated by Roche adaptor sequences, Multiplex Identier (MID) tags, library keys, and template-specic sequences (27F-800R) targeted across the hypervariable regions 14 of 16S rRNA genes. DNA library was subsequently puried (AMPure XP), quantied (Quant-iT PicoGreen dsDNA Assay Kit), and subjected to quality control by cleanup of short amplicon fragments according to manufacturers instructions.
Sequence curation pipeline. Quality control of sequencing read was implemented as described in mothur software suite16. Flowgrams were pre-processed by retaining all that had fewer than two mismatches and one or zero mismatch to the primer and barcode, respectively, and trimmed to 1,050 ows before the removal of pyrosequencing noise using the PyroNoise algorithm45. The de-noised reads were demultiplexed by removing sample-specic barcodes, further processed by removing any that had homopolymers longer than 10 nucleotide bases and/or had an ambiguous base call, and aligned against the non-redundant SILVA database (version 119) using the NAST algorithm46. Any sequence that failed to align with the V1-4 region as predicted by the primer set was discarded; the remaining sequences were trimmed to the same alignment coordinates over which they fully overlapped, clustered with more abundant sequences by a maximum difference of ve nucleotide bases17, and detected for the presence of chimeras by de novo UChime47. The resulting sequences were classied against the Greengenes database (version 13.8) and annotated with deepest level taxa represented by pseudo-bootstrap condence scores of at least 80% averaged over 1,000 iterations of the naive Bayesian classier48. Any sequences that were classied as either being originated from eukarya, archaea, mitochondria, chloroplasts or unknown kingdoms, were removed. The annotated sequences were assigned to phylotypes according to their consensus taxonomy with which at least 80% of the sequences agreed (see Supplementary Fig. 11 for taxonomic breakdown at class level). The nal sequence count table contained 8,1974,471 (means.d.) reads per sample with a minimum and maximum read length of 450 and 623 nucleotide bases, and was rareed at 1,000 reads per sample to reduce the effects of variable sequencing depths on downstream analyses (Supplementary Fig. 12).
Determination of optimal microbial community clusters. Effects of binning rare phylotypes by their total relative abundance in the rareed data set containing 592 taxa were assessed to determine whether two general methods agreed over a certain range of rarity thresholds in detecting optimal numbers of cluster: PAM49 and DMM modelling18. Procrustes analysis of truncated data sets, which were generated by applying rarity cutoffs of up to 10%, consistently demonstrated minimal cutoff-by-cutoff variations to the results of the non-metric multidimensional scaling: R 0.9940.005 (means.d.)50. When changing rarity
denitions between 01% for model tting, the total number of reads per sample were preserved by grouping rare phylotypes. At around 0.1% rarity cutoff, we observed that a core list of 99 taxa were sufcient to detect the most comprehensive number of microbial community clusters as identied by both CalinskiHarabasz index and the Laplace approximation to the model evidence. When changing rarity
NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 7
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727
denitions above 0.1% at increments, the robustness of PAM-based approach varied in contrast to DMM-based approach (Supplementary Fig. 1a).
Prediction of metabolic potentials. Sequences from post-quality control were assembled into reference-free OTUs at 3% distance using the average neighbour algorithm as implemented within mothur16 (see Supplementary Fig. 13 for number of shared OTUs between disease states). Consensus taxonomy with a condence score of at least 80% was generated for each OTU and the OTU count table was picked against the Greengenes reference OTU identiers (version 13.5) for use in the two-step functional inference pipeline PICRUSt (ref. 51). The PICRUSt uses precomputed gene copy numbers for KEGG Orthologous families based on nished bacterial genomes available in the Integrated Microbial Genome database to predict the gene family content for all microorganisms represented by the 16S-based Greengenes phylogeny, including OTUs with unknown gene content for which previously sequenced evolutionary relatives are available. The input OTU table was normalized by the predicted 16S rRNA gene copy numbers to estimate the true organismal abundances before the multiplication of the pre-calculated set of gene family counts for each taxon by the abundance of that OTU. The resulting metagenomic copy number table consisted of 6,909 KEGG Orthologous entries and served as input data in the HUMAnN pipeline that outputs the relative abundances of known microbial metabolic modules and pathways as dened by KEGG for each sample based on the user-provided table of gene family counts52. A total of 118 and 169 KEGG functional modules and pathways, respectively, were derived from the predicted metagenomic data. See Supplementary Figs 68 and Supplementary Data 11 and 12 for results of differential abundance analyses on gene families using the LEfSe algorithm.
Correlation network inferred by phylogenetic marker genes. The rareed data set containing 99 phylotypes, which were previously selected for the detectionof microbial community clusters through DMM modelling, was subjected to compositionality data analysis using the SparCC algorithm, which is known for its robustness to the compositional effects that are inuenced by the diversity and sparsity of correlation in human microbiome data sets22. Taxontaxon correlation coefcients were estimated as the average of 20 inference iterations rened by 100 exclusion iterations with the default strength threshold. A total of 10,000 simulated data sets were generated to calculate the corresponding empirical P values. This set of iterative procedures were applied separately to normal control, adenoma and carcinoma data sets to infer the basis correlation values within and/or between paired sampling sites. Correlation coefcients with magnitude of 0.3 or above were selected for visualization in Cytoscape (version 3.1.1).
Denition of microbial community polarization index. Inspired by how ones microbiome prole can be summarized by the Microbial Dysbiosis index (MDI) as an important indicator of disease53, we designed a composite index of the MDI to describe how the level of microbial diversity is associated with colonic tumour burden. We calculated the fold change for each representative taxon from a community cluster by dividing the mean abundance in paired samples by that of normal controls and required a marker taxon to have a minimum fold change of1.5 to be selected as an elemental variable of the MDI. We intended to dene the MCPI as a measure of overall dysbiotic shifts that were more characteristic of adenoma over carcinoma, or vice versa (Fig. 1a, top upper panel; Supplementary Fig. 2). The MCPI of sample j was computed as follows:
MCPIj log10 P
i2C TIij
P
i2C TIij
P
i2A TDij
P
i2A TDij
References
1. Sears, C. L. & Garrett, W. S. Microbes, microbiota, and colon cancer. Cell Host Microbe 15, 317328 (2014).
2. Zackular, J. P., Rogers, M. A., Rufn, M. T. 4th & Schloss, P. D. The human gut microbiome as a screening tool for colorectal cancer. Cancer Prev. Res. 7, 11121121 (2014).
3. Irrazabal, T., Belcheva, A., Girardin, S. E., Martin, A. & Philpott, D. J. The multifaceted role of the intestinal microbiota in colon cancer. Mol. Cell 54, 309320 (2014).
4. Belcheva, A. et al. Gut microbial metabolism drives transformation of Msh2-decient colon epithelial cells. Cell 158, 288299 (2014).
5. Ijssennagger, N. et al. Gut microbiota facilitates dietary heme-induced epithelial hyperproliferation by opening the mucus barrier in colon. Proc. Natl Acad. Sci. USA 112, 1003810043 (2015).
6. Rubinstein, M. R. et al. Fusobacterium nucleatum promotes colorectal carcinogenesis by modulating E-cadherin/beta-catenin signaling via its FadA adhesin. Cell Host Microbe 14, 195206 (2013).
7. Kostic, A. D. et al. Genomic analysis identies association of Fusobacterium with colorectal carcinoma. Genome Res. 22, 292298 (2012).
8. Castellarin, M. et al. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res. 22, 299306 (2012).
9. Tahara, T. et al. Fusobacterium in colonic ora and molecular features of colorectal carcinoma. Cancer Res. 74, 13111318 (2014).
10. Mima, K. et al. Fusobacterium nucleatum and T cells in colorectal carcinoma. JAMA Oncol. 1, 653661 (2015).
11. Gur, C. et al. Binding of the Fap2 protein of Fusobacterium nucleatum to human inhibitory receptor TIGIT protects tumors from immune cell attack. Immunity 42, 382392 (2015).
12. Warren, R. L. et al. Co-occurrence of anaerobic bacteria in colorectal carcinomas. Microbiome 1, 16 (2013).
13. Geng, J., Fan, H., Tang, X., Zhai, H. & Zhang, Z. Diversied pattern of the human colorectal cancer microbiome. Gut. Pathog. 5, 2 (2013).
14. Chen, W., Liu, F., Ling, Z., Tong, X. & Xiang, C. Human intestinal lumen and mucosa-associated microbiota in patients with colorectal cancer. PLoS ONE 7, e39743 (2012).
15. Ding, T. & Schloss, P. D. Dynamics and associations of microbial community types across the human body. Nature 509, 357360 (2014).
16. Schloss, P. D. et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 75, 75377541 (2009).
17. Schloss, P. D., Gevers, D. & Westcott, S. L. Reducing the effects of PCR amplication and sequencing artifacts on 16S rRNA-based studies. PLoS ONE 6, e27310 (2011).
18. Holmes, I., Harris, K. & Quince, C. Dirichlet multinomial mixtures: generative models for microbial metagenomics. PLoS ONE 7, e30126 (2012).
19. Breiman, L. Random Forests. Mach. Learn. 45, 532 (2001).20. Zeller, G. et al. Potential of fecal microbiota for early-stage detection of colorectal cancer. Mol. Syst. Biol. 10, 766 (2014).
21. Tibshirani, R. Regression selection and shrinkage via the lasso. J. Royal Stat. Soc. B 58, 267288 (1994).
22. Friedman, J. & Alm, E. J. Inferring correlation networks from genomic survey data. PLoS Comput. Biol. 8, e1002687 (2012).
23. Boleij, A. et al. The Bacteroides fragilis toxin gene is prevalent in the colon mucosa of colorectal cancer patients. Clin. Infect. Dis. 60, 208215 (2015).
24. Dejea, C. M. et al. Microbiota organization is a distinct feature of proximal colorectal cancers. Proc. Natl Acad. Sci. USA 111, 1832118326 (2014).
25. Johnson, C. H. et al. Metabolism links bacterial biolms and colon carcinogenesis. Cell Metab. 21, 891897 (2015).
26. Liu, J. et al. Metabolic co-dependence gives rise to collective oscillations within biolms. Nature 523, 550554 (2015).
27. Gnagnarellla, P., Gandini, S., La Vecchia, C. & Maisonneuve, P. Glycemic index, glycemic load, and cancer risk: a meta-analysis. Am. J. Clin. Nutr. 87, 17931801 (2008).
28. Meyerhardt, J. A. et al. Dietary glycemic load and cancer recurrence and survival in patients with stage III colon cancer: ndings from CALGB 89803.J. Natl Cancer Inst. 104, 17021711 (2012).29. Arthur, J. C. et al. Intestinal inammation targets cancer-inducing activity of the microbiota. Science 338, 120123 (2012).
; 1
where TIij (or TDij) is the abundance of a marker taxon i increased (or decreased) in either case of carcinoma, carcinoma-adjacent, adenoma or adenoma-adjacent, which are denoted by C, C0, A and A0, respectively.
Details of statistical methods. Differential abundance analyses were performed using the LEfSe algorithm to identify signicant gene markers that consistently differentiated at least one (or multiple) feature(s) in comparison with the others54. The biomarker relevance was ranked according to bootstrapped (n 30)
logarithmic linear discriminant analysis scores of at least 2. Using the R implementation of Random Forests tenfold cross-validations with 100 iterations19, we selected a minimum set of bacterial taxa that maximally discriminated against each mucosal phenotype; the variable importance of a microbial taxon was determined by 100 iterations of the algorithm with 3,000 trees and the default mtry of p1/2, where p is the number of input phylotypes. To evaluate the performance of markers that typify metacommunities against those that are selected by supervised classication on mucosal phenotypes; we constructed LASSO logistic regression models with tenfold repeated internal cross-validations to mitigate the risks of over-tting train-sets when predicting each test-set55,56. The data set was partitioned in such a way that each sample was selected exactly once by test-sets for which the prediction scores were generated for use with receiver operating characteristic analysis (Supplementary Fig. 2). Similarly, we subjected our discovery cohort data set to LASSO model training for ve-way prediction of metacommunities in independent cohorts (Supplementary Data 5). Furthermore,
we performed KolmogorovSmirnov tests to assess whether the observed differences in taxonomic relationships are statistically signicant between disease states (Supplementary Fig. 4). For associations with categorical and continuous clinical metadata, and confounding factor analyses of microbiome metrics and relative abundance data of metacommunity markers, we applied Fishers exact tests, MannWhitney U-tests, and multinomial logistic regression models, where appropriate. Statistical signicances of multiple comparisons were corrected by BenjaminiHochberg step-up procedure.
1
Pi2C TDij
P
i2C TDij
P
i2A TIij
P
i2A TIij
8 NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9727 ARTICLE
30. Grivennikov, S. I. et al. Adenoma-linked barrier defects and microbial products drive IL-23/IL-17-mediated tumour growth. Nature 491, 254258 (2012).
31. Huber, S. et al. IL-22BP is regulated by the inammasome and modulates tumorigenesis in the intestine. Nature 491, 259263 (2012).
32. Couturier-Maillard, A. et al. NOD2-mediated dysbiosis predisposes mice to transmissible colitis and colorectal cancer. J. Clin. Invest. 123, 700711 (2013).
33. Cuevas-Ramos, G. et al. Escherichia coli induces DNA damage in vivo and triggers genomic instability in mammalian cells. Proc. Natl Acad. Sci. USA 107, 1153711542 (2010).
34. Riley, D. R. et al. Bacteria-human somatic cell lateral gene transfer is enriched in cancer samples. PLoS Comput. Biol. 9, e1003107 (2013).
35. Maurice, C. F., Haiser, H. J. & Turnbaugh, P. J. Xenobiotics shape the physiology and gene expression of the active human gut microbiome. Cell 152, 3950 (2013).
36. Shah, P. & Swiatlo, E. A multifaceted role for polyamines in bacterial pathogens. Mol. Microbiol. 68, 416 (2008).
37. Murray, I. A., Patterson, A. D. & Perdew, G. H. Aryl hydrocarbon receptor ligands in cancer: friend and foe. Nat. Rev. Cancer. 14, 801814 (2014).
38. Royet, J., Gupta, D. & Dziarski, R. Peptidoglycan recognition proteins: modulators of the microbiome and inammation. Nat. Rev. Immunol. 11, 837851 (2011).
39. Wu, S. et al. A human colonic commensal promotes colon tumorigenesis via activation of T helper type 17 T cell responses. Nat. Med 15, 10161022 (2009).
40. Farrell, J. J. et al. Variations of oral microbiota are associated with pancreatic diseases including pancreatic cancer. Gut. 61, 582588 (2012).
41. Harrell, L. et al. Standard colonic lavage alters the natural state of mucosal-associated microbiota in the human colon. PLoS ONE 7, e32545 (2012).
42. Wong, M. C. et al. A comparison of the acceptance of immunochemical faecal occult blood test and colonoscopy in colorectal cancer screening: a prospective study among Chinese. Aliment. Pharmacol. Ther. 32, 7482 (2010).
43. Wong, M. C. et al. A validated tool to predict colorectal neoplasia and inform screening choice for asymptomatic subjects. Gut. 63, 11301136 (2014).
44. Yeoh, K.-G. et al. The Asia-Pacic Colorectal Screening score: a validated tool that straties risk for colorectal advanced neoplasia in asymptomatic Asian subjects. Gut. 60, 12361241 (2011).
45. Yuan, S., Cohen, D. B., Ravel, J., Abdo, Z. & Forney, L. J. Evaluation of methods for the extraction and purication of DNA from the human microbiome. PLoS ONE 7, e33865 (2012).
46. Quince, C., Lanzen, A., Davenport, R. J. & Turnbaugh, P. J. Removing noise from pyrosequenced amplicons. BMC Bioinformatics 12, 38 (2011).
47. Schloss, P. D. A high-throughput DNA sequence aligner for microbial ecology studies. PLoS ONE 4, e8230 (2009).
48. Edgar, R. C., Haas, B. J., Clemente, J. C., Quince, C. & Knight, R. UCHIME improves sensitivity and speed of chimera detection. Bioinformatics 27, 21942200 (2011).
49. Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naive Bayesian classier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73, 52615267 (2007).
50. Koren, O. et al. A guide to enterotypes across the human body: meta-analysis of microbial community structures in human microbiome datasets. PLoS Comput. Biol. 9, e1002863 (2013).
51. Gobet, A., Quince, C. & Ramette, A. Multivariate cutoff level analysis (MultiCoLA) of large community data sets. Nucleic Acids Res. 38, e155 (2010).
52. Langille, M. G. et al. Predictive functional proling of microbial communities using 16S rRNA marker gene sequences. Nat. Biotechnol. 31, 814821 (2013).
53. Abubucker, S. et al. Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Comput. Biol. 8, e1002358 (2012).
54. Gevers, D. et al. The treatment-naive microbiome in new-onset Crohns disease. Cell Host Microbe 15, 382392 (2014).
55. Segata, N. et al. Metagenomic biomarker discovery and explanation. Genome Biol. 12, R60 (2011).
56. Smialowski, P., Frishman, D. & Kramer, S. Pitfalls of supervised feature selection. Bioinformatics 26, 440443 (2009).
Acknowledgements
We thank the patients who provided samples for this study and participating clinicians who obtained informed consents and performed colonoscopies. This project was supported by China 973 Program (2013CB531401), China 863 program (2012AA02A506), China NSFC 81272194, SHHO foundation Hong Kong, Shenzhen Technology and Innovation Project Fund, Shenzhen (JSGG20130412171021059),and Shenzhen Virtual University Park Support Scheme to CUHK Shenzhen Research Institute. S.H.W. was supported by the Croucher Foundation.
Author contributions
G.N., X.L., H.Z., J.S., S.H.W. and W.K.K.W. contributed equally to this work. G.N., H.T., Y.D., L.C., K.W., J.Z. and Q.L. performed the experiments. J.S., S.C.N., N.Z., Y.H. and Q.K. collected the data. G.N., X.L., H.Z. analysed the data. S.H.W., W.K.K.W., Q.L., J.Y. and J.J.Y.S. conceived, designed and supervised the project. G.N., X.L., H.Z., S.C.N., H.T., S.H.W., W.K.K.W., J.Y. and J.J.Y.S. wrote and edited the manuscript. All authors discussed the results and commented on the manuscript.
Additional information
Accession codes: 16S rRNA gene sequences analysed in this study have been deposited in the NCBI SRA database under the BioProject ID: PRJNA280026.
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications
Web End =http://www.nature.com/ http://www.nature.com/naturecommunications
Web End =naturecommunications
Competing nancial interests: The authors declare no competing nancial interests.
Reprints and permission information is available online at http://npg.nature.com/reprintsandpermissions/
Web End =http://npg.nature.com/ http://npg.nature.com/reprintsandpermissions/
Web End =reprintsandpermissions/
How to cite this article: Nakatsu, G. et al. Gut mucosal microbiome across stages of colorectal carcinogenesis. Nat. Commun. 6:8727 doi: 10.1038/ncomms9727 (2015).
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the articles Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Web End =http://creativecommons.org/licenses/by/4.0/
NATURE COMMUNICATIONS | 6:8727 | DOI: 10.1038/ncomms9727 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 9
& 2015 Macmillan Publishers Limited. All rights reserved.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Oct 2015
Abstract
Gut microbial dysbiosis contributes to the development of colorectal cancer (CRC). Here we catalogue the microbial communities in human gut mucosae at different stages of colorectal tumorigenesis. We analyse the gut mucosal microbiome of 47 paired samples of adenoma and adenoma-adjacent mucosae, 52 paired samples of carcinoma and carcinoma-adjacent mucosae and 61 healthy controls. Probabilistic partitioning of relative abundance profiles reveals that a metacommunity predominated by members of the oral microbiome is primarily associated with CRC. Analysis of paired samples shows differences in community configurations between lesions and the adjacent mucosae. Correlations of bacterial taxa indicate early signs of dysbiosis in adenoma, and co-exclusive relationships are subsequently more common in cancer. We validate these alterations in CRC-associated microbiome by comparison with two previously published data sets. Our results suggest that a taxonomically defined microbial consortium is implicated in the development of CRC.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer