Scholarly article on topic 'Decoding the complete arsenal for cellulose and hemicellulose deconstruction in the highly efficient cellulose decomposer Paenibacillus O199'

Decoding the complete arsenal for cellulose and hemicellulose deconstruction in the highly efficient cellulose decomposer Paenibacillus O199 Academic research paper on "Biological sciences"

Share paper
Academic journal
Biotechnol Biofuels
OECD Field of science

Academic research paper on topic "Decoding the complete arsenal for cellulose and hemicellulose deconstruction in the highly efficient cellulose decomposer Paenibacillus O199"

López-Mondéjar et al. Biotechnol Biofuels (2016) 9:104 DOI 10.1186/s13068-016-0518-x

Biotechnology for Biofuels

RESEARCH Open Access

Decoding the complete arsenal for cellulose and hemicellulose deconstruction in the highly efficient cellulose decomposer Paenibacillus O199

Rubén López-Mondéjar1*, Daniela Zühlke2, Tomás Vëtrovsky1, Dörte Becher2, Katharina Riedel2 and Petr Baldrian1


Background: The search for new enzymes and microbial strains to degrade plant biomass is one of the most important strategies for improving the conversion processes in the production of environment-friendly chemicals and biofuels. In this study, we report a new Paenibacillus isolate, O199, which showed the highest efficiency for cellulose deconstruction in a screen of environmental isolates. Here, we provide a detailed description of the complex multi-component O199 enzymatic system involved in the degradation of lignocellulose.

Results: We examined the genome and the proteome of O199 grown on complex lignocellulose (wheat straw) and on microcrystalline cellulose. The genome contained 476 genes with domains assigned to carbohydrate-active enzyme (CAZyme) families, including 100 genes coding for glycosyl hydrolases (GHs) putatively involved in cellulose and hemicellulose degradation. Moreover, 31 % of these CAZymes were expressed on cellulose and 29 % on wheat straw. Proteomic analyses also revealed a complex and complete set of enzymes for deconstruction of cellulose (at least 22 proteins, including 4 endocellulases, 2 exocellulases, 2 cellobiohydrolases and 2 p-glucosidases) and hemicellulose (at least 28 proteins, including 5 endoxylanases, 1 p-xylosidase, 2 xyloglucanases, 2 endomannanases, 2 licheninases and 1 endo-p-1,3(4)-glucanase). Most of these proteins were secreted extracellularly and had numerous carbohydrate-binding domains (CBMs). In addition, O199 also secreted a high number of substrate-binding proteins (SBPs), including at least 42 proteins binding carbohydrates. Interestingly, both plant lignocellulose and crystalline cellulose triggered the production of a wide array of hydrolytic proteins, including cellulases, hemicellulases, and other GHs.

Conclusions: Our data provide an in-depth analysis of the complex and complete set of enzymes and accessory non-catalytic proteins—GHs, CBMs, transporters, and SBPs—implicated in the high cellulolytic capacity shown by this bacterial strain. The large diversity of hydrolytic enzymes and the extracellular secretion of most of them supports the use of Paenibacillus O199 as a candidate for second-generation technologies using paper or lignocellulosic agricultural wastes.

Keywords: Cellulose, Hemicellulose, Paenibacillus, Glycosyl hydrolase, CAZyme, Plant biomass

'Correspondence:; mondejar@

1 Laboratory of Environmental Microbiology, Institute of Microbiology of the CAS, v. v. i., Prumyslova 595, 252 42 Vestec, Czech Republic Full list of author information is available at the end of the article

O© 2016 The Author(s). This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.


Concerns about the non-renewable nature of fossil fuels and their rapid consumption together with their effects on the global climate have driven the search for alternative sources of energy [1]. In this context, plant biomass represents a renewable and abundant source for the production of environmentally friendly chemicals and biofuels [2]. Plant biomass is composed of lignocellulose, a highly organized and interlinked mix of different polymers containing mainly cellulose (35-50 %), hemicellu-loses (20-50 %) and lignin (10-35 %) [1]. Plant biomass represents a recalcitrant structure whose decomposition is limited by several factors, such as (1) the complex structure of lignin, (2) the crystalline organization of cellulose fibrer and (3) the diversity of hemicellulases; its degradation into simple compounds, such as mono-saccharides, may represent a challenging process [3, 4]. Although the chemical hydrolysis of lignocellulosic biomass may theoretically overcome these limitations, some secondary products formed in this process may inhibit fermentation, which represents the following step in the production of biofuels. The use of microbial enzymes and enzymatic hydrolysis can overcome these issues and represents a 'greener' technology [2, 5]. Cellulases are the most important enzymes in this process, cleaving the p-1,4 bond in the cellulose chain and are traditionally classified as endocellulases (cleavage inside the cellulose chain), exocellulases or cellobiohydrolases (acting on the ends of the chain) and p-glucosidases (converting cellobiose to glucose monomers). Most enzymes with cellulolytic activity belong to one of the above groups of hydrolases (glycoside hydrolases, GHs), a subgroup of the carbohydrate-active enzymes (CAZymes) [6]. Enzymes with cellulolytic activity are mainly found in the families GH1, GH3, GH5, GH6, GH7, GH8 GH9, GH12, GH45, and GH48. Similar to cellulases, hemicellulases cleave the various bonds within hemicellulose and are classified based on their mode of action and substrate preference into endoxylanases, xylosidases, xyloglucanases, endomannanases, mannosidases, fucosidases, arabino-furanosidases, and others. Hemicellulolytic enzymes are principally found in families GH2, GH10, GH11, GH16, GH26, GH30, GH31, GH36, GH43, GH51, GH74 and GH95. Importantly, members of the same GH family may catalyse different reactions, and their family membership may not sufficiently indicate the targets of their activity. Therefore, the comparison of protein sequences with biochemically characterized CAZymes is essential for an accurate classification [7]. Together with GHs, other CAZymes may also be involved in the degradation of cellulose and hemicellulose, such as xylan esterases (CE), lyases (PL) and lytic polysaccharide monooxygenases (LPMO) from families AA9 and AA10 [8]. Moreover,

both cellulases and hemicellulases may contain carbohydrate-binding modules (CBM) that bind cellulose or hemicelluloses and are essential for effective hydrolysis [1].

The exploration of environments where lignocellulose represents an important resource is a promising strategy for discovering new microorganisms and enzymes useful for biomass conversion and biofuel production [9]. In these natural habitats, plant biomass is mainly decomposed by fungi, bacteria and protozoa. Several fungi are well known as powerful producers of cellulolytic and hemicellulolytic enzymes for plant biomass conversion to sugars [10]; however, studies exploring diverse and versatile bacteria have led to the discovery of novel cellulases, some of which exhibit better properties than fungal ones and can thus reduce the economic costs of the conversion process [11]. In this context, the biomass-rich soils from forests are considered a "gold mine" for identifying new bacterial strains and novel enzymatic systems, which are extremely resistant to environmental stresses occurring in these ecosystems and may be able to survive the harsh conditions in biomass conversion and biofuel production [12, 13]. Several strains have been isolated from soil, and numerous cellulases and hemicellulases have been individually studied and characterized [2].

The genus Paenibacillus belonging to Firmicutes is known to include strains able to produce enzymes for industrial and agricultural applications, and numerous strains have recently been described as cellulolytic or hemicellulolytic [5, 14, 15]. In addition, cellulases and xylanases have been purified and described from the members of this genus [5, 16, 17]. However, the whole enzymatic complement of Paenibacillus spp. has not been systematically explored, although this is necessary for a complete understanding of the biodegradative potential of this genus, considering the synergistic mode of action of the enzymes. Although some recent works were focused on the study of multienzyme complexes [18, 19], the studies limited to the analysis of catalytic efficiency of individual enzymes and their combinations are insufficient for the complete evaluation of the potential of Paenibacillus spp. in the degradation of lignocel-lulose [14].

New molecular methods are useful for exploring the potential of bacterial strains as plant biomass decomposers [20]. The sequencing and analysis of bacterial genomes have revealed differences in the potential mechanisms of cellulose degradation and have been used repeatedly for the prediction of the cellulose and hemi-cellulose degradation potential of bacterial taxa based on the presence of specific CAZyme families [21]. However, the presence of cellulose- and hemicellulase-encoding genes in a genome does not necessarily imply that the

bacterial strain is able to degrade plant biomass [22], and proteomic approaches on cellulose-grown cells are therefore required to provide the link between the genomic potential and the actual expression [23, 24].

The aim of this study was to explore the cellulolytic and hemicellulolytic abilities of the bacterium Paenibacillus sp. O199. This novel strain was isolated from forest soil, where it exhibited the highest efficiency of cellulose deconstruction among the screened isolates. Whole-genome sequencing and annotation were combined with GeLC-MS/MS to characterize its extracellular proteome during growth on plant biomass and on microcrystalline cellulose. Our results reveal the presence of a complex multi-component enzymatic system that is expressed during the degradation of cellulose and complex lignocel-lulose, indicating that this strain may have a high potential for biotechnology applications.


Identification and cellulolytic activity of the Paenibacillus O199 strain

The bacterial isolate O199 degraded carboxymethylcel-lulose (CMC) during incubation on agar plates. More importantly, it was highly efficient at degrading cellulosic filter paper during growth in liquid media, degrading it in less than one week, which was faster than any other isolate screened from forest soil (Additional file 1: Figure S1A). The measurement of enzymatic activities after incubation showed the production of several enzymes involved in the deconstruction of plant polysaccharides (Additional file 1: Figure S1B).

The comparison of the 16S rRNA gene of the strain O199 with the reference 16S rRNA gene sequences of the type strains in the EzTaxon server showed the closest matches to P. tundrae (with a pairwise similarity of 99.86 %), P. tylopili (99.65 %), P. xylanexedens (98.65 %), and P. amylolyticus (99.43 %), all of which also clustered together in the phylogenetic tree (Additional file 2: Figure S2). The isolate was named Paenibacillus sp. O199.

Genomic features of Paenibacillus O199

The draft genome assembly indicated a genome size of 7,193,447 bases. Annotation predicted 6507 protein-coding sequences, including 72 RNA genes and 453 predicted SEED subsystem features. Among the predicted proteins, 476 (7.3 %) had one or more domains assigned to CAZyme families, including 231 GHs, 82 CBMs, 10 AAs, 79 CEs, and 13 PLs (Additional file 3: Table S1). The putative genes encoding GHs belonged to 61 different families with gene counts ranging between 1 and 28. The total content of GHs was among the highest so far recorded in the genomes of Paenibacillus species (Fig. 1).

Numerous genes assigned to GH families involved in cellulose and hemicellulose deconstruction were detected. For example, genes belonging to the cellulolytic families GH6, GH9, GH12 or GH48 were found in single copies in the genome, and genes from families GH1, GH2, GH3, GH5, GH16, GH30 and GH43 encoding putative p-glucosidases, p-xylanases and other hemicellulases were more abundant. The number of genes belonging to these families was high (100) in comparison with most of the sequenced strains of Paenibacillus but similar to those strains isolated from soil and the rhizosphere, such as P. borealis (101), P. riograndensis (105) or P. terrae (82) (Fig. 1). Furthermore, the genome was also rich in proteins with CBM domains involved in cellulose and hemi-cellulose binding, such as CBM3, CBM6, CBM9, CBM35 and CBM46. Multiple carbohydrate esterases (CE) were also detected in the genome, including families encoding enzymes that act on xylan, such as CE1, CE3, CE4, and CE7, among others. A gene encoding a lytic polysac-charide monooxygenase from the family AA10 was also identified.

In general, genes encoding potential cellulases and hemicellulases were randomly distributed in the genome, with the exception of some genes organized in short clusters. CAZymes were flanked by numerous genes coding for proteins involved in nutrient transport and by transcriptional regulators. The main transporters were annotated as ATP-binding cassette (ABC transporters) systems and phosphotransferase systems. In addition, numerous genes encoding transcriptional regulators (such as AraC, TetR, DeoR, LysR, and GntR, among others), two-component regulatory systems, and sigma factors were also frequently found flanking the CAZymes.

Expression of proteins during growth on crystalline cellulose and wheat straw

Analysis by GeLC-MS/MS allowed us to identify 961 proteins in the extracellular fraction of Paenibacillus O199 (Additional file 4: File SF1). Approximately 68 % of the proteins were expressed in both cellulose- and wheat straw-supplemented cultures, whereas 228 proteins (23.7 %) were exclusively expressed in cellulose, and 77 (around the 8 %) were found only with wheat straw (Additional file 5: Figure S3A). A higher number of expressed proteins showed no functional prediction or were not classified in any functional categories (Additional file 5: Figure S3B). However, many proteins involved in energy metabolism and in transport and binding of nutrients were found in the exoproteome. Even if some of these proteins showed higher abundance during growth on cellulose or on straw (e.g., in the case of proteins involved in sugar metabolism or in transport of

Paenibacillus sp. 0199

Paenibacillus pini JCM 16418 Paenibacillus terrae HPL-003 Paenibacillus stelifer DSM 14472 Paenibacillus sp. IHBB 10380 Paenibacillus sp. JDR-2 Paenibacillus sabinae T27 Paenibacillus riograndensis SBR5

Paenibacillus polymyxa CR1 Paenibacillus polymyxa SQR-21 Paenibacillus peoriae HS311 Paenibacillus odorifer DSM 15391 Paenibacillus mucilaginosus KNP414 Paenibacillus mucilaginosus K02 Paenibacillus lautus Y412MC10 Paenibacillus larvae DSM 25430 Paenibacillus graminis DSM 15220 Paenibacillus durus ATCC 35681 Paenibacillus borealis DSM 13188 Paenibacillus beijingensis DSM 24997

300 200

I Total number of GHs


■ GH3 □GH5 GH36 ■ GH39




GH10 GH51

GH12 GH16 GH74 ■ GH95

Fig. 1 Predicted numbers of glycosyl hydrolases (GH) in the genome of Paenibacillus O199 and the genomes of other species of Paenibacillus. On the left total number of GHs found in the genome; on the right gene content in GH families containing enzymes involved in the degradation of cellulose and hemicelluloses

carbohydrates, organic alcohols and acids), most proteins showed similar expression in both carbon sources (Additional file 5: Figure S3C).

Carbon source-dependent expression of carbohydrate-active enzymes

The percentages of expressed CAZymes were slightly higher on cultures growing on crystalline cellulose (31 %) than on wheat straw (29 %), with the exception of CEs and PLs, which showed higher diversity on straw. Twenty-two GH families containing proteins involved in the degradation of cellulose and hemicellulose were detected in the proteome of Paenibacillus O199 (Fig. 2, Additional file 4: File SF1). The strain expressed four endoglucanases, two exocellulases, two p-glucosidases and two cellobiohydrolases, although one of the endocel-lulases (ID2796) and one of the exocellulases (ID6170) were exclusively found on cellulose. Paenibacillus O199 also produced at least twenty-eight proteins degrading xylan (five endoxylanases and one p-xylosidase), xylo-glucan (including two xyloglucanases), glucomannan (two endomannanases) and mixed-linkage glucans (two licheninases and one endo-p-1,3(4)-glucanase), which acted on numerous residues of hemicellulose chains, and three of the proteins were only found in straw. Many proteins showed multiple domains, including one or more CBMs. The endoxylanase ID161 and the endomannanase ID3189 each contained four different domains, and the

licheninase ID3888 contained six. ID3888, ID161 and the a-galactosidase ID5332 also contained three copies of a surface-layer homology (SLH) motif for cell wall anchoring. Most of the proteins degrading cellulose and hemi-celluloses were predicted to be extracellularly located, with the exception of enzymes hydrolysing disaccharides and other oligosaccharides (p-glucosidases, p-xylanase or P-mannanases), which had cytoplasmic locations (Fig. 2). The summary of the expressed proteins involved in cellulose and hemicellulose degradation and their different targets within plant biopolymers is schematically shown in Fig. 3.

Other CAZymes involved in the cleavage of arabinans (GH43), galactans (GH2, GH30, GH42, GH43 and GH53) and rhamnogalacturonans (GH28 and GH105) from pectin chains were also recorded (Additional file 4: File SF1). Additionally, enzymes involved in chitin or peptidogly-can (GH18) and other glucan degradation (GH13, GH16) and GHs involved in general bacterial metabolism, such as glycan biosynthesis and catabolism (N-acetylgalac-tosaminidases) (GH109), were highly expressed. Interestingly, some proteins from GH and CBM families not involved in cellulose degradation, such as GH129, GH130 and CBM35, were exclusively found during growth on crystalline cellulose. Moreover, proteins from families CE1, CE3, CE7 and CE12, containing putative acetyl xylan esterases, and from families PL1, PL3, PL4, PL9, PL11, involved in pectin degradation, were expressed

ST CE ID CAZy domains1 SP2 Location3 Putative role4 EC number Closest hit5 Ident6 Action7

4321 GH1 No Cytoplasmic P-glucosidase BAE48718 99 C

1958 GH2 No Cytoplasmic P-mannosidase AAO75565 43 HC

5225 GH2 No Cytoplasmic p-galactosidase AAA23216 53 GA

5308 GH2 No Cytoplasmic P-mannosidase CAD33708 33 HC

3919 GH3 No Cytoplasmic P-glucosidase BAJ09393 64 C

5271 GH3 No Cytoplasmic Exo-p-1,4-glucanase 3.2.1.l4 AAB36835 41 C

5910 GH3 No Unknown Glucan 1,3-p-glucosidase EGC01435 46 GLU

6170 GH3-CBM6 Yes Extracellular Exo-P-1,4-glucanase 3.2.1.l4 AAB36835 44 C

2171 GH5 Yes Unknown Endo-P-1,4-mannanase 3.2.1.l8 BAD99527 62 HC

2182 GH5-CBM46-CBM46 Yes Extracellular Endo-P-1,4-glucanase AFC68970 81 C

6177 GH5 Yes Unknown Endo-P-1,4-glucanase AAA23230 28 C

1242 GH6-CBM3 Yes Extracellular Cellobiohydrolase (non-reducing end) AAC36898 51 C

2796 GH9-CBM3-CBM37-CBM3 Yes Extracellular Endo-P-1,4-glucanase CAB38941 80 C

161 SLH-SLH-SLH-CBM9-GH10-CBM22-CBM22-CBM22 Yes Extracellular Endo-p-1,4-xylanase BAC45001 87 HC

3757 GH10 Yes Extracellular Endo-P-1,4-xylanase AEO12683 87 HC

4543 GH10 Yes Extracellular Endo-P-1,4-xylanase CBA13561 86 HC

1530 GH11 Yes Extracellular Endo-P-1,4-xylanase ABD66557 98 HC

II 2027 GH12 Yes Extracellular Endo-P-1,4-glucanase AAP44491 53 C

1510 GH16 Yes Extracellular Endo-p-1,3-glucanase AAA22474 99 GLU

2298 GH16-CBM13 Yes Extracellular Endo-p-1,3-glucanase BAC06195 80 GLU

3825 GH16 Yes Extracellular Licheninase 3.2.1.l3 AAG53947 78 HC

3881 GH16 Yes Extracellular Endo-p-1,3(4)-glucanase ABD94065 92 HC

3888 SLH-SLH-SLH-CBM54-GH16-CBM4-CBM4-CBM61-CBM4-CBM4 Yes Cell Wall Licheninase 3.2.1.l3 ABN54008 37 HC

3189 CBM35-GH26-CBM46-CBM46-CBM3 Yes Extracellular Endo-P-1,4-mannanase 3.2.1.l8 ADL52314 57 HC

16l0 GH30 Yes Unknown P-glucuronidase BAO73305 29 HC

2291 GH30 No Unknown Unknown

4290 GH30 Yes Extracellular Endo-P-1,4-xylanase BAL45490 76 HC

4544 GH30 No Cytoplasmic P-xylosidase ABX45137 50 HC

l03 GH30 No Cytoplasmic Endo-p-1,6-galactanase BAC72917 34 GA

l08 GH30-CBM13-CBM13 Yes Extracellular p-fucosidase EFI72736 27 FU

5273 GH31 No Unknown a-xylosidase BAE77637 55 HC

2599 GH36 No Cytoplasmic a-galactosidase AF130985 66 HC

5332 SLH-SLH-SLH-GH36-CBM35-CBM35-CBM35 Yes Cell Wall a-galactosidase AAC38607 24 HC

5142 GH42 No Unknown p-galactosidase ADA67652 29 GA

5282 GH42 No Cytoplasmic p-galactosidase AAA22260 95 GA

1021 GH43-CBM13 Yes Extracellular Galactan |-1,3-galactosidase BAC69820 53 GA

1547 GH43-CBM66-CBM6-CBM66 Yes Extracellular a-L-arabinofuranosidase ACX75362 24 HC

1719 GH43-CBM35 Yes Extracellular Exo-a-1,5-L-arabinofuranosidase 3.2.1.- BAC68753 42 HC

2773 GH43 Yes Unknown Endo-a-1,5-L-arabinanase AHA50062 67 AR

3758 GH43 No Unknown P-1,4-xylosidase/a-L-arabinofuranosidase ACE82692 56 HC

4272 GH43-CBM6 No Unknown a-L-arabinofuranosidase ABP67152 54 HC

4531 GH43-CBM42-CBM32 Yes Multiple Galactan 1,3-|-galactosidase BAJ67504 30 GA

2795 GH48-CBM3 Yes Extracellular Cellobiohydrolase (reducing end) CAD32945 85 C

2792 GH51 No Cytoplasmic a-L-arabinofuranosidase AF159625 74 HC

4446 GH51 No Cytoplasmic a-L-arabinofuranosidase ACE73681 65 HC

3997 GH52 No Extracellular P-xylosidase BAA74507 88 HC

5231 GH74-GH74-GH74-GH74-CBM46-CBM3 Yes Extracellular Xyloglucanase BAE44527 81 HC

1047 GH74-GH74-GH74-GH74 Yes Extracellular Xyloglucanase BAE44527 81 HC

4445 GH74 Yes Extracellular Unknown

4320 GH95 No Cytoplasmic a-1,2-L-fucosidase ACJ53393 35 HC

1552 SLH-SLH-SLH-CBM9 No Cell wall and Extracellular Bindingto cellulose CAI79477 33 C

4287 CBM13 Yes Unknown Binding toxylan HC

_ 3208 CBM35 Yes Unknown Binding to xylan and mannan HC

4890 CBM35 Yes Unknown Binding to xylan and mannan HC

Abundance (NSAF)

1.CAZYmes wereannotated with dbCAN.

2. Presence orabsence ofa signal peptide, predicted with SignaIP4.0 (Petersen etal.,2011).

3. Protein location,predicted with PSORTb3.0.2 (Yu etal.,2010).

4. Putative role after sequence comparison with characterized proteinsfrom CAZy database.

5. Closest hitfromthe characterized sequencesin that GH family, usingBlastpfrom NCBI.

6. Identity(%) between the ID protein andthe best hit.

7. Protein involved in the degradation of cellulose (C), hemicelluloses (HC), galactans (GA), p-1,3-glucans (GLU),arabinans (AR) orfucans(FU).

Fig. 2 Expression of Paenibacillus O199 proteins involved in cellulose and hemicellulose degradation during the growth on wheat straw (ST) and cellulose (CE). Protein abundance is colour-coded, increasing from yellow to red, and CAZymes not found are in white. Proteins with putative role as cellulases and hemicellulases are highlighted in bold




i321911 1—^Lrx,-. ^ "fin:


ooooooooooooooooooo ooooooooooooooooooo


Ç^ D-Glucose (1,4) ^^ D-Glucose (1,3) D-Mannose D-Xylose L-Arabinose ^^ D-Glucuronic Acid Q D-Galactose /\ L-Fucose

Fig. 3 Simplified schematic overview of all the proteins expressed by PaenibaciHus O199, their role in the hydrolysis of cellulose and hemicellulose (xylan, glucomannan, xyloglucan and mixed-linkage glucans), and their location. Only proteins characterized as cellulases and hemicellulases are shown. Proteins marked with an asterisk were only expressed on cellulose, and proteins marked with a double asterisk were only expressed on wheat straw. Question marks indicate unclear location. Polysaccharides structures are based on Burton et al. [65]

GH10 ID4543

not only on straw but also on cellulose, where their substrates were not present. Lastly, an LMPO from the AA10 containing a CBM12 domain and putatively involved in chitin degradation was also produced on both carbon sources (Additional file 4: File SF1).

Carbon source-dependent expression of other proteins involved in binding and uptake of nutrients

Interestingly, numerous proteins involved in the transport and binding of nutrients were found in the exo-proteome of Paenibacillus O199 under both analysed

conditions (Additional file 5: Figure S3B and C). Most of them belonged to the subunit substrate-binding protein (SBP) from the ABC transporters, encompassing approximately 9.5 % of the total detected proteins. SBPs have been defined as key determinants for the specificity and affinity of ABC uptake systems in bacteria [25]. In our study, almost ninety different SBPs binding multiple monosaccharides (such as xylose, galactose or rhamnose), polysaccharides, oligopeptides, vitamins and microelements were produced in the bacterial cultures. Among them, at least forty-two proteins were involved in the binding of carbohydrates and fourteen in the binding of amino acids and peptides (Additional file 6: Table S2). Ten of them were specific for growth on straw and twelve for growth on cellulose (Additional file 4: File SF1).


The search for new bacterial strains and enzymes to efficiently degrade plant biomass represents one of the most promising approaches to optimize biofuel production [26]. In this study, a new strain isolated from forest soil showed high cellulolytic efficiency, being able to degrade filter paper strips in 7 days, which was much faster than any other cellulolytic isolate tested. The isolate was identified as a member of the genus Paenibacillus. Several strains of this globally abundant genus of soil bacteria have been already described as cellulolytic or hemicel-lulolytic (Figure S2). Due to the promising characteristics of these strains in biomass conversion, several genomes from cellulolytic and hemicellulolytic strains of Paeni-bacillus have recently been published [27-30]. However, no in-depth analyses have been carried out on them so it is unclear to what extent their genomes are expressed. Here, the genome of Paenibacillus O199 was sequenced, assembled and analysed to obtain complete information about the potential polysaccharide hydrolysis. The published genomes of four strains of Paenibacillus poly-myxa were analysed and compared recently [31], not only focusing mainly on the genes related to plant growth promotion but also noting a large arsenal of hydrolytic enzymes for plant biomass degradation. The genome of Paenibacillus O199 contained numerous genes encoding CAZymes putatively involved in plant polymer degradation, which together represented a higher percentage of its genome than other well-known cellulolytic bacteria, such as Clostridium thermocellum, C. japonicus or Strep-tomyces sp. ActE [24]. The correlation between a high number of CAZymes in the genome and plant biomass degradation has been suggested by several authors [32, 33]. Moreover, the number of genes belonging to GH families involved in cellulose and hemicellulose degradation in O199 was comparable to other described plant biomass-degrading strains, such as Paenibacillus sp.

JDR-2 (Fig. 1). In particular, some of these specific GH families (especially GH5, GH6, GH9, GH12 and GH48) have been identified as the main mediators of cellulose degradation by Gram-positive bacteria [7]. Furthermore, a large number of other GHs involved in the degradation of different biopolymers, such as chitin, peptidoglycan, starch or pectin, were found in the genome of Paeniba-cillus O199. The presence of diverse gene sets involved in the degradation of various biopolymers has also been reported from the genomes of other Paenibacillus strains isolated from soil [29, 31].

Unfortunately, genome analyses only indicate the functional potential of bacteria and not their real activity [22, 31]. Here, we have demonstrated that the potential cellu-lases and hemicellulases encoded in the genome of Paenibacillus O199 were produced on crystalline cellulose and on a complex lignocellulosic substrate, but the expressed proteins represented only a small fraction (approximately 30 %) of those predicted by genome sequencing. Similar results have been described in other cellulolytic bacteria, where only a fraction of predicted CAZymes were expressed [24]. Despite the absence of some predicted CAZymes, the expressed proteins in O199 still represented a complete system for cellulose deconstruction, with multiple endoglucanases, p-glucosidases, exoglu-canases and cellobiohydrolases (Figs. 2, 3), responsible for the powerful activity shown by this strain. Interestingly, enzymes such as endoglucanases, exoglucanases and p-glucosidases were redundant, showing different variants of secreted enzymes. The redundancy of hydro-lytic enzymes has been typically explained by synergis-tic effects observed during biopolymer degradation by enzyme mixtures and has been found in many other bacterial strains [15, 34]. For example, Gastelum-Arellanez et al. [14] observed that the endoglucanase activity of a cellulolytic strain of P polymyxa was due to at least fourteen different expressed proteins. Similar results were found in Paenibacillus O199, which expressed at least four different endoglucanases, one of which (ID2796) exhibited the typical characteristics of processive endo-glucanases [35]: the GH9 and CBM3 domains. Moreover, this enzyme seemed to act synergistically with a GH48 cellobiohydrolase (ID2795) encoded by a contiguous gene for degrading crystalline cellulose. Similar structures with contiguous genes encoding GH9 and GH48 cellu-lases have been reported in other strains of Paenibacillus sp. and in C. thermocellum [36, 37]. Redundancy was also found in proteins involved in hemicellulose degradation, such as endoxylanases, xyloglucanases, mannanases and licheninases, which also displayed diverse structures with GH domains from different families within a single protein and with multiple CBMs (Figs. 2, 3). As in the case of cellulases, endoxylanases from the GH10 and GH11

families have been previously described to act synergis-tically, cleaving different substituted or unsubstituted regions in the polymer chain [38].

Most of the hydrolytic enzymes expressed by O199 showed identifiable signal peptides and were secreted extracellularly to the media. Extracellular cellulases and hemicellulases are notable from an industrial perspective because they considerably reduce the costs of the extraction procedures [3]. Moreover, many of secreted enzymes showed that CBM domains involved in cellulose and hemicellulose binding (Figs. 2, 3), which allow a strong interaction between the free enzymes and the substrates for efficient hydrolysis of cellulose and hemicellulose. Additionally, four proteins—an endoxylanase (ID161), a licheninase (ID3888), an a-galactosidase (ID5332) and a cellulose-binding protein (ID1552)—contained SLH domains, which mediate the binding of the protein to the cell surface [2]. SLH domains in hydrolytic enzymes contribute to efficient plant polysaccharide degradation, binding the enzymes to the cell surface and allowing the oligomers released in the proximity of the membrane to be immediately transported into the cell [39]. The fact that the SLH-containing proteins also contain CBMs binding cellulose (ID1552) or hemicelluloses (ID161, ID3888, ID5332) indicates that O199 cells are associated with lignocellulose, which highly increases the efficiency of decomposition. In this way, similar proteins containing SLH domains have been also described in other Pae-nibacillus strains. For example, a multimodular protein containing SLH domains and five CBMs—homologue of ID3888—have been recently described to be also involved in binding glucans through the CBM domains and in sequestering the polysaccharides to the cell surface for allowing the rapid transport of oligosaccharides released into the cell [40]. An homologue of ID161—a GH10 xylanase containing SLH and CBM9 and CBM22 domains—has been also defined as essential for xylan utilization in one strain of Paenibacillus and was also primarily involved in the generation of oligomers acting as inducers for the rest of the xylanase genes [41].

Despite the fact that some cellulases and hemicellu-lases were only expressed on one of the tested carbon sources, most of the hydrolytic enzymes were found on both substrates (Figs. 2, 3). In Paenibacillus, it has been found that both xylanases and cellulases were induced by cellulose, xylan, or complex lignocellulosic substances [42]. In this study, crystalline cellulose was able to induce hemicellulolytic enzymes even when hemicellulose was not present in the media. Because cellulose is always closely associated with other polymers, such as hemicel-luloses and pectin, in plant material, the sole presence of cellulose or cellodextrins in the media may theoretically act as a general inducer of the whole enzymatic system

for plant biomass degradation. This general induction may be more efficient than complex regulatory systems, allowing the bacteria to use a wide variety of plant polymers present in the soil environment. However, the presence of numerous transcriptional regulators, two-component regulatory systems, and sigma factors surrounding the genes coding for CAZymes—also reported in other Firmicutes cellulolytic strains [43, 44]—suggests that lignocellulose degradation by O199 is likely under complex regulation. Although it is known that oligomers from cellulose and hemicellulose degradation can act as inducers of cytoplasmic or membrane-associated accessory enzymes [4], the regulatory systems that respond to the presence of cellulose degradation products remain unknown in most bacteria [45]. In the cellulolytic fungi Trichoderma reseei, the presence of ABC transporters in cell membranes has been related to the induction of cel-lulases [10]. ABC transporters were abundantly found in O199 and in the genomes of other Paenibacillus species [31]. Their proximity to genes encoding CAZymes has also been suggested as an indicator of their involvement in the transport of sugars released by hydrolytic enzymes [33]. In bacteria, Xu et al. [46] showed that soluble sac-charides are captured by SBPs from ABC transporters in C. cellulolyticum, inducing the activation of more transporters and CAZyme genes. In this study, at least forty-two putative SBPs involved in the binding of sugars were expressed during growth on straw and cellulose. Recent studies in other cellulolytic bacteria, such as Caldicel-lulosiruptor bescii, have reported that these secreted noncatalytic proteins are capable of binding a variety of plant cell wall soluble and insoluble saccharides, including microcrystalline cellulose, amorphous cellulose, xylo-glucan, xylan and mannan, among others [47]. Therefore, the high amount of SBPs found in the proteomes of O199 may also explain the high efficiency shown by this isolate in filter paper degradation because substrate binding is an important prerequisite for the degradation of insoluble polysaccharides. However, unlike the CBMs, the role of these proteins secreted by cellulolytic bacteria is still poorly understood [47]. The fact that a high percentage of the proteins expressed on plant biomass and on cellulose were annotated as hypothetical or showed no functional prediction (Figure S3B and S3C) reflects the present lack of understanding.


Methods for developing enzymatic cocktails for more efficient conversion of plant biomass into "green" energy are mainly based on improving the knowledge of all the players taking part in this process and in understanding the characteristics, dynamics and synergies between these enzymes and other involved proteins. The search

for new cellulolytic isolates and analysis of their hydro-lytic arsenal through the study of their genomes and pro-teomes are revealed as a promising strategy for ultimately enhancing the biomass conversion process. Here, we show that a newly described cellulolytic strain of Paenibacillus produced a rich and complex set of proteins for the complete deconstruction of cellulose and hemicellu-lose. Its high efficiency for cellulose hydrolysis is due to multiple and diverse enzymes showing different specificities, together with the presence of carbohydrate-binding domains and other nutrient-binding proteins, through the synergistic action of all of them. Interestingly, we found that most of the cellulolytic and hemicellulolytic enzymes were induced not only by complex plant biomass but also by cellulose.

The results support the use of the strain Paenibacillus O199 as a candidate for second-generation technologies using paper or lignocellulosic agricultural wastes (like wheat straw) as an inexpensive and sustainable alternative for the production of value-added chemicals and biofuels.


Strain isolation, identification and enzymatic activity

Paenibacillus O199 was isolated from the forest floor of a temperate sessile oak forest (Quercus petraea) in the Czech Republic. Litter chemistry and decomposition processes were studied in the selected area previously [48, 49], as well as the composition of bacterial communities in the litter and soil [50]. Strain isolation was performed by plating the forest floor material extracted with Ringer solution (100 mL g-1) on CMC agar medium (2 g L-1 yeast extract, 5 g L-1 carboxymethylcellulose (CMC), 50 mg L-1 of cycloheximide, pH 7.0) and incubated at 25 °C. After 7 days, agar plates were stained with 0.1 % Congo Red, and cellulose-degrading bacterial colonies showed clear halos indicating CMC degradation.

The ability of bacterial isolates to decompose cellulose was tested during growth in minimal medium with cellulose (filter paper strips of 20 mg weight) as the sole carbon source. The isolate O199 exhibited the fastest cellulose decomposition among all isolated strains. Bacteria were grown for 7 days at 25 °C on an orbital shaker. After that time, the integrity of the filter paper in the media was observed, and the activity of enzymes involved in the degradation of plant polymers was measured according to Valaskova et al. [51]. Briefly, the activities of cello-biohydrolase (exocellulase), p-glucosidase, p-xylosidase, a-arabinosidase, glucuronidase, p-galactosidase, a-glucosidase and p-mannosidase were measured using methylumbelliferyl (MUF)-based substrates on a microplate reader (Infinite, TECAN, Groedig, Austria), with an excitation wavelength of 355 nm and an emission

wavelength of 460 nm. Calibration of product development was based on standard curves with a range of MUF concentrations added to the sample.

The bacterial 16S rRNA gene of the strain O199 was sequenced using the primers 27F and 1492R [52]. The EzTaxon server (

[53] was used for isolate identification, using the EzTaxon database that contains 16S rDNA sequences of the type strains of species. The sequence of the 16S ribosomal RNA gene was deposited in the GenBank database under the accession number KR181834.

DNA extraction, whole-genome sequencing and genome analysis

Total genomic DNA was extracted from bacteria grown in GYM media (4 g L-1 glucose; 4 g L-1 yeast extract; 10 g L-1 malt extract; pH 7.0) with the UltraClean Micro-bial DNA Isolation Kit (MoBio Laboratories, Carlsbad, CA, USA), and sequencing was performed on the Illumina MiSeq platform in a paired-end 2 x 250 bp run. The sequence data were assembled using Velvet 1.2.10

[54], and a draft genome was obtained. Draft genome sequences were deposited in GenBank under the accession number LGRP00000000. Gene annotation was performed using Rapid Annotations Subsystems Technology (RAST) 4.0 [55, 56]. To identify the CAZymes, translated proteins from the predicted open reading frames were analysed with dbCAN [57]. Information about the carbohydrate-active enzyme content in the complete genomes of closely related bacteria was obtained from the CAZy database [6].

Protein expression and GeLC-MS/MS analysis of extracellular proteins

For the analysis of protein expression, bacteria were pre-grown in GYM media for 24 h, and 1 mL of culture was inoculated in triplicate in 1 L of MM containing 0.5 % w/v of either microcrystalline cellulose (Serva, Heidelberg, Germany) or wheat straw that was finely milled and repeatedly washed with hot water to remove low-molecular mass compounds while retaining plant cell wall biopolymers. Cultures were incubated for 7 days at 25 °C in an orbital shaker. After harvest, cultures were centrifuged, and proteins in the supernatant were precipitated with 10 % w/v trichloroacetic acid and resuspended in 8 M urea/2 M thiourea buffer. Protein concentration was measured in every sample with Roti®-Nanoquant (Carl Roth, GmbH, Germany), and 25 |g was separated by 1D-SDS-PAGE using Criterion™ TGX™ Precast Gels (Bio-Rad Laboratories, Hercules, CA, USA). Lanes were cut in ten equidistant pieces and subjected to trypsin digestion as previously described [58]. Peptide mixtures were separated by RP chromatography using

a nanoACQUITY™ UPLC™ system (Waters, Milford, MA, USA). Peptides were loaded onto a trap column and separated on the analytical column using a binary 80 min gradient of buffer B (99.9 % ACN, 0.1 % acetic acid) at a constant flow rate of 400 nL min-1. The UPLC system was coupled to an LTQ Orbitrap mass spectrometer (Thermo Fisher Scientific, Waltham, MA, USA). Full survey scans were recorded in the Orbitrap (m/z range from 300 to 2000) with a resolution of 30,000 and lock mass option enabled. MS/MS experiments in the LTQ XL were performed for the five most abundant precursor ions (CID), excluding unassigned charge states and singly charged ions. Dynamic exclusion was enabled after 30 s. For protein identification, spectra were searched against a database of Paenibacillus O199 containing sequences of all predicted proteins from its genome, including reverse sequences and sequences of common laboratory contaminants (13,098 entries). Database searches were performed using Sorcerer SEQUEST (Version v. 27 rev. 11, Thermo Scientific) and Scaffold 4.0.5 (Proteome Software, Portland, OR, USA) with the following search parameters: parent ion tolerance: 10 ppm, fragment ion mass tolerance: 1.00 Da, up to two missed cleavages were allowed and methionine oxidation (+15.99492 Da) was set as variable modification. For protein identification, a stringent SEQUEST filter for peptides was used (Xcorr versus charge state: 2.2, 3.3 and 3.8 for doubly, triply and quadruply charged peptides, respectively, and deltaCn value greater than 0.10), and at least two pep-tides per protein were required. Protein quantification was based on the normalized spectrum abundance factor (NSAF), which is calculated as the number of spectral counts (SpC) identifying a protein, divided by protein length (L), divided by the sum of SpC/L for all proteins in the experiment [59]. Statistical analysis was performed using MeV v4.8.1 [60]. Student's t test was performed with the following settings: unequal group variances were assumed (Welch approximation), P values based on all permutations with P = 0.01, significance determined by adjusted Bonferroni correction. Functional prediction and classification of proteins were performed by the in-house developed analysis pipeline Prophane 2.0 (http:// [61] and the RAST annotation server. Voronoi treemaps were generated using Paver (Decodon, Greifswald, Germany; Sequences of the identified proteins and those previously predicted to be GHs with dbCAN were compared with the characterized sequences deposited in the CAZy database for the GH families found in the proteome for functional prediction. For this purpose, sequences were aligned with MUSCLE 3.7 [62], and trees based on the maximum likelihood were constructed with PhyML 3.0 [63]. Where possible, the putative role for the identified

GHs was directly assigned based on the closest sequences in the family tree whose functional roles were known. Identity between similar proteins was calculated using the Basic Local Alignment Search Tool for proteins (BLASTP, The mass spec-trometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE [64] partner repository with the dataset identifier PXD003970.

Additional files

Additional file 1: Figure S1. Cellulolytic ability of Paenibacillus O199. A) Growth in minimal medium containing filter paper as the sole carbon source. The deconstruction of the paper by bacterial enzymes was complete after 7 days (on the right), in comparison with the control (on the left). B) Activity of enzymes involved in the degradation of polysaccharides in the culture. "-": no activity detected.

Additional file 2: Figure S2. Phylogenetic tree of Paenibacillus species based on complete 16S rRNA gene sequences. Strains described as cellulolytic (*) and hemicellulolytic f) are marked. The tree was constructed via the maximum likelihood method (ML) using a Kimura 2-parameter model (K2) and a discrete gamma distribution with invariant sites (G + I) (bootstrap confidence levels determined by 500 bootstrap replications are shown as percentages of nodes) with the software package MEGA 5.1. Sequences were obtained from EzTaxon.

Additional file 3: Table S1. Genome content and protein expression on wheat straw and on crystalline cellulose of carbohydrate-active enzymes of Paenibacillus O199.

Additional file 4: File SF1. Proteomic analysis of Paenibacillus O199.

Additional file 5: Figure S3. Identification of proteins in the proteomes of Paenibacillus O199. A) Venn diagrams depicting identified proteins during growth on cellulose (red) and wheat straw (blue). B) Voronoi treemap visualization of protein expression pattern of Paenibacillus O199 during growth on cellulose (red) and wheat straw (blue). Functional classification of proteins was carried out by Prophane2.0 and RAST and is based on TIGRFAM classification. Each cell represents a protein; proteins are clustered according to their function. Functional classes are separated by thicker black lines.

Additional file 6: Table S2. Summary of proteins annotated as substrate-binding proteins (SBP) from ATP-binding cassette (ABC) transporters detected in the proteomes of Paenibacillus O199. Annotation was performed with RAST.


CAZyme: carbohydrate-active enzyme; GH: glycosyl hydrolase; CBM: carbon-binding domain; CE: carbohydrate esterase; PL: polysaccharide lyase; AA: auxiliary activities; LPMO: lytic polysaccharide monooxygenase; CMC: carboxymethylcellulose; ABC: ATP-binding cassette; PTS: phosphotransferase system; SLH: surface-layer homology; SBP: substrate-binding protein; NSAF: normalized spectrum abundance factor; SpC: spectral counts; GeLC-MS/MS: one-dimensional polyacrylamide gel electrophoresis liquid chromatography tandem mass spectrometry.

Authors' contributions

RLM, DZ, KR, PB designed research; RLM, DZ, TV performed research; RLM, DZ, DB, PB analysed data; RLM, PB wrote the paper. All authors read and approved the final manuscript.

Author details

1 Laboratory of Environmental Microbiology, Institute of Microbiology of the CAS, v. v. i., Prumyslova 595, 252 42 Vestec, Czech Republic. 2 Institute of Microbiology, Ernst-Moritz-Arndt-University of Greifswald, Friedrich-Ludwig-Jahnstrasse 15, 17487 Greifswald, Germany.


We are grateful to Sabryna Junker for her help with protein identification by mass spectrometry.

Competing interests

The authors declare that they have no competing interests Availability of supporting data

The datasets supporting the conclusions of this article are available in the Genbank database, accession number LGRP00000000, and in the PRIDE repository with the dataset identifier PXD003970.


This work was supported by the BIOCEV—Biotechnology and Biomedi-cine Centre of the Academy of Sciences and Charles University project no. CZ.1.05/1.1.00/02.0109 from the European Regional Development Fund in the Czech Republic, by the Ministry of Education, Youth and Sports of the Czech Republic project no. CZ.1.07/2.3.00/30.0003, LM2015055, LD15086 and by the research concept of the Institute of Microbiology of the ASCR (RVO61388971).

Received: 18 February 2016 Accepted: 4 May 2016 Published online: 14 May 2016


1. Van Dyk JS, Pletschke BI. A review of lignocellulose bioconversion using enzymatic hydrolysis and synergistic cooperation between enzymes-factors affecting enzymes, conversion and synergy. Biotechnol Adv. 2012;30:1458-80.

2. Himmel ME, Xu Q, Luo Y, Ding S, Lamed R, Bayer EA. Microbial enzyme systems for biomass conversion: emerging paradigms. Biofuels. 2010;1:323-41.

3. Zhou Y, Pope PB, Li S, Wen B, Tan F, Cheng S, Chen J, Yang J, Liu F, Lei X,

et al. Omics-based interpretation of synergism in a soil-derived cellulose-degrading microbial community. Sci Rep. 2014;4:5288.

4. Rakotoarivonina H, Hermant B, Monthe N, Rémond C. The hemicel-lulolytic enzyme arsenal of Thermobacillusxylanilyticus depends on the composition of biomass used for growth. Microb Cell Fact. 2012;11:159.

5. Song HY, Lim HK, Kim DR, Lee KI, Hwang IT. A new bi-modular endo-beta-1,4-xylanase KRICT PX-3 from whole genome sequence of Paenibacillus terrae HPL-003. Enzyme Microb Technol. 2014;54:1-7.

6. Lombard V. Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42:D490-5.

7. Brumm PJ. Bacterial genomes: what they teach us about cellulose degradation. Biofuels. 2013;4:669-81.

8. Koeck DE, Pechtl A, Zverlov VV, Schwarz WH. Genomics of cellulolytic bacteria. Curr Opin Biotechnol. 2014;29:171-83.

9. Sukharnikov LO, Cantwell BJ, Podar M, Zhulin IB. Cellulases: ambiguous nonhomologous enzymes in a genomic perspective. Trends Biotechnol. 2011;29:473-9.

10. Dos Santos Castro L, Ramos Pedersoli W, Campos Antoniêto AC, Stecca Steindorff A, Silva-Rocha R, Martinez-Rossi NM, Rossi A, Brown NA, Goldman GH, Faça VM, et al. Comparative metabolism of cellulose, sophorose and glucose in Trichoderma reesei using high-throughput genomic and proteomic analyses. Biotechnol Biofuels. 2014;7:41.

11. Mori T, Kamei I, Hirai H, Kondo R. Identification of novel glycosyl hydrolases with cellulolytic activity against crystalline cellulose from metagenomic libraries constructed from bacterial enrichment cultures. SpringerPlus. 2014;3:365.

12. Maki M, Leung KT, Qin W. The prospects of cellulase-producing bacteria for the bioconversion of lignocellulosic biomass. Int J Biol Sci. 2009;5:500-16.

13. Yang JK, Zhang JJ, Yu HY, Cheng JW, Miao LH. Community composition and cellulase activity of cellulolytic bacteria from forest soils planted with broad-leaved deciduous and evergreen trees. Appl Microbiol Biotechnol. 2014;98:1449-58.

14. Gastelum-Arellanez A, Paredes-Lopez O, Olalde-Portugal V. Extracellular endoglucanase activity from Paenibacilluspolymyxa BEb-40: production,

optimization and enzymatic characterization. World J Microbiol Biotechnol. 2014;30:2953-65.

15. Mihajlovski KR, Carevic MB, Devic ML, Siler-Marinkovic S, Rajilic-Stojanovic MD, Dimitrijevic-Brankovic S. Lignocellulosic waste material as substrate for Avicelase production by a new strain of Paenibacillus chitinolyticus CKS1. Inter Biodeter Biodegr. 2015;104:426-34.

16. Hwang IT, Lim HK, Song HY, Cho SJ, Chang JS, Park NJ. Cloning and characterization of a xylanase, KRICT PX1 from the strain Paenibacillus sp. HPL-001. Biotechnol Adv. 2010;28:594-601.

17. Park IH, Chang J, Lee YS, Fang SJ, Choi YL. Gene cloning of endoglucanase Cel5A from cellulose-degrading Paenibacillus xylanilyticus KJ-03 and purification and characterization of the recombinant enzyme. Protein J. 2012;31:238-45.

18. Pason P, Kosugi A, Waeonukul R, Tachaapaikoon C, Ratanakhanokchai K, Arai T, Murata Y, Nakajima J, Mori Y. Purification and characterization of a multienzyme complex produced by Paenibacilluscurdlanolyticus B-6. Appl Microbiol Biotechnol. 2010;85:573-80.

19. van Dyk JS, Sakka M, Sakka K, Pletschke BI. Identification of endoglu-canases, xylanases, pectinases and mannanases in the multi-enzyme complex of Bacillus licheniformis SVD1. Enzyme Microb Technol. 2010;47:112-8.

20. Baldrian P, Lopez-Mondejar R. Microbial genomics, transcriptomics and proteomics: new discoveries in decomposition research using complementary methods. Appl Microbiol Biotechnol. 2014;98:1531-7.

21. Berlemont R, Martiny AC. Phylogenetic distribution of potential cellulases in bacteria. Appl Environ Microbiol. 2013;79:1545-54.

22. Mba Medie F, Davies GJ, Drancourt M, Henrissat B. Genome analyses highlight the different biological roles of cellulases. Nat Rev Microbiol. 2012;10:227-34.

23. Wilson DB. Processive and nonprocessive cellulases for biofuel production—lessons from bacterial genomes and structural analysis. Appl Microbiol Biotechnol. 2012;93:497-502.

24. Takasuka TE, Book AJ, Lewin GR, Currie CR, Fox BG. Aerobic deconstruc-tion of cellulosic biomass by an insect-associated Streptomyces. Sci Rep. 1030;2013:3.

25. Maqbool A, Horler RS, Muller A, Wilkinson AJ, Wilson KS, Thomas GH. The substrate-binding protein in bacterial ABC transporters: dissecting roles in the evolution of substrate specificity. Biochem Soc Trans. 2015;43:1011-7.

26. Harris PV, Xu F, Kreel NE, Kang C, Fukuyama S. New enzyme insights drive advances in commercial ethanol production. Curr Opin Chem Biol. 2014;19:162-70.

27. Chua P, Yoo H-S, Gan HM, Lee S-M. Draft genome sequences of two cellulolytic Paenibacillus sp. strains, MAEPY1 and MAEPY2, from Malaysian landfill leachate. Genome Announc. 2014;2:e00065-14.

28. Dhar H, Swarnkar MK, Gulati A, Singh AK, Kasana RC. Draft genome sequence of a cellulase-producing psychrotrophic Paenibacillus strain, IHB B 3415, isolated from the cold environment of the Western Himalayas, India. Genome Announc. 2015;3:e01581 -14.

29. Yuki M, Oshima K, Suda W, Oshida Y, Kitamura K, Iida T, Hattori M, Ohkuma M. Draft genome sequence of Paenibacillus pini JCM 16418T, isolated from the rhizosphere of pine tree. Genome Announc. 2014;2:e00210-4.

30. Shin SH, Kim S, Kim JY, Song HY, Cho SJ, Kim DR, Lee KI, Lim HK, Park NJ, Hwang IT, Yang KS. Genome sequence of Paenibacillus terrae HPL-003, a xylanase-producing bacterium isolated from soil found in forest residue. J Bacteriol. 2012;194:1266.

31. Eastman AW, Heinrichs DE, Yuan Z. Comparative and genetic analysis of the four sequenced Paenibacillus polymyxa genomes reveals a diverse metabolism and conservation of genes relevant to plant-growth promotion and competitiveness. BMC Genom. 2014;15:851.

32. Adams AS, Jordan MS, Adams SM, Suen G, Goodwin LA, Davenport KW, Currie CR, Raffa KF. Cellulose-degrading bacteria associated with the invasive woodwasp Sirexnoctilio. ISME J. 2011;5:1323-31.

33. Dam P, Kataeva I, Yang SJ, Zhou F, Yin Y, Chou W, Poole FL 2nd, West-pheling J, Hettich R, Giannone R, et al. Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725. Nucleic Acids Res. 2011;39:3240-54.

34. Berlemont R, Allison SD, Weihe C, Lu Y, Brodie EL, Martiny JB, Martiny AC. Cellulolytic potential under environmental changes in microbial communities from grassland litter. Front Microbiol. 2014;5:639.

35. Wilson DB. Microbial diversity of cellulose hydrolysis. Curr Opin Microbiol. 2011;14:259-63.

36. Sanchez MM, Irwin DC, Pastor FI, Wilson DB, Diaz P. Synergistic activity of Paenibacillus sp. BP-23 cellobiohydrolase Cel48C in association with the contiguous endoglucanase Cel9B and with endo- or exo-acting glucanases from Thermobifida fusca. Biotechnol Bioeng. 2004;87:161-9.

37. Berger E, Zhang D, Zverlov VV, Schwarz WH. Two noncellulosomal cellu-lases of Clostridium thermocellum, Cel9I and Cel48Y, hydrolyse crystalline cellulose synergistically. FEMS Microbiol Lett. 2007;268:194-201.

38. Vardakou M, Katapodis P, Topakas E, Kekos D, Macris BJ, Christakopoulos P Synergy between enzymes involved in the degradation of insoluble wheat flour arabinoxylan. Innov Food Sci Emerg. 2004;5:107-12.

39. Özdemir I, Blumer-Schuette SE, Kelly RM. S-layer homology domain proteins Csac_0678 and Csac_2722 are implicated in plant polysaccharide deconstruction by the extremely thermophilic bacterium Caldicellulo-siruptorsaccharolyticus. Appl Environ Microbiol. 2012;78:768-77.

40. Chow V, Kim YS, Rhee MS, Sawhney N. St John FJ, Nong G, Rice JD, Preston JF: A 1,3-1,4-ß-Glucan Utilization Regulon in Paenibacillus sp. Strain JDR-2. Appl Environ Microbiol. 2016;82:1789-98.

41. Fukuda M, Watanabe S, Yoshida S, Itoh H, Itoh Y, Kamio Y, Kaneko J. Cell surface xylanases of the glycoside hydrolase family 10 are essential for xylan utilization by Paenibacillus sp. W-61 as generators of xylo-oligosac-charide inducers for the xylanase genes. J Bacteriol. 2010;192:2210-9.

42. Waeonukul R, Kyu KL, Sakka K, Ratanakhanokchai K. Effect of Carbon Sources on the Induction of Xylanolytic-Cellulolytic Multienzyme Complexes in Paenibacillus curdlanolyticus Strain B-6. Biosci Biotechnol Biochem. 2008;72:321-8.

43. Wegmann U, Louis P, Goesmann A, Henrissat B, Duncan SH, Flint HJ. Complete genome of a new Firmicutes species belonging to the dominant human colonic microbiota (Ruminococcusbicirculans1) reveals two chromosomes and a selective capacity to utilize plant glucans. Environ Microbiol. 2014;16:2879-90.

44. NatafY, Bahari L, Kahel-Raifer H, Borovok I, Lamed R, Bayer EA, Sonenshein AL, Shoham Y. Clostridium thermocellum cellulosomal genes are regulated by extracytoplasmic polysaccharides via alternative sigma factors. Proc Natl Acad Sci USA. 2010;107:18646-51.

45. Zhang H, Hutcheson SW. Complex expression of the cellulolytic transcriptome of Saccharophagus degradans. Appl Environ Microbiol. 2011;77:5591-6.

46. Xu C, Huang R, Teng L, Wang D, Hemme CL, Borovok I, He Q, Lamed R, Bayer EA, Zhou J, Xu J. Structure and regulation of the cellulose degradome in Clostridium cellulolyticum. Biotechnol Biofuels. 2013;6:73.

47. Yokoyama H, Yamashita T, Morioka R, Ohmori H. Extracellular secretion of noncatalytic plant cell wall-binding proteins by the cellulolytic thermo-phile Caldicellulosiruptorbescii. J Bacteriol. 2014;196:3784-92.

48. Snajdr J, Cajthaml T, Valásková V, Merhautová V, Petranková M, Spetz P, Leppanen K, Baldrian P. Transformation of Quercuspetraea litter: successive changes in litter chemistry are reflected in differential enzyme activity and changes in the microbial community composition. FEMS Microbiol Ecol. 2011;75:291-303.

49. Snajdr J, Valásková V. Merhautová Vr, Herinková J, Cajthaml T, Baldrian P: Spatial variability of enzyme activities and microbial biomass in the upper layers of Quercus petraea forest soil. Soil Biol Biochem. 2008;40:2068-75.

50. López-Mondéjar R, Vorísková J, Vetrovsky T, Baldrian P The bacterial community inhabiting temperate deciduous forests is vertically stratified and undergoes seasonal dynamics. Soil Biol Biochem. 2015;87:43-50.

51. Valásková V, Snajdr J, Bittner B, Cajthaml T, Merhautová V, Hofrichter M, Baldrian P. Production of lignocellulose-degrading enzymes and degradation of leaf litter by saprotrophic basidiomycetes isolated from a Quercus petraea forest. Soil Biol Biochem. 2007;39:2651-60.

52. Lane DJ. 16S/23S rRNA sequencing. In: Stackebrandt E, Goodfellow M, editors. Nucleic acids techniques in bacterial systematics. Chichester: Wiley; 1991. p. 115-47.

53. Kim OS, Cho YJ, Lee K, Yoon SH, Kim M, Na H, Park SC, Jeon YS, Lee JH, Yi H, et al. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. I nt J Syst Evol Microbiol. 2012;62:716-21.

54. Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821-9.

55. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. The RAST server: rapid annotations using subsystems technology. BMC Genom. 2008;9:75.

56. Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, et al. The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res. 2014;42:D206-14.

57. Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40:W445-51.

58. Grube M, Cernava T, Soh J, Fuchs S, Aschenbrenner I, Lassek C, Wegner U, Becher D, Riedel K, Sensen CW, Berg G. Exploring functional contexts of symbiotic sustain within lichen-associated bacteria by comparative omics. ISME J. 2015;9:412-24.

59. Zybailov B, Mosley AL, Sardiu ME, Coleman MK, Florens L, Washburn M. Statistical analysis of membrane proteome expression changes in Sac-charomyces cerevisiae. J Proteome Res. 2006;5:2339-47.

60. Saeed A, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M, et al. TM4: a free, open-source system for micro-array data management and analysis. Biotechniques. 2003;34:374-8.

61. Schneider T, Schmid E, de Castro JV Jr, Cardinale M, Eberl L, Grube M, Berg G, Riedel K. Structure and function of the symbiosis partners of the lung lichen (Lobariapulmonaria L. Hoffm.) analyzed by metaproteomics. Proteomics. 2011;11:2752-6.

62. Edgar RC. MUSCLE: multiple sequence alignment with high accurance and high throughput. Nucleic Acids Res. 2004;32:1792-7.

63. Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307-21.

64. Vizcaíno JA, Deutsch EW, Wang R, Csordas A, Reisinger F, Ríos D, Dianes JA, Sun Z, Farrah T, Bandeira N, et al. ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination. Nat Biotechnol. 2014;30:223-6.

65. Burton RA, Gidley MJ, Fincher GB. Heterogeneity in the chemistry, structure and function of plant cell walls. Nat Chem Biol. 2010;6:724-32.

Submit your next manuscript to BioMed Central and we will help you at every step:

• We accept pre-submission inquiries

• Our selector tool helps you to find the most relevant journal

• We provide round the clock customer support

• Convenient online submission

• Thorough peer review

• Inclusion in PubMed and all major indexing services

• Maximum visibility for your research

Submit your manuscript at