Skip to main content


We’d like to understand how you use our websites in order to improve them. Register your interest.

Optimizing the composition of a synthetic cellulosome complex for the hydrolysis of softwood pulp: identification of the enzymatic core functions and biochemical complex characterization



The development of efficient cellulase blends is a key factor for cost-effectively valorizing biomass in a new bio-economy. Today, the enzymatic hydrolysis of plant-derived polysaccharides is mainly accomplished with fungal cellulases, whereas potentially equally effective cellulose-degrading systems from bacteria have not been developed. Particularly, a thermostable multi-enzyme cellulase complex, the cellulosome from the anaerobic cellulolytic bacterium Clostridium thermocellum is promising of being applied as cellulolytic nano-machinery for the production of fermentable sugars from cellulosic biomass.


In this study, 60 cellulosomal components were recombinantly produced in E. coli and systematically permuted in synthetic complexes to study the function–activity relationship of all available enzymes on Kraft pulp from pine wood as the substrate. Starting from a basic exo/endoglucanase complex, we were able to identify additional functional classes such as mannanase and xylanase for optimal activity on the substrate. Based on these results, we predicted a synthetic cellulosome complex consisting of seven single components (including the scaffoldin protein and a β-glucosidase) and characterized it biochemically. We obtained a highly thermostable complex with optimal activity around 60–65 °C and an optimal pH in agreement with the optimum of the native cellulosome (pH 5.8). Remarkably, a fully synthetic complex containing 47 single cellulosomal components showed comparable activity with a commercially available fungal enzyme cocktail on the softwood pulp substrate.


Our results show that synthetic bacterial multi-enzyme complexes based on the cellulosome of C. thermocellum can be applied as a versatile platform for the quick adaptation and efficient degradation of a substrate of interest.


Cellulose and hemicellulose from plants are the most abundant carbohydrates on earth and a ubiquitous and regenerative resource for the generation of second-generation biofuels. Substrate depolymerization into fermentable sugars is one of the limiting steps within the value chain of the biorefinery process [1, 2]. Due to the recalcitrant nature of this substrate, effective and cost-competitive enzyme mixtures for the hydrolysis of cellulose are highly demanded.

The extracellular multi-enzyme complex of the anaerobic bacterium Clostridium thermocellum is an effective cellulase nano-machinery to hydrolyze crystalline cellulose from plant-derived biomass [3,4,5]. Its effectivity is due to the co-localization of many different enzymatic functions needed to act synergistically on the highly complex matrix of polysaccharides for most efficient breakdown to sugars [6].

For the development of a competitive synthetic cellulase complex, a higher effectiveness of the cellulosomes than existing enzyme cocktails is needed [7]. When the components are separately produced recombinantly, one of the major advantages over fungal enzyme cocktails is the possibility to quickly adapt the composition of synthetic cellulase complexes by selectively adding new enzymatic functions or to change the stoichiometry of components added. Another advantage of the bacterial components from thermophiles is their higher temperature optimum compared to the fungal enzymes, a key feature to increase solubility of substrate and by-products, to increase diffusion rates, and to decrease viscosity. A higher process stability due to reduced microbial contamination risks is a further benefit [8]. However, despite many decades of research in this field, the commercial use of these native enzyme complexes is mainly hampered by the low production yield from anaerobic bacteria [9].

The C. thermocellum cellulosome is characterized by the binding of over 70 catalytic and non-catalytic protein components on a scaffoldin protein CipA [10]. This binding is mediated through a very strong protein–protein interaction between the dockerins located on each cellulosomal component, and one of the nine cohesin modules of CipA. Native and recombinantly produced cellulosomal enzymes have been combined in vitro on a scaffoldin and form complexes stoichiometrically in statistical distribution [7]. There are numerous studies that show the influence of different enzymatic functions on the complex effectivity, such as the presence of auxiliary enzymes [11], enzyme additives [12], enzymatic processivity modes [13], enzymatic diversity and stoichiometry [14,15,16]. However, to the best of our knowledge, synthetic cellulosomal cellulases have so far been unsuccessful in reaching the activity of commercial cellulase blends.

In this study, we show the rapid adaptation of a fully synthetic cellulosome complex on an industrial substrate based on delignified softwood from Kraft pulp process and present a screening strategy to identify enzymatic functions necessary within the cellulosome complex to enhance substrate degradation. To employ this strategy, over 60 cellulosomal proteins from C. thermocellum containing a dockerin module were cloned and successfully expressed, including cellulases, hemicellulases, structural proteins and proteins with unknown function. Mixtures of these enzymes were bound to a recombinant scaffoldin and systematically tested for substrate degradation efficiency.

Our approach underscores the versatility and advantage of a simple and fast adaptation strategy using fully recombinant cellulosome complexes. This strategy reduces the complexity of random combinations and may help to develop cost-effective and efficient bacterial cellulase mixtures in the future.


Strains and media

Clostridium thermocellum (also referred to as Ruminiclostridium thermocellum) strains DSM1313 and mutant strain SM901 (strain devoid of cipA scaffoldin-encoding gene, also referred to as SM1 [17]) were grown at 60 °C in prereduced GS-2 [18] medium for liquid cultures containing 0.5% (w/v) cellobiose, Whatman filter paper (both purchased at Sigma-Aldrich, St. Louis, USA) or softwood pulp. Bleached and delignified Kraft pulp from pine (softwood) was a generous gift from Michael Duetsch from UPM-Kymmene Oy (Finland). Strains Escherichia coli DH10B and DH5α were used for cloning. E. coli strains for protein expression were BL21 Star (DE3) (Invitrogen, Carlsbad, USA), Arctic Express (DE3), BL21 Codon Plus (DE3) RIPL (Agilent Technologies, Santa Clara, USA) and Rosetta-gami B (DE3) (Novagen–Merck, Darmstadt, Germany). Cells were grown in lysogeny broth (LB) containing 100 µg/mL ampicillin for pET21a(+) plasmids and 50 µg/mL kanamycin for pET24(+) plasmids.

DNA cloning

DNA fragments were assembled with Gibson Assembly Master Mix (NEB, Ipswich, USA). QIAprep Spin Miniprep kit and PCR purification kit (Qiagen, Hilden, Germany) were used for the purification of recombinant plasmids and PCR products. DNA sequences encoding recombinant protein constructs were PCR amplified with Phusion DNA polymerase (NEB) and cloned without the predicted N-terminal signal peptides as identified using the SignalP 4.0 server [19]. Oligonucleotides are listed in Additional file 1. The amplicons were digested and ligated in frame into the multiple cloning site of plasmids pET21a(+) or pET24(+). The genes encoding for Cel9-44J (Clo1313_1604), Cel124A (Clo1313_1786), Cel9K (Clo1313_1809), and Cel48S (Clo1313_2747) were optimized in E. coli codon usage by Eurofins (Ebersberg, Germany). The cellulosomal scaffoldin protein CipA8 was synthesized in optimized E. coli codon usage and DNA sequence, including eight cohesins, the carbohydrate-binding module CBM3 and the C-terminal X-module from C. thermocellum WP_020458017.1 lacking Coh6 and dockerin type II [13]. Correct cloning was verified by sequencing (MWG-Eurofins, Ebersberg, Germany).

Protein purification

For protein expression, E. coli cells were grown at 37 °C, room temperature (RT) or lower temperatures in LB medium containing chloramphenicol (34 µg/mL for BL21 Codon Plus), gentamycin (20 µg/mL for Arctic Express) and kanamycin (25 µg/mL for Rosetta-gami B). Heterologous protein expression was induced by the addition of 1 mM isopropyl-β-d-thiogalactopyranoside (IPTG) to an exponentially growing culture. After further growth for 4 h (or overnight incubation with Arctic Express) the cells were harvested by centrifugation at 3440×g for 10 min at 4 °C. Heterologously expressed proteins and the native cellulosome from C. thermocellum were prepared as previously described [13]. Before cell lysis, Roche cOmplete Mini EDTA-free protease inhibitor cocktail tablets (purchased from Sigma-Aldrich) were added. The cells were resuspended in 20 mL lysis buffer (50 mM MOPS, pH 7.3, 100 mM NaCl, 10 mM CaCl2, 20 mM imidazole) with the addition of 10 mg/mL lysozyme (AppliChem, Darmstadt, Germany) and Roche cOmplete Mini EDTA-free protease inhibitor cocktail tablets (purchased from Sigma-Aldrich). After incubation for 30 min on ice, the cells were sonified twice with Sonifier UP 200S (Hielscher, Teltow, Germany) set at amplitude 60%, interval 0.25 for 4 min. After centrifugation (18,000 rpm, 20 min, 4 °C) the supernatant was loaded onto an immobilized metal HisTrap affinity column (IMAC) (GE Healthcare, Munich, Germany) and eluted with 0.5 M imidazole, 50 mM MOPS, pH 7.3, 100 mM NaCl, and 10 mM CaCl2. All enzyme preparations were heat treated for 15 min at 60 °C and precipitates were separated from the supernatant by centrifugation (13,000 rpm for 10 min at RT). The proteins were examined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and stained with Coomassie brilliant blue R-250. Protein concentrations were determined using Pierce BCA protein quantification kit (Thermo Fisher Scientific).

Complex assembly

Synthetic cellulosome complexes were assembled in complex assembly buffer for 1 h at RT with a fixed concentration of the scaffoldin protein CipA8 comprising eight type I cohesins and an amount of single enzymes equimolar with the available cohesins. Following complex stoichiometries were used: 0.87 nmol of the CipA8 corresponds to free cohesin concentrations of 6.75 nmol in a standard complexation reaction of 0.55 mL. For the fully loaded SKL complex, 2.25 nmol of each component Cel48S, Cel9K and Cel5L was mixed (each representing 33.3% stoichiometric binding). A not fully loaded SKL complex is stochastically populated by 75% of the available cohesins with an equimolar mix of Cel48S, Cel9K and Cel5L (1.69 nmol of each). For the SKLM or SKLY complexes, 1.97 nmol of Cel48S, Cel9K and Cel5L (in sum binding 87.5% of all available cohesins) were mixed with 0.84 nmol of Man26A or Xyn10Y (equals 12.5% loading). Pentavalent SKLMY was assembled with 1.69 nmol of the cellulases (Cel48S, Cel9K, Cel5L, each binding 25%) and 0.84 nmol of each additional protein (Man26A, Xyn10Y, each binding 12.5%). Fully recombinant complexes were purified from non-complexed proteins by size-exclusion chromatography on a Superdex 200 10/300 GL column (GE Healthcare, Little Chalfont, UK) equilibrated with a buffer containing 50 mM MOPS, pH 7.3, 0.5 M NaCl, and 20 mM CaCl2. Size-exclusion chromatography was carried out in an ÄKTA Purifier (GE Healthcare, Munich, Germany). The column was developed with the same buffer at a flow rate of 0.5 mL/min. Fractions of 1 mL were collected and concentrated with Vivaspin 500 columns with a cutoff of 50 kDa. Protein concentrations were determined by the BCA method using BSA as the standard. The complexation reaction in 20 µL final volume was visualized on 6% native PAGE as described elsewhere [13]. The influence of the substrate on complex formation was studied as follows: as complexation master mix, 20 µg of CipA8 was bound on 1.25 mg substrate, followed by the addition of 200 µg native enzyme extract (from scaffoldin protein-devoid mutant SM901 [17]). Cellulosomes from unbound cellulosomal components on synthetic CipA8 were assembled as described by Krauss et al. [7].

Substrate binding of CipA8

For binding analysis of a recombinant CipA8 on the insoluble substrate, 20 µg of the scaffoldin protein was mixed with 1.25 mg softwood pulp in 250 µL 0.1 M MOPS buffer (pH 6.5), containing 50 mM NaCl and 20 mM CaCl2. After 5 min of binding at RT, the reaction was spun down and the pellet washed three times with buffer. The reaction mixture was finally mixed with 20 µL of 4× concentrated denaturing protein loading dye and the supernatant was completely loaded on a SDS gel.

Enzymatic assays

Enzymatic reactions using cellulosome complexes were performed under standard reaction conditions at 60 °C in a total volume of 0.5 mL. The reaction buffer contained 0.1 M MOPS, pH 6.5, 50 mM NaCl, and 10 mM CaCl2. Cellic CTec2 (Novozymes A/S, Bagsværd, Denmark; from Sigma-Aldrich, St. Louis, USA) was incubated at 50 °C in 0.1 M MES and 50 mM CaCl2 buffered at pH 5. The activity of synthetic cellulosome complexes was measured on 0.25–0.5% (w/v) microcrystalline cellulose (Avicel, from Sigma-Aldrich) and micronized softwood pulp (UPM-Kymmene, Finland). Softwood was treated with an Ultra Turrax homogenizer (Ika, Staufen, Germany) until the homogeneously micronized substrate could be pipetted using wide-bored tips. To avoid inhibition of the complexed cellulases by cellobiose, β-glucosidase BglT (TT_P0042) from Thermus thermophilus [20] or CglT (Q60026_THEBR) from Thermoanaerobacter brockii [21] was added to a final concentration of 6 µg/mL. The presence of d-glucose in reaction mixtures was determined with the d-glucose HK assay kit (Megazyme, Wicklow, Ireland). Reducing sugars released from the substrates were quantified using the 3,5-dinitrosalicylic acid method [22]. One enzymatic unit liberates 1 µmol of glucose equivalent per minute.

Two-step acid substrate hydrolysis

Substrate analysis with a two-step sulfuric acid hydrolysis was carried out as follows: 100 mg of substrate was hydrolyzed by adding 7 mL of 2% sulfuric acid, incubated for 1 h at 30 °C and mixed by homogenization every 15 min. Then, the mixture was incubated at 121 °C for 1 h. After cooling and centrifugation, the supernatant was stored at 8 °C. The pellet was dried overnight and hydrolyzed by adding 600 µL of 72% sulfuric acid, incubated at 35 °C for 1 h, mixed with 8 mL water, and autoclaved at 121 °C for 1 h. The mixture was centrifuged; the second pellet (acid-insoluble lignin and inorganic constituents) was weighed. The supernatant (approx. 8.6 mL) was mixed with 7 mL from the first hydrolysis and filled to 50 mL final volume. For neutralization of the acidic reaction mixture, calcium carbonate was added until the pH was > 5. The amount of acid-hydrolyzed sugar monomers was determined by the glucose detection kit and DNSA assay. Each hydrolysis was carried out in duplicates.


Clostridium thermocellum was able to grow on Kraft softwood fibers over several days under anaerobic conditions (data not shown). Cellulosomes from C. thermocellum cultures were purified and their ability to degrade softwood fibers was verified by visual inspection and quantification of hydrolytic products (Fig. 1a, b). According to the results from two-step acid hydrolysis, softwood polysaccharides from kraft pulp contain approx. 80% β-d-glucose and 20% other reducing sugars (Fig. 1b). Within 24 h, kraft softwood fibers were completely hydrolysed to soluble sugars by 25 µg native cellulosome complex including β-glucosidase per 1.25 mg substrate (substrate to enzyme ratio of 1:50).

Fig. 1

Preliminary assessment of enzymatic hydrolysis of softwood pulp. a Hydrolysis reaction of native cellulosome preparation on 0.25% (w/v) softwood after 24 h at 60 °C (5, 25 and 125 µg of enzyme per 1.25 mg softwood in 0.5 mL). b Measurement of released glucose (black bar) and reducing sugar ends (as determined with the DNSA assay, gray bars) from the substrate at different enzyme loadings (average values from triplicate measurements). The softwood composition was determined using a two-step protocol using sulfuric acid as hydrolysis agent

In a previous study, a nonavalent synthetic cellulosome complex (nine different single cellulase components on recombinant CipA8 scaffoldin protein) showed half of the activity of the native cellulosome complex from C. thermocellum on microcrystalline cellulose Avicel [13]. The substrate softwood pulp contains about 88% of cellulose and CipA8 was shown to bind to the substrate as efficiently as on Avicel, but an identically composed nonavalent synthetic complex failed to degrade significant amounts of this substrate (data not shown).

Cloning and screening of cellulosomal proteins on softwood pulp

In total, 73 dockerin type I-containing polypeptide sequences were predicted from in silico genome analysis of the C. thermocellum DSM 1313 genome and targeted for cloning and subsequent expression (Table 1). The proteins are either identical (100% sequence identity) or share very high amino acid sequence identity (99%) with the type strain C. thermocellum ATCC 27405 cellulosome components, of which many individual components are fully characterized. Furthermore, 16 dockerin-containing proteins were predicted that cannot directly be linked to carbohydrate hydrolysis (serpin, protease), contain only glycoside hydrolase (GH)-associated modules (fibronectin), have no predictable function at all (unknown modules) or are very small polypeptides only encoding dockerin type I modules (MW ≤ 15 kDa). We expressed 60 of the 73 polypeptides and purified them in soluble form, whereas 57 could be obtained as full-length protein only (see Additional file 2 summarizing all purified proteins). To analyze the impact of additional functions on complex activity, a stoichiometrically not fully loaded, minimized three component synthetic complex [SKL: Cel48S (Clo1313_2747), Cel9K (Clo1313_1809) and Cel5L (Clo1313_1816)] was mixed with 38 different single proteins and screened for higher substrate conversion efficiency (see “Methods” section for exact stoichiometries). After 2 days of incubation at 60 °C mainly (endo-)xylanase (Xyn11A: Clo1313_0521; Xyn10D: Clo1313_0177; Xyn10Z: Clo1313_2635) and β-mannanase functions (Man26A: Clo1313_2202; Man5A: Clo1313_1398; Cel5-26H: Clo1313_2234) were found to stimulate the complex activity on this substrate. The involved glycoside hydrolase families were GH10, GH11, GH5, and GH26, respectively (Fig. 2a).

Table 1 All cellulosomal proteins containing a dockerin type I module
Fig. 2

Screening of recombinantly expressed cellulosomal proteins on softwood. A minimized SKL complex was incubated with single-enzyme supplementations on 0.25% softwood, and soluble reducing sugars were measured after 2 days at 60 °C. a 38 proteins were supplemented to the SKL complex in identical molar stoichiometry and tested in duplicates. Activities are shown as heat map representations of reducing sugars released and quantified by DNS assay. Relative activity is depicted as follows: 100% relative activity equals black depiction, 0% no color. b A minimized SKL complex (relative activity = 100%) was incubated with single-enzyme combinations of Man26A and Xyn10Y. Soluble reducing sugars released from 0.25% (w/v) Avicel (dark gray) and softwood (light gray) were measured after 2 days at 60 °C. Each supplementation was added to the complex in identical stoichiometry (one supplement per cohesin). Data are shown as average values from at least duplicate (n = 2) measurements

We raised the question, if an adapted complex would benefit from the addition of two additional functions, if these functions were synergistic, and if the selection of these additions would depend on the substrate composition. Based on the screening results (Fig. 2a) and the availability of recombinant cellulosomal proteins (Table 1), a set of component combinations was designed incorporating each enzymatic function necessary for the degradation of softwood: a reducing end and a non-reducing end exo-acting cellobiohydrolases (Cel48S and Cel9K), one cellobiose-releasing processive endoglucanase (Cel5L), whereby each cellulase is added in equimolar ratios and optionally the supplemental functions consisting of a mannanase function (Man26A, stochastically one cohesin per complex, equally to 12.5%) and a multi-modular xylanase (Xyn10Y, 12.5%), respectively (see “Methods” section for exact stoichiometric loading of each component). As a result, the presence of both enzymatic functions, the mannan-degrading (e.g. Man26A) and the xylanolytic (Xyn10Y) function (resulting in the SKLMY complex, Fig. 2b) led to a 3.5-fold increase in activity relative to the minimized SKL cellulase complex on 0.25% (w/v) of the substrate softwood; the addition of the multifunctional xylanase Xyn10Y alone had the highest impact (3.1× activity), followed by Man26A (2.1× activity). On microcrystalline cellulose the effect was far less pronounced, with only 65% higher activity of the SKLMY complex (Fig. 2b).

Complex assembly and biochemical characterization of the optimized SKLMY complex

The dockerin-containing single SKLMY components (Cel48S, Cel9K, Cel5L, Man26A, Xyn10Y) were assembled with the scaffoldin protein CipA8 via dockerin–cohesin interaction (schematic representation in Fig. 3a). Upon mixing the single enzymes in desired stoichiometric ratios (Cel48S: 25%, Cel9K: 25%, Cel5L: 25%, Man26A: 12.5%, Xyn10Y: 12.5%), binding occurs in a random fashion on the eight available cohesins of the CipA8 molecule. This approach is different to designer cellulosomes, where the order of the components is controlled by selectively binding each component to the corresponding binding module at a fixed position on the scaffoldin molecule [68, 69]. The approach of randomly assembling protein mixtures has been successfully applied for testing native and recombinant cellulosomal components from C. thermocellum [7], [13] and is assayed by complex purification using size-exclusion chromatography (see “Methods” section) and electrophoretic mobility shift analysis (EMSA) of the complexes visualized by native PAGE (Fig. 3c, d). After complexation, the five single enzymes and the scaffoldin protein were up-shifted indicating assemblage of the single components into a higher molecular weight protein complex.

Fig. 3

Assembly process of the SKLMY complex. a Schematic representation of the recombinant cellulosomal components Cel48S, Cel9K, Cel5L, Man26A and Xyn10Y, containing dockerin type 1-binding modules. The scaffoldin protein CipA8 comprises eight cohesin type I modules, enabling stoichiometric binding of eight dockerin-containing components via specific protein–protein interaction. b The assembly of the single components results in random combinations of macromolecular complexes, termed SKLMY. The order of components bound is arbitrary. c SDS-PAGE control of the assembly. CipA8 (3.8 µg in lane 1) and eight-time molar excess of single and unbound SKLMY components (15.3 µg loaded in lane 2) is mixed for the complex assembly reaction (19.1 µg in lane 3). d Native PAGE of single CipA8 (lane 1), unbound components (lane 2) and electrophoretic mobility up-shift upon SKLMY complex formation (lane 3)

The established pentavalent complex termed SKLMY was further characterized for enzyme kinetics and biochemical properties. All single enzymatic components (Cel48S, Cel9K, Cel5L, Man26A, Xyn10Y) were able to bind to all cohesin modules of CipA8. The synthetic SKLMY complex showed temperature and pH optima between 60 and 65 °C at around pH 5.8, whereas at higher temperature (70 °C) it was completely inactivated (Fig. 4a). The pentavalent synthetic complex displayed a high thermal stability over 2 days, retaining approximately 60–70% of its initial activity even at its temperature optimum. Noteworthy, inactivation of the complex strongly depended on the incubation time rather than the temperatures applied (Fig. 4b). The influence of common by-products such as purification agents (imidazole, ammonium sulfate) and cryo-preservatives (glycerol) was studied (Additional file 3). Imidazole is the most potent inhibitor of the recombinant cellulosome complex, which at concentrations as low as 5–10 mM resulted in a significant activity reduction (data not shown). Glycerol and ammonium sulfate above 10% saturation (w/v) (used for protein complex precipitation and purification) were also shown to be important inhibitors of hydrolysis that resulted in reduced activity (reduction by 25–50%). Sucrose used as another cryo-preservative showed comparable inhibition results (data not shown). The presence of bivalent metal ions (1 mM CoCl2, 1 mM MnCl2, 10 mM MgSO4) did not result in significant changes of the recombinant complex activity.

Fig. 4

Biochemical properties of the optimized SKLMY enzyme complex on softwood pulp. a pH optima at three different temperatures around the temperature optimum of 60–65 °C after 36 h of incubation. b Thermo-inactivation kinetics of the SKLMY complex during incubation at different temperatures. 10 µg of complex was incubated on 0.25% (w/v) softwood and the concentration of liberated glucose was measured

The enzyme efficiency of the established pentavalent SKLMY complex was assessed and compared with the fungal enzyme mixture Cellic CTec2 (Fig. 5). This complex supplemented with 47 additional recombinant enzymes (see Additional file 4 for the exact protein composition) or SM901 native free cellulosomal enzyme mixture showed comparable results with the existing commercial preparation (0.25% w/v softwood). The SKLMY complex showed approx. 30% of the activity of the native cellulosome on softwood, whereas supplementation of additional enzymes to the complex resulted in 50–60% of the overall activity of the native cellulosome.

Fig. 5

Comparison of commercial fungal cellulase with SKLMY complex on 0.25% (w/v) softwood pulp as substrate. The soluble fraction of reducing sugar ends was quantified after 2 days of incubation at optimal reaction conditions (60 °C for cellulosomal native and synthetic complexes at pH 5.8, and 50 °C at pH 5.0 for fungal enzyme preparation, respectively). The SKLMY complex was mixed with varying amounts (% of all cohesin-binding positions on CipA8) of native components (SM901 mutant extract) or an equimolar ratio of recombinant enzymes (n = 47, Additional file 4 for enzymatic composition). Enzyme loadings were as follows: Cellic CTec2, 7.6 µg per reaction; synthetic cellulosome complexes (each 1 µg); non-complexed enzyme control (− CipA: 13.2 µg); cellulosome complexes contain 3 µg of β-glucosidase as additive in the reaction mixture. Substrate loading was 1.25 mg per reaction. Bars represent average values ± standard deviation from three independent enzyme reactions


Enzymatic degradation of cellulosic biomass is one of the most cost-intensive key reactions in the biomass-to-liquid process. Consequently, there is a huge demand in further optimization of cellulases. Advantageous properties are amongst other factors: (a) higher process temperature during hydrolysis reaction to avoid the contamination risk, (b) re-use of enzymes by resistance to thermally or chemically induced denaturation, (c) an enhanced hydrolytic efficiency through higher enzymatic activity, (d) reduced inhibition by high concentrations of oligo- or monosaccharides, (e) high yields of cellulases in their production and (f) adaptability of enzyme composition within the mixture depending on the substrate [8]. Fungal enzyme cocktails are regularly used today, as they can be economically produced in large amounts. They are still optimized further, for instance, by including accessory enzymes such as lytic polysaccharide monooxygenase (LPMOs). Advances have been made to optimize specific biochemical properties and by selecting special features when screening recombinant proteins, either by applying site-directed or random mutagenesis, exchange/deletion of peptide signatures and re-arrangement of functional modules, or by directed protein engineering and modeling [55, 56].

Although many sophisticated molecular biological tools are available for the engineering of fungal proteins [57], one major drawback in development is the labor-intensive adaptation of the enzyme composition when optimizing for different substrates (depending on the polysaccharide composition, pre-treatment conditions, amongst others). As another implication, each alteration of the cellulase system may again interfere with complementary/synergistic enzymatic functions of the whole enzyme mixture. Thereby the complexity of the polysaccharide matrices dictates the complexity of the cellulase formulation and accessory enzymes.

Saccharolytic clostridia are known to produce a battery of extracellular glycoside hydrolases to degrade a diversity of polysaccharides [58]. Hemicelluloses are depolymerized by a diverse set of xylan-, mannan-, arabinogalactan-, xyloglucan-, and pectin-degrading enzymes. Side chains and oligosaccharides can be further depolymerized by the action of arabinofuranosidases and β-xylosidases. The native cellulosomal enzyme complex from C. thermocellum, containing all of these functions in addition to numerous cellulases, is regarded as one of the most efficient cellulose-degrading systems known to date. Since the first description of this supra-molecular complex in 1983 [3], it became clear that its industrial use is mainly hampered by low production yield from the anaerobic C. thermocellum cultures, and the immense complexity of over 70 known components.

The adaptation of the cellulosome enzymatic composition strongly depends on the nature of the substrate to be degraded, as was shown by transcriptomic and proteomic data [10, 14, 59, 60]. Consequently, we reasoned that a fully synthetic cellulosome complex would allow for the fast adaptation on a substrate while reducing the number of enzymatic components when single enzymes were expressed and added separately. Further advantages are the higher temperature stability of the cellulosomal proteins with optimal incubation temperatures around 60–65 °C compared to 50 °C of most fungal enzymes.

By attempting to express and purify all genetically encoded cellulosomal components of C. thermocellum, 57 of all 73 dockerin-containing components (78%) could be obtained in full length and soluble form. The reasons for the failure to obtain 16 of the components recombinantly are manifold and maybe connected to the limitations of heterologous protein expression encoded by the genomic inserts [61,62,63]. In some cases, enhanced expression levels were only observed with special E. coli expression systems that function via co-expression of cold-adapted chaperonins using Arctic Express (for Clo1313_2747 and Clo1313_2859) or via overcoming tRNA pool depletion by the co-expression of genes for rare tRNAs using BL21 Codon Plus (for Clo1313_2858 and Clo1313_0685) or Rosetta-gami B (for Clo1313_2795 and Clo1313_0501). Three proteins (Clo1313_0177, Clo1313_0420) and Clo1313_2861) could only be obtained in truncated or mixed forms, most likely due to proteolytic cleavage at flexible and exposed linkers between two protein modules.

Starting from a stoichiometrically not fully loaded and minimized cellulase complex (SKL), we supplemented single enzymes to study their effect on the complex activity. Interestingly, we were able to identify seven single cellulosomal enzymes belonging to two major functional groups which we regard as the core enzymatic requirements:

The activity-boosting enzymes (Man5A, Man26A, Man26B, Cel5-26H) are known for activity on mannans (β-mannanase, exo-β-1,4-mannobiohydrolase; β-1,3-xylanase (EC; lichenase/endo-β-1,3-1,4-glucanase; mannobiose-producing exo-β-mannanase). Especially galactomannans are highly abundant in softwoods (20-25% of the dry mass), whereby its backbone consists of β-(1,4)-linked β-d-glucopyranose and β-d-mannopyranose residues. Further acetyl groups and α-(1,6)-d-galactopyranose are present as partial substituents [64]. As a second enhancing group, xylanases of the families GH10 and GH11 (Xyn11A, Xyn10C, Xyn10D, Xyn10Y, Xyn10Z) and the xylanolytic enzyme Xyn141E with high sequence similarity to the recently described family GH141 [65] could be identified. These enzymes hydrolyze xylans, highly abundant and variable hemicelluloses in nature that share a backbone of β-(1,4)-d-xylose units with diverse substitutions.

Due to these results, we reasoned that these two general accessory functions, mannanolytic and xylanolytic activities, are needed to boost the activity of the basal cellulase complex for this specific substrate. Verification of this assumption was possible by assembling in vitro combinations of complex compositions. By incorporating only two additional enzymes, Xyn10Y and Man26A, an approximately threefold higher enzymatic complex activity could be obtained. We suggest that the pentavalent SKLMY complex must contain a minimum number of five single components (with at least five distinct functions, respectively) bound on the carrier protein CipA8 to effectively hydrolyze softwood pulp as the substrate: one processive endoglucanase producing cellobiose as main hydrolysis product (Cel5L), two cellobiohydrolases (CBHs, Cel48S and Cel9K) with specificity from the reducing and non-reducing end of the polysaccharide chain, respectively, a mannanase of the GH26 family (Man26A, [66]) and a xylan-specific multifunctional feruloyl-esterase containing xylanase (Xyn10Y [67]). In addition, β-glucosidase was added to relieve the inhibitory effect of cellobiose on the CBH enzymes. Other studies also focused on the incorporation of single xylanolytic functions into designer cellulosomes for higher enzymatic efficiencies, as successfully shown for xylanases of Thermobifida fusca on wheat straw [68, 69]. Noteworthy, the complex optimization is not finished at this stage of complex development. Another initial complex combination than SKL may result in additional synergies that might have been missed in our approach. By establishing the synthetic complex containing almost all residual recombinant enzymes, a significant boost in enzyme activity was observed in our study. This in turn may be due to hidden enzyme synergies that we could not yet uncover. Due to the indefinite number of possible enzyme and stoichiometric combinations, more advanced, automated and high-throughput screening approaches will have to be applied.

Our minimalized but fully synthetic enzyme mix achieved almost 60% of the activity of the commercially available fungal cellulase blend Cellic CTec2 on softwood pulp as the substrate. Further addition of over 40 fully synthetic components of known and unknown functions, or alternatively the native protein mixture SM901 purified from a CipA-deficient mutant (from mutant SM1, [17]) led to enzymatic efficiencies comparable to a commercial fungal cellulase preparation. An identical enzyme complex composition did not result in comparable hydrolytic efficiency when tested on other cellulosic substrates (e.g. Avicel), as the SKLMY complex was optimized for softwood pulp degradation. Depending on the substrate constituents, further complex optimization will be necessary as important enzymatic functions may still be missing. To the very best of our knowledge, the superior hydrolytic efficiency of the cellulosome on more complex substrates and under process-relevant conditions has still not been proven so far, but should be reachable in the near future by engineering synthetic cellulosome analogs or designer cellulosomes.

An important aspect that could not be addressed within this study was the role of the stoichiometric loading of diverse enzymatic functions and ratios between components within the cellulosomal multi-enzyme complex. Numerous studies tried to answer this question by employing transcriptomic and proteomic analyses to understand the complex adaptation on different substrates. However, recent results indicate that the enzymatic complexity of the cellulosome is a key feature for its high hydrolytic efficiency on cellulosic substrates [13, 14, 16]. Furthermore, this principle seems to hold true also for other cellulosomal multi-enzyme systems from other cellulolytic bacteria such as Acetivibrio cellulolyticus and Ruminococcus flavefaciens [70, 71]. As a consequence, a high-throughput screening strategy is needed to understand the interplay between single enzymatic activities and synergies between the functional groups and proximity of single components, and to build up computational models. This knowledge may help to predict and adapt fully synthetic complexes to virtually any kind of polysaccharide from plant-derived biomass, in dependence of the substrate composition requirements.


Inspired by the supra-modular extracellular cellulase complex from C. thermocellum, we designed fully synthetic cellulosome complexes for enhanced degradation of softwood pulp as cellulose-based substrate. To this end, we expressed and purified 60 single enzymatic components to systematically study the core enzymatic modalities needed to hydrolyze softwood pulp. Two major function classes, xylanase and mannanase enzymes, were incorporated into a pentavalent recombinant cellulase complex that was characterized biochemically. In direct comparison, the enzymatic efficiency of a fully synthetic cellulosome is, even without stoichiometric optimization, comparable with the commercial fungal enzyme cocktail Cellic CTec2. This study underscores the prospect to use synthetic cellulosome complexes for a fast and versatile adaptation of single enzymatic functions to achieve high activity on cellulosic substrates.





carbohydrate-binding module


carbohydrate esterase


CotH spore coat protein kinase module


electrophoretic mobility shift assay


fibronectin module


glycoside hydrolase


glycoside hydrolase-associated immunoglobulin module




lytic polysaccharide monooxygenase


leucin-rich repeat


lamin tail domain


polysaccharide lyase


regulator of chromosome condensation


room temperature


unknown module or module with unknown function


  1. 1.

    Chandel AK, Chandrasekhar G, Silva MB, Silvério da Silva S. The realm of cellulases in biorefinery development. Crit Rev Biotechnol. 2012;32:187–202.

    Article  CAS  PubMed  Google Scholar 

  2. 2.

    Arantes V, Saddler JN. Access to cellulose limits the efficiency of enzymatic hydrolysis: the role of amorphogenesis. Biotechnol Biofuels. 2010;3:4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. 3.

    Bayer EA, Kenig R, Lamed R. Adherence of Clostridium thermocellum to cellulose. J Bacteriol. 1983;156:818–27.

    CAS  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Bayer EA, Belaich J-P, Shoham Y, Lamed R. The cellulosomes: multienzyme machines for degradation of plant cell wall polysaccharides. Annu Rev Microbiol. 2004;58:521–54.

    Article  CAS  Google Scholar 

  5. 5.

    Zverlov VV, Schwarz WH. Bacterial cellulose hydrolysis in anaerobic environmental subsystems—Clostridium thermocellum and Clostridium stercorarium, thermophilic plant-fiber degraders. Ann NY Acad Sci. 2008;1125:298–307.

    Article  CAS  PubMed  Google Scholar 

  6. 6.

    Fierobe H-P, Bayer EA, Tardif C, Czjzek M, Mechaly A, Bélaïch A, et al. Degradation of cellulose substrates by cellulosome chimeras. Substrate targeting versus proximity of enzyme components. J Biol Chem. 2002;277:49621–30.

    Article  CAS  Google Scholar 

  7. 7.

    Krauss J, Zverlov VV, Schwarz WH. In vitro reconstitution of the complete Clostridium thermocellum cellulosome and synergistic activity on crystalline cellulose. Appl Environ Microbiol. 2012;78:4301–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. 8.

    Yeoman CJ, Han Y, Dodd D, Schroeder CM, Mackie RI, Cann IKO. Thermostable enzymes as biocatalysts in the biofuel industry. New York: Elsevier; 2010. p. 1–55.

    Google Scholar 

  9. 9.

    Lynd LR, Weimer PJ, van Zyl WH, Pretorius IS. Microbial cellulose utilization: fundamentals and biotechnology. Microbiol Mol Biol Rev. 2002;66:506–77.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Zverlov VV, Kellermann J, Schwarz WH. Functional subgenomics of Clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. Proteomics. 2005;5:3646–53.

    Article  CAS  PubMed  Google Scholar 

  11. 11.

    Arfi Y, Shamshoum M, Rogachev I, Peleg Y, Bayer EA. Integration of bacterial lytic polysaccharide monooxygenases into designer cellulosomes promotes enhanced cellulose degradation. Proc Natl Acad Sci USA. 2014;111:9109–14.

    Article  CAS  PubMed  Google Scholar 

  12. 12.

    Gefen G, Anbar M, Morag E, Lamed R, Bayer EA. Enhanced cellulose degradation by targeted integration of a cohesin-fused β-glucosidase into the Clostridium thermocellum cellulosome. Proc Natl Acad Sci USA. 2012;109:10298–303.

    Article  PubMed  Google Scholar 

  13. 13.

    Leis B, Held C, Bergkemper F, Dennemarck K, Steinbauer R, Reiter A, et al. Comparative characterization of all cellulosomal cellulases from Clostridium thermocellum reveals high diversity in endoglucanase product formation essential for complex activity. Biotechnol Biofuels. 2017;10:240.

    Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Xu Q, Resch MG, Podkaminer K, Yang S, Baker JO, Donohoe BS, et al. Dramatic performance of Clostridium thermocellum explained by its wide range of cellulase modalities. Sci Adv. 2016;2:e1501254.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Hirano K, Nihei S, Hasegawa H, Haruki M, Hirano N. Stoichiometric assembly of the cellulosome generates maximum synergy for the degradation of crystalline cellulose, as revealed by in vitro reconstitution of the Clostridium thermocellum cellulosome. Appl Environ Microbiol. 2015;81:4756–66.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Hirano K, Kurosaki M, Nihei S, Hasegawa H, Shinoda S, Haruki M, Hirano N. Enzymatic diversity of the Clostridium thermocellum cellulosome is crucial for the degradation of crystalline cellulose and plant biomass. Sci Rep. 2016;6:35709.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Zverlov VV, Klupp M, Krauss J, Schwarz WH. Mutations in the scaffoldin gene, cipA, of Clostridium thermocellum with impaired cellulosome formation and cellulose hydrolysis: insertions of a new transposable element, IS1447, and implications for cellulase synergism on crystalline cellulose. J Bacteriol. 2008;190:4321–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Johnson EA, Sakajoh M, Halliwell G, Madia A, Demain AL. Saccharification of complex cellulosic substrates by the cellulase system from Clostridium thermocellum. Appl Environ Microbiol. 1982;43:1125–32.

    CAS  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8:785–6.

    Article  CAS  Google Scholar 

  20. 20.

    Ohta T, Tokishita S-I, Imazuka R, Mori I, Okamura J, Yamagata H. beta-Glucosidase as a reporter for the gene expression studies in Thermus thermophilus and constitutive expression of DNA repair genes. Mutagenesis. 2006;21:255–60.

    Article  CAS  PubMed  Google Scholar 

  21. 21.

    Breves R, Bronnenmeier K, Wild N, Lottspeich F, Staudenbauer WL, Hofemeister J. Genes encoding two different beta-glucosidases of Thermoanaerobacter brockii are clustered in a common operon. Appl Environ Microbiol. 1997;63:3902–10.

    CAS  PubMed  PubMed Central  Google Scholar 

  22. 22.

    Wood TM, Bhat KM. Methods for measuring cellulase activities. In: Biomass part A: cellulose and hemicellulose. New York: Elsevier; 1988. p. 87–112.

  23. 23.

    Zverlov VV, Velikodvorskaya GA, Schwarz WH. Two new cellulosome components encoded downstream of celI in the genome of Clostridium thermocellum: the non-processive endoglucanase CelN and the possibly structural protein CseP. Microbiology (Reading, Engl). 2003;149:515–24.

    Article  CAS  Google Scholar 

  24. 24.

    Kang S, Barak Y, Lamed R, Bayer EA, Morrison M. The functional repertoire of prokaryote cellulosomes includes the serpin superfamily of serine proteinase inhibitors. Mol Microbiol. 2006;60:1344–54.

    Article  CAS  PubMed  Google Scholar 

  25. 25.

    Zverlov VV, Fuchs KP, Schwarz WH, Velikodvorskaya GA. Purification and cellulosomal localization of Clostridium thermocellum mixed linkage?-glucanase LicB (1,3-1,4-β-d-glucanase). Biotechnol Lett. 1994;16:29–34.

    Article  CAS  Google Scholar 

  26. 26.

    Schwarz WH, Gräbnitz F, Staudenbauer WL. Properties of a Clostridium thermocellum endoglucanase produced in Escherichia coli. Appl Environ Microbiol. 1986;51:1293–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Zverlov VV, Fuchs K-P, Schwarz WH. Chi18A, the endochitinase in the cellulosome of the thermophilic, cellulolytic bacterium Clostridium thermocellum. Appl Environ Microbiol. 2002;68:3176–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. 28.

    Zverlov VV, Velikodvorskaya GA, Schwarz WH, Kellermann J, Staudenbauer WL. Duplicated Clostridium thermocellum cellobiohydrolase gene encoding cellulosomal subunits S3 and S5. Appl Microbiol Biotechnol. 1999;51:852–9.

    Article  CAS  PubMed  Google Scholar 

  29. 29.

    Zverlov VV, Velikodvorskaya GV, Schwarz WH, Bronnenmeier K, Kellermann J, Staudenbauer WL. Multidomain structure and cellulosomal localization of the Clostridium thermocellum cellobiohydrolase CbhA. J Bacteriol. 1998;180:3091–9.

    CAS  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Brás JLA, Cartmell A, Carvalho ALM, Verzé G, Bayer EA, Vazana Y, et al. Structural insights into a unique cellulase fold and mechanism of cellulose hydrolysis. Proc Natl Acad Sci USA. 2011;108:5237–42.

    Article  PubMed  Google Scholar 

  31. 31.

    Grépinet O, Béguin P. Sequence of the cellulase gene of Clostridium thermocellum coding for endoglucanase B. Nucleic Acids Res. 1986;14:1791–9.

    Article  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Navarro A, Chebrou MC, Béguin P, Aubert JP. Nucleotide sequence of the cellulase gene celF of Clostridium thermocellum. Res Microbiol. 1991;142:927–36.

    Article  CAS  PubMed  Google Scholar 

  33. 33.

    Zverlov VV, Schantz N, Schwarz WH. A major new component in the cellulosome of Clostridium thermocellum is a processive endo-beta-1,4-glucanase producing cellotetraose. FEMS Microbiol Lett. 2005;249:353–8.

    Article  CAS  PubMed  Google Scholar 

  34. 34.

    Ahsan MM, Kimura T, Karita S, Sakka K, Ohmiya K. Cloning, DNA sequencing, and expression of the gene encoding Clostridium thermocellum cellulase CelJ, the largest catalytic component of the cellulosome. J Bacteriol. 1996;178:5732–40.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Arai T, Ohara H, Karita S, Kimura T, Sakka K, Ohmiya K. Sequence of celQ and properties of celQ, a component of the Clostridium thermocellum cellulosome. Appl Microbiol Biotechnol. 2001;57:660–6.

    Article  CAS  PubMed  Google Scholar 

  36. 36.

    Ichinose H, Kuno A, Kotake T, Yoshida M, Sakka K, Hirabayashi J, et al. Characterization of an exo-beta-1,3-galactanase from Clostridium thermocellum. Appl Environ Microbiol. 2006;72:3515–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Hazlewood GP, Davidson K, Clarke JH, Durrant AJ, Hall J, Gilbert HJ. Endoglucanase E, produced at high level in Escherichia coli as a lacZ’ fusion protein, is part of the Clostridium thermocellum cellulosome. Enzyme Microb Technol. 1990;12:656–62.

    Article  CAS  PubMed  Google Scholar 

  38. 38.

    Hall J, Hazlewood GP, Barker PJ, Gilbert HJ. Conserved reiterated domains in Clostridium thermocellum endoglucanases are not essential for catalytic activity. Gene. 1988;69:29–38.

    Article  CAS  PubMed  Google Scholar 

  39. 39.

    Correia MAS, Prates JAM, Brás J, Fontes CMGA, Newman JA, Lewis RJ, et al. Crystal structure of a cellulosomal family 3 carbohydrate esterase from Clostridium thermocellum provides insights into the mechanism of substrate recognition. J Mol Biol. 2008;379:64–72.

    Article  CAS  PubMed  Google Scholar 

  40. 40.

    Mizutani K, Fernandes VO, Karita S, Luís AS, Sakka M, Kimura T, et al. Influence of a mannan binding family 32 carbohydrate binding module on the activity of the appended mannanase. Appl Environ Microbiol. 2012;78:4781–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. 41.

    Joliff G, Béguin P, Aubert JP. Nucleotide sequence of the cellulase gene celD encoding endoglucanase D of Clostridium thermocellum. Nucleic Acids Res. 1986;14:8605–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. 42.

    Fontes CM, Hazlewood GP, Morag E, Hall J, Hirst BH, Gilbert HJ. Evidence for a general role for non-catalytic thermostabilizing domains in xylanases from thermophilic bacteria. Biochem J. 1995;307:151–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. 43.

    Zverlov VV, Schantz N, Schmitt-Kopplin P, Schwarz WH. Two new major subunits in the cellulosome of Clostridium thermocellum: xyloglucanase Xgh74A and endoxylanase Xyn10D. Microbiology (Reading, Engl). 2005;151:3395–401.

    Article  CAS  Google Scholar 

  44. 44.

    Yagüe E, Béguin P, Aubert JP. Nucleotide sequence and deletion analysis of the cellulase-encoding gene celH of Clostridium thermocellum. Gene. 1990;89:61–7.

    Article  PubMed  Google Scholar 

  45. 45.

    Hayashi H, Takagi KI, Fukumura M, Kimura T, Karita S, Sakka K, Ohmiya K. Sequence of xynC and properties of XynC, a major component of the Clostridium thermocellum cellulosome. J Bacteriol. 1997;179:4246–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. 46.

    Grépinet O, Chebrou MC, Béguin P. Nucleotide sequence and deletion analysis of the xylanase gene (xynZ) of Clostridium thermocellum. J Bacteriol. 1988;170:4582–8.

    Article  PubMed  PubMed Central  Google Scholar 

  47. 47.

    Kruus K, Wang WK, Ching J, Wu JH. Exoglucanase activities of the recombinant Clostridium thermocellum CelS, a major cellulosome component. J Bacteriol. 1995;177:1641–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. 48.

    Zverlov VV, Velikodvorskaya GA, Schwarz WH. A newly described cellulosomal cellobiohydrolase, CelO, from Clostridium thermocellum: investigation of the exo-mode of hydrolysis, and binding capacity to crystalline cellulose. Microbiology (Reading, Engl). 2002;148:247–55.

    Article  CAS  Google Scholar 

  49. 49.

    Montanier CY, Correia MAS, Flint JE, Zhu Y, Baslé A, McKee LS, et al. A novel, noncatalytic carbohydrate-binding module displays specificity for galactose-containing polysaccharides through calcium-mediated oligomerization. J Biol Chem. 2011;286:22499–509.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Kurokawa J, Hemjinda E, Arai T, Kimura T, Sakka K, Ohmiya K. Clostridium thermocellum cellulase CelT, a family 9 endoglucanase without an Ig-like domain or family 3c carbohydrate-binding module. Appl Microbiol Biotechnol. 2002;59:455–61.

    Article  CAS  PubMed  Google Scholar 

  51. 51.

    Lemaire M, Béguin P. Nucleotide sequence of the celG gene of Clostridium thermocellum and characterization of its product, endoglucanase CelG. J Bacteriol. 1993;175:3353–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Hayashi H, Takehara M, Hattori T, Kimura T, Karita S, Sakka K, Ohmiya K. Nucleotide sequences of two contiguous and highly homologous xylanase genes xynA and xynB and characterization of XynA from Clostridium thermocellum. Appl Microbiol Biotechnol. 1999;51:348–57.

    Article  CAS  PubMed  Google Scholar 

  53. 53.

    Schwarz WH, Zverlov VV. Protease inhibitors in bacteria: an emerging concept for the regulation of bacterial protein complexes? Mol Microbiol. 2006;60:1323–6.

    Article  CAS  PubMed  Google Scholar 

  54. 54.

    Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42:D490–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. 55.

    Schülein M. Protein engineering of cellulases. Biochim Biophys Acta. 2000;1543:239–52.

    Article  PubMed  Google Scholar 

  56. 56.

    Bommarius AS, Sohn M, Kang Y, Lee JH, Realff MJ. Protein engineering of cellulases. Curr Opin Biotechnol. 2014;29:139–45.

    Article  CAS  PubMed  Google Scholar 

  57. 57.

    Druzhinina IS, Kubicek CP. Genetic engineering of Trichoderma reesei cellulases and their production. Microb Biotechnol. 2017;10:1485–99.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. 58.

    Schwarz WH, Zverlov VV, Bahl H. Extracellular glycosyl hydrolases from clostridia. New York: Elsevier; 2004. p. 215–61.

    Google Scholar 

  59. 59.

    Yoav S, Barak Y, Shamshoum M, Borovok I, Lamed R, Dassa B, et al. How does cellulosome composition influence deconstruction of lignocellulosic substrates in Clostridium (Ruminiclostridium) thermocellum DSM 1313? Biotechnol Biofuels. 2017;10:222.

    Article  PubMed  PubMed Central  Google Scholar 

  60. 60.

    Raman B, Pan C, Hurst GB, Rodriguez M, McKeown CK, Lankford PK, et al. Impact of pretreated Switchgrass and biomass carbohydrates on Clostridium thermocellum ATCC 27405 cellulosome composition: a quantitative proteomic analysis. PLoS ONE. 2009;4:e5271.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. 61.

    Leis B, Angelov A, Liebl W. Screening and expression of genes from metagenomes. Adv Appl Microbiol. 2013;83:1–68.

    Article  CAS  PubMed  Google Scholar 

  62. 62.

    Kimelman A, Levy A, Sberro H, Kidron S, Leavitt A, Amitai G, et al. A vast collection of microbial genes that are toxic to bacteria. Genome Res. 2012;22:802–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. 63.

    Sørensen HP, Mortensen KK. Advanced genetic strategies for recombinant protein expression in Escherichia coli. J Biotechnol. 2005;115:113–28.

    Article  CAS  PubMed  Google Scholar 

  64. 64.

    Moreira LRS, Filho EXF. An overview of mannan structure and mannan-degrading enzyme systems. Appl Microbiol Biotechnol. 2008;79:165–78.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. 65.

    Heinze S, Mechelke M, Kornberger P, Liebl W, Schwarz WH, Zverlov VV. Identification of endoxylanase XynE from Clostridium thermocellum as the first xylanase of glycoside hydrolase family GH141. Sci Rep. 2017;7:11178.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. 66.

    Halstead JR, Vercoe PE, Gilbert HJ, Davidson K, Hazlewood GP. A family 26 mannanase produced by Clostridium thermocellum as a component of the cellulosome contains a domain which is conserved in mannanases from anaerobic fungi. Microbiology (Reading, Engl). 1999;145(Pt 11):3101–8.

    Article  CAS  Google Scholar 

  67. 67.

    Blum DL, Kataeva IA, Li X-L, Ljungdahl LG. Feruloyl esterase activity of the Clostridium thermocellum cellulosome can be attributed to previously unknown domains of XynY and XynZ. J Bacteriol. 2000;182:1346–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. 68.

    Moraïs S, Barak Y, Caspi J, Hadar Y, Lamed R, Shoham Y, et al. Contribution of a xylan-binding module to the degradation of a complex cellulosic substrate by designer cellulosomes. Appl Environ Microbiol. 2010;76:3787–96.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. 69.

    Moraïs S, Barak Y, Hadar Y, Wilson DB, Shoham Y, Lamed R, Bayer EA. Assembly of xylanases into designer cellulosomes promotes efficient hydrolysis of the xylan component of a natural recalcitrant cellulosic substrate. MBio. 2011.

    Article  PubMed  PubMed Central  Google Scholar 

  70. 70.

    Venditto I, Luis AS, Rydahl M, Schückel J, Fernandes VO, Vidal-Melgosa S, et al. Complexity of the Ruminococcus flavefaciens cellulosome reflects an expansion in glycan recognition. Proc Natl Acad Sci USA. 2016;113:7136–41.

    Article  CAS  PubMed  Google Scholar 

  71. 71.

    Hamberg Y, Ruimy-Israeli V, Dassa B, Barak Y, Lamed R, Cameron K, et al. Elaborate cellulosome architecture of Acetivibrio cellulolyticus revealed by selective screening of cohesin–dockerin interactions. PeerJ. 2014;2:e636.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Authors’ contributions

VVZ, WHS, LPS, SG, and BL planned and designed the research. BL, CH, and BA performed the experiments. BL, CH, FB, and VVZ analyzed the data. BL, VVZ, WHS, and WL wrote the manuscript. All authors read and approved the final manuscript.


The authors thank Sabrina Sigl and Patricia Krähe for excellent technical assistance. Provision of Kraft process pretreated softwood from UPM-Kymmene by Michael Duetsch is acknowledged.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its additional files.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.


This work was supported by the German Federal Ministry of Education and Research (BMBF, Research Grant number 0316147) and the German Federal Ministry for Economic Affairs and Energy (BMWi, Grant number 03EFIBY149). Publication of this work was supported by the German Research Foundation (DFG) and the Technische Universität München within the funding program Open Access Publishing.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information



Corresponding author

Correspondence to Vladimir V. Zverlov.

Additional files

Additional file 1. Oligonucleotides used in this study.

Additional file 2. SDS-PAGE summary of all recombinantly expressed dockerin type I containing proteins used in this study (60 different components of C. thermocellum cellulosome). Each protein is shown after the purification process, including the final heat precipitation step. Enzyme names and Clo1313 numbers are shown above the corresponding SDS-PAGE picture. Molecular weight standards are depicted in kDa. Proteins marked with asterisks are truncated versions of the original protein.

Additional file 3. Effect of inhibitors on the cellulosomalcomplexes. The glucose concentration was measured using the Glu-HK determination kit (Megazyme). The residual complex activity was assessed using 0.5 mL standard reaction mixture containing 0.25% (w/v) of the substrate for one to 2 days of incubation at 60 °C. The initial activity at time point 0 corresponds to 100% relative activity.

Additional file 4. Overview of 47 recombinant proteins. All proteins were mixed in equimolar amounts before adding them to the pentavalent SKLMY complex.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Leis, B., Held, C., Andreeßen, B. et al. Optimizing the composition of a synthetic cellulosome complex for the hydrolysis of softwood pulp: identification of the enzymatic core functions and biochemical complex characterization. Biotechnol Biofuels 11, 220 (2018).

Download citation


  • Clostridium thermocellum
  • Cellulosome
  • Screening
  • Synthetic cellulase complex
  • Softwood
  • Cellulose