Skip to main content

Hypocrea jecorina CEL6A protein engineering


The complex technology of converting lignocellulose to fuels such as ethanol has advanced rapidly over the past few years, and enzymes are a critical component of this technology. The production of effective enzyme systems at cost structures that facilitate commercial processes has been the focus of research for many years. Towards this end, the H. jecorina cellobiohydrolases, CEL7A and CEL6A, have been the subject of protein engineering at Genencor. Our first rounds of cellobiohydrolase engineering were directed towards improving the thermostability of both of these enzymes and produced variants of CEL7A and CEL6A with apparent melting temperatures above 70°C, placing their stability on par with that of H. jecorina CEL5A (EG2) and CEL3A (BGL1). We have now moved towards improving CEL6A- and CEL7A-specific performance in the context of a complete enzyme system under industrially relevant conditions. Achievement of these goals required development of new screening strategies and tools. We discuss these advances along with some results, focusing mainly on engineering of CEL6A.


Hypocrea jecorina (anamorph Trichoderma reesei) is an industrially important producer of cellulases for an applications portfolio that includes pulp and paper processing, food and feed processing, textile manufacturing and modification, and detergents/cleaners formulation. Cellulases were employed in early 'waste to transportation fuel' research in, for example, the U.S. Army Natick program in the 1970s; however, it was not until 2007 that a cellulase formulation for biomass saccharification was developed and marketed to the nascent biomass-to-ethanol industry (Accellerase®1000; Genencor Division, Danisco USA Inc.).


To understand the challenges of protein engineering biomass enzymes, it is important to understand current concepts of their structure-function relationship. Early concepts of cellulase mode of action were published by Reese et al. in 1950 [1], followed by a review article in 1964 [2]. Since then, a great deal of work has been devoted to elucidating the mechanism of enzymatic cellulose degradation. These studies have covered a wide range of substrates from pure cellulose to pretreated lignocellulose from various sources [38]. Yet, there is still much to learn regarding the fundamental mechanism of cellulases.

Two of the most important cellulase glycosyl hydrolase families are GH6 and GH7. The first structure of a cellobiohydrolase to be solved was the catalytic module of H. jecorina CEL6A (previously known as CBH 2), which was published by Rouvinen et al. in 1990 [9]. The first structure of a fungal cellulose binding module, the CBM1 of H. jecorina CEL7A (previously known as CBH 1), was published by Kraulis et al. in 1989 [10], and a few years later the structure of the H. jecorina CEL7A catalytic module was published by Divne et al. in 1994 [11].

The catalytic mechanism of glycosyl hydrolase family 6 enzymes, including both exo- and endoglucanases is net inversion (versus retention in family 7) of the anomeric configuration(Carbohydrate Active Enzymes database; Cantarel [12]). Although there is no overall sequence homology between the catalytic domains of the H. jecorina cellobiohydrolases CEL6A and CEL7A, there is homology between CEL6A and GH6 endoglucanases and between CEL7A and GH7 endoglucanases. It is the differences, particularly in the loop regions that form the active site tunnel in CEL6A and CEL7A, that affect the type of enzyme activity. These loops are missing in the endoglucanases, leaving a more open active site cleft [9, 11]. Indeed, deletion of the C-proximal loop that covers the active site of Cellulomonas fimi cellobiohydrolase A increased its endoglucanase activity [13]. Furthermore, Divne et al.[11] concluded that dissimilarities in the CEL6A and CEL7A tunnels (length, number of tryptophans, and asymmetric distribution of glucose binding subsites) could explain observations of distribution differences on cellulose fibrils and synergy between CEL7A and CEL6A.

Advanced modeling [14, 15] and microscopy techniques [16, 17] are being used to study cellobiohydrolase-substrate interactions. These techniques hold promise for demonstrating how these enzymes bind substrate and move and how these events are related to hydrolysis. Studies of the H. insolens CEL6A [18] provided a detailed description of solvent-mediated carbohydrate-protein interactions involved in ligand movement through the active site tunnel during hydrolysis using crystal structures to support the authors' model. Electron microscopy studies with enzymes from Humicola insolens confirmed that the GH6 and GH7 cellobiohydrolases act processively from opposite ends of the cellulose chain, GH6 from the nonreducing end and GH7 from the reducing end [19]. These studies also revealed that CEL6A had substantial endoglucanase activity and lower processivity than CEL7A, at least on the Valonia cellulose crystals studied. However, on a complex biomass substrate, in a natural mix of cellulases, it is not known to what extent CEL6A relies on its own endoglucanase activity and to what extent it starts from chain ends created by other endoglucanase enzymes. The Walker lab at Cornell University uses single-molecule detection methods, such as quantitative fluorescence microscopy, to study cellulase adsorption and hydrolysis [20]. Recently, researchers have observed cellulases sliding along a cellulose fibril in real time and measured the velocity of sliding [21]. While the basic chemical mechanisms of glycosyl hydrolases are well understood and their combined roles in the hydrolysis of pure crystalline cellulose are reasonably well elucidated, hydrolysis of lignocellulosic substrates is more complicated and is the subject of continuing research. Much of the applied research is now directed towards discovering the minimum number of enzyme components that provide optimal performance.

Optimal cellulase mixture for biomass hydrolysis

To date, optimal mixtures have generally been developed by purely empirical approaches [22, 23] and may vary relative to the substrate being used. The CEL7A and CEL6A cellobiohydrolases are the two most abundant enzymes in H. jecorina[23], indicating their key role in the cellulase enzyme system. The importance of CEL7A and CEL6A was demonstrated in 1980 by Reese and Mandels [24] when they showed that cellobiohydrolase activity was limiting cellulose hydrolysis. They also demonstrated that the cellobiohydrolase component was less stable under standard process conditions (pH 4.8, 50°C, 24 hr) than most of the other enzymes in the native H. jecorina cellulase milieu. That the cellobiohydrolases are indeed key players was further supported by individual deletion of the four major cellulase genes cbh1, cbh2, egl1 and egl2 (coding for CEL7A, CEL6A, CEL7B and CEL5A) in the industrial H. jecorina strain VTT-D-79125, where the mutants without CEL7A or CEL6A showed a 70% and 33% activity loss, respectively, on filter paper relative to the parent strain [25]. This study, and that of Reese and Mandels [24], was conducted with pure cellulose substrates (filter paper and Avicel, respectively). Rosgaard et al.[23], using two different hot water-pretreated barley samples, demonstrated optimal performance when the CEL6A:CEL7A ratio was 2:1, again showing that cellobiohydrolase activity exceeding that of native H. jecorina preparations was beneficial.

The issues of inhibition (substrate and product) and substrate accessibility in these complex mixtures have been recognized for some time [2629]. It was also accepted early on that efficient cellulose-digesting enzymes would need to be produced with acceptable economics. Much effort has been made to improve the productivity of strains descending from the original T. reesei QM6a [30]. Programs such as the NTG mutagenesis and selection conducted at Rutgers University [31] and Lehigh University [32] were quite successful and resulted in strains that are still in industrial use, such as RutC30 and RL-P37, and their descendants. Strain improvement has continued even with the relatively high productivity of these industrial strains. To date, enzyme cost reduction has been mainly accomplished by strain improvement rather than by protein engineering.

Protein engineering

The generally low turnover rates of cellulases, for example, one to four per second for Cel7A [33], present a challenge in applications where cheap sugars are the desired product. Faster rates may be accomplished by raising the reaction temperature and/or by increasing the specific activity of the enzyme. CEL6A has been the target of several protein engineering efforts; some were described in Schülein's 2000 review [34] of cellulase engineering, where he acknowledged that most of the work was devoted to understanding the catalytic mechanism. At VTT, Koivula et al.[35] identified the CEL6A catalytic domain surface residue W272 as essential for degradation of crystalline cellulose, but not soluble or amorphous cellulose. Wohlfahrt et al.[36] identified carboxyl-carboxylate pair mutations (to the corresponding amide-carboxylate pair) to be useful in stabilizing H. jecorina CEL6A, particularly with respect to pH. More site-directed mutations pointed to a role for Y169 in kinetics and binding [37]. The Y169F mutation resulted in little change in the crystal structure relative to wild-type Cel6A, but the association constants for cellotriose and cellotetraose increased fourfold while the activity decreased to about one fourth its original level. The authors speculate that the Y169 residue imposes a distortion of glucose to a more reactive conformation in the active site tunnel. Mutations of two carboxylic acid residues, D175 and D221, at the catalytic centre of CEL6A, supported the proposed role of D221 as responsible for protonation of the glycosidic oxygen, while molecular dynamics simulations indicated that D175 indirectly fulfills the role of a catalytic base [38]. However, in most of the structures of GH6 enzymes, D221 (or the equivalent residue) makes a hydrogen bond to and shares a proton with D175; therefore, D221 cannot protonate the glycosidic oxygen. Formation of a catalytically competent configuration seems to be associated with closing of a flexible loop in association with the substrate, which isolates two water molecules at the catalytic centre [39]. D175 is in contact with and may accept a proton from one of the water molecules, which may in turn accept a proton from the second water that is in position for nucleophilic attack on the anomeric carbon of the sugar. But the closing of the loop also restricts the passage between successive glucose binding subsites within the active site, suggesting that processive action along a cellulose chain requires a cyclic closing-opening sequence for each hydrolytic event, in conformity with molecular dynamics [38]. Flexibility in this loop may thus be an important factor for the specific activity of the enzyme.

Wilson and colleagues [4042] have worked extensively with Thermobifida fusca CEL6B exocellulase and have characterized the catalytic mechanism using enzyme variants. With CEL6B variants expressed in S. lividans or E. coli, they demonstrated that lower activity with insoluble substrates was linked to reduced processivity and that adding disulfide bonds across the loops forming the active site tunnel reduced ligand binding, processivity and activity. In addition, they identified noncatalytic CEL6B mutations in which single and double mutants (G234 S, G284P) demonstrated higher activity on swollen cellulose and filter paper, but these improved variants did not increase synergism with the T. fusca endoglucanase Cel5A [40]. Mutations near the substrate binding site were found to reduce cellobiose inhibition, but in most cases (except G234S) the mutations also resulted in reduced thermal stability. The effect of cysteine residue mutations on expression and thermostability was related to the position of the residue(s) and whether it led to aberrant disulfide bond formation, improper folding, and sometimes proteolysis [41]. Wilson described these and other complexities of engineering cellulases for enhanced activity in a 2009 review article [42]. The review cautions that (1) there is a dearth of demonstrated cellulase activity improvement, (2) improving a single activity may be irrelevant if performance of the synergistic mixture is not improved, (3) protein engineering using only the catalytic module does not guarantee the same performance in the full length protein and (4) activity improvement demonstrated on one substrate does not guarantee the same results on a different substrate.

Recently, Heinzelman and colleagues [43] demonstrated that CEL6A chimeras, which included sequences from H. jecorina CEL6A, CEL6A from the thermophilic fungus Humicola insolens, and/or sequences from three other fungi, expressed by S. cerevisiae, resulted in molecules with greater thermostability than either parent. The chimeras demonstrated a broader pH range than H. jecorina CEL6A. Of the chimeras described, none exceeded CEL6A specific activity on phosphoric acid swollen cellulose (PASC). In a related study [44], they identified a single mutation that significantly enhanced H. jecorina CEL6A thermal stability with PASC hydrolysis activity similar to wild type.

Yet, after all these years, cellulosic ethanol is still expensive relative to starch ethanol. The saccharifying enzymes compose a significant cost component, so it is our objective to reduce the dose required to convert pretreated biomass to simple sugars for fermentation. One way to optimize performance is through protein engineering. Engineered cellulases have been used in textile applications, but few examples exist in the field of biomass saccharification. Unlike textile enzyme products which often are monocomponent, lignocellulosic biomass conversion requires a complex mixture of enzymes. This presents several challenges to a protein engineering approach to increasing performance. For example, it is not efficient to engineer all of the required enzymes. However, within the enzyme complex used for lignocellulosic conversion, some activity is always going to be either limiting or required in abundance. The preferred protein target(s) for improvement is an enzyme that is limiting and has the potential for making a large impact on the specific performance of the system. This improvement could be the result of one or a combination of improvements such as increased specific activity, reduced product inhibition, reduced nonproductive binding, or enhanced stability and longevity under process conditions. Studies using pure cellulose and traditional methods for detecting sugar release [4549] have provided insight into enzyme mode of action, but pure cellulose does not necessarily predict enzyme performance on pretreated lignocellulose. Such improvements should be measured under process-relevant conditions, making development of the appropriate screening assay critical. Screens conducted in the complex biomass milieu will enable the detection of variants that are less susceptible to inhibition and inactivation from components found in the biomass of interest.

Results and Discussion

Mixture studies

We demonstrated that the performance optima of a cellulase mixture also favoured more cellobiohydrolase activity using a process-relevant substrate, dilute acid-pretreated sugarcane bagasse, in a computer-designed mixture experiment (Design Expert Dx5). In this experiment, purified H. jecorina CEL7A and CEL6A were combined in various ratios with an RL-P37 background sample from which the cel7A and cel6A genes had been deleted (delta delta P37). Samples were dosed such that the final cellulose concentration of the mixture was 6% (wt/wt). In the three component mixtures, the amounts of CEL7A, CEL6A and delta delta P37 proteins ranged from 5 to 85 weight percent of the total mixture protein. Five percent (wt/wt) H. jecorina Cel3A (beta glucosidase 1) was included in all of the mixtures to mitigate the impact of cellobiose inhibition by converting most of the soluble cellooligosaccharides to glucose, which was analyzed by high-pressure liquid chromatography (HPLC). At both 50°C and 60°C, the performance of the mixture favoured more CEL7A and CEL6A activity and in approximately equal proportions, whereas in the native preparation CEL7A is three- to fourfold more abundant than CEL6A (Figure 1). The cellobiohydrolases were more limiting at 60°C than at 50°C, possibly due to their relatively lower thermostability. Enhanced activity and thermal stability of cellobiohydrolases are thus target properties to enhance for improvement of biomass conversion performance.

Figure 1
figure 1

Mixture experiment. Three component design of experiment shows limitation and thermolability of cellobiohydrolase activity compared to the cellulase background of delta-CEL7A delta-CEL6A RL-P37. Shown are the experimental design points (left), contour plots for 50°C saccharification (middle) and 60°C saccharification (right). Red dots represent actual data points. Duplicate data points are indicated by the label (2). Contour labels indicate percent glucan conversion.

Cellulase engineering for thermostability

From 2000-2004, Genencor worked with the National Renewable Energy Laboratory (NREL) under the auspices of the Department of Energy Office of the Biomass Program and improved the thermal stability of CEL6A and CEL7A through protein engineering.

Differential scanning calorimetry (DSC) was used to determine the thermal midpoint (Tm) values of the thermal unfolding process for the most abundant H. jecorina cellulases and β-glucosidase (Table 1). Excessive heat capacity curves were measured using an ultrasensitive scanning high-throughput microcalorimeter, VP-Cap DSC (MicroCal, Inc., Northampton, MA). H. jecorina enzymes (500 μl of 0.5 mg/ml) were scanned over 30-90°C temperature range in 10 mM sodium acetate buffer, pH 5.0. A 200°C/hr scan rate was used. The Tm of the DSC curves was used as an indicator of the thermal stability and calculated using the Origin Lab 7.0 software. Native CEL6A and CEL7A are more thermolabile than CEL5A and CEL3A, which may affect whole cellulase performance in extended saccharification reactions at >50°C. With the screening tools available in 2000, protein engineering was undertaken to improve the thermal stability of the two H. jecorina cellobiohydrolases.

Table 1 H. jecorina cellulases.

Different mutagenesis and screening approaches were used for CEL7A and CEL6A Tm improvement. The melting point of CEL7A was increased through a combination of random and site-directed mutagenesis and screening (US 2007/0173431). A limited number of sites with potential involvement in stability were selected on the basis of structure and a 42-member CEL7A sequence alignment of Hypocrea and Trichoderma family members. Site saturation mutagenesis was performed on these sites. CEL7A variants containing from 1 to 19 mutations were expressed in A. niger for screening, and approximately 100,000 clones were assayed for improved stability. Stability was determined by the difference in 4-methylumbelliferryl-lactoside (Sigma Chemicals, M2405) activity before and after a heat challenge. Select A. niger-expressed variants were purified by hydrophobic interaction chromatography, and thermal stability was determined by circular dichroism spectrophotometry (CD) [5052].

Point mutations were then combined to obtain CEL7A variants with substantially higher thermal stability (Goedegebuur et al., manuscript in preparation). Eighteen sites were combined to produce one CEL7A variant, which when expressed in T. reesei had a Tm increase of 14.8°C (Tm 76.0°C) as determined by DSC. While screening was performed on A. niger-expressed proteins, lead molecules were then expressed in T. reesei for validation of performance, expression, and Tm determination.

CEL6A variants were also expressed in A. niger for screening. A limited number of sites were selected for mutagenesis through a consensus approach (US 20060205042). A sequence alignment of H. jecorina CEL6A and eight GH6 family members was used to construct a consensus sequence. Single and multiple amino acid mutations were designed and made by site mutagenesis. Nonconserved positions were examined in the crystal structure, and mutations were selected that were different in CEL6A from the consensus and which fit the structure without disturbance. Conserved sites were not changed. More than 5000 clones were screened on PASC (prepared from Avicel PH101 [53]) for remaining activity after heat inactivation for 1 hr at 61°C or 65°C at pH 4.85. Combinations of mutations were made and ultimately resulted in an H. jecorina-expressed CEL6A variant with a Tm increase of 6.9°C (as determined by DSC). This thermostable variant was shown to have similar activity to the CEL6A wild type in a reconstituted whole cellulase in dilute acid pretreated corn stover (PCS [54]) hydrolysis. PCS (7% wt/wt cellulose) specific performance was tested at 53°C for 20 hr by adding CEL6A variants to a CEL6A-deleted cellulose strain product (US 2006/0205042). These two protein engineering projects resulted in apparent melting temperatures above 70°C for both CEL7A and CEL6A, placing their stability on par with that of CEL5A and CEL3A.

Cellulase engineering for performance

With this foundation of knowledge and experience, and with new tools in hand, we tackled the challenge of improving the specific activity of H. jecorina CEL6A. The products of this research will contribute to the cost reduction of enzymes in biomass-to-ethanol processes and one example for the application of the improved enzymes is the demonstration plant that the DuPont Danisco Cellulosic Ethanol, LLC (DDCE) began operating in early 2010.

Since our initial protein engineering work with NREL, we have developed T. reesei as a screening host for protein engineering and made other technical advancements on the basis of the lessons learned during those early years. Because T. reesei is a preferred host for enzyme production, we developed it as the host for screening to ensure that expression and performance of the selected variants would not be lost when expressed in the production host. The requirements for an effective high throughput screening strain include high frequency of transformation, reliable gene expression, reproducible growth in microtiter plate (MTP) format, sufficient and reproducible protein production, and low secretion of background proteins. The latter was especially important because the candidates for protein engineering were overexpressed using a cellulase induction system that had the potential to induce production of confounding background activities. The strain basis for screening was a fourfold deletion variant of T. reesei in which the genes for cel7A, cel6A, cel7B, and cel5A had been removed.

The biomass saccharification assay was miniaturized from shake flasks to 96-well MTP. Although miniaturized, the assays incorporated process relevant conditions. Either washed or unwashed pretreated biomass was used as the substrate. The substrate was delivered to the MTP as slurry in pH 5 sodium acetate buffer, with a consistency like cake batter. We demonstrated that the MTP scale assay was predictive of shake flask scale results (Figure 2). We also found that the MTP scale assay was predictive of larger-scale performance (not shown). MTP and shake flask saccharification assays were incubated for 3-5 days at 50°C, with shaking, using washed PCS at 13% wt/wt final solids (7% wt/wt cellulose). The correlation was shown by comparison of MTP scale results with shake flask scale results from NREL using the same materials and conditions Each scale presented different challenges in delivery, mixing, and sampling. It is critical that the small-scale assays predict large-scale results. Screens performed with pure cellulosic substrates are often not predictive and do not allow for a mechanistic understanding of complex substrates. In fact, the results of screening with a particular complex substrate may not accurately predict performance on a different complex substrate. This is illustrated in the results of a performance comparison of 62 independent samples of T. reesei whole cellulase in saccharification assays with four different substrates: dilute acid pretreated sugarcane bagasse, PCS, Avicel, and PASC. The dilute acid PCS and bagasse were produced and provided by NREL [54]. The composition of lignocellulosic materials was determined using the assays detailed in the NREL protocols for Standard Biomass Analytical Procedures

Figure 2
figure 2

Saccharification assays. The miniaturized saccharification assay with dilute acid pretreated corn stover is predictive of shake flask scale performance. The shake flask assay was conducted by NREL according to their Laboratory Analytical Procedure (LAP) "Enzymatic Saccharification of Lignocellulosic Biomass"

The 62 cellulase samples represented material from various Genencor T. reesei strains, production lots, protein production conditions, and formulations, collected over several years. In the performance assays, the cellulases were dosed at 20 mg total protein per gram of cellulose. Total protein was determined by an automated Biuret method (Pointe Scientific T7528). The substrate loading of the PCS, bagasse, and Avicel was 7% (wt/wt) cellulose. Substrates were incubated with the cellulases for 3 days at 50°C, pH 5, and 200 rpm shaking. The cellulases were incubated with 1% (wt/wt) PASC at 50°C, pH 5, with shaking for 1 hr. Cellulose hydrolysis was measured either by a reducing sugar release assay (e.g., PAHBAH assay [49]), or by HPLC. Although clean cellulose such as Avicel and PASC generally correlated with lignocellulose conversion, there were exceptions (Figure 3). For example, sample 35 showed overall good performance on all four substrates; however, sample 57 performed well on PASC and Avicel and poorly on bagasse and PCS. Overall, enzyme performance was greater on bagasse than on PCS. There was little correlation between performance and enzyme production process or formulation or sample age. On the basis of previous experience, all samples were assumed to be stable during storage.

Figure 3
figure 3

H. jecorina cellulase performance. Comparison of saccharification performance (glucan conversion) on biomass and model cellulosic substrates.

Because of these results, our screening assays for protein engineering were developed to bring them closer to actual use conditions. This required the development of biomass performance screens in which the activity of a specific enzyme or variant could be queried within a cellulase background. The challenge of screening for the target cellulase activity in a background of other cellulases is not trivial due to the synergistic nature of the enzymes. Another advance was using pretreated lignocellulosic substrate at high solids. Screening for CEL6A specific activity improvements required development of two high-throughput assays: one that showed dose dependence with respect to CEL6A concentration and one to accurately determine the concentration of expressed CEL6A. Variants were screened in a reconstituted cellulase background lacking cellobiohydrolase activity and including sufficient β-glucosidase activity to produce primarily glucose, which was measured by reducing sugar analysis using the PAHBAH method [49]. The substrate was washed PCS. Specific activity screening became possible with HPLC determination of protein concentrations in a 96-well MTP format. Although specific activity was the target property for improvement, stability using PASC was also measured.

Our protein engineering approach was to create Site Evaluation Libraries (SELs) that contained all 19 amino acid substitutions (including recreation of wild type). The libraries were generated in E. coli, variants were sequenced, and plasmid DNA was transformed into T. reesei for expression and screening. Each variant was screened for multiple properties to ensure that important properties, such as thermal stability and performance, were not lost.

Selection of the CEL6A sites for engineering was based on knowledge of the enzyme structure and guided by sequence alignments. More than 100 nonconserved CEL6A residues were selected for mutagenesis. They covered about 30% of the molecule concentrating on catalytic domain surface residues, but also including sites in the linker region and the carbohydrate binding module. Active site residues were not targeted in this study.

Although it was tempting to use a tagged molecule for ease of separation of the protein of interest from the cellulase background, we demonstrated that a C-terminal His tag on CEL6A caused reduced performance in biomass assays. In contrast, PASC assay performance of CEL6A was not affected by the tag, which emphasized again the need to use process-relevant conditions in screens (Figure 4). Instead, we developed proprietary high-throughput assays for HPLC determination of CEL6A and variant protein concentration to enable calculation of specific activities and dose-dependent biomass performance of CEL6A.

Figure 4
figure 4

Relative activity of CEL6A molecules. Purified CEL6A and purified CEL6A-His6 were compared in two cellulose hydrolysis activity assays. CEL6A and CEL6A-His6 exhibited similar activity in PASC hydrolysis (top graph). PASC activity was determined by measurement of reducing sugars by PAHBAH following 1-hr incubation at 50°C in 1% (wt/wt) PASC. However, CEL6A-His6 activity was compromised, compared to native CEL6A, in hydrolysis of dilute acid pretreated corn stover (bottom graph), PCS miniaturized assay.

MTP-scale saccharification assays using PCS and PASC were used to screen the CEL6A variants. Serial dilutions of the CEL6A variants were added to the PCS saccharification assay such that a dose-response curve could be generated. Cellulose hydrolysis was determined by measurement of the increase in reducing sugars (PAHBAH). A performance index (PI) was calculated for each variant. The performance index is the ratio of performance of the variant to the wild-type protein. Generally an improved variant would have a PI >1, as shown in Figure 5. Although saccharification data is not linear with respect to CEL6A concentration and requires a curve fit, improved variants can be detected with this assay and analysis. CEL6A controls were included in each MTP in two formats. Plate-to-plate reproducibility of wild-type CEL6A was compared for PASC performance and found to be acceptable for detecting winners. Each growth plate contained recreated wild types in the SELs as well as wild-type controls. In Figure 6, each graph shows the activity of the wild-types on each PASC assay plate plotted against the isotherm fit for the activity of the wild-types from all 34 plates (blue line). The same comparison was made for PCS and found to be acceptable. In addition to the PCS and PASC saccharification assays, CEL6A variants were also screened for ethanol stability and heat stability (PASC activity before and after heating at a challenge temperature).

Figure 5
figure 5

Performance index. The performance index is the ratio of performance of the variant to the wild-type protein. PI >1 is improved over wild-type performance.

Figure 6
figure 6

PASC assay controls. The sugar detected from the addition of the wild-type control CEL6A on a plate-by-plate basis plotted with a global curve fit (shown in the bottom right graph) to all of the data (second from the right at the bottom) collected during screening.

For each assay, we graphed the natural log of the PI for wild-type performance (those that were recreated within each site library). The transformed data are a Gaussian distribution (Figure 7) centered on zero for each property. These wild-type transformants were not used to calculate the PI. Two types of wild-type transformants were on each library plate: those that were used to calculate the PI (the controls) and the recreated wild types that were used to test the curve fit. The Gaussian distribution of the wild types was as expected.

Figure 7
figure 7

Normal distribution of CEL6A controls. Activity values of the library recreated wild-type CEL6A, not used for curve fitting, were transformed to determined PI values and further transformed with the natural log to obtain data that is normally distributed.

A correlation was observed between the two activity assays, PCS and PASC, from the graph of the natural log of the PI for all of the transformants (wild type and variants) (Figure 8). The variants in the upper right quadrant were improved approximately 2.5-fold over wild type. One false-positive wild type was observed in this quadrant. A correlation was also observed between the two stability assays: heat and ethanol (data not shown). There was little correlation between performance in the PASC activity assay and either stability assay (Figure 9).

Figure 8
figure 8

Activity assay results. All of the wild-type and CEL6A variant activity data from the PASC and PCS assays was plotted. A correlation is observed between the two activity assays. Improved variants in both assays are observed in the upper right quadrant. Wild type is shown in blue diamonds.

Figure 9
figure 9

Stability assay results. All wild-type and CEL6A variant activity data from the PASC assay and both stability assays are plotted. Little correlation is observed between activity and stability. Improved variants in both properties are observed in the upper right quadrant. Wild type is shown in blue diamonds.

The performance data can be sorted in a variety of ways, depending on the query objective. The most stringent case would be selection of mutations that resulted in improvements over wild type in all four assays (ethanol stability, heat stability, PASC activity and PCS activity). A less stringent selection would be for wild-type performance in some assays and improved performance in others. Several sites identified in previous CEL6A engineering efforts were included in the libraries and were identified again from the screen results.


Biomass-to-ethanol plants are being constructed and operated today, but the foundation was laid by Mandels, Reese and others in the 1950s and 1960s. In fact, Reese credits Mandels with changing the focus of cellulase research at the U.S. Army Natick Laboratory from 'prevention of decomposition to promotion of decomposition' [55]. Reese and Mandels [24] demonstrated that cellobiohydrolase activity was limiting cellulose hydrolysis, but 30 years later, there are few reported successes in improving cellobiohydrolase-specific activity. There are many technical challenges to increasing cellulose-specific performance, including development of representative and predictive screens, expression of variants in an appropriate host, measurement of specific activity which includes high-throughput specific protein determination, and the ability to query cellulase activity in a background of confounding activities. T. reesei was demonstrated to be an effective high-throughput screening host for protein engineering. Specific activity screens were developed and shown to detect improvements in CEL6A activity in a T. reesei background with real biomass substrates at intermediate solids loadings. We improved the thermal stability of the two H. jecorina cellobiohydrolases to the same level as CEL5A and CEL3A. CEL6A variants were identified with higher activity than wild type and without loss of thermal stability. Since the methods used reflect process relevant conditions, they will help to quickly translate screening success to industrial success, without the potential pitfalls of changing substrate, solids loading, expression host, or protein background. While cost reductions will be achieved through process optimization, improved cellulases and cellulase preparations will be needed to further reduce the cost of delivering cheap sugars to the biofuels and biochemicals industries.





DuPont Danisco Cellulosic Ethanol, LLC


Differential Scanning Calorimetry




Microtiter plate


National Renewable Energy Laboratory


Phosphoric acid swollen cellulose


Dilute acid pretreated corn stover


Performance Index


Site Evaluation Library.


  1. Reese ET, Siu RGH, Levinson HS: The biological degradation of soluble cellulose derivatives and its relationship to the mechanism of cellulose hydrolysis. J Bacteriol 1950, 59: 485-488.

    CAS  Google Scholar 

  2. Mandels M, Reese ET: Fungal cellulases and the microbial decomposition of cellulosic fabric. Devel Industrial Microbiol 1964, 5: 5-20.

    CAS  Google Scholar 

  3. Caminal GJ, Sola LC: Kinetic modeling of the enzymic hydrolysis of pretreated cellulose. Biotechnol Bioeng 1985, 27: 1282-1290. 10.1002/bit.260270903

    Article  CAS  Google Scholar 

  4. Eriksson T, Karlsson J, Tjerneld F: A model explaining declining rate in hydrolysis of lignocellulose substrates with cellobiohydrolase I (cel7A) and endoglucanase I (cel7B) of Trichoderma reesei . Appl Biochem Biotechnol 2002, 101: 41-60. 10.1385/ABAB:101:1:41

    Article  CAS  Google Scholar 

  5. Gonzalez GCG, De Mas C, Lopez-Santin J: A kinetic model for pretreated wheat straw saccharification by cellulase. J Chem Technol Biotechnol 1989, 44: 275-288. 10.1002/jctb.280440404

    Article  CAS  Google Scholar 

  6. Eriksson T: Mechanism of surfactant effect in enzymatic hydrolysis of lignocellulose. Enzyme Microb Technol 2002, 31: 353-364. 10.1016/S0141-0229(02)00134-5

    Article  CAS  Google Scholar 

  7. Gan Q, Allen SJ, Taylor G: Kinetic dynamics in heterogeneous enzymatic hydrolysis of cellulose: an overview, an experimental study and mathematical modeling. Process Biochem 2003, 38: 1003-1018. 10.1016/S0032-9592(02)00220-0

    Article  CAS  Google Scholar 

  8. Kadam KL, Rydholm EC, Knutsen JS, McMillan JD: Development and validation of a kinetic model for enzymatic saccharification of lignocellulosic biomass. Biotechnol Progress 2004,20(3):698-705. 10.1021/bp034316x

    Article  CAS  Google Scholar 

  9. Rouvinen J, Bergfors T, Teeri T, Knowles JKC, Jones TA: Threee dimensional structure of cellobiohydrolase II from Trichoderma reesei . Science 1990, 249: 380-386. 10.1126/science.2377893

    Article  CAS  Google Scholar 

  10. Kraulis PJ, Clore GM, Nilges M, Jones TA, Pettersson G, Knowles J, Gronenborn AM: Determination of the three-dimensional solution structure of the carboxyl-terminal domain of cellobiohydrolase I from Trichoderma reesei : a study using NMR and hybrid distance geometry-dynamical simulated annealing. Biochemistry 1989, 28: 7241-7257. 10.1021/bi00444a016

    Article  CAS  Google Scholar 

  11. Divne C, Stahlberg J, Reinikainen T, Rouhonen L, Pettersson G, Knowles JKC, Teeri TT, Jones TA: The three-dimensional crystal structure of the catalytic core of cellobiohydrolase I from Trichoderma reesei . Science 1994, 265: 524-528. 10.1126/science.8036495

    Article  CAS  Google Scholar 

  12. Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B: The Carbohydrate-Active Enzymes Database (CAZy): an expert resource for glycogenomics. Nucleic Acids Res 2009, 37: D233-D238. 10.1093/nar/gkn663

    Article  CAS  Google Scholar 

  13. Meinke A, Damude HG, Tomme P, Kwan E, Kilburn DG, Miller RC Jr, R Warren AJ, Gilkes NR: Enhancement of the endo-β-1,4-glucanase activity of an exocellobiohydrolase by deletion of a surface loop. J Biol Chem 1995, 270: 4383-4386. 10.1074/jbc.270.9.4383

    Article  CAS  Google Scholar 

  14. Bu L, Beckham GT, Crowley MF, Chang CH, Matthews JF, Bomble YJ, Adney WS, Himmel ME, Nimlos MR: The energy landscape for the interaction of the family 1 carbohydrate-binding module and the cellulose surface is altered by hydrolyzed glycosidic bonds. J Phys Chem B 2009, 113: 10994-11002. 10.1021/jp904003z

    Article  CAS  Google Scholar 

  15. Zhong L, Matthews JF, Hansen PI, Crowley MF, Cleary JM, Walker RC, Nimlos MR, Brooks CL, Adney WS, Himmel ME, Brady JW: Computational simulations of the Trichoderma reesei cellobiohydrolase I acting on microcrystalline cellulose Iβ: the enzyme-substrate complex. Carbohydr Res 2009, 344: 1984-1992. 10.1016/j.carres.2009.07.005

    Article  CAS  Google Scholar 

  16. Igarashi K, Koivula A, Wada M, Kimura S, Penttilä M, Samejima M: High speed atomic force microscopy visualizes processive movement of Trichoderma reesei cellobiohydrolase I on crystalline cellulose. J Biol Chem 2009, 284: 36186-36190. 10.1074/jbc.M109.034611

    Article  CAS  Google Scholar 

  17. Imai T, Boisset C, Samejima M, Igarashi K, Sugiyama J: Unidirectional processive action of cellobiohydrolase Cel7A on Valonia cellulose microcrystals. FEBS Lett 1998,432(3):113-116. 10.1016/S0014-5793(98)00845-X

    Article  CAS  Google Scholar 

  18. Varrot A, Frandsen TP, von Ossowski I, Boyer V, Cottaz S, Driguez H, Schülein M, Davies GJ: Structural basis for ligand binding and processivity in cellobiohydrolase Cel6A from Humicola insolens . Structure 2003, 11: 855-864. 10.1016/S0969-2126(03)00124-2

    Article  CAS  Google Scholar 

  19. Boisset C, Fraschini C, Schülein M, Henrissat B, Chanzy H: Imaging the enzymatic digestion of bacterial cellulose ribbons reveals the endo character of the cellobiohydrolase Cel6A from Humicola insolens and its mode of synergy with cellobiohydrolase Cel7A. Appl Environ Microbiol 2000, 66: 1444-1452. 10.1128/AEM.66.4.1444-1452.2000

    Article  CAS  Google Scholar 

  20. Moran-Mirabal JM, Santhanam N, Corgie SC, Craighead HG, Walker LP: Immobilization of cellulose fibrils on solid substrates for cellulase binding studies through quantitative fluorescence microscopy. Biotechnol Bioeng 2008, 101: 1129-1141. 10.1002/bit.21990

    Article  CAS  Google Scholar 

  21. Igarashi K, Uchihashi T, Koivula A, Wada M, Penttilä M, Ando T, Samejima M: Single molecular observations of processive glycosidases on crystalline substrates. 14th Annual SBNet Meeting June 11-14, 2010 abstract, Tällberg, Sweden

    Google Scholar 

  22. Baker JO, Ehrman CI, Adney WS, Thomas SR, Himmel ME: Hydrolysis of cellulose using ternary mixtures of purified cellulases. Appl Biochem Biotechnol 1998, 70-72: 395-403. 10.1007/BF02920154

    Article  CAS  Google Scholar 

  23. Rosgaard L, Pedersen S, Langston J, Akerhielm D, Cherry JR, Meyer AS: Evaluation of minimal Trichoderma reesei cellulase mixtures on differently pretreated barley straw substrates. Biotechnol Prog 2007, 23: 1270-1276. 10.1021/bp070329p

    Article  CAS  Google Scholar 

  24. Reese ET, Mandels M: Stability of the cellulase of Trichoderma reesei under use conditions. Biotechnol Bioeng 1980,22(2):323-335. 10.1002/bit.260220207

    Article  CAS  Google Scholar 

  25. Suominen PL, Mäntylä AL, Karhunen T, Hakola S, Nevalainen H: High frequency one-step gene replacement in Trichoderma reesei . II. Effects of deletions of individual cellulase genes. Mol Gen Genet 1993,241(5-6):523-530. 10.1007/BF00279894

    Article  CAS  Google Scholar 

  26. Converse AO, Matsuno R, Tanaka M, Taniguchi M: A model of enzyme adsorption and hydrolysis of microcrystalline cellulose with slow deactivation of the adsorbed enzyme. Biotechnol Bioeng 1988, 32: 38-45. 10.1002/bit.260320107

    Article  CAS  Google Scholar 

  27. Gusakov AV, Sinitsyn AP: A theoretical analysis of cellulase product inhibition: effect of cellulase binding constant, enzyme/substrate ration, and β-glucosidase activity on the inhibition pattern. Biotechnol Bioeng 1992, 40: 663-671. 10.1002/bit.260400604

    Article  CAS  Google Scholar 

  28. Holtzapple M, Cognata M, Shu Y, Hendrickson C: Inhibition of Trichoderma reesei cellulase by sugars and solvents. Biotechnol Bioeng 1990, 36: 275-287. 10.1002/bit.260360310

    Article  CAS  Google Scholar 

  29. Jeoh T, Ishizawa CI, Davis MF, Himmel ME, Adney WS, Johnson DK: Cellulase digestibility of pretreated biomass is limited by cellulose accessibility. Biotechnol Bioeng 2007,98(1):112-122. 10.1002/bit.21408

    Article  CAS  Google Scholar 

  30. Mandels M, Weber J, Parizek R: Enhanced cellulase production by a mutant of Trichoderma viride . Appl Environ Microbiol 1971,21(1):152-154.

    CAS  Google Scholar 

  31. Schimenti J, Garrett T, Montenecourt BS, Eveleigh DE: Selection of hypercellulolytic mutants of Trichoderma reesei based on resistance to nystatin. Mycologia 1983, 75: 876-880. 10.2307/3792779

    Article  CAS  Google Scholar 

  32. Sheir-Neiss G, Montenecourt BS: Characterization of the secreted cellulases of Trichoderma reesei wild type and mutants during controlled fermentations. Appl Microbiol Biotechnol 1984, 20: 46-53. 10.1007/BF00254645

    Article  CAS  Google Scholar 

  33. Jalak J, Väljamäe P: Mechanism of initial rapid rate retardation in cellobiohydrolase catalyzed cellulose hydrolysis. Biotechnol Bioeng 2010,106(6):871-883. 10.1002/bit.22779

    Article  CAS  Google Scholar 

  34. Schülein M: Protein engineering of cellulases. Biochim Biophys Acta 2000,1543(2):239-252.

    Article  Google Scholar 

  35. Koivula AT, Kinnari T, Harjunpää V, Ruohonen L, Teleman A, Drakenberg T, Rouvinen J, Jones TA, Teeri TT: Tryptophan 272: an essential determinant of crystalline cellulose degradation by Trichoderma reesei cellobiohydrolase Cel6A. FEBS Lett 1998,429(3):341-346. 10.1016/S0014-5793(98)00596-1

    Article  CAS  Google Scholar 

  36. Wohlfahrt G, Pellikka T, Boer H, Teeri TT, Koivula AT: Probing pH-dependent functional elements in proteins: modification of carboxylic acid pairs in Trichoderma reesei cellobiohydrolase Cel6A. Biochemistry 2003, 42: 10095-10103. 10.1021/bi034954o

    Article  CAS  Google Scholar 

  37. Koivula A, Reinikainen T, Ruohonen L, Valkeajärvi A, Claeyssens M, Teleman O, Kleywegt GJ, Szardenings M, Rouvinen J, Jones TA, Teeri TT: The active site of Trichoderma reesei cellobiohydrolase II: the role of tyrosine 169. Protein Eng 1996, 9: 691-699. 10.1093/protein/9.8.691

    Article  CAS  Google Scholar 

  38. Koivula A, Ruohonen L, Wohlfahrt G, Reinikainen T, Teeri TT, Piens K, Claeyssens M, Weber M, Vasella A, Becker D, Sinnott ML, Zou J-Y, Kleywegt GJ, Szardenings M, Ståhlberg J, Alwyn Jones T: The active site of cellobiohydrolase Cel6A from Trichoderma reesei : the roles of aspartic acids D221 and D175. J Am Chem Soc 2002, 124: 10015-10024. 10.1021/ja012659q

    Article  CAS  Google Scholar 

  39. Zou J-Y, Kleywegt GJ, Ståhlberg J, Driguez H, Nerinckx W, Claeyssens M, Koivula A, Teeri TT, Alwyn Jones T: Crystallographic evidence for substrate ring distortion and protein conformational changes during catalysis in cellobiohydrolase Cel6A from Trichoderma reesei . Structure 1999,7(9):1035-1044. 10.1016/S0969-2126(99)80171-3

    Article  CAS  Google Scholar 

  40. Zhang S, Irwin DC, Wilson DB: Site-directed mutation of noncatalytic residues of Thermobifida fusca exocellulase Cel6B. Eur J Biochem 2000, 267: 3101-3115. 10.1046/j.1432-1327.2000.01315.x

    Article  CAS  Google Scholar 

  41. Ai Y-C, Zhang S, Wilson DB: Positional expression effects of cysteine mutations in the Thermobifida fusca cellulose Cel6A and Cel6B catalytic domains. Enz Microbial Technol 2003, 32: 331-336. 10.1016/S0141-0229(02)00276-4

    Article  CAS  Google Scholar 

  42. Wilson DB: Cellulases and biofuels. Curr Opin Biotechnol 2009, 20: 295-299. 10.1016/j.copbio.2009.05.007

    Article  CAS  Google Scholar 

  43. Heinzelman P, Snow CD, Wu I, Nguyen C, Villalobos A, Govindarajan S, Minshull J, Arnold FH: A family of thermostable fungal cellulases created by structure-guided recombination. Proc Natl Acad Sci USA 2009, 106: 5610-5615. 10.1073/pnas.0901417106

    Article  CAS  Google Scholar 

  44. Heinzelman P, Snow CD, Smith MA, Yu X, Kannan A, Boulware K, Villalobos A, Govindarajan S, Minshull J, Arnold FH: SCHEMA recombination of a fungal cellulase uncovers a single mutation that contributes markedly to stability. J Biol Chem 2009, 284: 26229-26233. 10.1074/jbc.C109.034058

    Article  CAS  Google Scholar 

  45. Mandels M, Andreotti R, Roche C: Measurement of saccharifying cellulase. Biotech Bioeng Symp 1976, 6: 21-33.

    CAS  Google Scholar 

  46. Doner LW, Irwin PL: Assay of reducing end-groups in oligosaccharide homologues with 2,2'-bicinchoninate. Analyt Biochem 1992, 202: 50-53. 10.1016/0003-2697(92)90204-K

    Article  CAS  Google Scholar 

  47. Miller GL: Use of dinitrosalicylic acid reagent for determination of reducing sugar. Anal Chem 1959, 31: 426-428. 10.1021/ac60147a030

    Article  CAS  Google Scholar 

  48. Ghose TK: Measurement of cellulase activities. Pure Appl Chem 1987,59(2):257-268. 10.1351/pac198759020257

    Article  CAS  Google Scholar 

  49. Lever MA: New reaction for colorimetric determination of carbohydrates. Anal Biochem 1972,47(1):273-279. 10.1016/0003-2697(72)90301-6

    Article  CAS  Google Scholar 

  50. Kuwajima K: Circular dichroism. Methods Mol Biol 1995, 40: 115-35.

    CAS  Google Scholar 

  51. Woody RW: Circular dichroism. Methods Enzymol 1995, 246: 34-71. full_text

    Article  CAS  Google Scholar 

  52. Kelly SM, Price NC: The application of circular dichroism to studies of protein folding and unfolding. Biochim Biophys Acta 1997,1338(2):161-185.

    Article  CAS  Google Scholar 

  53. Wood T: Biomass part A: cellulose and hemicellulose. In Methods in Enzymology. Edited by: Wood W, Kellog S. San Diego, Academic Press; 1988:19-25. full_text

    Google Scholar 

  54. Schell DJ, Farmer J, Newman M, McMillan JD: Dilute-sulfuric acid pretreatment of corn stover in pilot-scale reactor: investigation of yields, kinetics, and enzymatic digestibilities of solids. Appl Biochem Biotechnol 2003,105(1-3):69-85. 10.1385/ABAB:105:1-3:69

    Article  Google Scholar 

  55. Reese ET: History of the cellulase program at the U.S. Army Natick Development Center. Biotechnol Bioeng Symp 1976, 6: 9-20.

    CAS  Google Scholar 

Download references


Portions of this work were funded by subcontract no. ZCO-0-30017-01 with the National Renewable Energy Laboratory under prime contract no. DE-AC36-99G10337 with the U.S. Department of Energy or under award no: DE-FC36-08GO18078 awarded by the U.S. Department of Energy. Accordingly, the U.S. government may have certain rights. The authors would like to extend their appreciation to Dan Schell, Nancy Dowe-Farmer, and Mike Himmel at NREL; and to Roopa Ghirnikar and the Genencor project team in Palo Alto and Leiden.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Suzanne E Lantz.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

SEL conducted cellulase performance assays and mixture experiments, FG developed assays and selected protein engineering sites, BRK developed assays and conducted mixture experiments, TK conducted His-tag comparison and selected engineering sites, CM selected engineering sites and provided technical leadership and coordination, RH coordinated and performed high throughput screening, LW performed DSC analysis and selected engineering sites, JS solved protein structures and selected engineering sites, EAL developed assays and conducted mixture experiments. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Lantz, S.E., Goedegebuur, F., Hommes, R. et al. Hypocrea jecorina CEL6A protein engineering. Biotechnol Biofuels 3, 20 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: