- Open Access
A precise and consistent assay for major wall polymer features that distinctively determine biomass saccharification in transgenic rice by near-infrared spectroscopy
Biotechnology for Biofuelsvolume 10, Article number: 294 (2017)
The genetic modification of plant cell walls has been considered to reduce lignocellulose recalcitrance in bioenergy crops. As a result, it is important to develop a precise and rapid assay for the major wall polymer features that affect biomass saccharification in a large population of transgenic plants. In this study, we collected a total of 246 transgenic rice plants that, respectively, over-expressed and RNAi silenced 12 genes of the OsGH9 and OsGH10 family that are closely associated with cellulose and hemicellulose modification. We examined the wall polymer features and biomass saccharification among 246 transgenic plants and one wild-type plant. The samples presented a normal distribution applicable for statistical analysis and NIRS modeling.
Among the 246 transgenic rice plants, we determined largely varied wall polymer features and the biomass enzymatic saccharification after alkali pretreatment in rice straws, particularly for the fermentable hexoses, ranging from 52.8 to 95.9%. Correlation analysis indicated that crystalline cellulose and lignin levels negatively affected the hexose and total sugar yields released from pretreatment and enzymatic hydrolysis in the transgenic rice plants, whereas the arabinose levels and arabinose substitution degree (reverse xylose/arabinose ratio) exhibited positive impacts on the hexose and total sugars yields. Notably, near-infrared spectroscopy (NIRS) was applied to obtain ten equations for predicting biomass enzymatic saccharification and seven equations for distinguishing major wall polymer features. Most of the equations exhibited high R 2/R 2 cv/R 2 ev and RPD values for a perfect prediction capacity.
Due to large generated populations of transgenic rice lines, this study has not only examined the key wall polymer features that distinctively affect biomass enzymatic saccharification in rice but has also established optimal NIRS models for a rapid and precise screening of major wall polymer features and lignocellulose saccharification in biomass samples. Importantly, this study has briefly explored the potential roles of a total of 12 OsGH9 and OsGH10 genes in cellulose and hemicellulose modification and cell wall remodeling in transgenic rice lines. Hence, it provides a strategy for genetic modification of plant cell walls by expressing the desired OsGH9 and OsGH10 genes that could greatly improve biomass enzymatic digestibility in rice.
Lignocellulose represents an enormous biomass resource for biofuels and chemical products. Food crops not only produce grains for human beings, but also provide large amounts of lignocellulose residues [1, 2]. In principle, biomass conversion involves three major steps: initial physical and chemical pretreatment for lignocellulose destruction, subsequent enzymatic hydrolysis to release fermentable sugars, and, finally, yeast fermentation to produce ethanol. However, lignocellulose recalcitrance leads to an unacceptably high cost for biofuel production . To reduce recalcitrance, the genetic modification of plant cell walls has been proposed as a promising solution by selecting transgenic plants that over-express the key genes associated with cell wall biosynthesis and modification [4, 5]. It thus becomes essential to distinguish, among the transgenic plants, the major wall polymer features that basically determine biomass enzymatic saccharification.
Plant cell walls are mainly composed of cellulose, hemicellulose, and lignin with small amounts of pectin and wall proteins. Cellulose is a crystalline polymer composed of β-1,4-glucan chains, and its crystallinity has been characterized as the key feature that negatively affects enzymatic biomass saccharification in the plant species examined [5,6,7]. To reduce cellulose crystallinity, some researchers have considered selecting transgenic plants that over-express GH9 family genes, which encode glycoside hydrolase enzymes specific for β-1,4-glucan modification [2, 8, 9]. As xylan is the major hemicellulose in grass plants, the acetylation of xylan has been reported to negatively affect biomass saccharification and biofuel productivity by hindering the access cellulase enzymes to the cellulose surface and by producing acetic acid compounds that inhibit yeast fermentation [10, 11]. Furthermore, the degree of arabinose substitution of xylan has been shown to positively impact biomass enzymatic digestibility by reducing the cellulose crystalline index . Since GH10 enzymes are involved in the modification of xylans and other hemicelluloses, over-expressing GH10 genes may alter the xylan structure to promote biomass saccharification in bioenergy crops .
Lignin is a phenylpropane wall polymer consisting of three major monomers: p-hydroxyphenyl (H), guaiacyl (G), and syringyl (S). The lignin is tightly associated with hemicelluloses to maintain plant mechanical strength and biomass recalcitrance, so it is thought to play a negative role in biomass saccharification . More recently, lignin has been reported to play dual roles in biomass enzymatic hydrolysis, probably due to three distinctive monolignol proportions in genetic mutants and transgenic plants .
Rice (Oryza sativa L.) is one of the most important cereal crops around the world, and it also produces approximately 800 million metric tons of lignocellulose-based straws annually for potential bioethanol production . To reduce lignocellulose recalcitrance, we have selected large-scale transgenic rice plants that, respectively, over-expressed seven OsGH9 and five OsGH10 family genes [2, 9, 16]. Since plant cell walls have complicated structures and dynamic networks, developing a precise and rapid approach to identify the key wall polymer features that could greatly enhance biomass saccharification among a large population of biomass samples remains a technical challenge. On the other hand, classic laboratory methods are time-consuming and costly for cell wall analysis.
Near-infrared spectroscopy (NIRS) has been applied as a non-destructive and rapid analytical tool to predict sample properties and component compositions. It is very efficient for high-throughput screening of a large population of samples at both qualitative and semi-quantitative levels. For instances, NIRS has been used for high-throughput phenotyping of multiple traits in crop breeding [17,18,19] and has also been applied to predict plant cell wall composition and biomass digestibility in different plant species [20,21,22,23,24,25,26,27,28]. However, due to limited variation in the normal cultivated species, little has been reported about the application of NIRS in a precise assay for both key wall polymer features and biomass enzymatic digestibility in rice.
In this work, we collected hundreds of transgenic rice samples that over-expressed and RNAi knocked-down a total of 12 typical OsGH9 and OsGH10 genes. Because those transgenic plants exhibited large variations in cell wall compositions and biomass saccharification, we established optimal equations for an excellent NIRS prediction of the major wall polymer features. Hence, our study provides a strategy for the genetic modification of plant cell walls by over-expressing or RNAi knocking-down OsGH9 and OsGH10 genes and demonstrated a precise and rapid NIRS assay, which may be applicable for large-scale screening of target traits in bioenergy crops and beyond.
Large variations of cell wall composition in transgenic rice straws
In this study, we collected and determined the cell wall compositions (cellulose, hemicelluloses, and lignin) of a total of 246 transgenic lines and one wild-type plant (Additional file 1: Table S1). As a result, a total of 247 rice straw samples exhibited large variations in three major wall polymer levels (Fig. 1). For comparison, the cellulose levels varied from 16.6 to 39.9% (Fig. 1a), whereas the levels of two major monosaccharides (Ara and Xyl) of hemicelluloses showed a perfectly normal data distributions (Fig. 1b). As Xyl/Ara (X/A) is the key parameter that inversely correlates with the degree of Ara substitution of xylan in rice, we also measured a normal distribution of X/A in a total of 247 samples. Furthermore, we detected that acid-insoluble lignin (AIL) showed much more variation than acid-soluble lignin (ASL), but the total lignin levels varied from 6.7 to 28.7% (Fig. 1c). Hence, these 246 transgenic rice plants presented a perfect sample population for the analysis of wall polymer features.
Diverse biomass saccharification in transgenic rice straws
Biomass saccharification (digestibility) has been defined by measuring the yields of pentoses and hexoses released from physical and chemical pretreatment and subsequent enzymatic hydrolysis . In this work, we detected diverse biomass saccharification in the 246 rice transgenic plants and one wild-type plant (Fig. 2). For comparison, the transgenic biomass samples showed large variations in the yields of pentoses and hexoses released either from 1% NaOH pretreatment or from enzymatic hydrolysis (Fig. 2a). Furthermore, the total yields of pentoses ranged from 11.7 to 21.9% after both pretreatment and enzymatic hydrolysis, whereas the yields of hexoses varied from 20.9 to 42.1%, leading to yields of total sugars (hexoses and pentoses) ranging from 37.2 to 58.4% among 247 samples (Fig. 2b). Because only the hexoses released from enzymatic hydrolysis are fermentable by yeast for ethanol production, we also found that the yields of fermentable hexoses showed a normal distribution ranging from 52.8 to 95.9% among the 247 biomass samples (Fig. 2c). Compared with the wild-type plant, several transgenic plants showed much higher biomass digestibility, especially for the fermentable hexoses, which even reached 95.9%. This also suggests that over-expressing or RNAi knocking-down OsGH9 and OsGH10 genes may be a potential genetic strategy for greatly improving biomass enzymatic saccharification in rice.
Correlation between wall polymer features and biomass saccharification
A correlation analysis has been applied that accounts well for the impact of wall polymer features on biomass enzymatic saccharification in different plant species [8, 12, 29]. In this study, we performed a Spearman correlation analysis among three major wall polymer features and the yields of sugars (pentoses, hexoses) released from pretreatment and enzymatic hydrolysis (Fig. 3). The cellulose levels showed a positive correlation with the yield of pentoses, but a negative correlation with the yields of hexoses and total sugars (Fig. 3a). For comparison, the Ara levels and the degree of Ara substitution (inverse X/A) exhibited significant positive correlations with the yields of hexoses and total sugars at p < 0.01 levels in the 246 transgenic rice plants and one wild-type plant (Fig. 3b), consistent with the previous reports in rice mutants . However, both acid-soluble lignin (ASL) and acid-insoluble lignin (AIL) correlated negatively with the yield of hexoses (Fig. 3c), which differs from the previous findings in rice mutants. Significantly, all three wall polymer features showed significant correlations with the yields of pentoses, hexoses, or total sugars released by the pretreatment and subsequent enzymatic hydrolysis, suggesting that the polymer features could be employed to predict the biomass saccharification of transgenic rice plants.
NIRS data in the transgenic rice population
NIRS data were collected in triplicate by recording the reflectance independently using an XDS Rapid Content™ Analyzer (FOSS, Co. LLC, Denmark). In general, the 247 averaged spectra were generated using the 246 transgenic rice plants and one wild-type plant (Fig. 4a). A principal component analysis (PCA) was carried out to eliminate the anomalous samples and reconstruct the spectral population (Fig. 4b). During the PCA process, the dimensionality of the spectral data was reduced by linearization processing of the original spectral data to generate new variables (principal components), which were orthogonal and uncorrelated to each other . In this study, we generated a total of 63 new variables (principal components) that covered all of the variations in the original spectral population during the principal component analysis (Fig. 4c). The detailed variances of the spectral population on each principal component are presented in Fig. 4d. As a result, the first few components explained most of the variation, and little variation was observed due to component numbers over 30. Finally, only the first 18 of the 63 components were selected to measure the global H (GH), which explained 99.81% of the variation that characterizes the spectral population (Fig. 4e, f). Hence, seven samples were eliminated as GH outliers, leading to a narrow distribution of the spectroscopy of the remaining 240 samples (Fig. 4b).
Calibration and validation sets for the NIRS modeling
A total of 240 samples were selected for modeling, based on the PCA analysis described above. Before calibration, 93 samples were randomly selected to form the external validation sets, and the remaining 147 samples were used for the calibration sets. A frequency distribution was obtained for the chemical values of the calibration and validation sets (Fig. 5). Most of them exhibited a normal distribution (except for AIL and total lignin), which was suitable for statistical analysis and the subsequent calibration. In addition, the calibration and validation sets were compared in terms of the mean, minimum, maximum, and standard deviation values (Additional file 2: Table S2). Therefore, the data demonstrated that the calibration and validation sets were comparable and reliable.
NIRS modeling for wall polymer features
A principal component analysis (PCA) was further performed for the 147 samples in the calibration sets, and 15 components were obtained that accounted for 99.69% of the variation. The MPLS methods packed in the WinISI III software were applied for the calibration, and the standard error of calibration (SEC) and the coefficient determination of the calibration (R 2) were generated after this process. Furthermore, cross validations were performed to evaluate the calibration. During the cross validations, four groups were adopted by randomly selecting samples from the calibration sets into the cross-validation sets, leading to the standard error of the cross validations (SECV) and the coefficient determination of the cross validations (R 2 cv). Moreover, the ratio performance deviation (RPD) was used to evaluate the predictive capacity of the equation . Hence, the optimal equation could be selected on the basis of high R 2 cv, R 2, and RPD values and low SEC, and SECV values.
Notably, all the equations for the prediction of wall polymer features exhibited high R 2 cv, R 2 and RPD values, especially for the RPD values, which ranged from 1.73 to 2.79, indicating a high predictive capacity (Table 1). In detail, the equation for the acid-soluble lignin (ASL) showed the finest predictive capacity, based on its highest R 2 cv, R 2, and RPD values among the calibrations of lignin content. The crystalline cellulose also showed a perfect calibration equation, with R 2 cv of 0.87 and an RPD of 2.79. Even though the major hemicellulosic monosaccharides (Xyl, Ara) showed relatively low R 2, R 2 cv, and RPD values, the X/A ratio had an RPD value of 1.98, which is sufficient for reasonable predictive capacity.
Furthermore, the 93 randomly selected samples described above were used to evaluate the calibration equations by independent external validation. During the external validation, the coefficient determination of external validations (R 2 ev) and the RPD remained the main factors for evaluating the calibration equation (Fig. 6). As a result, the equation for crystalline cellulose showed the highest R 2 ev value at 0.84 during the external validation, with an RPD value of the external validation at 2.51 (Fig. 6a), consistent with the calibration and cross-validation data. Notably, the equations for X/A and acid-soluble lignin (ASL), two major features for hemicelluloses and lignin, retained high R 2 ev and RPD values (Fig. 6b, c), suggesting that all the equations may be applicable for the prediction of wall polymer features.
NIRS modeling for biomass saccharification
A similar calibration analysis was performed for the prediction of biomass saccharification in the transgenic rice plants. In general, the equations calibrated for biomass saccharification showed extremely high R 2 values ranged from 0.89 to 0.98 (Table 2), much higher than those of the wall polymer features described above. Meanwhile, the equations also showed relatively high R 2 cv values, ranging from 0.85 to 0.97 during cross validation, consistent with the calibration data. Notably, the RPD values of the cross validation reached from 2.56 to 5.89 (Table 2), indicating a perfect predictive capacity.
In addition, external validations were carried out to confirm the predictive capacity of the equation calibrated for biomass saccharification. All the equations showed a high correlation between the predicted values and the reference values measured by conventional analysis methods (Fig. 7). In particular, the yield of hexoses released from pretreatment showed an optimal R 2 ev value (Fig. 7b) compared to the yields of pentoses and total sugars released from both the pretreatment and the subsequent enzymatic hydrolysis (Fig. 7a, c). This was consistent with the calibration and cross-validation data with the highest R 2 and R 2 cv values at 0.98 and 0.97, respectively (Table 2). Notably, only the yield of fermentable hexoses obtained from enzymatic hydrolysis showed extremely high values for R 2 (0.97), R 2 cv (0.95), and R 2 ev (0.95) (Table 2 and Fig. 7d).
In conclusion, all the RPD values showed more than 2.0 values in both the cross validation and the independent external validation, with a highest RPD value of 6.19, indicating that NIRS is a precise and consistent assay for predicting biomass saccharification in the transgenic rice plants.
Integrative calibrations for both wall polymers and biomass saccharification
An integrative calibration was performed by combining the calibration and validation sets as a new calibration set. As expected, the integrative calibrations were highly correlated with the previous calibrations, and some equations even showed much better parameters (Tables 1, 3). For instance, the equation for Ara (%) showed higher R 2, R 2 cv, and RPD values. However, some equations showed worse parameters, such as the equation for the yield of pentoses released from pretreatment, perhaps due to many samples showing close neighborhood H (NH) with much redundancy variation negatively affecting the calibration. Furthermore, the integrative calibration showed RPD values that ranged from 1.79 to 2.94 for wall polymer features, but retained high RPD values, from 2.39 to 6.63, for the prediction of biomass saccharification. Hence, the NIRS predictions should be applicable for both wall polymer features and biomass saccharification in transgenic rice plants and beyond.
The genetic modification of plant cell walls has been considered to be a promising solution to addressing lignocellulose recalcitrance in bioenergy crops [4, 5]. In general, wall modification involves three strategies: altering the lignin polymer network, increasing the cellulose accessibility, and reducing inhibitors for biomass processing . The genes of at least three groups of enzymes (RWA, AXY9, and TBR/TBL) and one BADH family have been identified for wall polysaccharide modification, in particular for xylan de-acetylation or de-feruloylation. Hence, these wall modifications could lead to either improving cellulase enzyme accessibility to enhance biomass saccharification or reducing the formation of inhibitors for increased ethanol fermentation by yeast [32,33,34,35,36]. In addition, other cell wall modification attempts have been made to enhance the yield of fermentable hexoses by carbon partitioning or glycosyl-transferase engineering [37,38,39,40]. However, because hundreds of genes have been reported to be involved in plant cell wall biosynthesis and modification, it remains difficult to identify the desired genes for transgenic plant selection towards enhancing biomass saccharification with little impact on plant growth in bioenergy crops [2, 41]. Recently, we have identified major wall polymer features that significantly affect biomass enzymatic digestibility using genetic mutants and natural germplasm accessions in rice, wheat, sweet sorghum, Miscanthus, and other grass plants [6,7,8, 12, 42]. Although both OsGH9 and OsGH10 family genes have been shown to play a major role in cellulose and hemicellulose modification, little is yet known about their enhancements to biomass saccharification in transgenic rice plants and other crops, probably due to the difficulty of selecting the desired genes for genetic manipulation.
More recently, we have selected a large population of transgenic rice plants by, respectively, over-expressing and RNAi silencing 12 representative genes from the OsGH9 and OsGH10 families (Additional file 1: Table S1). Using a total of 246 transgenic rice samples, this study has determined the significant variations in cell wall polymer features and in biomass saccharification, especially for fermentable hexoses, which even reached a highest yield of 95.9%. This suggests that over-expressing or RNAi knocking-down OsGH9 and OsGH10 genes may be a powerful genetic approach to greatly enhance biomass enzymatic saccharification in transgenic rice plants. Furthermore, using those greatly varied transgenic rice samples, we may be able to explore the biological functions and roles of the OsGH9 and OsGH10 families in cellulose and hemicellulose modification and cell wall remodeling in the future. More importantly, we were able to identify the desired OsGH9 and OsGH10 family genes for optimal genetic wall modification by screening out the transgenic rice lines that produce the highest biomass saccharification.
In addition, this work has examined a significant correlation between three wall polymer features and biomass saccharification (Fig. 3), indicating that the sample population of transgenic rice plants is sufficient to identify the key wall polymer features that determine biomass enzymatic hydrolysis. Surprisingly, even though the OsGH9 and OsGH10 families are not directly involved in lignin metabolism, this study showed that the lignin levels, including acid-soluble and acid-insoluble lignin, significantly varied, and found that the lignin levels were significantly correlated with biomass saccharification. This confirms that plant cell walls are dynamic networks and any small wall modification may lead to major wall polymer feature alteration. Therefore, it is important to develop a precise and rapid approach to screen for the desired genes that could greatly enhance biomass saccharification and slightly alter the wall polymer features in transgenic plants.
As a high-throughput screening method, NIRS has been applied to screen for “invisible” phenotypes (wall composition) in maize  and to predict cell wall compositions and biomass enzymatic digestibility in Miscanthus germplasm accessions and sweet sorghum mutants [20, 25, 27]. It has also been applied for the analysis of the chemical composition of rice straw , the quantification of the cell wall composition and monosaccharide content [22, 23], and the prediction of lignin syringyl/guaiacyl content . However, the application of NIRS is critically limited by the calibration models, which in principle require large populations of varied samples to be reliable.
Rice (O. sativa L.) is one of the most important cereal crops around the world, but the variations in rice samples are restricted by the currently available rice cultivar species. Hence, this study initially attempted to collect rice samples that significantly varied by generating a total of 246 transgenic rice lines that, respectively, over-expressed and RNAi silenced a total of 12 OsGH9 and OsGH10 family genes. Due to the close association of the OsGH9 and OsGH10 families with cellulose and hemicellulose modifications (2, 9), we have observed large variations in the wall polymer features and biomass enzymatic saccharification among the transgenic rice samples.
Using this large population of varied transgenic rice samples, we have performed an NIRS assay for both biomass saccharification and three wall polymer features. Notably, most of the equations established based on the transgenic plants showed much higher R 2/R 2 cv/R 2 ev and RPD values than those of the Miscanthus germplasm accessions and sweet sorghum genetic mutants. This indicates a precise NIRS assay in transgenic rice plants, probably due to more variation in the cell wall compositions and the biomass saccharification for transgenic biomass samples presented compared with the Miscanthus accessions and sweet sorghum mutants. Furthermore, this study not only established equations for ten parameters that account for biomass saccharification, but also generated equations for the prediction of seven wall polymer features, in particular the X/A ratio, a key wall polymer feature that determines the biomass saccharification in the grass plants examined. Hence, this study has established a precise and consistent NIRS assay for wall polymer features and biomass saccharification, which may be applicable for large-scale screening of transgenic rice plants.
A total of 246 transgenic rice plants were selected by overexpressing and RNAi silencing 12 representative genes for cellulose and hemicellulose modification from the OsGH9 and OsGH10 families. Large variations in the cell wall compositions and the biomass saccharification were evaluated in the transgenic rice plants. In particular, the fermentable hexoses ranged from 52.8 to 95.9%, suggesting a potential wall modification strategy to greatly enhance biomass saccharification. Notably, the transgenic rice plants presented a perfect normal distribution of biomass samples, which was applicable for the NIRS analysis. Ten equations were ultimately generated for the prediction of biomass saccharification, and seven equations were applied for the key wall polymer feature analysis. Due to extremely high R 2/R 2 cv/R 2 ev and RPD values in most of the equations, this study has demonstrated a precise and consistent NIRS assay for the large-scale screening of transgenic bioenergy crops and other bioenergy crops.
A total of seven OsGH9 and five OsGH10 genes were transformed into Nipponbare (NPB) using overexpressing and RNAi silencing constructs (Additional file 1: Table S1). A total of 246 transgenic lines and one wild-type plant (NPB) were harvested from the experimental field of Huazhong Agriculture University (Wuhan), and the mature stem tissues were collected and dried at 50 °C. The dried tissues were ground through 40 mesh and stored in a dry container until use.
Plant cell wall polymer determination
The dried biomass samples were extracted twice with acetic-nitric acid–water (8:1:2) at 100 °C for 1 h. After centrifugation at 3000×g for 10 min, the remaining residues were washed five times with distilled water and collected as the cellulose fraction. The cellulose samples were dissolved in 72% sulfuric acid (w/w) and the total hexoses were determined using the anthrone/H2SO4 method described below. Three biological triplicates were performed for each sample.
Lignin was assayed using a two-step acid hydrolysis method. The samples were hydrolyzed with 72% (w/w) sulfuric acid at 30 °C for 90 min with gentle shaking at 115 rpm, and subsequently diluted to 3.97% (w/w) with distilled water and heated at 115 °C for 60 min. The supernatant liquids were measured at 205 nm for acid-soluble lignin (ASL), and the remaining residues were placed in a muffle furnace at 575 ± 25 °C for 4 h for the acid-insoluble lignin (AIL) assay . For the determination of hemicellulosic monosaccharides, the acid-soluble supernatant for the acid-soluble lignin determination described above was used for the monosaccharide determination by GC–MS as described by Li et al. .
Determination of biomass saccharification
The chemical pretreatments and the subsequent enzymatic hydrolysis were performed as previously described by Huang et al.  with minor modifications. The biomass sample was added to 6 mL of 1% NaOH (w/v). The sample tube was shaken at 150 rpm for 2 h at 50 °C, and centrifuged at 3000×g for 5 min. The supernatant was collected for determination of the total sugars released from the alkali pretreatment, and the pellet was washed five times with 10 mL distilled water and washed once with 10 mL of mixed cellulase reaction buffer (0.2 M acetic acid–sodium acetate, pH 4.8). Then, the samples were added to 6 mL of 0.16% (w/v) mixed-cellulases (containing ≥ 6 × 104 U of β-glucanase, ≥ 600 U of cellulase, and ≥ 10 × 104 U of xylanase from Imperial Jade Biotechnology Co., Ltd. Ningxia 750002, China). During the enzymatic hydrolysis, the samples were shaken at 150 rpm for 48 h at 50 °C. After centrifugation at 3000×g for 10 min, the supernatant was collected for the total sugar determination.
Colorimetric assay of total hexoses and pentoses
A UV/VIS spectrometer (Shanghai MAPADA Instruments Co., Ltd. V-1100D) was used for the absorbance reading. Hexoses were detected by the anthrone/H2SO4 method  and pentoses were detected by the orcinol/HCl method . Anthrone was purchased from Sigma-Aldrich Co. LLC. Ferric chloride and orcinol were purchased from Sinopharm Chemical Reagent Co., Ltd. The standard curves for hexoses and pentoses were obtained using d-glucose and d-xylose (purchased from Sinopharm Chemical Reagent Co., Ltd.), respectively.
NIRS calibration and data acquisition
The near-infrared spectral data collection was performed using an XDS Rapid Content™ Analyzer (FOSS, Co. LLC, Denmark) as described by Huang et al. . The WinISI III software package (Version 1.50e, Infrasoft International LLC) was used for the calibration and the acquisition of data as described by Huang et al.  with minor modifications.
Briefly, a principal component analysis (PCA) was carried out to structure and assess the variability of the spectral population before calibration. The GH outlier (GH > 3.0) samples were eliminated after the PCA. The modified partial least squares (MPLS) method was performed to provide a prediction equation. In MPLS, the near-infrared spectra residuals at each wavelength, obtained after calculating each factor, were standardized (divided by the standard deviations of the residuals at a wavelength) before calculating the next factor . During calibration, eight derivative treatments were used: “0,0,1,1”, “1,4,4,1”, “2,4,4,1”, “2,4,4,2”, “2,5,5,2”, “2,8,8,1”, “2,8,8,2”, and “2,10,10,2”, where the first digit is the number of the derivative, the second is the gap over which the derivative is calculated, the third is the number of the first smoothing, and the fourth is the number of the second smoothing. Five scatter correction methods were provided, namely, standard normal variate (SNV), detrend only (DET), standard multiple scatter correction (MSC), a combination of SNV and DET (SNVD), and weighted multiple scatter correction (WMSC) to remove artifacts and imperfections from the data. Three wavelength ranges (408–2492, 780–2492, and 1108–2492 nm) were selected to obtain the best calibration equation.
principal component analysis
standard error of calibration
- R 2 :
coefficient determination of the calibration
standard error of cross-validation
- R 2 cv :
coefficient determination of cross-validation
ratio performance deviation
- R 2 ev :
coefficient determination of external-validation
standard normal variate
standard multiple scatter
a combination of standard normal variate and detrend
weighted multiple scatter correction
Service RF. Biofuel researchers prepare to reap a new harvest. Science. 2007;315:1488–91.
Wang Y, Fan C, Hu H, Li Y, Sun D, Wang YM, Peng L. Genetic modification of plant cell walls to enhance biomass yield and biofuel production in bioenergy crops. Biotechnol Adv. 2016;34:997–1017.
Taherzadeh MJ, Karimi K. Pretreatment of lignocellulosic wastes to improve ethanol and biogas production: a review. Int J Mol Sci. 2008;9:1621–51.
Xie G, Peng L. Genetic engineering of energy crops: a strategy for biofuel production in China. J Integr Plant Biol. 2011;53:143–50.
Loqué D, Scheller HV, Pauly M. Engineering of plant cell walls for enhanced biofuel production. Curr Opin Plant Biol. 2015;25:151–61.
Xu N, Zhang W, Ren S, Liu F, Zhao C, Liao H, Xu Z, Huang J, Li Q, Tu Y, Yu B, Wang Y, Jiang J, Qin J, Peng L. Hemicelluloses negatively affect lignocellulose crystallinity for high biomass digestibility under NaOH and H2SO4 pretreatments in Miscanthus. Biotechnol Biofuels. 2012;5:58.
Wu Z, Zhang M, Wang L, Tu Y, Zhang J, Xie G, Zhou W, Li F, Guo K, Li Q, Gao C, Peng L. Biomass digestibility is predominantly affected by three factors of wall polymer features distinctive in wheat accessions and rice mutants. Biotechnol Biofuels. 2013;6:183.
Zhang W, Yi Z, Huang J, Li F, Hao B, Li M, Hong S, Lv Y, Sun W, Ragauskas A, Hu F, Peng J, Peng L. Three lignocellulose features that distinctively affect biomass enzymatic digestibility under NaOH and H2SO4 pretreatments in Miscanthus. Bioresour Technol. 2013;130:30–7.
Xie G, Yang B, Xu Z, Li F, Guo K, Zhang M, Wang L, Zou W, Wang Y, Peng L. Global identification of multiple OsGH9 family members and their involvement in cellulose crystallinity modification in rice. PLoS ONE. 2013;8:e50171.
Chen X, Shekiro J, Elander R, Tucker M. Improved xylan hydrolysis of corn stover by deacetylation with high solids dilute acid pretreatment. Ind Eng Chem Res. 2012;51:70–6.
de Carvalho DM, Sevastyanova O, Penna LS, da Silva BP, Lindstrom ME, Colodette JL. Assessment of chemical transformations in eucalyptus, sugarcane bagasse and straw during hydrothermal, dilute acid, and alkaline pretreatments. Ind Crops Prod. 2015;73:118–26.
Li F, Ren S, Zhang W, Xu Z, Xie G, Chen Y, Tu Y, Li Q, Zhou S, Li Y, Tu F, Liu L, Wang Y, Jiang J, Qin J, Li S, Li Q, Jing HC, Zhou F, Gutterson N, Peng L. Arabinose substitution degree in xylan positively affects lignocellulose enzymatic digestibility after various NaOH/H2SO4 pretreatments in Miscanthus. Bioresour Technol. 2013;130:629–37.
Ding SY, Liu YS, Zeng Y, Himmel ME, Baker JO, Bayer EA. How does plant cell wall nanoscale architecture correlate with enzymatic digestibility? Science. 2012;338:1055–60.
Li Z, Zhao C, Zha Y, Wan C, Si S, Liu F, Zhang R, Li F, Yu B, Yi Z, Xu N, Peng L, Li Q. The minor wall-networks between monolignols and interlinked-phenolics predominantly affect biomass enzymatic digestibility in Miscanthus. PLoS ONE. 2014;9:e105115.
Domínguez-Escribà L, Porcar M. Rice straw management: the big waste. Biofuels Bioprod Biorefin. 2009;4:154–9.
Li F, Zhang M, Guo K, Hu Z, Zhang R, Feng Y, Yi X, Zou W, Wang L, Wu C, Tian J, Lu T, Xie G, Peng L. High-level hemicellulosic arabinose predominately affects lignocellulose crystallinity for genetically enhancing both plant lodging resistance and biomass enzymatic digestibility in rice mutants. Plant Biotechnol J. 2015;13:514–25.
Cabrera-Bosquet L, Crossa J, von Zitzewitz J, Serret MD, Araus JL. High-throughput phenotyping and genomic selection: the frontiers of crop breeding converge. J Integr Plant Biol. 2012;54:312–20.
Jasinski S, Lécureuil A, Durandet M, Bernard-Moulin P, Guerche P. Arabidopsis seed content QTL mapping using high-throughput phenotyping: the assets of near infrared spectroscopy. Front Plant Sci. 2016;7:1682.
Penning BW, Hunter CT 3rd, Tayengwa R, Eveland AL, Dugard CK, Olek AT, Vermerris W, Koch KE, McCarty DR, Davis MF, Thomas SR, McCann MC, Carpita NC. Genetic resources for maize cell wall biology. Plant Physiol. 2009;151:1703–28.
Slavov G, Allison G, Bosch M. Advances in the genetic dissection of plant cell walls: tools and resources available in Miscanthus. Front Plant Sci. 2013;4:217.
Jin S, Chen H. Near-infrared analysis of the chemical composition of rice straw. Ind Crops Prod. 2007;26:207–11.
Lomborg CJ, Thomsen MH, Jensen ES, Esbensen KH. Power plant intake quantification of wheat straw composition for 2nd generation bioethanol optimization—a near infrared spectroscopy (NIRS) feasibility study. Bioresour Technol. 2010;101:1199–205.
Smith-Moritz AM, Chern M, Lao J, Sze-To WH, Heazlewood JL, Ronald PC, Vega-Sánchez ME. Combining multivariate analysis and monosaccharide composition modeling to identify plant cell wall variations by Fourier transform near infrared spectroscopy. Plant Methods. 2011;7:26.
Horikawa Y, Imai T, Takada R, Watanabe T, Takabe K, Kobayashi Y, Sugiyama J. Near-infrared chemometric approach to exhaustive analysis of rice straw pretreated for bioethanol conversion. Appl Biochem Biotechnol. 2011;164:194–203.
Huang J, Xia T, Li A, Yu B, Li Q, Tu Y, Zhang W, Yi Z, Peng L. A rapid and consistent near infrared spectroscopic assay for biomass enzymatic digestibility upon various physical and chemical pretreatments in Miscanthus. Bioresour Technol. 2012;121:274–81.
Lupoi JS, Singh S, Davis M, Lee DJ, Shepherd M, Simmons BA, Henry RJ. High-throughput prediction of eucalypt lignin syringyl/guaiacyl content using multivariate analysis: a comparison between mid-infrared, near-infrared, and Raman spectroscopies for model development. Biotechnol Biofuels. 2014;7:93.
Wu L, Li M, Huang J, Zhang H, Zou W, Hu S, Li Y, Fan C, Zhang R, Jing H, Peng L, Feng S. A near infrared spectroscopic assay for stalk soluble sugars, bagasse enzymatic saccharification and wall polymers in sweet sorghum. Bioresour Technol. 2015;177:118–24.
Payne CE, Wolfrum EJ. Rapid analysis of composition and reactivity in cellulosic biomass feedstocks with near-infrared spectroscopy. Biotechnol Biofuels. 2015;8:43.
Li M, Feng SQ, Wu LM, Li Y, Fan CF, Zhang R, Zou W, Tu Y, Jing H, Li S, Peng L. Sugar-rich sweet sorghum is distinctively affected by wall polymer features for biomass digestibility and ethanol fermentation in bagasse. Bioresour Technol. 2014;167:14–23.
Wold S, Geladi P, Esbensen K. Multiway principal components and PLS analysis. J Chemom. 1987;1:41–56.
Williams PC, Sobering DC. How do we do it: a brief summary of the methods we use in developing near infrared calibration. In: Davies AMC, Williams P, editors. Near infrared spectroscopy: the future waves. Chichester: NIR Publications; 1996. p. 185–8.
Manabe Y, Nafisi M, Verhertbruggen Y, Orfila C, Gille S, Rautengarten C, Cherk C, Marcus SE, Somerville S, Pauly M, Knox JP, Sakuragi Y, Scheller HV. Loss-of-function mutation of reduced wall acetylation2 in Arabidopsis leads to reduced cell wall acetylation and increased resistance to Botrytis cinerea. Plant Physiol. 2011;155:1068–78.
Schultink A, Naylor D, Dama M, Pauly M. The role of the plant-specific ALTERED XYLOGLUCAN9 protein in Arabidopsis cell wall polysaccharide O-acetylation. Plant Physiol. 2015;167:1271–83.
Xiong G, Cheng K, Pauly M. Xylan O-acetylation impacts xylem development and enzymatic recalcitrance as indicated by the Arabidopsis mutant tbl29. Mol. Plant. 2013;6:1373–5.
Zhang B, Zhang L, Li F, Zhang D, Liu X, Wang H, Xu Z, Chu C, Zhou Y. Control of secondary cell wall patterning involves xylan deacetylation by a GDSL esterase. Nat Plants. 2017;3:17017.
Bartley LE, Peck ML, Kim SR, Ebert B, Manisseri C, Chiniquy DM, Sykes R, Gao L, Rautengarten C, Vega-Sánchez ME, Benke PI, Canlas PE, Cao P, Brewer S, Lin F, Smith WL, Zhang X, Keasling JD, Jentoff RE, Foster SB, Zhou J, Ziebell A, An G, Scheller HV, Ronald PC. Overexpression of a BAHD acyltransferase, OsAt10, alters rice cell wall hydroxycinnamic acid content and saccharification. Plant Physiol. 2013;161:1615–33.
Coleman HD, Yan J, Mansfield SD. Sucrose synthase affects carbon partitioning to increase cellulose production and altered cell wall ultrastructure. Proc Natl Acad Sci. 2009;106:13118–23.
Sahoo DK, Stork J, DeBolt S, Maiti IB. Manipulating cellulose biosynthesis by expression of mutant Arabidopsis proM24:CESA3(ixr1-2) gene in transgenic tobacco. Plant Biotechnol J. 2013;11:362–72.
Li F, Xie G, Huang J, Zhang R, Li Y, Zhang M, Wang Y, Li A, Li X, Xia T, Qu C, Hu F, Ragauskas AJ, Peng L. OsCESA9 conserved-site mutation leads to largely enhanced plant lodging resistance and biomass enzymatic saccharification by reducing cellulose DP and crystallinity in rice. Plant Biotechnol J. 2017;15:1093–104.
Fan C, Feng S, Huang J, Wang Y, Wu L, Li X, Wang L, Tu Y, Xia T, Li J, Cai X, Peng L. AtCesA8-driven OsSUS3 expression leads to largely enhanced biomass saccharification and lodging resistance by distinctively altering lignocellulose features in rice. Biotechnol Biofuels. 2017;10:221.
Guo K, Zou W, Feng Y, Zhang M, Zhang J, Tu F, Xie G, Wang L, Wang Y, Klie S, Persson S, Peng L. An integrated genomic and metabolomic framework for cell wall biology in rice. BMC Genom. 2014;15:596.
Sun D, Alam A, Tu Y, Zhou S, Wang Y, Xia T, Huang J, Li Y, Wei X, Hao B, Peng L. Steam-exploded biomass saccharification is predominately affected by lignocellulose porosity and largely enhanced by Tween-80 in Miscanthus. Bioresour Technol. 2017;239:74–81.
Sluiter A, Hames B, Ruiz R, Scarlata C, Sluiter J, Templeton D, Crocker D. Determination of structural carbohydrates and lignin in biomass. Tech. Rep. NREL/TP-510-42618, NREL, Golden, Co. 2008.
Fry SC. The growing plant cell wall: chemical and metabolic analysis. London: Longman; 1988.
Dische Z. Color reactions of carbohydrates. In: Whistler RL, Wolfrom ML, editors. Methods in carbohydrate chemistry. New York: Academic Press; 1962. p. 477–512.
Shenk JS, Westerhaus MO. Population structuring of near infrared spectra and modified partial least squares regression. Crop Sci. 1991;31:1548–55.
JH completed major experiment and wrote manuscript. YL, YC, YW, and ML participated transgenic plant selections and biomass digestibility analysis. YW, RZ, SZ, and JL participated cell wall components determination. YT and BH analyzed the data. LP and TX designed the project, supervised the experiments, interpreted the data, and finalized the manuscript. All authors read and approved the final manuscript.
This work was supported in part by grants from the National Science Foundation of China (31670296; 31571721), Fundamental Research Funds for the Central Universities of China (2662015PY018), the National 111 Project (B08032), the Earmarked Fund for China Agriculture Research System (CARS-31-02), and the National Transgenic Project (2009ZX08009-119B).
The authors declare that they have no competing interests.
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.