Comparative transcriptome analysis to investigate the high starch accumulation of duckweed (Landoltia punctata) under nutrient starvation
- Xiang Tao†1, 2,
- Yang Fang†1,
- Yao Xiao1,
- Yan-ling Jin1,
- Xin-rong Ma1,
- Yun Zhao2,
- Kai-ze He1,
- Hai Zhao1Email author and
- Hai-yan Wang2Email author
© Tao et al.; licensee BioMed Central Ltd. 2013
Received: 1 February 2013
Accepted: 1 May 2013
Published: 8 May 2013
Duckweed can thrive on anthropogenic wastewater and produce tremendous biomass production. Due to its relatively high starch and low lignin percentage, duckweed is a good candidate for bioethanol fermentation. Previous studies have observed that water devoid of nutrients is good for starch accumulation, but its molecular mechanism remains unrevealed.
This study globally analyzed the response to nutrient starvation in order to investigate the starch accumulation in duckweed (Landoltia punctata). L. punctata was transferred from nutrient-rich solution to distilled water and sampled at different time points. Physiological measurements demonstrated that the activity of ADP-glucose pyrophosphorylase, the key enzyme of starch synthesis, as well as the starch percentage in duckweed, increased continuously under nutrient starvation. Samples collected at 0 h, 2 h and 24 h time points respectively were used for comparative gene expression analysis using RNA-Seq. A comprehensive transcriptome, comprising of 74,797 contigs, was constructed by a de novo assembly of the RNA-Seq reads. Gene expression profiling results showed that the expression of some transcripts encoding key enzymes involved in starch biosynthesis was up-regulated, while the expression of transcripts encoding enzymes involved in starch consumption were down-regulated, the expression of some photosynthesis-related transcripts were down-regulated during the first 24 h, and the expression of some transporter transcripts were up-regulated within the first 2 h. Very interestingly, most transcripts encoding key enzymes involved in flavonoid biosynthesis were highly expressed regardless of starvation, while transcripts encoding laccase, the last rate-limiting enzyme of lignifications, exhibited very low expression abundance in all three samples.
Our study provides a comprehensive expression profiling of L. punctata under nutrient starvation, which indicates that nutrient starvation down-regulated the global metabolic status, redirects metabolic flux of fixed CO2 into starch synthesis branch resulting in starch accumulation in L. punctata.
KeywordsDuckweed Transcriptome Bioethanol Nutrient starvation Starch accumulation Metabolic flux
Liquid biofuels, such as bioethanol, converted from biomass are considered as a promising alternative for traditional fossil fuels. Biofuels development can reduce greenhouse gas emission and meet the world's rapidly growing demand for energy. Currently, bioethanol is mainly produced from feedstocks with relatively high starch or sugar percentage, such as corn, sugarcane, sweet potato and cassava [1–3]. However, these bioethanol production modes have some inherent problems, including the adverse impacts on food security, environment and insufficient agricultural land [4, 5]. Although lignocellulosic sources are also considered as a promising feedstock for bioethanol production, there are several obstacles, such as the lack of an efficient, economical and environment friendly pretreatment process, that still needed to be overcome . Therefore, developing sustainable feedstocks and processing protocols for biofuel production is becoming more and more urgent.
One alternative feedstock is duckweed (Lemnacecae family), a small flowering plant that has a global adaptability across a broad range of climates [7, 8] and can be easily found in quiescent or slowly flowing and polluted water bodies worldwide . Duckweed has a longer yearly production period than most of other plants, and even grows year-round in some areas with a warm climate , which make it a potential sustainable feedstock for industrial application. Previous studies indicated that this plant produces biomass faster than any other flowering plant . With near-exponential growth rates and the shortest doubling times [12–14], duckweed can achieve a biomass of 0.5 to 1.5 metric tons/hectare/day fresh weight or 13 to 38 metric tons/hectare/year dry weight . This accompanies with the tremendous amount of CO2 sequestered from the atmosphere and the natural ability of duckweed to thrive on eutrophic wastewater, and recover polluting nutrients [14, 16–23], suggesting that growing duckweed for biomass can have large beneficial environmental impacts. In warm seasons, duckweed can remove up to 85% of total Kjehldahl nitrogen (TKN) and 78% of total phosphorous (TP) . Importantly, duckweed biomass exhibits good characteristics for bioethanol production due to its relatively high starch and low lignin percentage [24–27]. High starch accumulation is the most important trait of this crop. Depending on the duckweed species and growing conditions, starch percentage of duckweed ranges from 3% to 75% (Dry weight, DW) . Under nutrient-rich growth conditions, duckweed has a relatively low starch percentage. But by manipulating growing conditions, including the adjustment of pH, phosphate concentration, and nutrient status, starch percentage of duckweed can be significantly increased [26, 27, 29]. It is encouraging that published studies have demonstrated that carbohydrate derived from duckweed could be converted to ethanol and butanol efficiently [26, 27, 30–32]. As a novel feedstock for bioenergy production, duckweed has recently gained interest from researchers and governments. In 2009, the United States Department of Energy supported a project to sequence the genome of Spirodela polyrhiza (CSP2009, CSP_LOI_793583). Supported by the Minister of Science and Technology, China launched a project to produce liquid biofuel from duckweed biomass, and the first international conference on duckweed application and research was held in Chengdu, China in October, 2011 .
Duckweed was once an important model system for plant biology before the days of Arabidopsis[33–36] and facilitated important advances in plant biology, such as contributions to our understanding of photoperiod control of flowering [34, 37, 38], sulfur metabolic pathways and auxin biosynthesis . With the advent of modern genetics and genomics, the status of duckweed as model plant was replaced by other plants, such as Arabidopsis and rice, mainly because the exceedingly tiny size and infrequent flowers made genetics studies and breeding in duckweed difficult. With its new value rediscovered, molecular research on duckweed has been carried out again. Several whole chloroplast genome data of duckweed were released [39, 40], nuclear genome size were measured by flow cytometry (FCM) , and DNA barcoding technology was developed to distinguish different species in Lemnacecae family [42–44]. However, as a potential energy crop, functional genomics and transcriptomics data for duckweed are urgently needed.
Next-generation sequencing (NGS) provides a novel method to uncover transcriptomics data [45, 46]. This technology shows major advantages in robustness, resolution and inter-lab portability over several microarray platforms . These NGS platforms can detect millions of transcripts and can be used for new gene discovery and expression profiling independent of a reference genome [48–50]. In this study, in order to construct a comprehensive transcriptome and investigate the molecular mechanism behind the starch accumulation in L. punctata 0202, samples collected at 0 h, 2 h and 24 time points respectively after fronds were shifted from Hoagland nutrient solution to distilled water were used for a high throughput RNA-Seq analysis. Paired-end (PE) reads from the RNA-Seq were then de novo assembled to build a duckweed transcriptome, which was further used for comparative analysis to reveal the expression patterns of this gene set. This analysis gives a preliminary but global insight into the possible molecular mechanism of starch accumulation, and provides large amount of information for future basic research in duckweed.
Effect of nutrient starvation on L. punctata growth and starch accumulation
Sequencing, de novo assembly and functional annotation of L. punctata transcriptome
Assembly quality statistics of the L. punctata transcriptome
PE read number
Contig ≥2000 bp
Contig ≥1000 bp
Average length (bp)
Max length (bp)
N50 length (bp)
Sequence similarity search against the Non-redundant protein database (NR) of NCBI [http://www.ncbi.nlm.nih.gov/] was conducted by a locally installed blast program to investigate functional annotation of each contig. Of the 74,797 contigs, 51,968 (69.5%) had significant BLASTX hits and matched 25,581 unique protein accessions (Additional file 1: Table S1). For contigs with length ≥300 bp, 80.5% of them had BLASTX hits, and for contigs with length ≥600 bp, this ratio was 93.8% (Figure 2). These results indicated that most of these contigs were protein-encoding transcripts. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) annotation were combined with the BLASTX results to give comprehensive functional information for each transcript. In total, we got 105,616 GO annotations for 40,618 transcripts, and 911 unique Enzyme Codes (ECs) number for 8,697 transcripts covering 134 pathways.
Global molecular characterization of L. punctata transcriptome
All PE reads were pooled together for GC percentage statistics by EMBOSS  on the Galaxy website [http://main.g2.bx.psu.edu/] [55, 56]. It demonstrated that GC percentage of reads is 56.6%, while the transcriptomic GC percentage calculated from the 74,797 contigs is 52.3%. Scanning Open Reading Frames (ORFs) of all contigs, we found there were 40,767 ORFs with length ≥600 bp, and GC percentage of coding regions is 55.2%. For the first position of codon, GC accounts 58.2%. The highest GC percentage presents in the third position, while GC of the second position accounts only 45.0%. We also analyzed the codon usage of these 40,767 ORFs (Additional file 2: Table S2). Results indicated that the most frequently used stop codon in L. punctata is TGA, which presents in 55.0% of all ORFs, the second one is TAG (23.9%) and TAA is the least one (21.0%). According to the codon usage analysis, we found that the most abundant amino acids in L. punctata are non-polar amino acids (40.5%), and then the uncharged polar amino acids (33.4%), while the acidic and basic amino acids account for 11.7% and 14.5%, respectively (Additional file 2: Table S2).
SSR identification of L. punctata
Library size (bp)
SSR density (/Mbp)
SSR containing contig
SSR density (/contig)
Contig containing more than 1 SSR
Compound formation SSR
Differential expression between the samples and qRT-PCR validation
To validate the expression patterns obtained from the comparative RNA-Seq studies, 50 transcripts were selected randomly from the annotated transcripts set for qRT-PCR analysis (Additional file 4: Table S6). The results showed that the expression patterns of 45 of them were consistent with the RNA-Seq analysis, confirming that different expression analysis based on high-throughput RNA sequencing gave reliable expression data in this study.
Functional classification of DETs
Functional enrichment of differentially expressed transcripts among three samples
0 vs 2
primary metabolic process
interspecies interaction between organisms
0 vs 24
primary metabolic process
cellular metabolic process
2 vs 24
primary metabolic process
cellular metabolic process
Expression analysis of photosynthesis-related transcripts
Expression analysis of transporters-encoding transcripts
Expression pattern analysis showed that nutrient starvation resulted in increased expression of most transporters-encoding transcripts, especially some high-affinity transporters-encoding transcripts (Additional file 4: Table S4). Among the 2,558 DETs identified between 0 h and 2 h, 126 DETs encode transporter, including two phosphate transporters, six nitrate transporters, two potassium transporters, six sulfate transporters, five magnesium transporters and three amino acid transporters. For example, two high affinity nitrate transporter-encoding transcripts (comp17450_c0_seq1; comp34106_c0_seq2) had expression levels of 158 FPKM and 93 FPKM at 0 h, but increased to 812 FPKM and 430 FPKM at 2 h. Expression level of a high affinity inorganic phosphate transporter-encoding transcript, comp27235_c1_seq1, was significantly up-regulated from the initial 2 FPKM to 13 FPKM at 2 h. Although most of these transporter-encoding transcripts were up-regulated in 2 h, expression level of two highly-expressed amino acid transporters-encoding transcripts (comp28314_c0_seq3, comp28314_c0_seq4) were down-regulated at least 4 times. Among the 5,399 DETs identified between 0 h and 24 h, 246 transcripts were annotated as transporters. These results suggested that in response to nutrient starvation, L. punctata promoted the expression of some ion transporters, especially several high affinity transporters, to adapt itself to extremely low mineral concentrations.
Expression analysis of transcript encoding key enzymes involved in starch accumulation
Expression analysis of transcript-encoding key enzymes involved in respiration
Alpha- and beta-amylase are enzymes that can degrade starch to convert it into monosaccharide. Lower expression of these two types of amylase may indirectly decrease the utilization of starch and affect downstream metabolism, such as respiration and some other sugar dependent processes. To investigate whether nutrient stress resulted in a suppression of the starch utilization metabolism, we further studied expression patterns of regulatory enzymes involved in glycolysis and tricarboxylic acid cycle. Interestingly, no significant difference were observed for all of these regulators except the pyruvate kinase (EC: 126.96.36.199; PK; comp17122_c0_seq1) (Additional file 1: Table S1), for which the expression level increased from 53 FPKM to 109 FPKM by 24 h. However, ATP synthase (EC 188.8.131.52; ATPase), which is the energy currency maker that uses the proton gradient generated by the degradation of glucose and electron transfer process to drive the synthesis of ATP, was impacted by nutrient starvation. Usually, ATPase is consisted of two linked multi-subunit complexes, the soluble catalytic core F1, and the membrane-spanning component Fo. The regulatory and catalytic core of F1, alpha and beta subunits, were both down-regulated in L. punctata in this study (Figure 4). We identified four alpha subunits and four beta subunits in the transcriptome. All of the four alpha subunits (comp26885_c0_seq1; comp26885_c0_seq2; comp26885_c0_seq3; comp31538_c0_seq1) were down-regulated at least 2 times, while just one beta subunit (comp34876_c0_seq1) was down-regulated. However, the changed beta subunit had much higher expression abundance than the other three. These changes of the energy currency maker may form a bottleneck and affect some ATP dependent metabolic reactions. We also measured the respiration rate of L. punctata under nutrient starvation to verify the hypotheses described here. The results showed that the respiration rate of L. punctata decreased to 76.6% and 57.2% at 24 h and 168 h respectively compared to that at 0 (Figure 1).
Lignin and flavonoid biosynthesis of L. punctata
Genomics and transcriptomics research of duckweed
Published genomic studies of duckweed mainly focus on chloroplast genome and genome size of different duckweed species [39–41]. However, the sequence of nuclear genome and transcriptome of duckweed is still unrevealed, resulting in the shortage of genetic information, which is still one of the largest obstacles for the development of this crop.
NGS is a cost effective technology to identify millions of transcripts expressed under specific growth conditions and has been widely used since its inception at 2005 [69, 70]. In this study, we constructed a comprehensive 87.2 Mbp transcriptome of L. punctata by assembling 10.7 Gbp RNA-Seq PE read data. As 95.4% of these PE reads can be aligned back to the transcriptome, the corresponding coverage of RNA-Seq PE reads was 115.2. Based on the findings that the genome size of L. punctata ranges from 372 to 397 Mbp , 10.7 Gbp PE reads used in this study was sufficient enough to cover almost all of the transcripts expressed in L. punctata. Using the de novo assembly strategy, a database with 74,797 contigs was established, and 51,968 (69.5%) of them received annotation information from NCBI, KEGG and GO databases. It turned out to be a reliable transcriptome after being assessed by length distribution statistic, map-based and CDS-prediction method. The accuracy of the expression profiling data was verified by real-time quantitative RT-PCR. Furthermore, we identified and characterized 14,250 cSSRs to offer useful molecular markers for future genetic analysis and molecular breeding.
Transcriptomic changes of L. punctata during nutrient starvation
As almost all “essential mineral nutrients” were in deficiency, distilled water provided a global mineral starvation condition which simulates natural water bodies better than the single nutrient-deficient media used in previous published studies [24, 71, 72]. To cope with the reduced nutrient availability in distilled water, L. punctata immediately triggered the up-regulation of the expression of several high-affinity transporters, aiming to increase nutrient acquisition (Additional file 5: Table S4). But this response failed to change the nutrient status and consequently resulted in a series of starvation responses. Under nutritional stress, many aspects of morphology, physiology and metabolism are altered in plants, including adjusting their growth to increase nutrient acquisition [73–78], remobilizing nutrient from inorganic and organic resources , and using alternative metabolic pathways that require lower amounts of a limiting nutrient [80, 81].
Previous studies have observed that on nitrogen- or phosphate-deficient medium, vegetative growth of duckweed was quickly reduced and eventually ceased. The photosynthesis and respiration rate both fell gradually and roots elongated. At the molecular level, nutrient starvation stimulated starch accumulation while protein or nucleotide percentage decreased [24, 26, 27, 32, 72, 73]. As “essential mineral nutrients” play crucial roles in cell metabolism and physiology, such as constituents of metabolites and macromolecules, enzyme cofactors, and so on , distilled water regulated the metabolic status and induced almost all of the metabolic changes observed previously. For examples, RbcS and RCA of L. punctata exhibited very high expression levels in standard Hoagland nutrient solution, but were significantly down-regulated within the first 24 h in distilled water (Additional file 4: Table S4). This observation is consistent with the decrease of photosynthetic rate (Figure 1). The significant down-regulation of the expression level of the alpha and beta subunits of ATPase F1 (Additional file 4: Table S4), together with the decreased respiration rate (Figure 1), reflect the significant suppression of respiration in L. punctata under nutrient starvation.
With the shortage of mineral nutrients, especially nitrogen and phosphorus, the biosynthesis of protein and nucleotide in L. punctata were restricted and total amount of these two macromolecules were mostly dependent on the initial storage of nitrogen and phosphorus respectively. At the later stage of nutrient starvation, the free nitrogen and phosphorus in L. punctata would inevitably decrease due to the majority being fixed in cell skeleton and genomic DNA. This will consequently suppress the total transcription and translation levels. The stable protein content presented in Figure 1 strongly supported this hypothesis. Although the protein percentage decreased from the initial 29.6% (DW) to the final 11.3% (DW) by 168 h, the total protein amount kept at a stable level of 0.015 g per flask. Beside this, deficiency of magnesium, calcium, zinc, iron, manganese and some other minor minerals that play basic functions as enzyme cofactors, widely affected the metabolic reactions in cell, and globally reduced the rate of the metabolisms and energy consumption. The down-regulation of the expression level of the regulatory and catalytic core of ATPase resulted in a decrease of ATP/ADP ratio and then produced a negative feedback regulation of oxidative phosphorylation to decrease the degradation of glucose [83, 84]. This indirectly supported the reduction of global metabolism in nutrient starvation treated L. punctata.
There are disadvantages in using distilled water as culture media. The obtained data are not enough to explain all the starvation responses since physiological and molecular responses observed in this study might be related to a specific ions deficiency or a mixed stress by the lack of all ions. However, it still provides an alternative method to produce high-starch duckweed. This study provides a preliminary investigation concerning the transcriptomic responses induced by nutrient-stress. Intensive investigations on this topic are needed in the future.
Starch accumulation of L. punctata under nutrient starvation
Although previous studies focused on the starch accumulation of duckweed have made a lot of progresses, no comprehensive assay on this trait has been published and the mechanism of it remains largely unknown. We used a genome-wide transcriptomic analysis method to investigate the expression patterns of some key enzymes involved in starch metabolism.
We integrated the transcriptomics data, the enzymatic assay and starch percentage variation to elucidate the mechanisms of starch accumulation under nutrient starvation. The data from three different levels were analyzed and compared. Starch composition investigation showed that starch percentage of L. punctata accumulated rapidly. In the first 24 h of nutrient starvation, starch percentage increased from the original 3.0% (DW) to 18.3% (DW). At 168 h, starch percentage upped to 45.4% (DW) (Figure 1). Calculated according to the dry weight of biomass, the starch net weight increased from 1.5 mg to 63.5 mg per flask, it meant that the starch quantity increased 42 times. Meanwhile, the enzymatic activity of AGP, the regulator of the first committed step in biosynthesis of starch, increased by 30.0% and 52.8% at 24 h and 168 h respectively compared to that of 0 h (Figure 1). More importantly, the expression pattern of key enzyme-encoding transcript involved in starch biosynthesis and degradation further supported these results described above. Findings showed that the expression levels of transcripts encoding AGP-LS were up-regulated significantly, while those of AGP-SS were up-regulated slightly at the first 24 h of starvation, which indicated that the starch accumulation may have originally resulted from the high expression levels of these enzyme-encoding transcripts. On the other hand, nutrient starvation down-regulated expression level of starch degradation related enzymes, such as alpha- and beta-amylase. Additionally, the expression level of regulatory enzymes involved in some competitive bypasses, including trehalose-6-phosphate synthase, sucrose synthase, trehalose 6-phosphate phosphatase, and so on, were also down-regulated (Figure 5). Coulped with the up-regulation of starch biosynthesis related key enzyme-encoding transcripts, these down-regulations finally redirected alpha-D-Glucose-1P and UDP-Glucose to the starch biosynthesis branch.
Taking all of the transcriptomic changes into account, we deduced that the suppression of global metabolic status and cellular respiration decreased the consumption of glucose and redirected the flux toward the biosynthesis of starch. Although several previous studies suggest that starch accumulation is a general response to nutrient deficiency in some plants [85, 86], the imbalance between photosynthesis and carbohydrate usage caused by nutrient-deficient is still be less revealed. The findings of this study provides transcriptomics data about this stress response and suggests that the continuous intake of carbon, hydrogen, oxygen and light energy, together with the suppression of many energy-required metabolisms, redirected metabolic flux and resulted in the final starch accumulation.
Transcriptomics data also showed that transcripts encoding laccase, the last key enzyme involved in the biosynthesis of lignin, had a low expression level, while key enzyme-encoding transcripts involved in flavonoid biosynthesis had much higher expression level regaedless of nutrient starvation (Figure 6). This may be the reason that duckweed has low lignin percentage but high flavonoid content. The characteristics of high starch and low lignin percentage, together with high biomass production, make duckweed a promising energy crop.
This study provides the first comprehensive transcriptome and a genome-wide gene expression profiling of duckweed L. punctata exposed to nutrient starvation. Global expression pattern analysis revealed that starvation suppressed most metabolisms due to the extreme shortage of essential mineral nutrients, and then redirected metabolic flux to direct more fixed CO2 into starch synthesis pathway, resulting in starch accumulation. These results described here provide valuable genomic resource for duckweed and pave the way for the further molecular biological studies and the application of duckweed as a bioenergy crop.
Materials and methods
Duckweed cultivation and growth conditions
L. punctata 0202, originally collected from Sichuan, China, was cultivated in standard Hoagland nutrient solution  for 14 days under a 16/8 h day/night cycle, with a light intensity of 130 μmol/m2/s, and a temperature of 25°C/15°C at day/night. Then 0.5 g fronds were transferred into 50 mL distilled water in 250 mL culture flask for further cultivation over a period of 7 days. A total of 150 culture flasks were used for L. punctata cultivation. Eleven different time points, including 0 h, 0.5 h, 2 h, 5 h, 24 h, 48 h, 72 h, 96 h, 120 h, 144 h and 168 h after fronds being transferred into distilled water, were chose for composition and enzymatic activity assay. For each time point, fronds were collected from three culture flasks. Moreover, samples collected at 0 h, 2 h and 24 h were snap-frozen immediately in liquid nitrogen immediately for the further RNA-Seq study.
Material composition and enzymatic activity assay
For each time point, fronds collected from three culture flasks were dried respectively and used for dry matter, protein and starch percentage measurement. The starch percentage of was measured using a total starch kit (Megazyme, Ireland) according to the manufacturer’s instructions. The crude protein in the biomass was determined as Kjeldahl nitrogen×6.25. Activities of AGP were measured following the introduction of Nakamura, Y. et al.. Accordingly, one enzymatic unit of AGP was defined as the amount of enzyme that causes the increase of 0.01 OD at 340 nm of the finally reacted solution per minute. The AGP units were then divided by the total protein amount.
RNA extraction and library construction
Total RNA were extracted by using OMEGATM Plant DNA/RNA kit (OMEGA, USA) following the manufacturer’s instruction and genomic DNA were removed by DNase I (Fermentas, USA). The purity, concentration and RNA integrity number (RIN) were measured by SMA3000 and/or Agilent 2100 Bioanalyzer. Qualified total RNAs were then submitted to Beijing Genomics Institute (BGI)-Shenzhen, Shenzhen, China [http://www.genomics.cn] for RNA sequencing.
To get a comprehensive transcriptome and investigate the transcriptomic response to nutrient starvation, fronds harvested at 0 h, 2 h and 24 h were used for RNA-Seq study. More than 20 μg qualified total RNA extracted from each sample were used for RNA sequencing by Hiseq 2000. The poly (A)+ RNAs were purified using poly-T oligo-attached magnetic beads and eluted with Tris-HCl under heating condition. mRNAs was mixed with fragmentation media and then fragmented. Fragmented mRNAs were copied into first strand cDNA using reverse transcriptase and random primers. This is followed by second strand cDNA synthesis using DNA Polymerase I and RNaseH. The ends of these dscDNAs were repaired by adding a single ‘A’ base and then Illumina adapters ligated to the repaired ends. 200 bp cDNAs fragment were purified from a gel and used for further templates enrichment by PCR with two primers that anneal to the ends of the adapters to construct a fragmented cDNAs library. Quality control analysis was performed by Agilent 2100 Bioanalyzer.
RNA sequencing and de novo assembly
The validated 200 bp fragments cDNAs library constructed above was submitted to Illumina Hiseq 2000 to perform transcriptome sequencing. Then the Illumina sequencing-by-synthesis, image analysis and base-calling procedures were used to obtain PE read sequence information and base-calling quality values.
Sequencing quality were assessed by fastQC [http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/] and PE reads were de novo assembled by using Trinity (v2012-06-08)  under default parameters choice. To investigate the assembly quality, all of the PE reads were aligned back to these contigs by using Bowtie2 (v2.0.0-beta5) program , and aligned rate were calculated. Assembly quality was also assessed by length distribution analysis by common perl scripts. N50 number, average length, max length and contig number during different length interval were all been calculated. Moreover, we scanned the best candidate Coding sequence (CDS) for each contig and got the ratios of long-CDS containing transcripts to corresponding length contigs.
Sequence composition analysis
Before de novo assembly, the GC percentage was calculated basing on the PE reads. After sequence assembly, the GC percentage and codon usage bias of the transcriptome were analyzed using EMBOSS  on the Galaxy website [http://main.g2.bx.psu.edu/] [55, 56]. We also searched cDNA-derived simple sequence repeats (SSRs) with at least 18 bp in length using a perl script known as MIcroSAtellite identification tool [MISA, http://pgrc.ipk-gatersleben.de/misa/misa.html].
Functional annotation and cluster
All of the contigs produced by Trinity (v2012-06-08)  were used for similarity search against the NR database downloaded from Genebank [http://www.ncbi.nlm.nih.gov/] by using local blast program. For BLASTX search, the threshold was set to E-value <103. The blast results were imported into the Blast2GO [87, 88] and performed further functional annotation. Gene ontology (GO) classification  was achieved using WEGO [http://wego.genomics.org.cn/cgi-bin/wego/index.pl] . Enzyme codes were extracted and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were retrieved from KEGG web server [http://www.genome.jp/kegg/]. KEGG and GO function cluster was conducted by common perl scripts.
Expression profiling and different expression calling
To investigate the express level of each contig in different samples, all PE reads for each sample were aligned back to the final assembly by using perl scripts in Trinity (v2012-06-08) package  under default parameters option. The alignment produced a digital expression levels for each contig and then these were normalized by RESM-based algorithm by using perl scripts in Trinity (v2012-06-08) package  so as to get FPKM values.
Based on the expression levels, the effect and bias introduced by library size and/or RNA composition were eliminated by edgeR (the Empirical analysis of Digital Gene Expression in R) , then significant DETs among different samples were identified with pvalue ≤0.05 and log2 fold-change (log2FC) ≥1. The cluster of the DETs was performed by using the common perl and R scripts.
Expression levels verification
To verify the reliability of the NGS-based expression levels, 50 transcripts were chose randomly from the annotated transcripts set for primer design (Additional file 7: Table S6) and real-time quantitative RT-PCR analysis. First strand cDNA was synthesized from 500 ng assessed total RNA using oligo (dT) and random hexamers as primers using Moloney murine leukemia virus (M-MLV) reverse transcriptase (Invitrogen, CA, USA) according to the manufacturer’s instructions. Real-time PCR were performed using Ssofast Evagreen supermix (BIO-RAD, CA, USA) on CFX Connect Real-time PCR detection system (BIO-RAD, CA, USA). The quantitative RT-PCR was implemented under the following program, 5 min at 95°C, followed by 40 cycles of application with 10 s of denaturation at 95°C, 30 s of annealing at 60°C and 30s of extension at 72°C. Three biological replications were used and amplicons were used for meltcurve analysis to check the amplification specificity. The relative expression levels were calculated using Vandesompele method by CFX Manager 2.1 of the amplifier.
Non-redundant protein database
Open reading frame
Fragments Per Kilobase of transcripts per Million mapped fragments
Differentially expressed transcript
Kyoto Encyclopedia of Genes and Genomes
This work was supported by the National Key Technology R&D Program of China (No.2011BAD22B03), the Major Projects of Knowledge Innovation Program of Chinese Academy of Sciences (KSCX2-EW-J-22), and West Light Foundation of The Chinese Academy of Sciences (Y2C5021100). We thank Mr. Lasse Bech and Dr. Wei-zao Huang for revising the manuscript.
- Sanchez OJ, Cardona CA: Trends in biotechnological production of fuel ethanol from different feedstocks. Bioresource Technol 2008, 99: 5270-5295. 10.1016/j.biortech.2007.11.013View Article
- Papong S, Malakul P: Life-cycle energy and environmental analysis of bioethanol production from cassava in Thailand. Bioresource Technol 2010, 101: S112-S118. 10.1016/j.biortech.2009.09.006View Article
- Zhang LA, Chen QA, Jin YL, Xue HL, Guan JF, Wang ZY, Zhao H: Energy-saving direct ethanol production from viscosity reduction mash of sweet potato at very high gravity (VHG). Fuel Process Technol 2010, 91: 1845-1850. 10.1016/j.fuproc.2010.08.009View Article
- Ge LL, Wang P, Mou HJ: Study on saccharification techniques of seaweed wastes for the transformation of ethanol. Renew Energ 2011, 36: 84-89. 10.1016/j.renene.2010.06.001View Article
- Crutzen P, Mosier A, Smith K, Winiwarter W: N2O release from agro-biofuel production negates global warming reduction by replacing fossil fuels. Atmos Chem Phys 2007, 7: 11191-11205. 10.5194/acpd-7-11191-2007View Article
- Sarkar N, Ghosh SK, Bannerjee S, Aikat K: Bioethanol production from agricultural wastes: An overview. Renew Energ 2012, 37: 19-27. 10.1016/j.renene.2011.06.045View Article
- Porath D, Hepher B, Koton A: Duckweed as an aquatic crop: Evaluation of clones for aquaculture. Aquat Bot 1979, 7: 273-278.View Article
- Zhao H, Appenroth K, Landesman L, Salmean AA, Lam E: Duckweed rising at Chengdu: summary of the 1st International Conference on Duckweed Application and Research. Plant Mol Biol 2012, 78: 627-632. 10.1007/s11103-012-9889-yView Article
- Landolt E: Biosystematic investigations in the family of duckweed (Lemnaceae) Volume 2. Stiftung Rubel, Zúrich, Suiza: Geobotanischen Inst ETH; 1986.
- Chaiprapat S, Cheng JJ, Classen JJ, Liehr SK: Role of internal nutrient storage in duckweed growth for swine wastewater treatment. T Asae 2005, 48: 2247-2258.View Article
- Hillman WS, Culley DD Jr: The Uses of Duckweed: The rapid growth, nutritional value, and high biomass productivity of these floating plants suggest their use in water treatment, as feed crops, and in energy-efficient farming. Am Sci 1978, 66: 442-451.
- Datko AH, Mudd SH, Giovanelli J: Lemna paucicostata Hegelm. 6746: development of standardized growth conditions suitable for biochemical experimentation. Plant Physiol 1980, 65: 906-912. 10.1104/pp.65.5.906View Article
- Seth PN, Venkataraman R, Maheshwari SC: Studies on the growth and flowering of a short-day plant, Wolffia microscopica. Planta 1970, 90: 349-359. 10.1007/BF00386387View Article
- Korner S, Vermaat JE, Veenstra S: The capacity of duckweed to treat wastewater: ecological considerations for a sound design. J Environ Qual 2003, 32: 1583-1590. 10.2134/jeq2003.1583View Article
- Skillicorn P, Spira W, Journey W: Duckweed aquaculture: a new aquatic farming system for developing countries. Banco Mundial; 1993.
- Oron G, de-Vegt A, Porath D: Nitrogen removal and conversion by duckweed grown on waste-water. Water Res 1988, 22: 179-184. 10.1016/0043-1354(88)90076-0View Article
- Cheng J, Bergmann BA, Classen JJ, Stomp AM, Howard JW: Nutrient recovery from swine lagoon water by Spirodela punctata. Bioresource Technol 2002, 81: 81-85. 10.1016/S0960-8524(01)00098-0View Article
- Cheng JJ, Stomp AM: Growing Duckweed to recover nutrients from wastewaters and for production of fuel ethanol and animal feed. Clean-Soil Air Water 2009, 37: 17-26. 10.1002/clen.200800210View Article
- El-Shafai SA, El-Gohary FA, Nasr FA, Peter Van Der Steen N, Gijzen HJ: Nutrient recovery from domestic wastewater using a UASB-duckweed ponds system. Bioresource Technology 2007, 98: 798-807. 10.1016/j.biortech.2006.03.011View Article
- Xu J, Cheng JJ, Stomp AM: Nutrient removal from swine wastewater by growing Duckweed: A pilot study. T Asabe 2012, 55: 181-185.View Article
- Xu J, Shen G: Growing duckweed in swine wastewater for nutrient recovery and biomass production. Bioresource Technol 2011, 102: 848-853. 10.1016/j.biortech.2010.09.003View Article
- Crites RW, Middlebrooks JE, Reed SC: Natural Wastewater Treatment Systems. United States: CRC Press; 2006.
- Oron G: Duckweed culture for wastewater renovation and biomass production. Agr Water Manage 1994, 26: 27-40. 10.1016/0378-3774(94)90022-1View Article
- Reid MS, Bieleski RL: Response of Spirodela oligorrhiza to Phosphorus Deficiency. Plant Physiol 1970, 46: 609-613. 10.1104/pp.46.4.609View Article
- Blazey EB, McClure JW: The distribution and taxonomic significance of lignin in the Lemnaceae. Am J Bot 1968, 55: 1240-1245. 10.2307/2440747View Article
- Cui WH, Xu JL, Cheng JJ, Stomp AM: Starch accumulation in duckweed for bioethanol production. Biol Eng 2011, 3: 187-197.View Article
- Xu JL, Cui WH, Cheng JJ, Stomp AM: Production of high-starch duckweed and its conversion to bioethanol. Biosyst Eng 2011, 110: 67-72. 10.1016/j.biosystemseng.2011.06.007View Article
- Landolt E, Kandeler R: Biosystematics investigation in the family of duckweeds (lemnacea). The family of the Lemnacea: a monographic study. Vol. 2. Zurich: Veroff Geobot Inst ETH; 1987.
- Cui W, Xu J, Cheng J, Stomp A: Growing duckweed for bioethanol production. 2010 ASABE Annual Meeting Paper 2010.
- Chen Q, Jin YL, Zhang GH, Fang Y, Xiao Y, Zhao H: Improving production of bioethanol from Duckweed (Landoltia punctata) by pectinase pretreatment. Energies 2012, 5: 3019-3032. 10.3390/en5083019View Article
- Li X, Jin Y, Gao X, Zhang G, Zhao H: Fermentation method of high ratios of biobutanol with Landoltia punctata(in Chinese, with English abstract). China Brewing 2012, 31: 85-88.
- Xu J, Zhao H, Stomp A-M, Cheng JJ: The production of duckweed as a source of biofuels. Biofuels 2012, 3: 589-601. 10.4155/bfs.12.31View Article
- Posner HB: Characteristics of X-ray-induced aberrants of Lemna perpusilla 6746. Plant Cell Physiol 1962, 3: 275-284.
- Hillman WS: Photoperiodism: An effect of darkness during the light period on critical night length. Science 1963, 140: 1397-1398. 10.1126/science.140.3574.1397View Article
- Datko AH, Mudd SH, Giovanelli J, Macnicol PK: Sulfur-containing compounds in Lemna perpusilla 6746 grown at a range of sulfate concentrations. Plant Physiol 1978, 62: 629-635. 10.1104/pp.62.4.629View Article
- Baldi BG, Maher BR, Slovin JP, Cohen JD: Stable isotope labeling, in vivo, of d- and l-Tryptophan pools in Lemna gibba and the low incorporation of label into indole-3-acetic acid. Plant Physiol 1991, 95: 1203-1208. 10.1104/pp.95.4.1203View Article
- Kandeler R: Uber die Bliitenbildung bei Lemna gibba LI Kulturbedingungen und Tageslangenabhangigkeit. Z Bot 1955, 43: 61-71.
- Tobin EM: White light effects on the mRNA for the light-harvesting Chlorophyll a/b-protein in Lemna gibba L. G-3. Plant Physiol 1981, 67: 1078-1083. 10.1104/pp.67.6.1078View Article
- Wang W, Messing J: High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA. PLoS One 2011, 6: e24670. 10.1371/journal.pone.0024670View Article
- Mardanov AV, Ravin NV, Kuznetsov BB, Samigullin TH, Antonov AS, Kolganova TV, Skyabin KG: Complete sequence of the duckweed (Lemna minor) chloroplast genome: structural organization and phylogenetic relationships to other angiosperms. J Mol Evol 2008, 66: 555-564. 10.1007/s00239-008-9091-7View Article
- Wang W, Randall AK, Todd PM: Evolution of genome size in Duckweeds (Lemnaceae). J Bot 2011, 2011: 9. Article ID 570319 10.1155/2011/570319
- Wang W, Wu Y, Yan Y, Ermakova M, Kerstetter R, Messing J: DNA barcoding of the Lemnaceae, a family of aquatic monocots. BMC Plant Biol 2010, 10: 205. 10.1186/1471-2229-10-205View Article
- Bog M, Schneider P, Hellwig F, Sachse S, Kochieva EZ, Martyrosian E, Landolt E, Appenroth K-J: Genetic characterization and barcoding of taxa in the genus Wolffia Horkel ex Schleid.(Lemnaceae) as revealed by two plastidic markers and amplified fragment length polymorphism (AFLP). Planta 2013, 237: 1-13. 10.1007/s00425-012-1777-9View Article
- Bog M, Baumbach H, Schween U, Hellwig F, Landolt E, Appenroth KJ: Genetic structure of the genus Lemna L. (Lemnaceae) as revealed by amplified fragment length polymorphism. Planta 2010, 232: 609-619. 10.1007/s00425-010-1201-2View Article
- Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009, 10: 57-63. 10.1038/nrg2484View Article
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al.: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 2011, 29: 644-652. 10.1038/nbt.1883View Article
- AC't Hoen P, Ariyurek Y, Thygesen HH, Vreugdenhil E, Vossen RH, de Menezes RX, Boer JM, van Ommen G-JB, den Dunnen JT: Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms. Nucleic Acids Res 2008, 36: e141. 10.1093/nar/gkn705View Article
- Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH: Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.xView Article
- Meyer E, Aglyamova GV, Wang S, Buchanan-Carter J, Abrego D, Colbourne JK, Willis BL, Matz MV: Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx. BMC Genomics 2009, 10: 219. 10.1186/1471-2164-10-219View Article
- Shi CY, Yang H, Wei CL, Yu O, Zhang ZZ, Jiang CJ, Sun J, Li YY, Chen Q, Xia T: Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds. BMC Genomics 2011, 12: 131. 10.1186/1471-2164-12-131View Article
- Hoagland DR, Arnon DI: The water-culture method for growing plants without soil. Circular California Agricultural Experiment Station 1950., 347:
- Nakamura Y, Yuki K, Park S-Y, Ohya T: Carbohydrate metabolism in the developing endosperm of rice grains. Plant Cell Physiol 1989, 30: 833-839.
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 2009, 10: R25. 10.1186/gb-2009-10-3-r25View Article
- Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2View Article
- Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010, 11: R86. 10.1186/gb-2010-11-8-r86View Article
- Blankenberg D, Von Kuster G, Coraor N, Ananda G, Lazarus R, Mangan M, Nekrutenko A, Taylor J: Galaxy: a web-based genome analysis tool for experimentalists. In Current protocols in molecular biology 19th edition. Edited by: Ausubel FM. 2010, 11-21. Chapter 19: Unit 19 10
- Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26: 139-140. 10.1093/bioinformatics/btp616View Article
- VENNY: An interactive tool for comparing lists with Venn diagrams.
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25: 25-29. 10.1038/75556View Article
- Calvin M: Photosynthesis as a resource for energy and materials. Photochem Photobiol 1976, 23: 425-444. 10.1111/j.1751-1097.1976.tb07276.xView Article
- Ragauskas AJ, Williams CK, Davison BH, Britovsek G, Cairney J, Eckert CA, Frederick WJ, Hallett JP, Leak DJ, Liotta CL, et al.: The path forward for biofuels and biomaterials. Science 2006, 311: 484-489. 10.1126/science.1114736View Article
- Studer MH, DeMartini JD, Davis MF, Sykes RW, Davison B, Keller M, Tuskan GA, Wyman CE: Lignin content in natural Populus variants affects sugar release. Proc Natl Acad Sci USA 2011, 108: 6300-6305. 10.1073/pnas.1009252108View Article
- Folts JD: Potential health benefits from the flavonoids in grape products on vascular disease. Adv Exp Med Biol 2002, 505: 95-111. 10.1007/978-1-4757-5235-9_9View Article
- Patel JM: A review of potential health benefits of flavonoids. Lethbridge Undergraduate Research Journal 2008., 3:
- Van Dyke JM, Sutton DL: Digestion of duckweed (Lemna spp.) by the grass carp (Ctenopharyngodon idella). J Fish Biol 1977, 11: 273-278. 10.1111/j.1095-8649.1977.tb04120.xView Article
- Besseau S, Hoffmann L, Geoffroy P, Lapierre C, Pollet B, Legrand M: Flavonoid accumulation in Arabidopsis repressed in lignin synthesis affects auxin transport and plant growth. Plant Cell 2007, 19: 148-162. 10.1105/tpc.106.044495View Article
- Ma QH, Wang C, Zhu HH: TaMYB4 cloned from wheat regulates lignin biosynthesis through negatively controlling the transcripts of both cinnamyl alcohol dehydrogenase and cinnamoyl-CoA reductase genes. Biochimie 2011, 93: 1179-1186. 10.1016/j.biochi.2011.04.012View Article
- Hahlbrock K, Grisebach H: Enzymic controls in the biosynthesis of lignin and flavonoids. Annu Rev Plant Physiol 1979, 30: 105-130. 10.1146/annurev.pp.30.060179.000541View Article
- Wang L, Li P, Brutnell TP: Exploring plant transcriptomes using ultra high-throughput sequencing. Brief Funct Genomics 2010, 9: 118-128. 10.1093/bfgp/elp057View Article
- Egan AN, Schlueter J, Spooner DM: Applications of next-generation sequencing in plant biology. Am J Bot 2012, 99: 175-185. 10.3732/ajb.1200020View Article
- Thorsteinsson B, Tillberg J-E: Carbohydrate partitioning, photosynthesis and growth in Lemna gibba G3. II. Effects of phosphorus limitation. Physiol Plantarum 1987, 71: 271-276. 10.1111/j.1399-3054.1987.tb04341.xView Article
- White HL, Templeman WG: The interaction of factors in the growth of Lemna: X. the interaction of nitrogen and light intensity in relation to respiration. Ann Bot-London 1937, 1: 191-204.
- Lopez-Bucio J, Cruz-Ramirez A, Herrera-Estrella L: The role of nutrient availability in regulating root architecture. Curr Opin Plant Biol 2003, 6: 280-287. 10.1016/S1369-5266(03)00035-9View Article
- Hermans C, Hammond JP, White PJ, Verbruggen N: How do plants respond to nutrient shortage by biomass allocation? Trends Plant Sci 2006, 11: 610-617. 10.1016/j.tplants.2006.10.007View Article
- Reymond M, Svistoonoff S, Loudet O, Nussaume L, Desnos T: Identification of QTL controlling root growth response to phosphate starvation in Arabidopsis thaliana. Plant Cell Environ 2006, 29: 115-125. 10.1111/j.1365-3040.2005.01405.xView Article
- Svistoonoff S, Creff A, Reymond M, Sigoillot-Claude C, Ricaud L, Blanchet A, Nussaume L, Desnos T: Root tip contact with low-phosphate media reprograms plant root architecture. Nat Genet 2007, 39: 792-796. 10.1038/ng2041View Article
- Landsberg EC: Hormonal regulation of iron-stress response in sunflower roots: A morphological and cytological investigation. Protoplasma 1996, 194: 69-80. 10.1007/BF01273169View Article
- Bates TR, Lynch JP: Stimulation of root hair elongation in Arabidopsis thaliana by low phosphorus availability. Plant Cell Environ 1996, 19: 529-538. 10.1111/j.1365-3040.1996.tb00386.xView Article
- Schachtman DP, Shin R: Nutrient sensing and signaling: NPKS. Annu Rev Plant Biol 2007, 58: 47-69. 10.1146/annurev.arplant.58.032806.103750View Article
- M H: Mineral Nutrition of Higher Plants. 2nd edition. London: Academic Press; 1995.
- Duff SM, Moorhead GB, Lefebvre DD, Plaxton WC: Phosphate starvation inducible; bypasses' of adenylate and phosphate dependent glycolytic enzymes in Brassica nigra suspension cells. Plant Physiol 1989, 90: 1275-1278. 10.1104/pp.90.4.1275View Article
- Rubio V, Bustos R, Irigoyen ML, Cardona-Lopez X, Rojas-Triana M, Paz-Ares J: Plant hormones and nutrient signaling. Plant Mol Biol 2009, 69: 361-373. 10.1007/s11103-008-9380-yView Article
- Korzeniewski B: Regulation of ATP supply during muscle contraction: theoretical studies. Biochem J 1998,330(Pt 3):1189-1195.View Article
- Huang YC, Colman RF: Affinity labeling of the allosteric ADP activation site of NAD-dependent isocitrate dehydrogenase by 6-(4-bromo-2,3-dioxobutyl)thioadenosine 5'-diphosphate. J Biol Chem 1984, 259: 12481-12488.
- Lunde C, Zygadlo A, Simonsen HT, Nielsen PL, Blennow A, Haldrup A: Sulfur starvation in rice: the effect on photosynthesis, carbohydrate metabolism, and oxidative stress protective pathways. Physiol Plantarum 2008, 134: 508-521. 10.1111/j.1399-3054.2008.01159.xView Article
- Paul M, Driscoll S: Sugar repression of photosynthesis: the role of carbohydrates in signalling nitrogen deficiency through source: sink imbalance. Plant Cell Environ 1997, 20: 110-116. 10.1046/j.1365-3040.1997.d01-17.xView Article
- Conesa A, Gotz S: Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genomics 2008, 2008: 619832.View Article
- Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 2005, 21: 3674-3676. 10.1093/bioinformatics/bti610View Article
- Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, Wang J, Li S, Li R, Bolund L: WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 2006, 34: W293-297. 10.1093/nar/gkl031View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.