Skip to main content

Function of ORFC of the polyketide synthase gene cluster on fatty acid accumulation in Schizochytrium limacinum SR21



As a potential source of polyunsaturated fatty acids (PUFA), Schizochytrium sp. has been widely used in industry for PUFA production. Polyketide synthase (PKS) cluster is supposed to be the primary way of PUFA synthesis in Schizochytrium sp. As one of three open reading frames (ORF) in the PKS cluster, ORFC plays an essential role in fatty acid biosynthesis. However, the function of domains in ORFC in the fatty acid synthesis of Schizochytrium sp. remained unclear.


In this study, heterologous expression and overexpression were carried out to study the role of ORFC and its domains in fatty acid accumulation. Firstly, ORFC was heterologously expressed in yeast which increased the PUFA content significantly. Then, the dehydratase (DH) and enoyl reductase (ER) domains located on ORFC were overexpressed in Schizochytrium limacinum SR21, respectively. Fatty acids profile analysis showed that the contents of PUFA and saturated fatty acid were increased in the DH and ER overexpression strains, respectively. This indicated that the DH and ER domains played distinct roles in lipid accumulation. Metabolic and transcriptomic analysis revealed that the pentose phosphate pathway and triacylglycerol biosynthesis were enhanced, while the tricarboxylic acid cycle and fatty acids oxidation were weakened in DH-overexpression strain. However, the opposite effect was found in the ER-overexpression strain.


Therefore, ORFC was required for the biosynthesis of fatty acid. The DH domain played a crucial role in PUFA synthesis, whereas the ER domain might be related to saturated fatty acids (SFA) synthesis in Schizochytrium limacinum SR21. This research explored the role of ORFC in the PKS gene cluster in Schizochytrium limacinum and provided potential genetic modification strategies for improving lipid production and regulating PUFA and SFA content.


Polyunsaturated fatty acids (PUFA), especially docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) and eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) are essential for human health [1,2,3,4]. Industrial production on a large scale is currently being launched via biotechnological methods based on marine algae [5]. It is known that the polyketide synthase (PKS) pathway is the primary way to synthesize PUFA in Schizochytrium sp. [6]. Fatty acid synthases, which are large enzyme complexes with multiple catalytic domains, synthesize PUFA from acetyl units via the PKS pathway, as demonstrated by microalgal DHA production [6].

Three genes encoding PKS were found in Schizochytrium sp. by Metz in 2001 [6]. The PKS pathway is composed of three open reading frames (ORFA, ORFB, and ORFC) that possess domain structures similar to fatty synthase (FAS), including dehydratase (DH), acyltransferase (AT), malonyl-CoA transacylase (MAT), β-ketoacyl synthase (KS), acyl carrier protein (ACP), ketoacyl reductase (KR), and enoyl reductase (ER) domains [7]. As illustrated in Fig. 1, the putative domains and active sites of the PKS gene cluster were identified by several online bioinformatic analysis tools, including BLAST (NCBI), SMART [8] and PKS/NRPS analysis [9]. Recently, research on the synthetic ability of PKS gene cluster to improve microbial strains and characterize their synthetic pathways has become a research hotspot [10,11,12]. Hayashi et al. investigated the function of ACPs and found that the number of activated ACPs increased the productivity of PUFA [13]. They also investigated the role of the KS domains in PUFA synthases by in vivo and in vitro analysis, and found that that two KS domains in subunit A and subunit C (KSA and KSC) were verified to catalyze the condensation from C18-C20 and C20-C22, respectively [14].

Fig. 1

The domain organization of the PKS gene cluster. The numbers indicate the positions of the amino acids. ACP acyl carrier protein, KS 3-ketoacyl-ACP synthase, CLF chain length factor, KR 3-hydroxyacyl-ACP reductase, DH dehydratase; ER enoyl-ACP reductase, MAT malonyl acyl transferase. In this study, the first DH and ER domains of ORFC was investigated

DH domain, functional as FabA in E. coli, catalyze the dehydration of β-hydroxyacyl-ACP [15]. According to Xie et al., the DH domain of ORFC could catalyze the transformation of 3-hydroxyacyl-ACP to trans-2-decenoyl-ACP and then isomerize this product to generate 3-cis-decenoyl-ACP, which is a critical step in the PUFA biosynthesis [16]. To study the detailed mechanism for the production of PUFA, Hayashi et al. discovered that depending on the carbon chain length, the DH domains in subunit A and subunit C introduced either saturation or cis double bonds to growing acyl chains [17]. Similarly, in our previous work, the content of PUFA was decreased in Schizochytrium sp. by deleting the DH domain, indicating the DH domain was required for PUFA synthesis [18]. The ER domain is similar to FabI in E. coli and is capable of catalyzing double bond reduction during EPA biosynthesis in Shewanella via the PKS pathway [6]. Ling et al. found that the ORFB-ER played a key role in PUFA synthesis, and the ORFC-ER might be related to saturated fatty acids (SFA) synthesis by knocking out the ER domains on ORFB and ORFC, respectively [19]. However, the function of DH and ER domains still remains unknown.

Recently, metabolomic and transcriptomic analysis have been widely used in microalgae research. Lim et al. suggested that the increase in lipid accumulation was due to the decrease in lipid catabolism in the early stable phase in Tetraselmis sp. M8 with nitrogen source deficiency by transcriptomic and metabolomic analysis [20]. By performing ultraviolet mutagenesis on Aurantiochytrium sp., Liu et al. obtained strains with high yields of lipids and DHA, and further comparative transcriptomics analysis revealed that coenzyme A transferase, AT, ER, DH and methyltransferase might be the key genes responsible for increased DHA production in Aurantiochytrium [21]. These studies laid a foundation for us to explore the synthesis mechanism of PUFA in Schizochytrium sp. by using the omics method.

ORFC, which consists of two DH and one ER domains, is considered one of the critical regulators for enhancing fatty acid biosynthesis [22]. However, the function of ORFC in fatty acid synthesis of Schizochytrium sp. remains unclear. In this study, to investigate the role of ORFC in fatty acid accumulation, we constructed an ORFC heterologous expression strain in yeast and overexpressed these domains (located on ORFC) in Schizochytrium limacinum SR21. Gas chromatography/mass spectrometry (GC–MS) based metabolomic analysis and an RNA-seq analysis were conducted to explain the change in fatty acid composition in different strains. This study could provide more explanations and demonstrations for the role of ORFC in fatty acid synthesis.


Effects of ORFC heterologous expression on fatty acid analysis in S. cerevisiae

To study the effects of ORFC on fatty acid accumulation, heterologous expression of ORFC was conducted in S. cerevisiae YSG50. As shown in Additional file 1: Fig. S2, single clones grew in the Ura-depletion plate, and ORFC gene was detected in the engineered strain while not in the wild-type strain by genomic PCR analysis, revealed that ORFC was successfully heterologously expressed in S. cerevisiae YSG50. As shown in Table 1, the total lipids (TL) yield had a slightly increase in the engineered strain compared to the wild-type strain. However, both the SFA and PUFA contents significantly increased by 33.24% and 125% in the engineered strain compared with the wild-type strain. C18:1 showed a significant decrease compared with the wild-type strain. Moreover, dihomo-gamma linolenic acid (DGLA), docosapentaenoic acid (DPA) and DHA were found in the engineered strain while none was produced in the wild-type strain. A higher ratio of PUFAs/SFAs was found in the engineered strain. These results indicated that the introduction of ORFC accelerated the conversion of C18 to C20, which in turn enables the synthesis of PUFA with more than 20 carbons in the yeast, such as DPA and DHA.

Table 1 Lipid profile of YSG50 (wild-type strain) and YSG50-C (engineered strain) after 3 days of fermentation

Construction of the DH/ER overexpressing strain

As shown in Additional file 1: Fig. S3C, a zeocin resistance gene was successfully detected but this gene was not detected in the wild-type strain. Moreover, qPCR analysis revealed that the transcription levels of DH and ER genes were significantly increased compared to the wild-type strain (Additional file 1: Fig. S4). These results indicated that DH and ER domains in Schizochytrium limacinum SR21 were successfully overexpressed, respectively.

Effects of DH/ER domain overexpression on biomass and lipids accumulation

As shown in Fig. 2A, the cell growth of both engineered strains was slower than that of the wild-type strain. The biomass of the DH-overexpression strain and ER-overexpression strain were 14.3% and 5.82% lower than that of the wild-type strain, respectively (p < 0.05). In addition, the TL content of the ER-overexpression strain was the lowest during the whole fermentation period among the strains (Fig. 2B). However, compared with the wild-type strain, the TL content of the DH-overexpression strain showed an increase of 9.7% (p < 0.05). These results suggested that overexpression of the ER domain had suppressive effects on biomass and lipid production. Both of engineered strains resulted in similar biomass inhibition, but the DH-overexpression strain could cause an increase in lipid production.

Fig. 2

Biomass, lipid accumulation, and ROS levels of Schizochytrium limacinum SR21. A Biomass. B Percentage of the total fatty acid content to biomass. C Fatty acid composition of the total fatty acid at 5rd of fermentation. D Percentage of the primary fatty acids to total lipids after 5 days of fermentation. The different lowercase letters in each bar indicate a significant difference (p < 0.05) between different strains. Spearman pairwise correlation globe among the different fatty acids. Each line represents a Spearman's correlation coefficient between two different fatty acids. Positive correlations are indicated in red, and the negative correlations are in green. F Reactive oxygen species (ROS) levels. Standard bars represent the standard deviation of triplicate samples. WT, ER+, and DH+ represent the wild-type strain, DH-overexpression strain, and ER-overexpression strain, respectively

Effects of DH/ER domain overexpression on fatty acid composition

Cells were collected to analyze fatty acid compositions on the final day of fermentation (the 5th day) when the lipid accumulation reached its highest level (Fig. 2B). As shown in Fig. 2C, D, overexpression of the ER and DH domains had a different and significant influence on fatty acid compositions. Compared to the wild-type strain, the DH-overexpression strain showed a 20.5% decrease in EPA content (p < 0.05), while the levels of DPA、DHA and PUFA increased by 14.8%, 9.8%, and 10.2% (p < 0.05), respectively. Conversely, in the ER-overexpression strain, EPA content increased by 66% (p < 0.05), while the levels of DPA, DHA and PUFA decreased by 10.1%, 18.6%, and 16.5% (p < 0.05), respectively.

Then, the correlations among the fatty acids were evaluated by using Spearman pairwise analysis. As shown in Fig. 2E, six fatty acids were divided into two categories. Interestingly, we found that there were significant positive correlations among C16:0, EPA and SFA. DHA, DPA and PUFA were also found to be positively correlated. However, any two fatty acids of the two categories were negatively correlated. As a result, we speculated that EPA synthesis might be related to SFA accumulation. However, additional research is required to confirm these findings.

Effects of cellular reactive oxygen species (ROS) in DH/ER overexpression strains

The contents of intracellular ROS in Schizochytrium limacinum SR21 at 1-day intervals were determined during the entire fermentation process. As shown in Fig. 2F, cells of the three strains maintained low ROS levels during the initial stage (< 3 days), which might be related to the nutrient-rich medium [23]. From 3 days to the end of fermentation, the ROS levels of all strains increased significantly with the depletion of nutrients and the rapid accumulation of fatty acids. Additionally, the ROS level of the DH-overexpression strain remained low in comparison to the other strains throughout the fermentation period, reaching 118.7 at the end point of fermentation (5th day). Nonetheless, ROS levels remained higher in the ER-overexpression strain than in the other strains for the entire 5-day fermentation period.

Metabolite profile analysis of DH/ER overexpression strains

To uncover the intracellular fatty acid metabolism of all the engineered strains, GC–MS was adopted along with multivariate analysis. More than 51 putative intracellular metabolites were detected, identified and quantified in all samples on the 3rd day of fermentation, including fatty acids, organic acids, phosphorylated compounds, amino acids, and sugars. A principal component analysis (PCA) score plot was constructed to reveal the obvious separation of metabolites among the wild-type and engineered strains (Fig. 3A). Orthogonal partial least-squares discriminant analysis (OPLS-DA) pairwise comparison was performed to illustrate the remarkable diversity of the metabolic profiles between the wild-type strain and the engineered strains (Fig. 3B, C). Two OPLS-DA models had high Q2 values and low p values (less than 0.01) from CV-ANOVA. The permutation tests also suggested that the two OPLS-DA models had high predictive capability (Additional file 1: Fig. S5).

Fig. 3

Metabolite analysis of the wild-type and engineered strains on the 3rd day of the fermentation process. A PCA score plot among the DH+, ER+ and WT strains. OPLS-DA score plots obtained from B WT vs. DH+ strains and C WT vs. ER+ strains. D Heatmap of normalized concentrations of 33 differential metabolites based on OPLS-DA results. Each column represents an individual. The normalized abundance values are depicted from blue to red, where red and blue indicate an increase and decrease in the metabolites, respectively. Yellow or purple entries indicate metabolites that are less or more abundant between different pairwise groups, respectively. WT, ER+, and DH+ represent the wild-type strain, the DH-overexpression strain, and the ER-overexpression strain, respectively

Differential metabolites were determined with variable importance in the projection (VIP) values greater than 1, and the p-values of less than 0.05 as statistically significant [24]. Based on these results, 26 and 25 differential metabolites were characterized between the wild-type strain and DH-overexpression strain and ER-overexpression strain, respectively. Differential metabolites in any two groups were all visually displayed in a heatmap plot (Fig. 3D).

To obtain a biochemical overview of differential metabolites, two metabolomics networks (Additional file 1: Fig. S6) were constructed and visually displayed using MetaMapp based on chemical similarity and enzymatic transformation. The resulting metabolic network mainly consisted of 8 distinctive clusters (TCA cycle, fatty acid metabolism, amino sugar and nucleotide sugar metabolism, glycolysis/pentose phosphate pathway (PPP pathway), galactose metabolism, amino acid metabolism, steroid biosynthesis, and nucleic acid metabolism). Moreover, the changing trends of some metabolites (DHA, EPA, palmitic acid (C16:0), pentadecanoic acid, myristic acid (C14:0), and 1-monopalmitin) were found to be different between the two engineered strains.

Comparative transcriptomic analysis of engineered strains

For a comprehensive understanding of the molecular mechanism underlying fatty acid changes in different domain overexpressing strains, RNA-seq analysis was conducted among the wild-type and engineered strains. In these strains, a total of 15,034 genes were identified corresponding to the reference genome of Aurantiochytrium limacinum ATCC MYA1381 [25]. A total of 5469 differentially expressed genes (DEGs) were identified among the wild-type and engineered strains under two criteria (|log2 (fold change)| > 1 and false discovery rate (FDR) < 0.01). There were 4175, 3929 and 1578 DEGs among the WT-DH+, WT-ER+ and DH+-ER+ groups, respectively, of which 3421, 2974 and 360 genes were upregulated and 754, 955 and 1218 genes were downregulated, respectively (Fig. 4A, B and Additional file 6: Table S7). There were 418 DEGs in commons in all 3 groups (Additional file 2: Table S3). Among these, 55 genes were downregulated in the DH-overexpression strains and upregulated in the ER-overexpression strains. In addition, 579 DEGs were observed only in the WT-DH+ and DH+-ER+ groups (Additional file 3: Table S4), and 268 DEGs were found only in the WT-ER+ and DH+-ER+ groups (Additional file 4: Table S5). The heatmap was illustrated to highlight these specific DEGs. As illustrated in Fig. 4C, the up- and down-regulation of these DEGs varied significantly among groups.

Fig. 4

Transcriptomic analysis in the wild-type strain and engineered strains. A Venn diagram of the differentially expressed genes among different strains. B The number of up/down-regulated genes. C Heatmap of differentially expressed genes. D KEGG pathway enrichment analysis of differentially expressed genes among the different groups. WT, ER+, and DH+ represent the wild-type strain, the DH-overexpression strain, and the ER-overexpression strain, respectively

The Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis was used to determine the significant enrichment pathways for DEGs (|log2 (fold change)| > 1 and p < 0.05). Figure 4D shows that 94 DEGs were significantly enriched in 12 pathways in the C-DH+ and DH+-ER+ groups, 28 of which were related to metabolism; 69 DEGs were enriched in 10 pathways in the C-ER+ and DH+-ER+ groups, 46 of which were related to metabolism; and 53 DEGs were enriched in 12 pathways in the three groups, 34 of which were related to metabolism. The pathways related to lipid metabolism included TAG biosynthesis and lipid oxidation. The pathways related to carbohydrate metabolism included the TCA cycle, glycolysis and inositol phosphate metabolism. The differentially expressed genes are shown as a heatmap in Additional file 1: Fig. S7, and the specific data are shown in Additional file 5: Table S6.

The differential metabolites and several key DEGs were further mapped to the major metabolic pathways (Fig. 5), which include the PPP, glycolysis, the mevalonate (MVA) pathway, the TCA cycle, TAG biosynthesis, amino acid metabolism, fatty acid biosynthesis and oxidation.

Fig. 5

The specific metabolites and genes in the main metabolic pathways in the DH and ER overexpression strains. ER+, and DH+ represent the DH-overexpression strain and ER-overexpression strain, respectively. The red rectangle indicates key enzymes. Rectangle: DH-overexpression strain; oval: ER-overexpression strain. Red: increase (p < 0.05), green or blue: decrease (p < 0.05), white: no significant change (p > 0.05)


Schizochytrium limacinum is rich in PUFA, an alternative commercial lipid source. Bi et al. suggested that ORFC, one of the ORFs in the PKS cluster, played an essential role in PUFA (especially DHA) production [22]. In our study, heterologous expression of ORFC significantly increased TL and PUFA contents in S. cerevisiae, which suggested that ORFC played a vital role in PUFA biosynthesis. To investigate the function of ORFC for fatty acid biosynthesis, each domain of ORFC was overexpressed in Schizochytrium limacinum SR21. However, since a strain overexpressing the second DH domain exhibited no significant changes in fatty acid composition in comparison with the wild-type strain (p < 0.05, Additional file 1: Table S2), the first DH domain (named DH domain in this study) was deep investigated. Lipid profile analysis suggested that the PUFA (mainly DHA) content increased in the DH-overexpressed strain, whereas the SFA (mainly C16:0) content increased in the ER-overexpression strain. Our previous study indicated that deleting the DH domain and ER domain of ORFC could result in a decrease in the PUFA and SFA contents, respectively [19]. The DH domain of ORFC was also essential for the biosynthesis of PUFA through heterologous expression and site-mutagenesis in E. coli and S. cerevisiae [16, 26]. These results suggested that the DH and ER domains might exert distinct regulatory effects on the accumulation of PUFA and SFA. The DH-overexpression strain had a high PUFA content, whereas the ER-overexpression strain had a decreased PUFA content and an increased SFA content.

Effects of acetyl-CoA, NADPH and TAG accumulation changes on fatty acids biosynthesis in engineered strains

Acetyl-CoA is an essential precursor for many pathways, such as the TCA cycle, amino acid biosynthesis, steroid biosynthesis, and fatty acid biosynthesis [27]. In the TCA cycle, the contents of 2-oxoglutaric acid, malic acid, succinic acid and citric acid were decreased, whereas ARA and DHA concentrations were increased in the DH-overexpression strain. This result suggested that the carbon flux could be redirected away from the TCA cycle and toward PUFA synthesis. Similar results were reported in which an enhanced acetyl-CoA supply could improve lipid production [28, 29]. However, the expression level of citrate synthase (CS), which catalyzes the conversion acetyl-CoA to citric acid in the TCA cycle, was upregulated significantly in the ER-overexpression strain. The same results were observed for a number of TCA cycle-related genes, including isocitrate dehydrogenase (ICDH), 2-oxoglutarate dehydrogenase E1 component (sucA), and the 2-oxoglutarate dehydrogenase E2 component (sucB). These findings might imply that acetyl-CoA was used to synthesize citrate acid rather than fatty acid biosynthesis, leading to a lower TL content in the ER-overexpression strain. Similarly, Deng et al. found that overexpression of the CS gene decreased the fatty acid level in Chlamydomonas [30].

Along with acetyl-CoA, the biosynthesis of fatty acids also requires a high concentration of NADPH [31, 32]. PPP was previously thought to be the primary source of NADPH in microalgae [33]. The expression level of glucose-6-phosphate dehydratase (G6PDH), which was the limiting enzyme in the PPP [34], showed a 1.73-fold increase in the DH-overexpressed strain. The results indicated that overexpression of the DH domain could elevate the NADPH supply, thereby promoting fatty acid biosynthesis.

In general, the accumulated lipids, including SFA and PUFA, are mainly in the form of triacylglycerols (TAG) [35]. TAG are primary lipid storage in microalgae and occur mainly via the TAG pathway [36]. Therefore, the increase in TAG content also represents an increase in fatty acid content. Phosphatidic acid phosphatase (PAP), lysophosphatidic acid acyltransferase (LPAAT), glycerol-3-phosphate acyltransferase (GPAT), and diacylglycerol acyltransferase (DGAT) are the key enzymes that catalyze the acylation of glycerol-3-phosphate (G3P) and lead to the production of TAGs [37,38,39,40]. Comparing the wild-type strain with the ER-overexpression strain, no significant difference in gene expression or metabolites related to TAG biosynthesis was observed. Nevertheless, the expression levels of critical genes involved in TAG biosynthesis, such as GPAT, LPAAT and PAP, were significantly upregulated in the DH-overexpression strain compared to the wild-type strain (Fig. 5). Increased expression of GPAT, LPAAT and DGAT resulted in a higher PUFA yield in Phaeodactylum tricornutum [41]. Additionally, the cellular metabolites involved in TAG biosynthesis (3-phosphoglyceric acid and 1-monopalmitin) were upregulated. Thus, the significantly increased expression levels of GPAT, LPAAT and PAP resulted in an increase in the TL and PUFA contents in the DH-overexpression strain.

Effects of ROS changes on fatty acids biosynthesis in engineered strains

ROS can cause damage to DNA, proteins, lipids, and other biological molecules, leading to the loss of protein function and even cell death [42]. Lipids, mostly PUFAs, can be oxidized by ROS formed in aerobic environments. Conversely, lipid peroxidation can also lead to the accumulation of high levels of ROS [43]. By determining ROS levels, the time profile levels of ROS during fermentation appeared first decrease and then increase gradually. However, different changes in the intracellular ROS were found among the three strains during the fermentation process. Compared to the wild-type strain, the ROS level remained low in the DH-overexpression strain, which indicated that antioxidant (perhaps PUFAs) contents increased to induce the reduction of oxygen free radicals in the DH-overexpression strain. The expression level of genes encoding superoxide dismutase (SOD) was also upregulated in the DH-overexpression strain. SOD is one of the key enzymes in fatty acid oxidation and forms the first defense line against ROS by reducing it to H2O2 [44]. Moreover, the transcription levels of genes encoding fatty acid oxidation (such as gala and aslA) were significantly downregulated in the DH-overexpression strain. Combined with the lower concentration of ROS, these results indicated that the DH domain enhanced PUFA accumulation, probably by reducing fatty acid oxidation.

While the opposite changes were found in the ER-overexpression strain, a higher ROS level was exhibited in this strain than the wild-type strain. With the consumption of nutrition and prolonged stress, lipid peroxidation, particularly PUFA peroxidization, could result in a significant accumulation of ROS [45]. Combined with the upregulated genes involved in fatty acid oxidation (galA and aslA), these results might indicate that overexpression of the ER domain enhances lipid peroxidation, thereby reducing the content of TL and PUFA.

Effects of other changes on fatty acid biosynthesis in engineered strains

Surprisingly, the contents of metabolites involved in lipid biosynthesis (such as palmitic acid, myristic acid and stearic acid) increased significantly in the two engineered strains, especially he ER-overexpression strain. Lipid profile analysis revealed a positive correlation between the accumulation of EPA and SFA. According to researchers, SFA and PUFA were synthesized via the conventional desaturase/elongase pathway (FAS) and PKS pathways, respectively [6]. Transcriptomic analysis revealed that the genes involved in the FAS pathway, such as 3-oxoacyl-[acyl-carrier-protein] synthase (fabF), elongation of very-long-chain fatty acids protein 6 (ELO6) and elongation of very-long-chain fatty acids protein 9 (ELO9) were upregulated by 1.41, 1.36 and 2.15-fold in the ER-overexpression strain, respectively. In the DH-overexpression strain, however, no such differences were observed. Our previous research demonstrated that after three days of fermentation, the proportion of DHA in total lipids remained constant, whereas the EPA content increased with the increased fermentation time [46]. Due to the absence of certain desaturases, Song et al. hypothesized that a third pathway involving both the FAS and PKS pathways in PUFA biosynthesis in Thraustochytrids [25]. These results might suggest that EPA accumulation is related to SFA biosynthesis, but further research is necessary [17].


In the present study, to investigate the role of ORFC of the PKS gene cluster in fatty acid accumulation, ORFC was expressed heterologously in yeast. The significantly increased content of TL and PUFA indicated that ORFC was required for fatty acid biosynthesis. Subsequently, the DH and ER domains located on ORFC were overexpressed in Schizochytrium limacinum SR21, respectively. Lipid profile showed a significant increase in the PUFA content in the DH-overexpression strain. In the ER-overexpression strain, on the other hand, the PUFA content decreased while the SFA content increased. Additionally, metabolomic and transcriptomic analysis revealed that overexpression of the DH domain increased PPP and TAG biosynthesis while decreasing TCA and fatty acid oxidation, thereby enhancing the biosynthesis of fatty acids. However, overexpression of the ER domain appeared to enhance the TCA cycle and fatty acid oxidation during the lipid accumulation phase. As a result, the DH and ER domains of ORFC might have distinct roles in lipid accumulation. The DH domain was required for PUFA (mainly DHA) synthesis, whereas the ER domain inhibited PUFA synthesis, which might be related to the biosynthesis of SFA in Schizochytrium limacinum SR21. This work provides an experimental basis for clarifying the role of ORFC in lipid accumulation and theoretical support for engineering strains with high PUFA yields.


Strains and plasmids

The primers, plasmids and strains used in this study are listed in Additional file 1: Table S1 and Table 2. All fragments obtained by polymerase chain reaction (PCR) or overlap extension PCR were gel purified using a kit (Takara; Japan) before cloning. Fragment assembly was performed using the Gibson method [47].

Table 2 Strains and plasmids used in this study

Media and culture conditions

The fermentation and seed broth of Schizochytrium limacinum SR21 was the same as that used in our previous study [48]. All media were autoclaved at 121 °C for 20 min before use. The seed medium was inoculated at 2% (v/v) and cultured in a shaker at 200 rpm and 28 °C. The cells were grown in 100 mL flasks with 20 mL of seed medium and cultivated for 48 h. After two generations of cultivation, the seed culture (4% v/v) was then transferred to 500 mL flasks with 100 mL of fermentation medium. Three parallel samples were performed.

Overexpression of the DH/ER domain in ORFC in Schizochytrium limacinum SR21

Homologous recombination was used to generate overexpression strains. The DH and ER domains were amplified from the cDNA of Schizochytrium limacinum SR21 and ligated to the promoter TEF1p by overlap extension PCR. After digestion with BamHI and SpeI, the resulting DNA fragments were ligated into pBlueZeo-MAT which was linearized by BamHI and SpeI. Zeocin was used as the selection marker. The resulting plasmids and primers used are listed in Additional file 1: Fig. S1 and Table S1.

The overexpression plasmids were linearized by ApaI and NotI, and then transformed into Schizochytrium limacinum SR21 through electro-transformation according to our previous study [48]. Specifically, 100 μL of competent cells and ~ 1 μg of linearized plasmids were added to a 0.2 cm gap cuvette (Bio-Rad, California, USA) for electroporation. After electroporation, the cells were recovered at 28 °C in 200 rpm in a recovery medium (seed medium with 1 M sorbitol), and then 100 μL of cells were plated on a solid selection medium containing 2% agar and 50 μg/mL zeocin. After 3–5 days of incubation, the resulting transformants were transformed into the seed medium containing 50 μg/mL zeocin at 28 °C and 200 rpm. Genomic DNA and RNA were extracted and used for PCR and qPCR analysis, respectively.

Quantitative real-time PCR (qPCR) analysis

Total RNA was isolated from 1 mL of Schizochytrium limacinum SR21 cells in exponential growth phase using a ZR Fungal/ Bacterial RNA MicroPrep kit (ZYMO, California, USA). cDNA was synthesized by HiScript III RT Supermix for qPCR (+ gDNA wiper) (Vazyme, Nanjing, China) according to the manufacturer’s protocol. qPCR was performed using ChamQ Universal SYBR qPCR Master mix (Vazyme, Nanjing, China). QTower 3G (Analytik Jena, Germany) was used to detect the expression of target genes. The ACT (actin) gene was used as a reference gene in the calculations.

Determination of intracellular ROS

The ROS level in cells was measured using the commercialized probe 2',7'-dichlorodihydrofluorescein diacetate (DCFH-DA, Beyotime Biotechnology, China) according to the optimized manufacturer's instructions. Briefly, an aliquot of the culture was collected, centrifuged and resuspended to 1–2 × 106 cells/mL in 20 mM PBS (pH 7.0). The cell suspension was mixed with diluted probes of DCFH-DA at the final concentration of 10 mM, incubated at 37 °C in the dark for 30 min, and then washed three times with 10 mM PBS. The fluorescence intensity was measured at an excitation wavelength of 488 nm and an emission wavelength of 525 nm using SpectraMax M5 (Molecular Devices, San Jose, USA).

Determination of biomass, TL and fatty acid composition

One milliliter of fermentation broth was collected every day during the entire fermentation period by centrifugation at 8000 g for 5 min. The cell pellet was washed with 0.7% saline solution and dried using a vacuum freezer dryer to obtain the total biomass.

TL content and fatty acid composition were determined from 5 mL of culture according to our previous report [46]. Briefly, 5 mL of fermentation broth was mixed with 5 mL of HCl (12 mol/L) and incubated at 65 °C for 30 min. After transesterification, the mixture was extracted 5 times with 3 mL of n-hexane and evaporated to dryness by nitrogen flow. The samples were then redissolved in 5 mL of 0.5 M KOH–CH3OH and 5 mL of 30% BF3-ether and applied to a gas chromatograph (Agilent GC 7890, USA) equipped with a 100 m × 0.25 mm capillary column (SP-2560, USA). Deuterated myristic acid (Sigma, Burlington, USA) was used as an internal standard.

GC–MS performance and metabolomics data analysis

Sample preparation was performed as described previously [46]. In brief, samples on the 3rd day of fermentation were quenched with 80% cold methanol (− 40 °C, v/v) and then centrifuged at 8000 g and 4 °C for 10 min. Samples were ground under liquid nitrogen, extracted with prechilled methanol (− 40 °C) and freeze-dried. The resulting extracts were used for further analysis. The methyl ester of heptadecanoic acid was used as an internal standard for quantification.

GC–MS analysis was carried out with a variation on the two-stage technique as described previously [48]. Briefly, samples were derivatized in 50 μL of 20 mg/mL methoxyamine hydrochloride in pyridine for 2 h at 37 °C. Then 60 μL of N-methyl-N-(trimethylsilyl) trifluoroacetamide (MSTFA, Sigma, Burlington, USA) was added followed by incubation for another 2 h at 37 °C. Finally, the samples were centrifuged at the maximum speed for 10 min at 4 °C. The resulting sample was analyzed on an Agilent 7890-5975C GC–MS solution system (Agilent, Sacramento, USA). First, the GC oven temperature was maintained at 85 °C for 5 min, increased to 270 °C at a rate of 15 °C/min, and held for 5 min. Electron impulse ionization was applied at 70 eV. Helium was used as the carrier gas, and the flow rate was maintained at 1 mL min−1. The working range of the mass spectrometer was m/z 50−600.

GC–MS data were processed based on Li’s method [46]. Each peak was determined by alignment with the mass spectra in the NIST 2.2 library (National Institute of Standards and Technology, USA). The data of the identified metabolites were normalized and analyzed using SIMCA 14.1 (Umetrics, Umeå, Sweden) for multivariate analysis. PCA and OPLS-DA were used to identify the differential metabolites of the wild-type and overexpression strains. Differential metabolites were identified with VIP value greater than 1, and p < 0.05 was considered as statistically significant. A heat map was generated by the pheatmap package in R [49]. Six replicates were used in this experiment. MetaMapp analysis was performed in the MetaMapp and Cytoscape software packages [50].

RNA extraction, sequencing, and data analysis

Samples were collected at 3rd day and immediately centrifuged for 5 min at 8000 g and 4 ℃. The resulting pellets were frozen in liquid nitrogen and then stored at − 80 ℃ until use. Three biological replicates were prepared. Following the manufacturer's protocol, total RNA was extracted from liquid-ground cells using a TRIzol reagent kit (Invitrogen, Carlsbad, CA, USA). An Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA) was used to determine RNA quality. Oligo(dT) beads were used to enrich poly(A) mRNA from total RNA. cDNA was prepared by DNA polymerase I, RNase H, dNTPs and buffer. After purification, cDNA fragments were end-repaired, poly(A) was added followed by ligation to Illumina sequencing adapters. Finally, the cDNA libraries were sequenced with Illumina HiSeq2500 by Gene Denovo Biotechnology Co. (Guangzhou, China).

Clean reads obtained from RNA-seq were aligned to the reference genome using HISAT2.2.4. Fragments per kilobase of transcript per million reads mapped (FPKM) was used to calculate the abundance and variations of assembled transcripts using Stringtie V1.3.1 [51, 52]. EdgeR was used to analyze DEGs with the criteria of |log2 (fold change)| > 1 in expression level and FDR value below 0.05 [53]. GO terms and KEGG pathway enrichment analysis of the DEGs were performed using the online OmicShare tools ( All expressed genes were used as the background.

Heterologous expression of ORFC in Saccharomyces cerevisiae

To prepare the ORFC heterologous plasmid, yeast promoter ADH1p (397 bp) and terminator CYC1t (248 bp) were amplified from S. cerevisiae YSG50 genomic DNA. The target gene ORFC was cloned from the cDNA of Schizochytrium limacinum SR21. These individual fragments were assembled by overlap extension PCR [54]. After purification, the target gene was ligated with BamHI linearized pRS426 (5726 bp) (Additional file 1: Fig. S1). Then, the successfully constructed plasmid was electro-transformed in S. cerevisiae according to a previous method [54]. The recombinant strain was transferred to fermentation medium for 3 days at 30 ℃ and 200 rpm, and the fermentation broth was used for further fatty acid analysis (Additional file 6).

Data analysis

Different parameters, such as total biomass and lipid profile, were statistically analyzed using R ( with several publicly available packages. Figures were generated with several R packages, such as pheatmap, circlize, and EasyStat. Analysis of variance (ANOVA) was used to determine the significant differences between different strains. Tukey’s multiple comparisons test was used for post hoc analysis to compare individual means. Spearman correlation analysis was used to explore the correlation among the fatty acids. Each experiment was conducted in triplicates. Values are expressed as means ± standard deviation (SD) and p < 0.05 was used to determine statistical significance.

Availability of data and materials

All data generated or analyzed during this study are included in the article and its Additional files.



Open reading frame


Polyketide synthases


Polyunsaturated fatty acids


Saturated fatty acids


Total lipids


Docosahexaenoic acid


Docosapentaenoic acid


Eicosapentaenoic acid


Arachidonic acid


Dihomo-gamma linolenic acid




Enoyl reductase


Reactive oxygen species


Principal component analysis


Orthogonal partial least-squares discriminant analysis


Variable importance in the projection

PPP pathway:

Pentose phosphate pathway




Fatty acid methyl esters


False discovery rate




Glycerol-3-phosphate acyltransferase


Lysophosphatidic acid acyltransferase


Phosphatidic acid phosphatase


Diacylglycerol acyltransferase




Superoxide dismutase


Pyruvate dehydrogenase


Citrate synthase


2-Oxoglutarate dehydrogenase E1 component


2-Oxoglutarate dehydrogenase E2 component


Glucose-6-phosphate dehydratase


3-Oxoacyl-[acyl-carrier-protein] synthase


Elongation of very-long-chain fatty acids protein 6


Elongation of very-long-chain fatty acids protein 9


  1. 1.

    Zárate R, El Jaber-Vazdekis N, Tejera N, Pérez JA, Rodríguez C. Significance of long chain polyunsaturated fatty acids in human health. Clin Transl Med. 2017;6(1):25.

    PubMed  PubMed Central  Article  Google Scholar 

  2. 2.

    Shan Z, Rehm CD, Rogers G, Ruan M, Wang DD, Hu FB, Mozaffarian D, Zhang FF, Bhupathiraju SN. Trends in dietary carbohydrate, protein, and fat intake and diet quality among US adults, 1999–2016. JAMA. 2019;322(12):1178–87.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  3. 3.

    Jiao J, Liu G, Shin H, Hu F, Rimm E, Rexrode K, Manson J, Zong G, Sun Q. Dietary fats and mortality among patients with type 2 diabetes: analysis in two population based cohort studies. BMJ. 2019;366:4009.

    Article  Google Scholar 

  4. 4.

    Lapillonne A, Moltu SJ. Long-chain polyunsaturated fatty acids and clinical outcomes of preterm infants. Ann Nutr Metab. 2016;69(suppl 1):35–44.

    PubMed  Article  Google Scholar 

  5. 5.

    Adarme-Vega TC, Lim DK, Timmins M, Vernen F, Li Y, Schenk PM. Microalgal biofactories: a promising approach towards sustainable omega-3 fatty acid production. Microb Cell Fact. 2012;11:96.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  6. 6.

    Metz JG, Roessler P, Facciotti D, Levering C, Dittrich F, Lassner M, Valentine R, Lardizabal K, Domergue F, Yamada A. Production of polyunsaturated fatty acids by polyketide synthases in both prokaryotes and eukaryotes. Science. 2001;293(5528):290–3.

    CAS  PubMed  Article  Google Scholar 

  7. 7.

    Jeong Y-S, Song S-K, Lee S-J, Hur B-K. The growth and EPA synthesis of Shewanella oneidensis MR-1 and expectation of EPA biosynthetic pathway. Biotechnol Bioproc E. 2006;11(2):127–33.

    CAS  Article  Google Scholar 

  8. 8.

    Letunic I, Bork P. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 2018;46(D1):D493–6.

    CAS  PubMed  Article  Google Scholar 

  9. 9.

    Bachmann BO, Ravel J. Methods for in silico prediction of microbial polyketide and nonribosomal peptide biosynthetic pathways from DNA sequence data. Methods Enzymol. 2009;458:181–217.

    CAS  PubMed  Article  Google Scholar 

  10. 10.

    AmiriJami M, LaPointe G, Griffiths MW. Engineering of EPA/DHA omega-3 fatty acid production by Lactococcus lactis subsp cremoris MG1363. Appl Microbiol Biotechnol. 2014;98(7):3071–3080.

    CAS  Article  Google Scholar 

  11. 11.

    Hauvermale A, Kuner J, Rosenzweig B, Guerra D, Diltz S, Metz J. Fatty acid production in Schizochytrium sp: involvement of a polyunsaturated fatty acid synthase and a type I fatty acid synthase. Lipids. 2006;41(8):739–747.

    CAS  PubMed  Article  Google Scholar 

  12. 12.

    Liu Z, Zang X, Cao X, Wang Z, Liu C, Sun D, Guo Y, Zhang F, Yang Q, Hou P. Cloning of the pks3 gene of Aurantiochytrium limacinum and functional study of the 3-ketoacyl-ACP reductase and dehydratase enzyme domains. PloS ONE. 2018;13(12):e0208853.

    PubMed  PubMed Central  Article  Google Scholar 

  13. 13.

    Hayashi S, Satoh Y, Ujihara T, Takata Y, Dairi T. Enhanced production of polyunsaturated fatty acids by enzyme engineering of tandem acyl carrier proteins. Sci Rep. 2016;6(1):1–10.

    Article  CAS  Google Scholar 

  14. 14.

    Dairi T, Hayashi S, Naka M, Ikeuchi K, Ohtsuka M, Kobayashi K, Satoh Y, Ogasawara Y, Maruyama C, Hamano Y, Ujihara T. Control mechanism for carbon chain length in polyunsaturated fatty acid synthases. Angew Chem Int Edit. 2019;131(20):6677–82.

    Article  Google Scholar 

  15. 15.

    Finzel K, Nguyen C, Jackson DR, Gupta A, Tsai SC, Burkart MD. Probing the substrate specificity and protein-protein interactions of the E. coli fatty acid dehydratase, FabA. Chem Biol. 2015;22(11):1453–1460.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  16. 16.

    Xie X, Meesapyodsuk D, Qiu X. Functional analysis of the dehydratase domains of a PUFA synthase from Thraustochytrium in Escherichia coli. Appl Microbiol Biot. 2018;102(2):847–56.

    CAS  Article  Google Scholar 

  17. 17.

    Hayashi S, Satoh Y, Ogasawara Y, Maruyama C, Hamano Y, Ujihara T, Dairi T. Control mechanism for cis double-bond formation by polyunsaturated fatty-acid synthases. Angew Chem Int Edit. 2019;58(8):2326–30.

    CAS  Article  Google Scholar 

  18. 18.

    Li Z, Chen X, Li J, Meng T, Wang L, Chen Z, Shi Y, Ling X, Luo W, Liang D, Lu Y, Li Q, He N. Functions of PKS genes in lipid synthesis of Schizochytrium sp. by gene disruption and metabolomics analysis. Mar Biotechnol. 2018;20(6):792–802.

    CAS  Article  Google Scholar 

  19. 19.

    Ling X, Zhou H, Yang Q, Yu S, Li J, Li Z, He N, Chen C, Lu Y. Functions of enyolreductase (ER) domains of PKS cluster in lipid synthesis and enhancement of PUFAs accumulation in Schizochytrium limacinum SR21 using triclosan as a regulator of ER. Microorganisms. 2020;8(2):300.

    CAS  PubMed Central  Article  PubMed  Google Scholar 

  20. 20.

    Lim DKY, Schuhmann H, Thomas-Hall SR, Chan KCK, Wass TJ, Aguilera F, Adarme-Vega TC, Dal'Molin CGO, Thorpe GJ, Batley J, Edwards D, Schenk PM. RNA-Seq and metabolic flux analysis of Tetraselmis sp. M8 during nitrogen starvation reveals a two-stage lipid accumulation mechanism. Bioresource Technol. 2017;244:1281–1293.

    CAS  Article  Google Scholar 

  21. 21.

    Liu L, Hu Z, Li S, Yang H, Li S, Lv C, Zaynab M, Cheng CHK, Chen H, Yang X. Comparative transcriptomic analysis uncovers genes responsible for the dha enhancement in the mutant Aurantiochytrium sp. Microorganisms. 2020;8(4):529.

    CAS  PubMed Central  Article  PubMed  Google Scholar 

  22. 22.

    Bi Z-Q, Ren L-J, Hu X-C, Sun X-M, Zhu S-Y, Ji X-J, Huang H. Transcriptome and gene expression analysis of docosahexaenoic acid producer Schizochytrium sp. under different oxygen supply conditions. Biotechnol Biofuels. 2018;11(1):249.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  23. 23.

    Ren LJ, Sun XM, Ji XJ, Chen SL, Guo DS, Huang H. Enhancement of docosahexaenoic acid synthesis by manipulation of antioxidant capacity and prevention of oxidative damage in Schizochytrium sp. Bioresource Technol. 2017;223:141–8.

    CAS  Article  Google Scholar 

  24. 24.

    Farrés M, Platikanov S, Tsakovski S, Tauler R. Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation. J Chemometr. 2015;29(10):528–36.

    Article  CAS  Google Scholar 

  25. 25.

    Song Z, Stajich JE, Xie Y, Liu X, He Y, Chen J, Hicks GR, Wang G. Comparative analysis reveals unexpected genome features of newly isolated Thraustochytrids strains: on ecological function and PUFAs biosynthesis. BMC Genomics. 2018;19(1):1–16.

    Article  CAS  Google Scholar 

  26. 26.

    Xie X, Sun K, Meesapyodsuk D, Miao Y, Qiu X. Distinct functions of two FabA-like dehydratase domains of polyunsaturated fatty acid synthase in the biosynthesis of very long chain polyunsaturated fatty acids. Environ Microbiol. 2020;22(9):3772–83.

    CAS  PubMed  Article  Google Scholar 

  27. 27.

    Yan J, Cheng R, Lin X, You S, Li K, Rong H, Ma Y. Overexpression of acetyl-CoA synthetase increased the biomass and fatty acid proportion in microalga Schizochytrium. Appl Microbiol Biotechnol. 2013;97(5):1933–9.

    CAS  PubMed  Article  Google Scholar 

  28. 28.

    Ren L-J, Huang H, Xiao A-H, Lian M, Jin L-J, Ji X-J. Enhanced docosahexaenoic acid production by reinforcing acetyl-CoA and NADPH supply in Schizochytrium sp. HX-308. Bioproc Biosyst Eng. 2009;32(6):837–843.

    CAS  Article  Google Scholar 

  29. 29.

    Han X, Zhao Z, Wen Y, Chen Z. Enhancement of docosahexaenoic acid production by overexpression of ATP-citrate lyase and acetyl-CoA carboxylase in Schizochytrium sp. Biotechnol Biofuels. 2020;13(1):1–13.

    Article  CAS  Google Scholar 

  30. 30.

    Deng X, Cai J, Fei X. Effect of the expression and knockdown of citrate synthase gene on carbon flux during triacylglycerol biosynthesis by green algae Chlamydomonas reinhardtii. BMC Biochem. 2013;14(1):1–11.

    Article  CAS  Google Scholar 

  31. 31.

    Liu B, Liu J, Sun P, Ma X, Jiang Y, Chen F. Sesamol enhances cell growth and the biosynthesis and accumulation of docosahexaenoic acid in the microalga Crypthecodinium cohnii. J Agr Food Chem. 2015;63(23):5640–5.

    CAS  Article  Google Scholar 

  32. 32.

    Hao G, Chen H, Gu Z, Zhang H, Chen W, Chen YQ. Metabolic engineering of Mortierella alpina for enhanced arachidonic acid production through the NADPH-supplying strategy. Appl Microbiol Biotechnol. 2016;82(11):3280–8.

    CAS  Google Scholar 

  33. 33.

    Xue J, Chen T-T, Zheng J-W, Balamurugan S, Cai J-X, Liu Y-H, Yang W-D, Liu J-S, Li H-Y. The role of diatom glucose-6-phosphate dehydrogenase on lipogenic NADPH supply in green microalgae through plastidial oxidative pentose phosphate pathway. Appl Microbiol Biotechnol. 2018;102(24):10803–15.

    CAS  PubMed  Article  Google Scholar 

  34. 34.

    Wasylenko TM, Ahn WS, Stephanopoulos G. The oxidative pentose phosphate pathway is the primary source of NADPH for lipid overproduction from glucose in Yarrowia lipolytica. Metab Eng. 2015;30:27–39.

    CAS  PubMed  Article  Google Scholar 

  35. 35.

    Imamura S, Kawase Y, Kobayashi I, Sone T, Era A, Miyagishima S-y, Shimojima M, Ohta H, Tanaka K. Target of rapamycin (TOR) plays a critical role in triacylglycerol accumulation in microalgae. Plant Mol Biol. 2015;89(3):309–318.

    CAS  PubMed  Article  Google Scholar 

  36. 36.

    Liu J, Mao X, Zhou W, Guarnieri MT. Simultaneous production of triacylglycerol and high-value carotenoids by the astaxanthin-producing oleaginous green microalga Chlorella zofingiensis. Bioresource Technol. 2016;214:319–27.

    CAS  Article  Google Scholar 

  37. 37.

    Vanhercke T, El Tahchy A, Shrestha P, Zhou X-R, Singh SP, Petrie JR. Synergistic effect of WRI1 and DGAT1 coexpression on triacylglycerol biosynthesis in plants. FEBS Lett. 2013;587(4):364–9.

    CAS  PubMed  Article  Google Scholar 

  38. 38.

    Niu Y-F, Wang X, Hu D-X, Balamurugan S, Li D-W, Yang W-D, Liu J-S, Li H-Y. Molecular characterization of a glycerol-3-phosphate acyltransferase reveals key features essential for triacylglycerol production in Phaeodactylum tricornutum. Biotechnol Biofuels. 2016;9(1):1–11.

    Article  CAS  Google Scholar 

  39. 39.

    Chungjatupornchai W, Fa-aroonsawat S. Enhanced triacylglycerol production in oleaginous microalga Neochloris oleoabundans by co-overexpression of lipogenic genes: Plastidial LPAAT1 and ER-located DGAT2. J Biosci Bioeng. 2020;131(2):124–30.

    PubMed  Article  CAS  Google Scholar 

  40. 40.

    Jin H-H, Jiang J-G. Phosphatidic acid phosphatase and diacylglycerol acyltransferase: potential targets for metabolic engineering of microorganism oil. J Agr Food Chem. 2015;63(12):3067–77.

    CAS  Article  Google Scholar 

  41. 41.

    Wang X, Luo S-W, Luo W, Yang W-D, Liu J-S, Li H-Y. Adaptive evolution of microalgal strains empowered by fulvic acid for enhanced polyunsaturated fatty acid production. Biotechnol Biofuels. 2019;277:204–10.

    CAS  Google Scholar 

  42. 42.

    Ruenwai R, Neiss A, Laoteng K, Vongsangnak W, Dalfard AB, Cheevadhanarak S, Petranovic D, Nielsen J. Heterologous production of polyunsaturated fatty acids in Saccharomyces cerevisiae causes a global transcriptional response resulting in reduced proteasomal activity and increased oxidative stress. Biotechnol J. 2011;6(3):343–56.

    CAS  PubMed  Article  Google Scholar 

  43. 43.

    Johansson M, Chen X, Milanova S, Santos C, Petranovic D. PUFA-induced cell death is mediated by Yca1p-dependent and -independent pathways, and is reduced by vitamin C in yeast. FEMS Yeast Res. 2016; 16(2):fow007.

  44. 44.

    Li T, Huang X, Zhou R, Liu Y, Li B, Nomura C, Zhao J. Differential expression and localization of Mn and Fe superoxide dismutases in the heterocystous cyanobacterium Anabaena sp. strain PCC 7120. J Bacteriol. 2002;184(18):5096–5103.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  45. 45.

    Sun XM, Geng LJ, Ren LJ, Ji XJ, Hao N, Chen KQ, Huang H. Influence of oxygen on the biosynthesis of polyunsaturated fatty acids in microalgae. Bioresource Technol. 2018;250:868–76.

    CAS  Article  Google Scholar 

  46. 46.

    Li Z, Ling X, Zhou H, Meng T, Zeng J, Hang W, Shi Y, He N. Screening chemical modulators of benzoic acid derivatives to improve lipid accumulation in Schizochytrium limacinum SR21 with metabolomics analysis. Biotechnol Biofuels. 2019;12:1–11.

    Article  Google Scholar 

  47. 47.

    Gibson DG, Young L, Chuang R-Y, Venter JC, Hutchison CA III, Smith HO. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods. 2009;6(5):343-U341.

    CAS  PubMed  Article  Google Scholar 

  48. 48.

    Li Z, Meng T, Ling X-P, Li J, Zheng C, Shi Y, Chen Z, Li Z, Li Q, Lu Y. Overexpression of malonyl-CoA: ACP transacylase in Schizochytrium sp. to improve polyunsaturated fatty acids production. J Agr Food Chem. 2018;66(21):5382–5391.

    CAS  Article  Google Scholar 

  49. 49.

    Silva TS, Richard N. Visualization and differential analysis of protein expression data using R. Methods Mol Biol. 2016;1362:105–18.

    CAS  PubMed  Article  Google Scholar 

  50. 50.

    Barupal DK, Haldiya PK, Wohlgemuth G, Kind T, Kothari SL, Pinkerton KE, Fiehn O. MetaMapp: mapping and visualizing metabolomic data by integrating information from biochemical pathways and chemical and mass spectral similarity. BMC Bioinformatics. 2012;13(1):99.

    PubMed  PubMed Central  Article  Google Scholar 

  51. 51.

    Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  52. 52.

    Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016;11(9):1650.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  53. 53.

    Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.

    CAS  PubMed  Article  Google Scholar 

  54. 54.

    Shao Z, Zhao H, Zhao H. DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways. Nucleic Acids Res. 2009; 37(2):e16.

Download references


We thank Prof. Yingjun Yuan for providing us with Saccharomyces cerevisiae YSG50.


This work was financially supported by the National Natural Science Foundation of China (31871779).

Author information




YYS and NH developed the idea for the study, YYS performed the research and data analysis, and prepared the manuscript. ZC, YXL, XYC, LJY, YYX and ZPL helped to revise the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ning He.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All the authors consent to publication.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Primers used in this experiment. Fig. S1. Plasmids constructed in this study. A DH overexpression plasmid. B ER overexpression plasmid. C ORFC heterologous plasmid. Fig. S2. ORFC heterologous expression in S. cerevisia YSG50. A The ORFC heterologous expression strain was selected by the Ura-depletion plate. B Genomic PCR analysis of ORFC. WT indicated the wild-type strain; YSG50-C indicated the ORFC heterologous strain. Fig. S3. A Genomic PCR products of DH and ER domains in the wild-type strain. B Plasmids construction validation by NotI and ApaI digestion. p-ER: ER-overexpression plasmid; p-DH: DH-overexpression plasmid. C Genomic PCR of Zeo expression cassette. P indicates positive control, N indicates the wild-type strain, DH+ indicates DH-overexpression strain, ER+ indicates ER-overexpression strain. Fig. S4 Gene copies of A DH and B ER domain in the wild-type and engineered strain, respectively, by qPCR analysis. All data are expressed as mean ± SD of three independent experiments. Fig. S5. Permutation test for the OPLS-DA model: A wild-type strain and DH-overexpressed strain and B wild-type strain and ER-overexpressed strain. Fig. S6. Metabolomics profiling by GC-MS reveals divergent metabolic phenotypes. A DH-overexpression strain compared with the wild-type strain. B ER-overexpression strain compared with the wild-type strain. Each node denotes an identified metabolite (red, up-regulated; blue, down-regulated; p < 0.05 by a two-tailed Student’s t-test). Node size reflects median fold change. Fig. S7. Heatmap of significant genes (log2 (fold change)) and their enriched pathways compared to the wild-type strain. Left frame: DH-overexpression strain; right frame: ER-overexpression strain. Table S2. Fatty acids composition analysis of domains located in ORFC overexpression strains

Additional file 2: Table S3.

DEGs between WT vs DH+ and DH+ vs ER+. WT, DH+ and ER+ indicate the wild-type strain, DH-overexpression strain and ER-overexpressed strain, respectively.

Additional file 3: Table S4.

DEGs among WT vs DH+, WT vs ER+ and DH+ vs ER+. WT, DH+ and ER+ indicate the wild-type strain, DH-overexpression strain and ER-overexpressed strain, respectively.

Additional file 4: Table S5.

DEGs between WT vs DH+ and WT vs ER+. WT, DH+ and ER+ indicate the wild-type strain, DH-overexpression strain and ER-overexpressed strain, respectively.

Additional file 5: Table S6.

The specific values of significant genes (log2 (fold change)) and their enriched pathways compared to the wild-type strain. WT, DH+ and ER+ indicate the wild-type strain, DH-overexpression strain and ER-overexpressed strain, respectively. FC means fold change.

Additional file 6: Table S7.

Gene expression of DEGs in WT vs DH+, WT vs ER+ and DH+ vs ER+, respectively. WT, DH+ and ER+ indicate the wild-type strain, DH-overexpression strain and ER-overexpressed strain, respectively.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Shi, Y., Chen, Z., Li, Y. et al. Function of ORFC of the polyketide synthase gene cluster on fatty acid accumulation in Schizochytrium limacinum SR21. Biotechnol Biofuels 14, 163 (2021).

Download citation


  • Polyunsaturated fatty acids
  • Dehydratase
  • Enoyl reductase
  • Transcriptomics
  • Metabolomics
  • Schizochytrium limacinum