Uncovering the molecular mechanisms of lignocellulose digestion in shipworms
- Federico Sabbadin†1,
- Giovanna Pesante†1,
- Luisa Elias1,
- Katrin Besser1,
- Yi Li1,
- Clare Steele-King1,
- Meg Stark2,
- Deborah A. Rathbone3,
- Adam A. Dowle2,
- Rachel Bates2,
- J. Reuben Shipway4,
- Simon M. Cragg5,
- Neil C. Bruce1 and
- Simon J. McQueen-Mason1Email author
© The Author(s) 2018
Received: 2 December 2017
Accepted: 21 February 2018
Published: 7 March 2018
Lignocellulose forms the structural framework of woody plant biomass and represents the most abundant carbon source in the biosphere. Turnover of woody biomass is a critical component of the global carbon cycle, and the enzymes involved are of increasing industrial importance as industry moves away from fossil fuels to renewable carbon resources. Shipworms are marine bivalve molluscs that digest wood and play a key role in global carbon cycling by processing plant biomass in the oceans. Previous studies suggest that wood digestion in shipworms is dominated by enzymes produced by endosymbiotic bacteria found in the animal’s gills, while little is known about the identity and function of endogenous enzymes produced by shipworms. Using a combination of meta-transcriptomic, proteomic, imaging and biochemical analyses, we reveal a complex digestive system dominated by uncharacterized enzymes that are secreted by a specialized digestive gland and that accumulate in the cecum, where wood digestion occurs. Using a combination of transcriptomics, proteomics, and microscopy, we show that the digestive proteome of the shipworm Lyrodus pedicellatus is mostly composed of enzymes produced by the animal itself, with a small but significant contribution from symbiotic bacteria. The digestive proteome is dominated by a novel 300 kDa multi-domain glycoside hydrolase that functions in the hydrolysis of β-1,4-glucans, the most abundant polymers in wood. These studies allow an unprecedented level of insight into an unusual and ecologically important process for wood recycling in the marine environment, and open up new biotechnological opportunities in the mobilization of sugars from lignocellulosic biomass.
Large amounts of wood from terrestrial plants enter the marine environment and support complex ecosystems. Tropical mangrove swamps, for example, provide safe nurseries that support fisheries  and are among the most productive ecosystems on the planet. It has been shown that around 70% of dead wood in mangroves is processed by the action of wood-boring bivalve molluscs known as shipworms [2, 3]. Shipworms acquired their name due to their devastating effects on wooden ships prior to the advent of copper-bottoming, a process developed to protect wooden vessels from their attack. Shipworms continue to destroy timber structures and docks around the world but, despite their ecological, historical and economic importance, the process by which these animals digest wood remains poorly understood.
Meta-transcriptomic analysis of L. pedicellatus and its endosymbionts
In vitro activity assays carried out with cecum fluids of L. pedicellatus against a panel of substrates revealed a complex enzymatic cocktail with activity against many polysaccharides associated with lignocellulosic biomass (Fig. 1b). Compositional analysis of wood and shipworm feces (frass) revealed that more than 40% of the cellulose content and lesser amounts of hemicellulose are removed while passing through the shipworm digestive system (Fig. 1c). While compelling evidence has shown that gill bacteriocytes are the main site for the production of bacterial CAZymes in shipworms, several authors have also hypothesized that the digestive gland might be responsible for the synthesis of endogenous enzymes, while the cecum appears to be the site of wood breakdown and could potentially be involved in sugar uptake [2, 4]. In order to identify the key genes involved in wood digestion and absorption of breakdown products in L. pedicellatus, we performed meta-transcriptome sequencing from the main organs putatively involved in wood digestion (digestive gland, cecum, and gills) (Fig. 1a, Additional file 1: Table S1) using healthy adult L. pedicellatus growing in blocks of Scots pine submerged in sea water.
Our gene expression analysis reveals that the shipworm digestive gland is the major site of transcription of endogenous lignocellulolytic enzymes in L. pedicellatus, all carrying a predicted signal peptide for secretion (identified using SingalP). BlastX and functional domain annotation show that the most highly transcribed CAZyme genes in the digestive gland encode putative glycoside hydrolases (GHs) belonging to GH9, GH45, GH1, GH13, GH2, GH18, GH31, GH5, GH10, and GH38 families (Fig. 1d, Additional file 1: Table S2).
Very few bacterial transcripts were detected in the cecum samples, confirming previous reports of the virtual absence of live bacteria in this organ . Only two sequences sharing similarity to putative GH30s from Bacillus species were found to be expressed at relatively high levels in the cecum (Fig. 1e, Additional file 1: Table S3). However, manual sequence alignment revealed that the two contigs are actually part of one unique transcript featuring a putative polyadenylation (polyA) tail at the 3′ terminus, and have orthologues in the annotated genomes of model bivalve molluscs (XP_011456351.1 from Crassostrea gigas, XP_021348230.1 from Mizuhopecten yessoensis), whose intron-exon structure strongly suggests an endogenous nature. This GH30 gene might thus be the result of an ancient horizontal gene transfer (HGT) from bacteria, a phenomenon previously observed for several GH families in multiple invertebrate genomes . The shipworm cecum also features high expression of endogenous GH1s, GH13s, and GH2s (Fig. 1e, Additional file 1: Table S3).
RNA-Seq data show that virtually all bacterial genes found in the shipworm meta-transcriptome are transcribed in the gills, with the most abundant being from GH family 6, 11, 5 and auxiliary activity (AA) family 10 lytic polysaccharide monooxygenases (LPMOs) (Fig. 1f, Additional file 1: Table S4). All these bacterial CAZymes carry a putative N-terminal signal peptide likely involved in protein translocation through the periplasm for secretion and have best BlastX matches to sequences from gammaproteobacteria of the family Alteromonadaceae (mainly Teredinibacter and Saccharophagus species). The identification of the gills as the main site of expression of bacterial CAZymes is in line with previous work from the shipworm B. setacea, where GH5s and GH6s were found to be the dominant CAZymes produced by gill bacteria .
Comparison of the digestive gland transcriptome from L. pedicellatus and Crassostrea gigas
Proteomic analysis of the shipworm cecum content
Interestingly, although GH9s and GH1s are abundant in both transcriptome and proteome, the relative abundance of GH45s is high in the transcriptomic data but lower in the proteome, where it accounts for only 1.5% in molar percentage of total CAZymes identified. The major bacterial contributions to the proteome are provided by GH11, 10 and 5 proteins, which typically function as xylanases, mannanases and endo-glucanases. In B. setacea, bacterial GH5s and GH6 were reported to account for over 30% of the total protein content of the cecum , but here only make up 0.5% of the total proteome (less than 2% of all CAZymes).
Isolation and characterization of the multi-domain GH1 from L. pedicellatus (LpMDGH1)
A modular protein (CjCel1A) comprising 2 sequential GH1s, reminiscent of the mammalian LPH, was shown to be produced in the digestive gland of the clam Corbicula japonica . Based on amino acid sequence, it was hypothesized that the anterior part (first GH1 domain) of the protein from Corbicula is not active as a glycoside hydrolase and might instead work as a chaperone, in a similar fashion to the mammalian LPH . Alignment of the six putative GH1 modules of the L. pedicellatus protein reveals that domains 2, 4, 5 and 6 possess the required amino acids for hydrolytic activity (regions “NE” and “TENG” in the protein alignment), while domains 1 and 3 lack these residues, are unlikely to have GH activity, and might thus be involved in protein folding or perhaps substrate interactions (Additional file 1: Figure S3).
A BlastP search of the full-length LpMDGH1 against non-redundant (nr) databases finds best matches among molluscs (e.g., Lottia gigantea) and reveals that LPH-like sequences are also present in insects, reptiles, birds, amphibians, mammals and fish (Fig. 4b), but not in bacteria, fungi and some animal taxa (e.g., crustaceans). The multi-domain GH1s from vertebrates typically feature a putative C-terminal transmembrane region (Additional file 1: Figure S4), likely involved in anchoring the mature protein to the outer face of the cell plasma membrane. Although our analysis indicates that the lack of this transmembrane region is a common trait among multi-domain GH1s from invertebrates (Additional file 1: Figure S4), the six-domain architecture appears unique to the shipworm protein and might, therefore, represent a specific adaptation towards wood digestion.
Despite our best efforts, we could not obtain soluble LpMDGH1 (nor any of its GH1 domains separately) in the heterologous expression systems we tested. However, we carried out size-exclusion chromatography of soluble cecum extracts and successfully isolated the mature LpMDGH1 to high purity. Zymograms and in vitro assays with the purified enzyme showed that it is the major β-glucosidase in L. pedicellatus cecum (Fig. 4c). Interestingly, the enzyme showed activity towards both short-chain gluco-oligosaccharides and long-chain glucans (glucomannan, β-glucan, and lichenan), with preference towards β-1,4 linkages, suggesting roles in the digestion of cellulose and hemicelluloses (Fig. 4d). Such a release of sugars from complex glucans was not seen in assays with a commercially sourced single-module GH1 from Agrobacteria (data not shown) and might be an unusual feature of the multi-modular protein. MALDI-TOF MS analysis of the reaction products revealed that LpMDGH1 releases medium- and long-chain oligosaccharides from glucans, suggesting that this enzyme might also have endo-glucanase activity (Additional file 1: Figure S5). Interestingly, a soluble 210 kDa enzyme isolated from the digestive fluids of the sea hare Aplysia kurodai was shown to share high similarity with the human lactase phlorizin hydrolase (based on N-terminal protein sequencing) . Although the authors did not manage to isolate the coding sequence nor localize the organ where it was produced, they showed that the purified enzyme can hydrolyze gluco-oligosaccharides as well as complex polysaccharides (lichenan, laminarin, and cardran), thus suggesting a key role in the digestion of sea lettuce by the sea hare . The protein size, in vitro activity, and sequence similarity to mammalian LPH suggest that the enzyme from the sea hare has four GH1 domains and plays a similar role to the MDGH1 from L. pedicellatus.
Anatomy of the digestive system
Although the cecum contains most of the ingested wood particles, the digestive gland has long been hypothesized to be involved in production of digestive enzymes in shipworms and other molluscs [15–22], and our meta-transcriptomic and proteomics data confirm this is the case for Lyrodus. The gland has a lobular structure reminiscent of secretory organs in other animals and is directly connected to the stomach by ducts [4, 22]. Our high-resolution microscopic analyses reveal that the gland contains secretory cells, as previous observed in other molluscs [19–21]. These specialized cells occupy the crypts of the tubule, are mostly pyramidal, and have a well-developed granular endoplasmic reticulum (Fig. 5c) and an extensive Golgi apparatus, producing numerous vesicle-like structures (Fig. 5d) of variable sizes potentially containing glycoside hydrolases and other secretory enzymes identified in our studies. The digestive gland also features abundant amoeboid cells (phagocytes, ~20 μm in diameter) with pseudopodia and internalized wood particles of variable dimensions (Fig. 5e, f). EM images show numerous vesicles budding from the Golgi apparatus and apparently fusing with the cell wall of the cavities containing the internalized wood fragments (Fig. 5f), suggesting the formation of lysosomes that might be responsible for intracellular wood breakdown. Although wood phagocytes have been previously reported in shipworms with hand drawings , this is the first high-resolution image of these cells and provides new evidence of their role in lignocellulose digestion. This suggests that, while the cecum provides the major site of wood digestion, it may be supplemented by intracellular digestion. Indeed, the cecum appears to be a specific adaptation of shipworms and has not been reported in other bivalves, where most digestive processes are restricted to the gland and intestinal systems.
Lignocellulose represents the most abundant and ubiquitous organic material in nature. The breakdown of this recalcitrant polymer plays a critical role in the global carbon cycle, and is attracting growing interest from a biotechnological perspective. As society moves away from the use of net greenhouse gas-emitting fossil resources, the use of surplus woody biomass to provision fuels, chemicals, and materials is becoming imperative. Understanding the digestive systems of major wood-digesting animals provides insights and enzymes that could help towards the cost-effective breakdown of lignocellulosic biomass into simple sugars and other building blocks. We have undertaken a detailed and multifaceted study of the digestive system of L. pedicellatus, and revealed the complex molecular mechanisms of lignocellulose digestion in shipworms. Our work shows that the digestive gland of L. pedicellatus produces a complex enzymatic mixture containing most of the activities required for the digestion of the plant cell wall, including cellulases (endo-β-1,4-glucanases, β-glucosidases) and hemicellulases (β-mannanases/mannosidases, β-xylanases/xylosidases). Although the meta-transcriptome of L. pedicellatus lacks endogenous cellobiohydrolases (CBHs), it contains bacterial GH6s (which can act as CBHs) expressed in the gills, as previously observed in B. setacea . Similarly, we also detected expression of bacterial lytic polysaccharide monooxygenases (LPMOs), which likely synergize endogenous as well as bacterial glucanases. Indeed, previous work has shown that LPMOs can insert breaks into highly crystalline polysaccharides and, by doing so, boost the activity of glycoside hydrolases by several orders of magnitude [23–25]. The surprisingly low levels of both GH6 and LPMO mature proteins in the shipworm cecum, however, suggest that the corresponding transcripts might not be translated efficiently, or that the mature enzymes are unstable in the shipworm digestive tract. In support of this, our cecum proteomics analysis revealed abundant endogenous proteases, which might reduce the half-life of some bacterial enzymes.
Our work in L. pedicellatus, and previous studies on wood-feeding cockroaches, beetles, and termites [26–28], suggest that a combination of endogenous and symbiotic enzymes is optimal for efficient plant cell wall digestion in invertebrates. Interestingly, the genomes of insects, crustaceans, annelids and molluscs encode numerous enzymes involved in plant cell wall digestion, implying that some of these genes were present in the last common ancestor of bilaterian animals , before bacterial symbioses developed, and likely represent an ancestral mechanism for lignocellulose digestion. Our analysis of the digestive gland transcriptome from shipworm and oyster confirms that molluscs share a complex array of endogenous lignocellulolytic enzymes, and that their expression levels are adapted to their specific diets. C. gigas (oyster) is characterized by high expression of glycoside hydrolases and sulfatases involved in the deconstruction of sulfated polysaccharides that are abundant in marine algae (as an adaptation to highly ionic environment)  but not in fresh water algae and terrestrial plants. Our data show that L. pedicellatus relies mostly on GH9s, GH45s, and GH1s to breakdown terrestrial woody plants, and the same is likely true for most shipworm species, which typically feed on submerged wood. There are, however, some notable exceptions. For example, Zachsia zenkewitschi feeds on the rhizomes of seagrasses such as Zostera, one of the few examples of marine angiosperms to have regained the ability to produce sulfated polysaccharides . Even more puzzling is the giant mud-boring teredinid Kuphus polythalamia, where sulfur-oxidizing bacteria have replaced the ancestral cellulolytic symbionts in the gills . Further work is needed to elucidate the function of the digestive gland in this enigmatic chemoautotrophic bivalve, which represents a unique example of a shipworm that has entirely lost the ability to digest plant biomass.
Previous studies in the shipworm B. setacea suggested that glycoside hydrolases produced by endosymbiotic bacteria located in the gills dominate the shipworm digestive system . The apparent discrepancies in results between our study and that by O’Connor et al.  may be in part due to differences between species. Both belong to the family Teredinidae, rely on a diet of wood, and share some key anatomical features, including digestive glands, extended cecum, and the presence of analogous bacterial populations in the gill bacteriocytes (mostly gammaproteobacteria related to Saccharophagus degradans). Therefore, it would be of interest to determine what factors underpin the differences between our results and those reported by O’Connor et al. . It is worth noting that in the study by O’Connor et al.  the cecum proteomic data were reported to be searched specifically against bacterial DNA extracted from the gills, which would have precluded identification of proteins produced by the animal itself. Indeed, our proteomics studies reveal that the bulk of the CAZymes found in the cecum of L. pedicellatus is secreted by a specialized digestive gland, while bacterial enzymes coming from the gills play a supporting role. The size of the cecum and abundance of GHs found there suggest that this is the major site of wood digestion in shipworms. Yet, our ultrastructural studies indicate a potential role in wood digestion for ameboid cells in the digestive gland. Although intracellular wood digestion is unusual, it has also been inferred from microscopic analysis of commensal ciliates found in the digestive system of lower termites, which appear to engulf wood particles .
By investigating the cecum meta-proteome, we have discovered that the most abundant enzyme in the L. pedicellatus digestive tract is an unusual multi-modular GH1 with sequence similarity to the lactase phlorizin hydrolase found in mammals. In contrast to the mature peptide in the mammalian LPH, which has two GH1 domains linked by a short peptide and is mostly active as a β-glucosidase, the L. pedicellatus MDGH1 has six domains, and our studies have revealed its specificity against β-linked glucosides as well as complex glucans, suggesting the ability to cleave these linkages in cellulose and glucomannans during wood digestion. Based on the presence of key amino acid residues predicted to be involved in substrate attack, we would expect four of the six GH1 domains in the LpMDGH1 to be active glycoside hydrolases. Future work is needed to clarify if the endo- and exo-activities observed in LpMDGH1 depend on the distinct GH1 domains or rather on the fusion nature of the polyprotein, and whether the two putatively inactive GH1 domains play any role in the two mechanisms of action.
The sustainability of biorefineries hinges on the identification of the most effective enzymatic cocktails for the saccharification of plant biomass. Our investigation into the digestive system of L. pedicellatus uncovers a wide range of new glycoside hydrolases attacking major fractions of lignocellulose (cellulose, xylans, and mannans) and unprecedented information regarding their relative abundance, which could help engineer an optimal enzymatic cocktail for the breakdown of lignocellulose. In this context, the identification of LpMDGH1 as the major enzyme in the shipworm’s cecum is particularly interesting, as one of the major bottlenecks in the industrial breakdown of plant biomass is the ability to prevent the accumulation of cellobiose, a potent inhibitor of endo-glucanases and cellobiohydrolases . Shipworms seem to have overcome this issue by mass producing a unique multi-domain hydrolase with dual activity towards long-chain glucans and cellobiose, an elegant evolutionary solution that may help simplify the enzymatic cocktails used in cellulosic biorefineries.
This work is the first comprehensive investigation of the complex molecular and physiological processes by which shipworms extract nutrients from wood and play a fundamental role in the global carbon recycling. The identification of the key enzymes produced by the shipworm L. pedicellatus, and the in vitro characterization of the most abundant glycoside hydrolase found in its digestive system, may open up new opportunities in the biotechnological deconstruction of lignocellulose in support of a sustainable bio-economy.
Phosphoric acid swollen cellulose (PASC) was prepared as follows. 5 g of Avicel ® PH-101 was moistened with water and treated with 150 mL ice cold 85% phosphoric acid, stirred on an ice bath for 1 h. Then 500 mL cold acetone was added while stirring. The swollen cellulose was filtered on a glass-filter funnel and washed 3 times with 100 mL ice cold acetone and subsequently twice with 500 mL water. PASC was then suspended in 500 mL water and blended to homogeneity.
High-purity pachyman (β-d-1,3-glucan), barley β-glucan (β-d-1,3-1,4-glucan), lichenan (from Icelandic moss, β-d-1,3-1,4-glucan), mannan (borohydride reduced), konjac glucomannan (β-d-1,4), carob galactomannan, larch arabinogalactan, wheat arabinoxylan, cellotriose, cellotetraose, cellopentaose, cellohexaose, mannobiose, and xylobiose were purchased from Megazyme. Locust bean gum, carboxymethyl cellulose (CMC), beechwood xylan, and cellobiose were purchased from Sigma. 5-bromo-4-chloro-3-indolyl-β-d-cellobioside was purchased from Santa Cruz Biotechnology.
Adult Lyrodus pedicellatus, matching the sequences of cytochrome c oxidase subunit I and small subunit rRNA 18S of the Atlantic population of Lyrodus as per Borges et al. , were obtained from an infested pier composed of greenheart wood (Chlorocardium rodiei) at Portsmouth, UK. Adults were harvested and dissected in order to yield the larvae. Subsequent cultures were reared in aquaria at the Institute of Marine Sciences, University of Portsmouth. Seawater was taken directly from Langstone Harbour, maintained at a temperature between 15–18 °C and a salinity of 33 PSU, and kept aerated throughout. Tanks were regularly provided with small panels of Scot pine wood for larval settlement.
mRNA extraction, preparation, and sequencing
Digestive gland, cecum and gills were dissected from three healthy adult L. pedicellatus and total RNA was extracted using TRIzol® Reagent (Thermo Fisher Scientific). Samples were DNase treated using Turbo DNA-free (Ambion) before quantification using a Qubit Fluorometer (Thermo Fisher Scientific). Ribosomal RNA depletion was carried out with a RiboZero™ Magnetic Gold Kit (Epidemiology) (Epicentre). mRNA was then concentrated using RNA Clean & Concentrator™-5 (Zymo Research). RNA-Seq libraries were prepared from each mRNA sample as per the Ion Total RNA-Seq kit v2 (Thermo Fisher Scientific), using an RNaseIII treatment time of 2.5–3 min. Samples were barcoded using the Ion Xpress RNA-Seq Barcode kit (Thermo Fisher Scientific). Yields and library sizes were assessed using the High Sensitivity D1K screentapes and reagents on a 2200 TapeStation Nucleic Acids System (Agilent Technologies). Appropriately diluted library aliquots were combined in pairs in equimolar amounts and used for template preparation using the Ion OneTouch 200 Template Kit v2 DL on a OneTouch system (Thermo Fisher Scientific) prior to loading onto a 318 chip and sequenced on an Ion Torrent PGM™ prepared as per the manufacturer’s instructions (IonPGM200Kit; Thermo Fisher Scientific). All raw sequence data were deposited in NCBI under BioProject PRJNA412369 (SRA files: SRR6106265, SRR6106266, SRR6106267, SRR6106268, SRR6106269, SRR6106270, SRR6106271, SRR6106272, SRR6106273). Assembled contigs are available from the authors upon request.
Transcriptome assembly, sequence annotation, and identification of putative CAZymes
After removing the primer sequences and low-quality reads from raw EST sequencing reads, the EST sequences from three tissues (digestive gland, cecum, and gills) from three healthy adults were assembled into unigene contigs using Trinity . The contigs from the three animals were then assembled into supercontigs with the CAP3 DNA Sequence Assembly Program . Raw reads were mapped onto the transcriptomes of the three molluscs and normalized expression values (TPM = Transcripts per kilobase Million) were calculated for each transcript using Salmon (part of the Galaxy toolshed) . Although all three animals provided raw reads with high quality and could be used to assemble a reference transcriptome, TPM values from one animal were found to be poorly correlated with the other two (possibly as a result of illness or distress) and were therefore excluded from the following analysis. Average TPM values for each contig in the three tissues (digestive gland, cecum, and gills) were thus calculated from the two healthy animals. The top 10,000 assembled contigs (ranked according to the cumulative number of mapped ESTs from all organs) were annotated with the BLASTx algorithm  to search against non-redundant (nr) peptide database downloaded from the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). CAZy annotation was carried out using the CAZYmes Analysis Toolkit (CAT) on the BioEnergy Science Center website (http://mothra.ornl.gov/cgi-bin/cat/cat.cgi). Sequences annotated as glycosyltransferases (GTs) and carboxyl esterases, mostly involved in intracellular processes not relevant to lignocellulose digestion, were excluded from the analysis. Among auxiliary activity (AA) families, only those clearly involved in polysaccharide degradation were considered (LPMO families AA9, AA10, AA11, and AA13).
Contigs were converted to putative ORFs using the online tool Emboss (http://www.bioinformatics.nl/cgi-bin/emboss/getorf), putative N-terminal signal peptides were predicted with SingalP (http://www.cbs.dtu.dk/services/SignalP/), and putative transmembrane regions were predicted using TMHMM (http://www.cbs.dtu.dk/services/TMHMM/).
Raw transcriptome sequencing data for the digestive gland of a wildly caught C. gigas (Japanese oyster) were retrieved from the EBI portal (run Accession SRR334213) . The published transcriptome of C. gigas (based on the annotated genome) was retrieved from NCBI (Accession PRJNA276446). Raw reads were mapped onto the transcriptome of the mollusc and normalized expression values were calculated for each transcript using Salmon (part of the Galaxy toolshed) . The identity and relative abundance of the CAZyme families were then compared to those obtained from the digestive gland of L. pedicellatus. CAZy annotation was carried out using the CAZYmes Analysis Toolkit (CAT) on the BioEnergy Science Center website (http://mothra.ornl.gov/cgi-bin/cat/cat.cgi).
The ceca from five animals grown on Scots pine were dissected in 50 mM sodium phosphate buffer pH 7 and the content (food particles and enzymes) was collected, pooled together, added with 1% SDS, beta-mercapto ethanol, DTT, boiled for 10 min, centrifuged, and the supernatant run into a 10% polyacrylamide gel to a depth of 1 cm, before staining with Coomassie.
In-gel tryptic digestion was performed post reduction with DTE and S-carbamidomethylation with iodoacetamide. Resulting peptides were analyzed by label-free LC-MS/MS over a 125-min gradient using a Waters nanoAcquity UPLC interfaced to a Bruker maXis HD mass spectrometer as detailed in . Protein identification was performed by searching tandem mass spectra against the assembled transcriptome of L. pedicellatus using the Mascot search program. Matches were passed through Mascot percolator to achieve a false discovery rate of < 1% and further filtered to accept only peptides with expect scores of 0.05 or better. Molar percentages were calculated from Mascot emPAI values by expressing individual values as a percentage of the sum of all emPAI values in the sample . Proteins identified in the proteomics analysis were annotated via Blastx versus non-redundant NCBI databases. CAZy annotation was carried out using the CAZYmes Analysis Toolkit (CAT) on the BioEnergy Science Center website (http://mothra.ornl.gov/cgi-bin/cat/cat.cgi).
One aliquot of the cecum extract (content only) and the purified LpMDGH1 was added with 1% SDS, beta-mercapto ethanol, DTT, boiled for 10 min, centrifuged, and the supernatant run in a 4–20% gradient polyacrylamide gel. After Coomassie staining, the most abundant band (with an approximate molecular weight of 300 kDa) was excised and in-gel digested as described for LC-MS/MS samples.
A 1 μL aliquot of peptide mixture was applied directly to a ground steel MALDI target plate and overlaid with an equal volume of a 5 mg/mL 4-hydroxy-α-cyano-cinnamic acid in 50% aqueous (v:v) acetonitrile containing 0.1% trifluoroacetic acid (v:v). Positive-ion MALDI mass spectra were obtained using a Bruker Ultraflex III in reflectron mode, equipped with a Nd:YAG smart beam laser. MS spectra were acquired over a range of m/z 800–4000. The ten most intense precursors with S/N greater than 30 were selected for MS/MS fragmentation in LIFT mode without collision gas. The default calibration was used for MS/MS spectra, which were baseline-subtracted and smoothed (Savitsky–Golay, width 0.15 m/z, cycles 4); monoisotopic peak detection used a SNAP averaging algorithm (C 4.9384, N 1.3577, O 1.4773, S 0.0417, H 7.7583) with a minimum S/N of 6. Bruker flexAnalysis software (version 3.3) was used for spectral processing and peak list generation. Tandem mass spectral data were submitted to database searching using a locally running copy of the Mascot program (Matrix Science Ltd., version 2.5), through the Bruker ProteinScape interface (version 2.1). Search criteria included enzyme, trypsin; fixed modifications, carbamidomethyl (C); variable modifications, oxidation (M), deamidated (N, Q); peptide tolerance, 100 ppm; MS/MS tolerance, 0.5 Da; Instrument, MALDI-TOF–TOF. Peptide matches were filtered to require expect scores of 0.05 or better.
Cloning the LpMDGH1 cDNA
The native sequence (from start to stop codon) for LpMDGH1 was cloned from cDNA generated from RNA extracted from the digestive gland of L. pedicellatus using external oligonucleotide primers designed on the assembled contig from the transcriptome. Total RNA was extracted from one animal using the TRIzol ® Reagent (Thermo Fisher Scientific) and cDNA was generated with an oligodT primer using SuperScript ® II reverse transcriptase (Thermo Fisher Scientific). PCR reactions were then set up using Phusion® High-Fidelity DNA Polymerase (Thermo Fisher Scientific) and the amplicon was cloned into an auxiliary plasmid using the StrataClone Blunt PCR Cloning Kit (Stratagene) and the correct sequence was verified with the Sanger method using internal primers. Open reading frames (ORFs) were calculated using the online EXPASY tool “Translate” and confirmed that the cloned sequence codes for a unique polypeptide of 2752 amino acid residues (without internal stop codons). The sequence has been deposited in GenBank with Accession No. MG013499.
Phylogeny and sequence analysis of LpMDGH1
The LpMDGH1 protein sequence was searched via BlastP against NCBI non-redundant databases and orthologues from molluscs, insects, fish, amphibians, reptiles, birds, and mammals were retrieved. The resulting amino acid sequences were aligned using Muscle , operating with default parameters. A distance matrix was made with Mega6  using a Jones–Taylor–Thornton matrix and a phylogenetic tree was then calculated by the maximum likelihood algorithm and standard parameters. The resulting tree was visualized using Dendroscope .
Purification of LpMDGH1
The ceca of twenty L. pedicellatus specimens were dissected in 50 mM sodium phosphate buffer pH 7 and the content (wood particles and enzymes) was collected and centrifuged. The supernatant was then filtered with 0.22-μm syringe filters and applied to a Superose™ 6 Increase 10/300 GL size-exclusion chromatography column (GE Healthcare) pre-equilibrated with 20 mM Tris-HCl pH 8 plus 100 mM NaCl. Eluted fractions were analyzed by denaturing SDS-PAGE and those corresponding to the LpMDGH1 were pooled, concentrated using Microsep™ Advance Centrifugal Filters (Pall Laboratory, 100 kDa cut-off), and re-applied to the same column pre-equilibrated with 20 mM sodium phosphate buffer pH 7 plus 100 mM NaCl. Protein purity was again assessed via SDS-PAGE analysis.
Enzymatic assays and zymograms
Activity of the cecum fluids on a panel of polysaccharides and oligosaccharides was determined by DNS reducing sugar assay . Briefly, ten ceca were dissected in 50 mM sodium phosphate buffer pH 7 and the content fully re-suspended by pipetting. After centrifugation, the soluble portion (supernatant) was filtered through 0.22-μm porous membranes, quantified with the Bradford  reagent and used for assays. 50 μL reactions were carried out in 96-well plates in 50 mM sodium phosphate buffer pH 6 with either 1187 ng of total soluble cecum protein or 237 ng of purified GH1, and 1 mg mL−1 polysaccharide or 2.5 mM oligosaccharide. All reactions, including controls, were performed in triplicate. The micro-plate was incubated at 28 °C shaking at 320 rpm for 24 h, and then 100 μL of DNS reagent was added to each reaction before heating at 100 °C for 5 min. Absorbance at 540 nm was measured with a micro-plate reader and nanomoles of reducing sugars released were determined based on absorbance obtained with glucose standards. The DNS reagent was prepared by mixing 0.75 g of dinitrosalicylic acid, 1.4 g NaOH, 21.6 g sodium potassium tartrate tetrahydrate, 0.53 mL phenol, and 0.59 g sodium metabisulfite in 100 mL pure water.
Zymograms were performed as follows. 1.9 μg of total protein from the cecum fluids and purified LpMDGH1 were run in a non-denaturing SDS-PAGE gel (4–20% Mini-PROTEAN® TGX™ Precast Protein Gel, Biorad). The gel was then incubated in 20 mL of 20 mM sodium phosphate buffer pH 6 plus 2.5% Triton X100 for 30 min, and then washed twice in 20 mL of 20 mM sodium phosphate buffer pH 6 for 10 min. The gel was then incubated for 4 h in 20 mL of 20 mM sodium phosphate buffer pH 6 with 0.01% (w/v) of 5-bromo-4-chloro-3-indolyl-β-d-cellobioside to allow formation of the insoluble dye.
Compositional analysis of wood and frass
Untreated Scots Pine (powdered) and dried frass (obtained from L. pedicellatus grown on submerged panels of Scots Pine) were analyzed for cellulose, hemicellulose, and lignin content in five technical replicates each, using the methods reported in Marriot et al. .
Transmission electron microscopy
Dissected shipworm tissue (cecum, digestive gland) was fixed for 1–2 h at ambient temperature in primary fixative (4% formaldehyde (w/v), 2.5% (w/v) glutaraldehyde in 100 mM sodium phosphate buffer pH 7.2), and then washed (3 × 10 min) in 100 mM sodium phosphate buffer pH 7.2 before incubation in secondary fixative for 1 h on ice (1% osmium tetroxide in 100 mM sodium phosphate buffer pH 7.2). Samples were dehydrated through a graded ethanol series (15 min each), followed by two washes (5 min each) in epoxy propane. Samples were infiltrated with a series of epoxy propane/Epon araldite (25%, 50%, 75% Epon Araldite with a minimum of 1 h at each stage, all at 30°C) concluding with a minimum of two changes of Epon araldite resin over 24 h at 30 °C, and polymerized at 60 °C for 48 h in flat embedding molds. Pale gold (70–90 nm) ultra-thin sections were cut with a Diatome diamond knife, using a Leica Ultracut UCT microtome, and mounted on hexagonal 200-mesh nickel grids. Sections were post-stained with 2% (w/v) aqueous uranyl acetate (10 min), then with lead citrate (5 min)  in a carbon dioxide-free chamber and viewed using a FEI Tecnai 12 BioTWIN G2 TEM operating at 120 kV. Images were captured using AnalySIS software and a Megaview III CCD camera.
FS, GP, JRS, SMC, NCB, and SJM designed research. FS, GP, LE, and KB performed research. CS and MS contributed to TEM sample preparations and imaging; AAD and RB contributed to proteomic sample preparation and data analysis; FS, DAR, and YL analyzed transcriptomic data; FS, NCB, and SJM wrote the paper. All authors reviewed and commented on the manuscript. All authors read and approved the final manuscript.
This work is funded by the UK Biotechnology and Biological Sciences Research Council (Grant Number BB/L001926/1).
The authors declare that they have no competing interests.
The datasets supporting the conclusions of this article are available in NCBI repository, under BioProject PRJNA412369 (SRA files: SRR6106265, SRR6106266, SRR6106267, SRR6106268, SRR6106269, SRR6106270, SRR6106271, SRR6106272, SRR6106273). The LpMDGH1 sequence can be accessed through GenBank Accession No. MG013499.
Ethics approval and consent to participate
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Feller IC, Lovelock CE, Berger U, McKee KL, Joye SB, Ball MC. Biocomplexity in mangrove ecosystems. Ann Rev Mar Sci. 2010;2:395–417.View ArticleGoogle Scholar
- Turner RD. A survey and illustrated catalogue of the Teredinidae (Mollusca: Bivalvia). Cambridge: The Museum of Comparative Zoology, Harvard University; 1966.View ArticleGoogle Scholar
- Cragg SM. Marine wood boring invertebrates of New Guinea and its surrounding waters. In: Beehler BM, Marshall AJ, editors. The ecology of Papua. Singapore: Periplus; 2007. p. 539–63.Google Scholar
- Morton B. The functional anatomy of the organs of feeding and digestion of Teredo navalis Linnaeus and Lyrodus pedicellatus (Quatrefages). Proc Malac Soc Lond. 1970;39:151–67.Google Scholar
- Betcher MA, Fung JM, Han AW, O’Connor R, Seronay R, Concepcion GP, Distel DL, Haygood MG. Microbial distribution and abundance in the digestive system of five shipworm species (Bivalvia: Teredinidae). PLoS ONE. 2012;7:e45309.View ArticleGoogle Scholar
- O’Connor RM, Fung JM, Sharp KH, Benner JS, McClung C, Cushing S, Lamkin LR, Fomenkov AI, Henrissat B, Londer YY, Scholz MB, Posfai J, Malfatti S, Tringe SG, Woyke T, Malmstrom RR, Coleman-Derr D, Altamia MA, Dedrick S, Kaluziak ST, Haygood MG, Distel DL. Gill bacteria enable a novel digestive strategy in a wood-feeding mollusk. Proc Natl Acad Sci USA. 2014;111:E5096–104.View ArticleGoogle Scholar
- Wybouw N, Pauchet Y, Heckel DG, Van Leeuwen T. Horizontal gene transfer contributes to the evolution of arthropod herbivory. Genome Biol Evol. 2016;8:1785–801.View ArticleGoogle Scholar
- Miller GL. Use of dinitrosalicylic acid reagent for determination of reducing sugar. Anal Chem. 1959;31:426–8.View ArticleGoogle Scholar
- Pomin VH, Mourão PA. Structure, biology, evolution, and medical importance of sulfated fucans and galactans. Glycobiology. 2008;18:1016–27.View ArticleGoogle Scholar
- Ponis E, Probert I, Veron B, Mathieu M, Robert R. New microalgae for the Pacific oyster Crassostrea gigas larvae. Aquaculture. 2006;253:618–27.View ArticleGoogle Scholar
- Naim HY, Naim H. Dimerization of lactase-phlorizin hydrolase occurs in the endoplasmic reticulum, involves the putative membrane spanning domain and is required for an efficient transport of the enzyme to the cell surface. Eur J Cell Biol. 1996;70:198–208.Google Scholar
- Jacob R, Peters K, Naim HY. The prosequence of human lactase-phlorizin hydrolase modulates the folding of the mature enzyme. J Biol Chem. 2002;277:8217–25.View ArticleGoogle Scholar
- Sakamoto K, Uji S, Kurokawa T, Toyohara H. Molecular cloning of endogenous beta-glucosidase from common Japanese brackish water clam Corbicula japonica. Gene. 2009;435:72–9.View ArticleGoogle Scholar
- Tsuji A, Tominaga K, Nishiyama N, Yuasa K. Comprehensive enzymatic analysis of the cellulolytic system in digestive fluid of the Sea Hare Aplysia kurodai. Efficient glucose release from sea lettuce by synergistic action of 45 kDa endoglucanase and 210 kDa ß-glucosidase. PLoS ONE. 2013;8:e65418.View ArticleGoogle Scholar
- Sakamoto K, Touhata K, Yamashita M, Kasai A, Toyohara H. Cellulose digestion by common Japanese freshwater clam Corbicula japonica. Fish Sci. 2007;73:675–83.View ArticleGoogle Scholar
- Sakamoto K, Toyohara H. Putative endogenous xylanase from brackish-water clam Corbicula japonica. Comp Biochem Physiol Part B Biochem Mol Biol. 2009;154:85–92.View ArticleGoogle Scholar
- Sakamoto K, Uji S, Kurokawa T, Toyohara H. Immunohistochemical, in situ hybridization and biochemical studies on endogenous cellulase of Corbicula japonica. Comp Biochem Physiol Part B Biochem Mol Biol. 2008;150:216–21.View ArticleGoogle Scholar
- Owen G. The fine structure of the digestive tubules of the marine bivalve Cardium edule. Phil Trans Royal Soc Lond. 1970;258:245–60.View ArticleGoogle Scholar
- Pal SG. The fine structure of the digestive tubules of Mya arenaria L: I. Basiphil cell. Proc Malac Soc Lond. 1971;39:303–9.Google Scholar
- Pal SG. The fine structure of the digestive tubules of Mya arenaria L. II. Digestive cell. Proc Malac Soc Lond. 1972;40:161–70.Google Scholar
- Taїeb N. Distribution of digestive tubules and fine structure of digestive cells of Aplysia punctate (Cuvier, 1803). J Molluscan Stud. 2000;67:169–82.View ArticleGoogle Scholar
- Potts FA. The structure and function of the liver of Teredo, the shipworm (biological sciences). Proc Camb Phil Soc. 1923;1:1–17.Google Scholar
- Vaaje-Kolstad G, Westereng B, Horn SJ, Liu Z, Zhai H, Sørlie M, Eijsink VG. An oxidative enzyme boosting the enzymatic conversion of recalcitrant polysaccharides. Science. 2010;330:219–22.View ArticleGoogle Scholar
- Quinlan RJ, Sweeney MD, Lo Leggio L, Otten H, Poulsen JC, Johansen KS, Krogh KB, Jørgensen CI, Tovborg M, Anthonsen A, Tryfona T, Walter CP, Dupree P, Xu F, Davies GJ, Walton PH. Insights into the oxidative degradation of cellulose by a copper metalloenzyme that exploits biomass components. Proc Natl Acad Sci USA. 2011;108:15079–84.View ArticleGoogle Scholar
- Leggio LL, Simmons TJ, Poulsen JC, Frandsen KE, Hemsworth GR, Stringer MA, von Freiesleben P, Tovborg M, Johansen KS, De Maria L, Harris PV, Soong CL, Dupree P, Tryfona T, Lenfant N, Henrissat B, Davies GJ, Walton PH. Structure and boosting activity of a starch-degrading lytic polysaccharide monooxygenase. Nat Commun. 2015;6:5961. https://doi.org/10.1038/ncomms6961.View ArticleGoogle Scholar
- Watanabe H, Tokuda G. Cellulolytic systems in insects. Annu Rev Entomol. 2010;55:609–32.View ArticleGoogle Scholar
- Brune A. Symbiotic digestion of lignocellulose in termite guts. Nat Rev Microbiol. 2014;12:168–80.View ArticleGoogle Scholar
- Franco Cairo JPL, Carazzolle MF, Leonardo FC, Mofatto LS, Brenelli LB, Gonçalves TA, Uchima CA, Domingues RR, Alvarez TM, Tramontina R, Vidal RO, Costa FF, Costa-Leonardo AM, Paes Leme AF, Pereira GAG, Squina FM. Expanding the knowledge on lignocellulolytic and redox enzymes of worker and soldier castes from the lower termite Coptotermes gestroi. Front Microbiol. 2016;7:1518. https://doi.org/10.3389/fmicb.2016.01518.View ArticleGoogle Scholar
- Calderón-Cortés N, Quesada M, Watanabe H, Cano-Camacho H, Oyama K. Endogenous plant cell wall digestion: a key mechanism in insect evolution. Annu Rev Ecol Evol Syst. 2012;43:45–71.View ArticleGoogle Scholar
- Popper ZA, Michel G, Hervé C, Domozych DS, Willats WG, Tuohy MG, Kloareg B, Stengel DB. Evolution and diversity of plant cell walls: from algae to flowering plants. Annu Rev Plant Biol. 2011;62:567–90.View ArticleGoogle Scholar
- Shipway JR, O’Connor R, Stein D, Cragg SM, Korshunova T, Martynov A, Haga T, Distel DL. Zachsia zenkewitschi (Teredinidae), a rare and unusual seagrass boring bivalve revisited and redescribed. PLoS ONE. 2016. https://doi.org/10.1371/journal.pone.0155269.Google Scholar
- Distel DL, Altamia MA, Lin Z, Shipway JR, Han A, Forteza I, Antemano R, Limbaco MGJP, Tebo AG, Dechavez R, Albano J, Rosenberg G, Concepcion GP2, Schmidt EW, Haygood MG. Discovery of chemoautotrophic symbiosis in the giant shipworm Kuphus polythalamia extends wooden-steps theory. Proc Natl Acad Sci USA. 2017;114:E3652–8.View ArticleGoogle Scholar
- Kiuchi I, Moriya S, Kudo T. Two different size-distributions of engulfment-related vesicles among symbiotic protists of the lower termite, Reticulitermes speratus. Microbes Environ. 2004;19:211–4.View ArticleGoogle Scholar
- Sørensen A, Lübeck M, Lübeck PS, Ahring BK. Fungal beta-glucosidases: a bottleneck in industrial use of lignocellulosic materials. Biomolecules. 2013;3:612–31.View ArticleGoogle Scholar
- Borges LMS, Sivrikaya H, le Roux CA, Shipway JR, Cragg SM, Costa FO. Investigating the taxonomy and systematics of marine wood borers (Bivalvia: Teredinidae) combining evidence from morphology, DNA barcodes and nuclear locus sequences. Invertebr Syst. 2012;26:572–82.View ArticleGoogle Scholar
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol. 2011;29:644–52.View ArticleGoogle Scholar
- Huang X, Madan A. CAP3: a DNA sequence assembly program. Genome Res. 1999;9:868–77.View ArticleGoogle Scholar
- Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14:417–9.View ArticleGoogle Scholar
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.View ArticleGoogle Scholar
- Zhang G, Fang X, Guo X, Li L, Luo R, Xu F, Yang P, Zhang L, Wang X, Qi H, Xiong Z, Que H, Xie Y, Holland PW, Paps J, Zhu Y, Wu F, Chen Y, Wang J, Peng C, Meng J, Yang L, Liu J, Wen B, Zhang N, Huang Z, Zhu Q, Feng Y, Mount A, Hedgecock D, Xu Z, Liu Y, Domazet-Lošo T, Du Y, Sun X, Zhang S, Liu B, Cheng P, Jiang X, Li J, Fan D, Wang W, Fu W, Wang T, Wang B, Zhang J, Peng Z, Li Y, Li N, Wang J, Chen M, He Y, Tan F, Song X, Zheng Q, Huang R, Yang H, Du X, Chen L, Yang M, Gaffney PM, Wang S, Luo L, She Z, Ming Y, Huang W, Zhang S, Huang B, Zhang Y, Qu T, Ni P, Miao G, Wang J, Wang Q, Steinberg CE, Wang H, Li N, Qian L, Zhang G, Li Y, Yang H, Liu X, Wang J, Yin Y, Wang J. The oyster genome reveals stress adaptation and complexity of shell formation. Nature. 2012;490:49–54.View ArticleGoogle Scholar
- Dowle AA, Wilson J, Thomas JR. Comparing the diagnostic classification accuracy of iTRAQ, peak-area, spectral-counting, and emPAI methods for relative quantification in expression proteomics. J Proteome Res. 2016;10:3550–62.View ArticleGoogle Scholar
- Ishihama Y, Oda Y, Tabata T, Sato T, Nagasu T, Rappsilber J, Mann M. Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein. Mol Cell Proteom. 2005;4:1265–72.View ArticleGoogle Scholar
- Notredame C, Higgins DG, Heringa J. T-Coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000;302:205–17.View ArticleGoogle Scholar
- Clamp M, Cuff J, Searle SM, Barton GJ. The jalview java alignment editor. Bioinformatics. 2004;20:426–7.View ArticleGoogle Scholar
- Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.View ArticleGoogle Scholar
- Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.View ArticleGoogle Scholar
- Huson DH, Scornavacca C. Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst Biol. 2012;61:1061–7.View ArticleGoogle Scholar
- Bradford MM. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal Biochem. 1976;72:248–54.View ArticleGoogle Scholar
- Marriott PE, Sibout R, Lapierre C, Fangel JU, Willats WG, Hofte H, Gómez LD, McQueen-Mason SJ. Range of cell-wall alterations enhance saccharification in Brachypodium distachyon mutants. Proc Natl Acad Sci USA. 2014;111:14601–6.View ArticleGoogle Scholar
- Reynolds ES. The use of lead citrate at high pH as an electron opaque stain in electron microscopy. J Cell Biol. 1963;17:208–12.View ArticleGoogle Scholar