Skip to main content

Table 1 Summary of bagasse fosmid pyrosequencing data

From: Comparative analysis of sugarcane bagasse metagenome reveals unique and conserved biomass-degrading enzymes among lignocellulolytic microbial communities

Raw reads

Dataset

Number of sequences

Number of nucleotides

Sequence length

Average

SD

Minimum

Maximum

1. Raw reads

1,038,205

591,656,071

569.9

173.3

40

1,595

2. Read screen repeats

982,383

569,556,388

579.8

164.7

40

1,595

3. Read screen repeats and trim vector

726,980

421,491,438

579.8

166.0

40

1,595

Assembled sequences

Dataset

Number of sequences

Number of nucleotides

Sequence length

Average

SD

Minimum

Maximum

1. Contigs

17,829

32,867,905

1,843.5

2,394.6

100

46,577

2. Singletons (non-redundant)

185,543

109,290,202

589.0

163.5

40

1,595

  1. The bagasse fosmid library was sequenced on one full lane of the 454 GS-FLX Titanium, resulting in approximately one million raw reads. The reads with contaminating sequences of vector or host genome were removed before contig assembling and redundant sequence cleaning.