Skip to main content

Table 1 Genomic information of the five Anaerolinea genomes retrieved from thermophilic cellulose-degrading metagenomes

From: Cellular adhesiveness and cellulolytic capacity in Anaerolineae revealed by omics-based genome interpretation

General information

TCF-2

TCF-5

TCF-12

TCF-8

TCF-13

A. thermophila UNI-1

IMG Genome ID

2561511051

2561511052

2561511055

2561511056

2561511053

N/A

Total length (Mb)

3.8

3.0

3.7

4.1

4.0

3.5

Scaffolds (n)

55

27

69

51

153

1

Average sequence length (bp)

71,706

11,837

56,472

84,553

27,370

N/A

GC (%)

54

55

53

64

65

54

Estimated completeness based on ESCGa (%)

99.1

98.1

100.0

99.1

100.0

99.1

Estimated redundancy of ESCGs (%)

3.8

1.9

3.7

2.8

3.7

2.8

Genes

3602

3005

3580

3681

3659

3224

Protein-coding genes

3550

2949

3523

3634

3609

3166

rRNA genes (5S/16S/23S)

2/2/0

1/1/1

1/1/1

3/1/0

1/1/0

2/2/2

tRNA gene (n)

43

45

48

41

43

50

Genes with functional prediction (%)

81.4

83.2

80.4

80.8

80.5

53.7

Genes with transcription (%)

27.8

35.4

18.1

1.8

2.8

N/A

Duplicated AAb (n/ %)

814 (22.9 %)

734 (24.9 %)

921 (26.1 %)

932 (10.8 %)

893 (24.7 %)

757 (23.9 %)

  1. N/A data not available
  2. a Completeness estimation based on 107 conserved single-copy genes, named as essential single-copy genes (ESCGs) of >95 % complete bacterial genomes [13]
  3. b Putative duplication among amino acid (AA) sequences within each chromosome (based on BLASTP bitscore ≥70, similarity ≥30 over at least 70 % of the query length [66])