1 Department of Applied and Environmental Microbiology, Research Institute of Agro-Environmental Sciences, National Agricultural Research and Innovation Centre, H-1022 Budapest, Herman Ottó út 15. Hungary
| 2 Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, H-6726 Szeged, Temesvári krt. 62. Hungary
| 4 Laboratory of Microbiology; Agricultural Biotechnology Institute, National Agricultural Research and Innovation Centre, H-2100 Gödöllő, Szent-Györgyi Albert utca 4. Hungary
| 5 Department of Environmental Safety and Ecotoxicology, Institute of Aquaculture and Environmental Safety, Faculty of Agricultural and Environmental Sciences, Szent István University, H-2100 Gödöllő, Páter Károly utca 1. Hungary
| 6 Department of Molecular Structural Biology, Max Planck Institute of Biochemistry, Am Klopferspitz 18., D-82152 Martinsried. Germany
Thermobifidas are thermophilic, aerobic, lignocellulose decomposing actinomycetes. The Thermobifida genus includes four species: T. fusca, T. alba, T. cellulosilytica, and T. halotolerans. T. fusca YX is the far best characterized strain of this taxon and several cellulases and hemicellulases have been cloned from it for industrial purposes targeting paper industry, biofuel, and feed applications. Unfortunately, sequence data of such enzymes are almost exclusively restricted to this single species; however, we demonstrated earlier by zymography that other T. alba and T. cellulosilytica strains encode the same enzyme sets. Recently, the advances in whole genome sequencing by the use of next generation genomics platforms accelerated the selection process of valuable hydrolases from uncharacterized bacterial species for cloning purposes. For this purpose T. cellulosilytica TB100T type strain was chosen for de novo genome sequencing. We have assembled the genome of T. cellulosilytica strain TB100T into 168 contigs and 19 scaffolds, with reference length of 4 327 869 bps, 3 589 putative coding sequences, 53 tRNAs, and 4 rRNAs. The analysis of the annotated genome revealed the existence of 27 putative hydrolases belonging to 14 different glycoside hydrolase (GH) families. The investigation of identified, cloned, and heterologously multiple cellulases, mannanases, xylanases, and amylases may result in industrial applications beside gaining useful basic research related information.