Genome sequence of Ensifer sp. TW10; a Tephrosia wallichii (Biyani) microsymbiont native to the Indian Thar Desert
- Nisha Tak1,
- Hukam S. Gehlot1,
- Muskan Kaushik1,
- Sunil Choudhary1,
- Ravi Tiwari2,
- Rui Tian2,
- Yvette Hill2,
- Lambert Bräu3,
- Lynne Goodwin4,
- James Han5,
- Konstantinos Liolios5,
- Marcel Huntemann5,
- Krishna Palaniappan6,
- Amrita Pati5,
- Konstantinos Mavromatis5,
- Natalia Ivanova5,
- Victor Markowitz6,
- Tanja Woyke5,
- Nikos Kyrpides5 and
- Wayne Reeve2Email author
© The Author(s) 2013
Published: 20 December 2013
Ensifer sp. TW10 is a novel N2-fixing bacterium isolated from a root nodule of the perennial legume Tephrosia wallichii Graham (known locally as Biyani) found in the Great Indian (or Thar) desert, a large arid region in the northwestern part of the Indian subcontinent. Strain TW10 is a Gram-negative, rod shaped, aerobic, motile, non-spore forming, species of root nodule bacteria (RNB) that promiscuously nodulates legumes in Thar Desert alkaline soil. It is fast growing, acid-producing, and tolerates up to 2% NaCl and capable of growth at 40oC. In this report we describe for the first time the primary features of this Thar Desert soil saprophyte together with genome sequence information and annotation. The 6,802,256 bp genome has a GC content of 62% and is arranged into 57 scaffolds containing 6,470 protein-coding genes, 73 RNA genes and a single rRNA operon. This genome is one of 100 RNB genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.
The Great Indian (or Thar) Desert is a large, hot, arid region in the northwestern part of the Indian subcontinent. It is the 18th largest desert in the world covering 200,000 square km with 61% of its landmass occupying Western Rajasthan. The landscape occurs at low altitude (<1500 m above sea level) and extends from India into the neighboring country of Pakistan . The Thar Desert region is characterized by low annual precipitation (50 to 300 mm), high thermal load and alkaline soils that are poor in texture and fertility . Despite these harsh conditions, the Thar Desert has very rich plant diversity in comparison to other desert landscapes . Approximately a quarter of the plants in the Thar Desert are used to provide animal fodder or food, fuel, medicine or shelter for local inhabitants .
The Indian Thar desert harbors several native and exotic plants of the Leguminoseae family  including native legume members of the sub-families Caesalpinioideae, Mimosoideae and Papilionoideae that have adapted to the harsh Thar desert environment . The Papilionoid genus Tephrosia can be found throughout this semi-arid to arid environment and these plants are among the first to grow after monsoonal rains. The generic name is derived from the Greek word “tephros” meaning “ash-gray” since dense trichomes on the leaves provide a greyish tint to the plant. Many species within this genus produce the potent toxin rotenone, which historically has been used to poison fish. It is a perennial shrub that has adapted to the harsh desert conditions by producing a long tap root system and dormant auxillary shoot buds.
Recently, the root nodule bacteria (RNB) microsymbionts capable of fixing nitrogen in symbiotic associations with Tephrosia have been characterized . Both Bradyrhizobium and Ensifer were present within nodules, but a particularly high incidence of Ensifer was noted . Ensifer was found to occupy the nodules of all four species of Tephrosia examined . Here we present a preliminary description of the general features of the T. wallichii (Biyani) microsymbiont Ensifer sp. TW10 together with its genome sequence and annotation.
Classification and general features of Ensifer sp. TW10 according to the MIGS recommendations 
Species Ensifer sp.
Soil, root nodule, on host
Free living, symbiotic
Root nodule of Tephrosia wallichii
Jodhpur, Indian Thar Desert
Soil collection date
Classification and general features
Compatibility of Ensifer sp. TW10 with different wild and cultivated legume species
Tephrosia falciformis Ramaswami
Tephrosia purpurea(L.) Pers. sub sp.leptostachya DC.
Tephrosia purpurea (L.) Pers. sub sp.purpurea (L.) Pers
Tephrosia villosa (Linn.) Pres.
Prosopis cineraria(Linn.) Druce.
Mimosa hamata Willd.
M. himalayana Gamble
Vigna radiata (L.) Wilczek
Vigna aconitifolia(Jacq.) Marechal
Vigna unguiculata(L.) Walp.
Macroptilium atropurpureum(DC.) Urb.
Genome sequencing and annotation
Genome project history
Genome sequencing project information for Ensifer sp. strain TW10.
1× Illumina library
Allpaths, LG version r42328, Velvet 1.1.04
Gene calling methods
Genbank Date of Release
NCBI project ID
Symbiotic N2 fixation, agriculture
Growth conditions and DNA isolation
Ensifer sp. TW10 was cultured to mid logarithmic phase in 60 ml of TY rich medium  on a gyratory shaker at 28°C. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method .
Genome sequencing and assembly
The genome of Ensifer sp. TW10 was generated at the Joint Genome Institute (JGI) using Illumina  technology. An Illumina std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 14,938,244 reads totaling 2,241 Mbp.
All general aspects of library construction and sequencing performed at the JGI can be found at the JGI website . All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland, A, and Han, J, unpublished).
The following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet  (version 1.1.04), (2) 1–3 kb simulated paired end reads were created from Velvet contigs using wgsim (https://github.com/lh3/wgsim), and (3) Illumina reads were assembled with simulated read pairs using Allpaths-LG (version r42328) . Parameters for assembly steps were: 1) Velvet (velveth: 63 -shortPaired and velvetg: -veryclean yes -exportFiltered yes -mincontiglgth 500 -scaffolding no-covcutoff 10) 2) wgsim (-e 0 -1 100 -2 100 -r 0 -R 0 -X 0) 3) Allpaths-LG (PrepareAllpathsInputs:PHRED64=1 PLOIDY=1 FRAGCOVERAGE=125 JUMPCOVERAGE=25 LONGJUMPCOV=50, RunAllpath-sLG: THREADS=8 RUN=stdshredpairs TARGETS=standard VAPIWARNONLY=True OVERWRITE=True). The final draft assembly contained 57 contigs in 57 scaffolds. The total size of the genome is 6.8 Mbp and the final assembly is based on 2241Mbp of Illumina data, which provides an average 330× coverage of the genome.
Genes were identified using Prodigal  as part of the DOE-JGI annotation pipeline . The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. The tRNAScanSE tool  was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA . Other non-coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL . Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes (IMG) platform) [34,35].
Genome Statistics for Ensifer sp. TW10
% of Total
Genome size (bp)
DNA coding region (bp)
DNA G+C content (bp)
Number of scaffolds
Number of contigs
Genes with function prediction
Genes assigned to COGs
Genes assigned Pfam domains
Genes with signal peptides
Genes with transmembrane helices
Number of protein coding genes of Ensifer sp. TW10 associated with the general COG functional categories.
Translation, ribosomal structure and biogenesis
RNA processing and modification
Replication, recombination and repair
Chromatin structure and dynamics
Cell cycle control, mitosis and meiosis
Signal transduction mechanisms
Cell wall/membrane biogenesis
Intracellular trafficking and secretion
Posttranslational modification, protein turnover, chaperones
Energy production conversion
Carbohydrate transport and metabolism
Amino acid transport metabolism
Nucleotide transport and metabolism
Coenzyme transport and metabolism
Lipid transport and metabolism
Inorganic ion transport and metabolism
Secondary metabolite biosynthesis, transport and catabolism
General function prediction only
Not in COGS
This work was performed under the auspices of the US Department of Energy’s Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under contract No. DE-AC02-05CH11231, Lawrence Livermore National Laboratory under Contract No. DE-AC52-07NA27344, and Los Alamos National Laboratory under contract No. DE-AC02-06NA25396. We gratefully acknowledge funding received from the Murdoch University Strategic Research Fund through the Crop and Plant Research Institute (CaPRI), the GRDC National Rhizobium Program (UMU00032), the Council of Scientific and Industrial Research (CSIR) for a fellowship for Nisha Tak, the Department of Biotechnology (India) for a research grant (BT/PR11461/AGR/21/270/2008) and the Commonwealth of Australia for an Australia India Senior Visiting Fellowship for Ravi Tiwari.
- Sprent JI, Gehlot HS. Nodulated legumes in arid and semi-arid environments: are they important? Plant Ecol Divers 2010; 3:211–219. http://dx.doi.org/10.1080/17550874.2010.538740View ArticleGoogle Scholar
- Bhandari MM. Flora of the Indian desert. Jodhpur: MPS Repros; 1990. 435 p.Google Scholar
- Mohammed S, Kasera PK, Shukla JK. Unexploited plants of potential medicinal value from the Indian Thar Desert. Natural Product Radiance 2004; 3:69–74.Google Scholar
- Sen DN. Non-conventional food and some medicinal plant resources of Indian Desert. In: Purkayashtha RP, editor. Economic plants and microbes: Today and Tomorrow’s Printers and Publishers, New Delhi; 1991. p 67–76.Google Scholar
- Gehlot HS, Panwar D, Tak N, Tak A, Sankhla IS, Poonar N, Parihar R, Shekhawat NS, Kuma M, Tiwari R, et al. Nodulation of legumes from the Thar Desert of India and molecular characterization of their rhizobia. Plant Soil 2012; 357:227–243. http://dx.doi.org/10.1007/s11104-012-1143-5View ArticleGoogle Scholar
- Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen M, Angiuoli SV, et al. Towards a richer description of our complete collection of genomes and metagenomes “Minimum Information about a Genome Sequence” (MIGS) specification. Nat Biotechnol 2008; 26:541–547. PubMed http://dx.doi.org/10.1038/nbt1360PubMed CentralView ArticlePubMedGoogle Scholar
- Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA 1990; 87:4576–4579. PubMed http://dx.doi.org/10.1073/pnas.87.12.4576PubMed CentralView ArticlePubMedGoogle Scholar
- Garrity GM, Bell JA, Lilburn T. Phylum XIV. Proteobacteria phyl. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 2, Part B, Springer, New York, 2005, p. 1.View ArticleGoogle Scholar
- Garrity GM, Bell JA, Lilburn T. Class I. Alphaproteobacteria class. nov. In: Garrity GM, Brenner DJ, Krieg NR, Staley JT (eds), Bergey’s Manual of Systematic Bacteriology, Second Edition, Volume 2, Part C, Springer, New York, 2005, p. 1.View ArticleGoogle Scholar
- Validation List No. 107. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol 2006; 56:1–6. PubMed http://dx.doi.org/10.1099/ijs.0.64188-0
- Kuykendall LD. Order VI. Rhizobiales ord. nov. In: Garrity GM, Brenner DJ, Kreig NR, Staley JT, editors. Bergey’s Manual of Systematic Bacteriology. Second ed: New York: Springer-Verlag; 2005. p 324.Google Scholar
- Skerman VBD, McGowan V, Sneath PHA. Approved Lists of Bacterial Names. Int J Syst Bacteriol 1980; 30:225–420. http://dx.doi.org/10.1099/00207713-30-1-225View ArticleGoogle Scholar
- Conn HJ. Taxonomic relationships of certain non-sporeforming rods in soil. J Bacteriol 1938; 36:320–321.Google Scholar
- Casida LE. Ensifer adhaerens gen. nov., sp. nov.: a bacterial predator of bacteria in soil. Int J Syst Bacteriol 1982; 32:339–345. http://dx.doi.org/10.1099/00207713-32-3-339View ArticleGoogle Scholar
- Young JM. The genus name Ensifer Casida 1982 takes priority over Sinorhizobium Chen et al. 1988, and Sinorhizobium morelense Wang et al. 2002 is a later synonym of Ensifer adhaerens Casida 1982. Is the combination Sinorhizobium adhaerens (Casida 1982) Willems et al. 2003 legitimate? Request for an Opinion. Int J Syst Evol Microbiol 2003; 53:2107–2110. PubMed http://dx.doi.org/10.1099/ijs.0.02665-0View ArticlePubMedGoogle Scholar
- Judicial Commission of the International Committee on Systematics of Prokaryotes. The genus name Sinorhizobium Chen et al. 1988 is a later synonym of Ensifer Casida 1982 and is not conserved over the latter genus name, and the species name ‘Sinorhizobium adhaerens’ is not validly published. Opinion 84. Int J Syst Evol Microbiol 2008; 58:1973. PubMed http://dx.doi.org/10.1099/ijs.0.2008/005991-0View ArticleGoogle Scholar
- Agents B. Technical rules for biological agents. TRBA (http://www.baua.de):466.
- Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 25:25–29. PubMed http://dx.doi.org/10.1038/75556PubMed CentralView ArticlePubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol 2011; 28:2731–2739. PubMed http://dx.doi.org/10.1093/molbev/msr121PubMed CentralView ArticlePubMedGoogle Scholar
- Nei M, Kumar S. Molecular Evolution and Phylogenetics. New York: Oxford University Press; 2000.Google Scholar
- Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 1985; 39:783–791. http://dx.doi.org/10.2307/2408678View ArticleGoogle Scholar
- Liolios K, Mavromatis K, Tavernarakis N, Kyrpides NC. The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res 2008; 36:D475–D479. PubMed http://dx.doi.org/10.1093/nar/gkm884PubMed CentralView ArticlePubMedGoogle Scholar
- Vincent JM. A manual for the practical study of the root-nodule bacteria. International Biological Programme. UK: Blackwell Scientific Publications, Oxford; 1970.Google Scholar
- Gehlot HS, Tak N, Kaushik M, Mitra S, Chen WM, Poweleit N, Panwar D, Poonar N, Parihar R, Tak A, et al. An invasive Mimosa in India does not adopt the symbionts of its native relatives. Ann Bot (Lond) 2013; 112:179–196. PubMed http://dx.doi.org/10.1093/aob/mct112View ArticleGoogle Scholar
- Reeve WG, Tiwari RP, Worsley PS, Dilworth MJ, Glenn AR, Howieson JG. Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria. Microbiology 1999; 145:1307–1316. PubMed http://dx.doi.org/10.1099/13500872-145-6-1307View ArticlePubMedGoogle Scholar
- DOE Joint Genome Institute user home.http://my.jgi.doe.gov/general/index.html
- Bennett S. Solexa Ltd. Pharmacogenomics 2004; 5:433–438. PubMed http://dx.doi.org/10.1517/146224188.8.131.523View ArticlePubMedGoogle Scholar
- Zerbino DR. Using the Velvet de novo assembler for short-read sequencing technologies. Current Protocols in Bioinformatics 2010; Chapter 11:Unit 11 5.Google Scholar
- Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, Sharpe T, Hall G, Shea TP, Sykes S, et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA 2011; 108:1513–1518. PubMed http://dx.doi.org/10.1073/pnas.1017351108PubMed CentralView ArticlePubMedGoogle Scholar
- Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 2010; 11:119. PubMed http://dx.doi.org/10.1186/1471-2105-11-119PubMed CentralView ArticlePubMedGoogle Scholar
- Mavromatis K, Ivanova NN, Chen IM, Szeto E, Markowitz VM, Kyrpides NC. The DOE-JGI Standard operating procedure for the annotations of microbial genomes. Stand Genomic Sci 2009; 1:63–67. PubMed http://dx.doi.org/10.4056/sigs.632PubMed CentralView ArticlePubMedGoogle Scholar
- Pruesse E, Quast C, Knittel K. Fuchs BdM, Ludwig W, Peplies J, Glöckner FO. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res 2007; 35:7188–7196. PubMed http://dx.doi.org/10.1093/nar/gkm864PubMed CentralView ArticlePubMedGoogle Scholar
- INFERNAL. http://infernal.janelia.org
- Markowitz VM, Mavromatis K, Ivanova NN, Chen IM, Chu K, Kyrpides NC. IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics 2009; 25:2271–2278. PubMed http://dx.doi.org/10.1093/bioinformatics/btp393View ArticlePubMedGoogle Scholar
- DOE Joint Genome Institute. (http://img.jgi.doe.gov/er)