SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at...

6
SPOUT: a Class of Methyltransferases that Includes spoU and trmD RNA Methylase Superfamilies, and Novel Superfamilies of Predicted Prokaryotic RNA Methylases Vivek Anantharaman, Eugene V. Koonin and L. Aravind* National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA Nucleotide modification is a major source of structural and biochemical diversity of RNAs, particularly of rRNAs and tRNAs (Rozenski et al. , 1999). Base- specific methylation is one of the prevalent modes of post-transcriptional modification along with pseudo- uridylation and thio-uridylation. The catalytic domains of the majority of RNA methylases belong to the vast large superclass of Rossman-fold enzymes (Lo Conte et al., 2000). These Rossman-fold methy- lases (RFM) bind S-adenosyl-L-methionine (SAM) via the nucleotide-binding loop typical of this fold and catalyze the transfer of its methyl group to substrates such as DNA, RNA, proteins or small molecules. However, two families of RNA methylases, typified, respectively, by Escherichia coli proteins SpoU (TrmH) and TrmD, appear to be unrelated to the RFMs. SpoU family proteins have been characterized as tRNA (G 18 ) methyltransferases in E. coli and S. cerevisiae (Cavaille et al. , 1999; Koonin et al., 1993; Persson et al., 1997), whereas TrmD is responsible for the bacteria- specific tRNA methylation at the G 37 position (Bjork et al. , 1989; Li et al., 1999). Here, we provide evidence for a common evolutionary origin of the TrmD and SpoU methylase superfamilies that were previously considered unrelated. We show that the SpoU superfamily encompasses a greater diversity of (predicted) methylases than previously appreciated and contains two previously undetected families that are specific to the archaea and thermophilic bacteria. Additionally, we identify two previously unnoticed superfamilies of proteins from archaea and bacteria, respectively, that are predicted to be methylases related to both the TrmD and SpoU superfamilies. These four superfamilies together define a new structural scaffold for RNA methylase activity. As part of a systematic survey of the enzymes involved in RNA metabolism (VA, EVK and LA, unpublished), we investigated the SpoU and TrmD families by iterative searches of the Non-redundant database (National Center for Biotechnology Informa- tion, NIH, Bethesda) using the PSI-BLAST program (Altschul et al., 1997). Such a search (profile inclusion expectation (E) value threshold of 0.01) seeded with the E. coli SpoU sequence detected not only the obvious SpoU orthologs and previously described homologs (Koonin, 1996) from bacteria, archaea, and eukaryotes, but also several uncharacterized proteins from archaea (typified by MJ1385) and the thermophilic bacteria Aquifex and Thermotoga (typified by aq_054). These proteins showed significant E-values (10 3 10 7 ) on first detection and retained the conservation pattern typical of the SpoU proteins (Figure 1), suggesting that they were bona fide members of this superfamily. Additionally, in these searches we detected several uncharacterized archaeal proteins, typified by AF2226, and the TrmD methylases with marginally significant E-values (.06-.1). A reciprocal search started with the AF2226 sequence recovered the TrmD methylases and SpoU superfamily members with similar borderline E-values. Another confirmatory PSI- BLAST search initiated with the TrmD ortholog from Aquifex aeolicus (2983865) detected not only the AF2226-like archaeal proteins, but also members of a highly conserved superfamily of predominantly bac- terial proteins, typified by YbeA from E. coli , with significant E-values (<0.001). The TrmD methylases, the SpoU methylases, and the newly detected AF2226- like, and YbeA-like proteins showed striking conserva- tion in the region predicted to be the SAM-binding loop in the SpoU superfamily (Koonin, 1996) (Figure 1). Furthermore, in the archaeal AF2226-like proteins, the putative methylase domain is fused with a recently described RNA-binding domain, THUMP, that has also been found fused to classic, Rossmann-fold RNA methylases, thiouridine synthases and pseudouridine synthases (Aravind et al., 2001). These observations suggested that the SpoU superfamily, the TrmD superfamily, and the newly detected AF2226-like and YbeA-like superfamilies, belong to a class of methyltransferases with a common structural fold that does not include any structurally characterized proteins and is distinct from the classic Rossmann fold. To investigate this relationship further, we searched the entire set of members of the SpoU, TrmD, YbeA and AF2226 superfamilies for potential conserved motifs using the Gibbs sampling procedure (Neuwald et al., 1997). Two motifs were detected, one corresponding to the predicted SAM-binding loop and the other located directly C-terminal of it, with the *For correspondence. Email. [email protected] J. Mol. Microbiol. Biotechnol. (2002) 4(1): 71–75. JMMB Communication # 2002 Horizon Scientific Press

Transcript of SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at...

Page 1: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

SPOUT: a Class of Methyltransferases that IncludesspoU and trmD RNA Methylase Superfamilies,and Novel Superfamilies of Predicted ProkaryoticRNA Methylases

Vivek Anantharaman, Eugene V. Kooninand L. Aravind*

National Center for Biotechnology Information,National Library of Medicine, National Institutes ofHealth, Bethesda, MD 20894, USA

Nucleotide modification is a major source of structuraland biochemical diversity of RNAs, particularly ofrRNAs and tRNAs (Rozenski et al., 1999). Base-specific methylation is one of the prevalent modes ofpost-transcriptional modification along with pseudo-uridylation and thio-uridylation. The catalytic domainsof the majority of RNA methylases belong to the vastla rge superc lass o f Rossman- fo ld enzymes(Lo Conte et al., 2000). These Rossman-fold methy-lases (RFM) bind S-adenosyl-L-methionine (SAM) viathe nucleotide-binding loop typical of this fold andcatalyze the transfer of its methyl group to substratessuch as DNA, RNA, proteins or small molecules.However, two families of RNA methylases, typified,respectively, by Escherichia coli proteins SpoU (TrmH)and TrmD, appear to be unrelated to the RFMs. SpoUfamily proteins have been characterized as tRNA(G18) methyltransferases in E. coli and S. cerevisiae(Cavaille et al., 1999; Koonin et al., 1993; Perssonet al., 1997), whereas TrmD is responsible for thebacteria- specific tRNA methylation at the G37 position(Bjork et al., 1989; Li et al., 1999). Here, we provideevidence for a common evolutionary origin of theTrmD and SpoU methylase superfamilies that werepreviously considered unrelated. We show that theSpoU superfamily encompasses a greater diversity of(predicted) methylases than previously appreciatedand contains two previously undetected families thatare specific to the archaea and thermophilic bacteria.Additionally, we identify two previously unnoticedsuperfamilies of proteins from archaea and bacteria,respectively, that are predicted to be methylasesrelated to both the TrmD and SpoU superfamilies.These four superfamilies together define a newstructural scaffold for RNA methylase activity.

As part of a systematic survey of the enzymesinvolved in RNA metabolism (VA, EVK and LA,unpublished), we investigated the SpoU and TrmD

families by iterative searches of the Non-redundantdatabase (National Center for Biotechnology Informa-tion, NIH, Bethesda) using the PSI-BLAST program(Altschul et al., 1997). Such a search (profile inclusionexpectation (E) value threshold of 0.01) seeded with theE. coli SpoU sequence detected not only the obviousSpoU orthologs and previously described homologs(Koonin, 1996) from bacteria, archaea, and eukaryotes,but also several uncharacterized proteins from archaea(typified by MJ1385) and the thermophilic bacteriaAquifex and Thermotoga (typified by aq_054). Theseproteins showed significant E-values (10!3!10!7) onfirst detection and retained the conservation patterntypical of the SpoU proteins (Figure 1), suggesting thatthey were bona fide members of this superfamily.

Additionally, in these searches we detectedseveral uncharacterized archaeal proteins, typified byAF2226, and the TrmD methylases with marginallysignificant E-values (.06-.1). A reciprocal searchstarted with the AF2226 sequence recovered the TrmDmethylases and SpoU superfamily members withsimilar borderline E-values. Another confirmatory PSI-BLAST search initiated with the TrmD ortholog fromAquifex aeolicus (2983865) detected not only theAF2226-like archaeal proteins, but also members ofa highly conserved superfamily of predominantly bac-terial proteins, typified by YbeA from E. coli, withsignificant E-values (<0.001). The TrmD methylases,the SpoU methylases, and the newly detected AF2226-like, and YbeA-like proteins showed striking conserva-tion in the region predicted to be the SAM-binding loopin the SpoU superfamily (Koonin, 1996) (Figure 1).Furthermore, in the archaeal AF2226-like proteins, theputative methylase domain is fused with a recentlydescribed RNA-binding domain, THUMP, that has alsobeen found fused to classic, Rossmann-fold RNAmethylases, thiouridine synthases and pseudouridinesynthases (Aravind et al., 2001).

These observations suggested that the SpoUsuperfamily, the TrmD superfamily, and the newlydetected AF2226-like and YbeA-like superfamilies,belong to a class of methyltransferases with a commonstructural fold that does not include any structurallycharacterized proteins and is distinct from the classicRossmann fold. To investigate this relationship further,we searched the entire set of members of the SpoU,TrmD, YbeA and AF2226 superfamilies for potentialconserved motifs using the Gibbs sampling procedure(Neuwald et al., 1997). Two motifs were detected, onecorresponding to the predicted SAM-binding loop andthe other located directly C-terminal of it, with the*For correspondence. Email. [email protected]

J. Mol. Microbiol. Biotechnol. (2002) 4(1): 71–75. JMMB Communication

# 2002 Horizon Scientific Press

Page 2: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

• MALDI-TOF Mass Spectrometry in Microbiology

Edited by: M Kostrzewa, S Schubert (2016) www.caister.com/malditof

• Aspergillus and Penicillium in the Post-genomic Era

Edited by: RP Vries, IB Gelber, MR Andersen (2016) www.caister.com/aspergillus2

• The Bacteriocins: Current Knowledge and Future Prospects

Edited by: RL Dorit, SM Roy, MA Riley (2016) www.caister.com/bacteriocins

• Omics in Plant Disease Resistance

Edited by: V Bhadauria (2016) www.caister.com/opdr

• Acidophiles: Life in Extremely Acidic Environments

Edited by: R Quatrini, DB Johnson (2016) www.caister.com/acidophiles

• Climate Change and Microbial Ecology: Current Research and Future Trends

Edited by: J Marxsen (2016) www.caister.com/climate

• Biofilms in Bioremediation: Current Research and Emerging Technologies

Edited by: G Lear (2016) www.caister.com/biorem

• Microalgae: Current Research and Applications

Edited by: MN Tsaloglou (2016) www.caister.com/microalgae

• Gas Plasma Sterilization in Microbiology: Theory, Applications, Pitfalls and New Perspectives

Edited by: H Shintani, A Sakudo (2016) www.caister.com/gasplasma

• Virus Evolution: Current Research and Future Directions

Edited by: SC Weaver, M Denison, M Roossinck, et al. (2016) www.caister.com/virusevol

• Arboviruses: Molecular Biology, Evolution and Control

Edited by: N Vasilakis, DJ Gubler (2016) www.caister.com/arbo

• Shigella: Molecular and Cellular Biology

Edited by: WD Picking, WL Picking (2016) www.caister.com/shigella

• Aquatic Biofilms: Ecology, Water Quality and Wastewater Treatment

Edited by: AM Romaní, H Guasch, MD Balaguer (2016) www.caister.com/aquaticbiofilms

• Alphaviruses: Current Biology

Edited by: S Mahalingam, L Herrero, B Herring (2016) www.caister.com/alpha

• Thermophilic Microorganisms

Edited by: F Li (2015) www.caister.com/thermophile

• Flow Cytometry in Microbiology: Technology and Applications

Edited by: MG Wilkinson (2015) www.caister.com/flow

• Probiotics and Prebiotics: Current Research and Future Trends

Edited by: K Venema, AP Carmo (2015) www.caister.com/probiotics

• Epigenetics: Current Research and Emerging Trends

Edited by: BP Chadwick (2015) www.caister.com/epigenetics2015

• Corynebacterium glutamicum: From Systems Biology to Biotechnological Applications

Edited by: A Burkovski (2015) www.caister.com/cory2

• Advanced Vaccine Research Methods for the Decade of Vaccines

Edited by: F Bagnoli, R Rappuoli (2015) www.caister.com/vaccines

• Antifungals: From Genomics to Resistance and the Development of Novel Agents

Edited by: AT Coste, P Vandeputte (2015) www.caister.com/antifungals

• Bacteria-Plant Interactions: Advanced Research and Future Trends

Edited by: J Murillo, BA Vinatzer, RW Jackson, et al. (2015) www.caister.com/bacteria-plant

• Aeromonas

Edited by: J Graf (2015) www.caister.com/aeromonas

• Antibiotics: Current Innovations and Future Trends

Edited by: S Sánchez, AL Demain (2015) www.caister.com/antibiotics

• Leishmania: Current Biology and Control

Edited by: S Adak, R Datta (2015) www.caister.com/leish2

• Acanthamoeba: Biology and Pathogenesis (2nd edition)

Author: NA Khan (2015) www.caister.com/acanthamoeba2

• Microarrays: Current Technology, Innovations and Applications

Edited by: Z He (2014) www.caister.com/microarrays2

• Metagenomics of the Microbial Nitrogen Cycle: Theory, Methods and Applications

Edited by: D Marco (2014) www.caister.com/n2

Caister Academic Press is a leading academic publisher of advanced texts in microbiology, molecular biology and medical research. Full details of all our publications at caister.com

Further Reading

Order from caister.com/order

Page 3: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

72 Anantharaman et al.

Page 4: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

SPOUT 73

Page 5: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

probability of occurring by chance <10!14. A multiplealignment was constructed for each of these fourgroups using the T-Coffee program (Notredame et al.,2000) and the secondary structure was predictedseparately for each group using the JPRED program(Cuff et al., 1999). The strong correlation of thestructural elements in the individually predictedsecondary structure elements for these four groups(Figure 1) also supported the prediction of a commonstructural fold for all these proteins. Hereinafter werefer to this monophyletic assemblage of proteins asthe SPOUT (SpoU-TrmD) class.

An overall multiple alignment of the SPOUTproteins was constructed using the T-Coffee programand adjusted on the basis of the PSI-BLAST align-ments and individual secondary structure predictions(Figure 1). Sequence conservation concentrates in theC-terminal region of the SPOUT domain where thepredicted SAM-binding loop is located. This loop ispredicted to have a strand-loop-helix configuration thatresembles the SAM-binding loop in RFMs, but theSPOUT fold differs from the classic Rossmann fold inthat the conserved loop is located near the C-terminusrather than at the N-terminus of the methylase domain.The overall secondary structure prediction for theSPOUT domain reveals a roughly alternating a-helix,b-strand pattern (Figure 1). Consistent with this,sequence-structure threading performed using thehybrid fold prediction method (Fischer, 2000) gavegeneric, moderate scoring hits to several 3-layered a/bproteins (data not shown). Thus, the SPOUT domain ispredicted to have a 3-layered a/b fold with a centralsheet of 5-6 strands sandwiched between helices oneither side, with the SAM-binding site located in theC-terminal part of domain.

On the basis of sequence conservation, theSPOUT class of (predicted) methylases is divided intofour distinct superfamilies, namely SpoU, TrmD, YbeA,and AF2226 (Figure 1, Table 1). Of these, TrmD ispresent only in bacteria, whereas the AF2226 super-family is archaea-specific and all its members arefused to a N-terminal THUMP domain. Thus, byanalogy to TrmD that catalyzes bacteria-specific tRNAmethylation, it may be predicted that the AF2226-likeproteins catalyze an archaea-specific methylation.The YbeA superfamily includes highly conservedproteins that are present in a single copy in diversebacteria (with the exception of several lineages suchas Aquifex, cyanobacteria, actinomycetes and spiro-chaetes) and Arabidopsis thaliana.

The SpoU superfamily was divided into severalancient conserved families using clustering bysequence similarity with the BLASTCLUST program(I. Dondoshansky, YI Wolf, and EVK, unpublished)(Table 1). Each of these families has a distinct phyleticpattern (Table 1). The majority of typical SpoU-likemethylases belong to family 1, which consists ofseveral orthologous groups such as spoU/Trm3, YfiF,YjfH, and YsgA, (Cavaille et al., 1999; Koonin, 1996;Persson et al., 1997), most of which show bacterio-eukaryotic distr ibution, with a single archaealrepresentative in A. fulgidus. In contrast, family 2 ofSpoU-like methylases shows an archaeo-bacterialdistribution. Thus, unlike many RNA-modifying andprocessing enzymes that are either archaeo-eukar-yote-specific or bacterial-specific, the SpoU methy-lases show the opposite phyletic trend (Aravind et al.,1999; Makarova et al., 1999). The underlying evolu-tionary scenario might involve acquisition of Family 1SpoU methylases by eukaryotes from bacteria, whichcould have been accompanied by displacement ofSpoU family members (probably of family 2) that theeukaryotes inherited from their common ancestor withthe archaea. Several families of the SpoU superfamilyshow restricted phyletic distribution. Specifically,Family 3 is found only in the bacteria, Family 4 onlyin the archaea, and family 5 only in thermophilicbacteria (Table 1). It appears that members of thissuperfamily substantially contribute to the diversity oflineage-specific methylation patterns. The SpoU family4 is one of the most divergent families of this super-family and its representative from Thermoplasma isfused to a HD phosphoesterase domain (Aravind et al.,1998) that is highly conserved as a standalone proteinthroughout the archaea. Conceivably, the methylasesof this family function together with HD hydrolases(possibly, RNAses) (Aravind et al., 1998) in a novel,archaea-specific RNA processing pathway.

The most parsimonious explanation of thephyletic distribution and evolutionary affinities of theSPOUT domains is that the last common ancestor of alllife forms encoded a SpoU-like tRNA methylase.The TrmD, YbeA, and AF2226 superfamilies weresubsequently derived through independent duplica-tions; the former two in the bacterial and the latter in thearchaeo-eukaryotic lineages. The YbeA-like proteinfrom A. thaliana is particularly closely related to itsorthologs in Gram-positive bacteria, which suggests ahorizontal transfer from this bacterial lineage to plants.The SpoU superfamily members underwent further

Figure 1. Multiple alignment of the SPOUT methylases. The alignment was constructed by parsing high-scoring pairs from PSI-BLAST search resultsand realigning the sequences using T-Coffee (Notredame et al., 2000). The secondary structure shown above the alignment is derived from theoutput of the prediction program JPRED (Cuff et al., 1999). The 90% consensus shown below the alignment was derived using the following aminoacid classes: polar (p: KRHEDQNST); positive (+: HKR); hydrophobic (h: ALICVMYFW); the aliphatic subset of these are (l; ALIVMC); small(s: ACDGNPSTV) and tiny (u: GAS). The limits of the domains are indicated by the position numbers on each side of the alignment. The numberswithin the alignment are inserts that are not shown. The different families discussed in the text are indicated on the extreme right of the figure. Thesequences are denoted by their gene name followed by the species abbreviation and GenBank Identifier (gi). Species abbreviations: Ap – Aeropyrumpernix, Af – Archaeoglobus fulgidus, Mta – Methanothermobacter thermoautotrophicus, Mj – Methanococcus jannashii, Ph – Pyrococcus horikoshii,Hsp – Halobacterium species, Ta – Thermoplasma acidophilum, Aae – Aquifex aeolicus, Bs – Bacillus subtilis, Cj – Campylobacter jejuni, Ec –Escherichia coli, Hi – Haemophilus influenzae, Hp – Helicobacter pylori, Mlo – Mesorhizobium loti, Mt – Mycobacterium tuberculosis, Nm – Neisseriameningitides, Pa – Pseudomonas aeruginosa, Rp – Rickettsia prowazekii, Ssp – Synechocystis species, Tm – Thermotoga maritima, Tp –Treponema pallidum, Xf – Xylella fastidiosa, At – Arabidopsis thaliana, Ce – Caenorhabditis elegans, Dm – Drosophila melanogaster, Sc –Saccharomyces cerevisiae, Sp – Schizosaccharomyces pombe and Hs – Homo sapiens.

74 Anantharaman et al.

Page 6: SPOUT: a Class of Methyltransferases that Includes spoU ...bacteria- specific tRNA methylation at the G37 position (Bjork et al.,1989; Li et al., 1999). Here, we provide ... Current

independent duplication in archaea and bacteria,giving rise to lineage-specific methyltransferase fa-milies. The eukaryotes appear to have acquired theirSpoU family members horizontally from bacteria,which probably resulted in displacement of the ances-tral archaeo-eukaryotic version.

The unification of the TrmD and SpoU super-families into the SPOUT class points to a majorradiation of RNA-specific methyltransferases indepen-dent of the RFMs. The previously unnoticed SPOUTmethylase families are expected to contribute sub-stantially to the diversity of lineage-specific RNAmodifications that might be necessary for particularmicrobial adaptations, for example, thermophily. Theprediction of these novel methylases may help inexperimental studies that will advance our under-standing of these modifications.

References

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z.,Miller, W. and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: anew generation of protein database search programs. Nucleic AcidsRes. 25: 3389–3402.

Aravind, L. and Koonin, E.V. 1998. The HD domain defines a new super-family of metal-dependent phosphohydrolases. Trends Biochem Sci 23:469–472.

Aravind, L. and Koonin, E.V. 1999. Novel predicted RNA-bindingdomains associated with the translation machinery. J. Mol. Evol. 48:291–302.

Aravind, L. and Koonin, E.V. 2001. THUMP – a predicted RNA-bindingdomain shared by 4-thiouridine and pseudouridine synthases andRNA methylases. Trends in Biochem. Sci. 26: 215–217.

Bjork, G.R., Wikstrom, P.M. and Bystrom, A.S. 1989. Prevention oftranslational frameshifting by the modified nucleoside 1-methyl-guanosine. Science 244: 986–989.

Cavaille, J., Chetouani, F. and Bachellerie, J.P. 1999. The yeastSaccharomyces cerevisiae YDL112w ORF encodes the putative2’-O-ribose methyltransferase catalyzing the formation of Gm18 intRNAs. RNA 5: 66–81.

Cuff, J.A. and Barton, G.J. 1999. Evaluation and Improvement ofMultiple Sequence Methods for Protein Secondary StructurePrediction. Proteins 34: 508–519.

Fischer, D. 2000. Hybrid fold recognition: combining sequencederived properties with evolutionary information. Pac. Symp.Biocomput.119–130.

Koonin, E.V. 1996. Pseudouridine synthases: four families of enzymescontaining a putative uridine-binding motif also conserved indUTPases and dCTP deaminases. Nucleic Acids Res. 24:2411–2415.

Koonin, E.V. and Rudd, K.E. 1993. SpoU protein of Escherichia colibelongs to a new family of putative rRNA methylases. Nucleic AcidsRes. 21: 5519.

Li, J.N. and Bjork, G.R. 1999. Structural alterations of thetRNA(m1G37) methyltransferase from Salmonella typhimuriumaffect tRNA substrate specificity. Rna 5: 395–408.

Lo Conte, L., Ailey, B., Hubbard, T.J., Brenner, S.E., Murzin, A.G. andChothia, C. 2000. SCOP: a structural classification of proteinsdatabase. Nucleic Acids Res 28: 257–259.

Makarova, K.S., Aravind, L., Galperin, M.Y., Grishin, N.V., Tatusov,R.L., Wolf, Y.I. and Koonin, E.V. 1999. Comparative genomics of theArchaea (Euryarchaeota): evolution of conserved protein families,the stable core, and the variable shell. Genome Res 9: 608–628.

Neuwald, A.F., Liu, J.S., Lipman, D.J. and Lawrence, C.E. 1997.Extracting protein alignment models from the sequence database.Nucleic Acids Res. 25: 1665–1677.

Notredame, C., Higgins, D.G. and Heringa, J. 2000. T-Coffee: A novelmethod for fast and accurate multiple sequence alignment. J. Mol.Biol. 302: 205–217.

Persson, B.C., Jager, G. and Gustafsson, C. 1997. The spoU gene ofEscherichia coli, the fourth gene of the spoT operon, is essential fortRNA (Gm18) 20-O-methyltransferase activity. Nucleic Acids Res.25: 4093–4097.

Rozenski, J., Crain, P.F. and McCloskey, J.A. 1999. The RNAModification Database: 1999 update. Nucleic Acids Res 27:196–197.

Table 1. Sequence-based classification and phyletic distribution of SPOUT methyltransferases*

Bacteria Archaea Eukaryotes

SpoU superfamilyFamily 1 (Bacterial-Eukaryotic)

Orthologous sets

SpoU/Trm3p Bb,Ec,Mt,Dr,Aae – At(2),Ce,Hs,ScYjfH All (except Ct) Af At,Ce,Hs,Sc,SpYfiF Ec, Hi –YsgA Ct,Nm,Mlo(2),Ssp,Mt(2),Dr,Uu,Bs – At(2),Dm,Hs

Family 2 (Archaeo-Bacterial)LAST/MJ1476 Ec(2),Hi,Pa,Xf,Nm,Mlo,Ssp,Dr Alla

Family 3 (Bacterial)YibK Ct,Cj,Ec,Hi,Pa,Nm,Mlo,Ssp,Mt,Dr,Uu,Bs – At

Family 4 (Archaeal)MJ1385 – All

Ta (SPOUT+HD)–

Family 5 (Bacterial)aq_054 Aae,Tm – –

AF2226- like Archaeal superfamilyAF2226 – Af, Ph,Mj,Ape

Thump+SPOUT–

TrmD superfamilyTrmD All bacteria – –

YbeA superfamilyYbeA Cj,Hp,Ec,Hi,Pa,Xf,Nm,Mlo,Dr,Bs,Uu,Tm – At

*The species abbreviations are as in Figure 1.aPyrococcus sp. has a member but horikoshii and abysii do not.

SPOUT 75