1 Genomic and phylogenetic characterization of Brazilian Yellow ...

33
1 Genomic and phylogenetic characterization of Brazilian Yellow Fever virus strains 1 2 Marcio R. T, Nunes 1* , Gustavo Palacios 2#* , Jedson Ferreira Cardoso 1 , Livia C. Martins 3 , 3 Edivaldo C. Sousa Junior 1 , Clayton Pereira Silva de Lima 1 , Daniele B. A. Medeiros 3 , 4 Nazir Savji 2$ , Aaloki Desai 2 , Sueli G. Rodrigues 3 , Valeria L. Carvalho 3 , W. Ian Lipkin 2& 5 and Pedro F. C. Vasconcelos 3, 4& . 6 7 1 Center for Technological Innovation, Instituto Evandro Chagas, Ananindeua, Brazil. 8 2 Columbia University, New York, New York, USA 9 3 Departamento de Arbovirologia e Febres Hemorrágicas, Instituto Evandro Chagas, 10 Ananindeua, Brazil. 11 4 Universidade do Estado do Pará, Belém, Brazil. 12 # Current Address: United States Army Medical Institute for Infectious Diseases, Fort 13 Detrick, Maryland, USA 14 $ Current Address: New York University School of Medicine, New York, New York, 15 USA 16 *Both authors contributed equally 17 & Both senior authors contributed equally 18 19 Running Title: Brazilian Yellow Fever virus genomes 20 21 Key words: Yellow fever virus, full-length sequencing, genetic characterization, 22 secondary structure, phylogeny, phylogeography. 23 Copyright © 2012, American Society for Microbiology. All Rights Reserved. J. Virol. doi:10.1128/JVI.00565-12 JVI Accepts, published online ahead of print on 26 September 2012 on March 14, 2018 by guest http://jvi.asm.org/ Downloaded from

Transcript of 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

Page 1: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

1

Genomic and phylogenetic characterization of Brazilian Yellow Fever virus strains 1

2

Marcio R. T, Nunes1*, Gustavo Palacios2#*, Jedson Ferreira Cardoso1, Livia C. Martins3, 3

Edivaldo C. Sousa Junior1, Clayton Pereira Silva de Lima1, Daniele B. A. Medeiros3, 4

Nazir Savji2$, Aaloki Desai2, Sueli G. Rodrigues3, Valeria L. Carvalho3, W. Ian Lipkin2& 5

and Pedro F. C. Vasconcelos3, 4&. 6

7

1Center for Technological Innovation, Instituto Evandro Chagas, Ananindeua, Brazil. 8

2Columbia University, New York, New York, USA 9

3Departamento de Arbovirologia e Febres Hemorrágicas, Instituto Evandro Chagas, 10

Ananindeua, Brazil. 11

4Universidade do Estado do Pará, Belém, Brazil. 12

#Current Address: United States Army Medical Institute for Infectious Diseases, Fort 13

Detrick, Maryland, USA 14

$Current Address: New York University School of Medicine, New York, New York, 15

USA 16

*Both authors contributed equally 17

&Both senior authors contributed equally 18

19

Running Title: Brazilian Yellow Fever virus genomes 20

21

Key words: Yellow fever virus, full-length sequencing, genetic characterization, 22

secondary structure, phylogeny, phylogeography. 23

Copyright © 2012, American Society for Microbiology. All Rights Reserved.J. Virol. doi:10.1128/JVI.00565-12 JVI Accepts, published online ahead of print on 26 September 2012

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 2: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

2

24

Word count: 3244 25

26

Correspondence to: 27

Pedro F. C. Vasconcelos, MD, PhD 28

Instituto Evandro Chagas, Seção de Arbovirologia e Febres Hemorrágicas, 29

Rod. BR 316, Km 7, Levilandia, 67030-000, Ananindeua, Brazil 30

Tel: +55 91 3214-2271; Fax: +55 91 3214-2299 31

32

Marcio R.T. Nunes, PhD 33

Center for Technological Innovation, Instituto Evandro Chagas, Ananindeua, Brazil 34

Rod. BR 316, Km 7, Levilandia, 67030-000, Ananindeua, Brazil 35

Tel: +55 91 3214-2283; Fax: +55 91 3226-5262 36

[email protected] 37

38

39

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 3: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

3

Abstract 40

41

Globally, yellow fever virus infects nearly 200,000 people leading to 30,000 deaths 42

annually. Although the virus is endemic to Latin America, only a single genome from this 43

region has been sequenced. Here we report 12 Brazilian yellow fever virus complete 44

genomes, their genetic traits, phylogenetic characterization, and phylogeographic 45

dynamics. Variable 3’ non-coding region (NCR) patterns and specific mutations 46

throughout the open reading frame altered predicted secondary structures. Our findings 47

suggest that whereas the introduction of yellow fever virus in Brazil led to genotype I one 48

predominant dispersal throughout South and Central Americas, genotype II remained 49

confined to Bolivia, Peru and the western Brazilian Amazon. 50

51

52 on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 4: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

4

Introduction 53

Yellow fever is an infectious disease transmitted by Culicidae mosquitoes 54

carrying Yellow fever virus (YFV). This virus, the prototype species of the family 55

Flaviviridae, genus Flavivirus, has a positive-sense, single strand genome, composed of 56

approximately 11,000 nucleotides (nt), encoding ten viral proteins: three structural 57

proteins (Capsid-C; Pre-Membrane-PrM; and the Envelope-E), and seven non-structural 58

(NS) proteins (NS1-NS2A-NS2B-NS3-NS4a-NS4b-NS5-3’) (10). 59

YFV is endemic to tropical areas of Africa and South America where, according 60

to the World Health Organization (WHO), approximately 200,000 infections result in 61

30,000 deaths annually (31). YF is a zoonotic disease affecting non-human primates in 62

the forest canopy, the primary vertebrate hosts; humans are infected upon intrusion of the 63

eco-system. In Brazil, the virus is responsible for jungle Yellow fever outbreaks with 64

high case-fatality rates. It is transmitted to humans by mosquito bites of the genera 65

Haemagogus (primary) and Sabethes (secondary) in forested and surrounding areas (37). 66

The two main species vectors are Haemagogus janthinomys and Haemagogus 67

leucocelaenus (9,37,38). 68

Previous molecular epidemiology studies of YFV have provided 3’ non-coding 69

region (NCR) heterogeneity analysis and phylogenetic history and viral dispersal based 70

on partial E, NS5 and 3’ NCR sequences, suggesting two genotypes in the Americas: 71

genotype I, found in Brazil, Colombia, Ecuador, Venezuela, and the Caribbean, and 72

genotype II found in Bolivia, Peru, Brazil, and Ecuador (7,8,39). Although YFV is 73

considered a serious public health treat in Latin America, only a single Trinidadian strain 74

has been completely sequenced (2). 75

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 5: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

5

Here, we describe the full-length sequencing, genetic traits, phylogenetic analysis, 76

and phylogeographic dynamics of 12 Brazilian YFV isolates recovered between 1980 and 77

2002. 78

79

Material and methods 80

Virus strains. 81

The YFV strains analyzed in this study are listed in Table 1 and corresponded to 82

low-passage isolates obtained from the World Health Organization Collaborating Center 83

for Arbovirus Reference and Research at the Department of Arbovirology and 84

Hemorrhagic Fevers, Instituto Evandro Chagas, Brazilian Ministry of Health. 85

86

Viral culture, nuclease treatment and RNA extraction. 87

Vero cells were used for virus propagation. Virus stocks were clarified by 88

centrifugation at 3,000 rpm at 4º C, and treated with nucleases to minimize contaminants 89

Full-length genome sequencing 90

Complete genomes were obtained using high-throughput sequencing on the 91

Genome Sequencer FLX (Roche Life Sciences, Branford, CT, USA), as previously 92

discussed (12,23) alongside 5’ and 3’ RACE kits (Invitrogen, Calrsbad, CA, USA) using 93

specific internal primers YFV 5’SP1 (5’-AKCCCTGTCTTGGGTCCAGC-3’), 94

YFV5’SP2 (5’- TTGAACGCCTCTTGAAGGTC-3’), YFV5’SP3 95

(CTCTTGAAGGTCCAGGTCTA), YFV3’SP1 (5’-ATATCTGAATGGCAGCCATC-96

3’), YFV 3’SP2 (5’-TCACTGGCTGTTTCTTCTGCT-3’) targeting the termini. Products 97

were cloned and sequenced in both directions with ABI Prism BigDye Terminator 1.1 98

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 6: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

6

cycle sequencing kits on ABI Prism 3130 DNA analyzers (Applied Biosystems, Foster 99

City, CA, USA). 100

101

Genome assembly 102

Pyrosequencing reads were trimmed to remove specific adaptors and host 103

sequences using Newbler software (v.2.5.3; Data Processing Software Manual 454 Life 104

Science, http://www.454.com/), and BlastX (BLAST database and stand-alone sequence 105

comparison software v. 2.2.25), respectively. The 5`and 3` termini generated by Sanger 106

sequencing were assembled to the corresponding YFV ORF sequence using Geneious 5.4 107

(Biomatters, Auckland, New Zealand). 108

109

Genome characterization 110

Genomes obtained for the Brazilian YFV strains were compared with all full-111

length sequences publicly available including isolates from Africa, Trinidad & Tobago, 112

the 17DD vaccine substrain and vaccination-related strains (Table 1). Potential cleavage 113

sites for polyproteins were predicted using the proteolytic processing cascade pattern for 114

flavivirus ORFs developed by Rice & Strauss (33). Potential cleavage scores obtained by 115

SignalP (http://www.cbs.dtu.dk/services/) determined host signalase cleavage sites (27). 116

Glycosylation sites and cysteine residues were identified using NetNGlyc (v.1.0) 117

(http://www.cbs.dtu.dk/services/NetNGlyc/ ) and Geneious 5.4, respectively. Highly 118

conserved motifs and mutations were also identified. 119

120

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 7: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

7

Secondary and cyclization RNA structure prediction 121

Secondary and RNA cyclization structures involving the 5’ NCR, the first 200 nt 122

of the C protein, and 3’-NCR were assessed using Geneious and mfold, as previously 123

described (45). Termini patterns were selected according to lowest fold energy and 124

presence of conserved secondary structures, such as 3’ and 5’ stem looping (3`SL, 5`SL), 125

annealing of 5’ and 3’ cyclization regions (5’-3’ CYCs, 5’-3` upstream AUG region (5’-126

3’UAR) and the start codon at the 5’Cap Hairpin sequence (HS). 127

128

Three-dimensional protein modeling 129

Three-dimensional (3D) protein structures were rendered using the protein 130

homology method by initial search and template selection using the universal protein 131

database (templates) (http://www.rcsb.org/pdb) and applying the YFV envelope protein 132

sequence (query sequence) as the basic parameter. The template selected (Dengue virus 133

type 3, PDB: 1UZG) was used to construct the protein model with the Swiss model server 134

(1,34). The predicted 3D models were run through PROCHECK (20) and ANOLEA (26) 135

to evaluate stereochemistry parameters, ligation lengths, angle of ligation, peptidic 136

ligation, lateral rings alignment, and rotational angle for the main and lateral chains. The 137

final 3D structures were visualized with VMD software (17). 138

139

Similarity analysis 140

Similarity among the 29 YFV genomes was calculated with the plotting similarity 141

method in the Bootscan program in the Simplot v.3.5.1 142

(http://sray.med.som.jhmi.edu/SCRsoftware/simplot/). Phylogenetic trees were 143

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 8: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

8

constructed using the neighbor-joining method with bootstrap resampling 1000 replicates 144

using PHYLIP 3.69 (15). The Kimura 2-parameter was used as a nucleotide (nt) 145

substitution model. The phylogenetic permutation values (ppt) expressed on the y-axis 146

were calculated from bootstrap values using a 50% threshold. The X-axis represents the 147

YFV genome positions. Bootscanning was conducted in a slide window of 200nt with 148

20nt steps. The Trinidadian strain TVP 11767 (HM582851) was used as reference (2) 149

150

Phylogenetic and phylgeography analyses 151

Phylogenetic analysis was conducted for 20 complete YFV genomes retrieved from the 152

GenBank. From these analyses, vaccine and vaccine related strains were removed from 153

the dataset to avoid bias of geographic position. Before phylogenetic analysis, possible 154

recombination events were identified using both a genetic algorithm for recombination 155

detection (GARD) (19) and the Phi-test (6) available in SplitsTree v. 4 (18). Sequences 156

were aligned using the muscle algorithm implemented in Geneious 5.4. Maximum 157

likelihood (ML) phylogenies were constructed with the MrBayes-Multi v.3.1.2 (32) 158

applying the General Time Reversible nt substitution model (GTR+4Γ) and optimizing 159

across site rate variation. The parameters of a full probabilistic model, including timed 160

sequence evolution and spatial-temporal dispersal were estimated using a Bayesian 161

statistical inference approach implemented in BEAST (13,14,21). The Bayesian skyline 162

coalescent prior (36) was used to model fluctuations of the effective population size over 163

time. Bayesian phylogenetic inference was also applied GTR+4Γ to account for among-164

site rate variation. Isolation dates in years were available from Genbank and laboratory 165

records. The spatial-temporal dispersal of YFV between Africa and America was 166

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 9: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

9

reconstructed using the discrete model, and within the Americas using the continuous 167

probabilistic model of viral diffusion, both implemented in the BEAST package. For 168

each alignment, three Markov-chain Monte Carlo (MCMC) analyses were run for 50 169

million cycles sampling every 10,000th state until adequate mixing was achieved. 170

Bayesian phylogenetic analyses were computed using the BEAGLE library (5) to 171

augment the computational speed. After removing 10% of the burn-in, runs were 172

combined with LogCombiner (http://tree.bio.ed.ac.uk/software). Maximum clade 173

credibility (MCC) trees were summarized with Tree Annotator and visualized using Fig 174

Tree. The SPREAD (spatial phylogenetic reconstruction of evolutionary dynamics) 175

application (5) was used to visualize and convert the estimated divergence times and 176

spatial estimates annotated in the MCC trees to a keyhole markup language file (KML) 177

for visualization in the GoogleEarth virtual software (www.google.com/earth). All 178

evolutionary parameters are reported as posterior means along with the correspondent 179

95% Bayesian Credible intervals (95% BCI). 180

181

Results 182

Genome characterization: 183

The Brazilian YFV genomes ranged in size from 10,795 to 11,008 nt. The 184

predicted ORFs were 10,236 nt (3,411 aa) flanked by a conserved 5’ NCR 118 nt, and a 185

highly variable 3` termini ranging from 441 to 654 nt in length (Table 2). Genetic 186

divergence ranged from 2% for 5`NCRs to 12% for the M gene and 2K signal peptide, 187

with 10% divergence across the entire genome. Among South American strains, 4.8% of 188

genetic variability was observed , while between South American and African strains it 189

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 10: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

10

was 16% (Supplementary Table 1). By genetic similarity plot method, four distinct 190

groups were depicted (permutation values over 95% within groups and >75% between 191

groups). Groups I to IV are shown in Supplementary Figure 1, and unique mutations for 192

YFV South American strains in Supplementary Figure 2. 193

194

Potential cleavage sites, cysteine residues and glycosylation sites 195

Potential cleavage sites were predicted for each gene junction across the ORF of 196

each YFV strain. Highly conserved motifs representing potential sites for protease 197

cleavage activity were identified (Supplementary Table 2). Cysteine residues were 198

present throughout the viral proteins as follows: AncC: 0, M: 6, E:12, NS1: 12, NS2A: 3, 199

NS2B: 0, NS3: 10, NS4A/2k: 1, NS4B: 3, NS5:17. Glycosylation sites were found in 200

three viral proteins: M (residues M134, M150 and M266); E (residues E554, E594); and 201

NS1 (NS1-908, NS1-986). The number and position of cysteine residues and 202

glycosylation sites were conserved among all YFV strains. 203

204

Analysis of the 5’and 3’ends 205

Alignment of the 5’ and 3’ ends revealed 5’ NCR conservation and 3’ NCR 206

variability. 207

In the 5’ NCR, minor nt mutations were observed in comparison to other YFV 208

strains (A80G; C108T; T/C118A). Unique to BeH 413820 and BeH 422973 are nt 209

substitutions at position 117 (A→C) and 109 (T→A), respectively. 210

In the 3’ NCR, size heterogeneity was observed as previously described (8). Long 211

deletions were found compared to African and vaccine/vaccine derived strains hereafter 212

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 11: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

11

denominated South American deleted motifs YFVSADM1 that represent the YFV 213

repetition motifs RYF1 and RYF2 ,and specific sequences for Brazilian YFV strains 214

namely YFV conserved motifs YFVSACM1 and YFVSACM2 . Three YFV strains (Be 215

H6554117, BeH 622493 and BeH 422973) contained a unique motif (YFVSAUM). The 216

Brazilian strains BeH655417, BeH622493, BeH422973, BeH423602 do not have 217

YFVSADM2 motif in domain III within the 3’ SL (Table 3). 218

The absence of YFV repetition motifs RYF1 and RYF2 (YFVSADM1), and 219

presence of RYF3, conserved sequence 2 (CS2) , 3` upstream AUG region (UAR) and 3’ 220

cycling (CYC) sequences were observed (8,24). Furthermore, the pentanucleotides 221

3’CACAG observed in the 3’ SL of flaviviruses were absent fom strains with the 222

YFVSADM2 deletion. 223

Except for strain BeH 413820, the 3`CYC imperfect terminal (3’CYC imp) 224

sequence was identified in all Brazilian strains. Five strains (BeH 655417, BeH 622493, 225

BeH 422973, BeH 423602, and BeAR 378600) had the 3`CYC motif deleted within the 226

YFVSACM1 motif. In general, a single nt change (A→G) was observed, however for 227

strain BeH 394880 another mutation (T→G) was also observed. 228

In addition, five distinct 3’ NCR patterns (II to VI) were predicted in Brazilian 229

YFVs: 3’NCR/ deleted YFVSACM1 and YFVSACM2, represented by genotype II (II), 230

Long 3’NCR Uniq motif-plus(YFVSAUM) / deleted YFVSADM2 (III), Long 3’NCR / 231

deleted YFVSADM2 (IV), Long 3’NCR/ YFVSADM1 (V), and extra long 3’NCR 232

(YFVSADM1/ YFVSADM2) (VI). Pattern I correspond to the African strains exhibiting 233

the RYF1, RYF2, RYF3, CS2, and 3’CYC motifs (Figure 1). 234

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 12: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

12

In all patterns the 5’ secondary structure, represented by a long SL with a lateral 235

loop, secondary hairpin and start codon hairpin were conserved. The 3’ secondary 236

structure was not conserved in strains represented by patterns III and IV (Figure 1). 237

238

3D structure prediction of Yellow fever virus envelope protein 239

The analysis of the 12 YFV E protein sequences revealed point mutations 240

compared to both, ASIBI and 17D/Titan vaccine strains across the three E protein 241

domains (DI, DII and DIII). 242

In DI, amino acid changes were observed at positions E62 (N→S), E67 (H→N; 243

except for strain BeH 413820), E83 (A→E; excepted to the strain BeH 413820), E177 244

(K→R for strains BeH 422973, BeAR 513008, BeH 622205, BeH 622493, BeAR 245

646536 and BeH 655417), E191 (G→S), E198 (M→R; for strain BeH 463676), E207 246

(R→K, except to BeH 413820), E243 (R→K), E268 (T→A; strain BeH 413820, 247

tripeptide 270-272 (DNN→DSK or GSK for strain BeH 413820), and E282 (S→A; 248

strains BeH 394880 and BeH 413820). 249

For DII, amino acid substitutions were found at E120 (T→A; strain BeH 422973), 250

E147 (K→V, strain BeAR 646536), and E154 (T→A, strain BeH 413820). 251

The DIII of the YFV strains revealed mutations at positions E318 (V→A), E331 252

(K→R), E335 (I→M), E344 (I→V) and E360 (D→E, strain BeH 413820). 253

The DII fusion peptide (DRGWGNGCGLFGK), as well the DIII TGD motifs 254

were conserved in all YFV strains. Mutations across the E protein domains and 3D 255

structure are presented in the Supplementary Figure 2. 256

257

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 13: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

13

Phylogeographic analysis 258

The Bayesian MCC complete genome analysis demonstrated the presence of two 259

major groups (I and II) where group I represented East/Central African strains, while II 260

included Western Africa, Trinidadian and Brazilian strains. The time of emergence for 261

the most recent common ancestor (TMRCA) for the South American YFV strains was 262

estimated to be 332 years ago (95% Bayesian credibility interval: 210-519) from a 263

Western African lineage ancestor. The predicted substitution rates for the genomes were 264

1.12E-3 substitution/site/year (East African strains), 3.1E-4 substitution/site/year (West 265

African strains), 3.04 E-4 substitution/site/year (Brazilian strains), and 9.3 E-4 266

substitution/site/year (Trinidadian strain) (Figure 2a). 267

The phylogeographic model demonstrated the introduction of YFV from Africa to 268

Americas (Figure 2b), and the continuous model indicates that YFV dispersion in South 269

America is relatively recent. The virus was likely introduced in Northern/Northeast Brazil 270

before dispersing to other Brazilian regions (Western/Central and Southern/Southeast 271

regions) and Trinidad & Tobago at a dispersion speed of 24.95 Km/year (95% Bayesian 272

Credibility Interval (13.6-37.4) (Figures 2a, b and c). 273

274

Discussion 275

Despite Brazil being the largest country where YF is endemic with disease 276

emerging in local regions that were previously considered YF-free for almost 50 years 277

(8,9,38), no Brazilian YFV complete genomes have been reported (2,3,26,30,43). 278

Previous genomic, phylogenetic and evolutionary analyses were based on partial nt 279

sequences restricted to the prM/M– E and NS5/3’NCR junctions (7,8,11,22,39). 280

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 14: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

14

Although sequence analysis of YFV isolates collected from various hosts in distinct 281

geographic locations at different times, have yielded insights into 3´NCR heterogeneity 282

and evolutionary history (Vasconcelos et al., 2004; Bryant et al., 2005), full genome 283

sequencing reported here, has the potential to enhance diagnostics and provide a more 284

comprehensive view of the phylodynamics of YFV. 285

The 12 Brazilian wild-type YFV strains represented here showed high levels of 286

genetic divergence and size heterogeneity (ranging from 10,795 nt to 11,008 nt in length 287

(Table 2) as previously described (8,29). 288

Gene analysis revealed conserved motifs among all strains. Interestingly, many 289

genetic signatures were found in Brazilian YFV strains compared to African, Trinidadian 290

and vaccine-associated strains (Supplementary Figure 2). These unique motifs should be 291

useful for development of molecular probes and PCR derived methods (e.g Real time 292

PCR, Mass tag-PCR) capable of differentiating wild-type YFV cases from suspected 293

severe adverse events following vaccination mainly when epidemiological history is 294

poorly understood (e.g lack of vaccine shot information, period of vaccination, response 295

to vaccine). Potential cleavage sites for generation of YFV proteins, N-glycosylation sites 296

and cysteines were conserved in all Brazilian YFV genomes suggesting their functional 297

roles, are conserved (4). 298

Genetic variability among the Brazilian YFV strains was observed along the 299

entire genome (4.8%,); however greater divergence was observed in the 3’NCR region 300

(6%) as previously described suggesting high size heterogeneity in these regions (8,39). 301

Five distinct 3’NCR patterns were described (Figure 1), three of them (III, IV and 302

V) presented a long conserved motif (YFVSACM1), and two (III and IV) revealed a 37 303

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 15: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

15

nt deletion corresponding to the YFVSADM2 motif. Pattern VI presented an extra long 304

motif YFVSACM2. Despite the lack of 3’ CYC sequences in YFV strains belonging to 305

the 3’ NCR patterns III, IV and V, the RNA cyclization between 5’ and 3’ NCR termini 306

were made using the 3’ CYCimp. In case pattern IV strains where the 3’ CYC regular 307

motif is preserved, the imperfect 3’ cyclization sequence was not associated with 308

secondary structure. 309

The 37 nt deletion (YFVSADM2) appears to modify the secondary structure of 310

the 3’SL in pattern III and IV, including 3’ CACAG pentanucleotide. All flavivirus 311

genomes contain a 3’SL secondary structure constituted by the most downstream 100 nts 312

of the viral RNA genome and 3’ CACAG pentanucleotide at the top of 3’ SL structure. 313

These structures are required for virus replication and binding to both virus-coded and 314

cellular proteins (44), and modifications in 3’ SL or 3’ CACAG pentanucleotides may 315

result in viral attenuation (35). Further studies are needed to investigate all 3’motifs to 316

determine whether the YFVSADM2 deletion affects replication competency. 317

Mutations observed in the amino acid positions N62S, G191S, R243K, the 318

tripeptide 270-272, S282A, V318A, I335M, compared to the Asibi strain are genetic 319

signatures of strains circulating in the Americas (41). Although the H67N, A83E, T154A, 320

K177R, and R207K mutations are not considered genetic signatures, they are commonly 321

found in American strains. Furthermore, the K331R mutation was observed in the 322

Brazilian YFV strains and is suggested to be associated with viscerotropism in hamsters 323

(24,25), while the D360E substitution is associated with virulence rescue in attenuated 324

strains (40). The role of other mutations (T120A, A147V, M198R and T268A) remains 325

uncharacterized. 326

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 16: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

16

Genetic analyses of variability among complete YFV genomes, and phylogenetic 327

analysis support previous observations regarding the existence of distinct genetic groups 328

in Africa and in the Americas (11,43), and the existence of at least two distinct genetic 329

lineages or genotypes (I and II) co-circulating in Brazil. The complete genome 330

phylogenetic analysis indicates that the Brazilian strains are closely related to Western 331

African strains suggesting an African origin (Figure 2a) approximately 332 years ago 332

(95% HPD 210-519) in the year of 1677. This time is compatible with previous 333

publications based on E gene analyzes, and also is in accordance with the first report of a 334

Yellow fever outbreak in Brazil in 1685, when the virus was possibly introduced in 335

Recife, capital of Pernambuco state, Northeast Brazil during slave transportation from 336

Sao Tome, Africa to Brazil, with one stop in Santo Domingo, Antilleans, where the 337

disease was devastating the population (16). 338

The phylogeographic models (discrete and continuous) confirmed the 339

introduction of the YFV in Brazil from West Africa, and suggest that American 340

dispersion is relatively recent (Figure 2a). Previous geographical dispersion analysis was 341

performed using partial sequences (2), while the current results, using complete genomes, 342

suggest that YFV entered the Americas in Northern/Northeast Brazil before dispersing to 343

other regions in Brazil and Trinidad at 24.95 Km/year (95% Bayesian Credible Interval: 344

13.6, 37.47) with a mutation rate of 3.04E-4 (1.924E-4, 4.24E-4) (Figure 2c, d, e), 345

similar to West African strains and approximately 10 times lesser than Easter African 346

strains ( 1.12 x 10E3). It likely became endemic in areas where the virus could be 347

maintained among vertebrate and invertebrate reservoirs, such as the Brazilian Amazon 348

and Central-western region of Brazil (9). These rates (dispersion and mutation rates) are 349

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 17: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

17

within the confidence interval for rates estimated for the envelope and NS5 genes of YFV 350

and other flaviviruses (Briant et al., 2005; Gount et al., 2001; Gould et al., 2001) and 351

correspond to the first estimations using entire genomes of YFV 352

In conclusion, this study supports the existence of previously reported genotypes 353

circulating in Brazil (genotypes I and II), and provides informative data regarding the 354

virus’s African origin and dispersion throughout the Americas. Further studies are needed 355

to complete genomes from other endemic countries to elucidate the dynamics of 356

geographic dispersion of YFV throughout the world. The availability of, complete 357

genomic YFV sequence may facilitate further phenotypic evaluation of viral differences 358

particularly the potential significance of variable regions in the C, NS5 and 3’NCR by 359

using infectious clones. It may also enable improvements in YFV diagnostics. 360

361

Acknowledgements 362

We thank Keley Nascimento Barbosa Nunes (Center for Technological 363

Innovation) for the technical assistance with the GS FLX 454 and Dr. Ana Cecilia 364

Ribeiro Cruz (Department of Arbovirus and Hemorrhagic Fevers) for sequencing part of 365

the envelope gene of the Brazilian YFV strains. We also thank Nuno Rodrigues Faria 366

(Department of Microbiology and Immunology, Katholieke Universiteit Leuven, Leuven, 367

Belgium) for his helpful discussions about phylogeography. 368

369

Funding: This study was supported by CNPq grants (302987/2008-8 and 301641/2010-370

2); CNPq/CAPES/FAPESPA (573739/2008-0); awards from the National Institutes of 371

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 18: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

18

Health (AI57158, NBC-Lipkin), USAID Predict, and the Defense Threat Reduction 372

Agency. 373

374

Conflicts of Interest: Authors declared no conflict of interest 375

376

References 377

1. Arnold K, Bordoli L, Kopp J, Schwede T. 2006. The SWISS-MODEL workspace: a 378 web-based environment for protein structure homology modelling. Bioinformatics. 22: 379 195-201. 380 381 2. Auguste AJ, Lemey P, Pybus OG, Suchard MA, Salas RA, Adesiyun AA, Barrett 382 AD, Tesh RB, Weaver SC, Carrington CV. 2010. Yellow fever virus maintenance in 383 Trinidad and its dispersal throughout the Americas. J. Virol. 84: 9967-9977 384 385 3. Bae HG, Drosten C, Emmerich P, Colebunders R, Hantson P, Pest S, Parent M, 386 Schmitz H, Warnat MA, Niedrig M. 2005. Analysis of two imported cases of yellow 387 fever infection from Ivory Coast and The Gambia to Germany and Belgium. J. Clin. 388 Virol. 33: 274-280. 389 390 4. Beasley DW, Whiteman MC, Zhang S, Huang CY, Schneider BS, Smith DR, 391 Gromowski GD, Higgs S, Kinney RM, Barrett AD. 2005. Envelope protein 392 glycosylation status influences mouse neuroinvasion phenotype of genetic lineage 1 West 393 Nile virus strains. J. Virol. 79: 8339-8847. 394 395 5. Bielejec F, Rambaut A, Suchard MA, Lemey P. 2011. SPREAD: spatial 396 phylogenetic reconstruction of evolutionary dynamics. Bioinformatics. 15: 2910-2912. 397 398 6. Bruen TC, Philippe H, Bryant D. 2006. A simple and robust statistical test for 399 detecting the presence of recombination. Genetics. 172: 2665-2681. 400 401 7. Bryant JE, Holmes EC, Barrett AD. 2007. Out of Afric: a molecular perspective on 402 the introduction of yellow fever virus into the Americas. PLoS Pathog. 18;3(5):e75. 403 404 8. Bryant JE, Vasconcelos PF, Rijnbrand RC, Mutebi JP, Higgs S, Barrett AD. 405 2005. Size heterogeneity in the 3' noncoding region of South American isolates of yellow 406 fever virus. J. Virol. 79:3807-3821. 407 408 9. Cardoso JC, Almeida MA, Santos E, Fonseca DF, Sallum MA, Noll CA, Monteiro 409 HA, Cruz AC, Carvalho VL, Pinto EV, Castro FC, Nunes-Neto JP, Segura MN, 410

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 19: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

19

Vasconcelos PF. 2010. Yellow fever virus in Haemagogus leucocelaenus and Aedes 411 serratus mosquitoes, southern Brazil, 2008. Emerg. Infect. Dis. 16:1918-24. 412 413 10. Chambers TJ, Hahn CS, Galler R, Rice CM. 1990. Flavivirus genome 414 organization, expression, and replication. Ann. Rev. Microbiol. 44: 649-88. 415 416 11. Chang G-JJ, Cropp CB, Kinney RM, Trent DW, Gubler DJ. 1995. Nucleotide 417 sequence variation of the envelope protein gene identifies two distinct genotypes of 418 yellow fever virus. J. Virol. 69: 5773-5780. 419 420 12. Darling AC, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of 421 conserved genomic sequence with rearrangements. Genome Res. 14: 394-403. 422 423 13. Drummond AJ, Rambaut A, Shapiro B, Pybus OG. 2005. Bayesian coalescent 424 inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 22: 425 1185-1192. 426 427 14. Drummond AJ, Rambaut A. 2007. BEAST: Bayesian evolutionary analysis by 428 sampling trees. BMC Evolutionary Biol. 7:214. 429 430 15. Felsenstein J. 1989. PHYLIP - Phylogeny Interference Package (version 3.2). 431 Cladistics. 5: 164-166. 432 433 16. Franco O. 1969. [History of the Yellow fever in Brazil]. Rev. Bras. Malariol. D. 434 Trop. 21: 315-512. 435 436 17. Humphrey W, Dalke A, Schulten K. 1996. VMD: visual molecular dynamics. J. 437 Mol. Graphics. 14: 33-38. 438 439 18. Huson DH, Bryant D. 2006. Application of phylogenetic networks in evolutionary 440 studies. Mol Biol Evol. 23: 254-267. 441 442 19. Kosakovsky W, Pond SL, Posada D, Gravenor MB, Woelk CH, Frost SD. 2006. 443 Automated phylogenetic detection of recombination using a genetic algorithm. Mol. Biol. 444 Evol. 10: 1891-1901. 445 446 20. Laskowiski R, Macarthur M, Moss D, Thornton J. 1993. PROCHECK: a program 447 to check the stereochemical quality of protein structures. J. Applied. Crystallog. 26: 283-448 291. 449 450 21. Lemey P, Rambaut A, Drummond AJ, Suchard MA. 2009. Bayesian 451 phylogeography finds its roots. PLoS Computat. Biol. 5(9), e1000520. 452 453 22. Lepiniec L, Dalgarno L, Huong VT, Monath TP, Digoutte JP, Deubel V. 1994. 454 Geographic distribution and evolution of yellow fever viruses based on direct sequencing 455 of genomic cDNA fragments. J. Gen. Virol. 75: 417-423. 456

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 20: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

20

457 23. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, 458 Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, 459 Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie 460 TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, 461 Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, 462 Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons 463 JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, 464 Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM. 2006. Genome 465 sequencing in microfabricated high-density picolitre reactors. Nature. 437: 376-380 466 467 24. McArthur MA, Suderman MT, Mutebi J-P, Xiao SY, Barrett AD. 2003. 468 Molecular Characterization of a Hamster Viscerotropic Strain of Yellow Fever Virus. J. 469 Virol. 77: 1462–1468. 470 471 25. McArthur MA, Xiao SY, Barrett AD. 2005. Phenotypic and molecular 472 characterization of a non-lethal, hamster-viscerotropic strain of yellow fever virus. Virus 473 Res. 110: 65-71. 474 475 26. McElroy KL, Tsetsarkin KA, Vanlandingham DL, Higgs S. 2005. 476 Characterization of an infectious clone of the wild-type yellow fever virus Asibi strain 477 that is able to infect and disseminate in mosquitoes. J. Gen. Virol. 86: 1747-1751. 478 479 27. Medeiros DB, Nunes MR, Vasconcelos PF, Chang GJ, Kuno G. 2007. Complete 480 genome characterization of Rocio virus (Flavivirus: Flaviviridae), a Brazilian flavivirus 481 isolated from a fatal case of encephalitis during an epidemic in Sao Paulo state. J. Gen. 482 Virol. 88:2237-2246. 483 484 28. Melo F, Feytmans E. 1998. Assessing protein structures with a non-local atomic 485 interaction energy. J. Mol. Biol. 277: 1141-1152. 486 487 29. Mutebi JP, Wang H, Li L, Bryant JE, Barrett AD. 2001. Phylogenetic and 488 evolutionary relationships among yellow fever virus isolates in Africa. J. Virol. 75: 489 6999-7008. 490 491 30. Pisano MR, Mercier V, Deubel V, Tolou H. 1999. Complete nucleotide sequence 492 and phylogeny of an American strain of yellow fever virus, TRINID79A. Arch. Virol. 493 9:1837-1843. 494 495 31. Robertson SE, Hull BP, Tomori O, Bele O, LeDuc JW, Esteves K. 2003. Yellow 496 fever: a decade of reemergence. JAMA. 14: 1157-62. 497 498 32. Ronquist F, Huelsenbeck JP. 2003. MrBayes 3: Bayesian phylogenetic inference 499 under mixed models. Bioinformatics. 19: 1572-1574. 500 501

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 21: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

21

33. Rice C, Strauss J. 1990. Production of flavivirus polypeptides by proteolytic 502 provessing. Semin. Virol. 1: 357-367. 503 504 34. Schwede T, Kopp J, Guex N, Peitsch MC. 2003. SWISS-MODEL: An automated 505 protein homology-modeling server. Nucleic Acids Res. 31: 3381-3385. 506 507 35. Silva PA, Molenkamp R, Dalebout TJ, Charlier N, Neyts JH, Spaan WJ, 508 Bredenbeek PJ. 2007. Conservation of the pentanucleotide motif at the top of the yellow 509 fever virus 17D 3' stem-loop structure is not required for replication. J. Gen. Virol. 88: 510 1738-1747. 511 512 36. Suchard MA, Rambaut A. 2009. Many-core algorithms for statistical phylogenetics. 513 Bioinformatics. 25: 1370-1376. 514 515 37. Vasconcelos PF. 2003. [Yellow Fever]. Rev. Soc. Bras. Med. Trop. 36: 275-293. 516 517 38. Vasconcelos PF. 2010. Yellow fever in Brazil: thoughts and hypotheses on the 518 emergence in previously free areas. Rev. Saude Publ. 44: 1144-1149. 519 520 39. Vasconcelos PF, Bryant JE, Travassos da Rosa APA, Tesh RB, Rodrigues SG, 521 Barrett AD. 2004. Genetic divergence and dispersal of yellow fever virus, Brazil. 522 Emerg. Infect. Dis. 10: 1578-1584. 523 524 40. Vlaycheva LA, Chambers TJ. 2002. Neuroblastoma cell-adapted yellow fever 17D 525 virus: characterization of a viral variant associated with persistent infection and decreased 526 virus spread. J. Virol. 76: 6172-6184. 527 528 41. Wang E, Ryman KD, Jennings AD, Wood DJ, Taffs F, Minor PD, Sanders PG, 529 Barrett AD. 1995. Comparison of the genomes of the wild-type French viscerotropic 530 strain of yellow fever virus with its vaccine derivative French neurotropic vaccine. J. 531 Gen. Virol. 76: 2749-2755. 532 533 42. Wang H, Jennings AD, Ryman KD, Late CM, Wang E, Ni H, Minor PD, Barrett 534 AD. 1997. Genetic variation among strains of wild-type yellow fever virus from Senegal. 535 J. Gen. Virol. 78: 1349-1355. 536 537 43. Whittembury A, Ramirez G, Hernández H, Ropero AM, Waterman S, Ticona 538 M, Brinton M, Uchuya J, Gershman M, Toledo W, Staples E, Campos C, Martínez 539 M, Chang GJ, Cabezas C, Lanciotti R, Zaki S, Montgomery JM, Monath TP, Hayes 540 E. 2009.Viscerotropic disease following yellow fever vaccination in Peru. Vaccine. 27: 541 5974-5981. 542 543 44. Yu L, Markoff L. 2005. The topology of bulges in the long stem of the flavivirus 3' 544 stem-loop is a major determinant of RNA replication competence. J. Virol. 79: 2309-545 2324. 546 547

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 22: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

22

45. Zuker M, Mathews D, Turner D. 1999. Algorithms and thermodynamics for RNA 548 secondary structure prediction: a practical guide. In: Barciszewski J, Clark B, editors. 549 RNA Biochemistry and Biotechnology. Amsterdam: Kluwer Academic Publishers; p. 11-550 43. 551 552 553 554

555

556

557

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 23: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

23

Legends of figures: 558

Figure 1 – (a): Brazilian YFV strains according to the genotype and cyclization patterns. 559

(b): Schematic representation of the five distinct patterns found for 3’ NCR of twelve 560

YFV Brazilian isolates. (II): 3’NCR/ deleted YFVSACM1 and YFVSACM2; (III) Long 561

3’NCR Uniq motif-plus (YFVSAUM) / deleted YFVSADM2; (IV) Long 3’NCR / 562

deleted YFVSADM2; (V): Long 3’NCR/ YFVSADM1; and (VI) extra long 3’NCR 563

(YFVSACM1/ YFVSACM2). Pattern I corresponds to African strains exhibiting the 564

RYF1, RYF2, RYF3, CS2, and 3’CYC motifs (I-a; West African strains) or lack of RYF2 565

repetition sequence(I-b; East and Central African strains); (c) Predicted secondary 566

structures generated by the m-fold program for the five distinct patterns (II to VI) 567

described for YFV strains isolated in Brazil. Conserved structures are indicated by 568

arrows, brackets, colors or boxes. Abbreviations: CS, conserved sequences; RCS, 569

repeated conserved sequence; SmLs, small loop; LSH, long stable hairpin; 3’-CYC, 570

cycling sequence within CS1 of the 3’-NCR; 5’-CYC, cycling sequence within capsid 571

(Cap) gene; HS, hairpin sequence; UAR, upstream AUG region, and NCR, non-coding 572

region. 573

574

Figure 2 – a) Amino acid substitutions observed across the entire E protein and it 575

domains (I, II and III). E protein domains and sub domains are limited by traced-arrowed 576

lines. Epitopes within the red boxes represents neutralizing epitopes. Fusion peptide is 577

represented within the green box. Virulence related epitopes are inside of light-blue 578

boxes. TGD conserved motif is highlighted in grey. B) 3D structure predicted for the 579

YFV E protein. (B1) The protein domains I, II and III are showed inside Yellow, green 580

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 24: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

24

and blue circles, respectively. Neutralyzing epitopes and fusion peptide are represented in 581

the 3D structure. (Bi) Comparison between ASIBI and YFV 25 BeH 413820 showing 582

conformational changes due to amino acid substitution (K331R) at the Domain III; 583

Conformational changes between African, Trinidad and vaccine strains (Bii.1) and YFV 584

25 BeH 413820 strain (Bii.2). Note the hydrogen bound between the amino acids N 585

(epitope E359) and D (epitope E360) that are related to virulence. The presence of 586

hydrogen bound is associated to viral persistence. The number 4.81 corresponds to the 587

distance in Angstroms. 588

589

Figure 3- (a) Bayesian Maximum Clade Credibility tree demonstrating the phylogenetic 590

relationships among the YFV complete genomes. Major groups are indicated (I and II). 591

Sub groups according to geographic location (West Africa, East Africa, Trinidad, and 592

Brazil) are colored. Numbers under the bar are representing the timing scaled for the most 593

recent common ancestor. Node ages and substitution rates (s/s/year) are placed over each 594

main node in the tree. Numbers within parenthesis and brackets represents the 95% HPD 595

Bayesian posterior probabilities and dispersion rates expressed in percentage, 596

respectively. (b) Spatiotemporal diffusion of YFV throughout Africa to Americas (c). 597

Temporarily framed snapshots of the dispersal patterns and respective 95% Bayesian 598

Credibility Regions (BCR) are indicated (light green areas) for the years 1940 (d), 1970 599

(e), and 2000 (f). 600

601

602

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 25: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

25

Table 1 – Yellow fever virus strains isolated in Brazil, Trinidad, Africa, vaccine and 603 Vaccine-associated adverse event cases used for genetic characterization and 604 phylogeography analyses. 605 606

Strain Source of isolation

Passage History*

Date of isolation Place of isolation

Coordinates GenBank

BeAR378600 Haemagogus sp. #1 1980 Uruaçu, GO -14.52 -49.15 JF912179 BeH394880 human #1 1981 C. do Araguaia-PA -8.25 -49.26 JF912180 BeH413820 human #1 1983 Porto Velho, RO -8.76 -63.9 JF912181 BeH422973 human #1 1984 Monte Alegre, PA -1.99 -54.08 JF912182 BeH423602 human #1 1984 São Domingos do Capim,

PA -1.67 -47.76 JF912183

BeH463676 human #1 1987 Breves, PA -1.68 -50.48 JF912184 BeAR513008 Sabethes sp #1 1992 Sidrolândia, MS -20.93 -54.93 JF912185 BeH526722 human #1 1994 Arinos, MG -15.9 -46.1 JF912186 BeH622205 human #1 2000 Goiás, GO -15.92 -53.13 JF912187 BeH622493 human #1 2000 Alto Paraíso, GO -14.27 -47.5 JF912188 BeAR646536 Hg.

leucocelaenus #1 2001 S.A. Missões, RS -28.51 -52.23 JF912189

BeH655417 human #1 2002 Alto Alegre, RR 2.89 -61.49 JF912190 TVP11767 Alouatta

seniculus - 2009 Trinidad_and_Tobago 10.69 -61.22 HM582851

Angola 71 Human - 1971 Angola (Central/East Africa) -11.2 17.87 AY968064 Uganda 48 Human - 1948 Uganda(Central/East

Africa) 1.37 32.29 AY968065

Couma-Ethiopia 61

Human - 1961 Couma_Ethiopia (Central/East Africa)

9.14 40.48 DQ235229

Ivory_Coast82 Human - 1982 Ivory_Coast (West Africa) 7.53 -5.54 YFU54798 Asibi Human - 1927 Ghana(West Africa) 7.94 -1.02 AY640589 Gambia 01 Human - 2001 Gambia (West Africa) 13.44 -15.31 AY572535 Ivory_Coast99 Human - 1999 Ivory_Coast (West Africa) 7.53 -5.54 AY603338 French viscerotropic

Human - - - - YFV21056

17D Vaccine strain - - - - X03700 YFV-AVD2791-93

Vaccine strain - - - - DQ118157

17-D 213 Vaccine strain - - - - 17D-Titan Vaccine strain - - - - F1654700 17DD Vaccine strain - - - - DQ100292 French viscerotropic

Vaccine strain - - - - YFV21056

French neurotropic

Vaccine strain - - - - YFV 21055

YFV case 1 Vaccine adverse event

- - Peru - GQ379162

YFV case 2 Vaccine adverse event

- - Peru - GQ379163

607 GO: Goias state; PA: Para state; RO: Rondonia state; RR: Roraima state; MG: Minas 608 Gerais state; MS: Mato Grosso do Sul state; RS: Rio Grande do Sul state.Ar: arthropod; 609 H: human; (-) data not provided. * Virus passage performed in VERO cells. 610

611

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 26: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

26

Table 2 – Genome size of YFV Brazilian strains including the 5`NCR, ORF and 3`NCR. 612 613 614 Strain 5NCR ORF 3NCR Total BeAr 378600 118 10.236 535 10.889 Be H 394880 118 10.236 654 11.008 BEH 413820 118 10.236 441 10.795 BeH 422973 118 10.236 506 10.860 Be H423602 118 10.236 479 10.852 BeH 463676 118 10.236 650 11.004 BeAr 513008 118 10.236 654 11.008 BeH 526722 118 10.236 650 11.004 BeH 622205 118 10.236 654 11.008 BeH622493 118 10.236 463 10.817 BeAR 646536 118 10.236 654 11.008 BeH655417 118 10.236 505 10.859

Ar: arthropod; H: human 615 616

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 27: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

27

Table 3 – 3’ Non coding region motifs observed in South American YFV strains 617 618

3'NCR Motif Sequence (5’→3’) YFVSADM1

GGGATACAAACCACGGGTGGAGAACCGGACTCCCCACAACCTGAAACCGGGATAT AAACCACGGCTG

YFVSACM1

TGTCGGCCCAGAAACCCGCGTGGGTTCTGCCACTGCTAAGCTGTGAGGCAGTGCAGGCTGGGA TAGCCGCCTTCCATGTTGCGAAAAACATGGTTTCTGAGACCTCCC

YFVSACM2

A[G]CCTCAGAGTAAAACCAAATGGAGCCTCCGCTACCACCTCCCCACGTGGTGGTAGAAAGACGGGGTCTAGAGGTTAGAGGAGACCCTCCAGGGAACAAATAGTGGGACCATATTG

YFVSAUM ACCTCAGAGTAAAACCAAATGGAGCCTCCGCTACCACCTCCCCACGTGG

619 620 621

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 28: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

3’

I5’

IIIIBEH655417IIIIBEH413820

Cyclization PatternGenotypeStrains

IIIIBEH655417IIIIBEH413820

Cyclization PatternGenotypeStrains(a)(I-a)(I-b)

Figure 1

II3’

5’

III 3’

5’VIBEAR378600IVIBEH423602IIIIBEH422973IIIIBEH622493IIIIBEH655417

VIBEAR378600IVIBEH423602IIIIBEH422973IIIIBEH622493IIIIBEH655417

IV3’5’

V 5’

VIIBEH394880VIIBEH463676VIIBEH526722VIIBEAR646536VIIBEH622205

VIIBEH394880VIIBEH463676VIIBEH526722VIIBEAR646536VIIBEH622205

VI 5’

YFVSACM1 YFVSACM2RYF1 RYF2 RYF3 CS23’UAR

3’

VIIBEAR513008VIIBEH394880VIIBEAR513008VIIBEH394880

3’CYC

3’CACAG

3’CYC imp

YFVSAUM

YFVSADM2YFVSADM1

(b)

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 29: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

IIIII III

(c)

dG= -225.15II dG= -199.97

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 30: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

IV

dG= -221.40

V (c) VI

dG= -246.99 dG= -327.20

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 31: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

Figure 2

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 32: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

I(449-1167)[1,12E-4]

Figure 3

(1088-2854)

(466-939)

(210-448)

[2,26E-4]

[3,1E-4]

Western African strains

II(210-519)[3,04 E-4] [9,93 E-4]

Western African strains

Eastern African strains

Brazilian strains

T i id di t iTrinidadian strain

200917591509125912091209759509

(a)

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from

Page 33: 1 Genomic and phylogenetic characterization of Brazilian Yellow ...

(c)(b)

(d) (e)

on March 14, 2018 by guest

http://jvi.asm.org/

Dow

nloaded from