Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic...

46
A peer-reviewed version of this preprint was published in PeerJ on 13 August 2019. View the peer-reviewed version (peerj.com/articles/7314), which is the preferred citable publication unless you specifically need to cite this preprint. Amorim A, Fernandes T, Taveira N. 2019. Mitochondrial DNA in human identification: a review. PeerJ 7:e7314 https://doi.org/10.7717/peerj.7314

Transcript of Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic...

Page 1: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

A peer-reviewed version of this preprint was published in PeerJ on 13August 2019.

View the peer-reviewed version (peerj.com/articles/7314), which is thepreferred citable publication unless you specifically need to cite this preprint.

Amorim A, Fernandes T, Taveira N. 2019. Mitochondrial DNA in humanidentification: a review. PeerJ 7:e7314 https://doi.org/10.7717/peerj.7314

Page 2: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Mitochondrial DNA in human identification: a review

António Amorim 1 , Teresa Fernandes 2 , Nuno Taveira Corresp. 3, 4

1 Instituto Nacional de Medicina Legal e Ciências Forenses, Lisbon, Portugal2 Biologia, Universidade de Évora, Évora, Portugal3 Instituto Universitário Egas Moniz (IUEM), Almada, Portugal4 Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, Lisbon, Portugal

Corresponding Author: Nuno TaveiraEmail address: [email protected]

Mitochondrial DNA (mtDNA) presents several characteristics useful for forensic studies,

especially related to the lack of recombination, to a high copy number, and to matrilineal

inheritance. mtDNA typing based on sequences of the control region or full genomic

sequences analysis is used to analyze a variety of forensicsamples such as old bones,

teeth and hair, as well as other biological samples where the DNA content is low.

Evaluation and reporting of the results requires careful consideration of biological issues as

well as other issues such as nomenclature and reference population databases. In this

work we review mitochondrial DNA profiling methods used for human identification and

present their use in the main cases of human identification focusing on the most relevant

issues for the forensic and medico-legal areas.

PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.27500v1 | CC BY 4.0 Open Access | rec: 24 Jan 2019, publ: 24 Jan 2019

Page 3: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

1 Mitochondrial DNA in Human Identification: a review

2 Authors

3 António Amorim1, Teresa Fernandes2, Nuno Taveira3,4*

4

5 Affiliations

6 1Instituto Nacional de Medicina Legal e Ciências Forenses, Lisboa, Portugal

7 2Universidade de Évora, Évora, Portugal

8 3Instituto Universitário Egas Moniz (IUEM), Almada, Portugal

9 4Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de

10 Lisboa, Portugal

11

12 * Corresponding author:

13 Nuno Taveira

14 Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa,

15 Av. Prof. Gama Pinto, 1649-003 Lisbon, Portugal

16 [email protected]

17

18

Page 4: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

19

20 Abstract

21 Mitochondrial DNA (mtDNA) presents several characteristics useful for forensic studies,

22 especially related to the lack of recombination, to a high copy number, and to matrilineal

23 inheritance. mtDNA typing based on sequences of the control region or full genomic sequences

24 analysis is used to analyze a variety of forensic samples such as old bones, teeth and hair, as well

25 as other biological samples where the DNA content is low. Evaluation and reporting of the

26 results requires careful consideration of biological issues as well as other issues such as

27 nomenclature and reference population databases. In this work we review mitochondrial DNA

28 profiling methods used for human identification and present their use in the main cases of human

29 identification focusing on the most relevant issues for the forensic and medico-legal areas.

30

Page 5: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

31

32 Introduction

33 Human genetic identification for medico-legal or forensic purposes is achieved through the

34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the

35 phenotypic description of a set of genomic loci that are specific to that individual. In accordance

36 with international recommendations, particularly with recommendations of the European DNA

37 Profiling Group (EDNAP), currently, only genetic profiles obtained from autosomal short

38 tandem repeats (STR) should be used for genetic fingerprinting. However, in a considerable

39 number of situations of medico-legal and forensic identification, autosomal DNA is highly

40 degraded or isn’t available at all. In these cases the study of mitochondrial DNA (mtDNA) for

41 human identification has become routine (1–5).

42 Mitochondrial DNA (mtDNA) presents several characteristics useful for forensic studies,

43 especially related to the lack of recombination, to a high copy number, and to matrilineal

44 inheritance. mtDNA typing based on sequences of the control region or full genomic sequences

45 analysis is used to analyze a variety of forensic samples such as old bones, teeth and hair, as well

46 as other biological samples where the DNA content is low. Evaluation and reporting of the

47 results requires careful consideration of biological issues as well as other issues such as

48 nomenclature and reference population databases. In this work we review mitochondrial DNA

49 profiling methods used for human identification and present their use in the main cases of human

50 identification focusing on the most relevant issues for the forensic and medico-legal areas.

51

52 Survey methodology

Page 6: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

53 We systematically searched in PubMed for papers in English describing 1) mitochondrial DNA

54 biology and genetics, 2) mitochondrial DNA typing guidelines, 3) mitochondrial DNA

55 nomenclature, 4) mitochondrial DNA sequencing methodologies, 5) mitochondrial DNA

56 population data and databases and 6) mitochondrial DNA in human identification and forensics.

57 Our search wasn’t refined by publishing date, journal or impact factor of the journal, authors or

58 authors affiliations. In addition, we used Guideline documents from the International Society for

59 Forensic Genetics available at https://www.isfg.org/.

60

61 Mitochondrial DNA biology and genetics

62 Mitochondria are cellular organelles that contain an extrachromosomal genome, which is both

63 different and separate from the nuclear genome. The mitochondrial DNA (mtDNA) was first

64 identified and isolated by Margit Nass and Sylvan Nass in 1963, who studied some

65 mitochondrial fibers that according to their fixation, stabilization and staining behavior, appeared

66 to be DNA related (6). However, the complete sequence of the first mtDNA was only published

67 and established as the mtDNA Cambridge Reference Sequence (CRS) eighteen years later, in

68 1981 (7).

69 Essentially, the mtDNA is a 5 mm histone-free circular double-stranded DNA molecule, with

70 around 16,569 base-pairs and weighting 107 Daltons (8). mtDNA strands have different densities

71 due to different G+T base composition. The heavy (H) strand encodes more information, with

72 genes for two rRNAs (12S and 16S), twelve polypeptides and fourteen tRNAs, while the light

73 (L) strand encodes eight tRNAs and one polypeptide. All the 13 protein products are part of the

74 enzyme complexes that constitute the oxidative phosphorylation system. Other characteristic

Page 7: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

75 features of the mtDNA are the intronless genes and the limited, or even absent, intergenic

76 sequences, except in one regulatory region.

77 The mitochondrial D-loop is a triple-stranded region found in the major non-coding region

78 (NCR) of many mitochondrial genomes, and is formed by stable incorporation of a third 680

79 bases DNA strand known as 7S DNA (9). The origin of replication is located at the non-coding

80 or D-loop region, a 1121 base pairs segment that is located between positions 16,024 and 516,

81 according to the CRS numeration (7) (Figure 1). The D-loop region, also comprehends two

82 transcription promotors, one for each strand. Nucleotide positions in the mtDNA genome are

83 numbered according to the convention presented by Anderson et al. (7), which was slightly

84 modified by Andrews et al. (10). More precisely, the numerical designation of each base pair is

85 initiated at an arbitrary position on the H strand, which continues thereafter and around the

86 molecule for approximately 16,569 base pairs.

87 The apparent lack of mtDNA repair mechanisms and the low fidelity of the mtDNA polymerase

88 lead to a significant higher mutation rate in the mitochondrial genome, when compared to the

89 nuclear genome. For example, Sigurğardóttir and collaborators, estimated the mutation rate in

90 the human mtDNA control region to be 0.32x10-6/site/year (10) which compares to

91 0.5 × 10−9/site/year in the nuclear genome (12). Most of the sequence variation between

92 individuals is found in two specific segments of the control region, namely in the hypervariable

93 region 1 (HV1, positions 16,024 to 16,365) and in the hypervariable region 2 (HV2, positions 73

94 to 340) (13). A third hypervariable region (HV3, positions 438 to 574), with additional

95 polymorphic positions can be useful in the resolution of indistinguishable HV1/HV2 samples

96 (14). The small size and relatively high inter-person variability of the HV regions are very useful

97 features for forensic testing purposes.

Page 8: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

98 A mitochondrion contains 2 to 10 copies of mtDNA and each somatic cell can have up to 1,000

99 mitochondria (15,16). Hence, when the amount of the extracted DNA is quite small or degraded,

100 it is more likely that a DNA typing result can be obtained by typing the mtDNA than by typing

101 polymorphic regions that are found in nuclear DNA.

102 Contrarily to the nuclear DNA, the mtDNA is exclusively maternally inherited, which justifies

103 the fact that, apart from mutation, mtDNA sequence of siblings and all maternal relatives is

104 identical (17–19). This specific characteristic can be very helpful in forensic cases, such as in the

105 analysis of the remains of a missing person, where the known maternal relatives can provide

106 some reference samples for a direct comparison to the mtDNA type. Due to the lack of

107 recombination, maternal relatives from several generations apart from the source of evidence (or

108 biological material) can be used for reference samples (17–19).

109 The haploid and monoclonal nature of the mtDNA in most individuals simplifies the process of

110 interpretation of the DNA sequencing results. Still, it is possible to find heteroplasmy at

111 occasional cases (20–25). A person is considered as heteroplasmic if she/he carries more than

112 one detectable mtDNA type. There are two classes of heteroplasmy, related to length

113 polymorphisms and to point substitutions. Only the latter is important for forensic human

114 identification. Most forensic laboratories worldwide do not report length polymorphisms and the

115 guidelines on human identification with mtDNA do not point them as mandatory information

116 (2,4). Furthermore, the information of length polymorphisms has no impact in haplogroups’

117 definition.

118 Heteroplasmy manifests itself in diverse ways (26). An individual may show more than one

119 mtDNA type in a single tissue. An individual may be heteroplasmic in one tissue sample and

120 homoplasmic in another one. Finally, an individual may exhibit one mtDNA type in one tissue

Page 9: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

121 and a different type in another tissue. Of the three possible scenarios, the last one is the least

122 likely to occur. When heteroplasmy is found in the mtDNA of an individual, it usually differs at

123 a single base, in HV1 or HV2.

124 Heteroplasmy was observed at position 16,169 of the mtDNA control region in the putative

125 remains of Tsar Nicholas II of Russia and his brother, the Grand Duke of Russia Georgij

126 Romanov (22,23). Comas et al. (21), in turn, detected heteroplasmy at two distinct positions,

127 16,293 and 16,311, in the mtDNA of an anonymous donor’s plucked hair. Wilson et al. (24)

128 found a family constituted by a mother and two children carrying a heteroplasmic mtDNA at

129 position 16,355 both in blood and buccal swab samples.

130 The existence of heteroplasmic individuals and the limited knowledge about both the mechanism

131 and the rate of heteroplasmy can be issues raised in an attempt to exclude mtDNA evidence from

132 forensic investigations. In the context of forensic analysis, both mtDNA sequences of a reference

133 sample and an evidence sample(s) are compared. If the mtDNA sequences are identical, the

134 samples can’t be excluded since they must have the same origin or derive from the same

135 maternal lineage. Similarly, samples can’t be excluded when heteroplasmy is observed at the

136 same nucleotide positions in both samples. Finally, when one sample is heteroplasmic and the

137 other is homoplasmic but they both share at least one mtDNA species, the samples can’t be

138 excluded since they may have the same origin. Several authors have suggested that samples with

139 mtDNA with one-base difference should be further evaluated, mainly regarding their rate of

140 mutation (1,27–29). When two or more nucleotide differences exist between the two sequences,

141 the overall interpretation is exclusion (4).

142 Heteroplasmy at one nucleotide position is more frequently observed in hair samples, mainly due

143 to genetic drift and to bottlenecks which occur due to the hair follicle’s semiclonal nature (5,30–

Page 10: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

144 32). Hence, if an evidentiary hair sample contains one of the two heteroplasmic lineages that are

145 observed in a reference sample, or vice versa, then the interpretation of exclusion may be

146 incorrect. In this case, typing additional hairs may be required to solve the problem (5).

147 As it was previously pointed out, the mitochondrial genome is maternally inherited. Even though

148 the sperm contains a few mitochondria in the neck and in the tail region, the male mitochondrial

149 genome is destroyed either during or shortly after the fertilization. More precisely, sperm

150 mitochondria disappear in the early embryogenesis, namely by selective destruction, inactivation

151 or dilution (33–35). Nevertheless, there are a few examples of paternal inheritance of the

152 mitochondrial genome in animals (36), and by that reason and despite the limited evidence for

153 paternal inheritance of the mitochondrial genome in humans, the courtroom can use such

154 possibility to exclude the use of mtDNA evidence.

155

156 Mitochondrial DNA Nomenclature

157 Even though the process of naming mtDNA sequences seems simple and obvious, it is crucial to

158 properly consider the nomenclatures, since complications might arise. Considering that listing

159 more than 600 bases in order to describe the results from a new HV1 and HV2 sequence would

160 be unpractical, an alternative approach was developed which essentially identifies and reports the

161 differences relative to a reference sequence (Cambridge Reference Sequence or CRS sequence)

162 (7). For example, if the bases beyond the position 16,192 were out of the register by one base

163 due to the insertion of a C the mutation is designated as 16,192.1 C. If two Cs were inserted, they

164 would be designated as 16,192.1 C and 16,192.2 C. Regarding deletions, these are recorded by

165 the number of the base(s) that is missing, with respect to the CRS (i.e., 249 D or 249-). The bases

166 that cannot be unambiguously determined are coded by using an N. The guidelines that are used

Page 11: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

167 to record all the sequence’s differences have been described by several authors (1,2,4,29), and

168 are used by the forensic community in general.

169 In forensics, the origin of the evidentiary sample is unknown. Consequently, and to be fair to the

170 defendant, it is important to list any sequences in a reference mtDNA database that are identical

171 to the evidentiary mtDNA sequence, especially when estimating how rare or common the

172 evidentiary mtDNA sequence is. Currently, most ambiguities in the alignment/nomenclature

173 arise due to insertions and/or deletions (indels). Hence, Wilson et al. (37) developed an approach

174 to attempt to standardize all mtDNA alignments which is based on a phylogenetic context, and

175 gives differential weighting to transversions, transitions, insertions and deletions.

176 The nomenclature approach mentioned above aims the standardization of reference mtDNA

177 databases, which is why recommendations are provided to address several scenarios (1,2,4,29).

178 The recommendations are as follows: a) characterize the variant(s), namely by using the least

179 number of differences (i.e., insertions, substitutions, deletions) from the reference sequence; b) if

180 there is more than one alignment, each one having the same number of differences when it

181 comes to the reference sequence, indels must be prioritized; c) indels must be placed at 3’ end,

182 especially with respect to the light strand (if possible, deletions and/or insertions must be

183 combined); and d) gaps are combined only when they can be placed at the 3’ end, while

184 maintaining the exact same number of differences from the reference sequence. The bases that

185 are designated with an N should not affect these recommendations. These recommendations are

186 hierarchical, which is why the first recommendation takes precedence, being immediately

187 followed by the remaining ones, that are, and to some degree, arbitrary. The last recommendation

188 is a clarification, since it aims to facilitate the alignment of all interpretations. Lastly, it is

Page 12: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

189 relevant to point out that several examples have been identified, where alternative alignment

190 strategies are possible, being adequately described elsewhere in the literature (37).

191

192 Mitochondrial DNA Typing Guidelines

193 In 2014 the DNA Commission of the International Society of Forensic Genetics (ISFG)

194 published updated guidelines and recommendations concerning mitochondrial DNA typing.

195 These guidelines referred to good laboratory practices, targeted region, amplification and

196 sequencing ranges, reference sequence, alignment and notation, heteroplasmy, haplogrouping of

197 mtDNA sequences, and databases and database searches. In Table 2 we present the 16

198 recommendations of ISFG. Overall, these are the main guidelines concerning the application of

199 mtDNA polymorphisms in human identification, which are regularly revised and published by

200 the International Society of Forensic Genetics (1,2,4,29).

201

202 Mitochondrial DNA Sequencing Methodologies

203 In 1977 Sanger presented the first DNA sequencing technology (38), also called the chain

204 termination method and now known as first generation sequencing. The incorporation of ddNTPs

205 in newly synthesized DNA strands results in termination of the elongation process and

206 correspondent knowledge about the specific nucleotide present at the sequence at each position.

207 Sanger sequencing method can produce reads from 25 up to 1200 nucleotide positions, allowing

208 the read of a maximum of 96 kb nucleotides in 2 hours.

209 Since 2005 a wide number and variety of new sequencing methods have been developed and

210 launched on the market (39). Second generation sequencing methods do not allow to read the

211 complete sequence of an individual genome at one-time step but only small DNA fragments each

Page 13: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

212 time, from 35 up to 75 nucleotide positions using the SOLiD technology (39–41), to 100 up to

213 1,000 nucleotide positions using the 454 Pyrosequencing technology (39,42). Depending on the

214 technology, these methods allow sequence reads from 60 to 80 million nucleotide positions in 2

215 hours or 6 billion nucleotide positions in 1- 2 weeks as (43,44).

216 Third generation sequencing technologies were introduced in the market in 2010 and allow

217 massive parallel analysis combined with reading in real time (39). With these methods it is

218 possible to read between 8,000 to 200,000 nucleotide positions at one-time, and they allow

219 sequence reads from 100,000 up to 4 Tb nucleotide positions in less than 48 hours (45,46). Many

220 different methodologies designed to attend to different priorities had been proposed. Some

221 methodologies are focused on the number of reads, other’s give priority to results accuracy and

222 other’s attend to the range of the reads. Pacific Biosciences (PacBio) introduced equipment’s

223 designed to produce low number of reads with high accuracy (45,47), such as PacBio RSII (48),

224 while Oxford Nanopore Technologies (ONT) designed methodologies to produce high number

225 of reads with lower accuracy, such as ONT MinION (46), and Illumina gives priority to short-

226 read with high accuracy technologies and present’s a ‘post-light’ methodology with directly

227 sensing non-optical sequencing, such as Illumina HiSeq (49,50). These massive parallel

228 sequencing (MPS) technologies have been quickly applied in a wide range of areas where

229 forensics is included (39). Personal Genome Machine (PGM) first introduced the Ion Torrent

230 methodology for sequence complete mtGenomes in forensic context (51). Nevertheless,

231 regarding to mtDNA analysis for forensic human identification, and according to current

232 international guidelines (2,4), Sanger sequencing still continues to be an adequate method and is

233 used in most casework laboratories worldwide (52). However, if guidelines on human

234 identification with mtDNA come to establish as mandatory the inclusion of information of the

Page 14: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

235 coding region, the benefits of MPS could be explored to type whole mtDNA genomes. In 2015,

236 Magalhães and collaborators presented the results of Ion Torrent PGM NGS technology applied

237 to whole mtDNA molecule sequencing. Although it proved to be sensitive and accurate at

238 detecting and quantifying mixture and heteroplasmy, there were some problems in the coverage

239 of the mtDNA genome with some regions presenting extreme strand bias, and presenting false

240 positives mostly generated by alignment problems in the analysis algorithms (53). Woerner and

241 collaborators presented, in 2018, the evaluation of the Precision ID mtDNA Whole Genome

242 Panel on two massively parallel sequencing commercial systems - Ion S5 System (Thermo

243 Fisher Scientific) and MiSeq FGx Desktop Sequencer (Illumina) (54). According to their

244 conclusions, Ion and MiSeq methodologies provide consistent mtDNA haplotypes estimation.

245 Beyond this study many other studies on the use of MPS technologies for forensic genetics and

246 mtDNA analysis have been published (55,56,65–67,57–64). However, further validation studies

247 and specialized software functionality tailored to forensic practice should be produced in order to

248 facilitate the incorporation of NGS processing into standard casework applications (68,69).

249

250 Mitochondrial DNA Population Data and Databases

251 When two mtDNA sequences, one from an evidence sample and another from a reference

252 sample, cannot be excluded as being originated from the exact same source, it is necessary to

253 convey some information concerning the rarity of the mtDNA profile. The current practice is to

254 count how many times a specific sequence is observed within a population database(s) (70).

255 Overall, the population databases that are used in forensics comprehend several convenience

256 samples, representing the major population groups of the potential contributors in terms of

257 evidence.

Page 15: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

258 Phylogenetic methods have been used in order to identify the main human haplogroups, as well

259 as the most important SNPs that define such groups. Regarding the construction of a

260 phylogenetic tree of mtDNA sequences, there are several alternative methods, which include

261 neighbor-joining, maximum likelihood, minimum spanning networks, maximum parsimony and

262 Bayesian methods (71). Overall, the large majority of individuals from African populations, and

263 specially from sub-Saharan African populations, are categorized into one of the main haplogroup

264 lineages that diverged from macro-haplogroup L - L0, L1, L2, L3, L4, L5 and L6 - (72–79). On

265 the other hand, more than 90% of the individuals of the European and USA Caucasian

266 populations are categorized into 10 main haplogroup lineages - H, I, J, K, M, T, U, V, W and X -

267 (5,76,77,80,81). Concerning to African-American populations, the most commonly observed

268 haplogroups are L2a, L1c, L1b and L3b (75). The main haplogroups found in individuals from

269 Asian populations are haplogroups M and N (82,83).

270 The first database for human mtDNA was Mitomap in 1995 (84). In 1996, this database

271 developed into an online database, www.mitomap.org, containing published human mtDNA

272 variation along with geographic and disease specific variants. Currently, Mitomap is manually

273 curated, frequently updated and a functionally rich resource, presenting high-quality human

274 mtDNA data for clinicians, investigators and geneticists (84). Mitomap has three main categories

275 for usage. It contains some background information regarding the human mitochondrial DNA,

276 such as the general representation of mtDNA, haplogroups and their frequencies and illustrations

277 of mtDNA, among others. Furthermore, users can also find information about other mtDNA-

278 specific databases, tools and useful resources.

279 Mitomap stores the annotated listing of the mtDNA variants from both healthy individuals and

280 patients. The frequencies of the variants are calculated from human mitogenomes retrieved from

Page 16: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

281 the GenBank. Therefore, users can retrieve information about the loci, the nucleotide change, the

282 codon position and the number, among others, and download the most important data in different

283 file formats.

284 Mitomap contains the Mitomaster analysis tool, currently providing the Application

285 Programming Interface for it. The main function of this tool is to allow the identification of

286 polymorphic positions, the calculation of variant statistics and the assignment of haplogroups to

287 complete or partial mitogenomes. Such query might be performed by recurring to mtDNA

288 sequences, to GenBank identifiers or to single nucleotide variants (85).

289 Another database for human mtDNA is the EDNAP Mitochondrial DNA Population Database

290 (EMPOP, www.empop.org) (86). In its early stages, EMPOP was designed and envisioned to

291 serve as a reference population database, specifically to be used in the evaluation of the mtDNA

292 evidence around the world, aiming to provide the highest quality mtDNA data. The architecture

293 of this online database and its analysis tools, which are also provided via the website, have

294 evolved over the last few years, even though the main emphasis of the EMPOP database remains

295 to be mtDNA data quality. Therefore, and as a direct consequence, EMPOP not only serves as a

296 reference population database, but also as a quality-control tool for scientists in forensic genetics,

297 as well as in other disciplines. Finally, and even though there is a significant number of high-

298 quality reference population databases for forensic comparisons, EMPOP is the most

299 comprehensive resource, especially from the standpoint of the populations that are represented in

300 such database (4).

301 EMPOP uses SAM, a string-based search algorithm that converts query and database sequences

302 into alignment-free nucleotide strings and thus guarantees that a haplotype is found in a database

303 query regardless of its alignment. SAM-E, an updated version of SAM that considers block

Page 17: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

304 InDels as phylogenetic events, is used currently. At EMPOP, the tool haplogroup browser

305 represents all the established Phylotree haplogroups in convenient searchable format and

306 provides the number of EMPOP sequences assigned to the respective haplogroups by estimating

307 mitochondrial DNA haplogroups using a maximum likelihood approach EMMA (87). For

308 multiple possible haplogroups, most recent common ancestor (MRCA) haplogroups are

309 provided.

310 In order to facilitate a better use of known mtDNA variation, van Oven & Kayser, in 2008, have

311 constructed an updated comprehensive phylogeny of global human mtDNA variation -

312 PhyloTree -, based on both coding and control region mutations (78). The complete mtDNA tree

313 includes previously published as well as newly identified haplogroups, is continuously and

314 regularly updated, and is available online at http://www.phylotree.org. In figure 2 we present the

315 basic structure of the PhyloTree, which is divided into 25 subtrees accessible through

316 http://www.phylotree.org. At EMPOP the geographical haplogroup patterns are provided via

317 maps to visualize and better understand their geographical distribution (Figure 3).

Page 18: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

318 From another perspective, ethical and legal problems may arise in the implementation of mtDNA

319 databases. The informative potential which the analysis of mtDNA entails can generate privacy

320 questions (88,89). Mitochondrial diseases affect between 1 in 4,000 and 1 in 5,000 people. In

321 most people, primary mitochondrial disease is a genetic condition that can be inherited.

322 Information about the mitochondrial genome composition may therefore enable the identification

323 of the current or future state of health of an individual. For this reason, the analysis of mtDNA

324 must be carried out only on non-coding regions, which have not been associated with any kind of

325 disease or phenotypical information.

326

327 Mitochondrial DNA in Medico-Legal Human Identification

328 At this section we present some selected published cases of human identification with mtDNA.

329 Table 1 summarizes the selected published cases.

330 In 1991, Stoneking and collaborators presented the first report of successful application of the

331 mtDNA typing to a case that involved the individual identification of skeletal remains (90). This

332 was the case of a 3-year-old child disappeared from her parents’ house in October of 1984. In

333 March of 1986, the skeletal remains of a human child were found in the desert, 2 miles away

334 from the parents’ residence. Using hybridization with 23 sequence-specific oligonucleotide

335 probes (SSO) targeting nine regions of HV1 and HV2 on the control region, they found that the

336 skeletal sample and the mother shared the same mtDNA types, corroborating that those skeletal

337 remains were of the missing child. Moreover, they anticipated that the mtDNA typing would be

338 valuable not only in linking biological remains to missing individuals, but also in the analysis of

339 material in sexual assault cases.

Page 19: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

340 In July of 1990, the body of a female, in a quite advanced state of decomposition, was discovered

341 in an open field. Despite being impossible to identify the remains by analyzing the individual’s

342 clothes and fingerprints, her dentition was consistent with old dental records of a missing person

343 from the same region. Some fragments of the heel bone and fibula, plus samples of the hair and

344 skin, were provided for the DNA analysis, as well as a blood sample from a putative sister of the

345 deceased. In 1992, Sullivan and collaborators attempted the identification of the highly

346 decomposed remains of the corpse, amplifying and directly sequencing 2 hypervariable segments

347 within HV1 and HV2 in the mtDNA (91). No statistical value was given to the evidence, since

348 no database of the British population sequences were available at that time. Still, no differences

349 were found between both sequences, the blood of the putative sister and the bone of the corpse,

350 indicating they were sisters.

351 Perhaps the most well-known lineage study using mtDNA sequencing is related to the

352 identification of Tsar Nicholas II’s bones. Gill and collaborators, in 1994 (22), and Ivanov and

353 collaborators, in 1996 (23), compared the sequences of HV1 and HV2 fragments of the mtDNA

354 obtained from the putative bones of the Tsar with those of Tsar living maternal relatives,

355 Countess Xenia Cheremeteff-Sfiri and the Duke of Fife. It was found that the sequences were

356 very similar, corroborating the hypothesis that the bone remains were of Tsar Nicholas II.

357 In a distinct scenario, Deng et al. (92) used direct sequencing of the HV1 and HV2 fragments of

358 the mtDNA control region to identify Tsunami victims in Thailand in 2004. This tsunami killed

359 nearly 5,400 people in Southern Thailand, including foreign tourists and local residents. They

360 succeeded in obtaining fully informative results for mtDNA markers (HV1 and HV2) from 258

361 tooth samples with a success rate of 51% (258/507).

Page 20: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

362 More recently, in 2010, Ríos and collaborators (93) used direct sequencing of the HV1 and HV2

363 fragments of the mtDNA control region to identify human skeletal remains that were exhumed

364 from a mass grave from the Spanish Civil War (1936-1939). There was a match between the

365 mtDNA profiles of the biologically youngest skeleton and the sister of the youngest person that

366 was presumptively known to be buried in the grave, allowing the identification of that person.

367 Also in 2010, Piccinini and collaborators (94) attempted to identify the remains of a famous

368 World War One Italian soldier that was killed in a battle along the Italian front in 1915. Like

369 previous studies, they used the direct sequencing of the HV1 and HV2 fragments of the mtDNA

370 control region to define single mtDNA haplotypes. The availability of the offspring maternal

371 lineage allowed the mtDNA analysis, which presented a clear exclusion scenario: the remains did

372 not belong to the supposed war hero.

373 In 2012, a skeleton was excavated at the site of the Grey Friars friary, in Leicester, which is the

374 last-known resting place of King Richard III (95). To determine if the remains belonged to King

375 Richard III, the HV1, HV2 and HV3 regions of the mtDNA of the skeletal remains and of the

376 living relatives of King Richard III were sequenced and compared. There was a perfect match

377 between the sequences indicating that the remains belong to King Richard III.

378 The communist period in Poland during 1944-1956 resulted in the death of more than 50,000

379 people, who were buried in secret. One mass grave was found at the cemetery Powazki Military,

380 in Warsaw, Poland. In 2016, Ossowski and collaborators (96) identified 50 victims, specifically

381 by using autosomal, Y-STR and direct sequencing of the HV1 and HV2 fragments of the

382 mtDNA control region.

383 In 2016, among the first studies on human identification with mtDNA using massive paralell

384 sequencing, Park and collaborators proposed a protocol that includes the study of ten regions of

Page 21: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

385 mtDNA for the identification of historical human remains with forensic genetic markers (97).

386 They studied a 140-year-old human skeletal remains discovered at a historical site in Deadwood,

387 South Dakota, United States. The remains were in an unmarked grave and there were no records

388 available regarding the identity of the individual. The mtDNA profiles of the unidentified

389 skeletal remains obtained with their method were consistent with H1 haplogroup. This

390 haplogroup is the most common in Western Europe. The ancestry-informative nuclear SNPs also

391 studied in this case indicated a European background. These genetic results are consistent with

392 the findings of previous anthropological report which determined that the Deadwood

393 unidentified skeletal remains belong to a male of European ancestry.

394 In 2017, the victims’ remains from the World Trade Center terrorism act, which occurred in

395 September 11 of 2001, were still being identified by using the mtDNA sequencing technology,

396 among other techniques, with protocols and guidelines as recommended by the International

397 Society for Forensic Genetics (1,2,4,29,98).

398

399 Conclusions

400 Over the last 25 years, mtDNA typing has been widely used around the world to solve several

401 human identifications related issues in violent crimes, lesser crimes, acts of terrorism, mass

402 disasters and missing persons’ cases.

403 Forensic DNA methods are constantly questioned in terms of their admissibility for several years

404 now, and we foresee that this scenario is likely to continue in the future. Some of the most well-

405 known challenges to the mtDNA analysis are focused on the issue of the heteroplasmy as well as

406 on recombination and paternal leakage.

Page 22: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

407 Over the last decades progress in mtDNA typing was overwhelming, going from the examination

408 of small fragment in a matter of days to sequencing multiple mtDNA genomes in a couple of

409 hours using a point of care sequencing machine.

410 An individual’s mtDNA genome can tell us much about his/her ancestors. Even though many

411 would readily accept that there are good reasons for researchers to determine information about

412 an unknown suspect’s potential ancestral background, many still might find the potential to

413 determine genetic dispositions to certain disorders as being unacceptable. Hence, new

414 technologies must be wisely used and for the reasons that they are intended, considering their

415 specific focus and contribution within the field of forensic and medico-legal human

416 identification.

417

418 Acknowledgements

419 None to report.420

421

422 References

423 1. Bär W, Brinkmann B, Budowle B, Carracedo A, Gill P, Holland M, et al. Guidelines for

424 mitochondrial DNA typing. DNA Commission of the International Society for Forensic

425 Genetics. Vox Sang. 2000;79(2):121–5.

426 2. Prinz M, Carracedo A, Mayr WR, Morling N, Parsons TJ, Sajantila A, et al. DNA

427 Commission of the International Society for Forensic Genetics (ISFG): Recommendations

428 regarding the role of forensic genetics for disaster victim identification (DVI). Forensic

429 Sci Int Genet. 2007;1(1):3–12.

430 3. Parson W, Bandelt HJ. Extended guidelines for mtDNA typing of population data in

431 forensic science. Forensic Sci Int Genet. 2007;1(1):13–9.

432 4. Parson W, Gusmão L, Hares DRR, Irwin JAA, Mayr WRR, Morling N, et al. DNA

Page 23: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

433 Commission of the International Society for Forensic Genetics: Revised and extended

434 guidelines for mitochondrial DNA typing. Forensic Sci Int Genet. 2014;13:134–42.

435 5. Budowle B, Allard MW, Wilson MR, Chakraborty R. FORENSICS AND

436 MITOCHONDRIAL DNA: Applications, Debates, and Foundations*. Annu Rev

437 Genomics Hum Genet. 2003;4(1):119–41.

438 6. Nass MMK, Nass S. Intramitochondrial fibers with DNA characteristics. I. Fixation and

439 electron staining reactions. J Cell Biol. 1963;19:593–611.

440 7. Anderson S, Bankier AT, Barrell BG, de Bruijn MHL, Coulson AR, Drouin J, et al.

441 Sequence and organization of the human mitochondrial genome. Nature.

442 1981;290(5806):457–65.

443 8. Taanman J-W. The mitochondrial genome: structure, transcription, translation and

444 replication. Biochim Biophys Acta - Bioenerg. 1999;1410(2):103–23.

445 9. ATIG RK-B, HSOUNA S, BERAUD-COLOMB E, ABDELHAK S. ADN mitochondrial:

446 propriétés et applications. Arch Inst Pasteur Tunis. 2009;86(1–4):3–14.

447 10. Sigurðardóttir S, Helgason A, Gulcher JR, Stefansson K, Donnelly P. The Mutation Rate

448 in the Human mtDNA Control Region. Am J Hum Genet. 2000;66:1599–609.

449 11. Shokolenko IN. Aging: A mitochondrial DNA perspective, critical analysis and an update.

450 World J Exp Med. 2014;4(4):46.

451 12. Scally A. The mutation rate in human evolution and demographic inference. Curr Opin

452 Genet Dev. 2016;41:36–43.

453 13. Greenberg BD, Newbold JE, Sugino A. Intraspecific nucleotide sequence variability

454 surrounding the origin of replication in human mitochondrial DNA. Gene. 1983 Jan

455 1;21(1–2):33–49.

Page 24: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

456 14. Lutz S, Wittig H, Weisser HJ, Heizmann J, Junge A, Dimo-Simonin N, et al. Is it possible

457 to differentiate mtDNA by means of HVIII in samples that cannot be distinguished by

458 sequencing the HVI and HVII regions? Forensic Sci Int. 2000;113(1–3):97–101.

459 15. Elson JL, Samuels DC, Turnbull DM, Chinnery PF. Random Intracellular Drift Explains

460 the Clonal Expansion of Mitochondrial DNA Mutations with Age. Am J Hum Genet.

461 2001;68(3):802–6.

462 16. Wei W, Keogh MJ, Wilson I, Coxhead J, Ryan S, Rollinson S, et al. Mitochondrial DNA

463 point mutations and relative copy number in 1363 disease and control human brains. Acta

464 Neuropathol Commun. 2017;5(1):13.

465 17. Case J, Wallace D. Maternal inheritance of mitochondrial DNA polymorphisms in

466 cultured human fibroblasts. Somatic Cell Genet. 1981;7(1):103–8.

467 18. Giles RE, Blanc H, Cann HM, Wallace DC. Maternal inheritance of human mitochondrial

468 DNA. Proc Natl Acad Sci U S A. 1980;77(11):6715–9.

469 19. Hutchison CA, Newbold JE, Potter SS, Edgell MH. Maternal inheritance of mammalian

470 mitochondrial DNA. Nature. 1974;251(October 1974):536–8.

471 20. Bendall KE, Macaulay VA, Sykes BC. Variable levels of a heteroplasmic point mutation

472 in individual hair roots. Am J Hum Genet. 1997;61(6):1303–8.

473 21. Comas D, Paabo S, Bertranpetit J. Heteroplasmy in the control region of human

474 mitochondrial DNA. Genome Res. 1995;5(1):89–90.

475 22. Gill P, Ivanov PL, Kimpton C, Piercy R, Benson N, Tully G, et al. Identification of the

476 remains of the Romanov family by DNA analysis. Nat Genet. 1994 Feb 1;6(2):130–5.

477 23. Ivanov PL, Wadhams MJ, Roby RK, Holland MM, Weedn VW, Parsons TJ, et al.

478 Mitochondrial DNA sequence heteroplasmy in the Grand Duke of Russia Georgij

Page 25: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

479 Romanov establishes the authenticity of the remains of Tsar Nicholas II. Nat Genet. 1996

480 Apr 1;12(12):417–20.

481 24. Wilson MR, Polanskey D, Replogle J, DiZinno JA, Budowle B. A family exhibiting

482 heteroplasmy in the human mitochondrial DNA control region reveals both somatic

483 mosaicism and pronounced segregation of mitotypes. Hum Genet. 1997;100(2):167–71.

484 25. Bendall KE, Sykes BC. Length heteroplasmy in the first hypervariable segment of the

485 human mtDNA control region. Am J Hum Genet. 1995;57(2):248–56.

486 26. Stewart JE, Fisher CL, Aagaard PJ, Wilson MR, Isenberg AR, Polanskey D, et al. Length

487 variation in HV2 of the human mitochondrial DNA control region. J Forensic Sci.

488 2001;46(4):862–70.

489 27. Alonso A, Salas A, Albarrán C, Arroyo E, Castro A, Crespillo M, et al. Results of the

490 1999-2000 collaborative exercise and proficiency testing program on mitochondrial DNA

491 of the GEP-ISFG: An inter-laboratory study of the observed variability in the

492 heteroplasmy level of hair from the same donor. Forensic Sci Int. 2002;125(1):1–7.

493 28. Holland MM, Parsons TJ. Mitochondrial DNA Sequence Analysis - Validation and Use

494 for Forensic Casework. Forensic Sci Rev. 1999;11(1):21–50.

495 29. Tully G, Bär W, Brinkmann B, Carracedo A, Gill P, Morling N, et al. Considerations by

496 the European DNA profiling (EDNAP) group on the working practices, nomenclature and

497 interpretation of mitochondrial DNA profiles. Forensic Sci Int. 2001;124(1):83–91.

498 30. Buffoli B, Rinaldi F, Labanca M, Sorbellini E, Trink A, Guanziroli E, et al. The human

499 hair: from anatomy to physiology. Int J Dermatol. 2013;(November 2015):1–11.

500 31. Rogers GE. Hair follicle differentiation and regulation. Int J Dev Biol. 2004;48(2–3):163–

501 70.

Page 26: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

502 32. Paus R. Principles of hair cycle control. J Dermatol. 1998;25(12):793–802.

503 33. Cummins JM, Wakayama T, Yanagimachi R. Fate of microinjected sperm components in

504 the mouse oocyte and embryo. Zygote. 1997 Nov 26;5(4):301–8.

505 34. Shitara H, Hayashi JI, Takahama S, Kaneda H, Yonekawa H. Maternal inheritance of

506 mouse mtDNA in interspecific hybrids: Segregation of the leaked paternal mtDNA

507 followed by the prevention of subsequent paternal leakage. Genetics. 1998;148(2):851–7.

508 35. Shitara H, Kaneda H, Sato A, Inoue K, Ogura A, Yonekawa H, et al. Selective and

509 continuous elimination of mitochondria microinjected into mouse eggs from spermatids,

510 but not from liver cells, occurs throughout emborogenesis. Genetics. 2000;156(3):1277–

511 84.

512 36. Gyllensten U, Wharton D, Josefsson A, Wilson AC. Paternal inheritance of mitochondrial

513 DNA in mice. Nature. 1991 Jul 18;352(6332):255–7.

514 37. Wilson MR, Allard MW, Monson K, Miller KWP, Budowle B. Recommendations for

515 consistent treatment of length variants in the human mitochondrial DNA control region.

516 Forensic Sci Int. 2002;129(1):35–42.

517 38. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors.

518 Proc Natl Acad Sci. 1977;74(12):5463–7.

519 39. Bruijns B, Tiggelaar RM, Gardeniers HJGE. Massively parallel sequencing techniques for

520 forensics: A review. Electrophoresis. 2018;1–13.

521 40. Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, et al.

522 Molecular biology: Accurate multiplex polony sequencing of an evolved bacterial

523 genome. Science (80- ). 2005 Sep 9;309(5741):1728–32.

524 41. Ondov BD, Cochran C, Landers M, Meredith GD, Dudas M, Bergman NH. An alignment

Page 27: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

525 algorithm for bisulfite sequencing using the Applied Biosystems SOLiD System.

526 Bioinformatics. 2010;26(15):1901–2.

527 42. Dames S, Durtschi J, Geiersbach K, Stephens J, Voelkerding K V. Comparison of the

528 illumina genome analyzer and roche 454 GS FLX for resequencing of hypertrophic

529 cardiomyopathy-associated genes. J Biomol Tech. 2010;21(2):73–80.

530 43. Mascher M, Wu S, St. Amand P, Stein N, Poland J. Application of Genotyping-by-

531 Sequencing on Semiconductor Sequencing Platforms: A Comparison of Genetic and

532 Reference-Based Marker Ordering in Barley. PLoS One. 2013;8(10):1–11.

533 44. Pukk L, Ahmad F, Hasan S, Kisand V, Gross R, Vasemägi A. Less is more: Extreme

534 genome complexity reduction with ddRAD using Ion Torrent semiconductor technology.

535 Mol Ecol Resour. 2015;15(5):1145–52.

536 45. Volden R, Palmer T, Byrne A, Cole C, Schmitz RJ, Green RE, et al. R2C2: Improving

537 nanopore read accuracy enables the sequencing of highly-multiplexed full-length single-

538 cell cDNA. bioRxiv. 2018;115(39):338020.

539 46. Oikonomopoulos S, Wang YC, Djambazian H, Badescu D, Ragoussis J. Benchmarking of

540 the Oxford Nanopore MinION sequencing for quantitative and qualitative assessment of

541 cDNA populations. Sci Rep. 2016;6(August):1–13.

542 47. Lindberg MR, Schmedes SE, Hewitt FC, Haas JL, Ternus L, Kadavy DR, et al. A

543 Comparison and Integration of MiSeq and MinION Platforms for Sequencing Single

544 Source and Mixed Mitochondrial Genomes. 2016;1–16.

545 48. Vossen RHAM, Buermans HPJ. Full-Length Mitochondrial-DNA Sequencing on the

546 PacBio RSII. In Humana Press, New York, NY; 2017. p. 179–84.

547 49. Rothberg JM, Hinz W, Rearick TM, Schultz J, Mileski W, Davey M, et al. An integrated

Page 28: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

548 semiconductor device enabling non-optical genome sequencing. Nature.

549 2011;475(7356):348–52.

550 50. Mcelhoe JA, Holland MM, Makova KD, Su MS, Paul M, Baker CH, et al. HHS Public

551 Access. 2015;20–9.

552 51. Parson W, Strobl C, Huber G, Zimmermann B, Gomes SM, Souto L, et al. Forensic

553 Science International : Genetics Evaluation of next generation mtGenome sequencing

554 using the Ion Torrent Personal Genome Machine ( PGM ). 2013;7:543–9.

555 52. Ballard D. Analysis of Mitochondrial Control Region Using Sanger Sequencing. In

556 Humana Press, New York, NY; 2016. p. 143–55.

557 53. Magalhães S, Marques SL, Alves C, Amorim A, Alvarez L, Goios A. Evaluation of

558 heteroplasmy detection in the Ion Torrent PGM. Forensic Sci Int Genet Suppl Ser.

559 2015;5:e13–5.

560 54. Woerner AE, Ambers A, Wendt FR, King JL, Moura-Neto RS, Silva R, et al. Evaluation

561 of the precision ID mtDNA whole genome panel on two massively parallel sequencing

562 systems. Forensic Sci Int Genet. 2018;36(March):213–24.

563 55. Ovchinnikov I V., Malek MJ, Kjelland K, Drees K. Whole Human Mitochondrial DNA

564 Sequencing. In Humana Press, New York, NY; 2016. p. 157–71.

565 56. Churchill JD, Stoljarova M, King JL, Budowle B. Massively parallel sequencing-enabled

566 mixture analysis of mitochondrial DNA samples. Int J Legal Med. 2018;132(5):1263–72.

567 57. Templeton JEL, Brotherton PM, Llamas B, Soubrier J, Haak W, Cooper A, et al. DNA

568 capture and next-generation sequencing can recover whole mitochondrial genomes from

569 highly degraded samples for human identification. Investig Genet. 2013;4(1):1–13.

570 58. Just RS, Scheible MK, Fast SA, Sturk-Andreaggi K, Higginbotham JL, Lyons EA, et al.

Page 29: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

571 Development of forensic-quality full mtGenome haplotypes: Success rates with low

572 template specimens. Forensic Sci Int Genet. 2014;10(1):73–9.

573 59. Just RS, Scheible MK, Fast SA, Sturk-Andreaggi K, Röck AW, Bush JM, et al. Full

574 mtGenome reference data: Development and characterization of 588 forensic-quality

575 haplotypes representing three U.S. populations. Forensic Sci Int Genet. 2014;14:141–55.

576 60. Just RS, Irwin JA, Parson W. Mitochondrial DNA heteroplasmy in the emerging field of

577 massively parallel sequencing. Forensic Sci Int Genet. 2015;18:131–9.

578 61. Chaitanya L, Ralf A, van Oven M, Kupiec T, Chang J, Lagacé R, et al. Simultaneous

579 Whole Mitochondrial Genome Sequencing with Short Overlapping Amplicons Suitable

580 for Degraded DNA Using the Ion Torrent Personal Genome Machine. Hum Mutat.

581 2015;36(12):1236–47.

582 62. Hollard C, Keyser C, Delabarde T, Gonzalez A, Vilela Lamego C, Zvénigorosky V, et al.

583 Case report: on the use of the HID-Ion AmpliSeqTM Ancestry Panel in a real forensic case.

584 Int J Legal Med. 2017;131(2):351–8.

585 63. Lopopolo M, Børsting C, Pereira V, Morling N. A study of the peopling of Greenland

586 using next generation sequencing of complete mitochondrial genomes. Am J Phys

587 Anthropol. 2016;161(4):698–704.

588 64. Park S, Cho S, Seo HJ, Lee JH, Kim MY, Lee SD. Entire mitochondrial DNA sequencing

589 on massively parallel sequencing for the Korean population. J Korean Med Sci.

590 2017;32(4):587–92.

591 65. Young B, King JL, Budowle B, Armogida L. A technique for setting analytical thresholds

592 in massively parallel sequencing-based forensic DNA analysis. PLoS One. 2017;12(5):1–

593 15.

Page 30: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

594 66. Marshall C, Sturk-Andreaggi K, Daniels-Higginbotham J, Oliver RS, Barritt-Ross S,

595 McMahon TP. Performance evaluation of a mitogenome capture and Illumina sequencing

596 protocol using non-probative, case-type skeletal samples: Implications for the use of a

597 positive control in a next-generation sequencing procedure. Forensic Sci Int Genet.

598 2017;31(May):198–206.

599 67. Ma K, Zhao X, Li H, Cao Y, Li W, Ouyang J, et al. Massive parallel sequencing of

600 mitochondrial DNA genomes from mother-child pairs using the ion torrent personal

601 genome machine (PGM). Forensic Sci Int Genet. 2018;32(November 2017):88–93.

602 68. Peck MA, Brandhagen MD, Marshall C, Diegoli TM, Irwin JA, Sturk-Andreaggi K.

603 Concordance and reproducibility of a next generation mtGenome sequencing method for

604 high-quality samples using the Illumina MiSeq. Forensic Sci Int Genet. 2016;24:103–11.

605 69. Amorim A, Pinto N. Big data in forensic genetics. Forensic Sci Int Genet.

606 2018;37(August):102–5.

607 70. Budowle B, Wilson MR, DiZinno JA, Stauffer C, Fasano MA, Holland MM, et al.

608 Mitochondrial DNA regions HVI and HVII population data. Forensic Sci Int.

609 1999;103(1):23–35.

610 71. Bianchi L, Liò P. Forensic DNA and bioinformatics. Brief Bioinform. 2007;8(2):117–28.

611 72. Bandelt HJ, Alves-Silva J, Guimarães PEM, Santos MS, Brehm A, Pereira L, et al.

612 Phylogeography of the human mitochondrial haplogroup L3e: A snapshot of African

613 prehistory and Atlantic slave trade. Ann Hum Genet. 2001;65(6):549–63.

614 73. Chen Y-S, Olckers A, Schurr TG, Kogelnik AM, Huoponen K, Wallace DC. mtDNA

615 Variation in the South African Kung and Khwe—and Their Genetic Relationships to

616 Other African Populations. Am J Hum Genet. 2000;66(4):1362–83.

Page 31: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

617 74. Rosa A, Brehm A, Kivisild T, Metspalu E, Villems R. MtDNA profile of West Africa

618 Guineans: Towards a better understanding of the Senegambia region. Ann Hum Genet.

619 2004;68(4):340–52.

620 75. Allard MW, Polanskey D, Miller K, Wilson MR, Monson KL, Budowle B.

621 Characterization of human control region sequences of the African American SWGDAM

622 forensic mtDNA data set. Forensic Sci Int. 2005;148(2–3):169–79.

623 76. Behar DM, Villems R, Soodyall H, Blue-Smith J, Pereira L, Metspalu E, et al. The dawn

624 of human matrilineal diversity. Am J Hum Genet. 2008;82(5):1130–40.

625 77. Pakendorf B, Stoneking M. Mitochondrial Dna and Human Evolution. Annu Rev

626 Genomics Hum Genet. 2005;6(1):165–83.

627 78. van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human

628 mitochondrial DNA variation. Hum Mutat. 2008;30(September):386–94.

629 79. Gonder MK, Mortensen HM, Reed FA, De Sousa A, Tishkoff SA. Whole-mtDNA

630 genome sequence analysis of ancient african lineages. Mol Biol Evol. 2007;24(3):757–68.

631 80. Torroni A, Huoponen K, Francalacci P, Petrozzi M, Morelli L, Scozzari R, et al.

632 Classification of european mtDNAs from an analysis of three European populations.

633 Genetics. 1996;144(4):1835–50.

634 81. Allard MW, Miller K, Wilson M, Monson K, Budowle B. Characterization of the

635 Caucasian haplogroups present in the SWGDAM forensic mtDNA dataset for 1771

636 human control region sequences. Scientific Working Group on DNA Analysis Methods. J

637 Forensic Sci. 2002 Nov;47(6):1215–23.

638 82. Allard MW, Wilson MR, Monson KL, Budowle B. Control region sequences for East

639 Asian individuals in the Scientific Working Group on DNA Analysis Methods forensic

Page 32: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

640 mtDNA data set. Leg Med. 2004;6(1):11–24.

641 83. Kivisild T. Maternal ancestry and population history from whole mitochondrial genomes.

642 Investig Genet. 2015;6(1):1–10.

643 84. Ruiz-Pesini E, Lott MT, Procaccio V, Poole JC, Brandon MC, Mishmar D, et al. An

644 enhanced MITOMAP with a global mtDNA mutational phylogeny. Nucleic Acids Res.

645 2007;35(SUPPL. 1):823–8.

646 85. Brandon MC, Ruiz-pesini E, Mishmar D, Procaccio V, Marie T, Nguyen KC, et al.

647 MITOMASTER - A Bioinformatics Tool For the Analysis of Mitochondrial DNA

648 Sequences. Hum Mutat. 2009;30(1):1–6.

649 86. Parson W, Dür A. EMPOP—A forensic mtDNA database. Forensic Sci Int Genet.

650 2007;1(2):88–92.

651 87. Röck AW, Dür A, Van Oven M, Parson W. Concept for estimating mitochondrial DNA

652 haplogroups using a maximum likelihood approach (EMMA). Forensic Sci Int Genet.

653 2013;7(6):601–9.

654 88. Guillen M, Lareu M V., Pestoni C, Salas A, Carracedo A. Ethical-legal problems of DNA

655 databases in criminal investigation. J Med Ethics. 2000;26(4):266–71.

656 89. Wallace HM, Jackson AR, Gruber J, Thibedeau AD. Forensic DNA databases-Ethical and

657 legal standards: A global review. Egypt J Forensic Sci. 2014;4(3):57–63.

658 90. Stoneking M, Hedgecock D, Higuchi RG, Vigilant L, Erlich HA. Population variation of

659 human mtDNA control region sequences detected by enzymatic amplification and

660 sequence-specific oligonucleotide probes. Am J Hum Genet. 1991;48(2):370–82.

661 91. Sullivan KM, Hopgood R, Gill P. Identification of human remains by amplification and

662 automated sequencing of mitochondrial DNA. Int J Legal Med. 1992;105(2):83–6.

Page 33: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

663 92. Deng YJ, Li YZ, Yu XG, Li L, Wu DY, Zhou J, et al. Preliminary DNA identification for

664 the tsunami victims in Thailand. Genomics, Proteomics Bioinforma. 2005;3(3):143–57.

665 93. Ríos L, García-Rubio A, Martínez B, Alonso A, Puente J. Identification process in mass

666 graves from the Spanish Civil War II. Forensic Sci Int. 2010;219(1–3).

667 94. Piccinini A, Coco S, Parson W, Cattaneo C, Gaudio D, Barbazza R, et al. World war one

668 Italian and Austrian soldier identification project: DNA results of the first case. Forensic

669 Sci Int Genet. 2010;4(5):329–33.

670 95. King TE, Fortes GG, Balaresque P, Thomas MG, Balding D, Delser PM, et al.

671 Identification of the remains of King Richard III. Nat Commun. 2014;5:1–8.

672 96. Ossowski A, Diepenbroek M, Kupiec T, Bykowska-Witowska M, Zielińska G,

673 Dembińska T, et al. Genetic Identification of Communist Crimes’ Victims (1944–1956)

674 Based on the Analysis of One of Many Mass Graves Discovered on the Powazki Military

675 Cemetery in Warsaw, Poland. J Forensic Sci. 2016;61(6):1450–5.

676 97. Ambers AD, Churchill JD, King JL, Stoljarova M, Gill-King H, Assidi M, et al. More

677 comprehensive forensic genetic marker analyses for accurate human remains

678 identification using massively parallel DNA sequencing. BMC Genomics. 2016;17(Suppl

679 9).

680 98. Goodwin WH. The use of forensic DNA analysis in humanitarian forensic action: The

681 development of a set of international standards. Forensic Sci Int. 2017;278:221–7.

682

683

684

Page 34: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

685 Figure and Table legends

686 Figure 1 – The human mitochondrial DNA genome with genes and control regions labeled. Adapted from

687 Shokolenko et al., 2014 (11)

688

689 Figure 2 - Phylogenetic tree of global human mitochondrial DNA variation. mtDNA tree Build 17 (18

690 Feb 2016). Available at http://www.phylotree.org/tree/index.htm. mtDNA-MRCA- Most recent common

691 ancestor of mtDNA.

692

693 Figure 3 - Representation of the geographical origin of mtDNA haplogroups and main mutations that are

694 at the origin of each haplogroup. Adapted from Kivisild et al., 2015 (83).

695

696 Table 1 - Selected published cases of human identification with mtDNA.

697

698 Table 2 - Guidelines of the DNA Commission of the International Society of Forensic Genetics,

699 2014

Page 35: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Table 1(on next page)

Selected cases of human identification with mtDNA

Selected published cases of human identification with mtDNA

Page 36: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

1 Table 1 - Selected published cases of human identification with mtDNA

Reference/Year Studied samples

mtDNA studied regions

Used methodologies

Reference samples Results

Stoneking M, Hedgecock D, Higuchi RG, Vigilant L, Erlich HA. Population variation of human mtDNA control region sequences detected by enzymatic amplification and sequence-specific oligonucleotide probes. Am J Hum Genet. 1991;48(2):370–82.

Skeletal remains of a human child, found in 1986

HVI, HVII PCR for amplification

Hybridization with oligonucleotide probes for sequence determination

Parents of a 3-year-old child disappeared from home in 1984

Identical mtDNA sequence in skeletal remains and sample of the 3-year-old child mother

Positive ID

Sullivan KM, Hopgood R, Gill P. Identification of human remains by amplification and automated sequencing of mitochondrial DNA. Int J Legal Med. 1992;105(2):83–6.

Body of a female, in an advanced state of decomposition discovered in 1990

HVI, HVII PCR for amplification

Sanger sequencing

Blood sample from a sister of a deceased female at the same region

No differences were observed between the corpse and blood from the putative sister

Positive ID

Gill P, Ivanov PL, Kimpton C, Piercy R, Benson N, Tully G, et al. Identification of the remains of the Romanov family by DNA analysis. Nat Genet. 1994;6(2):130–5.

Nine skeletons found in a grave in Ekaterinburg, Russia, 1991

HVI, HVII PCR for amplification

Sanger sequencing

Blood sample from Gt. Gt. Grandson of Louise of Hesse-Cassel and from Gt. Gt. Gt. Granddaughter of Louise of Hesse-Cassel

Exact sequence between putative Tsarina Alexandra and putative three children. Exact mtDNA results between putative Tsar Nicholas II and two living maternal relatives of the Tsar

Ivanov PL, Wadhams MJ, Roby RK, Holland MM WV& PT. Mitochondrial DNA sequence heteroplasmy in the Grand Duke of Russia Georgij Romanov establishes the authenticity of the remains of Tsar Nicholas II. Nat Genet. 1996;(12):417–20.

Skeleton of putative Tsar Nicholas II

HVI, HVII PCR for amplification

Sanger sequencing

Skeleton of Grand Duke of Russia Georgij Romanov (Tsar’s brother)Blood sample from Countess Xenia Cheremeteff-Sfiri (maternal Tsar’s relative)

Establishment of the authenticity of the remains of Tsar Nicholas II

Deng YJ, Li YZ, Yu XG, Li L, Wu DY, Zhou J, et al. Preliminary DNA identification for the tsunami victims in Thailand. Genomics, Proteomics Bioinforma. 2005;3(3):143–57.

258 tooth samples from killed people at the 2004 Southeast Asia Thailand

HVI, HVII PCR for amplification

Sanger sequencing

200 relatives of the tsunami victims

200 tsunami victims have been identified, including both Thai nationals and foreign tourists from several nations

Page 37: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Tsunami

Ríos L, García-Rubio A, Martínez B, Alonso A, Puente J. Identification process in mass graves from the Spanish Civil War II. Forensic Sci Int. 2010;219(1–3).

Skeletal remains exhumed from a mass grave from the Spanish Civil War (1936–1939)

HVI, HVII PCR for amplification

Sanger sequencing

Sister of the youngest person presumptively known to be buried in the grave

Match between mtDNA profiles of the biologically youngest skeleton and the sister of the youngest person presumptively known to be buried in the grave

Piccinini A, Coco S, Parson W, Cattaneo C, Gaudio D, Barbazza R, et al. World war one Italian and Austrian soldier identification project: DNA results of the first case. Forensic Sci Int Genet. 2010;4(5):329–33.

Remains of missing soldiers occasionally found during excavations

HVI, HVII PCR for amplification

Sanger sequencing

Offspring of the italian soldier Libero Zugni Tauro

Both mtDNA and Y-STR data showed clear exclusion scenariosbetween the human remains and the reference samples

King TE, Fortes GG, Balaresque P, Thomas MG, Balding D, Delser PM, et al. Identification of the remains of King Richard III. Nat Commun. 2014;5:1–8.

Skeleton excavated at the presumed site of the Grey Friars friary in Leicester, 2012

Whole mitochondrial genome

PCR for amplification

Massive parallel sequencing

Saliva samples of the modern relatives of Richard III

Positive mtDNA match between the only known female-line of Richard III and studied modern relatives of Richard III

Ossowski A, Diepenbroek M, Kupiec T, Bykowska-Witowska M, Zielińska G, Dembińska T, et al. Genetic Identification of Communist Crimes’ Victims (1944–1956) Based on the Analysis of One of Many Mass Graves Discovered on the Powazki Military Cemetery in Warsaw, Poland. J Forensic Sci. 2016;61(6):1450–5.

Remains of eight people buried in one of many mass graves, which were found at the cemetery Powazzki Military in Warsaw, Poland

HVI, HVII PCR for amplification

Sanger sequencing

Reference material was collected from the closest living relatives of Communist Crimes’ Victims (1944–1956)

Positive mtDNA match between 6 putative victims and 6 living relatives

2

Page 38: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Table 2(on next page)

Table 2

Guidelines of the DNA Commission of the International Society of Forensic Genetics, 2014

Page 39: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

1 Table 2 - Guidelines of the DNA Commission of the International Society of Forensic Genetics, 2014

Addressement Recommendation Statement

Recommendation #1

Good laboratory practice and specific protocols for work with mtDNA must be followed in accordance with previous guidelines

Recommendation #2

Negative and positive controls as well as extraction reagent blanks must be carried through the entire laboratory process

Recommendation #3

Reported consensus sequences must be based on redundant sequence information, using forward and reverse sequencing reactions whenever practical

Recommendation #4

Manual transcription of data should be avoided and independent confirmation of consensus haplotypes by two scientists must be performed

General recommendations/ good laboratory practice

Recommendation #5

Laboratories using mtDNA typing in forensic casework shall participate regularly in suitable proficiency testing programs

Targeted region, amplification and sequencing ranges

Recommendation #6

In population genetic studies for forensic databasing purposes, the entire mitochondrial DNA control region should be sequenced.

Reference sequence Recommendation #7

MtDNA sequences should be aligned and reported relative to the revised Cambridge Reference Sequence (rCRS, NC001807), and should include the interpretation range (excluding primer sequence information)

Recommendation #8

IUPAC conventions using capital letters shall be used to describe differences to the rCRS and (point heteroplasmic) mixtures. Lower case letters should be used to indicate mixtures between deleted and non-deleted (inserted and non-inserted) bases. N-designations should only be used when all four bases are observed at a single position (or if no base call can be made at a given position). For the representation of deletions, ‘‘DEL’’, ‘‘del’’ or ‘‘S’’ shall be used

Alignment and notation

Recommendation #9

The alignment and notation of mtDNA sequences should be performed in agreement with the mitochondrial phylogeny (established patterns of mutations). Tools to assist with the notation of mtDNA sequences are available at http:// empop.org/

Recommendation #10

In forensic casework, laboratories must establish their own interpretation and reporting guidelines for observed length and point heteroplasmy. The evaluation of heteroplasmy depends on the limitations of the technology and the quality of the sequencing reactions as well as the experience of the laboratory. Differences in both PHP and LHP do not constitute evidence for excluding two otherwise identical haplotypes as deriving from the same source or same maternal lineage

Heteroplasmy

Recommendation #11

For population database samples, length heteroplasmy in homopolymeric sequence stretches should be interpreted by calling the dominant variant, which can be determined by identifying the position with the highest representation of a non-repetitive peak downstream of the affected stretch

Haplogrouping of mtDNA sequences Recommendation #12

MtDNA population data should be subjected to analytical software tools that facilitate phylogenetic checks for data quality control. A comprehensive suite of QC tools is provided by EMPOP

Recommendation #13

The entire database of available sequences should be searched with respect to the sequencing (interpretation) range to avoid biased query results

Recommendation #14

Laboratories must be able to justify the choice of database(s) and statistical approach used in reporting

Recommendation #15

Laboratories must establish statistical guidelines for use in reporting an mtDNA match between two samples

Databases and database searches

Recommendation #16

Highly variable positions such as length variants in homopolymeric stretches should be disregarded from searches for determining frequency estimates. Heteroplasmic calls should be queried in a manner that

Page 40: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

does not exclude any of the heteroplasmic variants

2

3

Page 41: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Figure 1(on next page)

Genes and control regions labeled of the human mitochondrial DNA genome

The human mitochondrial DNA genome with genes and control regions labeled. Adapted from

Shokolenko et al., 2014 (11)

Page 42: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial
Page 43: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Figure 2(on next page)

Phylogenetic tree of global human mitochondrial DNA variation

Phylogenetic tree of global human mitochondrial DNA variation. mtDNA tree Build 17 (18 Feb 2016).

Available at http://www.phylotree.org/tree/index.htm . mtDNA-MRCA- Most recent common ancestor of

mtDNA.

Page 44: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

mtDNA-MRCA

Page 45: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial

Figure 3(on next page)

Geographical origin of mtDNA haplogroups and main mutations that are at the origin of

each haplogroup

Representation of the geographical origin of mtDNA haplogroups and main mutations that are at the origin

of each haplogroup. Adapted from Kivisild et al., 2015 (83) .

Page 46: Mitochondrial DNA in human identification: a review · 34 definition of genetic profiles. A genetic profile or the genetic fingerprint of an individual is the ... 77 The mitochondrial