Genetic Characterization of ChrX MiniSTRs in Pakistani ...

149
Genetic Characterization of ChrX MiniSTRs in Pakistani Population MUHAMMAD ISRAR National Centre of Excellence in Molecular Biology University of the Punjab, Lahore Pakistan (2012)

Transcript of Genetic Characterization of ChrX MiniSTRs in Pakistani ...

Genetic Characterization of ChrX MiniSTRs in Pakistani Population

MUHAMMAD ISRAR

National Centre of Excellence in Molecular Biology

University of the Punjab, Lahore Pakistan (2012)

Genetic Characterization of ChrX MiniSTRs in Pakistani Population

A THESIS SUBMITTED TO

UNIVERSITY OF THE PUNJAB

IN FULFILLMENT OF THE REQUIREMENTS

FOR THE DEGREE OF

DOCTOR OF PHILOSOPHY

IN

MOLECULAR BIOLOGY

BY

MUHAMMAD ISRAR

SUPERVISORS: DR. AHMAD ALI SHAHID DR. ZIAUR RAHMAN

NATIONAL CENTRE OF EXCELLENCE IN MOLECULAR BIOLOGY UNIVERSITY OF THE PUNJAB LAHORE

PAKISTAN

(October, 2012)

IN THE NAME OF ALLAH, THE MOST BENEFICENT, THE MOST MERCIFUL

AL QURAN

O mankind, indeed We have created you from male and female and made you peoples and tribes that you may know one another. Indeed, the most noble of you in the sight of Allah is the most righteous of you. Indeed, Allah is Knowing and Acquainted.

(Quran 49:13)

DEDICATED

To

My Mother

A strong and gentle soul who taught me to trust in

Allah, believe in hard work and that so much

could be done with little

My Grandmother

For being my first teacher

My Father

For earning an honest living for us and for

supporting and encouraging me to believe in

myself

My Uncle

For being my guardian during my educational

career

CERTIFICATE

It is certified that the research work described in this thesis is the original work of the

author Mr. Muhammad Israr and has been carried out under our direct supervision. We

have personally gone through all the data reported in the manuscript and certify their

correctness/authenticity. It is further certified that the material included in this thesis has

not been used in part or full in a manuscript already submitted or in the process of

submission in partial/complete fulfillment of the award of any other degree from any

other institution. It is also certified that the thesis has been prepared under our supervision

according to the prescribed format and we endorse its evaluation for the award of Ph.D.

degree through the official procedures of the University.

In accordance with the rules of the Centre, data books # M157, M178, 942 and 998 are

declared as unexpendable document that will be kept in the registry of the Centre for a

minimum of three years from the date of the thesis defense examination.

Signature of the Supervisor: __________________

Name: Dr. Ahmad Ali Shahid

Assistant Professor

Signature of the Supervisor: ___________________

Name: Dr. Ziaur Rahman

Assistant Professor

iii

Summary

Short tandem repeats (STRs) are genetic loci containing tandemly repeated

sequences of 2–6 base pairs in length. High polymorphism as well as the ease in

genotyping has made STR DNA Profiling the most popular method of human

identification. As compared with minisatellites, the small amplified fragment length of

STRs facilitates its utility in the analysis of degraded DNA samples.

However, in situations where DNA is highly degraded, poor amplification of the

larger sized loci (300–500 base pairs) in standard multiplex typing kits is common. DNA

template can become highly fragmented as the sample decomposes, and the yield of

complete target fragments is greatly reduced. In this case, the larger amplicons often have

lower sensitivity and fall below the detection threshold which can result in a partial

genetic profile. To solve this problem, the primers were developed to positions as close as

possible to the ends of the repeat reducing the amplified product size. Amplification of

compromised DNA using these reduced sized primer sets in a single reaction was called

Miniplex system.

X-chromosomal short tandem repeats (X-STRs) have been proved to be the best

complement of autosomal markers in complex kinship cases such as deficiency paternity

testing in which disputed child is a female when mother and any paternal relative is

available for testing. X-STRs can also be used to test true sisterhood without father’s

DNA and to identify the female DNA in mixed stains. The experience of using autosomal

short-length amplicon STRs or miniSTRs in profiling of degraded DNA and mass disaster

victims is extended into the realm of X-chromosomal (ChrX) STR miniaturization. About

half of the total X-STRs are now short length amplicons and the focus is shifting to

completely use the mini versions of all of them. Multiplexing of these can be used for

iv

solving complex paternity cases and association of mass disaster victims with their

families. This technology may herald a new dimension into research of population

genetics and evolution.

This study presents development and optimization of a new ChrX-miniSTR

multiplex system providing short-amplicon (<200bp) fragments. This multiplex includes

11 ChrX miniSTRs (DXS101, DXS6789, DXS6793, DXS7132, DXS7423, DXS7424,

DXS8378, DXS9902, GATA31E08, GATA172D05 and HPRTB) as well as gender

determining locus Amelogenin. Developmental validation studies included sensitivity and

specificity studies, cycle number, annealing temperature, primer concentration, peak

balance and degraded DNA studies. Allele frequency distribution of 11 X-STRs was

determined in a large group of individuals including males and females from 4 provinces

(Baluchistan =100, KP = 120, Punjab = 250, Sindh = 100) of Pakistan. A total of 5–11

alleles were observed for each locus and altogether 79 alleles for all 11 X-STR loci.

Heterozygosity in females ranged from 0.5351 to 0.8332. No significant deviation was

observed from Hardy–Weinberg equilibrium for all 11 microsatellites. The power of

discrimination using all the 11 X-STRs was 1.28 x 10-11 in females and 2.64 x 10-7 in

males, which fulfills demands for paternity testing.

This study also includes haplotype analysis of DXS7424-DXS101 in a sample of

149 males from Baluchi, KP and Sindhi population. In this cluster, 9 alleles for marker

DXS101 were identified while 8 alleles were identified for DXS7424 and total of 46

different haplotypes were identified through genotyping.

The data can be used as reference database for Pakistani Population along with the

current battery of autosomal STR for forensic case work to increase the discrimination

capacity and strengthen the existing system.

v

Acknowledgements

All acclamations and appreciations are for ALIMIGHTY ALLAH, the Omnipotent, the

Omnipresent, the Compassionate, the Beneficent and the source of all knowledge and wisdom, who

bestowed upon me the intellectual ability, courage and strength to complete this humble contribution

towards knowledge. I am proud of being a follower of the Holy Prophet Hazrat Muhammad

(PBUH), the most perfect and exalted among and of ever born on the surface of earth, who declared it

to be an obligatory duty of every man and woman to seek and acquire knowledge.

I feel highly privileged to take this opportunity to wish my profound gratitude with a deep

sense of obligation to my doctoral research supervisors, Dr. Ahmad Ali Shahid and Dr. Ziaur

Rahman, Assistant Professors, National Centre of Excellence in Molecular Biology, for their personal

interest, inspiring guidance, helping attitude, and above all for providing necessary laboratory facilities

during the whole span of this research work. Thank you for your skillful guidance, invaluable

suggestions and sincere attitude throughout the course of my research. Thank you for teaching me the

noble arts of science, project management, technical writing, manuscripts preparation and

publications.

My wholehearted thanks go to the worthy Director, Dr. Tayyab Husnain for providing all the

necessary facilities for my research at the CEMB. He has been helpful in every facet of my studies and

research. He was an excellent advisor in many of the meetings that we had. Furthermore, I am greatly

indebted to Dr. S. Riazuddin, National Professor and founding director of CEMB and a researcher par

excellence, for his valuable suggestion, sympathetic attitude and cordial co-operation throughout the

progress of this research. I would like to thank Dr. Shaheen N. Khan for facilitating my research

work by extending her cordial support and guidance during my stay at the Centre.

Nothing happens all of a sudden but is the outcome of something preceding and this thesis is

no exception. I would like to thank Dr. Farhat Zaheer, Dr. G.A. Niazi, Dr. Idrees Nasir, Dr.

Muhammad Idrees, Dr. Sajida Hassan and Mr. Khalid Masood, all my teachers who taught me

during my stay at the Centre. My cordial thanks go to Dr. M. Saqib Shahzad and Dr. Obaid Ullah

for their help and immense support during every stage of my research and the write up. I wish to

extend my thanks to members of Forensic Research Lab., Genetic Diseases Lab., Genotyping Lab. and

Stem Cell Research Lab., especially, Dr. Shahid Yar Khan, Mr. Akram Tariq, Dr. Mohsin

Shehzad, Dr. Faizan Cheema, Dr. Farooq Sabir, Ms. Farheena Iqbal and Ms. Samra Kausar for

their guidance and cooperation whenever needed. I thank my Lab fellows, Mr. Ghulam Murtaza,

Ms. Shahla Nargis Mir, Dr. Usman Ali Ashfaq, Maida, Sana, Tariq Javed, Sidra Rehman, Sana,

Ms. Farah Naz, Rahat, Dr. Muhammad Ansar and Rabia Faridi for their help and support.

vi

Moreover I would like to thank Dr. Bushra Rashid, Dr. Sadia Mohsin, Dr. Azra, Dr. A. Qayyum

Roa, Mr. Kamran Bajwa, Mr. Fazal-ur-Rehman, Ms. Bushra Ijaz, Mr. Waqar, Mr. Sajid Iqbal,

Mr. Imran, Mr. Atif Anwar and Mr. Zulfiqar. Like a Movie where hundreds and thousands of

people contribute but we only see the on-screen characters, similarly a lot of people contributed and I

would like to thank all the Scientific, Para scientific and Administrative staff of CEMB who had been

directly or indirectly instrumental in my research work.

I thank my primary and high school teachers at Govt. Centennial Model School, Daggar

Buner. I thank my college lecturers at Excelsior College Swat as well as my university lecturers at

Centre of Biotechnology, University of Peshawar for their untiring devotion and dedication to

mould us into better humans and achieve something which I would not be able to do without their

enthusiasm and teachings.

I would like to thank my friends Sulaiman, Ilyas, Haji Akbar and Mian Sahib Zar for being

such wonderful friends during my stay at CEMB. Special thanks to Adnan for his help, thanks to all

the TPs and to all BBTians. Also thanks to Ibrahim and his cousin Ali, who hosted me in Baluchistan

for sampling, and to all the people who participated in this study by giving their blood samples. I

would also like to thank, Shakeel, Hayat, Fawad, Khitab, Niaz, Faidad, Masaud, Irshad, Tahir,

Liaqat, Ali, Abrar, Islam and Kazim, just a few of all the fantastic “CEMBIANS” at the CEMB,

thanks for your friendship and providing such a great research environment. I thank Asma, Sadie,

Fatima and Mona for their help and support in my research and write up. I am grateful to you all for

helping me get through the difficult times, and for all the emotional support, camaraderie,

entertainment, and caring you provided.

No words can express and no deeds can return the love, affection, amiable attitude, sacrifices,

advices, unceasing prayers, support, and inspiration that my mother, my sister Aapa Begum and

brothers Shehriyar and Bashar infused in me during my whole academic career. I thank my uncles

especially Omar and Ali and all my aunts and cousins particularly Dr. Khizar, Dr. Omar, Zeeshan

and Asif Ali, for their utmost support during my career.

Last but not the least; I would like to acknowledge Higher Education Commission (HEC) of

Pakistan for awarding me Indigenous Ph.D. fellowship and supporting a part of this study.

Muhammad Israr

vii

Abbreviations and Symbols

°C Degrees Celsius

ABI Applied Biosystems Incorporated

BLAT BLAST Like Alignment Tool

bp Basepair

CE Capillary Electrophoresis

cM CentiMorgan

CODIS Combined DNA Index System

dH2O Distilled water

DNA Deoxy Ribonucleic Acid

dNTPs DeoxyNucleoside TriPhosphates

EDNAP European DNA Profiling Group

EDTA Ethylene Diamine Tetra Acetic acid

HCl Hydrochloric Acid

ISFG International Society of Forensic Genetics

Kb Kilobases

KP Khyber Pakhtunkhwa

Mb Megabases

MgCl2 Magnesium Chloride

min Minute

mL Milliliter

mM Millimolar

miniSTRs Mini-Short Tandem Repeats

NaCl Sodium Chloride

NCBI National Center for Biotechnology Information

ng Nanogram

p Short Arm of Chromosome

PCI Phenol-Chloroform-Isoamylalcohol

PCR Polymerase Chain Reaction

pg Picogram

pmole Picomole

viii

q Long Arm of Chromosome

RFLP Restriction Fragment Length Polymorphism

RFU Relative Fluorescence Unit

RNA RiboNucleic Acid

rpm Revolution per minute

SDS Sodium Dodecyl Sulphate

s or sec Second

SNP Single Nucleotide Polymorphism

STR Short Tandem Repeat

Taq Thermus aquaticus

TE Tris-EDTA

UV Ultraviolet

Tm Melting Temperature

VNTR Variable Number of Tandem Repeats

X-STR X-chromosome Short Tandem Repeat

Y-STR Y-chromosome Short Tandem Repeat

μg Microgram

μL Microlitre

μM Micromole

ix

Table of Contents

Summary....................................................................................................... iii Acknowledgements....................................................................................... v Abbreviations and Symbols......................................................................... vii List of Tables................................................................................................ xi List of Figures............................................................................................... xii

Chapter 1 1. Introduction............................................................................................. 1

Chapter 2 2. Review of Literature............................................................................... 6

2.1 DNA Polymorphisms and Short Tandem Repeats (STRs).................................... 6 2.2 Techniques Used for Typing of STR Markers.......................................................

2.2.1 Polymerase Chain Reaction (PCR) and Multiplexing.......................................... 2.2.2 Capillary Electrophoresis (CE).............................................................................

9 9 11

2.3 X-Chromosomal STRs: Identity Testing and Beyond.......................................... 12

2.4 Evolution of MiniSTRs.......................................................................................... 17 2.5 Evidence Items Benefitting from MiniSTR Analysis............................................ 21 2.6 Miniaturization of ChrX STRs: Problems and Prospects....................................... 21

Chapter 3 3. Materials and Methods.......................................................................... 26

3.1 Blood Samples Collection..................................................................................... 26 3.2 DNA Extraction from Blood Samples................................................................... 27 3.3 DNA Quantitation.................................................................................................

3.3.1 Spectrophotometery.............................................................................................. 3.3.2 Gel Electrophoresis...............................................................................................

29 29 30

3.4 Multiplex Design Strategy..................................................................................... 3.4.1 Selection of Loci, Primer Design and Labelling................................................... 3.4.2 Size Determination................................................................................................ 3.4.3 Multiplex Schematics............................................................................................ 3.4.4 Primer Concentration............................................................................................

30 30 32 33 34

3.5 Polymerase Chain Reaction (PCR)........................................................................ 3.5.1 PCR Setup............................................................................................................. 3.5.2 Thermal Cycling Parameters................................................................................. 3.5.3 Cycle Number....................................................................................................... 3.5.4 Peak Balance......................................................................................................... 3.5.5 Sensitivity Studies................................................................................................. 3.5.6 Nonhuman DNA and Specificity Studies............................................................. 3.5.7 Degraded DNA Studies.........................................................................................

3.5.7.1 Enzymatic Degradation............................................................................ 3.5.7.2 Mechanical Degradation..........................................................................

3.5.8 Gel Extraction Protocol......................................................................................... 3.5.8.1 Gel Dissociation...................................................................................... 3.5.8.2 DNA Binding.......................................................................................... 3.5.8.3 Wash......................................................................................................... 3.5.8.4 DNA Elution............................................................................................

34 35 36 37 37 37 38 38 38 38 39 39 39 40 40

3.6 Detection and DNA Analysis................................................................................. 3.6.1 Sample Preparation for ABI 3130 Genetic Analyzer............................................ 3.6.2 Sample Electrophoresis.........................................................................................

41 41 41

x

3.6.3 Absorption/Emission Spectra of Fluorescent Dyes Used for Labelling............... 3.6.4 Software Programs for Data Analysis................................................................... 3.6.5 Statistical Analysis and Formulae.........................................................................

42 42 44

Chapter 4 4. Results...................................................................................................... 46

4.1 Development of Multiplexes.................................................................................. 4.1.1 Screening of Primers............................................................................................. 4.1.2 Development of 7-Plex......................................................................................... 4.1.3 Development of 12-Plex.......................................................................................

4.1.3.1 Sensitivity Studies.................................................................................... 4.1.3.2 Primer Concentration............................................................................... 4.1.3.3 Colour Balance......................................................................................... 4.1.3.4 Cycle Number.......................................................................................... 4.1.3.5 Annealing Temperature. .......................................................................... 4.1.3.6 MgCl2 Titration....................................................................................... 4.1.3.7 Specificity................................................................................................ 4.1.3.8 Efficiency of Multiplex Assay.................................................................

46 46 47 49 49 50 50 51 52 53 54 55

4.2 Degraded DNA Studies.......................................................................................... 4.2.1 Enzymatic Degradation......................................................................................... 4.2.2 Mechanical Degradation.......................................................................................

57 57 58

4.3 Population Genetics................................................................................................ 4.3.1 Baluchistan............................................................................................................ 4.3.2 Khyber Pakhtunkhwa (KP)................................................................................... 4.3.3 The Punjab............................................................................................................ 4.3.4 Sindh.....................................................................................................................

60 62 66 70 74

4.4 Linkage Disequilibrium.......................................................................................... 78 4.5 Haplotype Analysis................................................................................................

4.5.1 Allele Frequency for DXS7133............................................................................ 83 87

4.6 Allelic Ladder......................................................................................................... 88 4.7 Phylogenetic Analysis............................................................................................ 91

Chapter 5 5. Discussion................................................................................................. 94

5.1 Comparison of Multiplex PCR Systems................................................................ 94 5.2 Allele Distribution and Population Comparisons.................................................. 95 5.3 Linkage Disequilibrium and Haplotype Analysis.................................................. 99 5.4 Phylogenetic Studies.............................................................................................. 100 5.5 Conclusions............................................................................................................ 102

Chapter 6 6. References................................................................................................ 104

Publication............................................................................................... 127

xi

List of Tables

Table No. Page Table 2.1 X-Chromosomal miniSTRs……………………………………………… 23 Table 3.1 Markers along with their linkage groups and mapping information…….. 31 Table 3.2 Genotyping Primers Sequences and Dye Labelling................................... 32 Table 3.3a Primers concentration and ingredients for 7-plex master mix………… 35 Table 3.3b Primers concentration and ingredients for 12-plex master mix…………. 35 Table 3.3c Primers concentration and ingredients for 4-plex master mix………… 36 Table 3.4 Fluorescence Spectral Parameters……………………………………….. 42 Table 4.1 Primers interactions of 12 primer pairs with alignment score 6 or greater 47 Table 4.2 Population Specific F Statistics…………………………………………. 60 Table 4.3 Allele Frequency Distribution in Baluchi Population…………………… 64 Table 4.4 Forensic Efficiency Parameters for Baluchi Population………………… 65 Table 4.5 Allele Frequency Distribution in KP Population………………………... 67 Table 4.6 Forensic Efficiency Parameters for KP Population……………………... 68 Table 4.7 Allele Frequency Distribution in Punjabi Population…………………… 71 Table 4.8 Forensic Efficiency Parameters for Punjabi Population………………… 72 Table 4.9 Allele Frequency Distribution in Sindhi Population…………………….. 75 Table 4.10 Forensic Efficiency Parameters for Sindhi Population………………….. 76 Table 4.11 Linkage Disequilibrium Matrix Baluchistan…………………………….. 79 Table 4.12 Linkage Disequilibrium Matrix KP...…………………………………… 80 Table 4.13 Linkage Disequilibrium Matrix Punjab…………………………………. 81 Table 4.14 Linkage Disequilibrium Matrix Sindh…………………………………... 82 Table 4.15 Haplotype analysis of DXS7424-DXS101 in Baluchi Population………. 84 Table 4.16 Haplotype analysis of DXS7424-DXS101 in KP Population……...……. 85 Table 4.17 Haplotype analysis of DXS7424-DXS101 in Sindhi Population………... 86 Table 4.18 Allele Frequency for DXS7133…………………………………………. 87 Table 4.19 Observed alleles, fragment lengths and repeat size of 12-X miniSTRs…. 88 Table 4.20 Alleles of 12 X-STRs for allelic ladder…………………………………. 89

xii

List of Figures

Figure No. Page Figure 2.1 Polymorphic Regions in the Human Genome……………………………. 7 Figure 2.2 Localization of STRs on X-Chromosome………………………………... 14 Figure 2.3 MiniSTRs vs. conventional STRs………………………………………... 18 Figure 2.4 Discordance Between miniSTRs and Conventional STRs………………. 18 Figure 2.5 Evolution of miniSTRs – a Timeline…………………………………….. 20 Figure 3.1 Sampling Area and Number of Samples collected………………………. 26 Figure 3.2a Multiplex Schematics for 7-Plex…………………………………………. 33 Figure 3.2b Multiplex Schematics for 4-Plex…………………………………………. 33 Figure 3.2c Multiplex Schematics for 12-Plex………………………………………... 34 Figure 3.3a PCR Profile for 7-Plex…………………………………………………… 36 Figure 3.3b PCR Profile for 12-Plex………………………………………………….. 37 Figure 3.4 Emission Spectra…………………………………………………………. 42 Figure 4.1 Female DNA Profile of 7 X-STRs……………………………………….. 48 Figure 4.2 Sensitivity of 12-Plex…………………………………………………….. 49 Figure 4.3 Average amplification in each dye lane………………………………….. 50 Figure 4.4 Amplification at different number of PCR cycles ………………………. 51 Figure 4.5 Amplification at different annealing temperatures………………………. 52 Figure 4.6 MgCl2 Titration for 12-Plex……………………………………………… 53 Figure 4.7a Bovine DNA amplified through 12-Plex………………………………… 54 Figure 4.7b Equine DNA amplified through 12-Plex…………………………………. 55 Figure 4.8 Male DNA Profile of 11 X-STRs plus sex determining locus Amelogenin……. 56 Figure 4.9a DNA Degraded with DNase I……………………………………………. 57 Figure 4.9b DNA Degraded with ultrasonication……………………………………... 58 Figure 4.10 Degraded DNA amplified through 12-Plex……………………………… 59 Figure 4.11 Matrix of pair wise Fst between populations…………………………….. 61 Figure 4.12 Population Assignment Test for Baluchistan against other populations…. 62 Figure 4.13 Number of Alleles at each locus in Baluchi Population………………….. 63 Figure 4.14 Number of Alleles at each locus in KP Population………………………. 66 Figure 4.15 Population Assignment Test for KP against other populations………….. 69 Figure 4.16 Number of Alleles at each locus in Punjabi Population………………….. 70 Figure 4.17 Population Assignment Test for Punjab against other populations………. 73 Figure 4.18 Number of Alleles at each locus in Sindhi Population…………………… 74 Figure 4.19 Population Assignment Test for Sindhi against other populations………. 77 Figure 4.20 Tetraplex Profile of Male DNA………………………………………….. 83 Figure 4.21 Allelic Ladder for 12 X-STRs……………………………………………. 90 Figure 4.22 Phylogenetic Tree of 61 populations around the world …………………. 92

Chapter 1

The world is full of obvious things which nobody by any chance ever

observes.

Sherlock Holmes - The Hound of the Baskervilles

1

1. Introduction

Sherlock Holmes said, “I had come to an entirely erroneous conclusion which

shows how dangerous it always is to reason from insufficient data.” A century later,

forensic science has moved forward a long way and instead of crime psychology, it is the

empirical evidence that is of interest to a forensic scientist. But now it is even more

necessary to have sufficient data for an accurate analysis in a forensic laboratory lest the

faulty evidence be responsible for conviction of an innocent citizen (Caskey and

Hammond, 1992, Coovadia, 2008). As the forensic techniques advance, so do the

techniques of the perpetrators of the crimes. Moreover, the industrialization of our world

further confounds the evidence found at a crime scene or the samples left on a mass

disaster site. Hence, there is a constant pressure to develop novel and precise methods to

arrest the guilty but to exonerate the innocent or to accurately identify a missing relative.

From fingerprinting to DNA profiling, many techniques were standardized and discarded

until the data basing of core Short Tandem Repeats (STR) loci. Since then, millions of

profiles are generated and it is very likely that STRs will be the workhorses for the

foreseeable future (Gill et al., 2004, Muller et al., 2007).

Forensic science and the ability to solve crimes are revolutionized by DNA

typing. The invisible DNA molecule is the silent biological witness at crime scene. In

forensic casework we often have to face with samples which are not in the best of

conditions and can have highly degraded DNA. This includes burnt items, bones and

teeth. If exposed to heat and humidity, DNA molecules start breaking down to smaller

fragments. Nucleases from within the cell attack the DNA as soon as the cell dies, leading

to its degradation (El-Harouny et al., 2009). Biochemical, microbial or oxidative process

can also lead to DNA degradation. In favourable environment, microbes feed on the

2

hydrocarbons of DNA and render it highly fragmented. Bacteria are the main agents on

land while fungi are responsible for oceanic degradation (Leahy and Colwell, 1990,

Butler, 2006b).

The human genome consists of about 3 billion base pairs as estimated during the

human genome project (Lander et al., 2001, Venter et al., 2001). This includes coding

and non-coding regions. Around 2% of this DNA is responsible for coding genes for

polypeptides or RNA and is called coding region. However, the bulk of DNA, that is,

98% is called non-coding region. Moreover, it is estimated that 99.5% of human DNA is

similar between all individuals and only 0.5% varies from individual to individual. These

variations in DNA provide the base for human identification purposes (Feuk et al., 2006,

Levy et al., 2007).

The length polymorphisms include mini-satellites or Variable Number Tandem

Repeats (VNTR) and microsatellites or Short Tandem Repeats (STRs). VNTRs are the

markers containing repeat unit of 10 to 100 base pairs long that are repeated in tandem

over and over again (Butler, 2005). STRs, which are also called as Simple Sequence

Repeats (SSRs), as defined by Butler, “are accordion-like stretches of DNA containing

core repeat unit of between two and seven nucleotides in length that are tandemly

repeated from approximately a half dozen to several dozen times” (Butler, 2007).

Thousands of STRs have been identified in human DNA. According to an estimate, STR

marker occur every 10,000 nucleotides in the human genome (Butler, 2007). For human

identification in forensic science, STRs are the method of choice for DNA profiling

(Crow et al., 2000, Butler et al., 2007).

The DNA typing has played a pivotal role to establish the paternity of child which

is utmost priority for support, inheritance right and other social benefits of a child. Short

Tandem Repeats (STRs) located on X chromosome are powerful marker for complex

3

kinship testing such as deficiency paternity testing when the disputed child is a female

(Asamura et al., 2006, Gomes et al., 2007c, Butler, 2011). Now more than 40 X-STRs

have been characterized and validated for forensic use (Szibor et al., 2006). Although,

some of X-STRs like HPRTB (Hearne and Todd, 1991, Edwards et al., 1992), HumARA

(Edwards et al., 1992, Desmarais et al., 1998) and DXS981 (Mahtani and Willard, 1993),

were established early but the use of X-STRs in paternity testing was limited due to the

lack of their genetic characterization (Gomes et al., 2007c). Since X-Chromosome

recombines in females only, therefore, X-STRs show stronger Linkage Disequilibrium

compared with autosomal STRs (Hering et al., 2006) and the mutation rates of X-STRs

are not different to autosomal STRs. Therefore, it is worth-while including X-STRs with

autosomal markers in paternity testing (Szibor et al., 2006, Szibor, 2007, Machado and

Medina-Acosta, 2009, Jedrzejczyk et al., 2010).

X-STRs are routinely used in parentage analysis and relationship investigations

such as avuncular and first cousin relationships now a day. In addition to X-STRs, stable

haplotype of closely associated X-chromosome markers have proven to be a powerful

tool in kinship analysis (Szibor et al., 2005a) especially for cases when father/daughter

relationships are to be tested. It has been reported that a set of 12 X-STRs provides more

information than a set of 38 highly informative biallelic autosomal markers for cases

involving father–daughter duos (Gomes et al., 2012). X-STRs have also advantage over

autosomal STRs for paternity cases involving close blood relatives as alternative putative

fathers and in deficiency paternity cases, i.e. when the DNA sample from putative father

is not available and DNA from paternal relative has to be analyzed instead (Barbaro and

Cormaci, 2006). Further, X-linked STRs can be used to solve sibling ship status, without

using father’s DNA, of two females having the same biological father (Toni et al., 2003,

Toni et al., 2006). X-STRs can determine the relationship of grandmother/granddaughter

4

as granddaughter theoretically has to carry at least one allele in common with the

grandmother (Edelmann et al., 2004). In forensic analysis of mixed stains, X-STRs are

helpful to identify the female DNA (Szibor et al., 2000, Shin et al., 2005). X-STR

markers are low size markers and can efficiently be used for degraded DNA analysis.

Slightly degraded samples can be typed by traditional STRs, but may yield

negative results as the fragmentation increases (Holland et al., 2003, Grubwieser et al.,

2006, Parsons et al., 2007). Conventional STRs have a size range of 100-400bp most of

which consists of flanking sequences on both sides of the repeat region. To alleviate the

problems associated with analyzing DNA from degraded samples a new set of STR

primers known as Miniplexes were designed by moving the primers closer to the repeat

region leaving the extra sequences out (Butler et al., 2003, Coble and Butler, 2005).

Using shorter amplicons in polymerase chain reaction (PCR), improvement has been

reported in obtaining results from forensic evidence or a mass disaster site having

degraded specimens (Drabek et al., 2004).

Although, STRs are highly polymorphic and are used in combination to obtain a

DNA profile of an individual or a sample, yet the possibility that, any other person in the

population can have the exact same DNA profile cannot be ruled out. Therefore, it is of

paramount importance to determine the frequency and other statistical parameters for a

particular DNA profile in a particular population and that is presented in the court of law.

For this purpose, population databases are essential in forensic casework which provide

the information about what is the statistical probability of a particular profile to occur in a

population at random. Random match probability is the estimated frequency at which a

particular STR profile would be expected to occur in a population (Butler, 2005, Koehler

et al., 2009).

5

A sample size of about 100 individuals is reported to be sufficient for studying

genetic loci in a population provided that any allele with a frequency less than 1% should

not be included in forensic calculations (Chakraborty, 1992). However, it is customary to

assign a minimum allele frequency value to any such alleles, instead of exclusion. This

minimum value is usually very conservative, higher than the probable frequency of the

alleles in the actual population (Butler, 2011).

This study reports the successful optimization and testing of a multiplex system,

miniplex, which is capable of parallel amplification of 11 ChrX miniSTRs. The markers

included in multiplex have been proved as highly polymorphic. The markers included are:

DXS101, DXS6789, DXS6793, DXS7132, DXS7423, DXS7424, DXS8378, DXS9902,

GATA172D05, GATA31E08 and HPRTB as well as the gender determining locus,

Amelogenin. For forensic evaluation of a multiplex population data are crucial, therefore,

we analyzed Baluchi, KP, Punjabi and Sindhi population, using an ABI PRISM 3130

Genetic Analyzer (Applied Biosystems) with the miniplex system. Sampling was done

from all the 4 provinces of Pakistan totalling 570 individuals. A total of 5–11 alleles were

observed for each locus and altogether 79 alleles for all 11 X-STR loci. Heterozygosity in

females ranged from 0.5351 to 0.8332. No significant deviation was observed from

Hardy–Weinberg equilibrium for all 11 microsatellites. The power of discrimination

using all the 11 X-STRs was 1.28 x 10-11 in females and 2.64 x 10-7 in males.

This study also includes haplotype analysis of DXS7424-DXS101 in a sample of

149 males from Baluchi, KP and Sindhi population. In this cluster, 9 alleles for marker

DXS101 were identified while 8 alleles were identified for DXS7424. Therefore, 72

different combinations are possible but we identified only 46 different haplotypes through

genotyping.

Chapter 2

It has long been an axiom of mine that the little things are infinitely

the most important.

Sherlock Holmes - A Case of Identity

6

2. Review of Literature

2.1 DNA Polymorphisms and Short Tandem Repeats (STRs)

Human Genome consists of 23 pairs of chromosomes having 3 billion base pairs

of DNA. However, only about 5% of the total genome is coding sequences, the remaining

being non-coding as shown in figure 2.1. The non-coding region is highly repetitive and

although not very significant in expression studies but variations in this region are used

for identification purposes (Kashyap et al., 2004). There are mainly two types of DNA

polymorphisms in the human genome: Length polymorphism, which is the variation in

length of DNA at a particular locus in different individuals and sequence polymorphism

which is due to different recognition sequences in the DNA. Variation in fragment length

of DNA as a result of digestion by restriction enzyme is called RFLP (Restriction

Fragment Length Polymorphisms). The first polymorphic locus was discovered in 1980

(Wyman and White, 1980). RFLP, the method of choice in the 1980s is no longer used in

forensic casework due to its requirement for high quality DNA (Schneider, 2003).

The first breakthrough came in with the discovery of multi-locus probes (Jeffreys

et al., 1985a). The study found many dispersed, highly polymorphic, tandem-repetitive

'mini-satellite' regions. A probe based on a tandem-repeat of the core sequence could

detect many highly variable loci simultaneously. The multi-locus probe analysis has been

used to produce individual specific fingerprints of human DNA (Jeffreys et al., 1985c,

Jeffreys et al., 1985b) but they may also cross react with genomes from other species

(Jeffreys et al., 1987). Variable Number Tandem Repeats (VNTRs) typing became the

first generally accepted forensic DNA technology (Nakamura et al., 1987).

7

Figure 2.1: Polymorhpic Regions in the Human Genome (adapted from: Kashyap et al., 2004)

Human Genome

Mitochondrial Genome (16.5

kbp)

Nuclear Genome (3billion bp)

Coding Sequences (~5% of total genome)

Non-Coding Sequences (90-

95% of total genome)

Non-Repetitive Sequences (70-

75% of genome)

Flanking, leader, trailer

sequences, introns,

enhancers

Pseudogenes

Repetitive DNA (25-30% of

genome)

Intersperesed Repetitive DNA

(10-15%)

Tandem Repetitive DNA

(10%)

Tandem Repeat Genes (Coding

Sequences)

Satellite (~5%) Repeat Unit = 1-

1000s

Macrosatellite (Repeat Unit

=1000s) Cryptic, Midi

satellite

Microsatellite (Simple

Sequences/STR) 2-6bp, 50, 000

loci

Minisatellite (Repeat Unit = 100bp, 1500

loci)

8

VNTRs are also called single-locus probes. The locus D1S80 is one of the best-

known polymorphic loci, showing a variable number of tandem repeats (Kasai et al.,

1990). As interpretation of VNTRs was complicated by problems related to fragment

match criteria and population genetics, search for new markers of human identity testing

continued (Schneider, 2003).

Short Tandem Repeats (STR) is another class of length polymorphisms, which

occurs when a pattern of two or more nucleotides are repeated and the repeated sequences

are directly adjacent to each other. The position of STR on a chromosome is called an

STR locus. STR analysis was the real breakthrough in the field of forensic DNA testing.

The first reported STR markers were di-repeats (Litt and Luty, 1989, Weber and May,

1989) which can be typed using the polymerase chain reaction. Later on, tri-meric and

tetra-meric STRs markers were found to be more polymorphic (Edwards et al., 1991).

Compared to tri- tetra- and penta-repeats, the di-repeats produce more stutter bands due to

strand slippage during amplification (Hauge and Litt, 1993). Therefore, tri, tetra and

penta-repeat markers are more suitable for forensic purposes (Edwards et al., 1991).

For personal identification purposes like parentage testing, forensic identification

and medical applications, STRs are widely used. The presence of micro-variant further

increases the power of discrimination of STR (Butler et al., 1999). Discrimination value

is directly proportional to the increase in number of STRs used. The more STRs used,

lesser are the chances that an individual profile will match with any other in the

population at random (Panneerchelvam and Norazmi, 2003).

9

2.2 Techniques Used for Typing of STR Markers

The widespread use of STR as forensic markers was made possible through the

use of PCR. Later on with the advent of Capillary Electrophoresis (CE), the concept of

multiplexing was introduced. Till date, many multiplexes have been reported which are

capable of amplifying as many as 20 different set of primers in a single reaction.

Following is brief account of both PCR and CE.

2.2.1 Polymerase Chain Reaction (PCR) and Multiplexing

PCR has revolutionized molecular biology since its introduction and has become

the ideal choice for the analysis of STR (Saiki et al., 1985, Mullis et al., 1986). During

PCR, primers which are pair of synthetic oligonucleotides complementary to flanking

sequence around the region of interest, along with thermostable polymerase enzyme and

other reagents amplify particular segment of DNA into a large number of copies. The

process broadly consists of 3 main steps: denaturation of double-stranded DNA,

annealing of primers to their target and extension of the new sequence (Maniatis et al.,

1982).

Each cycle results in doubling the number of fragments and hence, exponential

increase is observed. Due to thermostable properties of Taq polymerase, isolated from

Thermus aquaticus (Chien et al., 1976, Lawyer et al., 1989), target amplification at

higher temperature has been made possible which results in high specificity, yield and

sensitivity (Saiki et al., 1988). There are many different types in which PCR can be set up

to achieve this specific goal. The hot start PCR is introduced by chemically inactivating

or physically separating the PCR components (Kebelmann-Betzing et al., 1998, Kaboev

et al., 2000, Kong et al., 2004), to suppress mis-priming artifacts and increase the specific

yield (D'Aquila et al., 1991, Chou et al., 1992). Other types of PCR include nested PCR

10

to increase the specificity (Haqqi et al., 1988), quantitative PCR to assess the number of

starting molecules, long range PCR to amplify the long DNA fragments (McDonald et

al., 2002), allele specific PCR to investigate SNPs and Asymmetric PCR to get single

strand amplification (Innis et al., 1988).

The ideal technique for forensic DNA typing is multiplex PCR because as the

number of polymorphic loci examined increases, the probability of identical alleles being

present in two different individuals decreases (Edwards and Gibbs, 1994). Multiplex PCR

is defined as parallel amplification of multiple loci of DNA in a single tube by adding two

or more than two primer pairs. The concept of multiplexing was initially reported in 1988

(Chamberlain et al., 1988). One of the most critical parameter for the Multiplex PCR is

the careful designing of primers as multiple primers have to anneal at similar temperature

without interfering with each other and no non specific binding. Primers designed should

have similar melting temperatures so that annealing events should occur at similar

temperature. The stringent initial primer selection reduces the costly process of

optimization of multiplex PCR (Schoske et al., 2003). There are some drawbacks to

multiplex PCR. For instance, multiplex PCR results in biased amplification due to

different length of PCR product as the shortest fragments show preferential amplification

in comparison to the longest ones. However, this effect can be reduced, either by

initiating PCR with the long amplicon primers and by adding the primer for the shorter

amplicon some cycles later (Bourque et al., 1993), or by using a low concentration of the

short amplicon primer. One of the critical issues of multiplexing is increase in number of

spurious fragments by increasing the number of loci to be amplified simultaneously

which is due to interaction of primers with each other or with non specific sequences.

Therefore, multiplexing capability is restricted to approximately 20 fragments by

11

traditional PCR. However, 1000-plex PCR is developed successfully by the careful

designing of primers (Wang et al., 2005).

The other ideas have been introducing to circumvent the affect of primer

interaction and to increase the multiplexing capability by, using chimeric primers (Shuber

et al., 1995), Molecular Inversion Probe (Hardenbol et al., 2003) and Golden Gate assay

(Fan et al., 2003). The approach of simultaneous analysis of STRs has been proved to be

worthwhile in minimizing labour, material and analysis time for forensic case work in

DNA testing laboratories. The traditional way of multiplexing is common for STR

analysis (Krenke et al., 2002, Schoske et al., 2003, Vallone et al., 2008).

2.2.2 Capillary Electrophoresis (CE)

The concept of capillary electrophoresis was introduced much earlier (Hjerten,

1967), but Jorgenson and Lukacs were the first to demonstrate the power of capillary

zone electrophoresis (Jorgenson and Lukacs, 1981, Jorgenson and Lukacs, 1983). Further

developments in the CE technique rendered it the method of choice for DNA typing

(Grossman and Soane, 1991, Williams et al., 1994, Butler et al., 1995).

Currently, STR measurement is based on electrophoretic technique, which

requires dye labelled primers and very careful analysis of results because of technology

artefacts (Butler et al., 2004). The system is composed of buffer solution containing a

water soluble polymer instead of rigid physical gel. Polyethylene oxide, hydroxyethyl

cellulose or linear polyacrylamide is used as polymers (Budowle et al., 1990) and is

pumped in to narrow 50-70µm diameter capillary, and replaced at the end of each sample

analysis. This polymer act as sieving medium and separation of DNA fragments occurs

due to retardation of molecules in polymer (Barron and Blanch, 1995). To reduce analysis

time, high voltage up to 15,000 volts can be used. The end of capillary has window for

12

purpose of detection by UV or laser-induced fluorescence. To detect the DNA,

fluorescent dye is attached to 5’end of PCR primer which becomes part of PCR product

during amplification. Different dyes have been used to label the PCR product in the case

of STR analysis (Butler, 2005). The length polymorphisms amplified by PCR can be

efficiently analyzed by capillary electrophoresis (Williams et al., 1994). The two

instruments have been calibrated for STR analysis which includes the ABI PRISM® 3730

Genetic Analyzer and ABI PRISM® 3100 Genetic Analyzer 16 Capillary System

(Applied Biosystems). These instruments provide a semi-automated and accurate

analytical system for STR typing (Lazaruk et al., 1998, Moretti et al., 2001).

2.3 X-Chromosomal STRs: Identity Testing and Beyond

The DNA typing has played a pivotal role to establish the paternity of child which

is utmost priority for support, inheritance right and other social benefits of a child.

Normal males possess one X chromosome and one Y chromosome, whereas females

possess two X chromosomes (Creighton, 1999, Baker et al., 2008, Butler, 2011). Due to

its unique inheritance pattern, the X chromosome is a potential candidate for forensic and

human identity testing applications (Butler, 2011). Short Tandem Repeats (STRs) located

on X chromosome are powerful markers for complex kinship testing such as deficiency

paternity testing when the disputed child is a female (Asamura et al., 2006, Gomes et al.,

2007b, Butler, 2011). The use of X-STRs for paternity testing was limited due to lacking

of genetically characterized markers (Gomes et al., 2007b). Although, some of X-STRs

like HPRTB (Hearne and Todd, 1991, Edwards et al., 1992), ARA (Edwards et al., 1992,

Desmarais et al., 1998) and DXS981 (Mahtani and Willard, 1993), were established early

but intentions to use these markers for forensic purposes developed late. As shown in

Figure 2.2, more than 40 X-STRs have been established as forensic markers. (Szibor et

13

al., 2006, Szibor, 2007, Machado and Medina-Acosta, 2009, Jedrzejczyk et al., 2010).

For investigation of avuncular and first cousin relationship as well as parentage, X-STRs

are routinely used.

Stable haplotyping of closely linked X-STRs are powerful tools in kinship

analysis (Hering et al., 2006). Positions of STR on X-Chromosome are given in Figure

2.2. X-STR show stronger Linkage Disequilibrium (LD) compared with autosomal STR

since X-chromosome recombines only in females and the mutation rates of X-STRs are

not different to that of autosomal STRs (Hering et al., 2006). Majority of X-STRs can be

used routinely and there are no peculiarities in term of their usage except HumARA, one

of the established STR markers. It is recommended recently that HumARA should not be

used as forensic marker (Szibor et al., 2005b) due to its association with bulbar muscular

atrophy (La Spada et al., 1991).

Inclusion of X-STRs with autosomal markers in cases when father/daughter

relationships are to be tested is worthwhile. X-STR haplotyping can be of particular help

in cases where a DNA sample from one of the parents is not available for testing. For

example, if a father/daughter parentage relationship is in question, X-STRs may be

helpful (Silveira et al., 2007, Reid et al., 2008, Aquino et al., 2009, Butler, 2009). X-

STRs have advantage over autosomal STRs for paternity cases involving close blood

relatives as alternative putative fathers.

The major advantage of X-STRs is proven in cases when the DNA sample from

putative father is not available and DNA from paternal relative has to be analyzed instead

(Barbaro and Cormaci, 2006). Further, X-linked STRs can be used to solve sibling ship

status, without father’s DNA, of two females having the same biological father (Toni et

al., 2003, Toni et al., 2006).

14

Figure 2.2: Localization of Short Tandem Repeats (STRs) on X Chromosome (from: Szibor et al., 2006)

15

X-STRs can determine the relationship of grandmother/granddaughter as

granddaughter theoretically has to carry at least one allele in common with the

grandmother (Edelmann et al., 2004).

Usually ChrX markers are less powerful in stain analyses than autosomal markers

and are not suitable for use in testing male traces where there is female contamination.

But these are more powerful than autosomal markers in identification of female traces in

male contamination (Szibor et al., 2003). In forensic analysis of mixed stains, X-STRs are

helpful to identify the female DNA (Szibor et al., 2000, Shin et al., 2005).

There are several measures like variation in allele size and numbers,

heterozygosity, genetic variability and discontinuous allele distribution, which are useful

in population genetics. These X-STRs may be powerful in the study of population

genetics and can be used to detect changes due to mutations or genetic drift among

populations (Calafell et al., 1998). Being lineage markers, these can also be used as

ancestry informative markers to deduce individual ancestry information from admixture

populations (Schaffner, 2004). The X chromosome, along with Y-chromosome and

mitochondrial DNA, has tremendous potential for the reconstruction of phylogenetic trees

to unravel the complexities of the history of populations (Santos-Lopes et al., 2007, Pinto

et al., 2011).

Similar to autosomal STRs, different multiplex systems have been developed to

use the X-STRs efficiently for paternity testing. The first multiplex study with respect to

X-STRs analysed nine loci in three different multiplexes which include duplex PCR

(DXS6789 and DXS6795), Triplex PCR (DXS7133, DXS9895 and DXS9898) and

Quadruplex PCR (DXS6803, DXS8378, GATA164A09 and DXS7132) (Son et al.,

2002). In very short period of time, other studies came up with parallel amplification of

16

three (Wiegand et al., 2003), four (Son et al., 2002, Lee et al., 2004), five (Zarrabeitia et

al., 2002, Poetsch et al., 2005), six (Robino et al., 2006), seven (Bini et al., 2005), eight

(Nakamura and Minaguchi, 2010), ten (Gomes et al., 2007b), eleven (Ribeiro Rodrigues

et al., 2008), twelve (Turrina et al., 2007), thirteen (Tariq et al., 2008, Hwa et al., 2009)

and fifteen (Liu et al., 2012) X-STR markers. Multiplex with greater number of markers

are being developed to obtain a high degree of discrimination.

Commercially, there has been little focus on developing multiplex kits for ChrX

applications. The first such effort is Mentype Argus X-UL PCR amplification kit by

Biotype AG. Four ChrX markers, HPRTB, DXS7132, DXS7423 and DXS8378 as well as

gender determining loucs Amelogenin, are included in this kit. One marker per linkage

group can be analyzed using this kit (Castella et al., 2006, Gehrig and Teyssier, 2006).

In April 2005, Biotype introduced another kit called Mentype® Argus X-8 PCR

Amplification Kit. This kit can analyze eight markers DXS8378, DXS10135, DXS7132,

DXS10134, HPRTB, DXS10101, DXS7423 and DXS10074 as well as Amelogenin for

gender determination. Two markers per linkage group have been chosen [group 1 (Xp22):

DXS8378/DXS10135, group 2 (Xq11): DXS7132/DXS10134, group 3 (Xq26):

HPRTB/DXS10101, group 4 (Xq28): DXS7423/DXS10074]. Thus, two markers of each

group have to be handled as haplotype for genotyping. The primers are labelled with 6-

FAM or HEX (Branicki et al., 2008).

Qiagen (Qiagen, Hilden, Germany), in collaboration with Biotype, developed

Investigator Argus X-12 PCR Amplification Kit which amplifies simultaneously the 12

markers DXS8378, DXS10135, DXS10148, DXS7132, DXS10134, DXS10079, HPRTB,

DXS10101, DXS10103, DXS7423, DXS10074 and DXS10146 as well as Amelogenin

for gender determination. Three markers per linkage group

17

have been chosen:

Group 1 (Xp22): DXS8378/DXS10135/DXS10148,

Group 2 (Xq11): DXS7132/DXS10134/DXS10079,

Group 3 (Xq26): HPRTB/DXS10101/SXS10103,

Group 4 (Xq28): DXS7423/DXS100749/DXS10146].

Thus, three markers of each group have to be handled as haplotype for genotyping

(Edelmann et al., 2012).

2.4 Evolution of MiniSTRs

In forensic casework we often have to face with DNA which is not in ideal

condition (El-Harouny et al., 2009). When DNA molecules are exposed to water and/or

heat, they start breaking down into smaller fragments. Different biochemical, microbial

and oxidative processes are responsible for this degradation (Leahy and Colwell, 1990,

Butler, 2006b). The commercial kits based on STR do not fare well with these

compromised samples. Hence, to improve the results obtained from such samples, we

need to look beyond the commercial kits (Butler, 2011).

One way is to use smaller PCR products, the so called miniSTRs (Wiegand and

Kleiber, 2001, Graham, 2005). As shown in Figure2.3, this is done by moving the primers

closer to the repeat region of the STR. It has been proved that short amplicons perfrom

better with degraded samples and can recover information when larger STR amplicons

produce partial or no profiles (Butler et al., 2003).

18

Figure 2.3 (from Butler, 2009): (a) MiniSTRs created by designing PCR primers (Butler et al., 2003) that anneal closer to the repat region than conventional STR kit primers. (b) PCR product sizes, such as demonstrated here with D16S539, can be reduced by 150bp relative to conventional tests.

As miniSTRs and commercial STRs have primers in different locations, there is a

possibility of mutation in the flanking sequences on both sides of the repeat region. And

as a result the PCR products generated could be of different sizes due to allele dropout

and size shifts. As shown in Figure 2.4, an apparent discordance would result due to this

(Drabek et al., 2004).

Figure 2.4: Discordance between PCR Product of MiniSTR primers and commercial Kit primers (as illustrated by: Butler, 2012)

19

In the criminal justice system, it is not uncommon that due to faulty evidence, a

person is indicted and sent to jail but years later proved to be innocent by using DNA

typing (Coovadia, 2008). Since, evidence is not always kept in a very ‘DNA-friendly’

environment; degradation is one of the most common hindrances in such cases.

MiniSTRs have the ability to extract reliable profiles from old samples (Evison et al.,

1997) and hence may be used to set the innocent free. But we need many loci for this kind

of work. Miniaturization of X-STRs used with autosomal STRs can increase the chances

of getting a complete profile from these kind of samples (Jobling and Gill, 2004).

Another avenue where miniSTRs are potential candidates for use is the study of

ancient samples (Eliasova et al., 2010) such as those found inside and around the

Pyramids of Giza, Egypt or the leftovers of Mayan civilization in Latin America. Since

the DNA found in sites like these is expected to be highly degraded, only short templates

DNA of up to 300bp is present (Haack et al., 2000) making it impossible for any

meaningful analysis with traditional STRs. These ‘minis’ can prove essential where other

anthropological methods cannot be applied for sex typing, phylogenies (Eliasova et al.,

2010) and to explore the gender differences in the past populations (Faerman et al., 1995,

Cipollaro et al., 1998). Although SNPs are potential candidates for use with degraded

samples, problems like low polymorphism and difficulty in mixture interpretation renders

their widespread use, outside the research setting, impractical (Butler et al., 2007).

The timeline below (Figure 2.5) shows the progress of forensic markers VNTRs,

STRs, and ultimately miniSTRs of autosomal and X-Chromosomes over the years.

20

Figure 2.5: Evolution of miniSTRs – a Timeline

PCR = Polymerase Chain Reaction, VNTR = Variable Number Tandem Repeats, STR = Short Tandem Repeats, WTC KADAP = World Trade Centre Kinship and DNA Analysis Panel, CODIS = COmbined DNA Indexing System, EDNAP = European DNA Profiling group, ChrX = X-Chromosome

DNA Sex Test (Sullivan

et al., 1993)

First mass disaster

case (Waco,

Texas) (Lygo et

al., 1994)

(Whitaker et al.,

1995) (Clayton et

al., 1995)

Fluorescent STR

work reported

(Edwards et al.,

1991) (Caskey and

Hammond, 1992)

Alec Jeffery discovers

VNTR (Jeffreys et

al., 1985a, Jeffreys et

al., 1985b)

1ST PCR (Saiki et al., 1985)

1985 1991 1993 1995 2001 2003 2004 2005 2006 2009 2010

DNA Databases

starts (McEwen,

1995)

New STR

primers for

CODIS loci

(Butler et al.,

2003) "MiniSTR"

coined

(Holland et

al., 2003)

Typing of

Telogen Hair

(Hellmann et

al., 2001)

WTC

KADAP

(Marchi,

2004)

“Genderplex” (Esteve

Codina et al., 2009)

8-plex and 10-plex

for ChrX miniSTR

(Diegoli and Coble,

2011)

EDNAP finds

miniSTR better

than STR and SNP

on degraded DNA

(Dixon et al., 2006)

ChrX

miniplexes

developed

(Asamura et al.,

2006)

Concordance studies

(Drabek et al., 2004)

STR miniplex for

degraded template

(Chung et al.,

2004) (Muller et

al., 2007)

Chromosome

X Sequence

(Ross et al.,

2005)

21

2.5 Evidence Items Benefitting from MiniSTR Analysis

DNA fragment sizes for the products amplified thorugh miniSTRs are much

smaller as compared with those amplified with traditional STRs. Degraded or inhibited

samples can benefit greatly from miniSTRs analysis. Degraded samples typically include

teeth, bones, items exposed to humidity or heat, burnt items, etc. The DNA in these

samples can be broken down and highly fragmented, therefore, traditional STR analysis

works poorly with these and sometimes even yield negative results (Holland et al., 2003,

Hill et al., 2007, Parsons et al., 2007).

On the other hand samples containing DNA, but also having some substance that

inactivates or slows down the PCR, are called inhibited samples. Inhibitors can be of

different types including but not limited to: heme from blood samples, humic acid present

in soil, melanin in hair or skin samples, certain dyes (such as the indigo dye in denim

fabrics) and tannins from leather (Butler, 2006a, Wiegand et al., 2006).

MiniSTR kits are very sensitive and work best with DNA concentration of 0.25-

0.5ng. These STRs have been shown to work successfully even with even less than this

amount of DNA. This sensitivity allows for obtaining results samples like the items that

have been handled by an individual called “touch” DNA items, and hairs which have very

little root tissue left (Andrade et al., 2008).

2.6 Miniaturization of ChrX STRs: Problems and Prospects

Research on ChrX miniSTRs is very slow as compared to autosomal miniSTRs.

As reported by some groups, amplicon size of miniSTR is below 200bp, above which

chances of getting complete profile from a degraded sample diminish significantly

(Muller et al., 2007). Others have suggested the optimum product size as below 150bp

(Dixon et al., 2006). Only a handful of studies are available on ChrX miniSTRs and most

22

of these are in combination with other larger amplicon size markers hence making it

unsuitable for use in degraded sample studies and casework.

The first mini-only multiplex for ChrX STRs was reported by Asamura and

colleagues. Designing new primers, they devised two 4-plexes consisting of X-STR loci,

DXS7424, DXS101, DXS7423, DXS6789, DXS8378, DXS7133, GATA165B12 and

GATA31E08. The miniplex strategy was shown to be effective for the analysis of

degraded DNA based on the results of tests with these multiplexes. It was concluded that

for personal identification, these multiplex systems offered high effectiveness with

degraded DNA samples (Asamura et al., 2006). But since samples from a mass disaster

site or from an environmentally exposed crime scene are not only highly degraded but

also in very scarce quantity, thus making it difficult for the scientist to perform multiple

PCR analyses (Butler, 2009). In 2009, a “Genderplex” was reported. This included a 4-

plex for ChrX miniSTRs along with 2 gender determining amelogenin loci and an SRY

locus (Esteve Codina et al., 2009). In 2011, two new miniplexes, an 8-plex and 10-plex

were reported which can amplify a total of 15 ChrX-STRs, DXS7132, DXS6803,

HPRTB, DXS6789, DXS7130, DXS9902, DXS101, DXS6795, DXS10147, DXS8378,

DXS7424, DXS7423, GATA165B12, GATA172D05 and GATA31E08 (Diegoli and

Coble, 2011).

As shown in Table 2.1, there has been substantial progress with shortening the

length of many ChrX STRs amplicons during the last decade. But there are significant

problems which hinders the process of miniaturization. Foremost among these is that due

to size limitation, not many STRs can be fit together in a single multiplex and most of the

time only one marker per dye is run to avoid overlap. This approach makes a maximum

of 4 miniSTRs in a single multiplex using the 5-dye chemistry (Butler, 2005).

23

Table 2.1: X Chromosomal miniSTRs Marker Size Range (in basepairs) Reference

DXS101 142–169 (Asamura et al., 2006)

DXS6789 122–162 (Asamura et al., 2006)

DXS7423 99–115 (Asamura et al., 2006)

GATA31E08 101–133 (Asamura et al., 2006)

DXS8378 95–111 (Asamura et al., 2006)

DXS7424 79–100 (Asamura et al., 2006)

GATA165B12 90–110 (Asamura et al., 2006)

DXS6795 90–111 (Son et al., 2002)

DXS10147 165–185 (Edelmann et al., 2008)

DXS7424 147-174 (Edelmann et al., 2001)

DXS7133 106-130 (Edelmann et al., 2002b)

DXS9895 139-161 (Edelmann et al., 2002b)

GATA172D05 108-136 (Edelmann et al., 2002b)

DXS10160 135 – 193 (Hering et al., 2008)

DXS10159 154–190 (Edelmann et al., 2010)

DXS10162 150–186 (Edelmann et al., 2010)

DXS10163 121–171 (Edelmann et al., 2010)

DXS10165 145–173 (Edelmann et al., 2010)

DXS9902 162-186 (Diegoli and Coble, 2011)

DXS7132 131–155 (Edelmann et al., 2002b)

DXS6803 109-128 (Edelmann and Szibor, 2003)

DXS7130 109-128 (Edelmann and Szibor, 2003)

HPRTB 144-176 (Szibor et al., 2000)

DXS6801 113-137 (Edelmann and Szibor, 2005)

DXS10103 160-200 (Bekada et al., 2010)

Amelogenin X=80, Y=83 (Haas-Rochholz and Weiler, 1997)

Amelogenin X=106, Y=112 (Haas-Rochholz and Weiler, 1997)

Amelogenin X=55, Y=58 (Esteve Codina et al., 2009)

24

Secondly, most human identity testing applications requires a high degree of

polymorphism and hence, a higher allele frequency distributions. In multiplexes, these

markers consume precious electrophoretic space. However, in some parentage testing

situations, less polymorphic loci having lower mutation rates are more useful. For

example, mutation rates are not significant when evidence is compared directly with a

suspect (Butler, 2006a). But in parentage and kinship testing where comparison between

relatives is done, as for identification of mass disaster victims, mutational events become

important (Leclair et al., 2004). STRs having long homogenous repeat structures are

prone to instability such as DXS8377 and DXS10011 (Szibor, 2007) and are, therefore,

not suitable for use (Hering et al., 2004). Hence, STR loci possessing a smaller size range

and sufficiently polymorphic are being characterized (Coble and Butler, 2005).

Thirdly, the locations of miniSTR primers and conventional STR primers are in

different locations, the flanking regions on both sides of the repeat regions may have

insertion or deletion. Discordant results may be produced in amplicons generated form

one of the primer sets due to size shifts and allele drop out (Boutrand et al., 2001, Rolf et

al., 2011).

Fourthly, there are analytical issues attached to the miniaturization approach.

When the size is reduced to less than 150bp, the biggest problem comes from dye blobs.

These may be left over from the oligonucleotide synthesis process or may result from

dyes falling off the primer during the heating and cooling steps of PCR (Edelmann et al.,

2010).

Lastly, smaller PCR products have a problem in the manner in which sizing is

performed using an internal lane standard. Local Southern, the default method used by the

GeneScan software, which requires two peaks from the internal size standard to be

25

present on either side of a peak being defined. Hence, both 35 and 50bp peaks from the

GS500 Liz size standard must be defined for measuring any allele below 75 bp (Butler et

al., 2003). Due to increased concentration of primer dimers, sizing peaks may become

ambiguous. Therefore, it is recommended that Global Southern, where a regression line

generated from all sizing peaks is used to fit the STR alleles, is the method of choice for

small size amplicons (Hartzell et al., 2003).

Chapter 3

You know my method. It is founded upon the observation of trifles.

Sherlock Holmes - The Bascombe Valley Mystery

26

3. Materials and Methods

3.1 Blood Samples Collection

Blood samples were collected from 570 unrelated informed male and female

volunteers from the four provinces of Pakistan (Figure 3.1). According to Population

Census Organization of Pakistan (www.census.gov.pk), the Punjab has the largest

population with 56.2 percent, followed by Khyber-Pakhtunkhwa (KP) with 25.4 percent,

Sindh 13.4 percent and Baluchistan with 5.0 percent being the least populated province

(according to Population Census of 1998) . From each individual, 3mL of blood was

drawn. The drawn blood was mixed with 70µL of 0.5 M EDTA on the spot in a 15 mL

culture tube and shifted to CEMB Forensic DNA Laboratory. Of the collected blood,

700µl –800µL aliquots were made in 1.5 mL centrifuge tubes and properly labelled.

These 2 to 3 aliquots per sample were then frozen at -70°C as backup and to use in future.

Figure 3.1: Sampling Area and Number of samples collected

27

3.2 DNA Extraction from Blood Samples

Phenol chloroform DNA extraction was performed on each of the blood

sample as follows (Maniatis et al., 1982, Signer et al., 1988).

1. From the frozen blood, 300µL was taken out and mixed with 800µL of 20mM

Tris-HCl in a 1.5mL centrifuge tube. Centrifuge tube was tilt-mixed and left for

10 minutes. After this, the aliquot was centrifuged at 14,000 rpm for 1 min. About

1mL of supernatant was pipetted out into another tube and saved as a

precautionary measure, leaving behind a small pallet (400µL -450µL).

2. Pellet was resuspended in 1mL of 20mM Tris-HCl and repeated the same

protocol. The supernatant (about 1200µL) was removed and taken into another

tube, without disturbing the pellet. In this way three washings were made.

3. After the third washing, the pellet was mixed with following:

a) Pelleted WBCs = 25-50μL

b) 0.2 M sodium acetate (pH =5.2) = 375μL

c) 10% SDS = 50μL

d) Proteinase-K (1U/μl; 40 mg/ml) = 10μL

4. That mixture was incubated at 37°C overnight.

5. The next day, the digested cell suspension was treated as:

a) Digested cell suspension = 400-450μL

b) Buffered phenol (pH =8) = 120μL

6. Mixture was gently mixed for 30 seconds and centrifuged at 14000 rpm for 5 min.

7. Aqueous phase was carefully pipetted out with wide mouth tip into a separate

centrifuge tube.

28

8. The separated supernatant was treated as:

a) Aqueous phase from the last step = 400-450μL

b) PCI (Phenol:Chloroform:Iso-amylalcohol

25:24:1)

= 120μL

9. Mixture was gently mixed for 30 seconds and centrifuged at 14000 rpm for 5min.

10. Again the aqueous phase was pipetted out into a separate centrifuge tube.

11. Supernatant was treated as:

a) Upper separated supernatant = 400μL

b) Ice-chilled 95% Ethanol = 1000μL

12. Solution was gently tilt-mixed for 10min and centrifuged at 14000rpm for 10 min.

13. The supernatant was discarded carefully without disturbing the pellet.

14. To the pellet 70% ethanol solution was added, tilt mixed and centrifuged at

14000rpm for 1min.

15. The supernatant was discarded without disturbing the pellet.

16. Then pellet was dissolved in 180μL of TE (pH-8.0) and incubated at 56°C until

DNA pellet was completely dissolved.

17. Following was then added to it:

a) 2M sodium acetate = 20μL

b) Chilled 90% ethanol = 500μL

18. The solution was tilt mixed for 30s and left at room temperature for 15min.

19. Solution was centrifuged at 14,000rpm for 10min and supernatant was decanted

carefully.

29

20. To the pellet, 500µL of 70% ethanol solution was added and centrifuged at 14,000

rpm for 1 min. Supernatant was again decanted.

21. Rim of the centrifuge tube was blot dried and then air dried for 2-3 min.

22. To the dried pellet was added 100μL TE (pH 8.0) and incubated at 56°C for 1hr.

23. Then the prepared DNA solution was stored at 4°C.

3.3 DNA Quantitation

DNA concentration was estimated through spectrophotometery using NanoDrop

and the integrity of DNA was confirmed by gel electrophoresis with 25ng/μL standard

DNA.

3.3.1 Spectrophotometery

NanoDrop ND-1000 (NanoDrop Technologies, Wilmington, DE) was used to

quantify DNA (Desjardins and Conklin, 2001). The following values were measured:

a) 260/280

This value is used to assess purity of DNA. A ratio of ~1.8 was accepted as pure.

b) 260/230

This value is for assessment of DNA purity. A ratio of ~1.8 is considered as pure.

c) ng/μL

This is the final concentration of the sample which is based on absorbance at

260nm and is calculated using Beer’s Law.

30

3.3.2 Gel Electrophoresis

1. For quantification of genomic DNA, 0.8% agrose gel was prepared while for

degraded DNA and PCD product, 2% agarose gel was used.

2. Agarose gel was run in the tank (Hoefer Scientific Inc.) and was completed at

6V/cm and 47mA after 1hour.

3. Gel was visualized and photographed in UV Transilluminator (UMP).

4. The gel image was analyzed using Image J software (Abràmoff et al., 2004).

3.4 Multiplex Design Strategy

The multiplex design strategy was followed as recommended in previous literature

(Schoske et al., 2003). The steps taken are explained briefly.

3.4.1 Selection of Loci, Primer Design and Labelling

The X-STRs characterized in major world populations and highly polymorphic

markers were selected. The markers are given along their mapping information in table

3.1 (Szibor et al., 2006).

There should not be complementary sequences between primers to be used in

multiplex as it is important to prevent the primer dimer formation. All the primer pairs

were compared to one another using Autodimer program to assess the possibility of

primer dimerization in multiplex PCR (Vallone and Butler, 2004).

31

Table 3.1: Markers along with their linkage groups and mapping information

Cytogenetic localization

Marker Linkage group

Physical localisation Rutgers Map v.2 [Mb] [cM Kosambi]

NCBI 36 p 22.31 DXS8378 X1 9.330 20.21 p 22.2 DXS9902 15.234 32.32 Centromere DXS7132 X2 64.572 90.75 q 21.1 DXS6793 80.638 q 21.33 DXS6789 95.336 108.47 q 22.1 DXS7424 100.505 115.25 q 22.1 DXS101 101.300 116.15 q 23 GATA172D05 113.061 124.36 q 26.2 HPRTB X3 133.443 149.66 q 27.1 GATA31E08 140.062 160.54 q 28 DXS7423 X4 149.460 184.19 Amelogenin

Primer for DXS7132 was designed, by using Primer 3 (Rozen and Skaletsky,

2000). Primer sequences for DXS8378, DXS6789, DXS7424, DXS101, GATA31E08 and

DXS7423 as well as DXS7133 which was used in the haplotyping 4-plex were the same

as previously reported by Asamura and colleagues (Asamura et al., 2006) while primer

sequences of GATA172D05, DXS9902 were taken from Ensemble Genome Database

(Edelmann et al., 2001, Edelmann et al., 2002b, Flicek et al., 2012). Primer sequences for

HPRTB and DXS6793 were the same as used by Tariq and colleagues (Tariq et al.,

2008). Primer sequences for the gender determining locus, amelogenin, were used as

reported previously (Haas-Rochholz and Weiler, 1997). The arrangement and labelling of

primers with different dyes was done in a way to ensure maximum number of markers in

the multiplex as well as providing space to each marker in the eletrophoretic system. The

sequences of primers along with their dye labelling and references are shown in table 3.2.

32

Table 3.2: Genotyping Primers Sequences and Dye Labelling Marker Dye Sequence Reference

Amelogenin FAM F CCCTTTGAAGTGGTACCAGAGCA (Haas-Rochholz and Weiler, 1997)

R GCATGCCTAATATTTTCAGGGAATA

DXS101 NED F TCTCCCTTCAAAAACAAAGATAA (Asamura et al., 2006) R GTGCATATTCTGCGCATGT

DXS6789 VIC F CCTCGTGATCATGTAAGTTGG (Asamura et al., 2006) R GCAGAACCAATAGGAGATAGATGGT

DXS6793 VIC F ACACACGTGGTTTAGACCGT (Tariq et al., 2008) R CCAGAGCTACGGGAATATGA

DXS7132 FAM F ATAAATCCCCTCTCATCTATCTGAC This Study R ACTCCTGGTGCCAAACTCTA

DXS7423 FAM F AGATTTCCTCCCCATCCATC (Asamura et al., 2006) R GTTGTCACACAAATAAATGAATGAGT

DXS7424 NED F AAAACAGGAAGACCCCATC (Asamura et al., 2006) R GGCTAAGAAGAATCCCGCACA

DXS8378 VIC F GCTCCTGGCAGGTCACTATC (Asamura et al., 2006) R GCGACAAGAGCGAAACTCCA

DXS9902 PET F TGGAGTCTCTGGGTGAAGAG (Edelmann et al., 2001) R CAGGAGTATGGGATCACCAG

GATA172D05 NED F TAGTGGTGATGGTTGCACAG (Edelmann et al., 2002b) R ATAATTGAAAGCCCGGATTC

GATA31E08 PET F CAGAGCTGGTGATGATAGATGA (Asamura et al., 2006) R GCTCACTTTTATGTGTGTATGTATCTCC

HPRTB FAM F GTCTCTATTTCCATCTCTGTCTCC (Tariq et al., 2008) R TTCTTTCTCTCACCCCTGTCT

3.4.2 Size Determination

PCR fragment size range was determined for each marker. All the alleles found

for each particular marker were considered. For example, in-silico PCR was used to know

fragment size of newly designed primer set for marker DXS7132 (Fujita et al., 2011).

The fragment size for DXS7132 was 140bp for allele 14 with repeat motif (TCTA)x. The

allele range for this marker was 11-17 including all major populations. So, size range of

PCR product determined was 128-152bp. All primer sets specificity were checked by

using Human BLAT (Kent, 2002) ensuring that these only bind to the unique given

location on the human genome.

33

3.4.3 Multiplex Schematics

Multiplex layout schematics were prepared for both 7-plex and 12-plex as shown

in figure 3.2a and 3.2c, respectively and also for the 4-plex which was used for haplotype

analysis of cluster DXS7424-DXS101 and is shown in figure 3.2b. The size ranges of

markers were represented in schematic to check the possible areas of overlap. All the

fragment sizes of markers were set apart in same fluorescent dye to reduce the difficulty

of assigning the observed allele to its correct locus in any case of discovery of new allele.

The primers for some markers as mentioned above were selected or designed by keeping

the fragment length fix as required in multiplex Schematic.

Figure 3.2a: Schematic of PCR product sizes produced with known allele size ranges for the loci in 7-

plex. Yellow colour represents NED, blue for FAM, green for VIC, red for PET while LIZ Size

Standard is shown in orange.

Figure 3.2b: Schematics of PCR product sizes produced with known allele size ranges for the loci in 4-plex. Yellow colour represents NED, blue for FAM, green for VIC and red for PET dye. A= Amelogenin

34

Figure 3.2c: Schematics of PCR product sizes produced with known allele size ranges for the loci in 12-plex. Yellow colour represents NED, blue for FAM, green for VIC and red for PET dye. A= Amelogenin

3.4.4 Primer Concentration

Primer binding and amplification ability in a multiplex reaction is governed by

different factors and a slight variation in concentration of one primer set can affect the

amplification of one or more other primers. The difference in the DNA sequence for each

locus is also responsible for the variable efficiency of primer binding. Thus, for obtaining

the best ratio that would yield good signal intensities and balanced peak height for each

multiplex set, adjusting primer concentrations become necessary. The primer

concentrations were optimized to efficiently amplify DNA samples with low quantities.

3.5 Polymerase Chain Reaction (PCR)

PCR was carried out in ABI2700 Thermal Cycler (Applied Biosystems (ABI),

Foster City, CA), with AmpliTaq Gold polymerase (ABI), quantified genomic DNA and

labelled primers.

35

3.5.1 PCR Setup

PCR master mixes were made for 7-plex, 12-plex and 4-plex as shown in tables

3.3a, 3.3b and 3.3c, respectively.

Table 3.3a: Primers concentration and ingredients for 7-plex master mix Marker (10pm/μL) Forward (μL) Reverse (μL) x50 (μL) DXS101 0.3 0.3 15.0 DXS6789 0.12 0.12 6.0 DXS7132 0.1 0.1 5.0 DXS7423 0.09 0.09 4.5 DXS8378 0.05 0.05 2.5 GATA172D05 0.07 0.07 3.5 GATA31E08 0.09 0.09 4.5 Taq. Polymerase (2U/μL) 0.5 25.0 10X Buffer (NH4SO2) 1.0 50.0 MgCl2 (25mM) 0.6 30.0 dNTPs (25mM) 0.08 4.0 H2O (Injection Water) 10.18 509.0 Total 14.0μL 700.0

Table 3.3b: Primers concentration and ingredients for 12-plex master mix

Marker (10pm/μL) Forward (μL) Reverse (μL) x50 (μL) Amelogenin 0.05 0.05 2.5 DXS101 0.35 0.35 17.5 DXS6789 0.09 0.09 4.5 DXS6793 0.12 0.12 6.0 DXS7132 0.09 0.09 4.5 DXS7423 0.09 0.09 4.5 DXS7424 0.15 0.15 7.5 DXS8378 0.09 0.09 4.5 DXS9902 0.14 0.14 7.0 GATA172D05 0.35 0.35 17.5 GATA31E08 0.13 0.13 6.5 HPRTB 0.20 0.20 10.0 Taq. Polymerase (2U/μL) 0.5 25.0 10X Buffer (NH4SO2) 1.0 50.0 MgCl2 (25mM) 0.8 40.0 dNTPs (25mM) 0.08 4.0 H2O (Injection Water) 2.92 146.0 Total 9.0μL 450.0

36

Table 3.3c: Primers concentration and ingredients for 4-plex master mix Marker (10pm/μL) Forward (μL) Reverse (μL) x50 (μL) Amelogenin 0.05 0.05 2.5 DXS101 0.2 0.2 10.0 DXS7133 0.12 0.12 6.0 DXS7424 0.1 0.1 5.0 Taq. Polymerase (2U/μL) 0.5 25.0 10X Buffer (NH4SO2) 1.0 50.0 MgCl2 (25mM) 0.8 40.0 dNTPs (25mM) 0.08 4.0 H2O (Injection Water) 5.68 284.0 Total 9.0μL 450.0

3.5.2 Thermal Cycling Parameters

The Microsatellite markers (STRs) were amplified by Polymerase chain reaction

(PCR) on ABI GeneAmp® PCR System 2700. The PCR cycling conditions for both 7-

and 12-plex are given in figure 3.3a and 3.3b respectively. For 4-plex, the cycling

conditions used were the same as shown in 3.3b with a slight modification that is 30

cycles were done instead of 33 cycles.

Fig 3.3a: PCR cycle for 7-plex

30 Cycles

72°C

1min

60°C

45min

4°C

95°C

10min

95°C

1min

56°C

1min

2holds 1hold

37

Fig 3.3b: PCR cycle for 12-plex

3.5.3 Cycle Number

One way to improve the sensitivity of any PCR based method is to increase the

number of amplification cycles especially when amount of DNA template is very low. To

optimize the best cycle number for the multiplex, studies were carried out with different

cycle number.

3.5.4 Peak Balance

When the amount of template added to the PCR reaction is extremely low, there

may be stochastic effects which results in the imbalance of heterozygote alleles. Different

concentrations were used to obtain balanced peaks.

3.5.5 Sensitivity Studies

Sensitivity studies of the multiplex were performed on DNA sample using serial

dilution i.e. 5ng, 2.5ng, 1.0ng, 0.5ng and 0.250ng to evaluate minimum quantity required

to obtain the full DNA profile. The GeneScan Analysis threshold for these studies was set

at 100 RFU (relative fluorescence units).

33 Cycles

72°C

1min

66°C

60min

4°C

95°C

11min

94°C

1min

59°C

1min

2holds 1hold

38

3.5.6 Nonhuman DNA and Specificity Studies

Different variety of equine and cattle DNA was amplified to ensure that the

multiplex demonstrates specificity for humans.

3.5.7 Degraded DNA Studies

Studies were performed on artificially degraded DNA. As previously reported

(Asamura et al., 2006), miniSTRs are effective profiling degraded samples which are

otherwise difficult to genotype correctly through conventional STR kits.

3.5.7.1 Enzymatic Degradation

DNase I enzyme was used for digestion of genomic DNA. 2U of DNase (Turbo -

Ambion, Austin) was used to digest 1.2μg of human genomic DNA (100 ng/μL) at 37°C

in 120μL final volume. Aliquots of 10μL were removed predefined intervals (0, 0.5, 1, 2,

5, 10, 30 and 60 minutes). The enzyme was quenched by adding the aliquots to 490μL

preheated (TLE + 20 μg/ml glycogen) buffer at 95°C. DNase I was inactivated by heating

the aliquot at 95°C for 10 min (Grubwieser et al., 2006). The degraded DNA was run on

2% agarose gel stained with ethidium bromide along DNA ladder to assess the extent of

degradation.

3.5.7.2 Mechanical Degradation

DNA was degraded mechanically through an ultrasonic homogenizer Model

300VT (Biologics Inc., VA, USA). 1 mL of DNA solution (0.5 mg/mL) was sonicated on

ice at 180W in intervals of 30 seconds for up to 5 min. The degradation status was

determined empirically as ultrasound cannot be defined in physical units. Aliquots of 4µL

were taken from the sonicated solution after every 30s. Agarose gel of 2%, stained with

ethidium bromide was used for DNA electrophoresis (Bender et al., 2004).

39

3.5.8 Gel Extraction Protocol

To denature enzymes and dissolve agarose gel, chaotropic salt is used. The spin

column has glass fibre matrix to which the DNA fragments in the chaotropic salt bind

(Vogelstein and Gillespie, 1979). The degraded DNA fragments were extracted and

purified from the gel using Gel DNA Fragments Extraction Kit (Geneaid Biotech, Bade

City, Taiwan), as described below. Absolute ethanol was added to the Wash Buffer before

use.

3.5.8.1 Gel Dissociation

The agarose gel slice with the required DNA fragment was excised. To minimize

the gel size, any extra agarose was removed.

1. Gel slice of about 300 mg was transferred to an Eppendorf tube.

2. To this sample, DF Buffer 500 µL was added and mixed by vortex.

3. To ensure that gel slice has been completely dissolved, it was incubated at 55-

60ºC for 10-15 minutes. Every 2-3 minutes, the tube was inverted to ensure

uniform heating. After this, the sample was cooled at room temperature.

3.5.8.2 DNA Binding

1. In a 2mL Collection Tube the DF Column was placed.

2. From the previous step, 800 µL sample mixture was added to the DF Column and

centrifuged for 30 seconds at 14,000 rpm.

3. In the Collection Tube, The DF Column was placed back after discarding the

flow-through.

40

3.5.8.3 Wash

1. W1 Buffer (400 µL) was added to the DF Column and centrifuged for 30 seconds

at 14,000 rpm.

2. After discarding the flow-through, The DF Column was placed back in the

Collection Tube.

3. Eathonol-added Wash Buffer (600 µL) was added into the DF Column and after 1

minute, centrifuged it for 30 seconds at 14,000 rpm.

4. In the Collection Tube, the DF Column was placed back after discarding the flow-

through.

5. To dry the column matrix, it was centrifuged for 3 minutes at 14,000 rpm.

3.5.8.4 DNA Elution

1. DF Column was transferred after drying to a new 1.5mL Eppendorf tube.

2. Into the centre of the column matrix, 20µL was added.

3. It was let stand to ensure the Elution Buffer absorbed by the matrix for 2 minutes

and then centrifuged at 14,000 rpm for 2 minutes for the elution of the purified

DNA.

41

3.6 Detection and DNA Analysis

3.6.1 Sample Preparation for ABI 3130 Genetic Analyzer

For genotyping of samples on ABI Prism 3130, the samples were pooled into the

genotyping plate according to the following protocol (Butler, 2001).

1. Using a 96-well genotyper plate, 1μL of each amplified sample was carefully

pipetted into each well.

2. Mixed 0.2μL GeneScan® 500-LIZ™ size standard with 13.80μL Hi-Di™

Formamide (ABI) (Schumm, 1997).

3. To each well, 14μL of this cocktail mix was added and a clean plate septum was

placed over the 96-well plate and gently pushed septum into wells.

4. The plate was centrifuged for at least 10 seconds at more than 500 rpm to mix the

sample, remove bubbles, and spin the liquid to the bottom of each well.

5. The plate was put in a thermal cycler at 95°C for about 5 minutes to denature the

double stranded DNA into single stranded.

6. The plate was immediately put on ice for 5 minutes to stop DNA from renaturing.

7. The plate was again centrifuged for at least 10 seconds at 500rpm to mix the

sample, remove bubbles, and spin the liquid to the bottom of each well before

placing it in the genotyping machine.

3.6.2 Sample Electrophoresis

Using a 3130xl Genetic Analyzer (ABI), at 3kV the samples were injected for 10

seconds. In a Performance Optimized Polymer (POP-4), electrophoresis was carried out

for 25minutes at 15 kV and run temperature of 60°C. ABI 3130xl Data Collection

Software application v3.0 was used for collection of data.

42

3.6.3 Absorption/Emission Spectra of Fluorescent Dyes used

for Labelling

The absorption and emission wavelengths of the 5-dye system of ABI which was

used for multiplexing (Butler, 2005) is given below in table 3.4 as well as graphically

shown in figure 3.4.

Table 3.4: Fluorescence Spectral Parameters

Fluorescein FAM VIC NED PET LIZ

Absorption Wavelength (nm) 493 538 546 558 638

Emission Wavelength (nm) 522 554 575 595 655

Figure 3.4: Emission Spectra

3.6.4 Software Programs for Data Analysis

The results from electrophoresis were then analyzed by using GeneMapper

Software v4.0. A threshold of 100 relative fluorescence units (RFUs) was kept to interpret

allele peaks. Alleles were assigned as recommended by ISFG (International Society of

Forensic Genetics) through comparison with standard DNA 9947A (Butler et al., 2004,

Szibor et al., 2006).

43

Once STR genotypes had been generated from population samples the data were

typically evaluated through statistical tests to ensure that the data would be a useful one

when applied to human identity testing (Butler, 2005). A set of inter-population

comparisons of allele frequencies from this study was performed. Statistical tests on

genetic data have been greatly aided by the availability of computer processing power and

a number of available computer programs were used to perform various tests.

RFU values of all the peaks in a dye colour were averaged to calculate the tetra-

colour peak balance. For homozygous peaks, the heights are divided by two and averaged

with the values for heterozygous peaks for each marker in a dye.

After allele scoring, statistical analyses were carried out like Polymorphic

information content (PIC) expected heterozygosity (Het) (Shete et al., 2000), power of

discrimination (PD) in females and males and power of exclusion (PE). Fisher’s Exact

Test was used to check Hardy Weinberg Equilibrium at all selected STR loci.

Allele frequencies were calculated by simple counting. The minimum allele

frequency was set at 5/2N which is in accordance with the recommendations of National

Research Council for reliable statistical calculations (Bär et al., 1997) . For Hardy-

Weinberg Exact Test (Guo and Thompson, 1992), AMOVA, and Linkage Disequilibrium,

Arlequin v3.1 was used (Excoffier et al., 2005). PowerMarker v3.1 (Liu and Muse, 2005)

was used for Exact Test of population differentiation, Gene Diversity, Polymorphic

Information Content (PIC), and Heterozygosity. The ChrX-STR website was used for

calculation of Paternity Index, Power of Exclusion, Power of Discrimination in males

(PDm) and females (PDf), and Mean Exclusion Chance (MEC) (Szibor et al., 2006).

44

Poptree was used to estimate Nei’s distances based on the frequency distribution

(Takezaki et al., 2010). Unrooted tree based on Neighbour-Joining method was

constructed. MEGA5 was used to visualize the tree (Tamura et al., 2011).

3.6.5 Statistical Analysis and Formulae

Proportion of heterozygous individuals in the population is called Heterozygosity

(H). It is calculated by dividing the number of samples containing heterozygous alleles

into the total number of samples. Higher the heterozygosity, more will be the diversity of

alleles and lesser chance of random sample matching. Gene diversity or expected

heterozygosity, is the probability that two alleles chosen at random from the population

are different. Power of discrimination is equal to 1 minus the sum of the square of the

genotype frequencies. Formulae for all these are given below (Butler, 2005).

Polymorphism Information Content (Botstein et al., 1980)

PIC = 1 − (∑ ������� ) −∑ ∑ 2����

����������� ���.

Mean Exclusion Chance by Kruger (Krüger et al., 1968) for autosomal markers in

trios. This is usable for ChrX markers in deficiency cases in which the paternal

grandmother is investigated instead of the alleged father.

MECKRU = ∑ ���(1 − ��)�� +∑ ��(1 − ��)�� +∑ ����(�� + ��)(1 − �� − ��)����

Mean Exclusion Chance by Kishida (Kishida et al., 1997) for ChrX markers in trios

involving daughters.

MECKISH = ∑ ���(1 − ��)� + ∑ ��(1 − ��)�� + ∑ ����(�� + ��)(1 − �� − ��)���

45

Mean Exclusion Chance by Desmarais (Desmarais et al., 1998) for ChrX markers in

trios involving daughters.

MECDES = 1 −∑ ��� +∑ ��� −�� �∑ ���)��� ��

For ChrX markers in father/daughter duos

MECDES =1 − 2∑ ��� +∑ �����

Power of Discrimination in female (Desmarais et al., 1998)

PDf = 1 − 2(∑ ���� )� +∑ ����

Power of Discrimination in male (Desmarais et al., 1998)

PDm= 1 −∑ ����

Homozygosity (Liu and Muse, 2005)

h = ∑ ��������

Heterozygosity

H = 1 Homozygosity

Gene Diversity (Liu and Muse, 2005)

GD = ���∑ �

������� �

������

��

It is a capital mistake to theorize before you have all the evidence. It

biases the judgment.

Sherlock Holmes - A Study in Scarlet

Chapter 4

46

4. Results

4.1. Development of Multiplexes

4.1.1 Screening of Primers

There were 11 X-STR markers (DXS101, DXS6789, DXS6793, DXS7132,

DXS7423, DXS7424, DXS8378, DXS9902, GATA172D05, GATA31E08 and HPRTB)

included in the multiplex along with sex determining locus amelogenin. Moreover, there

was one other marker namely DXS7133 which was used with DXS7424 and DXS101 in

the haplotyping of male sample. The primer designing is critical parameter for parallel

amplification of STRs (Schoske et al., 2003). Alignment score of 8 or greater in

Autodimer program for any marker has been proved to result in significant primer dimer

formation during PCR amplification (Innis et al., 1999). Therefore, the primers which

showed the alignment score 8 or greater than 8, were eliminated from the study. It is

observed that primer interactions showing alignment score 6 or 7 were not problematic

during the multiplex amplification. Results of all primer interactions of 12 primer pairs

with alignment score 6 or greater are shown in table 4.1.

The similar primer Tm value of all primers is also another important parameter to

amplify multiple primers in a single reaction. Therefore, utmost care was taken to include

primers with similar Tm values. An important factor in multiplex optimization is the

specificity of primers to its target region in the human genome. BLAT was used to

confirm that the selected primers were specific only to a single location on the X

chromosome of humans.

47

Table 4.1: Primer interactions of 12 primer pairs with alignment score 6 or greater

No. of Sequences = 24 No. of Hits = 4

Threshold Score = 6

DXS7423-F AGATTTCCTCCCCATCCATC

versus DXS6789-R GCAGAACCAATAGGAGATAGATGGT

Matches = 8 Score = 6 Match Sequence = NCCATCNATC

3'-TGGTAGATAGAGGATAACCAAGACG-5' X| | || | x||| 5'-AGATTTCCTCCCCATCCATC-3'

DXS7132-R ACTCCTGGTGCCAAACTCTA

versus DXS9902-R CAGGAGTATGGGATCACCAG

Matches = 7 Score = 7 Match Sequence = CAGGAGT

5'-CAGGAGTATGGGATCACCAG-3' | | | || | | 3'-ATCTCAAACCGT G GT C CT C A-5'

Amelogenin-F CCCTTTGAAGTGGTACCAGAGCA

versus Amelogenin-F CCCTTTGAAGTGGTACCAGAGCA

Matches = 12 Score = 6 Match Sequence = TGNNNTGGTACCANNNCA

3'-ACGAGACCATGGTGAAGTTTCCC-5' | | xx x| | ||| | || x xx|| 5'-CCCTTTGAAGTGGTA CCAGAGCA-3'

Amelogenin-R GCATGCCTAATATTTTCAGGGAATA

versus Amelogenin-R GCATGCCTAATATTTTCAGGGAATA

Matches = 6 Score = 6 Match Sequence = GCATGC

5'-GC ATGCCTAATATTTTCAGGGAATA-3' || || | | 3'-ATAAGGGACTTTTATA ATC CGTA CG-5'

4.1.2 Development of 7-Plex

First 7 X-STRs were selected and combined into a multiplex. This system was

based on the primers reported by Asamura and colleagues (Asamura et al., 2006) along

with one newly designed primer set for DXS7132. This multiplex was optimized for

sensitivity, specificity, MgCl2 concentration, peak balance, cycle number, annealing

temperatures and was used for the population studies of Punjabi population (Israr et al.,

2012). A female DNA profile female typed with this multiplex is shown in figure 4.1.

48

Figure 4.1: Female DNA profile of 7 X-STRs

49

4.1.3 Development of 12-Plex

After the successful optimization of the 7-plex system, 4 more X-STRs were added

to the system along with the sex determining locus, amelogenin. The following different

parameters were optimized for this multiplex:

4.1.3.1 Sensitivity Studies

Studies were preformed to evaluate minimum quantity required to obtain the full

DNA profile. The GeneScan Analysis threshold for these studies was set at 100 RFU

(relative fluorescence units) (Hanson and Ballantyne, 2004). As shown in figure 4.2, the

optimal quantity of the template DNA for 12-plex PCR system ranged from 1 to 2 ng.

Figure 4.2: Sensitivity of 12-plex

0

2000

4000

6000

8000

10000

12000

14000

16000Sensitivity

5 ng

2.5 ng

1.0 ng

500 pg

250 pg

50

Each amplification at 1-2 ng DNA produced full 11-locus profile with a >100 RFU

threshold. Full profile was obtained with DNA concentration as low as 250pg. On the

other hand, split peaks were observed with higher concentration (<100ng) of DNA

template.

4.1.3.2 Primer Concentration

The primer concentrations were optimized to efficiently amplify DNA samples

with lower concentrations. Many PCRs were performed with primer concentration ranging

from 0.01 to 0.5µM to reach the optimal quantity and concentration for each individual

primer of the 11 X-STRs as well as Amelogenin.

4.1.3.3 Colour Balance

Different labelling dyes affect the amplification of template. Tetracolor balance

was calculated by averaging amplification peak values in each dye lane. As shown in

figure 4.3, in this multiplex system, the blue dye or FAM-6 labelled markers showed the

highest peaks while the yellow dye or NED showed the lowest peaks.

Figure 4.3: Average amplification in each dye lane

0

1000

2000

3000

4000

5000

6000

RFU

Colours

FAM-6

NED

VIC

PET

51

4.1.3.4 Cycle Number

Studies were carried out to optimize best cycle number for the multiplex. Different

cycle numbers from 27 to 35 were tested for the multiplex system. As shown in figure 4.4,

the multiplex exhibited optimal amplification with 33 cycles.

Figure 4.4: Amplification at different numbers of PCR cycles

0

2000

4000

6000

8000

10000

12000

14000

Cycle Number

30 Cycles

33 Cycles

35 Cycles

52

4.1.3.5 Annealing Temperature

Annealing temperature plays critical role in optimization of a multiplex and it is

essential in achieving the optimum annealing temperature at which all the primers in the

multiplex specifically bind to their respective targets. A range of temperatures (57, 59, 60

and 61°C) were tested. As shown in figure 4.5, it was found that 59°C is the optimal

temperature for this multiplex.

Figure 4.5: Amplification at different annealing temperatures

0

2000

4000

6000

8000

10000

12000

14000

Annealing Temperature

57°C

59°C

60°C

53

4.1.3.6 MgCl2 Titration

MgCl2 is essential component of PCR. DNA polymerase needs magnesium as a

cofactor to function. Too little of it affects the ability of polymerase to perform. While too

much of it results in non-specific amplification. Therefore, it is important to add the

optimum amount of MgCl2. Figure 4.6 shows that the 12-plex performed better with a

MgCl2 concentration of 2.0mM.

Figure 4.6: MgCl2 Titration of 12-plex

0

1000

2000

3000

4000

5000

6000MgCl2 Titration

1.0 mM

1.25 mM

1.5 mM

1.75 mM

2.0 mM

54

4.1.3.7 Specificity

The 12-plex was used to amplify a variety of equine and cattle DNA to

demonstrate its specificity for humans. The results showed that this multiplex is highly

specific to amplify only the human DNA since no amplification was observed in the

equine or cattle DNA and only primer dimers are visible in the eletropherograms in figure

4.7a and 4.7b respectively.

Figure 4.7a: Bovine DNA amplified through 12-plex

55

Figure 4.7b: Equine DNA amplified through 12-plex

4.1.3.8 Efficiency of Multiplex Assay

The multiplex PCR system which uses 11 X-STRs and sex determining locus

amelogenin was developed as shown in figure 4.8. All the markers of multiplex showed

high discriminatory powers ranging from 0.745 to 0.953 in females and 0.551 to 0.835 in

males. We selected 11 X-STR markers on the basis of their genetic localization and high

polymorphism and small size. X-STRs included in this study span the whole X-

chromosome and represent all the 4-linkage groups categorized previously by Szibor and

colleagues (Szibor et al., 2003).

56

Figure 4.8: Male DNA profile showing 11 X-STRs plus sex determining locus amelogenin

57

All the markers of this study are well spaced from each other on the X

chromosome to reduce the chance of LD and no evidence of LD was detected in all pair of

markers. Therefore, all the markers of this study are recommended to be considered as

independent markers for forensic practice in Pakistani population. The total length of

amplified fragments of the multiplex was limited to a maximum of 190 basepairs because

small sized markers have a great advantage over large sized markers during the analysis of

degraded DNA (Grubwieser et al., 2003).

4.2. Degraded DNA Studies

Studies were performed on artificially degraded DNA.

4.2.1. Enzymatic Degradation

DNase I enzyme was used for digestion of genomic DNA. The degraded DNA was

run on 2.0% agarose gel stained with ethidium bromide and a DNA ladder to assess the

extent of degradation as shown in figure 4.9a. DNA was extracted from gel and divided

into two categories: lesser than 500bp and fragments that were larger than 500bp.

Figure 4.9a: DNA degraded with DNase I. Lanes 1= 50ng standard, lane 2 and 14 = 100bp ladder, lane 3 = blank, lanes 4-12 = degraded DNA, lane 13 = 25ng standard, lane 15 = non-degraded DNA. The line shows the 500bp mark.

58

4.2.2. Mechanical Degradation

Ultrasonic homogenizer was used for the mechanical degradation of DNA. The

degradation status was confirmed empirically by running the different aliquots on

2%agarose gel. A 20bp DNA ladder was run with the samples to determine the extent of

degradation as shown in figure 4.9b.

Figure 4.9b: Sonicated fragments on gel. Lane 1 and 15 = control, Lane 2 = 25ng DNA, Lane 3-11= Sonicated DNA from 30s up to 5 minutes of ultrasonication at 30s interval, Lane 12 = 20bp DNA Ladder, Lane 13= blank, Lane 14 = 50ng DNA.

DNA was extracted from gel in the range of 200-300bp and also fragments of

lesser than 200bp. This DNA was then amplified with the multiplex. Full profile was

observed in the fragments of 300bp while there were partial profiles in the less than 200bp

DNA. The smaller size amplicons were observed but the larger amplicons (>150bp)

dropped out with this highly degraded DNA (Bender et al., 2004). Figure 4.10 is

graphical representation of degraded DNA at different stages amplified with the 12-plex

system.

59

Figure 4.10: Degraded DNA amplified with 12-plex.

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

Am

el

DX

S101

DX

S67

89

DX

S67

93

DX

S71

32

DX

S74

23

DX

S74

24

DX

S83

78

DX

S99

02

GA

TA1

72D

05

GA

TA3

1E08

HP

RTB

Degraded DNA Studies

Control >1000bp >700bp >300bp

60

4.3. Population Genetics

Once the multiplex was optimized, it was used to study the population living in the

four provinces of Pakistan, i.e. Baluchistan, Khyber-Pakhtunkhwa (KP), Punjab and

Sindh. Different parameters were determined for each of these populations. Global test of

differentiation was done amongst the 4 population and the exact test found no significant

difference in the allele frequency distribution of these populations (Exact P value =

0.19405 +- 0.07987 (100000 Markov steps done)). Table 4.2 shows the population specific

F statistics (Fst) amongst all the 4 populations while figure 4.11 is the graphical

representation of pair-wise F statistics between the populations.

Table 4.2: Population Specific F.Statistics

Locus Overall Punjab Baluchistan KP Sindh

DXS101 0.0175 0.0076 0.0045 0.0438 0.0278

DXS6789 0.0591 0.1181 0.0170 0.0077 -0.0093

DXS7132 0.0652 0.0431 0.0573 0.1216 0.0571

DXS7423 0.1102 0.1424 0.0873 0.0856 0.0587

DXS7424 0.1309 1.0000 0.1352 0.1185 0.1543

DXS8378 0.1350 0.1147 0.1997 0.1400 0.0778

GATA172D05 0.0199 0.0263 -0.0258 0.0534 0.0091

GATA31E08 0.0062 0.0046 0.0069 0.0003 0.0271

DXS7133 0.3045 1.0000 0.2899 0.3096 0.3115

DXS6793 0.5783 0.5778 1.0000 1.0000 0.5811

DXS9902 0.3154 0.3144 0.3581 1.0000 0.2962

HPRTB 0.0426 0.0468 0.0417 0.0478 0.0083

Overall 0.0999 0.1801 0.1094 0.1489 0.0803

61

Figure 4.11: Matrix of pair wise Fst between populations. PKA = Punjab, PKB = Baluchistan, PKPT = KP, PKS = Sindh

62

4.3.1. Baluchistan

DNA was isolated from the blood of 100 individuals (41 males, 59 females) after

informed consent was taken. Allele frequencies of males and females were pooled together

for all the 150 chromosomes studied. Figure 4.12 shows population assignment test of

Baluchi population against KP, Punjabi and Sindhi populations.

Figure 4.12: Population Assignment Test for Baluchistan against other populations.

PKA = Punjab, PKB = Baluchistan, PKPT = KP, PKS = Sindh

63

A total of 59 alleles were found at all the 9 loci tested. Maximum of 10 alleles

were observed at locus DXS6789 while minimum of 5 alleles were found at both

DXS7423 and DXS8378 as shown in figure 4.13.

Figure 4.13: Number of alleles at each locus

Table 4.3 shows allele frequency distribution while table 4.4 shows forensic

efficiency parameters for the 10 X-STRs in Baluchi population. Minimum gene diversity

of 0.651 was observed at locus DXS8378 while maximum of 0.8272 was observed at

locus DXS101. Locus GATA31E08 was the most heterozygous with 0.8261 while the

least heterozygous was DXS7424. Polymorphic Information Content (PIC) ranged from

0.5884 at locus DXS7423 to 0.8084 at locus DXS101. The combined power of

discrimination in females is 1.16 x 10-10 while in male it is 9.71 x 10-7. The population

deviates from Hardy-Weinberg at locus DXS101, DXS6789 and DXS7424 even after

Bonferroni’s correction for multiple testing.

64

Table 4.3: Allele frequency distribution of 10 X-STRs in Baluchi population (150 chromosomes)

Alleles DXS101 DXS6789 DXS7132 DXS7423 DXS7424 DXS8378 GATA172D05 GATA31E08 HPRTB

6 0.1838

7 0.2482

8 0.1397 0.0146

9 0.0444 0.1029 0.1314

10 0.3333 0.2426 0.2774

11 0.5037 0.2353 0.2628 0.0076

12 0.1022 0.0292 0.0917 0.0741 0.0956 0.0657 0.1212

13 0.2847 0.4161 0.0734 0.0444 0.3485

14 0.1022 0.3869 0.3869 0.1743 0.3182

15 0.0584 0.1825 0.0949 0.2569 0.1364

16 0.0219 0.0365 0.0730 0.2661 0.0682

17 0.0073 0.0826

18 0.0292 0.0550

19 0.2628

20 0.1898

21 0.1261 0.2409

22 0.0420 0.0730

23 0.1933 0.0146

24 0.2857 0.0073

25 0.1008

26 0.1261

27 0.0924

28 0.0336

65

Table 4.4: Different Forensic Efficiency Parameters for Baluchi Population

Marker DXS101 DXS6789 DXS7132 DXS7423 DXS7424 DXS8378 GATA172D05 GATA31E08 HPRTB Mean

GeneDiversity 0.8272 0.798 0.7079 0.6524 0.8223 0.651 0.8169 0.7647 0.734 0.7399

Heterozygosity 0.5 0.6304 0.6304 0.587 0.2121 0.5652 0.6957 0.8261 0.75 0.6397

PIC 0.8084 0.771 0.6592 0.5884 0.8023 0.5892 0.7908 0.7254 0.6896 0.6979

Power of Exclusion (PE):

0.651259 0.629348 0.466675 0.372047 0.616166 0.322944 0.62292 0.545737 0.491744

Paternity Index (PI): 0.086192 0.091926 0.137902 0.168968 0.095416 0.187118 0.093624 0.114679 0.130319 aPD female: 0.949313 0.941976 0.878038 0.824066 0.937615 0.796644 0.938124 0.909584 0.889706 bPD male 0.827617 0.816149 0.724195 0.662065 0.809168 0.625764 0.812753 0.770642 0.739363

MEC Krüger: 0.661871 0.639941 0.485447 0.40141 0.626118 0.366295 0.625898 0.550794 0.509617

MEC Kishida: 0.806646 0.791926 0.678301 0.600331 0.7832 0.56234 0.785824 0.732831 0.697

MEC Desmarais: 0.806646 0.791926 0.678301 0.600331 0.7832 0.562461 0.785938 0.732831 0.697

MEC Desmarais Duo: 0.691712 0.672939 0.536532 0.455305 0.661326 0.416931 0.66371 0.598602 0.557869

Hardy Weinberg 0 0.0008 0.163 0.032 0 0.138 0.192 0.295 0.027 aOnly female’s data is used for the analysis bOnly male data is used for the analysis. MEC Krueger, Mean Exclusion Chance for Autosomal markers typed in trios involving mother, child and putative father MEC Kishida, Mean Exclusion Chance for ChrX markers which covers trios including a daughter MEC Desmarais, Mean Exclusion Chance for ChrX markers in trios involving daughters MEC Desmarais, Mean Exclusion Chance for father/daughter duos lacking maternal genotype information

66

4.3.2. Khyber-Pakhtunkhwa (KP)

DNA samples of a total of 120 individuals (81 males and 39 females) were studied.

Allele frequency distribution and other forensic efficiency parameters for 9 X-STRs were

observed in the KP population. At all of the 9 loci, 61 alleles were found. Maximum of 10

alleles were observed at locus DXS6789 while minimum of 5 alleles were observed at

DXS7423 as shown in figure 4.14.

Figure 4.14: Number of alleles at each locus

Table 4.5 shows allele frequency distribution while table 4.6 shows other forensic

efficiency parameters for KP population. Minimum gene diversity of 0.56 was observed at

locus DXS8378 while maximum of 0.8075 was observed at locus DXS6789. Locus

GATA31E08 was the most heterozygous with 0.80 while the least heterozygous was

DXS7424 with 0.3684. Polymorphic Information Content (PIC) ranged from 0.5202 at

locus DXS8378 to 0.7781 at locus DXS6789. The combined power of discrimination in

females is 9.4 x 10-10 while in male it is 4.02 x 10-6. The population is in perfect Hardy-

Weinberg equilibrium at all loci except DXS7424 and GATA31E08 but after Bonferroni’s

correction, all loci are in equilibrium.

0

2

4

6

8

10

12KP

6

7

Table 4.5: Allele frequency distribution of KP population (159 Chromosomes)

Alleles DXS101 DXS6789 DXS7132 DXS7423 DXS7424 DXS8378 GATA172D05 GATA31E08 HPRTB

6 0.0413

7 0.2586

8 0.1736 0.0172

9 0.0661 0.0826 0.1724

10 0.1983 0.4132 0.2586

11 0.5372 0.1405 0.2414

12 0.0796 0.0833 0.0932 0.1570 0.1488 0.0517 0.0769

13 0.3274 0.4083 0.0678 0.0165 0.2479

14 0.1525 0.4071 0.3917 0.1102 0.0248 0.3846

15 0.0678 0.1239 0.0833 0.2034 0.2308

16 0.0169 0.0531 0.0333 0.3390 0.0342

17 0.1356 0.0256

18 0.0169 0.0088 0.0508

19 0.1864

20 0.0932

21 0.0661 0.2797

22 0.0496 0.1356

23 0.3471 0.0339

24 0.1818 0.0169

25 0.1818

26 0.0826

27 0.0496

28 0.0331

29 0.0083

6

8

Table 4.6: Different forensic efficiency parameters for KP population

Marker DXS101 DXS6789 DXS7132 DXS7423 DXS7424 DXS8378 GATA172D05 GATA31E08 HPRTB Mean

GeneDiversity 0.7738 0.8075 0.7701 0.635 0.6524 0.56 0.7413 0.7788 0.6731 0.6767

Heterozygosity 0.6 0.7895 0.6842 0.7 0.3684 0.6 0.8 0.75 0.5789 0.6371

PIC 0.738 0.7781 0.7349 0.5679 0.5921 0.5202 0.7042 0.7425 0.6101 0.6293

Power of Exclusion (PE):

0.581663 0.665802 0.432129 0.379722 0.579909 0.33363 0.510126 0.554009 0.472357

Paternity Index (PI):

0.104713 0.082429 0.148753 0.166276 0.105193 0.183019 0.124895 0.112356 0.136164

aPD female: 0.931056 0.952826 0.862221 0.82967 0.929146 0.823624 0.905475 0.912239 0.881189 bPD male 0.790573 0.835143 0.702494 0.667449 0.789615 0.633961 0.75021 0.775288 0.727673

MEC Krüger: 0.607703 0.67432 0.46035 0.409694 0.601916 0.402438 0.542572 0.556349 0.491739

MEC Kishida: 0.765374 0.815147 0.653106 0.607709 0.763023 0.59145 0.718081 0.737906 0.683024

MEC Desmarais: 0.765489 0.815147 0.653225 0.607709 0.763023 0.59157 0.718081 0.738023 0.683024

MEC Desmarais Duo:

0.64003 0.703094 0.510109 0.46279 0.636412 0.443062 0.581725 0.604497 0.541901

Hardy-Weinberg 0.183 0.701 0.158 0.802 0.005 1 0.745 0.007 0.368 aOnly female’s data is used for the analysis bOnly male data is used for the analysis. MEC Krueger, Mean Exclusion Chance for Autosomal markers typed in trios involving mother, child and putative father MEC Kishida, Mean Exclusion Chance for ChrX markers which covers trios including a daughter MEC Desmarais, Mean Exclusion Chance for ChrX markers in trios involving daughters MEC Desmarais, Mean Exclusion Chance for father/daughter duos lacking maternal genotype information

69

Figure 4.15 shows population assignment test of KP population against Baluchi,

Punjabi and Sindhi populations.

Figure 4.15: Population Assignment Test for KP against other populations. PKA = Punjab, PKB = Baluchistan, PKPT = KP, PKS = Sindh

70

4.3.3. The Punjab

Total of 250 individuals (116 females and 134 males) were genotyped. Different

forensic efficiency parameters along with allele frequency distribution were determined.

At all of the 10 loci, 67 alleles were found. Maximum of 9 alleles were found each at

locus DXS6789 and DXS101 while minimum of 5 alleles were found each at DXS8378

and DXS9902 as shown in figure 4.16.

Figure 4.16: Number of alleles at different loci

Table 4.7 shows allele frequency distribution while table 4.8 shows other forensic

efficiency parameters for Punjabi population. Minimum gene diversity of 0.5351 was

observed at locus DXS6793 while maximum of 0.8332 was observed at locus DXS101.

Locus DXS6789 was the most heterozygous with 0.8022 while the least heterozygous

was DXS7423 with 0.5604. Polymorphic Information Content (PIC) ranged from 0.4869

at locus DXS6793 to 0.8116 at locus DXS101. The combined power of discrimination in

females is 4.69 x 10-10 while in male it is 2.27 x 10-6. After Bonferroni’s correction, the

population is in perfect Hardy-Weinberg equilibrium at all loci.

0123456789

10The Punjab

71

Table 4.7: Allele Frequency Distribution in Punjabi Population (366 Chromosomes)

Allele DXS101 DXS6789 DXS6793 DXS7132 DXS7423 DXS8378 DXS9902 GATA172D05 GATA31E08 HPRTB

6 0.0898

7 0.064 0.2647

8 0.1575 0.0490

9 0.0073 0.1099 0.0769 0.1176

10 0.6277 0.0138 0.2711 0.3773 0.2157

11 0.0219 0.2630 0.3919 0.1832 0.3039 0.1136

12 0.0036 0.0034 0.0034 0.4291 0.2198 0.0513 0.0392 0.2601

13 0.1861 0.1065 0.0206 0.2630 0.0073 0.0098 0.3700

14 0.1387 0.2784 0.4777 0.0311 0.2051

15 0.1821 0.0146 0.3402 0.3677 0.0476

16 0.0584 0.2096 0.1065 0.0037

17 0.0103 0.0550 0.0241

18 0.0069

19 0.0241

20 0.4055

21 0.0756 0.1924

22 0.1684 0.0790

23 0.2509 0.0447

24 0.2096 0.0034

25 0.1375

26 0.0653

27 0.0653

28 0.0137

29 0.0137

72

Table 4.8: Forensic Efficiency Parameters for Punjabi Population

Marker DXS101 DXS6789 DXS7132 DXS7423 DXS8378 GATA31E08 DXS6793 DXS9902 GATA172D05 HPRTB Mean

GeneDiversity 0.8332 0.7802 0.753 0.613 0.673 0.7763 0.5351 0.6924 0.7591 0.737 0.7152

Heterozygosity 0.7033 0.8022 0.6813 0.5604 0.5824 0.7143 0.5862 0.6552 0.7126 0.6782 0.6676

PIC 0.8116 0.7519 0.7117 0.5367 0.6123 0.7402 0.4869 0.6379 0.7278 0.6939 0.6711

Power of Exclusion (PE):

0.658232 0.514987 0.50703 0.320984 0.392672 0.550264 0.236527 0.447823 0.539432 0.489879

Paternity Index (PI): 0.084383 0.123478 0.125801 0.18788 0.161811 0.113406 0.224319 0.143762 0.116462 0.130875 aPD female: 0.950159 0.907587 0.894864 0.788093 0.834009 0.912916 0.754261 0.86613 0.914101 0.888226 bPD male 0.831233 0.753043 0.748399 0.62424 0.676378 0.773189 0.551363 0.712476 0.767076 0.73825

MEC Krüger: 0.665253 0.549203 0.518715 0.352492 0.40653 0.560068 0.322129 0.460309 0.562254 0.504482

MEC Kishida: 0.809875 0.721501 0.706566 0.553528 0.615118 0.737431 0.50678 0.661276 0.735431 0.694989

MEC Desmarais: 0.809875 0.721618 0.706566 0.553528 0.615118 0.737548 0.506899 0.661276 0.735431 0.694989

MEC Desmarais Duo: 0.695838 0.586721 0.568332 0.408436 0.468179 0.604832 0.359172 0.517013 0.602147 0.55493

Hardy-Weinberg 0.0118 0.05 0.3389 0.0455 0.3352 0.3736 0.234 0.5535 0.6089 0.438 aOnly female’s data is used for the analysis bOnly male data is used for the analysis. MEC Krueger, Mean Exclusion Chance for Autosomal markers typed in trios involving mother, child and putative father MEC Kishida, Mean Exclusion Chance for ChrX markers which covers trios including a daughter MEC Desmarais, Mean Exclusion Chance for ChrX markers in trios involving daughters MEC Desmarais, Mean Exclusion Chance for father/daughter duos lacking maternal genotype information

73

Figure 4.17 shows population assignment test of Punjabi population against

Baluchi, KP and Sindhi populations.

Figure 4.17: Population Assignment Test for Punjab against other populations.

PKA = Punjab, PKB = Baluchistan, PKPT = KP, PKS = Sindh

74

4.3.4. Sindh

Total of 100 individuals (73 females and 27 males) were genotyped. Different

forensic efficiency parameters along with allele frequency distribution were determined.At

all of the 10 loci, 72 alleles were found. Maximum of 10 alleles were found at locus

DXS6789 while minimum of 5 alleles were found each at DXS8378 and DXS9902 and

HPRTB as shown in figure 4.18.

Figure 4.18: Number of alleles at different loci

Table 4.9 shows allele frequency distribution while table 4.10 shows other forensic

efficiency parameters for Sindhi population. Minimum gene diversity of 0.6 was observed

at locus DXS6793 while maximum of 0.8022 was observed at locus DXS6789. Locus

DXS7132 was the most heterozygous with 0.7857 while the least heterozygous was

DXS7424 with 0.4286. Polymorphic Information Content (PIC) ranged from 0.5482 at

locus DXS6793 to 0.7956 at locus DXS7424. The combined power of discrimination in

females is 1.28 x 10-11 while in male it is 2.64 x 10-7. After Bonferroni’s correction, the

population deviates from Hardy-Weinberg equilibrium only at DXS7424. This deviation

observed could potentially be due to population sampling effects.

0

2

4

6

8

10

12Sindh

75

Table 4.9: Allele Frequency Distribution in Sindhi Population (173 Chromosomes)

Alleles DXS101 DXS6789 DXS6793 DXS7132 DXS7423 DXS7424 DXS8378 DXS9902 GATA172D05 GATA31E08 HPRTB

6 0.0536

7 0.0179 0.3455

8 0.0714 0.0727

9 0.0196 0.0370 0.1000 0.1071 0.1273

10 0.5882 0.2593 0.2600 0.3393 0.2545

11 0.0196 0.2778 0.2600 0.1786 0.1818 0.0784

12 0.0196 0.1481 0.0980 0.0566 0.3889 0.3400 0.2321 0.0182 0.2941

13 0.2941 0.1852 0.3725 0.1132 0.0370 0.0400 0.2941

14 0.1579 0.0588 0.4259 0.3922 0.0943 0.2157

15 0.0175 0.2037 0.0980 0.2075 0.1176

16 0.0175 0.0185 0.0196 0.3585

17 0.0185 0.0196 0.0943

18 0.0175 0.0755

19 0.2632

20 0.1930

21 0.0385 0.1579

22 0.0385 0.0351

23 0.2692 0.0877

24 0.2885 0.0526

25 0.1538

26 0.0769

27 0.0769

28 0.0385

29 0.0192

76

Table 4.10: Forensic Efficiency Parameters for Sindhi Population

Marker DXS101 DXS6789 DXS7132 DXS7423 DXS7424 DXS8378 GATA172D05 GATA31E08 DXS6793 DXS9902 HPRTB Mean

GeneDiversity 0.7806 0.8022 0.6837 0.6607 0.8189 0.7044 0.7778 0.78 0.6 0.74 0.7533 0.7277

Heterozygosity 0.5714 0.4667 0.7857 0.6429 0.4286 0.7333 0.5333 0.6 0.6667 0.7333 0.7333 0.6195

PIC 0.7529 0.7753 0.6477 0.6124 0.7956 0.6524 0.7484 0.745 0.5482 0.6937 0.7108 0.6886

Power of Exclusion (PE):

0.606489 0.657787 0.460195 0.409109 0.578637 0.430754 0.561285 0.52875 0.248721 0.488793 0.527914

Paternity Index (PI):

0.097999 0.084498 0.139901 0.156277 0.105541 0.149195 0.110327 0.119507 0.218542 0.1312 0.119746

aPD female: 0.935985 0.950378 0.879704 0.84772 0.929626 0.855281 0.919962 0.905573 0.745111 0.884898 0.902643 bPD male 0.804002 0.831003 0.720198 0.687447 0.788918 0.701609 0.779345 0.760986 0.562916 0.7376 0.760508

MEC Krüger: 0.622908 0.666075 0.487643 0.436914 0.603011 0.44202 0.577966 0.542346 0.309866 0.496883 0.534522

MEC Kishida: 0.778402 0.809828 0.678073 0.632736 0.762985 0.645927 0.747996 0.723686 0.498948 0.691352 0.72039 MEC Desmarais:

0.778402 0.809941 0.678191 0.632856 0.7631 0.645927 0.747996 0.723686 0.499069 0.691352 0.720507

MEC Desmarais Duo:

0.656251 0.696181 0.535717 0.488806 0.636586 0.501012 0.617705 0.588161 0.355001 0.55072 0.584035

Hardy Weinberg

0.038 0.004 0.301 0.073 0.002 0.387 0.112 0.308 0.506 0.662 0.256

aOnly female’s data is used for the analysis bOnly male data is used for the analysis. MEC Krueger, Mean Exclusion Chance for Autosomal markers typed in trios involving mother, child and putative father MEC Kishida, Mean Exclusion Chance for ChrX markers which covers trios including a daughter MEC Desmarais, Mean Exclusion Chance for ChrX markers in trios involving daughters MEC Desmarais, Mean Exclusion Chance for father/daughter duos lacking maternal genotype information

77

Figure 4.19 shows population assignment test of Sindhi population against Baluchi,

KP and Punjabi populations.

Figure 4.19: Population Assignment Test for KP against other populations. PKA = Punjab, PKB = Baluchistan, PKPT = KP, PKS = Sindh

78

4.4. Linkage Disequilibrium (LD)

In female germ line, recombination rate of X-chromosome is lower which render X-

linked markers to have higher LD as compared to Autosomal markers (Pereira et al., 2007).

For all pair of loci, the exact test for LD was performed in Baluchi, KP, Punjabi and Sindhi

populations for all pairs of markers in this study. To reduce the chance of LD, All the

markers of this study are well spaced from each other and no evidence of LD was detected in

all pair of markers. Linkage disequilibrium matrices for Baluchi, KP, Punjabi and Sindhi

populations are given in tables 4.11, 4.12, 4.13 and 4.14, respectively. Any significant

linkage disequilibrium is given in bold font. The differences between populations regarding

LD is due to the fact that LD is not only monotonic function of distance between two

markers (Lonjou et al., 2003) but there are some other factors like selection, mutation,

random drift, population admixture, founder effect and stratification (Chakravarti, 1999). All

markers of this study are to be considered as independent and are recommended for forensic

practice in Baluchi, KP, Punjabi and Sindhi populations. The statistical parameters showed

that these markers are appropriate for individual identification, paternity testing involving a

female child.

79

Table 4.11: Linkage Disequilibrium Matrix Baluchistan

Marker DX

S1

01

DX

S6

789

DX

S7

132

DX

S7

423

DX

S7

424

DX

S8

378

GA

TA

172D05

GA

TA

31E0

8

HP

RT

B

DXS101 0.2279 0.2051 0.1685 0.2988 0.1701 0.2275 0.1701 0.2175

DXS6789 0.2028 0.1713 0.1974 0.1576 0.2399 0.2499 0.2165

DXS7132 0.1446 0.1225 0.1431 0.1817 0.1928 0.1781

DXS7423 0.1621 0.0721 0.1337 0.1840 0.1409

DXS7424 0.2044 0.2814 0.2391 0.1813

DXS8378 0.1365 0.1598 0.1346

GATA172D05 0.2359 0.1766

GATA31E08 0.7020

HPRTB

80

Table 4.12: Linkage Disequilibrium Matrix KP

Marker DX

S101

DX

S678

9

DX

S713

2

DX

S742

3

DX

S742

4

DX

S837

8

GA

TA

172D

05

GA

TA

31E08

HP

RT

B

DXS101 0.3415 0.3514 0.2188 0.2421 0.2237 0.3407 0.2741 0.2222

DXS6789 0.1989 0.2664 0.2431 0.2297 0.2804 0.4680 0.2846

DXS7132 0.2046 0.1829 0.2129 0.3200 0.3193 0.2114

DXS7423 0.0739 0.4439 0.2529 0.2024 0.1998

DXS7424 0.1722 0.1820 0.2554 0.2712

DXS8378 0.2209 0.1986 0.2488

GATA172D05 0.3482 0.2814

GATA31E08 0.1723

HPRTB

81

Table 4.13: Linkage Disequilibrium Matrix the Punjab

Marker DX

S1

01

DX

S6

789

DX

S7

132

DX

S7

423

DX

S8

378

GA

TA

31E0

8

DX

S6

793

DX

S9

902

GA

TA

172D05

HP

RT

B

DXS101 0.2005 0.1163 0.0739 0.1336 0.0959 0.1128 0.1264 0.1554 0.1001

DXS6789 0.1445 0.0884 0.1122 0.1172 0.1217 0.1384 0.1384 0.0949

DXS7132 0.0561 0.0971 0.0884 0.1284 0.0639 0.079 0.0967

DXS7423 0.0307 0.0557 0.0471 0.0898 0.0649 0.0581

DXS8378 0.0684 0.038 0.0732 0.0863 0.0707

GATA31E08 0.1036 0.1012 0.0771 0.1064

DXS6793

0.1114 0.1165 0.0772

DXS9902 0.0699 0.0268

GATA172D05 0.0694

HPRTB

82

Table 4.14: Linkage Disequilibrium Matrix Sindh

Marker DX

S1

01

DX

S6

789

DX

S7

132

DX

S7

423

DX

S7

424

DX

S8

378

GA

TA

172D05

GA

TA

31E0

8

DX

S6

793

DX

S9

902

HP

RT

B

DXS101

0.3961 0.4544 0.2853 0.3741 0.4007 0.4127 0.4655 0.3432 0.3532 0.3327

DXS6789

0.3990 0.1761 0.4138 0.3972 0.4326 0.3282 0.3335 0.3472 0.2547

DXS7132

0.3250 0.2787 0.3455 0.3224 0.3939 0.4542 0.3606 0.3890

DXS7423

0.2125 0.3214 0.3280 0.3888 0.4263 0.3418 0.4634

DXS7424

0.2571 0.4942 0.3971 0.2986 0.3093 0.2980

DXS8378

0.2355 0.2697 0.2535 0.2686 0.1795

GATA172D05

0.4340 0.3381 0.3470 0.2256

GATA31E08

0.3408 0.3148 0.3646

DXS6793

0.2835 0.3148

DXS9902

0.2775

HPRTB

83

4.5. Haplotype Analysis

The haplotypes of X-chromosome cluster (DXS7424-DXS101) was studied in the

Pakistani population. A total of 149 DNA samples from unrelated Pakistani males (KP =

81, Baluchi = 41 and Sindhi = 27) were analyzed in a triplex PCR reaction as described in

materials and methods section. The optimal quantity of the template DNA for triplex PCR

system ranged from 1 to 2 ng using 30 cycles of PCR. The tetraplex profile of male DNA

is shown in figure 4.20.

Figure 4.20: Tetraplex profile of male DNA

84

In 41 males from Baluchi population, we found 23 different haplotypes for the

cluster DXS7424-DXS101. As shown in table 4.15, haplotype 16-23 was the most

prevalent in the studied sample with frequency of 14%.

Table 4.15: Haplotype analysis in 41 Baluchi males of DXS7424-DXS101 cluster

S. No. Haplotype

Frequency S.D. Upper frequency DXS7424 DXS101

1 10 25 0.02439 0.02439 0.04878

2 10 26 0.02439 0.02439 0.04878

3 13 21 0.02439 0.02439 0.04878

4 13 26 0.02439 0.02439 0.04878

5 14 21 0.02439 0.02439 0.04878

6 14 23 0.02439 0.02439 0.04878

7 14 24 0.097561 0.046916 0.144477

8 14 25 0.02439 0.02439 0.04878

9 14 26 0.02439 0.02439 0.04878

10 15 21 0.02439 0.02439 0.04878

11 15 22 0.02439 0.02439 0.04878

12 15 23 0.073171 0.041175 0.114346

13 15 25 0.02439 0.02439 0.04878

14 15 28 0.02439 0.02439 0.04878

15 16 21 0.04878 0.034059 0.082839

16 16 23 0.146341 0.055885 0.202226

17 16 24 0.097561 0.046916 0.144477

18 16 25 0.02439 0.02439 0.04878

19 16 26 0.097561 0.046916 0.144477

20 16 27 0.04878 0.034059 0.082839

21 17 23 0.02439 0.02439 0.04878

22 17 24 0.02439 0.02439 0.04878

23 17 25 0.02439 0.02439 0.04878

85

In 81 males studied from KP population, we found 36 different haplotypes for the

cluster DXS7424-DXS101. As shown in table 4.16, haplotype 16-23 was the most

prevalent with frequency of 11% in the population sample.

Table 4.16: Haplotype analysis in 81 KP males of DXS7424-DXS101 cluster

S. No. Haplotype

Frequency S.D. Upper frequency DXS7424 DXS101

1 11 21 0.012346 0.012346 0.024692

2 11 22 0.012346 0.012346 0.024692

3 11 26 0.012346 0.012346 0.024692

4 12 23 0.037037 0.021114 0.058151

5 12 24 0.012346 0.012346 0.024692

6 12 26 0.012346 0.012346 0.024692

7 13 22 0.012346 0.012346 0.024692

8 13 23 0.037037 0.021114 0.058151

9 13 25 0.012346 0.012346 0.024692

10 13 26 0.024691 0.01735 0.042041

11 13 28 0.024691 0.01735 0.042041

12 14 22 0.012346 0.012346 0.024692

13 14 23 0.024691 0.01735 0.042041

14 14 24 0.049383 0.024224 0.073607

15 14 25 0.037037 0.021114 0.058151

16 14 26 0.024691 0.01735 0.042041

17 14 27 0.012346 0.012346 0.024692

18 15 22 0.037037 0.021114 0.058151

19 15 23 0.098765 0.033356 0.132121

20 15 24 0.037037 0.021114 0.058151

21 15 25 0.024691 0.01735 0.042041

22 15 26 0.024691 0.01735 0.042041

23 15 27 0.012346 0.012346 0.024692

24 16 21 0.037037 0.021114 0.058151

25 16 23 0.111111 0.035136 0.146247

26 16 24 0.012346 0.012346 0.024692

27 16 25 0.061728 0.026907 0.088635

28 16 26 0.012346 0.012346 0.024692

29 17 23 0.037037 0.021114 0.058151

30 17 24 0.012346 0.012346 0.024692

31 18 23 0.037037 0.021114 0.058151

32 18 24 0.012346 0.012346 0.024692

33 18 25 0.012346 0.012346 0.024692

34 18 28 0.012346 0.012346 0.024692

86

In 27 males studied from the Sindhi population, 19 different haplotypes were

found for the cluster DXS7424-DXS101.As shown in table 4.17, haplotype 16-24 was the

most prevalent with frequency of 11% in the population sample.

Table 4.17: Haplotype analysis in 27 Sindhi males of DXS7424-DXS101 cluster

No. Haplotype

Frequency S.D. Upper frequency DXS7424 DXS101

1 12 26 0.037037 0.037037 0.074074

2 13 24 0.074074 0.051361 0.125435

3 13 29 0.037037 0.037037 0.074074

4 13 28 0.037037 0.037037 0.074074

5 14 26 0.037037 0.037037 0.074074

6 14 27 0.037037 0.037037 0.074074

7 15 23 0.074074 0.051361 0.125435

8 15 24 0.074074 0.051361 0.125435

9 15 25 0.037037 0.037037 0.074074

10 16 23 0.074074 0.051361 0.125435

11 16 24 0.111111 0.061633 0.172744

12 16 25 0.074074 0.051361 0.125435

13 16 26 0.037037 0.037037 0.074074

14 16 27 0.074074 0.051361 0.125435

15 16 29 0.037037 0.037037 0.074074

16 17 21 0.037037 0.037037 0.074074

17 18 25 0.037037 0.037037 0.074074

18 18 24 0.037037 0.037037 0.074074

19 17 25 0.037037 0.037037 0.074074

The locus by locus amova result showed average F-Statistics at all loci as FST =

0.01160.

In this cluster, 9 alleles for marker DXS101 were identified while 8 alleles were

identified for DXS7424. Therefore, 72 different combinations are possible but we

identified only 46 different haplotypes through genotyping.

87

4.5.1. Allele Frequency for DXS7133

In the 149 samples analysed for haplotype cluster DXS7424-DXS101. Table 4.18

shows the allele frequencies that were observed for DXS7133 in the 3 populations.

Table 4.18: Allele Frequency of DXS7133 in male sample of 3 populations

Allele Baluchistan (n = 41) KP (n = 81) Sindh (n =27) 7 0.0154 8 0.5538 9 0.1538 0.0377 10 0.5610 0.2769 0.5660 11 0.2195 0.2075 12 0.2195 0.1887 PIC 0.5236 0.5274 0.5769 PD (male) 0.588919 0.59274 0.625123

88

4.6. Allelic Ladder

Overall, 85 different alleles were found for all the 12 loci in the 4 populations

studied. Maximum of 11 alleles were found for locus DXS6789 while minimum of 5

alleles were found for locus DXS9902. Table 4.19 shows alleles found for each locus

along with the range of amplicon length observed in the populations studied. These alleles

were used to construct allelic ladder.

Table 4.19: Observed alleles, fragment lengths and repeat size of 12 X-linked STRs in Baluchi, KP, Sindhi and Punjabi Populations

Locus No. of Alleles Fragment length Repeat

Size Genotypes (9947A)

DXS101 9 142-166 3 24, 26

DXS6789 11 122-162 4 21,22

DXS6793 7 178-196 3 10,10

DXS7132 7 128-152 4 12,12

DXS7133 6 80-100 4 9, 10

DXS7423 6 99-119 4 14,15

DXS7424 7 82-100 3 14, 16

DXS8378 6 95-115 4 10,11

DXS9902 5 169-185 4 11,11

GATA172D05 7 108-132 4 10,10

GATA31E08 7 101-125 4 11,11

HPRTB 7 161-185 4 14,14

Allelic ladder is artificial mixture of all common alleles of particular STR

present in human population (Sajantila et al., 1992). The allelic ladder has been used to

take in to account the run to run size variations of markers due to varying environmental

parameters such as temperature fluctuations (Butler, 2005). Therefore, allelic ladder was

constructed including all the common alleles of 12 X-STRs in this population (Butler,

2005). The exact allele sizes were determined by comparing PCR products with standard

DNA 9947A.

89

Altogether, ladder marker consisted of 81 alleles for 12 X-STRs. The alleles

included in ladder are shown in table 4.20. The resulting position of all alleles of ladder

was obtained in such way that there was no overlapping between two adjacent markers in

the ladder. The minimum difference of 4 base pairs (bp) between two adjacent markers

was maintained. Therefore, overlapping between the markers of multiplex was not

observed in the whole population. Allelic ladder was used for genotyping of few samples

as initial proof-of-concept experiment. The allelic ladder is given in figure 4.21.

Table 4.20: alleles of 12 X-STRs for allelic ladder

Locus Alleles included in allelic ladder

DXS101 21,22,23,24,25,26,27,28,29

DXS6789 14,15,16,17,18,19,20,21,22,23,24

DXS6793 9,10,11,12,13,14,15

DXS7132 12,13,14,15,16,17,18

DXS7133 9,10,11,12

DXS7423 12,13,14,15,16

DXS7424 12,13,14,15,16,17,18

DXS8378 9,10,11,12,13,14

DXS9902 9,10,11,12,13

GATA172D05 6,7,8,9,10,11,12

GATA31E08 7,8,9,10,11,12,13

HPRTB 11,12,13,14,15,16

Ladder was constructed by mixing all the hemizygous male samples having all the

different alleles for a particular locus. This mixture was then amplified in thermocyler

with Polymerase, buffer and the specific primers for that locus.

90

Figure 4.21: Allelic ladder for 12 X-STRs

91

4.7. Phylogenetic Analysis

As STR loci have high mutation rates, and high level of length polymorphism

within and among populations, making it ideal markers for intra-species phylogenetic and

forensic analysis (Rowold and Herrera, 2003). With the appropriate statistical approaches,

even lesser number of forensic STR loci are powerful enough to reconstruct the recent

human phylogenies despite their relatively high mutation rates (Agrawal and Khan,

2005). Therefore, the four populations studied were compared with 57 other populations

around the world on the basis of their X-STR frequency. Only those populations were

selected which had at least frequency reported at 6 of the 9 loci (DXS101, DXS6789,

DXS7132, DXS7424, DXS8378, DXS9902, GATA172D05, GATA31E08 and HPRTB).

The populations were from Morocco and Madagascar (Poetsch et al., 2011), Polish

(Łuczak et al., 2011), Ghana (Poetsch et al., 2009), Columbia (Pico et al., 2008), Angola,

Mozambique and Uganda (Gomes et al., 2007b), Algeria (Bekada et al., 2010), Taiwan,

Vietnamese, Chinese, Filipinos, Thais and Taos (Hwa et al., 2011), Asian-American,

African-American, Caucasian-American and Hispanic-American (Diegoli and Coble,

2011), Ethnic Chinese (Sun et al., 2011), Brazil, Argentina, Costa Rica, Portugal and

Spain (Gusmão et al., 2009), Germany (Edelmann et al., 2001), India (Mukerjee et al.,

2010), Korea (Shin et al., 2005), Japan (Asamura et al., 2006), Italy (Turrina et al., 2007),

Mongol (Liu and Li, 2006), Native Argentina (Toscanini et al., 2009) and Nu-China (Gao

et al., 2007). Poptree was used to estimate Nei’s distances based on the frequency

distribution (Takezaki et al., 2010). Unrooted tree based on Neighbour-Joining method

was constructed. MEGA5 was used to visualize the tree (Tamura et al., 2011). As shown

in figure 4.22, Punjabi population is closer to Sindhi while Baluchi population is closer to

KP. Overall these four populations formed a cluster which is nearer to the populations

from India.

92

93

Figure 4.22: Phylogenetic Tree of 61 populations around the world. Ethnic Chinese (Dongxiang, Tu,

Salar, Bonan), Iberian (Paran-Brazil, BuenosAires-Argentina, Misiones-Argentina, RioNegro-Argentina,

Costa Rica, Cordoba-Argentina, EntresRios-Argentina, SaoPaolo-Brazil, RioJenerio-Brazil, MatoGross-

Brazil, Antioqia-Columbia, Northern Portugal, Central Portugal, Galacia-Spain, Cantabria-Spain), Native

Argentina (Colla and Toba), Pakistan (Punjab, Sindh, Balochistan and KP), India (Balmiki, Sakaldwip

Brahimin, Konkanastha Brahimin, Mahadev Koli, Kurumans, Munda, Tripuri, Riang).

Chapter 5

It was easier to know it than to explain why I know it.

Sherlock Holmes - A Study in Scarlet

94

5. Discussion

5.1. Comparison of Multiplex PCR Systems

A multiplex PCR system has been developed consisting primers for the

amplification of sub-200bp amplicons of 11 X-STRs, DXS101, DXS6789, DXS6793,

DXS7132, DXS7423, DXS7424, DXS8378, DXS9902, GATA172D05, GATA31E08 and

HPRTB along with the gender determining locus, Amelogenin. A lot of PCR multiplex

systems for X chromosomal STRs including from 3 to 15 X-STRs have been optimized

for paternity testing by the forensic community (Zarrabeitia et al., 2002, Athanasiadou et

al., 2003, Bini et al., 2005, Poetsch et al., 2005, Robino et al., 2006, Gomes et al., 2007a,

Pereira et al., 2007, Turrina et al., 2007, Tariq et al., 2008, Hwa et al., 2009, Liu et al.,

2012). Moreover, multiplex PCR assay with 12 X chromosomal STRs is developed for

forensic identification (Turrina et al., 2007). Out of 12 markers of that study, 9 markers

can be typed by using our multiplex with reduced size amplicons. While the 2 markers,

DXS6809 and DXS6807 were replaced in this study by DXS9902 and DXS6793 having

almost similar power of discriminations with markers replaced. The marker DXS7133

was not included in our 12-plex but it was included in the 4-plex used for haplotype

analysis of DXS7424-DXS101.

However, there is very little data available on multiplexing of X-linked miniSTRs.

After 2006, when the first 2 multiplexes consisting of ChrX miniSTRs were published

(Asamura et al., 2006), recently another work was reported about the development of 2

new multiplexes consisting of 8 and 10 miniSTRs (Diegoli and Coble, 2011). The 8-plex

shares 5 markers (DXS6789, GATA31E08, DXS101, DXS7424 and DXS9902) with our

system while the 10-plex shares 6 markers (GATA172D05, DXS8378, DXS7132,

95

HPRTB, DXS7423 and DXS9902) with our system. Owing to reasonable combined

power of discrimination of markers (female = 1.28 x 10-11, male = 2.64 x 10-7) and

number of markers in the multiplex (11 plus amelogenin) may prove it to be worthwhile

tool for deficiency paternity testing.

5.2. Allele Distribution and Population Comparisons

The allele distribution across the populations was evaluated to determine the

presence of population specific pattern to aid the forensic case work as recommended in

previous literature (Kashyap et al., 2006). As reported in the literature, all non-Africans

descended from a single group of humans that left Africa by a coastal route across the

mouth of the Red Sea to South Asia (Macaulay et al., 2005). One migrant group of no

more than a few hundred souls was forced out of its homeland by increasing salinity in

the Red Sea, some 85,000 years ago. All non-Africans today can trace their mitochondrial

DNA to one woman from this group (Oppenheimer, 2004). They rapidly spread around

the Indian Ocean towards the Antipodes, long before a small branch left a South Asian

colony, earlier on the trail, through Turkey, Bulgaria and Hungary to populate Europe

(Oppenheimer, 2003, Oppenheimer, 2012). It is therefore, believed on the basis of genetic

evidence, that all human beings in existence now descended from Africa (Wells and

Read, 2002). Among human populations, diversity of microsatellites is highest in Africa,

which supports the hypothesis of an African origin for humans (Bowcock et al., 1994).

Allelic range of 21-28 for marker DXS101 was observed in Baluchi population

while 21-29 was observed in KP, Punjabi and Sindhi populations. Full range of alleles 14-

34 for marker DXS101 is found only in populations of African descent (Bekada et al.,

2009, Diegoli and Coble, 2011). Populations from Europe lacks allele 14 except Germans

(Edelmann et al., 2001) and Portuguese (Gusmão et al., 2009). Asian populations have

96

allele distribution starting from allele 19 (Shin et al., 2005, Hwa et al., 2009, Diegoli and

Coble, 2011). Interestingly allele 14 is reported in Mongol population (Liu and Li, 2006)

while the rare allele 35 is reported only in the Iyenger population of India. Allele 15

which is absent in Asians but interestingly, Balmiki population which is the lowest caste

in the Indian caste system, has this allele (Mukerjee et al., 2010).

Allele distribution of Marker DXS6789 ranged from 14 to 24. African-American

(Diegoli and Coble, 2011), German (Edelmann et al., 2001), Taiwanese (Hwa et al.,

2009) and Hungarian (Zalan et al., 2007) are the only populations having 14-25 allele

distribution. Allele 18 is missing in populations from African and Latin America

(Gusmão et al., 2009, Martins et al., 2009, Bobillo et al., 2011, Diegoli and Coble, 2011),

European (Aler et al., 2007, Gusmão et al., 2009) and Indian (Mukerjee et al., 2010)

populations. German (Edelmann et al., 2001), Italian (Turrina et al., 2007), Mongolian

(Liu and Li, 2006) and East Asian (Shin et al., 2005, Hwa et al., 2009, Hwa et al., 2011)

populations have this allele.

The marker DXS6793 is only studied in Chinese population in which alleles 14

and 15 are absent (Jia et al., 2004). However, allele 14 and 15 were observed in Punjabi

population which had allele distribution of 9-15. Similarly, allele 14 was present in Sindhi

population which had allele distribution of 9-14.

In the case of DXS7132, allele range 12-18 was found in Punjabi and KP

populations while 12-17 was observed in Sindhi and Baluchi populations. Allele range

10-18 is present in African populations (Poetsch et al., 2009, Diegoli and Coble, 2011).

Apart from African populations, allele 10 is present in Koreans (Shin et al., 2005) and

Asian-Americans (Diegoli and Coble, 2011). Europeans lack allele 10 and 18 (Edelmann

et al., 2001, Aler et al., 2007, Turrina et al., 2007, Zalan et al., 2007, Łuczak et al., 2011).

97

The rare allele 6 have been reported in Hungarians (Zalan et al., 2007), allele 8 in

Taiwanese (Hwa et al., 2009) and US population (Gomes et al., 2007a), allele 9 in

Spanish (Zarrabeitia et al., 2006) and Portuguese populations (Gusmão et al., 2009).

Similarly, microvariant alleles (15.3, 16.3, 17.3 and 18.3) have been observed only in

individuals from Argentina, Brazil and Columbia; therefore, a Native American origin is

postulated for these alleles (Gomes et al., 2009). The rare allele 19 was reported in

Hungarian (Zalan et al., 2007), Taiwanese (Hwa et al., 2009), Spanish (Zarrabeitia et al.,

2006) and Colombian populations (Pico et al., 2008). Allele 13 and 14 are the most

frequent alleles in major populations.

In the case of DXS7423, the allele range 12-16 was observed in KP and Baluchi

populations while allele range 12-17 observed in Punjabi and Sindhi populations. Allele

10 and 11 is only found in Portuguese population (Pereira et al., 2007) and rare allele 19

is reported in Hungarian population only (Zalan et al., 2007). Allele 13, 14 and 15 were

the frequent alleles in major populations. The rare allele 8 at marker DXS7423 has been

observed only in populations with African descent suggesting an African specificity

(Diegoli and Coble, 2011). Allele 9 is observed in Germans (Edelmann et al., 2001) while

allele 18 is only observed in Moroccan population (Poetsch et al., 2011).

For marker DXS7424, allele range 12-18 was observed in Baluchi, KP and Sindhi

populations. African-American and Caucasian-American (Diegoli and Coble, 2011) as

well as German (Edelmann et al., 2001) populations have allele 9. The alleles 15 and 16

were most frequent alleles of this marker in all populations.

For marker DXS8378, allelic range 9-14 was observed in KP and Baluchi

populations, 9-13 for Sindhi population and 10-14 for Punjabi population. This

distribution is similar to Indian (Mukerjee et al., 2010), African (Gomes et al., 2007b)

98

and Costa Rican (Gusmão et al., 2009) populations. Rare allele 7 is present in Spanish

population (Aler et al., 2007). Allele 14 is absent in Asian populations except Taiwanese

(Hwa et al., 2009). Alleles 10, 11 and 12 of DXS8378 were most abundant in all major

populations including Pakistani populations except Korean (Shin et al., 2005) and ethnic

Chinese (Sun et al., 2011) which showed allele 9 as a most frequent allele.

For DXS9902, the allele distribution in Sindhi and Punjabi populations was 9-13

which was similar to that of US Caucasian (Diegoli and Coble, 2011), Germany

(Edelmann et al., 2001), Vietnamese and Taos (Hwa et al., 2011). Allele 14 is present in

Asians (Diegoli and Coble, 2011, Hwa et al., 2011) as well as Brazilian (Martins et al.,

2009) populations. Allele 7 is only present in Korean (Shin et al., 2005) and Taiwanese

(Hwa et al., 2009).

The allele distribution of marker GATA172D05 in Baluchi, KP populations was

similar to Italian (Turrina et al., 2007), Taos and Filipinos (Hwa et al., 2011) and some

Indian (Mukerjee et al., 2010) populations, where allele 7 of this marker is absent. But

the same allele was found in Punjabi and Sindhi along with all other populations like US

(Diegoli and Coble, 2011), Iberian and Latin American (Gusmão et al., 2009), African

(Gomes et al., 2007b), Korean (Shin et al., 2005), Portuguese (Pereira et al., 2007),

Spanish (Zarrabeitia et al., 2006), German (Edelmann et al., 2001) and Colombian

Population (Pico et al., 2008). Allele 5 of GATA172D05 was only reported for German

population (Edelmann et al., 2001). Except Germans (Edelmann et al., 2001) and

Caucasian-Americans (Diegoli and Coble, 2011), allele 13 is present only in Latin

American populations (Gusmão et al., 2009) while allele 14 is reported only in

Columbians (Pico et al., 2008). The allele 10 of GATA172D05 was the most frequent

allele in all the above populations.

99

Allele distribution of GATA31E08 ranged from allele 7 to 12 in Sindhi, KP and

Baluchi populations which is similar to Italy (Turrina et al., 2007) and Korean (Shin et

al., 2005) while it was 7-13 in Punjabi population. Allele 6 is only reported in Koreans

(Shin et al., 2005) while allele 16 is reported only in Asian-Americans and African-

Americans (Diegoli and Coble, 2011).

In the case of marker HPRTB, the allele range 11-16 was observed in Baluchi and

Sindhi populations, which was similar to Asian-American (Diegoli and Coble, 2011),

African (Gomes et al., 2007b) and some Indian populations (Mukerjee et al., 2010).

Allele 6 is observed only in population from Madagascar (Poetsch et al., 2011) while

allele 7 is reported in Ugandans (Gomes et al., 2007b). Allele 18 is observed only in

Columbians (Pico et al., 2008) and Ghanaians (Poetsch et al., 2009). The allele range 12-

17 was observed in KP population while 11-15 was observed in Sindhi population. The

rare allele 18 is found in Colombian population (Pico et al., 2008). Alleles 12 and 13

were the most frequent alleles reported in all studied populations along with Punjabi and

Sindhi populations but alleles 13 and 14 were most frequent in KP and Baluchi

populations.

5.3. Linkage Disequilibrium and Haplotype Analysis

The exact test for LD was performed for all pairs of the 11 markers in female

samples from the Pakistani population. In Baluchi, KP and Sindhi populations, no pair of

markers showed any linkage disequilibrium. But in the Punjabi population, DXS6793

showed linkage with DXS8378 and DXS7423 while DXS7423 showed linkage with

DXS8378. Also DXS9902 showed linkage with HPRTB but after Bonferroni’s

correction, the LD values were not significant. Since these pairs of markers are not

located in close proximity or within the same linkage group on the X chromosome,

100

therefore, it is more likely that it is populations sampling effects or substructure and do

not indicate true linkage.

Linkage between pair of markers DXS7424 and DXS101 in any of the 4

populations that we studied could not be observed which is consistent with Korean

population (Lee et al., 2004) but significant LD was reported in Germans for same pair of

markers (Edelmann et al., 2002a). However, a previous study on generic Pakistani

population did not report any such linkage (Tariq et al., 2009). The linkage between

DXS8378 and DXS7423 has been reported in Waorani population of Ecuador which is an

extremely small and highly isolated population of about 3000 individuals (Baeta et al.,

2012). The differences among populations in terms of LD are due to factors such as

mutation, selection, random drift, the founder effect and population admixture

(Chakravarti, 1999).

In this cluster, 9 alleles for marker DXS101 were identified while 8 alleles were

identified for DXS7424. Therefore, 72 different combinations are possible but we

identified only 46 different haplotypes through genotyping. Haplotype 16-23 was the

most frequent in Baluchi and KP populations. In Sindhi population, haplotype 16-24 was

the most frequent. However, haplotype 15-24 is reported to be the most frequent in

German population (Edelmann et al., 2002a).

5.4. Phylogenetic Studies

Human genetic diversity is shaped by both demographic and biological factors

which is consistent with the hypothesis of a serial founder effect with a single origin in

sub-Saharan Africa. A pattern of ancestral allele frequency distributions that reflects

variation in population dynamics among geographic regions has been reported (Li et al.,

101

2008). In recent years, microsatellites based population genetics studies are on the rise

and these markers may be considered as good neutral Mendelian markers, at least for

intra-specific population studies (Jarne and Lagoda, 1996).

Baluchi, KP, Punjabi and Sindhi populations were compared with 57 other

populations around the world on the basis of their X-STR frequencies. It has been

reported in the literature that polymorphic microsatellites allow trees of human

individuals to be constructed that reflect their geographic origin with remarkable accuracy

(Bowcock et al., 1994). However, the irregular mutation pattern and upper limit of repeat

numbers are some of the problems in using microsatellite loci. Moreover, the long-term

stability of microsatellite loci is affected by the fact that loci that are highly polymorphic

in some species or populations may be monomorphic or less polymorphic in others. The

reason for this polymorphism between different populations is yet unknown. For these

reasons, caution is necessary in extrapolating results from distantly related populations.

Locus to locus mutation rates of microsatellites are variable and may increase the

variance of distance values. However, simultaneous examination of many polymorphic

loci is relatively easy. Therefore, these loci are very useful, at least for the study of

closely related populations (Takezaki and Nei, 1996).

The Baluchi populations clustered together with KP while the Punjabi population

with Sindhi. Together these 4 populations formed a cluster which was closer to the

populations from India. The results concur with a previous pyhlogenetic study of some

Pakistani ethnic populations on the basis of Y-Chromosomal STRs (Qamar et al., 2002).

Another study showed that Pakistani populations tend to be closer to Indian populations

on the basis of Y-Chromosomal (Underhill et al., 2000) and Autosomal STRs (Rakha et

al., 2009).

102

The Indian and Pakistani populations cluster seems closer to Western populations

especially from Latin American origin than to the Eastern populations. The Y-STR based

study also showed similar results (Qamar et al., 2002). The East Asian populations

clustered together with Asian-Americans and then with the other US populations. Similar

results have been reported previously where populations from US were shown closer to

population from Asia (Takezaki and Nei, 2008). This closeness is surprising as Genetic

relationships, as ascertained at least by Y-STRs, are dictated primarily by geographic

proximity rather than by remote linguistic origin (Ayub et al., 2003). Similarly

populations from Brazil, Argentina, Columbia and Costa Rica showed proximity with

populations from Spain and Portugal which are related by history and language but are

situated geographically on two different continents. However, there is no reason to

believe that actual molecular phylogenies would be convergent between different

molecules and would therefore represent populations’ history. In any case, our present

knowledge is obviously insufficient to reconstruct our genetic past, especially on the long

term (Langaney et al., 1992). As more and more data about phylogenetics and distances

of different populations become available, it will be helpful in ascertaining the behaviour

of X-STRs and their role in evolution of humans.

5.5. Conclusions

The proposed 12-plex system is the first in the field of ChrX miniSTRs. There

exist two other publications describing the use of mini-X chromosomal STRs. Indeed, we

adapted some of the markers in our miniplex from this work. However, the first

miniplexes published lacked a marker to represent linkage group 3, while we have

included markers to represent all the 4 linkage groups of X-chromosome. Moreover, the

second set of 2 published miniplexes contained 8 and 10 markers while our system

103

includes 11 highly polymorphic X-STRs. In conclusion, the parallel amplification of 11

informative X-STRs and sex determining locus amelogenin proved this multiplex PCR

assay as reliable tool for paternity testing. This multiplex can be a powerful tool in

forensic case work and can be used in parentage analysis especially in the identification

of female DNA profile in a mixture. It would be cost effective, less laborious and less

time consuming. Population characteristics of human at the genetic level are integral to

both population genetics and forensic biology and to the best of our knowledge, the

multiplex PCR system, consisting of 11 X-Chromosomal miniSTR loci as well as

amelogenin, developed in this study has not been previously reported anywhere in the

world. This multiplex system is highly recommended to be used along with the current

battery of forensic markers especially on compromised samples as this system high

specificity and low DNA requirement to give high quality DNA profiles. Hence, this

system can be used for forensic casework involving the Punjabi, Sindhi, Baluchi and KP

populations.

The present study did not include a comparison of miniSTRs with their larger size

counterparts. In the future, concordance between these miniSTRs and conventional STRs

present in commercial kits, like Argus X12, will be studied. This study demonstrates the

potential of this multiplex for routine forensic casework and inquiries into human

evolutionary history.

Chapter 6

There is nothing new under the sun. It has all been done before.

Sherlock Holmes - A Study in Scarlet

104

6. References

Abràmoff, M.D., Magalhães, P.J. and Ram, S.J. (2004). Image processing with ImageJ.

Biophotonics Int, 11: 36-42.

Agrawal, S. and Khan, F. (2005). Reconstructing recent human phylogenies with forensic

STR loci: A statistical approach. BMC Genetics, 6: 47.

Aler, M., Sánchez-Diz, P., Gomes, I., Gisbert, M., Carracedo, A., Amorim, A. and

Gusmão, L. (2007). Genetic data of 10 X-STRs in a Spanish population sample.

Forensic Sci Int, 173: 193-196.

Andrade, L., Bento, A.M., Serra, A., Carvalho, M., Gamero, J.J., Oliveira, C., Batista, L.,

Lopes, V., Balsa, F., Corte-Real, F. and Anjos, M.J. (2008). AmpFlSTR®

MiniFiler™ PCR amplification kit: The new miniSTR multiplex kit. Forensic Sci

Int Genet (Supp), 1: 89-91.

Aquino, J., Peixe, C., Silva, D., Tavares, C. and de Carvalho, E.F. (2009). A X-

chromosome STR hexaplex as a powerful tool in deficiency paternity cases.

Forensic Sci Int Genet (Supp), 2: 45-46.

Asamura, H., Sakai, H., Kobayashi, K., Ota, M. and Fukushima, H. (2006). MiniX-STR

multiplex system population study in Japan and application to degraded DNA

analysis. Int J Legal Med, 120: 174-181.

Athanasiadou, D., Stradmann-Bellinghausen, B., Rittner, C., Alt, K.W. and Schneider,

P.M. (2003). Development of a quadruplex PCR system for the genetic analysis of

X-chromosomal STR loci. Int Cong Ser, 1239: 311-314.

Ayub, Q., Mansoor, A., Ismail, M., Khaliq, S., Mohyuddin, A., Hameed, A., Mazhar, K.,

Rehman, S., Siddiqi, S. and Papaioannou, M. (2003). Reconstruction of human

evolutionary tree using polymorphic autosomal microsatellites. American J

Physical Antrhopol, 122: 259-268.

Baeta, M., Núñez, C., Aznar, J.M., Sosa, C., Casalod, Y., Bolea, M., González-Andrade,

F., de Pancorbo, M.M. and Martínez-Jarreta, B. (2012). Analysis of 10 X-STRs in

three population groups from Ecuador. Forensic Sci Int Genet.

Baker, T.A., Watson, J.D. and Bell, S.P. (2008). Molecular biology of the gene: Pearson,

Cummings.

Bär, W., Brinkmann, B., Budowle, B., Carracedo, A., Gill, P., Lincoln, P., Mayr, W. and

Olaisen, B. (1997). DNA recommendations. Int J Legal Med, 110: 175-176.

105

Barbaro, A. and Cormaci, P. (2006). X-STR typing for an identification casework. Int

Cong Ser, 1288: 513-515.

Barron, A.E. and Blanch, H.W. (1995). DNA separations by slab gel and capillary

electrophoresis: theory and practice. Separation and Purification Methods, 24: 1-

118.

Bekada, A., Benhamamouch, S., Boudjema, A., Fodil, M., Menegon, S., Torre, C. and

Robino, C. (2010). Analysis of 21 X-chromosomal STRs in an Algerian

population sample. Int J Legal Med, 124: 287-294.

Bekada, A., Benhamamouch, S., Boudjema, A., Fodil, M., Menegon, S., Torre, C. and

Robino, C. (2009). Analysis of 12 X-chromosomal STRs in an Algerian

population sample. Forensic Sci Int Genet (Supp), 2: 400-401.

Bender, K., Farfán, M.J. and Schneider, P.M. (2004). Preparation of degraded human

DNA under controlled conditions. Forensic Sci Int, 139: 135-140.

Bini, C., Ceccardi, S., Ferri, G., Pelotti, S., Alu, M., Roncaglia, E., Beduschi, G.,

Caenazzo, L., Ponzano, E., Tasinato, P., Turchi, C., Buscemi, L., Mazzanti, M.,

Tagliabracci, A., Toni, C., Spinetti, I., Domenici, R. and Presciuttini, S. (2005).

Development of a heptaplex PCR system to analyse X-chromosome STR loci

from five Italian population samples. A collaborative study. Forensic Sci Int, 153:

231-6.

Bobillo, C., Sala, A., Gusmão, L. and Corach, D. (2011). Genetic analysis of 10 X-STRs

in Argentinian population. Forensic Sci Int Genet, 5: e14-e16.

Botstein, D., White, R.L., Skolnick, M. and Davis, R.W. (1980). Construction of a genetic

linkage map in man using restriction fragment length polymorphisms. Am J Hum

Genet, 32: 314.

Bourque, S.N., Valero, J.R., Mercier, J., Lavoie, M.C. and Levesque, R.C. (1993).

Multiplex polymerase chain reaction for detection and differentiation of the

microbial insecticide Bacillus thuringiensis. Appl Environ Microbiol, 59: 523-7.

Boutrand, L., Egyed, B., Furedi, S., Mommers, N., Mertens, G. and Vandenberghe, A.

(2001). Variations in primer sequences are the origin of allele drop-out at loci

D13S317 and CD4. Int J Legal Med, 114: 295-7.

Bowcock, A., Ruiz-Linares, A., Tomfohrde, J., Minch, E., Kidd, J. and Cavalli-Sforza,

L.L. (1994). High resolution of human evolutionary trees with polymorphic

microsatellites. Nature, 368: 455-457.

106

Branicki, W., Wolanska-Nowak, P., Parys-Proszek, A. and Kupiec, T. (2008).

Application of the Mentype Argus X-8 Kit to forensic casework. Problems of

Forensic Sci, 73: 53-64.

Budowle, B., Waye, J.S., Shutler, G.G. and Baechtel, F.S. (1990). Hae III--a suitable

restriction endonuclease for restriction fragment length polymorphism analysis of

biological evidence samples. J Forensic Sci, 35: 530-6.

Butler, J.M. (2012). miniSTR Homepage. National Institute of Standards and

Technology.

Butler, J.M. (2011). Advanced Topics in Forensic DNA Typing: Methodology: Academic

Press.

Butler, J.M. (2009). Fundamentals of Forensic DNA Typing, New York: Academic Press.

Butler, J.M. (2007). Short tandem repeat typing technologies used in human identity

testing. Biotechniques, 43: 2-5.

Butler, J.M., Vallone, P.M. and Coble, M.D. (2007). STRs vs. SNPs: thoughts on the

future of forensic DNA testing. Forensic Sci Med Pathol, 3: 200–205.

Butler, J.M. (2006a). Genetics and genomics of core short tandem repeat loci used in

human identity testing. J Forensic Sci, 51: 253-265.

Butler, J.M. (2006b). MiniSTRs: past, present, and future. Forensic News. Applied

Biosystems.

Butler, J.M. (2005). Forensic DNA Typing: Biology, Technology, and Genetics of STR

Markers: Elsevier Science & Technology Books.

Butler, J.M., Buel, E., Crivellente, F. and McCord, B.R. (2004). Forensic DNA typing by

capillary electrophoresis using the ABI Prism 310 and 3100 genetic analyzers for

STR analysis. Electrophoresis, 25: 1397-412.

Butler, J.M., Shen, Y. and McCord, B.R. (2003). The development of reduced size STR

amplicons as tools for analysis of degraded DNA. J Forensic Sci, 48: 1054-64.

Butler, J.M. (2001). Short Tandem Repeat Analysis for Human Identity Testing. Current

Protocols in Human Genetics. John Wiley & Sons, Inc.

Butler, J.M., Li, J., Shaler, T.A., Monforte, J.A. and Becker, C.H. (1999). Reliable

genotyping of short tandem repeat loci without an allelic ladder using time-of-

flight mass spectrometry. Int J Legal Med, 112: 45-49.

Butler, J.M., McCord, B.R., Jung, J.M., Lee, J.A., Budowle, B. and Allen, R.O. (1995).

Application of dual internal standards for precise sizing of polymerase chain

reaction products using capillary electrophoresis. Electrophoresis, 16: 974-80.

107

Calafell, F., Shuster, A., Speed, W.C., Kidd, J.R. and Kidd, K.K. (1998). Short tandem

repeat polymorphism evolution in humans. Eur J Hum Genet, 6: 38-49.

Caskey, C.T. and Hammond, H.A. (1992). Forensic use of short tandem repeats via PCR.

Adv Forensic Haemogenetics, 4: 18-25.

Castella, V., Dimo-Simonin, N., Morerod, M.L. and Mangin, P. (2006). In-house

validation of the PCR amplification kit «Mentype® Argus X-UL». Int Cong Ser,

1288: 310-312.

Chakraborty, R. (1992). Sample size requirements for addressing the population genetic

issues of forensic use of DNA typing. Human Biology, 64: 141.

Chakravarti, A. (1999). Population genetics—making sense out of sequence. Nature

Genet, 21: 56-60.

Chamberlain, J.S., Gibbs, R.A., Ranier, J.E., Nguyen, P.N. and Caskey, C.T. (1988).

Deletion screening of the Duchenne muscular dystrophy locus via multiplex DNA

amplification. Nucleic Acids Res, 16: 11141-56.

Chien, A., Edgar, D.B. and Trela, J.M. (1976). Deoxyribonucleic acid polymerase from

the extreme thermophile Thermus aquaticus. J Bacteriol, 127: 1550-7.

Chou, Q., Russell, M., Birch, D.E., Raymond, J. and Bloch, W. (1992). Prevention of pre-

PCR mis-priming and primer dimerization improves low-copy-number

amplifications. Nucleic Acids Res, 20: 1717-23.

Chung, D.T., Drabek, J., Opel, K.L., Butler, J.M. and McCord, B.R. (2004). A study on

the effects of degradation and template concentration on the amplification

efficiency of the STR Miniplex primer sets. J Forensic Sci, 49: 733-40.

Cipollaro, M., Di Bernardo, G., Galano, G., Galderisi, U., Guarino, F., Angelini, F. and

Cascino, A. (1998). Ancient DNA in human bone remains from Pompeii

archaeological site. Biochem Biophys Res Commun, 247: 901-4.

Clayton, T.M., Whitaker, J.P. and Maguire, C.N. (1995). Identification of bodies from the

scene of a mass disaster using DNA amplification of short tandem repeat (STR)

loci. Forensic Sci Int, 76: 7-15.

Coble, M.D. and Butler, J.M. (2005). Characterization of new miniSTR loci to aid

analysis of degraded DNA. J Forensic Sci, 50: 43-53.

Coovadia, A. (2008). And the helix may set you free. J Assoc Genet Technol, 34: 11-2.

Creighton, T.E. (1999). Encyclopedia of molecular biology: John Wiley and Sons.

108

Crow, J., Budowle, B., Erlich, H., Lederberg, J., Reeder, D., Schumm, J., Thompson, E.,

Walsh, P. and Weir, B. (2000). ‘The Future of Forensic DNA Testing: Predictions

of the Research and Development Working Group. National Institute of Justice.

D'Aquila, R.T., Bechtel, L.J., Videler, J.A., Eron, J.J., Gorczyca, P. and Kaplan, J.C.

(1991). Maximizing sensitivity and specificity of PCR by pre-amplification

heating. Nucleic Acids Res, 19: 3749.

Desjardins, P.R. and Conklin, D.S. (2001). Microvolume Quantitation of Nucleic Acids.

Current Protocols in Molecular Biology. John Wiley & Sons, Inc.

Desmarais, D., Zhong, Y., Chakraborty, R., Perreault, C. and Busque, L. (1998).

Development of a highly polymorphic STR marker for identity testing purposes at

the human androgen receptor gene (HUMARA). J Forensic Sci, 43: 1046-1049.

Diegoli, T.M. and Coble, M.D. (2011). Development and characterization of two mini-X

chromosomal short tandem repeat multiplexes. Forensic Sci Int Genet, 5: 415-21.

Dixon, L.A., Dobbins, A.E., Pulker, H.K., Butler, J.M., Vallone, P.M., Coble, M.D.,

Parson, W., Berger, B., Grubwieser, P., Mogensen, H.S., Morling, N., Nielsen, K.,

Sanchez, J.J., Petkovski, E., Carracedo, A., Sanchez-Diz, P., Ramos-Luis, E.,

Brion, M., Irwin, J.A., Just, R.S., Loreille, O., Parsons, T.J., Syndercombe-Court,

D., Schmitter, H., Stradmann-Bellinghausen, B., Bender, K. and Gill, P. (2006).

Analysis of artificially degraded DNA using STRs and SNPs--results of a

collaborative European (EDNAP) exercise. Forensic Sci Int, 164: 33-44.

Drabek, J., Chung, D.T., Butler, J.M. and McCord, B.R. (2004). Concordance study

between Miniplex assays and a commercial STR typing kit. J Forensic Sci, 49:

859-60.

Edelmann, J., Lutz-Bonengel, S., Naue, J. and Hering, S. (2012). X-chromosomal

haplotype frequencies of four linkage groups using the Investigator Argus X-12

Kit. Forensic Sci Int Genet, 6: e24.

Edelmann, J., Hering, S., Augustin, C., Kalis, S. and Szibor, R. (2010). Validation of six

closely linked STRs located in the chromosome X centromere region. Int J Legal

Med, 124: 83-7.

Edelmann, J., Hering, S., Augustin, C. and Szibor, R. (2008). Characterisation of the STR

markers DXS10146, DXS10134 and DXS10147 located within a 79.1 kb region at

Xq28. Forensic Sci Int Genet, 2: 41-6.

Edelmann, J. and Szibor, R. (2005). Validation of the X-linked STR DXS6801. Forensic

Sci Int, 148: 219-20.

109

Edelmann, J., Lessig, R., Klintschar, M. and Szibor, R. (2004). Advantages of X-

chromosomal microsatellites in deficiency paternity testing: presentation of cases.

Int Cong Ser, 1261: 257-259.

Edelmann, J. and Szibor, R. (2003). The X-linked STRs DXS7130 and DXS6803.

Forensic Sci Int, 136: 73-5.

Edelmann, J., Hering, S., Kuhlisch, E. and Szibor, R. (2002a). Validation of the STR

DXS7424 and the linkage situation on the X-chromosome. Forensic Sci Int, 125:

217-22.

Edelmann, J., Deichsel, D., Hering, S., Plate, I. and Szibor, R. (2002b). Sequence

variation and allele nomenclature for the X-linked STRs DXS9895, DXS8378,

DXS7132, DXS6800, DXS7133, GATA172D05, DXS7423 and DXS8377.

Forensic Sci Int, 129: 99-103.

Edelmann, J., Hering, S., Michael, M., Lessig, R., Deischel, D., Meier-Sundhausen, G.,

Roewer, L., Plate, I. and Szibor, R. (2001). 16 X-chromosome STR loci frequency

data from a German population. Forensic Sci Int, 124: 215-8.

Edwards, A., Hammond, H.A., Jin, L., Caskey, C.T. and Chakraborty, R. (1992). Genetic

variation at five trimeric and tetrameric tandem repeat loci in four human

population groups. Genomics, 12: 241-253.

Edwards, A., Civitello, A., Hammond, H.A. and Caskey, C.T. (1991). DNA typing and

genetic mapping with trimeric and tetrameric tandem repeats. Am J Hum Genet,

49: 746-56.

Edwards, M.C. and Gibbs, R.A. (1994). Multiplex PCR: advantages, development, and

applications. PCR Methods Appl, 3: S65-75.

El-Harouny, E., El-Dakroory, S., Attalla, S., Hasan, N. and Hassab El-Nabi, S. (2009).

The relationship between postmortem interval and DNA degradation in different

tissues of drowned rats. Internet J Forensic Sci, 4.

Eliasova, I., Mazura, I. and Smejtek, L. (2010). DNA analysis of ancient skeletal remains.

Folia Biol (Praha), 56: 47-50.

Esteve Codina, A., Niederstatter, H. and Parson, W. (2009). "GenderPlex" a PCR

multiplex for reliable gender determination of degraded human DNA samples and

complex gender constellations. Int J Legal Med, 123: 459-64.

Evison, M.P., Smillie, D.M. and Chamberlain, A.T. (1997). Extraction of single-copy

nuclear DNA from forensic specimens with a variety of postmortem histories. J

Forensic Sci, 42: 1032-8.

110

Excoffier, L., Laval, G. and Schneider, S. (2005). Arlequin (version 3.0): an integrated

software package for population genetics data analysis. Evol Bioinform Online, 1:

47.

Faerman, M., Filon, D., Kahila, G., Greenblatt, C.L., Smith, P. and Oppenheim, A.

(1995). Sex identification of archaeological human remains based on

amplification of the X and Y amelogenin alleles. Gene, 167: 327-32.

Fan, J.B., Oliphant, A., Shen, R., Kermani, B.G., Garcia, F., Gunderson, K.L., Hansen,

M., Steemers, F., Butler, S.L., Deloukas, P., Galver, L., Hunt, S., McBride, C.,

Bibikova, M., Rubano, T., Chen, J., Wickham, E., Doucet, D., Chang, W.,

Campbell, D., Zhang, B., Kruglyak, S., Bentley, D., Haas, J., Rigault, P., Zhou, L.,

Stuelpnagel, J. and Chee, M.S. (2003). Highly parallel SNP genotyping. Cold

Spring Harb Symp Quant Biol, 68: 69-78.

Feuk, L., Carson, A.R. and Scherer, S.W. (2006). Structural variation in the human

genome. Nat Rev Genet, 7: 85-97.

Flicek, P., Amode, M.R., Barrell, D., Beal, K., Brent, S., Carvalho-Silva, D., Clapham, P.,

Coates, G., Fairley, S. and Fitzgerald, S. (2012). Ensembl 2012. Nucleic Acids

Res, 40: D84-D90.

Fujita, P.A., Rhead, B., Zweig, A.S., Hinrichs, A.S., Karolchik, D., Cline, M.S.,

Goldman, M., Barber, G.P., Clawson, H. and Coelho, A. (2011). The UCSC

genome browser database: update 2011. Nucleic Acids Res, 39: D876-D882.

Gao, S., Qiao, K., Rakha, A., Lai, J. and Li, S. (2007). Allele frequencies for 10 X-STR

loci in Nu population of Yunnan, China. Leg Med (Tokyo), 9: 284-6.

Gehrig, C. and Teyssier, A. (2006). Validation of the Mentype® Argus X-UL kit. Int

Cong Ser, 1288: 325-327.

Gill, P., Werrett, D.J., Budowle, B. and Guerrieri, R. (2004). An assessment of whether

SNPs will replace STRs in national DNA databases--joint considerations of the

DNA working group of the European Network of Forensic Science Institutes

(ENFSI) and the Scientific Working Group on DNA Analysis Methods

(SWGDAM). Sci Justice, 44: 51-3.

Gomes, C., Magalhaes, M., Alves, C., Amorim, A., Pinto, N. and Gusmao, L. (2012).

Comparative evaluation of alternative batteries of genetic markers to complement

autosomal STRs in kinship investigations: autosomal indels vs. X-chromosome

STRs. Int J Legal Med: 1-5.

111

Gomes, I., Pereira, R., Mayr, W., Amorim, A., Carracedo, A. and Gusmão, L. (2009).

Evaluation of DXS9902, DXS7132, DXS6809, DXS7133, and DXS7423 in

humans and chimpanzees: sequence variation, repeat structure, and nomenclature.

Int J Legal Med, 123: 403-412.

Gomes, I., Prinz, M., Pereira, R., Meyers, C., Mikulasovich, R.S., Amorim, A.,

Carracedo, A. and Gusmao, L. (2007a). Genetic analysis of three US population

groups using an X-chromosomal STR decaplex. Int J Legal Med, 121: 198-203.

Gomes, I., Alves, C., Maxzud, K., Pereira, R., Prata, M.J., Sánchez-Diz, P., Carracedo,

A., Amorim, A. and Gusmão, L. (2007b). Analysis of 10 X-STRs in three African

populations. Forensic Sci Int Genet, 1: 208-211.

Gomes, I., Alves, C., Maxzud, K., Pereira, R., Prata, M.J., Sanchez-Diz, P., Carracedo,

A., Amorim, A. and Gusmao, L. (2007c). Analysis of 10 X-STRs in three African

populations. Forensic Sci Int Genet, 1: 208-11.

Graham, E.A.M. (2005). Mini-STRs. Forensic Sci Med Pathol, 1: 65-67.

Grossman, P.D. and Soane, D.S. (1991). Experimental and theoretical studies of DNA

separations by capillary electrophoresis in entangled polymer solutions.

Biopolymers, 31: 1221-8.

Grubwieser, P., Muhlmann, R., Berger, B., Niederstatter, H., Pavlic, M. and Parson, W.

(2006). A new "miniSTR-multiplex" displaying reduced amplicon lengths for the

analysis of degraded DNA. Int J Legal Med, 120: 115-20.

Grubwieser, P., Mühlmann, R. and Parson, W. (2003). New sensitive amplification

primers for the STR locus D2S1338 for degraded casework DNA. Int J Legal

Med, 117: 185-188.

Guo, S.W. and Thompson, E.A. (1992). Performing the exact test of Hardy-Weinberg

proportion for multiple alleles. Biometrics: 361-372.

Gusmão, L., Sánchez-Diz, P., Alves, C., Gomes, I., Zarrabeitia, M.T., Abovich, M.,

Atmetlla, I., Bobillo, C., Bravo, L. and Builes, J. (2009). A GEP-ISFG

collaborative study on the optimization of an X-STR decaplex: data on 15 Iberian

and Latin American populations. Int J Legal Med, 123: 227-234.

Haack, K., Hummel, S. and Hummel, B. (2000). Ancient DNA fragments longer than 300

bp. Anthropol Anz, 58: 51-6.

Haas-Rochholz, H. and Weiler, G. (1997). Additional primer sets for an amelogenin gene

PCR-based DNA-sex test. Int J Legal Med, 110: 312-5.

112

Hanson, E.K. and Ballantyne, J. (2004). A highly discriminating 21 locus Y-STR"

megaplex" system designed to augment the minimal haplotype loci for forensic

casework. J Forensic Sci, 49: 40.

Haqqi, T.M., Sarkar, G., David, C.S. and Sommer, S.S. (1988). Specific amplification

with PCR of a refractory segment of genomic DNA. Nucleic Acids Res, 16:

11844.

Hardenbol, P., Baner, J., Jain, M., Nilsson, M., Namsaraev, E.A., Karlin-Neumann, G.A.,

Fakhrai-Rad, H., Ronaghi, M., Willis, T.D., Landegren, U. and Davis, R.W.

(2003). Multiplexed genotyping with sequence-tagged molecular inversion

probes. Nat Biotechnol, 21: 673-8.

Hartzell, B., Graham, K. and McCord, B. (2003). Response of short tandem repeat

systems to temperature and sizing methods. Forensic Sci Int, 133: 228-34.

Hauge, X.Y. and Litt, M. (1993). A study of the origin of 'shadow bands' seen when

typing dinucleotide repeat polymorphisms by the PCR. Hum Mol Genet, 2: 411-

415.

Hearne, C. and Todd, J. (1991). Tetranucleotide repeat polymorphism at the HPRT locus.

Nucleic Acids Res, 19: 5450-5450.

Hellmann, A., Rohleder, U., Schmitter, H. and Wittig, M. (2001). STR typing of human

telogen hairs--a new approach. Int J Legal Med, 114: 269-73.

Hering, S., Edelmann, J., Augustin, C., Dressler, J. and Szibor, R. (2008). Forensische

Validierung des STR-Markers DXS10160: Strukturanalyse, Allelnomenklatur,

Populations, und Kopplungsdaten. Rechtsmedizin, 18: 333.

Hering, S., Augustin, C., Edelmann, J., Heidel, M., Dressler, J., Rodig, H., Kuhlisch, E.

and Szibor, R. (2006). DXS10079, DXS10074 and DXS10075 are STRs located

within a 280-kb region of Xq12 and provide stable haplotypes useful for complex

kinship cases. Int J Legal Med, 120: 337-45.

Hering, S., Brundirs, N., Kuhlisch, E., Edelmann, J., Plate, I., Benecke, M., Van, P.H.,

Michael, M. and Szibor, R. (2004). DXS10011: studies on structure, allele

distribution in three populations and genetic linkage to further q-telomeric

chromosome X markers. Int J Legal Med, 118: 313-9.

Hill, C.R., Kline, M.C., Mulero, J.J., Lagace, R.E., Chang, C.W., Hennessy, L.K. and

Butler, J.M. (2007). Concordance study between the AmpFlSTR MiniFiler PCR

amplification kit and conventional STR typing kits. J Forensic Sci, 52: 870-3.

Hjerten, S. (1967). Free zone electrophoresis. Chromatogr Rev, 9: 122-219.

113

Holland, M.M., Cave, C.A., Holland, C.A. and Bille, T.W. (2003). Development of a

quality, high throughput DNA analysis procedure for skeletal samples to assist

with the identification of victims from the World Trade Center attacks. Croat Med

J, 44: 264-72.

Hwa, H.L., Lee, J.C.I., Chang, Y.Y., Yin, H.Y., Chen, Y.H., Tseng, L.H., Su, Y.N. and

Ko, T.M. (2011). Genetic analysis of eight population groups living in Taiwan

using a 13 X-chromosomal STR loci multiplex system. Int J Legal Med, 125: 33-

37.

Hwa, H.L., Chang, Y.Y., Lee, J.C., Yin, H.Y., Chen, Y.H., Tseng, L.H., Su, Y.N. and Ko,

T.M. (2009). Thirteen X-chromosomal short tandem repeat loci multiplex data

from Taiwanese. Int J Legal Med, 123: 263-9.

Innis, M.A., Sninsky, J.J. and Gelfand, D.H. (1999). PCR applications: protocols for

functional genomics: Academic Pr.

Innis, M.A., Myambo, K.B., Gelfand, D.H. and Brow, M.A. (1988). DNA sequencing

with Thermus aquaticus DNA polymerase and direct sequencing of polymerase

chain reaction-amplified DNA. Proc Natl Acad Sci U S A, 85: 9436-40.

Israr, M., Shahid, A.A., Rahman, Z., Shahzad, M.S., Ullah, O. and Husnain, T. (2012).

Punjabi population data for seven X-chromosome short tandem repeat (X-STR)

loci using a new miniplex system. Afr J Biotechnol, 11: 10513-10516.

Jarne, P. and Lagoda, P.J. (1996). Microsatellites, from molecules to populations and

back. Trends Ecol Evol, 11: 424-9.

Jedrzejczyk, M., Jacewicz, R., Kozdraj, A., Szram, S. and Berent, J. (2010). Application

of X-STR Loci in Forensic Genetics. Problems of Forensic Sci, 82: 141-150.

Jeffreys, A.J., Wilson, V., Wong, Z., Royle, N., Patel, I., Kelly, R. and Clarkson, R.

(1987). Highly variable minisatellites and DNA fingerprints. Biochem Soc Symp,

53: 165-80.

Jeffreys, A.J., Wilson, V. and Thein, S.L. (1985a). Hypervariable 'minisatellite' regions in

human DNA. Nature, 314: 67-73.

Jeffreys, A.J., Wilson, V. and Thein, S.L. (1985b). Individual-specific 'fingerprints' of

human DNA. Nature, 316: 76-9.

Jeffreys, A.J., Brookfield, J.F. and Semeonoff, R. (1985c). Positive identification of an

immigration test-case using human DNA fingerprints. Nature, 317: 818-819.

114

Jia, Y., Zhou, B., Liang, W.B., Lv, M.L. and Zhang, L. (2004). Two X-chromosome STR

loci DXS6803 and XS6793 frequency data in Chinese population. J Forensic Sci,

49: 845-6.

Jobling, M.A. and Gill, P. (2004). Encoded evidence: DNA in forensic analysis. Nat Rev

Genet, 5: 739-51.

Jorgenson, J.W. and Lukacs, K.D. (1983). Capillary zone electrophoresis. Science, 222:

266-72.

Jorgenson, J.W. and Lukacs, K.D. (1981). Free-zone electrophoresis in glass capillaries.

Clin Chem, 27: 1551-3.

Kaboev, O.K., Luchkina, L.A., Tret'iakov, A.N. and Bahrmand, A.R. (2000). PCR hot

start using primers with the structure of molecular beacons (hairpin-like structure).

Nucleic Acids Res, 28: E94.

Kasai, K., Nakamura, Y. and White, R. (1990). Amplification of a variable number of

tandem repeats (VNTR) locus (pMCT118) by the polymerase chain reaction

(PCR) and its application to forensic science. J Forensic Sci, 35: 1196-200.

Kashyap, V., Guha, S., Sitalaximi, T., Bindu, G.H., Hasnain, S. and Trivedi, R. (2006).

Genetic structure of Indian populations based on fifteen autosomal microsatellite

loci. BMC Genetics, 7: 28.

Kashyap, V., Sitalaximi, T., Chattopadhyay, P. and Trivedi, R. (2004). DNA profiling

technologies in forensic analysis. Int J Hum Genet, 4: 11.

Kebelmann-Betzing, C., Seeger, K., Dragon, S., Schmitt, G., Moricke, A., Schild, T.A.,

Henze, G. and Beyermann, B. (1998). Advantages of a new Taq DNA polymerase

in multiplex PCR and time-release PCR. Biotechniques, 24: 154-8.

Kent, W.J. (2002). BLAT—the BLAST-like alignment tool. Genome Res, 12: 656-664.

Kishida, T., Wang, W., Fukuda, M. and Tamaki, Y. (1997). Duplex PCR of the Y-27H39

and HPRT loci with reference to Japanese population data on the HPRT locus.

Nihon Hoigaku Zasshi, 51: 67-9.

Koehler, J., Chia, A. and Lindsey, S. (2009). The random match probability (RMP) in

DNA evidence: Irrelevant and prejudicial? Jurimetrics Journal, 35: 201.

Kong, D., Shen, H., Huang, Y. and Mi, H. (2004). PCR hot-start using duplex primers.

Biotechnol Lett, 26: 277-80.

Krenke, B.E., Tereba, A., Anderson, S.J., Buel, E., Culhane, S., Finis, C.J., Tomsey, C.S.,

Zachetti, J.M., Masibay, A., Rabbach, D.R., Amiott, E.A. and Sprecher, C.J.

115

(2002). Validation of a 16-locus fluorescent multiplex system. J Forensic Sci, 47:

773-85.

Krüger, J., Fuhrmann, W., Lichte, K.H. and Steffens, C. (1968). Zur Verwendung des

Polymorphismus der sauren Erythrocytenphosphatase bei der

Vaterschaftsbegutachtung. Int J Legal Med, 64: 127-146.

La Spada, A.R., Wilson, E.M., Lubahn, D.B., Harding, A.E. and Fischbeck, K.H. (1991).

Androgen receptor gene mutations in X-linked spinal and bulbar muscular

atrophy. Nature, 352: 77-9.

Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K.,

Dewar, K., Doyle, M., FitzHugh, W., Funke, R., Gage, D., Harris, K., Heaford,

A., Howland, J., Kann, L., Lehoczky, J., LeVine, R., McEwan, P., McKernan, K.,

Meldrim, J., Mesirov, J.P., Miranda, C., Morris, W., Naylor, J., Raymond, C.,

Rosetti, M., Santos, R., Sheridan, A., Sougnez, C., Stange-Thomann, N.,

Stojanovic, N., Subramanian, A., Wyman, D., Rogers, J., Sulston, J., Ainscough,

R., Beck, S., Bentley, D., Burton, J., Clee, C., Carter, N., Coulson, A., Deadman,

R., Deloukas, P., Dunham, A., Dunham, I., Durbin, R., French, L., Grafham, D.,

Gregory, S., Hubbard, T., Humphray, S., Hunt, A., Jones, M., Lloyd, C.,

McMurray, A., Matthews, L., Mercer, S., Milne, S., Mullikin, J.C., Mungall, A.,

Plumb, R., Ross, M., Shownkeen, R., Sims, S., Waterston, R.H., Wilson, R.K.,

Hillier, L.W., McPherson, J.D., Marra, M.A., Mardis, E.R., Fulton, L.A.,

Chinwalla, A.T., Pepin, K.H., Gish, W.R., Chissoe, S.L., Wendl, M.C.,

Delehaunty, K.D., Miner, T.L., Delehaunty, A., Kramer, J.B., Cook, L.L., Fulton,

R.S., Johnson, D.L., Minx, P.J., Clifton, S.W., Hawkins, T., Branscomb, E.,

Predki, P., Richardson, P., Wenning, S., Slezak, T., Doggett, N., Cheng, J.F.,

Olsen, A., Lucas, S., Elkin, C., Uberbacher, E., Frazier, M., Gibbs, R.A., Muzny,

D.M., Scherer, S.E., Bouck, J.B., Sodergren, E.J., Worley, K.C., Rives, C.M.,

Gorrell, J.H., Metzker, M.L., Naylor, S.L., Kucherlapati, R.S., Nelson, D.L.,

Weinstock, G.M., Sakaki, Y., Fujiyama, A., Hattori, M., Yada, T., Toyoda, A.,

Itoh, T., Kawagoe, C., Watanabe, H., Totoki, Y., Taylor, T., Weissenbach, J.,

Heilig, R., Saurin, W., Artiguenave, F., Brottier, P., Bruls, T., Pelletier, E., Robert,

C., Wincker, P., Smith, D.R., Doucette-Stamm, L., Rubenfield, M., Weinstock,

K., Lee, H.M., Dubois, J., Rosenthal, A., Platzer, M., Nyakatura, G., Taudien, S.,

Rump, A., Yang, H., Yu, J., Wang, J., Huang, G., Gu, J., Hood, L., Rowen, L.,

Madan, A., Qin, S., Davis, R.W., Federspiel, N.A., Abola, A.P., Proctor, M.J.,

116

Myers, R.M., Schmutz, J., Dickson, M., Grimwood, J., Cox, D.R., Olson, M.V.,

Kaul, R., Raymond, C., Shimizu, N., Kawasaki, K., Minoshima, S., Evans, G.A.,

Athanasiou, M., Schultz, R., Roe, B.A., Chen, F., Pan, H., Ramser, J., Lehrach,

H., Reinhardt, R., McCombie, W.R., de la Bastide, M., Dedhia, N., Blocker, H.,

Hornischer, K., Nordsiek, G., Agarwala, R., Aravind, L., Bailey, J.A., Bateman,

A., Batzoglou, S., Birney, E., Bork, P., Brown, D.G., Burge, C.B., Cerutti, L.,

Chen, H.C., Church, D., Clamp, M., Copley, R.R., Doerks, T., Eddy, S.R.,

Eichler, E.E., Furey, T.S., Galagan, J., Gilbert, J.G., Harmon, C., Hayashizaki, Y.,

Haussler, D., Hermjakob, H., Hokamp, K., Jang, W., Johnson, L.S., Jones, T.A.,

Kasif, S., Kaspryzk, A., Kennedy, S., Kent, W.J., Kitts, P., Koonin, E.V., Korf, I.,

Kulp, D., Lancet, D., Lowe, T.M., McLysaght, A., Mikkelsen, T., Moran, J.V.,

Mulder, N., Pollara, V.J., Ponting, C.P., Schuler, G., Schultz, J., Slater, G., Smit,

A.F., Stupka, E., Szustakowski, J., Thierry-Mieg, D., Thierry-Mieg, J., Wagner,

L., Wallis, J., Wheeler, R., Williams, A., Wolf, Y.I., Wolfe, K.H., Yang, S.P.,

Yeh, R.F., Collins, F., Guyer, M.S., Peterson, J., Felsenfeld, A., Wetterstrand,

K.A., Patrinos, A., Morgan, M.J., de Jong, P., Catanese, J.J., Osoegawa, K.,

Shizuya, H., Choi, S. and Chen, Y.J. (2001). Initial sequencing and analysis of the

human genome. Nature, 409: 860-921.

Langaney, A., Roessli, D., van Blyenburgh, N.H. and Dard, P. (1992). Do most human

populations descend from phylogenetic trees? Human Evolution, 7: 47-61.

Lawyer, F.C., Stoffel, S., Saiki, R.K., Myambo, K., Drummond, R. and Gelfand, D.H.

(1989). Isolation, characterization, and expression in Escherichia coli of the DNA

polymerase gene from Thermus aquaticus. J Biol Chem, 264: 6427-37.

Lazaruk, K., Walsh, P.S., Oaks, F., Gilbert, D., Rosenblum, B.B., Menchen, S., Scheibler,

D., Wenz, H.M., Holt, C. and Wallin, J. (1998). Genotyping of forensic short

tandem repeat (STR) systems based on sizing precision in a capillary

electrophoresis instrument. Electrophoresis, 19: 86-93.

Leahy, J.G. and Colwell, R.R. (1990). Microbial degradation of hydrocarbons in the

environment. Microbiol Rev, 54: 305-15.

Leclair, B., Fregeau, C.J., Bowen, K.L. and Fourney, R.M. (2004). Enhanced kinship

analysis and STR-based DNA typing for human identification in mass fatality

incidents: the Swissair flight 111 disaster. J Forensic Sci, 49: 939-53.

Lee, H.Y., Park, M.J., Jeong, C.K., Lee, S.Y., Yoo, J.E., Chung, U., Choi, J.H., Kim,

C.Y. and Shin, K.J. (2004). Genetic characteristics and population study of 4 X-

117

chromosomal STRs in Koreans: evidence for a null allele at DXS9898. Int J Legal

Med, 118: 355-60.

Levy, S., Sutton, G., Ng, P.C., Feuk, L., Halpern, A.L., Walenz, B.P., Axelrod, N.,

Huang, J., Kirkness, E.F., Denisov, G., Lin, Y., MacDonald, J.R., Pang, A.W.,

Shago, M., Stockwell, T.B., Tsiamouri, A., Bafna, V., Bansal, V., Kravitz, S.A.,

Busam, D.A., Beeson, K.Y., McIntosh, T.C., Remington, K.A., Abril, J.F., Gill, J.,

Borman, J., Rogers, Y.H., Frazier, M.E., Scherer, S.W., Strausberg, R.L. and

Venter, J.C. (2007). The diploid genome sequence of an individual human. PLoS

Biol, 5: e254.

Li, J.Z., Absher, D.M., Tang, H., Southwick, A.M., Casto, A.M., Ramachandran, S.,

Cann, H.M., Barsh, G.S., Feldman, M. and Cavalli-Sforza, L.L. (2008).

Worldwide human relationships inferred from genome-wide patterns of variation.

Science, 319: 1100-1104.

Litt, M. and Luty, J.A. (1989). A hypervariable microsatellite revealed by in vitro

amplification of a dinucleotide repeat within the cardiac muscle actin gene. Am J

Hum Genet, 44: 397-401.

Liu, K. and Muse, S.V. (2005). PowerMarker: an integrated analysis environment for

genetic marker analysis. Bioinformatics, 21: 2128-2129.

Liu, Q. and Li, S. (2006). Patterns of genetic polymorphism at the 10 X-chromosome

STR loci in Mongol population. Forensic Sci Int, 158: 76-79.

Liu, Q.L., Lu, D.J., Quan, L., Chen, Y.F., Shen, M. and Zhao, H. (2012). Development of

multiplex PCR system with 15 X‐STR loci and genetic analysis in three

nationality populations from China. Electrophoresis, 33: 1299-1305.

Lonjou, C., Zhang, W., Collins, A., Tapper, W.J., Elahi, E., Maniatis, N. and Morton,

N.E. (2003). Linkage disequilibrium in human populations. Proc Natl Acad Sci U

S A, 100: 6069.

Łuczak, S., Rogalla, U., Malyarchuk, B.A. and Grzybowski, T. (2011). Diversity of 15

human X chromosome microsatellite loci in Polish population. Forensic Sci Int

Genet, 5: 71.

Lygo, J.E., Johnson, P.E., Holdaway, D.J., Woodroffe, S., Whitaker, J.P., Clayton, T.M.,

Kimpton, C.P. and Gill, P. (1994). The validation of short tandem repeat (STR)

loci for use in forensic casework. Int J Legal Med, 107: 77-89.

Macaulay, V., Hill, C., Achilli, A., Rengo, C., Clarke, D., Meehan, W., Blackburn, J.,

Semino, O., Scozzari, R. and Cruciani, F. (2005). Single, rapid coastal settlement

118

of Asia revealed by analysis of complete mitochondrial genomes. Science, 308:

1034-1036.

Machado, F.B. and Medina-Acosta, E. (2009). Genetic map of human X-linked

microsatellites used in forensic practice. Forensic Sci Int Genet, 3: 202-4.

Mahtani, M.M. and Willard, H.F. (1993). A polymorphic X-linked tetranucleotide repeat

locus displaying a high rate of new mutation: implications for mechanisms of

mutation at short tandem repeat loci. Hum Mol Genet, 2: 431-437.

Maniatis, T., Fritsch, E.F. and Sambrook, J. (1982). Molecular cloning : a laboratory

manual, Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory.

Marchi, E. (2004). Methods developed to identify victims of the World Trade Center

disaster. Am Lab, 36: 30-36.

Martins, J.A., Costa, J.C., Paneto, G.G., Gusmão, L., Sánchez-Diz, P., Carracedo, A. and

Cicarelli, R.M.B. (2009). Genetic data of 10 X-chromosomal loci in Vitória

population (Espírito Santo State, Brazil). Forensic Sci Int Genet (Supp), 2: 394-

395.

McDonald, O.G., Krynetski, E.Y. and Evans, W.E. (2002). Molecular haplotyping of

genomic DNA for multiple single-nucleotide polymorphisms located kilobases

apart using long-range polymerase chain reaction and intramolecular ligation.

Pharmacogenetics, 12: 93-9.

McEwen, J.E. (1995). Forensic DNA data banking by state crime laboratories. Am J Hum

Genet, 56: 1487-92.

Moretti, T.R., Baumstark, A.L., Defenbaugh, D.A., Keys, K.M., Smerick, J.B. and

Budowle, B. (2001). Validation of short tandem repeats (STRs) for forensic usage:

performance testing of fluorescent multiplex STR systems and analysis of

authentic and simulated forensic samples. J Forensic Sci, 46: 647-60.

Mukerjee, S., Ghosh, T., Kalpana, D., Mukherjee, M. and Sharma, A.K. (2010). Genetic

variation of 10 X chromosomal STR loci in Indian population. Int J Legal Med,

124: 327-330.

Muller, K., Klein, R., Miltner, E. and Wiegand, P. (2007). Improved STR typing of

telogen hair root and hair shaft DNA. Electrophoresis, 28: 2835-42.

Mullis, K., Faloona, F., Scharf, S., Saiki, R., Horn, G. and Erlich, H. (1986). Specific

enzymatic amplification of DNA in vitro: the polymerase chain reaction. Cold

Spring Harb Symp Quant Biol, 51 Pt 1: 263-73.

119

Nakamura, Y. and Minaguchi, K. (2010). Sixteen X-chromosomal STRs in two octaplex

PCRs in Japanese population and development of 15-locus multiplex PCR system.

Int J Legal Med, 124: 405-14.

Nakamura, Y., Leppert, M., O'Connell, P., Wolff, R., Holm, T., Culver, M., Martin, C.,

Fujimoto, E., Hoff, M., Kumlin, E. and et al. (1987). Variable number of tandem

repeat (VNTR) markers for human gene mapping. Science, 235: 1616-22.

Oppenheimer, S. (2012). Out-of-Africa, the peopling of continents and islands: tracing

uniparental gene trees across the map. Philos Trans R Soc Lond B Biol Sci, 367:

770-84.

Oppenheimer, S. (2004). Out of Eden: the peopling of the world: Constable & Robinson

Ltd.

Oppenheimer, S. (2003). The real eve: modern man's journey out of Africa: Basic Books.

Panneerchelvam, S. and Norazmi, M.N. (2003). Forensic DNA profiling and database.

Malaysian J Med Sci, 10: 20-26.

Parsons, T.J., Huel, R., Davoren, J., Katzmarzyk, C., Milos, A., Selmanovic, A.,

Smajlovic, L., Coble, M.D. and Rizvic, A. (2007). Application of novel "mini-

amplicon" STR multiplexes to high volume casework on degraded skeletal

remains. Forensic Sci Int Genet, 1: 175-9.

Pereira, R., Gomes, I., Amorim, A. and Gusmão, L. (2007). Genetic diversity of 10 X

chromosome STRs in northern Portugal. Int J Legal Med, 121: 192-197.

Pico, A., Castillo, A., Vargas, C., Amorim, A. and Gusmão, L. (2008). Genetic profile

characterization and segregation analysis of 10 X-STRs in a sample from

Santander, Colombia. Int J Legal Med, 122: 347-351.

Pinto, N., Gusmão, L. and Amorim, A. (2011). X-chromosome markers in kinship testing:

a generalisation of the IBD approach identifying situations where their

contribution is crucial. Forensic Sci Int Genet, 5: 27-32.

Poetsch, M., Knop, A., El-Mostaqim, D., Rakotomavo, N. and von Wurmb-Schwark, N.

(2011). Allele frequencies of 11 X-chromosomal loci of two population samples

from Africa. Int J Legal Med, 125: 307-314.

Poetsch, M., El-Mostaqim, D., Tschentscher, F., Browne, E.N.L., Timmann, C.,

Horstmann, R.D. and von Wurmb-Schwark, N. (2009). Allele frequencies of 11

X-chromosomal loci in a population sample from Ghana. Int J Legal Med, 123:

81-83.

120

Poetsch, M., Petersmann, H., Repenning, A. and Lignitz, E. (2005). Development of two

pentaplex systems with X-chromosomal STR loci and their allele frequencies in a

northeast German population. Forensic Sci Int, 155: 71-6.

Qamar, R., Ayub, Q., Mohyuddin, A., Helgason, A., Mazhar, K., Mansoor, A., Zerjal, T.,

Tyler-Smith, C. and Mehdi, S.Q. (2002). Y-chromosomal DNA variation in

Pakistan. Am J Hum Genet, 70: 1107-1124.

Rakha, A., Yu, B., Hadi, S. and Sheng-Bin, L. (2009). Population genetic data on 15

autosomal STRs in a Pakistani population sample. Leg Med (Tokyo), 11: 305-7.

Reid, T.M., Peterson, J.W., Baird, M.L., Lee, S.C. and Lee, R.F. (2008). The use of

supplemental STR loci for resolving difficult family relationship cases. Forensic

Sci Int Genet (Supp), 1: 520–521.

Ribeiro Rodrigues, E.M., Leite, F.P., Hutz, M.H., Palha Tde, J., Ribeiro dos Santos, A.K.

and dos Santos, S.E. (2008). A multiplex PCR for 11 X chromosome STR

markers and population data from a Brazilian Amazon Region. Forensic Sci Int

Genet, 2: 154-8.

Robino, C., Giolitti, A., Gino, S. and Torre, C. (2006). Development of two multiplex

PCR systems for the analysis of 12 X-chromosomal STR loci in a northwestern

Italian population sample. Int J Legal Med, 120: 315-318.

Rolf, B., Bulander, N. and Wiegand, P. (2011). Insertion-/deletion polymorphisms close

to the repeat region of STR loci can cause discordant genotypes with different

STR kits. Forensic Sci Int Genet, 5: 339-41.

Ross, M.T., Grafham, D.V., Coffey, A.J., Scherer, S., McLay, K., Muzny, D., Platzer, M.,

Howell, G.R., Burrows, C., Bird, C.P., Frankish, A., Lovell, F.L., Howe, K.L.,

Ashurst, J.L., Fulton, R.S., Sudbrak, R., Wen, G., Jones, M.C., Hurles, M.E.,

Andrews, T.D., Scott, C.E., Searle, S., Ramser, J., Whittaker, A., Deadman, R.,

Carter, N.P., Hunt, S.E., Chen, R., Cree, A., Gunaratne, P., Havlak, P., Hodgson,

A., Metzker, M.L., Richards, S., Scott, G., Steffen, D., Sodergren, E., Wheeler,

D.A., Worley, K.C., Ainscough, R., Ambrose, K.D., Ansari-Lari, M.A., Aradhya,

S., Ashwell, R.I., Babbage, A.K., Bagguley, C.L., Ballabio, A., Banerjee, R.,

Barker, G.E., Barlow, K.F., Barrett, I.P., Bates, K.N., Beare, D.M., Beasley, H.,

Beasley, O., Beck, A., Bethel, G., Blechschmidt, K., Brady, N., Bray-Allen, S.,

Bridgeman, A.M., Brown, A.J., Brown, M.J., Bonnin, D., Bruford, E.A., Buhay,

C., Burch, P., Burford, D., Burgess, J., Burrill, W., Burton, J., Bye, J.M., Carder,

C., Carrel, L., Chako, J., Chapman, J.C., Chavez, D., Chen, E., Chen, G., Chen,

121

Y., Chen, Z., Chinault, C., Ciccodicola, A., Clark, S.Y., Clarke, G., Clee, C.M.,

Clegg, S., Clerc-Blankenburg, K., Clifford, K., Cobley, V., Cole, C.G., Conquer,

J.S., Corby, N., Connor, R.E., David, R., Davies, J., Davis, C., Davis, J., Delgado,

O., Deshazo, D., Dhami, P., Ding, Y., Dinh, H., Dodsworth, S., Draper, H.,

Dugan-Rocha, S., Dunham, A., Dunn, M., Durbin, K.J., Dutta, I., Eades, T.,

Ellwood, M., Emery-Cohen, A., Errington, H., Evans, K.L., Faulkner, L., Francis,

F., Frankland, J., Fraser, A.E., Galgoczy, P., Gilbert, J., Gill, R., Glockner, G.,

Gregory, S.G., Gribble, S., Griffiths, C., Grocock, R., Gu, Y., Gwilliam, R.,

Hamilton, C., Hart, E.A., Hawes, A., Heath, P.D., Heitmann, K., Hennig, S.,

Hernandez, J., Hinzmann, B., Ho, S., Hoffs, M., Howden, P.J., Huckle, E.J.,

Hume, J., Hunt, P.J., Hunt, A.R., Isherwood, J., Jacob, L., Johnson, D., Jones, S.,

de Jong, P.J., Joseph, S.S., Keenan, S., Kelly, S., Kershaw, J.K., Khan, Z.,

Kioschis, P., Klages, S., Knights, A.J., Kosiura, A., Kovar-Smith, C., Laird, G.K.,

Langford, C., Lawlor, S., Leversha, M., Lewis, L., Liu, W., Lloyd, C., Lloyd,

D.M., Loulseged, H., Loveland, J.E., Lovell, J.D., Lozado, R., Lu, J., Lyne, R.,

Ma, J., Maheshwari, M., Matthews, L.H., McDowall, J., McLaren, S., McMurray,

A., Meidl, P., Meitinger, T., Milne, S., Miner, G., Mistry, S.L., Morgan, M.,

Morris, S., Muller, I., Mullikin, J.C., Nguyen, N., Nordsiek, G., Nyakatura, G.,

O'Dell, C.N., Okwuonu, G., Palmer, S., Pandian, R., Parker, D., Parrish, J.,

Pasternak, S., Patel, D., Pearce, A.V., Pearson, D.M., Pelan, S.E., Perez, L.,

Porter, K.M., Ramsey, Y., Reichwald, K., Rhodes, S., Ridler, K.A., Schlessinger,

D., Schueler, M.G., Sehra, H.K., Shaw-Smith, C., Shen, H., Sheridan, E.M.,

Shownkeen, R., Skuce, C.D., Smith, M.L., Sotheran, E.C., Steingruber, H.E.,

Steward, C.A., Storey, R., Swann, R.M., Swarbreck, D., Tabor, P.E., Taudien, S.,

Taylor, T., Teague, B., Thomas, K., Thorpe, A., Timms, K., Tracey, A.,

Trevanion, S., Tromans, A.C., d'Urso, M., Verduzco, D., Villasana, D., Waldron,

L., Wall, M., Wang, Q., Warren, J., Warry, G.L., Wei, X., West, A., Whitehead,

S.L., Whiteley, M.N., Wilkinson, J.E., Willey, D.L., Williams, G., Williams, L.,

Williamson, A., Williamson, H., Wilming, L., Woodmansey, R.L., Wray, P.W.,

Yen, J., Zhang, J., Zhou, J., Zoghbi, H., Zorilla, S., Buck, D., Reinhardt, R.,

Poustka, A., Rosenthal, A., Lehrach, H., Meindl, A., Minx, P.J., Hillier, L.W.,

Willard, H.F., Wilson, R.K., Waterston, R.H., Rice, C.M., Vaudin, M., Coulson,

A., Nelson, D.L., Weinstock, G., Sulston, J.E., Durbin, R., Hubbard, T., Gibbs,

122

R.A., Beck, S., Rogers, J. and Bentley, D.R. (2005). The DNA sequence of the

human X chromosome. Nature, 434: 325-37.

Rowold, D.J. and Herrera, R.J. (2003). Inferring recent human phylogenies using forensic

STR technology. Forensic Sci Int, 133: 260-265.

Rozen, S. and Skaletsky, H. (2000). Primer3 on the WWW for general users and for

biologist programmers. Methods Mol Biol, 132: 365-86.

Saiki, R.K., Gelfand, D.H., Stoffel, S., Scharf, S.J., Higuchi, R., Horn, G.T., Mullis, K.B.

and Erlich, H.A. (1988). Primer-directed enzymatic amplification of DNA with a

thermostable DNA polymerase. Science, 239: 487-91.

Saiki, R.K., Scharf, S., Faloona, F., Mullis, K.B., Horn, G.T., Erlich, H.A. and Arnheim,

N. (1985). Enzymatic amplification of beta-globin genomic sequences and

restriction site analysis for diagnosis of sickle cell anemia. Science, 230: 1350-4.

Sajantila, A., Puomilahti, S., Johnsson, V. and Ehnholm, C. (1992). Amplification of

reproducible allele markers for amplified fragment length polymorphism analysis.

Biotechniques, 12: 16, 18, 20.

Santos-Lopes, S.S., Pereira, R.W., Wilson, I.J. and Pena, S.D.J. (2007). A Worldwide

Phylogeography for the Human X Chromosome. PLoS One, 2: e557.

Schaffner, S.F. (2004). The X chromosome in population genetics. Nat Rev Genet, 5: 43-

51.

Schneider, P.M. (2003). STR Profiling Strategy and the Development of National DNA

Databases. Identity, Newsletter for Europe, Middle East and Africa.

Schoske, R., Vallone, P.M., Ruitberg, C.M. and Butler, J.M. (2003). Multiplex PCR

design strategy used for the simultaneous amplification of 10 Y chromosome short

tandem repeat (STR) loci. Anal Bioanal Chem, 375: 333-43.

Schumm, J.W. (1997). Why Use a Size Marker and Allelic Ladders in STR Analysis?

Profiles in DNA, 1: 11-13.

Shete, S., Tiwari, H. and Elston, R.C. (2000). On estimating the heterozygosity and

polymorphism information content value. Theor Popul Biol, 57: 265-71.

Shin, S.H., Yu, J.S., Park, S.W., Min, G.S. and Chung, K.W. (2005). Genetic analysis of

18 X-linked short tandem repeat markers in Korean population. Forensic Sci Int,

147: 35-41.

Shuber, A.P., Grondin, V.J. and Klinger, K.W. (1995). A simplified procedure for

developing multiplex PCRs. Genome Res, 5: 488-93.

123

Signer, E., Kuenzle, C.C., Thomann, P.E. and Hübscher, U. (1988). DNA fingerprinting:

improved DNA extraction from small blood samples. Nucleic Acids Res, 16:

7738.

Silveira, D., Silva, F.F., Jesus, P.R. and Whittle, M.R. (2007). Use of X-linked short

tandem repeat loci in routine parentage casework. Transfusion, 47: 1050-3.

Son, J.Y., Lee, Y.S., Choung, C.M. and Lee, S.D. (2002). Polymorphism of nine X

chromosomal STR loci in Koreans. Int J Legal Med, 116: 317-21.

Sullivan, K.M., Mannucci, A., Kimpton, C.P. and Gill, P. (1993). A rapid and quantitative

DNA sex test: fluorescence-based PCR analysis of X-Y homologous gene

amelogenin. Biotechniques, 15: 636-8, 640-1.

Sun, R., Zhu, Y., Zhu, F., Kuang, W., Feng, J., Tian, Z., Chai, Z., Liu, Y. and Li, S.

(2011). Genetic polymorphisms of 10 X-STR among four ethnic populations in

northwest of China. Mol Biol Reports: 1-5.

Szibor, R. (2007). X-chromosomal markers: past, present and future. Forensic Sci Int

Genet, 1: 93-9.

Szibor, R., Hering, S. and Edelmann, J. (2006). A new Web site compiling forensic

chromosome X research is now online. Int J Legal Med, 120: 252-4.

Szibor, R., Hering, S., Kuhlisch, E., Plate, I., Demberger, S., Krawczak, M. and

Edelmann, J. (2005a). Haplotyping of STR cluster DXS6801–DXS6809–

DXS6789 on Xq21 provides a powerful tool for kinship testing. Int J Legal Med,

119: 363-369.

Szibor, R., Hering, S. and Edelmann, J. (2005b). The HumARA genotype is linked to

spinal and bulbar muscular dystrophy and some further disease risks and should

no longer be used as a DNA marker for forensic purposes. Int J Legal Med, 119:

179-80.

Szibor, R., Krawczak, M., Hering, S., Edelmann, J., Kuhlisch, E. and Krause, D. (2003).

Use of X-linked markers for forensic purposes. Int J Legal Med, 117: 67-74.

Szibor, R., Lautsch, S., Plate, I. and Beck, N. (2000). Population data on the X

chromosome short tandem repeat locus HumHPRTB in two regions of Germany. J

Forensic Sci, 45: 231-3.

Takezaki, N., Nei, M. and Tamura, K. (2010). POPTREE2: Software for constructing

population trees from allele frequency data and computing other population

statistics with Windows interface. Mol Biol Evol, 27: 747-52.

124

Takezaki, N. and Nei, M. (2008). Empirical tests of the reliability of phylogenetic trees

constructed with microsatellite DNA. Genetics, 178: 385-392.

Takezaki, N. and Nei, M. (1996). Genetic distances and reconstruction of phylogenetic

trees from microsatellite DNA. Genetics, 144: 389-399.

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M. and Kumar, S. (2011).

MEGA5: molecular evolutionary genetics analysis using maximum likelihood,

evolutionary distance, and maximum parsimony methods. Mol Biol Evol, 28:

2731-2739.

Tariq, M., Ullah, O., Riazuddin, S. and Riazuddin, S. (2008). Allele frequency

distribution of 13 X-chromosomal STR loci in Pakistani population. Int J Legal

Med, 122: 525-528.

Tariq, M.A., Sabir, M.F., Riazuddin, S.A. and Riazuddin, S. (2009). Haplotype analysis

of two X-chromosome STR clusters in the Pakistani population. Int J Legal Med,

123: 85-87.

Toni, C., Presciuttini, S., Spinetti, I., Rocchi, A. and Domenici, R. (2006). Usefulness of

X-chromosome markers in resolving relationships: Report of a court case

involving presumed half sisters. Int Cong Ser, 1288: 301-303.

Toni, C., Presciuttini, S., Spinetti, I. and Domenici, R. (2003). Population data of four X-

chromosome markers in Tuscany, and their use in a deficiency paternity case.

Forensic Sci Int, 137: 215.

Toscanini, U., Gusmão, L., Berardi, G. and Raimondi, E. (2009). Genetic data of 10 X-

STR in two Native American populations of Argentina. Forensic Sci Int Genet

(Supp), 2: 405-406.

Turrina, S., Atzei, R., Filippini, G. and De Leo, D. (2007). Development and forensic

validation of a new multiplex PCR assay with 12 X-chromosomal short tandem

repeats. Forensic Sci Int Genet, 1: 201-4.

Underhill, P.A., Shen, P., Lin, A.A., Jin, L., Passarino, G., Yang, W.H., Kauffman, E.,

Bonné-Tamir, B., Bertranpetit, J. and Francalacci, P. (2000). Y chromosome

sequence variation and the history of human populations. Nature Genet, 26: 358-

361.

Vallone, P.M., Hill, C.R. and Butler, J.M. (2008). Demonstration of rapid multiplex PCR

amplification involving 16 genetic loci. Forensic Sci Int Genet, 3: 42-5.

Vallone, P.M. and Butler, J.M. (2004). AutoDimer: a screening tool for primer-dimer and

hairpin structures. Biotechniques, 37: 226-231.

125

Venter, J.C., Adams, M.D., Myers, E.W., Li, P.W., Mural, R.J., Sutton, G.G., Smith,

H.O., Yandell, M., Evans, C.A. and Holt, R.A. (2001). The sequence of the human

genome. Science's STKE, 291: 1304.

Vogelstein, B. and Gillespie, D. (1979). Preparative and analytical purification of DNA

from agarose. Proc Natl Acad Sci U S A, 76: 615.

Wang, H.Y., Luo, M., Tereshchenko, I.V., Frikker, D.M., Cui, X., Li, J.Y., Hu, G., Chu,

Y., Azaro, M.A., Lin, Y., Shen, L., Yang, Q., Kambouris, M.E., Gao, R., Shih, W.

and Li, H. (2005). A genotyping system capable of simultaneously analyzing

>1000 single nucleotide polymorphisms in a haploid genome. Genome Res, 15:

276-83.

Weber, J.L. and May, P.E. (1989). Abundant class of human DNA polymorphisms which

can be typed using the polymerase chain reaction. Am J Hum Genet, 44: 388-396.

Wells, S. and Read, M. (2002). The journey of man: A genetic odyssey: Princeton Univ

Pr.

Whitaker, J.P., Clayton, T.M., Urquhart, A.J., Millican, E.S., Downes, T.J., Kimpton,

C.P. and Gill, P. (1995). Short tandem repeat typing of bodies from a mass

disaster: high success rate and characteristic amplification patterns in highly

degraded samples. Biotechniques, 18: 670-7.

Wiegand, P., Klein, R., Braunschweiger, G., Hohoff, C. and Brinkmann, B. (2006). Short

amplicon STR multiplex for stain typing. Int J Legal Med, 120: 160-164.

Wiegand, P., Berger, B., Edelmann, J. and Parson, W. (2003). Population genetic

comparisons of three X-chromosomal STRs. Int J Legal Med, 117: 62-5.

Wiegand, P. and Kleiber, M. (2001). Less is more--length reduction of STR amplicons

using redesigned primers. Int J Legal Med, 114: 285-7.

Williams, P.E., Marino, M.A., Del Rio, S.A., Turni, L.A. and Devaney, J.M. (1994).

Analysis of DNA restriction fragments and polymerase chain reaction products by

capillary electrophoresis. J Chromatogr, 680: 525-40.

Wyman, A.R. and White, R. (1980). A highly polymorphic locus in human DNA. Proc

Natl Acad Sci U S A, 77: 6754-8.

Zalan, A., Volgyi, A., Jung, M., Peterman, O. and Pamjav, H. (2007). Hungarian

population data of four X-linked markers: DXS8378, DXS7132, HPRTB, and

DXS7423. Int J Legal Med, 121: 74-7.

Zarrabeitia, M.T., Alonso, A., Martin, J., Gonzalez-Gay, M.A., Martin-Escudero, J.C., de

Pancorbo, M.M., Sanz, P., Ruiz-Cabello, F. and Riancho, J.A. (2006). Study of six

126

X-linked tetranucleotide microsatellites: population data from five Spanish

regions. Int J Legal Med, 120: 147-50.

Zarrabeitia, M.T., Amigo, T., Sañudo, C., Zarrabeitia, A., González-Lamuño, D. and

Riancho, J.A. (2002). A new pentaplex system to study short tandem repeat

markers of forensic interest on X chromosome. Forensic Sci Int, 129: 85-89.

127

Publication

Muhammad Israr, Ahmad Ali Shahid, Ziaur Rahman, Muhammad Saqib Shahzad, Obaid

Ullah and Tayyab Husnain (2012). Punjabi population data for seven X-chromosome

short tandem repeat (X-STR) loci using a new miniplex system. African Journal of

Biotechnology, 11: 10513-10516.