Gemma Cadby - UWA Research Repository€¦ · The genetic epidemiology of melanoma susceptibility...

376
The genetic epidemiology of melanoma susceptibility and prognosis and investigations of causal pathway modelling in complex disease aetiology. Gemma Cadby Bachelor of Science (Honours) This thesis is presented for the degree of Doctor of Philosophy of The University of Western Australia School of Population Health. May, 2011

Transcript of Gemma Cadby - UWA Research Repository€¦ · The genetic epidemiology of melanoma susceptibility...

The genetic epidemiology of melanoma

susceptibility and prognosis and investigations of

causal pathway modelling in complex disease

aetiology.

Gemma Cadby

Bachelor of Science (Honours)

This thesis is presented for the degree of

Doctor of Philosophy

of The University of Western Australia

School of Population Health.

May, 2011

ii

i

Declaration

This thesis is the author’s own composition. All sources have been acknowledged

and the author’s contribution is clearly identified in this thesis. This thesis con-

tains published work and/or work prepared for publication, some of which has

been co-authored. The permission of all co-authors has been obtained to include

the work in this thesis.

For work which has been prepared for publication, the work performed by the

author of this thesis has been described in detail.

This thesis was completed during the course of enrolment in this degree at the

University of Western Australia and has not previously been accepted for a degree

at this or another institution.

ii

iii

Abstract

Genetic epidemiological studies are increasingly being used to investigate the role

of genetic and environmental factors in common human disease aetiology, with the

ultimate aims of control and prevention. This thesis investigated the association

of candidate genes with melanoma susceptibility and prognosis in the Western

Australian population. In addition, methods research into novel statistical meth-

ods to model causal pathways in epidemiological studies was undertaken.

Australia has the highest incidence of cutaneous malignant melanoma in the world.

Several melanoma-susceptibility genes have been identified, however a greater un-

derstanding of these genes and their interactions with environmental factors would

lead to better interventions and control of the disease. The genetic determinants

of melanoma progression, and therefore prognosis, are largely unknown. Knowl-

edge of the genes which affect melanoma progression and prognosis would aid

in identifying individuals at risk of poorer prognosis, and may also elucidate the

mechanisms underlying melanoma progression. The collection of population-based

clinical, phenotypic and genetic data is integral in facilitating investigation into

the genetic and environmental factors affecting melanoma susceptibility and prog-

nosis.

This thesis involved the establishment of the Western Australian Melanoma Health

Study (WAMHS). The WAMHS is a population-based case-collection and linked

biospecimen resource established to investigate the genetic epidemiology of melanoma.

All eligible Western Australian adult cases of melanoma, diagnosed between Jan-

uary 2006 and September 2009 and notified to the Western Australian Cancer Reg-

iv

istry (WACR), were invited to participate in the study. Clinical and questionnaire-

based phenotypic data and blood samples for extraction of DNA, RNA and serum

were collected from consenting cases. Clinical data consisted of all pathological

data recorded by the WACR and the questionnaire covered major risk factors

for melanoma, such as sun exposure history and pigmentation. The final sample

consisted of 1,643 cases, of which 1,455 completed one or more components of

the study and 1,157 completed all components. The WAMHS comprises the only

population-based study of melanoma cases in Western Australia, and is one of the

largest single studies in the world.

A sub-sample of the WAMHS consisting of 800 European-ancestry participants

was used to investigate the genetic epidemiology of melanoma susceptibility and

melanoma prognosis. The genetic epidemiology of melanoma susceptibility was

investigated through association analyses of 42 single nucleotide polymorphisms

(SNPs) in genes previously reported to be associated with melanoma-risk, com-

paring the WAMHS sample to two general population samples. The genetic epi-

demiology of melanoma prognosis was investigated by association analyses of these

same SNPs with Breslow thickness, which is the best independent predictor for

disease prognosis. I hypothesised that some of the genes which increase melanoma-

risk may also be those which affect poorer disease prognosis, as reflected by in-

creased Breslow thickness.

As a result of this study, 14 SNPs in nine candidate genes were found to be

associated with melanoma-risk. The associations between six of these SNPs

(MC1R rs258322, TYR rs1393350, IRF4 rs12203592, MTAP rs7023329, MTAP

rs1011970 and BRAF rs6944385) remained significant after adjusting for multiple

v

testing. Four SNPs (IRF4 rs12203592, OCA2 rs1800401, TP53 rs1042522 and

BRAF rs1733826) were found to be associated with Breslow thickness. However,

none of these associations remained significant after adjustment for multiple test-

ing. The IRF4 rs12203592 SNP was associated with increased melanoma-risk and

thicker Breslow thickness, and therefore poorer prognosis. In addition, epidemio-

logical analyses of melanoma prognosis identified two phenotypic variables – age

at diagnosis and presence of naevi – which were associated with Breslow thickness.

In the second section of this thesis, I describe a statistical method known as

Mendelian randomisation, which is used to infer causal relationships between po-

tentially modifiable environmental exposures and some disease trait, using genetic

variants. More certain inferences regarding causal associations in epidemiological

studies are important as it may identify modifiable risk factors, i.e. information

that would allow individuals to modify their behaviours and exposures in order

to reduce the risk of some disease outcome.

As part of the methods work, I have developed an R library – MRsnphap – which

can be used to apply Mendelian randomisation to the genetic epidemiological set-

ting, by using SNPs and haplotypes as proxy variables for modifiable exposures.

While the use of SNPs as instruments is straightforward, the use of haplotypes

is not due to the uncertainty of haplotype phase. MRsnphap is the first soft-

ware for Mendelian randomisation analysis designed specifically for the genetic

epidemiological setting, which is able to incorporate the uncertainty surrounding

haplotypes into the analysis.

vi

The collection and analysis of population-based genotypic and phenotypic data

is integral in understanding the aetiology of disease. This is a critical step which

will enable the clinical utilisation of new knowledge and tools to improve clinical

practice and public health. The research presented in this thesis is both novel

and significant as it includes the first known study of the genetic determinants

of melanoma in a Western Australian population. This thesis also presents the

first comprehensive investigation into the role of candidate loci and melanoma

prognosis in a large, well-characterised sample of European-ancestry melanoma

cases. In addition, the availability of MRsnphap will help further elucidate the

role between modifiable exposures and disease, so that interventions can occur to

reduce the impact of disease in the community.

vii

Publications

Publications arising directly from this thesis

G. Cadby, S.V. Ward, (joint first authors), A. Lee, J.M. Cole, J. Heyworth,

M. Millward, T. Threlfall, F. Wood and L.J. Palmer. The Western Australian

Melanoma Health Study: Study design and participant characteristics. [IN PRESS:

CANCER EPIDEMIOLOGY]

G. Cadby, P.A. McCaskie and L.J. Palmer. (1923T). Mendelian Randomisation :

Modelling single SNPs and haplotypes in R. Presented at the 59th Annual Meeting

of The American Society of Human Genetics, October 24, 2009, Honolulu, Hawaii.

Available at http://www.ashg.org/2009meeting/abstracts/fulltext/. [ABSTRACT]

G. Cadby, S.V. Ward, J.M. Cole, M. Millward and L.J. Palmer. Breslow

thickness in the Western Australian Melanoma Health Study. Pigment Cell and

Melanoma Research, 22(6):903–904, 2009. [ABSTRACT]

S.V. Ward, G. Cadby, J.M. Cole, F. Wood, M. Millward and L.J. Palmer. The

Western Australian Melanoma Health Study. Genetic Epidemiology, 33:787, 2009.

[ABSTRACT]

L. Simpson, M. Cooper, G. Cadby, A.C. Fedson, K.L. Ward, J.D Lee, B. Szeg-

ner, C. Edwards, S. Mukherjee, D.R. Hillman, P. Eastwood and L.J. Palmer. Use

of the fat mass and obesity associated (FTO:RS9939609) single nucleotide poly-

viii

morphism to investigate the relationship between obesity and obstructive sleep

apnoea. Sleep and Biological Rhythms, 7:A18–19, 2009. [ABSTRACT]

M. Cooper, G. Cadby, J.D. Lee, A.C. Fedson, L. Simpson, K.L. Ward, D.R.

Hillman, S. Mukherjee and L.J. Palmer. Using Mendelian Randomisation to in-

vestigate the relationship between blood pressure and the severity of obstructive

sleep apnoea. Genetic Epidemiology, 33:778, 2009. [ABSTRACT]

G. Cadby, S.V. Ward, J.M. Cole, M. Millward and L.J. Palmer. Association

of candidate SNPs with melanoma susceptibility in Australian adults. Genetic

Epidemiology, 34(8):917-992, 2010. [ABSTRACT]

Publications not arising directly from this thesis

G. Cadby, K.W. Carter, S. Wiltshire and L.J. Palmer. Investigating aspects

of statistical power in meta–analysis of complex traits. Genetic Epidemiology,

33:791, 2009. [ABSTRACT]

S.V. Ward, G. Cadby, M. Millward, J.M. Cole, F. Wood and L.J. Palmer.

Scar outcome post melanoma excision. Pigment Cell and Melanoma Research,

22(6):903–904, 2009. [ABSTRACT]

ix

Contents

Declaration i

Abstract iii

Publications vii

Contents ix

List of Tables xviii

List of Figures xx

Glossary xxiii

Abbreviations xxvii

Acknowledgements xxix

Preface xxxi

1 Introduction 1

1.1 Introduction to Epidemiology . . . . . . . . . . . . . . . . . . . . 1

1.1.1 Causality in epidemiology . . . . . . . . . . . . . . . . . . 2

1.2 Introduction to Genetic Epidemiology . . . . . . . . . . . . . . . . 3

1.3 Genetic Concepts and Definitions . . . . . . . . . . . . . . . . . . 5

x

1.3.1 Genes and alleles . . . . . . . . . . . . . . . . . . . . . . . 5

1.3.2 Genetic variation . . . . . . . . . . . . . . . . . . . . . . . 7

1.3.2.1 Single nucleotide polymorphisms . . . . . . . . . 8

1.3.2.2 Haplotypes . . . . . . . . . . . . . . . . . . . . . 10

1.3.2.3 Haplotype inference . . . . . . . . . . . . . . . . 12

1.3.3 Hardy-Weinberg equilibrium principle . . . . . . . . . . . . 13

1.3.4 Linkage disequilibrium . . . . . . . . . . . . . . . . . . . . 14

1.4 Gene Discovery in Human Disease . . . . . . . . . . . . . . . . . . 15

1.4.1 Simple and complex disease . . . . . . . . . . . . . . . . . 16

1.4.2 Linkage analysis . . . . . . . . . . . . . . . . . . . . . . . . 17

1.4.3 Genetic association studies . . . . . . . . . . . . . . . . . . 18

1.4.4 Multiple testing . . . . . . . . . . . . . . . . . . . . . . . . 19

1.5 Aims . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

1.6 Outline of thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

2 Genetic Epidemiology of Malignant Melanoma: Susceptibility

and Prognosis in the WAMHS 23

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

2.2 Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

2.2.1 Skin cancer . . . . . . . . . . . . . . . . . . . . . . . . . . 24

2.2.2 Melanoma biology . . . . . . . . . . . . . . . . . . . . . . 25

2.2.3 Melanoma incidence and mortality . . . . . . . . . . . . . 27

2.2.3.1 Melanoma in Australia . . . . . . . . . . . . . . . 27

2.2.3.2 Melanoma in Western Australia . . . . . . . . . . 34

2.2.4 The aetiology of malignant melanoma . . . . . . . . . . . . 36

2.2.4.1 Environmental and host melanoma risk factors . 36

2.2.4.1.1 Sun exposure . . . . . . . . . . . . . . . 36

xi

2.2.4.1.2 Skin type and pigmentation . . . . . . . 39

2.2.4.1.3 Family history . . . . . . . . . . . . . . 41

2.2.4.1.4 Naevi . . . . . . . . . . . . . . . . . . . 42

2.2.4.2 Melanoma genetic risk factors . . . . . . . . . . . 43

2.2.4.2.1 High penetrance melanoma-susceptibility

genes . . . . . . . . . . . . . . . . . . . . 43

2.2.4.2.2 Low penetrance melanoma-susceptibility

genes . . . . . . . . . . . . . . . . . . . . 45

2.2.5 The diagnosis and treatment of melanoma . . . . . . . . . 49

2.2.6 Breslow thickness . . . . . . . . . . . . . . . . . . . . . . . 50

2.2.6.1 Breslow thickness and prognosis . . . . . . . . . . 50

2.2.6.1.1 Breslow thickness and melanoma risk fac-

tors . . . . . . . . . . . . . . . . . . . . 52

2.2.7 Literature Review Summary . . . . . . . . . . . . . . . . . 54

2.3 The Western Australian Melanoma Health Study . . . . . . . . . 56

2.3.1 Author’s contribution . . . . . . . . . . . . . . . . . . . . . 56

2.3.2 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 56

2.3.3 WAMHS population . . . . . . . . . . . . . . . . . . . . . 57

2.3.3.1 Recruitment . . . . . . . . . . . . . . . . . . . . . 59

2.3.3.2 WAMHS pilot study . . . . . . . . . . . . . . . . 62

2.3.3.3 WAMHS full-scale implementation I . . . . . . . 64

2.3.3.4 WAMHS full-scale implementation II . . . . . . . 66

2.3.3.5 Biospecimens . . . . . . . . . . . . . . . . . . . . 67

2.3.3.6 Study variables . . . . . . . . . . . . . . . . . . . 68

2.3.3.6.1 Phenotypic variables . . . . . . . . . . . 68

2.3.3.6.2 Demographic and clinical variables . . . 73

xii

2.3.3.7 Sample size and response rates . . . . . . . . . . 74

2.3.3.7.1 Consent, non-consent and non-response

rates . . . . . . . . . . . . . . . . . . . . 74

2.3.3.8 Characteristics of participants . . . . . . . . . . . 78

2.3.3.9 WAMHS sample summary . . . . . . . . . . . . . 83

2.3.4 WAMHS sub-sample used in this thesis . . . . . . . . . . . 85

2.3.4.1 Introduction . . . . . . . . . . . . . . . . . . . . 85

2.3.4.2 SNP selection . . . . . . . . . . . . . . . . . . . . 86

2.3.4.3 Laboratory methods . . . . . . . . . . . . . . . . 89

2.3.4.3.1 DNA preparation . . . . . . . . . . . . . 89

2.3.4.3.2 Genotyping . . . . . . . . . . . . . . . . 89

2.3.4.4 Methods for descriptive analyses . . . . . . . . . 90

2.3.4.4.1 Descriptive statistics of study variables . 90

2.3.4.4.2 Descriptive statistics of genotype data . 91

2.3.4.5 Results for descriptive analyses . . . . . . . . . . 91

2.3.4.5.1 Phenotypic data . . . . . . . . . . . . . 91

2.3.4.5.2 Genotypic data . . . . . . . . . . . . . . 100

2.3.4.6 Generalisation of sub-sample to population and

participating cases . . . . . . . . . . . . . . . . . 103

2.3.4.6.1 Introduction . . . . . . . . . . . . . . . . 103

2.3.4.6.2 Methods . . . . . . . . . . . . . . . . . . 103

2.3.4.6.3 Results . . . . . . . . . . . . . . . . . . 104

2.3.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

2.4 Association of Candidate Loci with Melanoma Susceptibility . . . 111

2.4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 111

2.4.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

xiii

2.4.2.1 Study populations . . . . . . . . . . . . . . . . . 111

2.4.2.2 Statistical methods . . . . . . . . . . . . . . . . . 113

2.4.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

2.4.3.1 Summary of results . . . . . . . . . . . . . . . . . 123

2.4.3.1.1 MTAP . . . . . . . . . . . . . . . . . . . 123

2.4.3.1.2 CDC91C1 . . . . . . . . . . . . . . . . . 124

2.4.3.1.3 TYR . . . . . . . . . . . . . . . . . . . . 124

2.4.3.1.4 KITLG . . . . . . . . . . . . . . . . . . 126

2.4.3.1.5 SLC24A4 . . . . . . . . . . . . . . . . . 126

2.4.3.1.6 HERC2 . . . . . . . . . . . . . . . . . . 127

2.4.3.1.7 IRF4 . . . . . . . . . . . . . . . . . . . . 127

2.4.3.1.8 MC1R . . . . . . . . . . . . . . . . . . . 128

2.4.3.1.9 BRAF . . . . . . . . . . . . . . . . . . . 129

2.4.3.2 Discussion . . . . . . . . . . . . . . . . . . . . . . 129

2.5 Association of Candidate Loci with Breslow thickness . . . . . . . 131

2.5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 131

2.5.2 Statistical methods . . . . . . . . . . . . . . . . . . . . . . 132

2.5.2.1 Association analyses . . . . . . . . . . . . . . . . 132

2.5.2.1.1 Epidemiological analyses . . . . . . . . . 132

2.5.2.1.1.1 Univariate analyses . . . . . . . . 132

2.5.2.1.1.2 Multivariate analyses . . . . . . . 133

2.5.2.1.2 Genotypic analyses . . . . . . . . . . . . 133

2.5.2.1.2.1 Univariate analyses . . . . . . . . 133

2.5.2.1.2.2 Multivariate analyses . . . . . . . 134

2.5.2.2 Statistical power . . . . . . . . . . . . . . . . . . 135

2.5.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

xiv

2.5.3.1 Epidemiological analyses . . . . . . . . . . . . . . 136

2.5.3.1.1 Univariate analyses . . . . . . . . . . . . 136

2.5.3.1.2 Multivariate analyses . . . . . . . . . . . 139

2.5.3.2 Genotypic analyses . . . . . . . . . . . . . . . . . 140

2.5.3.2.1 Univariate analyses . . . . . . . . . . . . 140

2.5.3.2.2 Multivariate analyses . . . . . . . . . . . 142

2.5.3.3 Statistical power . . . . . . . . . . . . . . . . . . 154

2.5.3.3.1 Genetic main effects . . . . . . . . . . . 154

2.5.3.3.2 Gene-environment interactions . . . . . 154

2.5.3.4 Summary of results . . . . . . . . . . . . . . . . . 156

2.5.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

2.5.4.1 Introduction . . . . . . . . . . . . . . . . . . . . 162

2.5.4.2 Summary of population characteristics . . . . . . 162

2.5.4.2.1 Breslow thickness . . . . . . . . . . . . . 162

2.5.4.2.2 Sex . . . . . . . . . . . . . . . . . . . . . 167

2.5.4.2.3 Age at diagnosis . . . . . . . . . . . . . 168

2.5.4.2.4 Naevi . . . . . . . . . . . . . . . . . . . 168

2.5.4.3 Summary of genetic association analyses . . . . . 171

2.5.4.3.1 Review of genetic variants studied . . . 171

2.5.4.3.2 Details of associations . . . . . . . . . . 171

2.5.4.3.2.1 OCA2 . . . . . . . . . . . . . . . 172

2.5.4.3.2.2 TP53 . . . . . . . . . . . . . . . . 173

2.5.4.3.2.3 IRF4 . . . . . . . . . . . . . . . . 174

2.5.4.3.2.4 BRAF . . . . . . . . . . . . . . . 175

2.5.4.4 Causality . . . . . . . . . . . . . . . . . . . . . . 176

2.5.4.5 Potential limitations . . . . . . . . . . . . . . . . 176

xv

2.5.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

2.6 Chapter Summary . . . . . . . . . . . . . . . . . . . . . . . . . . 180

3 Mendelian Randomisation: An Application of Instrumental

Variable Techniques 183

3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183

3.2 Vitamin D Levels and Breslow Thickness . . . . . . . . . . . . . . 184

3.3 Epidemiological Studies . . . . . . . . . . . . . . . . . . . . . . . 185

3.3.1 Experimental Studies . . . . . . . . . . . . . . . . . . . . . 186

3.3.2 Observational Studies . . . . . . . . . . . . . . . . . . . . . 187

3.3.2.1 Limitations of observational epidemiology . . . . 189

3.3.2.1.1 Confounding . . . . . . . . . . . . . . . 191

3.3.2.1.2 Reverse causation . . . . . . . . . . . . . 192

3.3.2.1.3 Selection bias . . . . . . . . . . . . . . . 192

3.3.2.1.4 Regression dilution bias . . . . . . . . . 193

3.4 Statistical Methods for Analysing Observational Epidemiological

Studies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194

3.4.1 Ordinary least squares . . . . . . . . . . . . . . . . . . . . 194

3.4.1.1 Assumptions of ordinary least squares . . . . . . 195

3.4.2 Instrumental variable methods . . . . . . . . . . . . . . . . 196

3.4.2.1 Assumptions of instrumental variables . . . . . . 198

3.4.2.2 Instrumental variable estimator of β . . . . . . . 198

3.4.2.3 Variance-covariance matrix estimator of βIV . . . 199

3.4.3 Instrumental variable diagnostic tests . . . . . . . . . . . . 199

3.4.3.1 Strength of excluded instruments . . . . . . . . . 199

3.4.3.2 Overidentification test . . . . . . . . . . . . . . . 200

3.4.3.3 Endogeneity test . . . . . . . . . . . . . . . . . . 201

xvi

3.4.4 Instrumental variable methods for observational epidemio-

logical studies . . . . . . . . . . . . . . . . . . . . . . . . . 203

3.5 Mendelian Randomisation . . . . . . . . . . . . . . . . . . . . . . 203

3.5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 203

3.5.2 Genetic variants as excluded instruments . . . . . . . . . . 205

3.5.2.1 SNPs as excluded instruments . . . . . . . . . . . 205

3.5.2.2 Haplotypes as excluded instruments . . . . . . . 205

3.5.3 Mendelian randomisation example . . . . . . . . . . . . . . 206

3.5.4 Benefits of Mendelian randomisation . . . . . . . . . . . . 210

3.5.4.1 Confounding . . . . . . . . . . . . . . . . . . . . 210

3.5.4.2 Reverse causation . . . . . . . . . . . . . . . . . . 210

3.5.4.3 Selection bias . . . . . . . . . . . . . . . . . . . . 211

3.5.4.4 Regression dilution bias . . . . . . . . . . . . . . 211

3.5.5 Limitations of Mendelian randomisation . . . . . . . . . . 211

3.5.5.1 Identification of suitable genetic variant for exposure 211

3.5.5.2 Reliable gene associations . . . . . . . . . . . . . 212

3.5.5.3 Population stratification . . . . . . . . . . . . . . 212

3.5.5.4 Pleiotropy . . . . . . . . . . . . . . . . . . . . . . 213

3.5.6 Current software for Mendelian randomisation analysis . . 213

3.5.7 Software implementation . . . . . . . . . . . . . . . . . . . 214

3.5.7.1 Using SNPs as instruments: mrsnp.quant . . . . 214

3.5.7.1.1 Example: mrsnp.quant . . . . . . . . . . 215

3.5.7.2 Using haplotypes as instruments: mrhap.quant . 216

3.5.7.2.1 Example: mrhap.quant . . . . . . . . . . 217

3.5.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

4 Summary and Suggestions for Further Research 219

xvii

4.1 The Western Australian Melanoma Health Study . . . . . . . . . 219

4.2 Mendelian Randomisation . . . . . . . . . . . . . . . . . . . . . . 221

4.2.1 Future development of MRsnphap . . . . . . . . . . . . . . 222

4.3 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224

Bibliography 227

A Letter to Doctor 271

B Doctor Information Sheet 273

C Letter to Patient 275

D Patient Information Brochure 277

E Patient Consent Form 281

F Mole-Counting Chart 285

G Questionnaire 287

H Questionnaire Brochure 329

I MRsnphap R Package User Manual 333

J Example output using mrsnp.quant 339

K Example output using mrhap.quant 341

xviii

List of Tables

1.3.2.2.1 Illustration of haplotypes derived from two bi-allelic loci or

SNPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

2.2.6.1.1 Estimated 5-year survival rates by Breslow thickness (mm)

in Western Australia [147] . . . . . . . . . . . . . . . . . . . 51

2.2.6.1.2 Estimated 5-year survival rates by Breslow thickness (mm)

in New South Wales [148] . . . . . . . . . . . . . . . . . . . 52

2.3.3.1 Eligible ICD-9 codes for WAMHS eligibility . . . . . . . . . 58

2.3.3.7.1.1 Comparison between consenting, non-consenting and non-

responding subjects . . . . . . . . . . . . . . . . . . . . . . 76

2.3.3.8.1 Questionnaire, demographic and clinical characteristics of

WAMHS sample . . . . . . . . . . . . . . . . . . . . . . . . 83

2.3.4.2.1 Tagged SNPs from MC1R, BRAF and EGF genes . . . . . 87

2.3.4.2.2 Reported melanoma-risk assocations between genotyped SNPs

and melanoma-risk factors . . . . . . . . . . . . . . . . . . . 88

2.3.4.5.1.1 Questionnaire-based characteristics of subjects in the WAMHS

sub-sample . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

2.3.4.5.1.2 Demographic and clinical characteristics of subjects in the

WAMHS sub-sample . . . . . . . . . . . . . . . . . . . . . . 95

2.3.4.5.2.1 Genotype frequencies of SNPs in the WAMHS sub-sample . 102

xix

2.3.4.6.3.1 Comparisons between the WAMHS sub-sample, consenting

cases, and eligible population . . . . . . . . . . . . . . . . . 106

2.4.2.1.1 Participant characteristics of the BHS sample . . . . . . . . 113

2.4.3.1 Comparison between WAMHS and HapMap and BHS geno-

type frequencies . . . . . . . . . . . . . . . . . . . . . . . . 121

2.4.3.2 Phenotypic variables multivariately associated with melanoma

risk in the WAMHS . . . . . . . . . . . . . . . . . . . . . . 122

2.4.3.3 Summary of associations between significant SNPs and melanoma

risk in the WAMHS . . . . . . . . . . . . . . . . . . . . . . 122

2.5.3.1.1.1 Univariate analysis between phenotypic variables with Bres-

low thickness in the WAMHS . . . . . . . . . . . . . . . . . 138

2.5.3.1.2.1 Multivariate model of phenotypic variables associated with

Breslow thickness in the WAMHS . . . . . . . . . . . . . . 139

2.5.3.2.1.1 Univariate associations between Breslow thickness and SNPs

modelled codominantly in the WAMHS . . . . . . . . . . . 142

2.5.3.2.2.1 Multivariate associations between Breslow thickness and SNPs

in the WAMHS modelled codominantly . . . . . . . . . . . 146

2.5.3.2.2.2 Association between codominant SNPs and Breslow thick-

ness in the WAMHS . . . . . . . . . . . . . . . . . . . . . . 147

2.5.3.2.2.3 Association between additive SNPs and Breslow thickness in

the WAMHS . . . . . . . . . . . . . . . . . . . . . . . . . . 149

2.5.3.2.2.4 Association between rs1042522, modelled dominantly, and

Breslow thickness in the WAMHS . . . . . . . . . . . . . . 151

2.5.3.4.1 Summary of associations between significant SNPs and Bres-

low thickness in the WAMHS . . . . . . . . . . . . . . . . . 161

3.5.3.1 IV and OLS coefficients and confidence intervals . . . . . . 209

xx

List of Figures

1.3.2.1.1 Inheritance of a dominant effect in a family . . . . . . . . . 9

1.3.2.1.2 Inheritance of a recessive effect in a family . . . . . . . . . . 10

2.2.2.1 Cross-section of the skin highlighting the epidermal layer . . 26

2.2.2.2 Clark model of the five stages of melanoma development . . 27

2.2.3.1.1 Melanoma incidence rates for selected countries for males and

females . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

2.2.3.1.2 Melanoma incidence rates from 1982 to 2005 . . . . . . . . 30

2.2.3.1.3 Melanoma incidence rates by age-group for 2005 for males

and females . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

2.2.3.1.4 Melanoma mortality rates from 1968 to 2005 for males and

females . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

2.2.3.2.1 Melanoma incidence rates for all Australian states and terri-

tories from 2001 to 2005 . . . . . . . . . . . . . . . . . . . . 34

2.3.3.1.1 Recruitment process through the WACR . . . . . . . . . . . 60

2.3.3.6.1.1 Degrees of naevi from the WAMHS questionnaire . . . . . . 70

2.3.3.6.1.2 Number of naevi in two sections of the back from the WAMHS

questionnaire . . . . . . . . . . . . . . . . . . . . . . . . . . 70

2.3.3.6.1.3 Degree of freckling from the WAMHS questionnaire . . . . . 71

2.3.3.7.1 Consent and participation figures and rates for the WAMHS 75

2.3.4.5.1.1 Distribution of Breslow thickness . . . . . . . . . . . . . . . 96

xxi

2.3.4.5.1.2 Distribution of log-transformed Breslow thickness . . . . . . 97

2.5.3.1.2.1 Model diagnostic plots from the multivariate phenotypic model

for Breslow thickness in the WAMHS . . . . . . . . . . . . . 140

2.5.3.2.2.1 Model diagnostic plots for the association between OCA2

rs1800401 and Breslow thickness in the WAMHS . . . . . . 148

2.5.3.2.2.2 Model diagnostic plots for the association between BRAF

rs1733826 and Breslow thickness in the WAMHS . . . . . . 150

2.5.3.2.2.3 Model diagnostic plots for the association between IRF4 rs12203592

and Breslow thickness in the WAMHS . . . . . . . . . . . . 151

2.5.3.2.2.4 Interaction between rs1042522 modelled dominantly and the

presence or absence of naevi in the WAMHS . . . . . . . . . 152

2.5.3.2.2.5 Model diagnostic plots for the association between rs1052522

and Breslow thickness in the WAMHS . . . . . . . . . . . . 153

2.5.3.3.1.1 Estimated main effects statistical power under an additive

model for n=800, under varying MAF and SD . . . . . . . . 155

2.5.3.3.1.2 Estimated main effects statistical power under a dominant

model for n=800, under varying MAF and SD . . . . . . . . 156

2.5.3.3.1.3 Estimated main effects statistical power under a recessive

model for n=800, under varying MAF and SD . . . . . . . . 157

2.5.3.3.2.1 Estimated interaction effects statistical power under an ad-

ditive model for n=800, under varying MAF and SD for en-

vironmental exposure prevalence of 0.2 . . . . . . . . . . . . 158

2.5.3.3.2.2 Estimated interaction effects statistical power under an ad-

ditive model for n=800, under varying MAF and SD for en-

vironmental exposure prevalence of 0.8 . . . . . . . . . . . . 159

xxii

3.3.2.1.1.1 Confounding – the observed association between Breslow thick-

ness and vitamin D levels is due to confounding by a healthy

lifestyle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191

3.4.2.1.1 DAG for the instrumental variables model. Z1, instrumental

variable; X1, exposure of interest; Y, outcome of interest;

and U, unmeasured confounders. . . . . . . . . . . . . . . . 199

3.5.3.1 DAG for the Mendelian Randomisation model of vitamin D

levels and Breslow thickness . . . . . . . . . . . . . . . . . . 208

xxiii

Glossary

Allele : alternative forms of DNA occurring at a single locus.

Association study : an investigation into the statistical association between an

allele and a phenotypic trait.

Bonferroni correction : a multiple testing correction method, where the false

positive rate is divided by the number of tests to form a new level of statistical

significance.

Candidate gene : a gene which can be reasonably posited to be involved in the

genesis of a phenotypic trait or disease on the grounds of biological plausibility.

Chromosome : a linear thread of DNA chains and proteins.

Complex disease : a disease which involves multiple genetic and environmental

factors, and does not exhibit a classic Mendelian pattern of inheritance.

Diplotype : the pair of haplotypes for a particular stretch of the chromosome.

Direct association : an observed association between a locus and a phenotypic

trait which is due to the causative nature of the locus.

Exposure : a factor or situation of potential aetiological contribution to a given

xxiv

trait or disease to which an individual is exposed.

Gene : a region of the genomic sequence which is inherited and is associated with

regulatory regions, transcribed regions, or other functional sequence regions.

Genome : the total genetic information carried by an individual.

Genotype : the combination of alleles present at one locus.

Haplotype : a series of alleles at linked loci along a single chromosome.

Hardy-Weinberg Equilibrium Principle : a principle which describes the dis-

tribution of genotypes at a locus in terms of allele frequencies.

Heterozygous : if the two alleles at a particular locus are different, the individual

is heterozygous at that loci.

Homozygous : if the two alleles at a particular locus are the same, the individual

is homozygous at that loci.

Indirect association : an observed association between a locus and a pheno-

typic trait as a result of linkage disequilibrium between the measured locus and

some unmeasured locus which has a direct effect on the trait.

xxv

Linkage : the situation where two syntenic loci are inherited together, such that

they are close on a chromosome so that recombination during meiosis is uncom-

mon.

Linkage disequilibrium : the increased frequency of haplotypes within a pop-

ulation due to co-inheritance of linked alleles.

Linkage study : a statistical study analysing the co-segregation of genetic loci

within families to infer their position in the genome.

Locus : a unique chromosomal location defining the position of an individual gene

or DNA sequence.

Mendelian randomisation : a method of using non-experimental studies to

examine the causal effect of a modifiable exposure on disease by using genotypes.

Penetrance : the proportion of individuals carrying an allele who will express

the phenotype associated with that allele.

Phase : the haplotypic configuration of linked loci.

Phenotype : a measurable characteristic of an individual, also referred to as a

trait.

xxvi

Recombination : the exchange of genetic information between two chromosomes

during meiosis.

Relative risk : a measure of disease-exposure relationship estimating the mag-

nitude of an association between exposure and disease.

Simple disease : a disease which exhibits a classic Mendelian pattern of inheri-

tance, where the inheritance of a mutation in a single gene will result in a certain

phenotype.

Single nucleotide polymorphism : a DNA variant that represents variation in

a single nucleotide.

xxvii

Abbreviations

AIC Akaike Information Criterion

AMS Atypical Mole Syndrome

BHS Busselton Health Study

BCC Basal Cell Carcinoma

BMI Body Mass Index

BRAF B-Raf proto-oncogene serine

CDK4 Cyclin-dependent kinase 4

CDKN2A Cyclin-dependent kinase inhibitor 2A

CEPH Centre d’Etude du Polymorphisme Humain

CEU Caucasian-European

CI Confidence Interval

DAG Directed Acyclic Graph

DNA Deoxyribonucleic Acid

EGF Epidermal Growth Factor

EM Expectation Maximisation

FDR False Discovery Rate

GWAS Genome Wide Association Study

HWE Hardy-Weinberg Equilibrium

IRF4 Interferon Regulatory Factor 4

IV Instrumental Variable

MAF Minor Allele Frequency

MC1R Melanocortin 1 Receptor

MTAP Methylthioadenosine Phosphorylase

OCA2 Oculocutaneous Albinism II

PLA2G6 Phospholipase A2, Group VI (cytosolic, calcium-independent)

xxviii

RCT Randomised Controlled Trial

RNA Ribonucleic Acid

RPH Royal Perth Hospital

QEII Queen Elizabeth II

SCC Squamous Cell Carcinoma

SE Standard Error

SLC2A4 Solute Carrier Family 2

SNP Single Nucleotide Polymorphism

TP53 Tumor Protein 53

TPCN2 Two Pore Segment Channel 2

TYR Tyrosinase (oculocutaneous albinism IA)

TYRP1 Tyrosinase-Related Protein 1

WADB Western Australian DNA Bank

WAGER Western Australian Genetic Epidemiology Resource

WAMHS Western Australian Melanoma Health Study

xxix

Acknowledgements

I must first thank my coordinating supervisor Professor Lyle Palmer, who took

me on as an Honours student in 2004, and then became my PhD supervisor in

2006. Thank you for your guidance, your enthusiasm, and your wisdom. I am so

fortunate to have a supervisor who has such belief in my abilities and who has

always encouraged me to be the best I can be.

To my co-supervisor, Dr Judith Cole, thank you for your assistance with estab-

lishing the study, and for reading my thesis so carefully and providing me with

invaluable feedback. I also need to thank Dr David Preen who stepped in and

became my coordinating supervisor in the final weeks of my PhD. Thank yous

must also go to Dr Steven Wiltshire and Dr Samuel Mueller who were part of my

supervisory team at various stages.

I could not have completed this PhD if it were not for the generosity of the study

participants who gave their time to be part of this study. Recruitment of study

participants would not have been possible if it were not for the kindness and assis-

tance of Dr Tim Threlfall and staff from the Western Australian Cancer Registry

who welcomed the study team into their workplace for three years.

I am very grateful to have spent the four years of my PhD at the Centre for Ge-

netic Epidemiology and Biostatistics. I have had the opportunity to work with

many wonderful people. Without the assistance of many of you, I would not have

completed this thesis.

xxx

Sarah, thank you for all the help you have given me during the past four years –

both in your role as study coordinator and for being so generous to read my many,

many thesis drafts. I would never have survived the past four years without you.

Thank you, thank you, thank you.

To my other friends from the Centre – Nicole, thank you for your friendship, and

for always being happy to answer my many, many questions – I promise that will

stop now. Jules, thank you for your encouragement and wisdom. Pam, thank you

for helping me when I first started my PhD and for allowing me to incorporate my

work into SimHap. Also, thank you to the boys in the Informatics team, without

whom I would not have been able to store or retrieve my data.

I wish to thank my parents, Sue and Alan, my brothers Dan and Matt, their

families, and Guy, for realising very early on that they should never ask me how

my thesis is going. Thank you for believing in me and for supporting me through

all my studies.

xxxi

Preface

Some of the work in this thesis has been prepared for publication.

Section 2.3 describes the establishment of the Western Australian Melanoma

Health Study, and this work has been accepted for publication. This is a jointly

co-authored paper by the author and Ms Sarah Ward. The author and Ms Ward

jointly wrote this paper, and the author of this thesis performed all analyses. The

other authors of this paper, Ms Amanda Lee, Dr Judith Cole, Dr Jane Heyworth,

Professor Michael Millward, Dr Fiona Wood, and Professor Lyle Palmer all pro-

vided feedback on the paper to prepare it for publication.

In addition, Section 2.3 also contains work contributed by others. Some of the

SNPs genotyped in this thesis were selected by the author, and others were se-

lected as part of the Western Australian Melanoma Health Study. Genotyping

was performed by the PathWest Molecular Genetics Service in Perth.

The Busselton Health Study data used in Section 2.4 was collected by the Bus-

selton Population Medical Research Foundation and was made available to the

author for use in this thesis.

All analyses in this thesis were performed and the results interpreted by the au-

thor.

The R library, MRsnphap, was developed by the author, using already established

xxxii

statistical methods. The MRhap function in this library includes haplotype esti-

mating functions from SimHap, which were developed by Dr Pamela McCaskie.

1CHAPTER 1

Introduction

It has been well established that our genes and environment contribute to the

pathogenesis of disease. The collection of population-based data is integral to fully

understand the role these factors play in both disease susceptibility and progno-

sis. This thesis describes the establishment of the Western Australian Melanoma

Health Study which is used to investigate the role of genetic, environmental and

host factors in melanoma susceptibility and prognosis. In addition, this thesis

describes a statistical technique which can be used to better understand causal

relationships between disease outcomes and modifiable exposures.

1.1 Introduction to Epidemiology

Epidemiology, literally meaning the ‘study of what is upon the people’, can be for-

mally defined as the ‘study of the distribution and determinants of health-related

states or events in specified populations, and the application of this study to con-

trol of health problems’ [1]. As early as the 5th century BC, the Greek physician

Hippocrates, often regarded as the world’s first epidemiologist, observed the ef-

fect of the environment on human health [2]. Since then, epidemiology has led to

a greater understanding of the distribution, causes and control of diseases. No-

table examples include Snow’s observations of the link between drinking water

and cholera in England in 1854 [3], the association between smoking and lung

cancer by Doll and Hill in 1950 [5], and the more recent discovery of the causal

link between certain human papillomavirus strains and cervical cancer [10].

Epidemiological studies fall into two major types of studies – intervention and

observational studies, and these are discussed further in Section 3.3 (Please see

2 Chapter 1. Introduction

page 185). One type of intervention study, Randomised Controlled Trials (RCT),

are often considered the ‘gold standard’ of epidemiological studies for aetiological

investigations. However, observational studies are often an effective precursor to

RCTs, and may even take the place of RCTs when it is not ethical or feasible

to conduct a RCT. The major difference between these studies is their ability to

determine causal associations, which is discussed in the next section.

1.1.1 Causality in epidemiology

Epidemiological associations between disease outcomes (for example, disease risk

or some quantitative measure of disease) and modifiable exposures are often ob-

served. However for these associations to make a public health impact by leading

to a reduction in disease, it is critical that these associations are causal. Causality

is an integral concept in epidemiology, and there are a number of aspects that

should be considered when determining if a significant association observed be-

tween an outcome and an exposure may be causal.

One commonly used set of conditions was described by Hill and includes consider-

ing the strength, consistency, temporality and plausibility of the association [11].

Hill stated that these conditions did not always need to be met; instead the

conditions should be considered when gathering evidence for causal associations.

However, for an association to be causal, it would seem necessary to have tem-

porality, that is, that the exposure occurred before the disease. Observational

epidemiological studies of disease outcomes are often unable to fully satisfy the

complete set of Hill’s conditions. For example, the strength of the observed asso-

ciation (e.g. odds ratio) is often small, and it is often not possible to determine a

temporal relationship between the exposure and the disease outcome. In view of

1.2. Introduction to Genetic Epidemiology 3

this, sophisticated statistical techniques, such as Mendelian randomisation, can

be used to provide evidence for a causal association; these are discussed further

in Chapter 3 (please see page 183).

1.2 Introduction to Genetic Epidemiology

There are many definitions of genetic epidemiology, including the definition by

Morton [12], who defined genetic epidemiology as ‘a science which deals with the

aetiology, distribution, and control of disease in groups of relatives and with in-

herited causes of disease in populations’. However, it is important to include the

role of environmental risk factors in the study of disease; therefore in this thesis,

genetic epidemiology is defined as the study of the genetic and environmental

causes of human disease, and their interactions.

During the 1980s, genetic epidemiology emerged as a new discipline incorporating

both genetics and traditional epidemiology. However, the importance of a dis-

cipline combining genetics and epidemiology, namely ‘epidemiological genetics’,

was first described as early as 1954 by Neel and Schull [14]. At this time, Neel

and Schull identified four main criteria to be used by epidemiologists to infer the

genetic causes of disease, namely:

1. the occurrence of the disease in definite numerical proportions among indi-

viduals related by descent,

2. failure of the disease to spread to non-related individuals,

3. onset of disease at a characteristic age without a known precipitating event,

and

4 Chapter 1. Introduction

4. greater concordance of the disease in identical than in fraternal twins.

Discovery of genetic and environmental factors associated with disease has the

ability to aid in the prevention of disease [15]. An individual who is at risk of a

disease due to their genetic makeup may be able to alter their lifestyle to ensure

the disease does not present, or does not have a large influence on their life. An

example of this is phenylketonuria which can be managed by a diet low in pheny-

lalanine, with little or no side effects [16]. Additionally, cardiovascular disease

may be prevented by diet and lifestyle modifications [17].

Knowledge of the genetic causes of disease can aid in disease diagnosis [18]. Iden-

tification of the genetic causes of disease enables high-risk individuals to be tested

before the onset of the disease, and may even help predict the probable age at

onset and probable severity of disease. Diseases such as Huntington’s disease and

cystic fibrosis are two diseases that can be diagnosed with a simple blood test

before the disease has become symptomatic [19,20].

Finally, knowledge of an individual’s genetic makeup can enable personalised

medicine; that is, matching therapeutic treatments to an individual’s genetic

makeup so that individuals can receive more effective treatments (often phar-

macological), which may result in a more successful disease treatment [21]. An

example of this is the drug abacavir which is used to treat individuals with human

immunodeficiency virus. Hypersensitivity reactions to abacavir in individuals are

dependent upon HLA-B*5701 genotypes. Therefore, it is recommended that prior

to commencing abacavir therapy, patients are screened for this mutation, so that

the drug can be given only to individuals who are not at high-risk of developing

1.3. Genetic Concepts and Definitions 5

hypersensitivity [22]. A recent study found this method significantly reduced the

prevalence of hypersensitivity, from 7.8% in the non-screened group to 3.4% in

the screened group [23].

1.3 Genetic Concepts and Definitions

This section presents an introduction to some fundamental genetic concepts and

terminology relevant to this thesis. Terms not defined here may be found in the

glossary (please see page xxiii).

1.3.1 Genes and alleles

A human individual possesses 23 pairs of chromosomes, one of each pair inherited

from their mother and the other from their father. Chromosomes are individual

structures consisting of deoxyribonucleic acid (DNA) chains and proteins. The

entire sequence of DNA across all chromosomes is referred to as the genome.

Contained by and stored along chromosomes are genes. Genes are the basic unit

of genetic information and consist of a sequence of DNA. Within each gene there

are four chemical building blocks called nucleotides. Each of the four nucleotides

differ in their nitrogen-containing base. These bases are adenine (A), cytosine (C),

thymine (T) and guanine (G).

It is the order of these bases which contains the genetic information responsible

for the developmental and metabolic processes of an organism. An allele is one of

the possible forms of DNA sequence that can exist at a particular position along

a gene, or genetic locus. Therefore, an individual will have two alleles at each

6 Chapter 1. Introduction

genetic locus, one allele inherited from each parent.

The two alleles at a genetic locus may come from a possible one, two or many dif-

ferent alleles. However, for simplicity, we will assume there are only two possible

alleles at each genetic locus. Therefore, an individual can have three possible ar-

rangements of these alleles on homologous chromosomes, referred to as genotypes.

By denoting A and a to be the possible alleles at a locus, an individual can have

either the AA, Aa or aa genotypes. If an individual has two copies of the same

allele (AA or aa), they are said to be homozygous. An individual with different al-

leles (Aa) is heterozygous. A phenotype is an observable biological or physiological

trait, most of which are influenced by a combination of an individual’s genotype

and their environment.

During gamete cell formation (i.e. ova and sperm), in which an individual inherits

one pair of each chromosome from their mother and one from their father, the

chromosomes exchange segments of DNA. This process is referred to as recombi-

nation. Recombination is a function of the distance between two loci, with loci

closer together experiencing less recombination than loci further apart. Loci that

are physically close along a chromosome, such that recombination between these

occurs less often than chance (i.e. 50%), are said to be linked. These loci will tend

to cosegregate within families, and this is referred to as linkage. Linkage studies

are a type of genetic study in families which exploit linkage, and this is discussed

further in Section 1.4.2 (please see page 17).

1.3. Genetic Concepts and Definitions 7

1.3.2 Genetic variation

A mutation in a gene, also known as a genetic variant, may cause a change in

the gene function. If this is a significant change, the gene may cause disease or

predispose an individual to disease.

The penetrance of a genetic variant refers to the proportion of individuals with

the genetic variant who will develop the disease. For example, if the penetrance

of a genetic variant is 0.1, then it is estimated that 10% of individuals with the

variant will also develop the disease; this would be referred to as low penetrance.

Penetrance values of approximately 1 indicate complete penetrance; this occurs

when all individuals with the variant develop the disease.

The relative risk of a genetic variant refers to the ratio of the disease rate in indi-

viduals with the variant to the disease rate in individuals without the variant. For

example, if we take a group of 100 individuals, and the probability of developing

disease with the risk variant is 20%, and the probability of developing disease

without the risk variant is 5%, then the relative risk will be 20/5 = 4. This means

that carriers of the disease variant are four times as likely to develop disease as

individuals without the disease variant.

There are many different types of mutations, including point mutations, insertions

and deletions. In this thesis, I deal only with a type of point mutations – single

nucleotide polymorphisms – and combinations of these as haplotypes. These are

described in the next sections.

8 Chapter 1. Introduction

1.3.2.1 Single nucleotide polymorphisms

A single nucleotide polymorphism, or SNP, is a small genetic change which occurs

within an individual’s DNA sequence, such that a single nucleotide differs between

individuals. Many SNPs are thought to have minimal effect on cell function, but

it is believed that others may cause disease, predispose an individual to disease,

or influence their response to drugs.

Suppose that the majority of individuals are homozygous for the A allele at a

locus. If a mutation occurs and another allele, such as a G, replaces one or both

of the A alleles, then this would be referred to as a SNP. In this case, the G allele

would be referred to as the minor or variant allele. The minor allele frequency

(MAF) of a SNP refers to the frequency of the less common or variant allele in a

given population. SNPs can be used as markers in genetic analysis to find asso-

ciations between genetic variation and the observed phenotypes of an individual,

and this is discussed further in Section 1.4.3 (please see page 18).

The effect of genes on phenotype expression can be either additive, co-dominant,

dominant or recessive. An additive effect is one where the expression of the phe-

notype increases or decreases linearly as the number of variant alleles increases,

while a co-dominant effect occurs when the expression of the phenotype increases

or decreases as the number of variant alleles increases, but not necessarily in a

linear pattern.

A dominant effect means that only one copy of the variant allele need be present

to change the expression of the phenotype. A recessive effect occurs when the

1.3. Genetic Concepts and Definitions 9

expression of the phenotype is altered only when two copies of the variant allele

are present.

Figure 1.3.2.1.1 shows the transmission of a dominant variant through a family.

A dominant effect requires only one copy of the variant allele, so the affected

father has one copy of the allele, and the unaffected mother carries no copies.

The four children represent the possible results from mating between affected and

unaffected individuals; that is, on average, 50% of the children will be unaffected

by disease, and 50% will be affected by disease.

Figure 1.3.2.1.1: Inheritance of a dominant effect in a family

Figure 1.3.2.1.2 shows the transmission of a recessive variant through a family.

Individuals who carry variant alleles, but do not have the disease are referred to

10 Chapter 1. Introduction

as carriers. A recessive effect requires two copies of the variant allele, so both

parents are carriers as they carry one disease allele and one non-disease allele.

The four children represent the possible results from mating between two carriers;

that is, on average, 25% of the children will be unaffected by disease, 50% will be

carriers, and 25% will be affected by disease.

Figure 1.3.2.1.2: Inheritance of a recessive effect in a family

1.3.2.2 Haplotypes

As a result of recombination during meiosis, each chromosome of a pair contains

a mixture of alleles from each parent. Syntenic alleles (i.e. on the same chro-

mosome) which have not taken part in recombination and have therefore been

inherited as a unit, are referred to as haplotypes. A pair of haplotypes is called a

diplotype.

1.3. Genetic Concepts and Definitions 11

The concept of a haplotype is best illustrated in an example. Consider two bi-

allelic loci with three possible genotypes at each locus; AA, Aa or aa at the first

locus and BB, Bb or bb at the second locus. To form a haplotype, one allele is

taken from each genotype and these are paired together. The remaining alleles in

the genotypes are also paired together to form a second haplotype.

Genotype BB Bb bb

AA AB AB AB Ab Ab Ab

Aa AB aB AB ab or Ab aB Ab ab

aa aB aB aB ab ab ab

Table 1.3.2.2.1: Illustration of haplotypes derived from two bi-allelic loci or SNPs

From Table 1.3.2.2.1, we can see that an individual who has the AA genotype

at one locus and the BB genotype at the second locus, will have the diplotype

AB AB. This means that this individual inherited one AB haplotype from each of

their parents. An individual who has the Aa genotype at one locus and the Bb

genotype at the second locus will have one of two possible diplotypes, AB ab or

Ab aB. Therefore we cannot be certain which alleles were inherited together from

the same parent. It may be that the individual inherited the AB haplotype from

one parent and the ab haplotype from their other parent or, alternatively, they

may have inherited the Ab haplotype from one parent and the aB haplotype from

their other parent.

The phase of a haplotype refers to the combination of alleles that an individual

inherits together from each parent.

12 Chapter 1. Introduction

1.3.2.3 Haplotype inference

It is not always possible to infer haplotype phase unambiguously. Recalling from

Figure 1.3.2.2.1, if an individual is homozygous at one or both loci, then their

haplotype pair, or phase, can be constructed with certainty. However, if the in-

dividual is heterozygous at both loci, then their haplotype pair is referred to as

phase-ambiguous. This is because the individual inherited either the AB ab or the

Ab aB diplotype.

Several methods can be employed to construct these haplotypes with certainty,

including collecting data from family members or the use of laboratory techniques

to isolate the haplotypes to a particular chromosome, however these procedures

are often costly and time consuming [24]. As a consequence, several statistical

procedures have been developed in order to estimate the phase of diplotypes from

available genotypic information.

The three main statistical methods of haplotype inference are Clark’s Algorithm

[25], a pseudo-Bayesian algorithm by Stephens et al. [26], and an Expectation

Maximisation (EM) algorithm by Excoffier and Slatkin [24]. Clark’s Algorithm is

prone to several problems, for example, the algorithm cannot begin if there are

no individuals who are homozygous at all loci. The pseudo-Bayesian algorithm

also exhibits problems, particularly with the calculation of distributions. Niu et

al. [27] have shown that the conditional distributions calculated do not correspond

to a proper joint distribution; also the distributions calculated may depend on the

order of the genotypes [28]. The EM algorithm appears to exhibit less problems,

and is utilised by the statistical software package, SimHap, which is introduced in

1.3. Genetic Concepts and Definitions 13

Section 3.5.7 (please see page 214) and described further in Section 3.5.7.2 (please

see page 216).

1.3.3 Hardy-Weinberg equilibrium principle

The Hardy-Weinberg Equilibrium (HWE) principle is a fundamental concept in

population genetics, describing the distribution of genotypes in a population. The

basic idea of this principle is that allele and genotype frequencies remain constant

from one generation to the next, if the following conditions are met [29]:

1. The population is large enough so that sampling variation is negligible,

2. mating within the population occurs at random,

3. there is no selective advantage for any genotype; all genotypes are equally

viable and fertile, and

4. there is an absence of factors (‘evolutionary forces’) such as mutation, mi-

gration and random genetic drift.

Consider a locus consisting of two alleles, A and a, and let the frequencies of these

alleles be represented by p and q, respectively, where q=1-p. The HWE princi-

ple states that the genotype frequencies of AA, Aa and aa will be in proportions

p2, 2pq and q2, respectively. As long as the above four conditions are met, then

these genotype frequencies will remain constant generation after generation. De-

viations from HWE can be tested using Pearson’s Chi-Square or an exact test;

if the observed genotype frequencies are significantly different from the expected

frequencies then we would conclude the population is not in HWE. Evidence for

14 Chapter 1. Introduction

deviations from HWE may point to the violation of one of the above four as-

sumptions, or undetected genotyping errors; both of which could bias statistical

analyses [30].

1.3.4 Linkage disequilibrium

Linkage disequilibrium is a measure of the association between two alleles in a

population. Linkage disequilibrium occurs when some combinations of alleles on

the same chromosome (i.e. a haplotype) occur more often than would be ex-

pected by chance. Linkage disequilibrium results from ancestral recombination

events. Both linkage and linkage disequilibrium measure cosegregation, however

linkage measures cosegregation within families, while linkage disequilibrium mea-

sures cosegregation within the population.

To explain linkage disequilibrium further, define PA, Pa, PB and Pb to be the fre-

quencies of the A, a, B and b alleles respectively, and PAB to be the frequency of

the AB haplotype. When there is no evidence for disequilibrium, that is, when al-

leles are in equilibrium, there will be independence between the allele frequencies

at the two loci. This means that PA|B = PA, or the probability of an individual

having the A allele given they have the B allele, is just equal to the probability

of having the A allele. Also, at equilibrium, PAB = PAPB, or the probability of

having the AB haplotype, is the product of the probabilities of having both the A

allele and the B allele.

When linkage disequilibrium is present, the probability of observing the AB hap-

lotype is PAB = PAPB + D, where D is the linkage disequilibrium coefficient and

1.4. Gene Discovery in Human Disease 15

measures the strength of disequilibrium. As D is highly dependent on allele fre-

quencies, D can be standardised to D′ (referred to as Lewontin’s D′) [31], by

D′ = DDmax

, where Dmax is the largest possible value that D can take and is equal

to min(PAPb, PaPB). D′ will take a value between 0 and 1, with a larger value

indicating stronger linkage disequilibrium. If D′ equals zero, then there is random

association between different alleles at different loci and we have equilibrium. A

D′ of 1 occurs when the two loci are perfectly correlated, and any value of D′

above 0.8 is generally considered evidence for strong disequilibrium.

Another measure of linkage disequilibrium is the correlation coefficient between

two loci, r [4] and is defined by DPA−Pa−P−BP−b

, where PA− is the probability of

the AB or Ab genotype, Pa− is the probability of the aB or ab genotype, P−B is

the probability of the AB aB genotype, and P−b is the probability of the Ab or

ab genotype. Similar to the standard correlation coefficient, r may take a value

between -1 and 1.

Evidence for linkage disequilibrium can be helpful in discovering disease genes,

and this is described further in the following section.

1.4 Gene Discovery in Human Disease

The recent sequencing of the human genome has facilitated a dramatic acceleration

in human disease gene discovery. This section describes some analytic methods

used to investigate and identify the genetic causes of human disease.

16 Chapter 1. Introduction

1.4.1 Simple and complex disease

Human diseases are often categorised into two types - simple disease and com-

plex disease. The terms ‘simple’ and ‘complex’ refer to the genetic nature of the

disease. Simple diseases are caused by a mutation in a single gene, and are often

referred to as monogenic or Mendelian diseases. The variants that cause simple

diseases are rare, and generally have high penetrance. These diseases are inher-

ited based on a known pattern of inheritance, whereby disease clearly segregates

within families. Examples of simple genetic diseases are cystic fibrosis which has

a recessive pattern of inheritance [32], and Huntington’s disease which has a dom-

inant inheritance pattern [33].

Complex diseases are caused by variation in many genes, and by environmental

factors and interactions of these; and it is likely that each individual genetic locus

contributes only modestly to the disease. Complex diseases are generally common

in a population relative to monogenic diseases and include cardiovascular disease,

asthma and many cancers, including melanoma. Complex diseases do not follow

a known pattern of inheritance, and while these diseases are more common in

relatives of those with the disease, they do not segregate within families with a

clear pattern of inheritance.

Due to the differences between simple and complex diseases, namely, that simple

diseases are rare and tend to segregate within families, while complex diseases

commonly occur in the general population, different gene discovery methods are

required. Two of these methods, linkage analysis and genetic association studies,

are described in the following sections – Section 1.4.2 and 1.4.3.

1.4. Gene Discovery in Human Disease 17

1.4.2 Linkage analysis

Traditionally, the identification of simple disease genes has begun with family-

based linkage analysis, as linkage analysis uses pedigrees and can be used to locate

areas of the genome that contain disease genes. Linkage analysis exploits the con-

cept of linkage, which was introduced in Section 1.3.4. That is, loci that are close

together on a chromosome will be inherited together more often than expected by

chance, as they will have experienced lower recombination events than loci further

apart. In linkage analysis, genetic markers that are evenly distributed through

the genome are genotyped and their segregation through a pedigree is studied.

If family members with disease often have the same genetic markers, that is, if

disease cosegregates with the markers, then it is possible to infer the position of

the disease gene.

Linkage analysis has been very successful in identifying disease-gene regions for

simple diseases involving a single locus, such as Huntington’s disease. Simple dis-

ease variants are often rare and highly penetrant, and therefore the disease allele

can be localised to a small chromosomal region, and linkage markers close to the

disease allele will cosegregate with disease [34]. In contrast, linkage analysis has

had only limited success at identifying the genes that cause complex diseases [35].

This lack of success is due to a number of factors, including the low heritability of

complex disease, low resolution to localise disease alleles to a particular chromo-

somal location and inadequately powered studies due to the small effect of each

disease locus [34]. It has been shown that for linkage analysis to have enough

statistical power to detect a relative risk of 2, at least 2,500 families need to be

studied [36]. A relative risk of 2 is considered to be a large relative risk conferred

18 Chapter 1. Introduction

by a disease locus in a complex disease [37], therefore any linkage analysis with less

than 2,500 families would be unlikely to detect genetic causes of complex disease.

1.4.3 Genetic association studies

Genetic association studies are used to discover both genetic and environmental

risk factors for disease. In contrast to linkage studies that are primarily used for

family-based studies, association studies can be used for both family-based and

population-based studies. However, association studies have often been used in

conjunction with linkage analysis; that is, when a disease locus is localised to a

chromosomal region, association studies can be used to investigate the region in

order to locate the specific disease-causing allele [38]; this is referred to as fine

mapping.

There are two major types of genetic association studies - candidate-gene stud-

ies and genome-wide association studies (GWAS). Both types of studies exploit

the concept of linkage disequilibrium between alleles which was described earlier

in Section 1.4.2. In a candidate-gene study, genes suspected of being involved in

causing disease are identified using selection criteria, such as biological plausibility,

animal models, or prior associations [39]. Tests of associations between variants in

these genes and the disease outcome are then performed. More recently, genetic

association studies are being performed over the entire genome, and are referred

to as GWAS. This approach is often preferred over the candidate-gene study in

complex disease genetics, as no assumptions are made about the identity of the

causal gene.

1.4. Gene Discovery in Human Disease 19

In both types of studies, associations detected between genetic variants and dis-

ease outcomes may be due to one of three reasons. The first of these is direct

association, that is, the variant tested is functional and directly causes the out-

come. These associations tend to be the easiest and most powerful to analyse [40].

However, the prior probability that a given associated variant is itself functional

is generally low. More likely is that indirect association occurred; that is, the

variant associated with the outcome is in linkage disequilibrium with the causal

variant. The third possibility is that the association was spurious, although this

possibility can be fully or partially eliminated by applying appropriate statistical

tools, such as adjusting for multiple testing.

1.4.4 Multiple testing

In statistical analyses, the probability of a type I error or ‘false positive’ is the prob-

ability of rejecting the null hypothesis when it is actually true. When performing

one statistical test, this probability, or α, is referred to as the significance level and

is commonly set at 0.05. This means that when performing one test, the probabil-

ity that the result will be a false positive is 0.05, and so for the association to be

deemed significant, the observed p-value needs to be below this value. When per-

forming n multiple comparisons or tests simultaneously, the probability of a false

positive increases to 1− (1− α)n. Multiple testing issues are common in genetic

association studies due to the testing of multiple genetic markers simultaneously.

For example, when testing associations between 42 genetic variants or SNPs and

some outcome, the probability of at least 1 false positive is 1−(1−0.05)42=0.8840.

Traditionally, the Bonferroni correction has been used to adjust for multiple com-

parisons. In this correction, the total error rate of all tests combined is set at α,

20 Chapter 1. Introduction

so that each test has an individual type I error rate of αn. When testing 42 SNPs,

this would lead to an adjusted significance level of 0.0542

= 0.00119. Therefore,

associations with a p-value < 0.00119 would be deemed significant. The Bonfer-

roni correction is considered by some to be too conservative when the number of

genetic variants tested is high [41].

Genetic analyses in this thesis use the False Discovery Rate (FDR) method [42]

instead of the Bonferroni correction to adjust for multiple testing. The FDR

approach controls the expected proportion of false positives, compared to the

Bonferroni correction that controls the chance of any false positives assuming a

null hypothesis at any genetic loci.

The FDR approach can be used to estimate an FDR threshold known as q. The

q value is calculated as q(r) =m∗p(r)r

, where m is the number of SNPs tested and

p(r) is the p-value of the rth SNP when the SNP p-values are ordered from lowest

to highest. In this thesis, this value of q was compared to α, and the test was

deemed significant when q < α, where α was defined at the 5% significance level.

The FDR approach was used to calculate the q threshold for each SNP at the

multivariate genotypic analysis stage.

1.5 Aims

This thesis investigates the genetic epidemiology of melanoma susceptibility and

prognosis in the Western Australian Melanoma Health Study (WAMHS). In ad-

dition, this thesis describes a technique known as Mendelian randomisation, and

an application of this technique for making causal inferences in epidemiological

1.6. Outline of thesis 21

studies.

The aims in this thesis are described below:

1. To assist in establishing the WAMHS,

2. to investigate associations between candidate loci and melanoma risk in the

WAMHS sample,

3. to investigate associations between known host, environmental and genetic

melanoma-risk factors with melanoma prognosis, as reflected by Breslow

thickness, in the WAMHS sample, and

4. to implement a statistical technique known as Mendelian randomisation in

R, and to undertake novel methods development to enable the use of both

SNP and haplotypes to model causality in epidemiological studies.

1.6 Outline of thesis

Chapter 2 of this thesis begins with a description of melanoma incidence and mor-

tality rates and follows with a comprehensive review of the genetic epidemiology

of melanoma susceptibility and prognosis. The WAMHS is described, and in the

following sections, these data are used to investigate the role of genetic variants

in melanoma susceptibility. WAMHS data are also used to test the hypothesis

that the genetic, environmental and host factors that increase melanoma risk are

also associated with Breslow thickness, which is the main prognostic feature of

melanoma. This chapter finishes with a summary of the key findings and their

implications in melanoma research.

22 Chapter 1. Introduction

In Chapter 3, a background to epidemiological studies, in particular observational

epidemiological studies, is presented. I then described a statistical method known

as Mendelian randomisation that can be used to detect causal associations between

disease outcomes and modifiable exposures in epidemiological association studies.

The implementation of this method in an R library, MRsnphap, is also described,

along with novel methods work to extend Mendelian randomisation to haplotypes.

Chapter 4 summarises the main results of this thesis and describes areas for further

research, including possible further development of MRsnphap.

23CHAPTER 2

Genetic Epidemiology of Malignant Melanoma:

Susceptibility and Prognosis in the WAMHS

This chapter begins with a description of the incidence and mortality of melanoma

in recent years, followed by a review of the genetic, host and environmental risk

factors for melanoma development and prognosis. Establishment of the WAMHS

is then described. Following this, I took known candidate variants for melanoma

risk and tested whether these genetic variants were associated with melanoma in

a sub-sample of the WAMHS. I also used the WAMHS data to investigate the role

of melanoma-risk factors in melanoma prognosis. The results of these analyses

are then presented and discussed.

2.1 Introduction

Melanoma is a significant public health issue in Australia and worldwide. Aus-

tralia has the highest incidence rate of melanoma in the world, with over 10,000

cases diagnosed each year [43]. Melanoma was first described as a disease by Rene

Laennec in 1804 [44], however the first recorded operation on metastatic melanoma

was performed some years earlier in 1787 by John Hunter [45]. As early as 1840,

the British surgeon Samuel Cooper recognised that ‘the only chance for benefit de-

pends upon early removal of disease’ [46]; to date, early excision of the melanoma

tumour is the standard treatment, and is the only potentially curative treatment

for melanoma [47].

Melanoma is a complex disease resulting from mutations in many genes, environ-

mental and host factors and their interactions. In 1956, John Lancaster observed

a geographical distribution of melanoma mortality which supported the role of

24Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

sun exposure in causing melanoma [48]. Sun exposure is still considered to be the

main environmental risk factor for melanoma, however its interactions with other

environmental, host and genetic factors and their role in melanoma development

and prognosis are less clearly understood. To date, several genes for melanoma

have been identified, however mutations in these genes are rare and only account

for a small proportion of melanoma diagnoses [49]. Population-based studies of

melanoma cases may help to elucidate the role of environmental, host and genetic

risk factors of melanoma development and prognosis.

2.2 Literature Review

2.2.1 Skin cancer

It is estimated that 2 in 3 Australians will be diagnosed with skin cancer be-

fore the age of 70 [50]. Cutaneous Malignant Melanoma (hereafter referred to as

melanoma), Basal Cell Carcinoma (BCC) and Squamous Cell Carcinoma (SCC)

are the three main types of skin cancers and accounted for approximately 82% of

all cancer diagnoses in Australia in 2002 [50]. Together, the non-melanoma skin

cancers, including mostly BCC and SCC, accounted for approximately 67% and

30% of Australian skin cancer diagnoses, respectively [51]. Melanoma is the rarest

form of these skin cancers and accounted for approximately 3% of Australian skin

cancer diagnoses [51].

While BCC and SCC are the most common types of skin cancer, mortality rates

from these cancers are much lower than those of melanoma. In 2005, approxi-

mately 3.1 per 100,000 males and 1 per 100,000 females died from non-melanoma

skin cancers [52]. In contrast, mortality rates from melanoma were approximately

2.2. Literature Review 25

three-fold those of the non-melanoma skin cancers, at 8.9 per 100,000 males

and 3.5 per 100,000 females [51]. In real terms, 411 Australians died from non-

melanoma skin cancers in 2005, while 1,262 Australians died from melanoma. As

such, it is important from a clinical and public health viewpoint that the mecha-

nisms underlying melanoma, in particular melanoma susceptibility and prognosis,

are more clearly understood.

2.2.2 Melanoma biology

Melanoma is a complex and aggressive form of cancer of the melanocytes. In the

skin, melanocytes are cells found in the deepest layer of the epidermis (See Figure

2.2.2.1). When exposed to sunlight, melanocytes produce and supply melanin to

protect the keratinocytes from mutation due to ultraviolet radiation. Melanin is

the pigment found in the skin, hair and eyes and once produced, is responsible for

tanning.

The Clark model [53,54] has been used to describe the progression of melanoma in

five stages, from normal melanocytes to development of metastatic melanoma. It is

estimated that up to 70% of melanomas may arise de novo from normal-appearing

skin, although this is difficult to determine [55–57]. Therefore, the development of

melanoma may begin in either stage 1 or stage 2 of the Clark’s model. In addition,

not all melanoma tumours experience radial or vertical growth [53]. As such, the

Clark model should be viewed as a general framework for melanoma growth, with

not all tumours progressing through all stages. The stages of Clark’s model are

shown in Figure 2.2.2.2.

26Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.2.2.1: Cross-section of the skin highlighting the epidermal layer

The five stages of the Clark model are described below:

1. Benign naevus – structurally normal melanocytes proliferate to form be-

nign naevus. These naevi may be flat or slightly raised lesions, are generally

a tan or dark brown colour, and rarely develop into melanoma.

2. Dysplastic naevus – aberrant growth begins in either a pre-existing be-

nign naevus, or in a new location. These lesions may be varying colours,

asymmetrical, or have irregular borders, and are generally 5mm or greater

in diameter.

3. Radial growth - cells acquire the ability to grow into the epidermis. If the

melanoma is removed before cells have spread outside of the epidermis, the

2.2. Literature Review 27

Figure 2.2.2.2: Clark model of the five stages of melanoma development, from

normal melanocytes to metastatic melanoma [54]

melanoma is referred to as in situ melanoma.

4. Vertical growth - lesions acquire the ability to grow into a deeper layer of

the skin, the dermis, and may also grow into the subcutis. These melanomas

are referred to as invasive melanomas, and are the focus of this thesis.

5. Metastatic melanoma - cancerous cells spread to other areas of the skin

and internal organs through the lymphatic system and via the blood stream,

where they can proliferate.

2.2.3 Melanoma incidence and mortality

2.2.3.1 Melanoma in Australia

Australia has the highest incidence of melanoma in the world [43] and this is ris-

ing [51]. Melanoma incidence rates in Australia are more than double those in

28Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

the United States of America, and about four times those in the United Kingdom

and Canada. Incidence rates for these countries and selected others are shown

in Figure 2.2.3.1.1. The age-standardised rates presented in Figure 2.2.3.1.1 are

based on the 2001 World Standard Population. To allow for comparisons between

Australian and Western Australian incidence and mortality rates, all rates here-

after are based on the 2001 Australian Standard Population.

Melanoma incidence rates in Australia and New Zealand are significantly higher

compared to other countries in the world. This may be due to a number of rea-

sons, including environmental, geographic, and cultural reasons. For example,

ultraviolet exposure is a known risk factor for melanoma [65], and ultraviolet ex-

posure levels are higher at lower latitudes. In addition, many Australians and

New Zealanders tend to have fair skin due to European ancestry, and therefore

their skin is naturally more suited to areas of low ultraviolet exposure. However,

even with skin poorly suited to the high levels of ultraviolet exposure in Australia

and New Zealand, many Australians and New Zealanders participate in outdoor

recreational activities throughout the year from childhood to adulthood, which

increases ultraviolet exposure and therefore melanoma incidence.

An estimated 10,684 new cases of melanoma were diagnosed in Australia in 2005.

Melanoma was the third most diagnosed cancer (excluding SCC and BCC) in

2005 in both Australian males and females, after prostate and colorectal can-

cer in males, and breast and colorectal cancer in females. Melanoma accounted

for approximately 10% of cancer diagnoses [51]. Males are approximately 1.5

times more likely to be diagnosed with melanoma compared to females, with 2005

age-standardised incidence rates of 60.9 per 100,000 males and 42.5 per 100,000

2.2. Literature Review 29

Figure 2.2.3.1.1: Melanoma incidence rates for selected countries for males and

females. Incidence rates are presented as age-standardised rates (per 100,000 per-

sons) based on the 2001 World Standard Population. Data obtained from Globocan

2002 [43].

females [51]. While the incidence of melanoma is significantly higher in males

compared to females in Australia, New Zealand, and the United States of Amer-

ica, female incidence tends to be higher than male incidence in European countries

(2.2.3.1.1). The reasons for this remain unclear.

Melanoma incidence has been increasing steadily since 1982 (see Figure 2.2.3.1.2),

with 2005 age-standardised rates approximately double those in 1982. Currently,

it is predicted that 1 in 26 Australians will develop melanoma before the age of

75 [51], however this risk was 1 in 35 in 1990 [58], and 1 in 49 in 1982 [59]. It

30Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

is predicted that incidence rates will continue to rise in Australia at least until

2011 [51]. It is evident from Figure 2.2.3.1.2 that in the early 1980s, incidence in

males and females was comparable, and it was only in 1985 that male incidence

increased in comparison to female incidence. As this disparity in male and female

melanoma incidence occurred only recently, it is not likely that the higher inci-

dence in males is driven by underlying biological differences between the sexes,

but may be due to an increase in ultraviolet exposure in males.

Figure 2.2.3.1.2: Melanoma incidence rates from 1982 to 2005 for males and females.

Incidence rates are presented as age-standardised rates (per 100,000 persons) based

on the 2001 Standard Australian Population. Data obtained from the Australian

Institute of Health and Welfare COGNOS data cube [60].

Melanoma is a cancer that affects both young and middle-aged individuals, unlike

most other cancers which tend to affect mainly older adults. In 2005, melanoma

was the most common cancer diagnosed in males aged between 16 and 51, and

2.2. Literature Review 31

females aged between 17 and 33 [51]. While melanoma is the most common can-

cer in these age groups, overall melanoma diagnoses are more common in older

individuals. Figure 2.2.3.1.3 shows melanoma incidence rates for each five-year

age group in 2005. Melanoma incidence increased linearly for females over all age

groups, while male rates increased at a similar rate to females until about age 50,

when male rates increased sharply.

Figure 2.2.3.1.3: Melanoma incidence rates by age-group in 2005 for males and fe-

males. Incidence rates are presented as age-standardised rates (per 100,000 persons)

based on the 2001 Standard Australian Population. Data obtained from Australian

Institute of Health and Welfare’s Cancer Incidence and Mortality series [59].

While melanoma is one of the most common cancer diagnoses, melanoma mortal-

ity rates are relatively low compared to other cancers, such as lung and colorectal

cancer. Mortality rates for males are over double those of females. In 2005, mor-

tality rates were 8.9 per 100,000 males and 3.5 per 100,000 females, accounting

32Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

for 3.9% and 2.4% of cancer deaths in males and females respectively [51].

The higher mortality rates in males may be due to the different characteristics of

male melanomas compared to female melanomas. For example, males are more

likely to have thicker melanomas excised, have melanomas excised from different

body sites, and have ulceration of the melanoma [271]. However, several studies

have found that even after adjustment for these factors, male sex remains an in-

dependent predictor for prognosis [270, 271], indicating that there may be other

sex-specific factors that affect mortality rates.

Figure 2.2.3.1.4 shows melanoma mortality rates from 1968 to 2005. Melanoma

mortality rates for females remained relatively stable from 1968 to 2005; however

male mortality rates have increased approximately two-fold since 1968. Melanoma

mortality rates are predicted to continue to rise slightly in males, while female

rates are predicted to remain stable [51]. The different trends in mortality be-

tween males and females is substantial, however few studies have been published

which attempt to explain this difference.

Although Australia has a much higher incidence rate than other countries, the

differences in melanoma mortality between Australia and other countries are not

as extreme. For example, the age-standardised incidence rate for melanoma in

the United Kingdom (UK) is approximately one–quarter of Australian melanoma

incidence, while the mortality rate of UK individuals with melanoma is less than

one–third of Australian mortality (8.9 in Australian males and 3.5 in Australian

females compared to 3.0 in UK males and 2.0 in UK females) [273]. This sug-

2.2. Literature Review 33

gests that while Australians are more likely to be diagnosed with melanoma, early

detection and improved treatment of melanoma results in proportionally less Aus-

tralians dying from the disease.

Figure 2.2.3.1.4: Melanoma mortality rates from 1968 to 2005 for males and females.

Mortality rates are presented as age-standardised rates (per 100,000 persons) based

on the 2001 Standard Australian Population. Data obtained from Australian Insti-

tute of Health and Welfare’s Cancer Incidence and Mortality series [59]

Australia-wide, five-year survival rates in both males and females increased over

the 22 year period from 1982 to 2004. Between 1982 and 1986, five-year survival

rates in males were 82.2%, increasing to 89.7% between 1998 and 2004. Similarly

for females, five-year survival rates increased from 90.5% between 1982 and 1986

to 94.1% between 1998 and 2004 [61]. These increases in survival may be partly

due to the earlier detection of melanoma (i.e. thinner melanomas with better

34Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

prognosis excised), and also advances in treatment.

2.2.3.2 Melanoma in Western Australia

Western Australia has the second-highest melanoma incidence in Australia, after

Queensland [51]. Average incidence rates of all Australian states and territories

from 2001 to 2005 can be seen in Figure 2.2.3.2.1.

Figure 2.2.3.2.1: Melanoma incidence rates for all Australian states and territories

from 2001 to 2005 for males and females. Incidence rates are presented as age-

standardised rates (per 100,000 persons) based on the 2001 Standard Australian

population. Data obtained from Australian Institute of Health and Welfare’s Can-

cer Report [51].

At the time of writing this thesis, 2007 melanoma incidence and mortality rates

in Western Australia were available; however due to an issue with data collec-

tion at the Western Australian Cancer Registry, the 2007 rates are believed to be

2.2. Literature Review 35

understated (further details are available in the 2007 Western Australian Cancer

Incidence and Mortality report [62]). Therefore, only melanoma rates for 2005

and 2006 will be discussed in this section.

In 2005, melanoma diagnoses accounted for 11% of cancer diagnoses in Western

Australian males and 9.8% in Western Australian females, which made melanoma

the 4th and 3rd ranked cancer in males and females respectively (after prostate,

lung and colorectal cancer in males, and breast and colorectal cancer in females)

[63]. In 2006, melanoma diagnoses accounted for slightly more of the cancer diag-

noses (11.1% for males and 10.6% for females), with melanoma the 2nd and 3rd

ranked cancer for males and females respectively (after prostate cancer in males,

and breast and colorectal cancer in females) [64].

Of the 10,684 melanomas diagnosed in Australia in 2005, approximately 9% were

in Western Australians [63]. This corresponded to incidence rates of 59.3 cases

per 100,000 males and 37.8 cases per 100,000 females [63]. Incidence rates in both

males and females increased in 2006 to 60.9 cases per 100,000 males and 42.5 cases

per 100,000 females [63].

In Western Australia, the 2005 mortality rates were 11.6 deaths per 100,000 males

and 3.2 deaths per 100,000 females [63]; mortality rates in females were compa-

rable to the 2005 Australian mortality rates, however male mortality rates were

significantly higher in Western Australian males. In 2006, Western Australian

mortality rates in males decreased to 9 deaths per 100,000, while mortality rates

in females increased to 4.3 deaths per 100,000 [64]. There is no obvious reason for

36Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2005 mortality rates to be higher in Western Australian males compared to other

Australian males. This may have been a chance fluctuation in rates in 2005, as

in 2006 male mortality rates were consistent with the national average.

2.2.4 The aetiology of malignant melanoma

Melanoma is a cancer that results from complex interactions between genetic,

host and environmental factors. Epidemiological research in melanoma has led to

some understanding of the host factors which increase an individual’s melanoma

risk; these include pale complexion, red or blonde hair, skin that freckles easily

and tans poorly, many naevi (or moles), and a family history of melanoma [65].

The strongest risk factor for melanoma is a large number of naevi [66], while

the main established environmental risk factor for melanoma is sun exposure, in

particular exposure to ultraviolet radiation (UVA and UVB) [65]. Several genes

which may predispose an individual to melanoma have been identified. Like any

complex disease, however, it is thought that a combination of the genetic, host

and environmental factors contribute to the development of melanoma.

2.2.4.1 Environmental and host melanoma risk factors

2.2.4.1.1 Sun exposure

Sun exposure is the principle established environmental risk factor for melanoma

and this was recognised by the International Agency for Cancer Research in

1992 [67]. Sun exposure history can be measured by either migrant studies or

personal exposure studies; however sun exposure is notoriously difficult to mea-

sure. Personal exposure case-control studies that ask individuals to recall their

2.2. Literature Review 37

sun exposure are often subject to recall and other biases [68,69]. In these studies,

a proxy measure of sun exposure is often used (for example, sunburn history)

and individuals must rely on recollection, sometimes from sun exposures experi-

enced decades earlier. However, these observational studies are often performed,

as RCTs are not considered ethical, and longitudinal cohort studies can be costly

and time-consuming. Migrant studies, also known as ambient studies, have an

advantage over personal exposure studies, as migration from areas of low/high in-

solation to areas of high/low insolation tend to be easy to classify and record [70].

Cumulative sun exposure throughout a whole lifetime has been established as a

risk factor for melanoma; however it has been hypothesised that childhood sun ex-

posure may confer a higher melanoma risk than similar exposure in later life [70].

This has been investigated in both migrant studies [71–82] and personal exposure

studies [83].

In a meta-analysis by Gandini et al. [83] of 33 personal exposure studies, sunburn

was associated with an increased risk of melanoma, with an overall relative risk

of 2.03 (95% CI = 1.73, 2.37). When further stratified by age, childhood sunburn

(less than 15 years of age) conferred a relative risk of 2.24 (95% CI = 1.73, 2.89),

while adulthood sunburn (greater than 19 years of age) conferred a relative risk

of 1.92 (95% CI = 1.55, 2.37). However, the meta-analyses was not adjusted for

skin type or the ability to tan which are likely to affect both incidence of sunburn

and melanoma.

Migrant studies are often used to investigate the relationship between sun expo-

38Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

sure in childhood and melanoma risk. A comprehensive review by Whiteman et

al. [70] of migration studies and melanoma risk investigated place of birth and

age at migration. In 11 of the 12 place of birth studies, individuals born in ar-

eas of low insolation (e.g. United Kingdom) who later migrated to areas of high

insolation (e.g. Australia) had lower risks of developing melanoma than natives

of these high insolation areas [71–81]. Only one study showed the opposite effect,

with migrants to Hawaii having an increased risk of melanoma [82]. The results

from these 11 studies suggest that individuals who migrate to areas of high in-

solation from areas of low insolation have, in general, lower melanoma incidence

than native-born and raised individuals. This indicates that living in areas of low

insolation for the earlier years of life may protect against melanoma compared to

living in high insolation areas for a lifetime. These studies, however, do not take

age at migration into account which may also play a significant role.

Of the five comparable studies examining age at migration from low insolation to

high insolation areas [74,75,79,84,85], three studies found that a decreased risk of

mortality from melanoma was associated with older age at migration [74, 75, 84].

In particular, a study by Khlat et al. [74] comparing mortality rates of migrants

to native-born individuals, found that when migration to Australia occurred be-

tween the ages of 15 and 24, male melanoma mortality dropped to 0.41 (95% CI

= 0.32, 0.54), and migration after the age of 25 was associated with an even lower

risk of mortality, 0.32 (95% CI = 0.27, 0.39).

Similarly, in a study by Darcy et al. [84] melanoma incidence was lower in in-

dividuals who migrated at a later age. In particular, melanoma incidence was

reduced significantly at migration after 19 years of age, with a risk ratio of 0.25

2.2. Literature Review 39

(95% CI = 0.08, 0.83), when compared to migration before 19 years of age. These

studies suggest that melanoma incidence and mortality are both affected by age

at migration. More specifically, migration in childhood increases melanoma risk

to closer to that of a native-born individual and migration before the age of 15

increases melanoma mortality.

There is continuing debate over what type of sun exposure confers the highest

melanoma risk. It has been suggested that intermittent exposure, which can be

categorised as either recreational or vacation exposure [86], confers a higher risk

of melanoma than occupational or chronic sun exposure. In a 2008 meta-analysis

by Gandini et al. [83] of 33 case-control studies, intermittent sun exposure was

associated with melanoma with a relative risk of 1.61 (95% CI = 1.31, 1.91), while

occupational sun exposure was not significantly associated with melanoma with a

relative risk of 0.95 (95% CI = 0.87, 1.04). These results suggest that intermittent

sun exposure may increase the risk of melanoma, while occupational sun exposure

either reduces or has no effect on melanoma risk.

2.2.4.1.2 Skin type and pigmentation

Pigmentation, including skin, hair and eye colour, plays an important role in

melanoma development. In particular, fair-skinned individuals are more likely to

develop melanoma than darker-skinned individuals [87], with melanoma risk in-

versely correlated to degree of pigmentation [88]. In fact, dark-skinned individuals

are unlikely to develop melanoma, with the majority of all melanomas diagnosed

in Caucasians. In a recent report of melanoma risk in an Australian Indigenous

population, the melanoma rate in the Indigenous population was 3.4 cases per

100,000, compared to 48.2 cases per 100,000 in the Australian non-Indigenous

40Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

population [62]. Bliss et al., [89] reviewed seven studies investigating the associa-

tion between melanoma and skin colour. When compared to dark skin, light skin

colour was associated with an approximate two-fold increased risk of melanoma,

mostly unaltered by naevus count, hair colour and freckling.

Melanoma risk is also independently associated with other pigmentation features,

including light hair colour, blue eyes and the presence of freckles [90,91]. A meta-

analysis of eight case-control studies [89] found that when compared to dark brown

hair, light brown, blonde and red hair conferred melanoma relative risks of 1.49

(95% CI = 1.31, 1.70), 1.84 (95% CI = 1.54, 2.21) and 2.38 (95% CI = 1.90,

2.97), respectively. These associations were observed independently of freckling

and skin colour. This meta-analysis also showed that when compared to brown

eyes, blue eyes conferred relative risks of 1.55 (95% CI = 1.35, 1.75), however this

association was not significant after adjustment for freckling [89]. Additionally,

in this meta-analysis, the presence of freckling when compared to sparse or no

freckling, was associated with a melanoma relative risk of 2.25 (95% CI=2.00,

2.54) and this association remained unchanged when adjusted for the presence of

naevi, skin and hair colour.

However, other researchers have cast doubt on the assertion that it is fair skin

that increases an individual’s melanoma risk, but instead suggest the tendency

to burn rather than tan following sun exposure is a better predictor of melanoma

risk [90,92]. A Western Australian study found that when compared to having a

deep tan, individuals with a moderate, light or no tan, had increased melanoma

relative risks of 1.4 (95% CI = 1.1, 1.9), 2.3 (95% CI = 1.6, 3.3) and 3.5 (95% CI

= 1.8, 6.8), respectively [87]. This association appeared to be explained by skin

2.2. Literature Review 41

and eye colour, but not hair colour. Similarly, a case-control study of American

adults found that individuals who had a deep tan had a reduced risk of melanoma

with an odds ratio of 0.47 (95% CI = 0.34, 0.65) for males and 0.43 (95% CI =

0.28, 0.65) for females, when compared to those with no tan [92].

While the relationship between pigmentation and melanoma development is com-

plex, fair-skinned individuals with light hair, blue eyes, freckles and who are un-

able to develop a deep tan, generally appear to be at a greater risk of developing

melanoma.

2.2.4.1.3 Family history

Melanoma clearly aggregates within families. It has been estimated that 10% of

individuals with melanoma have a first- or second-degree relative with melanoma

[93]. Early studies estimated that family history of melanoma caused a two- to

eight-fold relative risk for melanoma development [91,94]. However, a more recent

meta-analysis of 14 studies estimated a family history of melanoma (having one

or more affected first-degree relatives) confers a relative risk of only 1.74 (95% CI

= 1.41, 2.14) [95].

This aggregation of melanoma within families could possibly be due to shared host

factors such as eye colour, hair colour and naevus counts and also shared envi-

ronmental exposures found within families. However, Ford et al. [96] pooled eight

studies and found that, independent of these host factors, first-degree relatives of

individuals with melanoma had an estimated relative risk of 2.24 (95% CI = 1.76,

2.86). It has also been proposed that relatives of individuals with melanoma tend

42Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

to have an earlier age at onset, have thinner melanomas and are more likely to

develop multiple melanomas [94,97].

2.2.4.1.4 Naevi

The presence of multiple naevi is the main risk factor for melanoma. Most epi-

demiological studies that have investigated naevi and melanoma risk have found

that the presence of naevi are an independent risk factor for melanoma, particu-

larly the presence of atypical naevi [98]. The link between melanoma and naevi

was discovered in melanoma families who often have the Atypical Mole Syndrome

(AMS) phenotype. AMS is characterised by the presence of many common and

atypical naevi, typically over 100, which tend to appear in abnormal areas of the

body [99]. The AMS phenotype has been shown to be associated with melanoma

with an odds ratio of 10.4 (95% CI = 5.0, 21.5) [100]. Additionally, the same study

found that individuals under 40 years of age with melanoma were more than twice

as likely to have the AMS phenotype (odds ratio of 16.1; 95% CI = 4.6, 57.5),

compared to individuals aged over 40 (odds ratio of 6.9; 95% CI = 2.9, 16.6). This

highlights the importance of age in relation to the AMS phenotype and melanoma.

Children in Australia with increased numbers of naevi were found to have more

sun exposure than those with fewer naevi [101]. As sun exposure is a known risk

factor for melanoma, it follows that naevi may also be associated with melanoma,

although this relationship may not be causal. A case-control study of Australian

adolescents with melanoma found that the presence of 100 or more naevi was

strongly associated with melanoma risk, with an odds ratio of 46.5 (95% CI =

11.4, 190.8), independent of sun exposure levels [102]. Therefore, it appears the

presence of naevi, particularly in individuals younger than 40, plays an important

2.2. Literature Review 43

role in the development of melanoma.

2.2.4.2 Melanoma genetic risk factors

Historically, genetic studies of families selected on the basis of multiple cases of

melanoma have been used to discover melanoma-susceptibility genetic variants

[103, 104]. These studies have been successful in identifying two high-penetrance

genes for melanoma - Cyclin-dependent kinase inhibitor 2A (CDKN2A) and Cyclin-

dependent kinase 4 (CDK4). More recently, population- and family-based GWAS

have successfully identified several melanoma-susceptibility loci [105–108]. In

addition, candidate gene studies and GWAS of unrelated individuals have suc-

cessfully identified low-penetrance common genes which modify melanoma risk

through known risk factors, including skin pigmentation and naevi count [105,109].

2.2.4.2.1 High penetrance melanoma-susceptibility genes

Two high-penetrance melanoma-susceptibility genes have so far been identified

– CDKN2A and CDK4. In 1994, germline mutations in CDKN2A (located on

chromosome 9p21) were discovered in families with a history of melanoma [103,

110]. CDKN2A encodes for two tumour suppressor proteins, p16(INK4a) and

p14(ARF), which have been shown to have a role in cell cycle regulation and senes-

cence. The p16 and p14 mutations have been found to be strongly associated with

familial melanoma, and it has been estimated that 20-55% of melanoma families

have these mutations [111]. While relatively common in melanoma-prone fami-

lies, CDKN2A mutations are rare in melanoma cases in the general population. A

large international population-based study by Berwick et al. [112] found CDKN2A

mutations in 3% of individuals diagnosed with more than one melanoma, and mu-

44Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

tations in 1.2% of individuals with only one diagnosed melanoma.

Estimation of the likelihood of developing melanoma among CDKN2A muta-

tion carriers varies depending on ascertainment. Bishop et al. [113] estimated

melanoma penetrance in individuals with CDKN2A mutations who had a fam-

ily history of melanoma to be 0.3 by age 50, and 0.67 by age 80. In contrast,

a population-based study by Begg et al. [114] which ignored family history of

melanoma, estimated penetrance to be 0.14 by age 50, and 0.28 by age 80. There-

fore, from these studies it appears that CDKN2A mutation carriers in the general

population have a much lower risk of developing melanoma compared to carriers in

melanoma families. However, these differences may also be due to ascertainment

bias, as individuals with a strong family history of melanoma may have been more

likely to participate in the study by Bishop et al. It is clear, however, that the

combination of the rarity of CDKN2A mutations and their low penetrance in the

general population suggests that CDKN2A does not account for many melanoma

diagnoses. In fact, in a population-based study of melanoma cases in Queensland,

Aitken et al. [115] estimated that only 0.2% of melanomas in Queensland were

due to mutations in CDKN2A.

Germline mutations in CDK4 - an oncogene located on chromosome 12q13 - were

identified in melanoma families by Zuo et al. [104] in 1996. Mutations in CDK4

have only been found in several families to date [104, 116], thus accounting for

only a small proportion of familial melanoma. CDK4 interacts with CDKN2A

and as such, clinical phenotypes including age at onset of melanoma and the

number of melanomas, are similar between melanomas cases in CDKN2A and

CDK4 mutation carrier families [117]. Among two CDK4 families, Goldstein et

2.2. Literature Review 45

al. [118] assessed the penetrance of CDK4 mutation carriers to be 0.63 (95% CI

= 0.42, 0.85).

2.2.4.2.2 Low penetrance melanoma-susceptibility genes

The first low-penetrance melanoma-susceptibility gene identified was the melanocortin

1 receptor (MC1R) gene, located on the 16q24.3 chromosome. Variants in this

gene have been associated with red hair, fair skin and freckling [119–121]. As

these are well-established melanoma risk factors, it is not surprising that variants

in MC1R are also associated with melanoma risk. Valverde et al. [122] found

MC1R mutation carriers had an increased relative risk of melanoma of 3.9 (95%

CI = 1.48, 10.35). In a subsequent study, Palmer et al. [121] found the three

major red hair colour variants each conferred an odds ratio of melanoma of 2.2

(95% CI = 1.6, 3.0), while having two of these variants conferred an odds ratio

of approximately double this at 4.1 (95% CI = 2.1, 7.9). When adjusted for hair

colour and skin type, the association between one variant allele and melanoma

risk became non-significant for individuals with fair skin, however this association

remained significant for individuals with medium skin type (odds ratio of 2.2;

95% CI = 1.2, 4.1) and olive or dark skin type (odds ratio of 10.8; 95% CI = 1.2,

4.1). Thus, it appears that MC1R plays an independent role in melanoma risk,

independent of the association with skin pigmentation.

A recent meta-analysis of MC1R variants and melanoma risk and pigmentation

phenotypes by Raimondi et al. [123] found seven of the nine most commonly

investigated putative candidate risk variants were significantly associated with

melanoma. Additionally, two of these variants were associated with red hair and

fair skin, while three variants were associated with red hair only. More recently,

46Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

in a GWAS undertaken by Bishop et al. [106], the rs258322 SNP near the MC1R

locus was strongly associated with melanoma (combined GWAS and replication P

= 1.97 x 10−6). This SNP was previously identified in a GWAS by Han et al. [124]

as a hair colour and skin pigmentation locus, although Han et al. suggested that

the signal from this SNP was explained by three already identified MC1R variants.

This indicates that the rs258322 SNP may not be an independent risk SNP for

pigmentation.

While MC1R variants have been proven to be low-penetrance melanoma loci

[121,122,125], MC1R variants are also associated with increased CDKN2A pene-

trance and lower age of melanoma diagnosis [126]. In an Australian study of 15

melanoma-families, Box et al. [126] found CDKN2A penetrance to be 0.5, with a

mean age of diagnosis of 58.1. In the presence of MC1R variants, this penetrance

increased to 0.84 with a lower mean age of diagnosis of 37.8. Additionally, Gold-

stein et al. [127] suggested that in individuals with CDKN2A mutations, multiple

MC1R variants may be associated with the development of multiple melanomas.

Therefore, MC1R appears to not only be a melanoma risk gene, but it may also

interact with CDKN2A, increasing melanoma risk and number of melanomas, and

reducing the age at diagnosis.

Epidermal Growth Factor (EGF) located on chromosome 4q25-q27 has been im-

plicated as a melanoma-susceptibility gene. Shahbazi et al. [128] found a signif-

icant association between the rs4444903 SNP in EGF and melanoma risk, with

homozygous carriers of the rare allele having an increased odds of melanoma of

4.9 (95% CI = 2.3, 10.2). However, subsequent studies failed to detect any such

associations [129–132]. Some of these studies did detect significant associations

2.2. Literature Review 47

between the EGF variant and melanoma progression and this is discussed further

in Section 2.2.6 (please see page 50).

The recent advent of GWAS technology has enabled melanoma researchers to take

an unbiased approach in identifying melanoma-risk genes. To date, a number of

GWAS for melanoma risk, naevi count and pigmentation have been conducted.

A GWAS of naevi count by Falchi et al. [105] identified SNPs on the Methylth-

ioadenosine Phosphorylase (MTAP) gene (adjacent to CDKN2A on 9p21) that

were associated with naevi count. The strongest signal was from rs4636294 which

explained 1.5% of naevi count variance in the GWAS and 3.0% of naevi count

variance in the replication study (combined GWAS and replication P = 3.4 x

10−15).

In the same study, this SNP was also significantly associated with melanoma risk

in two independent samples, with each risk allele conferring a combined odds ratio

of 1.21 (95% CI = 1.14, 1.28; combined P = 3.7 x 10−8).

The MTAP rs4636294 SNP was also independently identified by Bishop et al. [106]

in a GWAS for melanoma risk, with each risk allele conferring a similar combined

odds ratio of 1.16 (95% CI = 1.09, 1.23; combined P = 1.97 x 10−6). Several

other SNPS in the same region were associated with both melanoma risk and

naevi count, including rs7023329, rs10757257 and rs2218220.

Falchi et al. [105] also identified the Phospholipase A2, Group VI (cytosolic,

calcium-independent) (PLA2G6) gene on chromosome 22q13.1 as a gene asso-

48Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

ciated with both naevi count and melanoma risk. Four SNPs in this region

(rs2284063, rs6001027, rs132985, rs73822) were significantly associated with naevi

count and melanoma risk. The strongest signal from these SNPs for naevi count

was rs2284063. Together, the two top naevi count SNPs – rs2284063 and rs4636294

– accounted for 9% and 20% of naevi count variance in the GWAS and replication

sample, respectively. The strongest signal for melanoma risk was from rs132985,

with each risk allele conferring a combined odds ratio of 1.23 (95% CI = 1.15,

1.30). Two of the four SNPs identified – rs2284063 and rs6001027 – were also

identified as melanoma-susceptibility loci in a GWAS by Bishop et al. [106].

In a GWAS by Brown et al. [107] using pooled DNA, two SNPs on chromosome

20q11.22 – rs910873 and rs1885120 – were identified as melanoma-susceptibility

loci, and these were replicated in two independent samples (combined P < 1 x

10−15). These SNPS were highly correlated and the risk allele conferred combined

odds ratios of 1.75 (95% CI = 1.53, 2.01) and 1.78 (95% CI = 1.54, 2.04) for

rs910873 and rs1885120, respectively.

Several studies have identified SNPs in the Tyrosinase (oculocutaneous albinism

IA) (TYR) gene located on chromosome 11q14-q21 that are associated with pig-

mentation and melanoma risk [106, 108, 109]. In a GWAS by Sulem et al. [109],

the risk allele of the rs1393350 SNP conferred an increased odds ratio for blue

eyes compared to green eyes of 1.52 (95% CI = 1.28, 1.81). This risk allele was

also associated with an increased odds of melanoma (odds ratio of 1.29; 95% CI

= 1.21, 1.38) in a GWAS by Bishop et al. [106]. The rs1393350 SNP is highly

correlated with another TYR SNP, rs1126809, and this was found to be associ-

ated with increased melanoma odds (odds ratio of 1.21; 95% CI = 1.13, 1.30) in

2.2. Literature Review 49

a candidate-gene study by Gudbjartsson et al. [108].

GWAS have therefore been successful in identifying melanoma risk, pigmentation

and naevi count loci. However, these SNPs may only be statistical markers for

causal variants, and fine-mapping of these regions is required to identify the causal

variant(s).

2.2.5 The diagnosis and treatment of melanoma

Melanoma is often identified as a lesion on the skin which has either changed (in

shape, outline, size or colour) or looks different to other naevi. The Australian

National Health and Medical Research Council (NHMRC) clinical practice guide-

lines state that ‘the diagnosis of melanoma should be considered in all cases of

pigmented or changing lesions on the skin’ [133].

A suspected melanoma is examined by a clinician, and if the lesion is deemed

suspicious, then the lesion may be fully or partially excised. This excised tissue

is then examined by a histopathologist to confirm diagnosis of melanoma. If the

lesion is melanoma and an adequate margin of normal tissue has been removed,

no further action is required.

Excision is often the only required treatment for thin melanoma. Metastatic

melanoma may also be treated by radiation or chemotherapy, however these treat-

ments are generally not a cure, and may often only relieve symptoms and extend

survival time [134].

50Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.2.6 Breslow thickness

Breslow thickness is a measure of the microscopic primary tumour depth of the

melanoma, and is measured in millimetres. Breslow thickness is measured from the

granular layer of the epidermis to the point of deepest invasion by the tumour cells.

Its value as a prognostic factor for melanoma was first reported by Dr A. Breslow

in 1970 [135], although as early as 1953, Allen and Spitz [136] reported that

patients with superficial melanomas had greater survival rates than those with

more invasive melanoma. Today, Breslow thickness is considered by clinical bodies

such as the Australian NHMRC and the American Joint Committee on Cancer to

be the most important predictor of survival in localised melanoma [133,137].

2.2.6.1 Breslow thickness and prognosis

Numerous studies in different countries have determined Breslow thickness to be

the best known predictor for prognosis of localised melanoma [138–146]. In gen-

eral, thinner melanomas have the best prognosis, whereas thicker melanomas are

more likely to lead to metastatic disease and concomitant poorer prognosis.

In a large international study of approximately 23,000 individuals with melanoma

by Balch et al. [137], Breslow thickness was significantly associated with sur-

vival probabilities (P < 0.001). Ten-year survival was 92% in individuals with

melanomas less than 1 mm thick, 80% in individuals with melanoma 1.01 to 2

mm thick, 63% in individuals with melanoma 2.01 to 4 mm thick, and 50% in

individuals with melanoma more than 4 mm thick. However, amongst the 17,841

individuals with melanoma less than 1 mm thick, survival rates ranged from 85%

to 99%. Further analysis indicated that in addition to Breslow thickness, mitotic

2.2. Literature Review 51

rate and presence of ulceration were important prognostic factors for these thin

melanomas.

A Western Australian study of individuals with melanoma by Heenan et al. [147]

found that five-year melanoma survival rates were significantly associated with

Breslow thickness. Survival after five years decreased from 98% for the thinnest

melanomas, to only 58% for melanomas over 4 mm thick (Table 2.2.6.1.1).

Thickness (mm) 5-year survival rates, % (95% CI)

0.00 - 0.75 98 (94 - 99)

0.76 - 1.49 96 (89 - 99)

1.50 - 2.99 86 (75 - 92)

3.00 - 3.99 79 (54 - 92)

> 4.00 58 (36 - 75)

Table 2.2.6.1.1: Estimated 5-year survival rates by Breslow thickness (mm) in West-

ern Australia [147]

A more recent study by Downing et al. [148] of approximately 30,000 individuals

with melanoma from New South Wales also found a significant association between

Breslow thickness and melanoma survival. While Breslow thickness categories are

not directly comparable between this study and the Western Australian study due

to the different breakpoints used, a similar trend can be seen, with the thinnest

melanomas having a survival rate of 99%, decreasing to 54% for melanomas over

4 mm thick (Table 2.2.6.1.2).

The consistency of associations between Breslow thickness and survival rates in-

52Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Thickness (mm) 5-year survival rates, % (95% CI)

0.00 - 1.00 99 (98-99)

1.01 - 2.00 89 (88-91)

2.01 - 4.00 73 (70-75)

> 4.00 54 (51-58)

Table 2.2.6.1.2: Estimated 5-year survival rates by Breslow thickness (mm) in New

South Wales [148]

dicate that the identification of the risk factors associated with Breslow thickness

may also lead to a greater understanding of prognosis, which in turn may result

in improved prognosis and survival of melanoma patients.

2.2.6.1.1 Breslow thickness and melanoma risk factors

Breslow thickness has been shown to be associated with host factors, some of

which are also associated with melanoma risk, such as age at diagnosis, sex and

family history of melanoma.

Breslow thickness is associated with increasing age [149–153]. A review of melanoma

cases from the United Kingdom spanning 14 years by Osborne et al. [153] found

thicker melanomas were strongly associated with increasing age (P < 0.001), with

a greater proportion of thicker melanomas in individuals older than 60 years of

age. This may be due to the different behaviour of melanomas in older individuals,

or more likely, older individuals having less frequent skin examinations, leading

to delays in tumour excision.

An association between Breslow thickness and male sex has been reported in nu-

2.2. Literature Review 53

merous studies [150, 152, 154, 155]. In the review by Osborne et al. [153], males

had a greater proportion of thicker melanomas when compared to females (P =

0.05), with thick melanomas accounting for 22.5% of male melanomas, compared

to 14.2% in females. In an Australian study by Heenan and Holman [150], mean

Breslow thickness for males was 2.27 mm compared to a mean Breslow thickness

of 1.85 mm in females. The differences in Breslow thickness between males and

females may be due to some underlying biological process resulting in melanomas

growing faster in males. However, there has been no published evidence to sup-

port this. Instead, it may be that males are less likely to examine their skin or

have regular skin examinations, and therefore have their melanomas diagnosed or

excised later than women, after the tumour has had more time to grow.

Some studies have reported associations between Breslow thickness and family

history, with thinner melanomas diagnosed in individuals with a family history of

melanoma [156, 157]. However, other studies have not found any evidence of this

association [158,159]. Family history was only verified in the two studies that did

not find any associations, and Fisher et al. suggested that the observed association

may be due to confounding factors. For example, individuals who have a greater

concern over skin examinations may falsely believe they have a family history of

melanoma, and therefore may be more likely to present for skin examinations and

have thinner tumours excised.

To the author’s knowledge, no studies to date have estimated heritability in Bres-

low thickness, that is, the proportion of variation in Breslow thickness that can

be explained by genetic factors. In fact, there have only been a limited number

of studies which have investigated the genetics of Breslow thickness. The few

54Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

studies that have investigated the genetics of Breslow thickness have concentrated

on the EGF gene which, in addition to its role as a possible melanoma risk gene,

has also been associated with Breslow thickness [128]. In a study by Shahbazi et

al., individuals who were common homozygotes at the rs4444903 SNP were more

likely to have melanomas greater than 3.5 mm (odds ratio of 3.7; 95% CI = 1.0,

13.2). However, the odds ratio was reduced to 2.3 when pigmentary factors were

also included in the logistic regression model. Two subsequent studies [129, 130]

replicated this finding by Shahbazi et al., however several other studies reported

conflicting results [131,132,160].

Breslow thickness has been associated with several melanoma risk factors, includ-

ing male sex, increasing age at diagnosis and melanoma-risk loci. Therefore, it

is plausible that other melanoma risk factors, in particular other melanoma-risk

loci, are also associated with Breslow thickness and melanoma prognosis. Further

investigations into these associations may enable identification of individuals at

increased risk of poorer melanoma prognosis.

2.2.7 Literature Review Summary

This section has described the incidence and mortality of melanoma, and the

known environmental, host and genetic risk factors for both melanoma risk and

melanoma prognosis (i.e. Breslow thickness). From the literature presented,

it is evident that there has been substantial research into the development of

melanoma. In particular, numerous studies over the past 20 years have investi-

gated the role of risk factors for melanoma development, such as sun exposure,

skin pigmentation, family history and naevi. This has led to a better understand-

ing of the risk factors associated with melanoma development.

2.2. Literature Review 55

In more recent years, the role of genetic variants in melanoma development have

been investigated; beginning with the early candidate gene studies in genes such

as CDKN2A and CDK4, to the current GWAS of melanoma risk. While these

genetic studies have identified new genetic loci for melanoma risk, it is probable

that there are still many unidentified risk loci. In addition, it is possible that

these unidentified genetic loci may interact with known melanoma risk factors to

increase melanoma risk.

Associations between environmental and host factors with Breslow thickness have

not been studied as widely. However, as Breslow thickness is considered the best

proxy for melanoma prognosis, it is imperative that further research is undertaken

to identify environmental and host risk factors associated with Breslow thickness.

To date, there have been no GWAS for Breslow thickness and only one gene (EGF)

has been identified as a possible risk variant for melanoma prognosis. It is due to

this gap in melanoma research, that this thesis investigates associations between

Breslow thickness and genetic variants.

From Section 2.2.6.1.1, it is evident that melanoma development and prognosis

have risk factors in common (i.e. age, sex, and family history of melanoma).

Therefore, it is proposed in this thesis, that melanoma development and prognosis

may also be affected by the same genetic variants. Facilitated by the development

of the WAMHS, this thesis investigates associations between previously identified

melanoma-risk genetic variants and melanoma prognosis, with the aim of further

understanding the genetics of melanoma prognosis.

56Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.3 The Western Australian Melanoma Health Study

The aim of this section is to describe the establishment and development of the

WAMHS cohort and to summarise the characteristics of the WAMHS participants.

In addition, this section describes the phenotypic and genotypic variables of the

sub-sample of the WAMHS which is used for analyses in later sections. The gen-

eralisability of this sub-sample to the entire population diagnosed with melanoma

in Western Australia over the WAMHS data collection period is also investigated.

2.3.1 Author’s contribution

The author assisted in establishing the WAMHS from its beginning in July 2006

until the end of data collection in March 2010. This included assisting in de-

veloping the questionnaires for both the pilot and full-launch studies, and other

materials and protocol. The author also attended the Western Australian Cancer

Registry (WACR) on a weekly basis to contact doctors and recruit subjects. After

data were collected, the author had primary responsibility for cleaning the data

and for making the questionnaire data available for use.

2.3.2 Introduction

The WAMHS database is a population-based database and linked biospecimen

resource of melanoma cases. The aim of the study was to collect a range of phe-

notypic, clinical and biospecimen data on a representative sample of all cases of

invasive melanoma diagnosed in Western Australia over the period 2006 to 2009,

and to establish a research resource that would enable a range of research projects.

Initial envisioned research directions included investigations into the genetic and

environmental factors associated with melanoma and their interactions in the sus-

2.3. The Western Australian Melanoma Health Study 57

ceptibility, progression and prognosis of melanoma.

In this thesis, the WAMHS database was used to investigate the role of known

environmental and genetic risk factors in melanoma susceptibility and a measure

of prognosis, Breslow thickness.

2.3.3 WAMHS population

The WAMHS database is a population-based resource established and maintained

by the Centre for Genetic Epidemiology and Biostatistics (CGEB) and funded by

the Scott Kirkbride Melanoma Research Centre. The database consists of all con-

senting adult incident cases of invasive cutaneous melanoma diagnosed in Western

Australia from January 2006 until September 2009 (the ‘diagnosis time frame’).

Subjects were identified through the WACR, which is notified of all incident cases

of cancer in the Western Australian population. Information collected by the

WACR includes the name, address, and date of birth of individuals diagnosed

with cancer. In addition, pathology reports are collected which include the loca-

tion and type of each cancer, and the name and address of the referring doctor.

Melanoma diagnoses notified to the WACR include both invasive and in situ

melanomas.

Melanoma diagnosis was defined as having an invasive melanoma of the skin diag-

nosed which was recorded on the WACR under an ICD-9 code ranging from C440

to C449 (Table 2.3.3.1). All eligible incident cases of melanoma diagnosed within

the diagnosis time frame were invited to participate in the WAMHS. Cases were

58Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

ICD-9 Code Description

C440 Skin of the lip

C441 Eyelid

C442 External ear

C443 Skin of other and unspecified parts of the face

C444 Skin of scalp and neck

C445 Skin of trunk

C446 Skin of upper limb and shoulder

C447 Skin of lower limb and hip

C448 Overlapping lesion of skin

C449 Skin, not otherwise specified

Table 2.3.3.1: Eligible ICD-9 codes for WAMHS eligibility

contacted regarding the melanoma which was diagnosed during the diagnosis time

frame. This may or may not have been the first melanoma diagnosed during their

life time.

Eligibility was defined as cases of invasive melanoma (excluding in situ melanomas)

notified to the WACR from 1st January 2006 to 30th September 2009. Eligible

subjects were all aged between 18 and 80 at the time their melanoma was diag-

nosed, and had to be residents of Western Australia at the time their melanoma

was diagnosed and during data collection.

Of the 3,420 individuals diagnosed with invasive melanoma within the diagnosis

time frame, 92.16% (3,152) were invited to participate in the WAMHS and 52.13%

2.3. The Western Australian Melanoma Health Study 59

(1,643) of these consented to some or all components of the study. Of these ini-

tially consenting subjects, 5.23% (86) of these subjects later withdrew from the

study.

The study protocol and all subsequent amendments were approved by the Uni-

versity of Western Australia Human Ethics Research Committee (UWA HREC)

(reference RA14/1/5552) and the Health Department of Western Australia’s Con-

fidentiality of Health Information Committee (reference 200633). All analysis con-

ducted in this thesis also had approval from UWA HREC (reference RA4/1/2308).

2.3.3.1 Recruitment

Subjects were recruited from the WACR primarily by the author and the WAMHS

study coordinator, Ms Sarah Ward. The recruitment process through the WACR

is outlined in Figure 2.3.3.1.1.

Before any doctor or subject contact, the most recent death register was checked

so that the doctor or family members of subjects who had died were not con-

tacted inadvertently. The doctor who requested the subject’s pathology report

was contacted initially. A letter from Dr Timothy Threlfall, Medical Director of

the WACR, was sent to each subject’s doctor and they were asked whether there

was any reason not to contact their patient regarding the study (Doctor letter in

Appendix A, please see page 271). A doctor may have deemed the subject un-

suitable for contact about the study for a number of reasons, such as, ill-health,

poor knowledge of English or anxiety regarding their diagnosis.

60Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Subject diagnosed with melanoma on WACR database

Update date of death to confirm subject’s status

No further action taken

Write to treating clinician (listed on pathology report) regarding

suitability to contact subject

No response from clinician within 5 weeks

Response from clinician within 5 weeks

Update date of death to confirm subject’s status

Advice from clinician that subject is suitable to contact

regarding the study

Clinician recommends subject is not suitable for contact

regarding the study

No further action taken

No further action taken

Write a letter to invite subject to participate in the

study

DEAD

DEAD ALIVE

ALIVE

No response from subject within 14 days

Update date of death to confirm subject’s status

No further action taken

Send reminder letter to subject

DEAD ALIVE

Response from subject within 14 days

No response from subject within 5 weeks

No further action taken

Subject response

Refusal to participate in study

No further action taken

Consent to participate in study

Subject details taken to CGEB

Figure 2.3.3.1.1: Recruitment process through the WACR, from subject appearing

on the WAMHS database, to consenting subject details taken to CGEB

2.3. The Western Australian Melanoma Health Study 61

If the doctor was being contacted for the first time about the study, they were also

sent a WAMHS information sheet (Appendix B, please see page 273), a WAMHS

information poster to display in their clinics and a Western Australian Melanoma

Advisory Service information brochure.

Initially, reminder letters were sent to doctors if they had not replied to the WACR

indicating patient suitability within a two-week period. Following this reminder,

if the doctor had not replied within five weeks of initial contact, it was assumed

that the subject was suitable to be contacted and they were subsequently invited

to participate in the study. In early 2009, the protocol was amended so that doc-

tors would not receive reminder letters. Instead, if the doctor did not reply to the

WACR within four weeks of the initial contact, it was assumed the subject was

suitable to be contacted.

If the requesting doctor deemed their patient not suitable to be contacted about

participation in the study, the reason given was recorded and their patient was

not contacted.

If the doctor deemed their patient suitable to be contacted regarding the study,

their patient was sent a letter (Appendix C, please see page 275), an information

brochure (Appendix D, please see page 277) and a consent form (Appendix E,

please see page 281) inviting them to participate in the study. The subject also

received a copy of the consent form to keep for their own records, information

about the WACR and an information sheet – DNA Storage and Testing – which

62Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

contained details on the storage and testing of DNA in medical research, provided

to the WAMHS by UWA HREC. Subjects were asked to complete and return

the consent form in the reply-paid envelope, indicating whether they wished to

participate in the study.

Reminder letters were also sent to subjects if they had not replied within a two-

week period. If the subject did not respond to their reminder letter, a telephone

call was placed. If the subject still did not return their consent form, no further

follow up occurred. During the latter stages of the study, the telephone reminder

calls ceased as they were time-consuming and did not significantly increase the

rate of return of consent forms.

Once subjects consented to be part of the study, their contact details were taken

to CGEB where they were sent the study materials which are described further in

Sections 2.3.3.2, 2.3.3.3 and 2.3.3.4 (please see pages 62, 64 and 66, respectively).

2.3.3.2 WAMHS pilot study

The WAMHS pilot study began in October 2006 after the development of subject

and doctor information brochures, consent forms and questionnaires. Consenting

subjects were sent a self-completion questionnaire and a mole-counting chart to

assist in their recognition of moles for several questions in the questionnaire.

The questionnaire was designed and compiled by the author and Ms Sarah Ward,

in collaboration with Professor Lyle Palmer, Dr Judith Cole, Dr Liz Milne, Profes-

sor Lin Fritschi and Professor Michael Millward. It was harmonised with the Aus-

2.3. The Western Australian Melanoma Health Study 63

tralian Melanoma Family Health Study, the Genes, Environment and Melanoma

Study and the Queensland Institute for Medical Research’s Melanoma Study ques-

tionnaires to facilitate possible future collaboration. The questionnaires were

printed and scanned by an independent research facilities company.

Subjects were also sent blood collection forms for PathWest blood services. Sub-

jects were asked to donate DNA and serum, with the additional option to donate

RNA, via blood samples. It was explained to subjects that if they wished to

donate RNA, they would need to attend the Queen Elizabeth II (QEII) Medical

Centre, Perth, as it was not practicable to send the PAXGene RNA tubes to all

PathWest Centres in Western Australia. If the participant was unable to travel to

Perth, or if they did not wish to give a sample of RNA, they were able to attend

any of the approximately 50 PathWest blood collection centres located through-

out Western Australia. Biospecimen collection and storage is described in more

detail in Section 2.3.3.5 (please see page 67).

Between October 2006 and January 2007, 235 subjects were contacted about par-

ticipating in the WAMHS (after the doctor on their pathology report confirmed

suitability for contact). Of these subjects, 137 (58.30%) consented and were sent

questionnaires and blood collection forms. However, 16 subjects withdrew or died

prior to completion of any subject components. By March 2007, 96 questionnaires

and 89 blood samples had been received. Of these, 87 subjects had returned both

the questionnaire and donated a blood sample. Based on the 121 subjects who

remained enrolled in the study, 67.97% of subjects completed both components

of the study.

64Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

The low completion rate of the questionnaire and blood samples highlighted that

the questionnaire was too time-consuming and complex which made subjects re-

luctant to complete and return it. As a result of this low response rate, the pro-

tocol was altered for the full-scale implementation of the study. Major changes

included shortening the questionnaire and changing part of the questionnaire from

self-completion to a telephone interview.

2.3.3.3 WAMHS full-scale implementation I

The WAMHS full-scale implementation I ran from October 2007 to December

2008. During this phase of the study, subjects were asked to complete two parts

of the questionnaire - Part 1, which was a written questionnaire taking approx-

imately 15 minutes to complete, and Part 2, which was a telephone interview

taking approximately 40 minutes to complete.

As subjects found the pilot study questionnaire difficult to complete, the ques-

tionnaires for the full-scale implementation were shortened and simplified. Part 1

of the questionnaire contained questions which were deemed easy to answer, such

as questions relating to hair and skin pigmentation. It also included questions

that required the subject to view a diagram, such as the degree of freckling and

naevi questions and also questions that required measurement, such as height and

weight.

Part 2 of the questionnaire contained more in-depth questions which required the

subject to recall details about their family history, past sun exposure, residence

history and ethnicity. The reasoning behind this was that with a trained inter-

2.3. The Western Australian Melanoma Health Study 65

viewer asking these questions, subjects could be assisted in a neutral manner to

remember details. It also ensured the whole questionnaire was completed and

that the data collected was of a higher quality compared to the self-completion

questionnaire.

Part 1 of the questionnaire was designed in the software package Teleforms by the

author. Returned questionnaires were scanned and verified by members of the

WAMHS team, including the author.

Part 2 of the questionnaire was designed in the software package LiquidOffice by

the author. Sections of the questionnaire were automated based on the age and

sex of the subjects so that only relevant questions would appear in the online

form for the interviewer to ask. Upon submission of the online form, data was

sent to an Oracle database and then extracted periodically. Part 1 and Part 2

questionnaire data was then cleaned and uploaded into the Western Australian

Genetic Epidemiology Resource (WAGER) by the author.

Upon consenting, subjects were sent blood collection forms, Part 1 of the ques-

tionnaire and a revised mole-counting chart (Appendix F, please see page 285).

Part 1 was completed by the subject at home and was returned to the CGEB in

a reply-paid envelope with the subject indicating their preferred day and time for

Part 2 of the questionnaire. Once the completed Part 1 questionnaire was received

by members of the WAMHS team, the questionnaire was reviewed and trained

interviewers conducted Part 2 of the questionnaire in a telephone interview.

66Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Blood collection locations altered slightly for the full-scale implementation of the

study compared to the pilot study. In the full-scale implementation I, subjects

were also able to consent to a scar examination (the Natural History of Scarring

project) which took place at Telstra Burns Unit at Royal Perth Hospital (RPH)

under the direction of Professor Fiona Wood. If a participant consented to the

scar examination, they were then able to donate RNA at RPH, as well as the

QEII Medical Centre. Subjects who did not wish to donate RNA continued to

attend any PathWest centre throughout Western Australia. The author assisted

with establishing the Natural History of Scarring project, however this project did

not form part of this thesis, and will not be discussed further.

2.3.3.4 WAMHS full-scale implementation II

In September 2008, a review of recruitment and completion data highlighted some

issues related to data collection, in particular the role of the scar examination and

donation of RNA. Availability of resourcing at the scar examination clinics meant

that only a limited number of scar examinations could be conducted each week.

Subjects who consented to the scar examination gave their blood sample at the

time of examination, resulting in a limited number of blood samples donated

weekly. Therefore, the study protocol was altered and subjects were no longer

invited to participate in the scar examinations for a period of time. The removal

of the scar examination meant that subjects no longer had the option of donating

RNA at RPH. Instead, RNA donation was only possible at the QEII Medical

Centre and DNA and serum donation was possible at any PathWest, as per the

aforementioned protocol.

The questionnaire delivery methods were also altered at this time. The questions

2.3. The Western Australian Melanoma Health Study 67

asked remained the same, however Parts 1 and 2 of the questionnaire were merged

and were all delivered via telephone interview (full questionnaire in Appendix G,

please see page 287). The use of the telephone interview for the entire question-

naire meant that subjects were not required to return Part 1 of the questionnaire

before the rest of the questionnaire could be completed. It was also done to

save the time spent scanning, verifying and uploading the data using Teleforms,

and to reduce the time spent following up on questionnaire completion. Instead,

data collected from the telephone interview could be saved directly into an Ora-

cle database via the online form, cleaned by the author, and then uploaded into

WAGER. Subjects were sent a questionnaire brochure before their interview con-

taining the diagrams required for answering questions (Appendix H, please see

page 329). This also contained instructions regarding questions subjects would

need to complete before their interview, such as height, weight and naevi counts.

2.3.3.5 Biospecimens

Subjects in the study attended one of either QE-II Medical Centre, RPH or any

PathWest centre to donate DNA and serum. RNA donation was available at the

QEII Medical Centre and for a limited time, at RPH. In the pilot study, a blood

sample for DNA was collected in one 9mL Vacutainer EDTA tube. This was

changed in the full-launch study, when samples for DNA were collected in two

9mL Vacutainer EDTA tubes. These tubes were stored at 4C and then trans-

ported to the Western Australian DNA Bank (WADB) within 24 to 48 hours.

At the WADB, the blood was centrifuged and the buffy coat removed, and then

dual-site banked at -40C at both RPH and the QE-II Medical Centre.

Blood samples for serum were collected in either one 10mL or two 5mL Vacutainer

68Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

SST serum tubes. After blood collection, the serum tubes sat at room tempera-

ture for 30 minutes prior to being centrifuged within 2 hours of collection. These

tubes were stored at 4C and then transported to the WADB within 24 to 48

hours. At the WADB, the collected sample was processed and the serum was

dual-site banked at -40C at both RPH and QE-II Medical Centre.

Blood samples for RNA were collected in two 2.5mL PAX Gene Blood RNA

tubes. After blood collection, the RNA tubes were inverted 8–10 times and stored

upright at room temperature. The RNA tubes were transported to the WADB

within 24–48 hours. At the WADB, the RNA tube was centrifuged and the PAX

Gene Blood RNA kit was used to extract RNA. The RNA was dual-site banked

at -80C at both RPH and the QE-II Medical Centre.

2.3.3.6 Study variables

As only the full-scale implementation data was used in this thesis, the following

descriptions of questionnaire, demographic and clinical data refers to only the

data collected as part of the full-scale implementation.

2.3.3.6.1 Phenotypic variables

The questionnaire-based variables formed the majority of the phenotypic variables

collected as part of the WAMHS. These included pigmentation variables such as

skin colour, hair colour, eye colour, and the skin’s ability to tan and propensity to

burn. Questions related to sun exposure were also collected, including freckling

during childhood and adulthood, and the number of episodes of sunburn. Some

of the phenotypic variables collected are described in further detail below.

2.3. The Western Australian Melanoma Health Study 69

Self-reported ethnicity was collected, as well as the ethnicity of the subject’s par-

ents and grandparents. More than one answer was allowed to be given if a subject

considered themselves to have a mixed ethnic background.

Country of birth was also collected, and if not born in Australia, the age of the

subject when they moved to Australia was recorded.

Height in centimetres and weight in kilograms were collected via the self-report

questionnaire. These were used to calculate Body Mass Index (BMI).

Three measures of naevi were collected. Subjects were shown a diagram of four

degrees of naevi ranging from ‘None’ to ‘Very Many’ and were asked to select the

diagram which most closely resembled themselves (Figure 2.3.3.6.1.1).

Two naevi counts were also performed which required the subject to count the

naevi in two sections of their back - from the base of their neck to the top of their

armpit and from the top of their armpit to the top of their underpants (Figure

2.3.3.6.1.2).

Degree of freckling during childhood and adulthood was ascertained from a di-

agram of six degrees of freckling, ranging from ‘None’ to ‘Very Many’ (Figure

2.3.3.6.1.3).

70Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.3.3.6.1.1: Degrees of naevi from the WAMHS questionnaire

Figure 2.3.3.6.1.2: Number of naevi in two sections of the back from the WAMHS

questionnaire

2.3. The Western Australian Melanoma Health Study 71

Figure 2.3.3.6.1.3: Degree of freckling from the WAMHS questionnaire

Subjects were asked about their eye colour, skin colour and hair colour at age 18.

Skin reaction to the sun was also collected from subjects. These questions were

used as an indicator of how uncovered skin reacted to sun exposure in terms of

sunburn and tanning and were based on the Fitzpatrick scale [161].

Smoking history was collected, with ‘smoking’ defined as having smoked at least

one cigarette per day for a period of six months or more. Subjects were asked if

they had ever smoked, how long they smoked for, and how many cigarettes per

day they smoked.

A subject’s history of melanoma was ascertained through both the questionnaire

and WACR records. Subjects who identified themselves during the questionnaire

as having had more than one melanoma were classified as having a history of

72Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

melanoma. Similarly, subjects who had more than one primary melanoma diag-

nosis notified to the WACR were classified as having a history of melanoma. In

both instances, the number of melanomas diagnosed was also recorded. A sub-

ject’s history of other cancers, including other skin cancers, was also recorded.

Sunbed use was collected and was defined as having used a sunbed on more than

one occasion. The overall number of sunbed sessions and the locations of these

sessions were recorded.

A subject’s family history of melanoma was also collected. A family history of

melanoma was defined as having more than one first-degree relative (i.e. mother,

father, brother, sister) diagnosed with melanoma, as reported by the subject. In

addition, a family history of other skin cancers, and other cancers was also col-

lected.

Subjects were asked to recall the number of times they had been sunburnt during

three periods of their lives – from 5-12 years, from 13-19 years and from 20 years

onwards. Sunburn was separated into two questions – sunburn to cause pain for

more than two days and sunburn severe enough to cause blisters. These variables

had four levels – 0, 1-5, 6-10 and more than 10 times.

Time spent outdoors in warmer and cooler months during various age periods was

also collected. This included the number of hours spent outside and not under

any shade between 9am and 5pm.

2.3. The Western Australian Melanoma Health Study 73

2.3.3.6.2 Demographic and clinical variables

The majority of demographic and clinical variables were collected from the WACR.

The WACR collects extensive details regarding an individual’s melanoma diagno-

sis from pathology reports, including Breslow thickness, Clark’s level, sex, WACR

history of melanoma and date of melanoma diagnosis.

Breslow thickness was collected for the excised melanoma and was measured in

millimetres. In addition, another measure of the thickness of the excised melanoma

was collected – Clark’s level. The possible levels were II, III, IV and V, indicating

the layers of skin the melanoma had invaded. Clark’s level I refers to an in situ

melanoma and as subjects were recruited into the study based on diagnosis of an

invasive melanoma, no subjects were recruited into the study who had only been

diagnosed with a Clark’s level I melanoma.

Sex was collected from the WACR. Age at diagnosis was also collected and was

calculated as the date of diagnosis minus date of birth. Date of diagnosis is the

date at which the melanoma was fully or partially excised by a clinician.

Age at data collection was calculated as the date of the telephone questionnaire

minus date of birth. WACR history of invasive and in situ melanoma was also

collected, where WACR history of melanoma was defined as the subject having

had more than one primary melanoma notified to the WACR.

74Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.3.3.7 Sample size and response rates

At the conclusion of the data collection period, the WAMHS consisted of 1,643

consenting cases of melanoma. This consisted of 144 subjects who enrolled in

the pilot study, and 1,499 subjects who enrolled in the full-scale implementation

of the study. 73 of the 1,499 subjects withdrew or died prior to participation in

the study. Of the 1,426 subjects who remained enrolled in the full-scale study,

91.87% (n=1,310) completed Part 1 of the questionnaire, 91.87% (n=1,310) com-

pleted Part 2 of the questionnaire and 84.43% (n=1,204) gave a blood sample for

DNA and serum. 81.14% (n=1,157) of the subjects completed all parts of the

study.

Figure 2.3.3.7.1 depicts the final figures and rates for the study, including both

the pilot study and the full-scale implementations.

2.3.3.7.1 Consent, non-consent and non-response rates

Statistical analysis was performed to compare the subject characteristics between

consenting subjects and non-consenting subjects, and also consenting subjects and

non-responders. Non-consenting subjects were defined as those who returned a

consent form indicating they did not wish to participate in the study, and non-

responders were defined as subjects who did not return a consent form indicating

if they wished to participate.

In total, of the 3,420 individuals diagnosed with melanoma within the diagnosis

timeframe, 1,643 (48%) subjects consented to participate in the study, 644 (19%)

2.3. The Western Australian Melanoma Health Study 75

Figure 2.3.3.7.1: Consent and participation figures and rates for the WAMHS

did not consent to participate, and 713 (21%) subjects failed to return their con-

sent form indicating consent or non-consent.

Fisher’s exact test [162] was used to identify any differences in distributions be-

tween these groups for sex, Breslow thickness, age at diagnosis, Clark’s level and

melanoma site, and these results are presented in Table 2.3.3.7.1.1. Fisher’s exact

test was used in place of Pearson’s Chi-Square test as Fisher’s test allows for zero

or small cell counts.

76Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Variable Consent Non-consent p-value No reply p-value

(n=1,643) (n=644) (n=713)

% [n] % [n] % [n]

Sex

Male 58.37 [959] 59.78 [385] 64.09 [457]

Female 41.63 [684] 40.22 [259] 0.57 35.91 [256] 0.0091

Breslow thickness, mm

0–1.00 71.94 [1182] 71.43 [460] 72.37 [516]

1.01–2.00 14.49 [238] 12.42 [80] 14.31 [102]

2.01–4.00 7.12 [117] 9.47 [61] 0.20 6.59 [47] 0.87

>4.00 3.77 [62] 3.26 [21] 4.49 [32]

Unknown 2.68 [44] 3.42 [22] 2.24 [16]

Age at diagnosis

18–29 2.31 [38] 1.70 [11] 8.56 [61]

30–44 13.94 [229] 8.85 [57] 25.95 [185]

45–59 32.56 [535] 25.47 [164] <0.0001 35.34 [252] <0.0001

60–74 40.54 [666] 42.24 [272] 23.98 [171]

75–80 10.65 [175] 21.74 [140] 6.17 [44]

Clark’s Level

II 36.76 [604] 37.42 [241] 38.71 [276]

III 25.63 [421] 22.98 [148] 23.56 [168]

IV 31.89 [524] 32.30 [208] 0.10 31.98 [228] 0.85

V 2.74 [45] 2.17 [14] 2.66 [19]

Unknown 2.98 [49] 5.12 [33] 3.09 [22]

Site

Head and neck 16.43 [270] 17.86 [115] 13.74 [98]

Trunk 34.04 [625] 39.60 [255] 37.03 [264]

Upper limb 23.25 [382] 22.05 [142] 0.63 25.95 [185] 0.39

Lower limb 21.85 [359] 20.34 [131] 22.86 [163]

Unknown 0.43 [7] 0.15 [1] 0.42 [3]

Table 2.3.3.7.1.1: Comparison of characteristics between consenting, non-consenting

and non-responding subjects

Analysis of consent versus non-consent found that the distributions of Breslow

thickness, sex, Clark’s level and site did not differ significantly. The frequency

distributions of these variables in consenters and non-consenters were similar.

However, the age of diagnosis distribution was significantly different (P<0.0001).

2.3. The Western Australian Melanoma Health Study 77

Analysis of consent versus non-response found that the distributions of Breslow

thickness, Clark’s level and site did not differ significantly. However, the sex

distribution was significantly different (P=0.0091), with a greater proportion of

males not responding (64.09%) versus consenting to participate (58.37%). The

age of diagnosis distribution also differed significantly between consenters and

non-responders (P<0.0001).

The differences in age distribution between consenters and non-consenters, and

consenters and non-responders was marked. In consenting individuals, the age

group 18 to 29 accounted for 2.31% of consenters. This was similar to the 1.70%

of non-consenters in this age category, however 8.56% of the non-responders were

aged between 18 to 29. This is a much larger proportion than would be expected.

Similar trends can be seen for the next two age groups – 30 to 44, and 45 to 59 –

with these age groups accounting for a smaller proportion of non-consenters, and

a larger proportion of non-responders. In the older age groups, the opposite trend

was observed, with the two age groups 60 to 74 and 75 to 80 accounting for a

larger proportion of non-consenters and a smaller proportion of non-responders.

These results suggest that individuals who did not respond tended to be younger

than individuals who consented. In addition, non-consenters appeared to be older

than consenting individuals. This may indicate that younger individuals who did

not wish to participate in the Study were less likely to respond, where as older

individuals who did not wish to participate were more likely to respond indicating

their non-consent.

78Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.3.3.8 Characteristics of participants

Table 2.3.3.8.1 shows demographic, clinical and questionnaire characteristics of

WAMHS participants who completed either Part 1 or Part 2 of the questionnaire.

These characteristics are stratified by sex and compared using Fisher’s exact test.

Participants who consented to participate in the study, but did not complete ei-

ther of the questionnaires are excluded from this table.

The sex distribution was skewed, with 58.17% of the sample male. Approximately

three-quarters of subjects were aged over 45, and the age distribution differed be-

tween sexes (P<0.0001). There were more females aged between 18 and 59 and

more males aged between 60 and 80.

Approximately 70% (n=944) of subjects had melanomas diagnosed less than 1 mm

thick, and only 3.55% (n=47) had melanomas thicker than 4 mm. Breslow thick-

ness differed between the sexes, with 69.57% (n=535) of males having melanomas

less than 1 mm thick, compared to 73.96% (n=409) of females. In addition, males

were more than twice as likely (4.55% compared to 2.17%) of having melanomas

more than 4 mm thick.

The most common melanoma site was the trunk, with 38.05% (n=203) of melanoma

diagnoses. The least common melanoma site was the head and neck region

(15.36%). Site distributions differed significantly between sexes (P<0.0001), with

male participants more likely to have melanomas diagnosed on their head, neck

and trunk, and females on their upper and lower limbs. The greatest difference

due to sex was observed at the lower limb site, with females approximately 2.4

2.3. The Western Australian Melanoma Health Study 79

times more likely to be diagnosed with melanomas on their lower limbs compared

to males.

The majority of subjects considered themselves to be Caucasian, with 50% (n=813)

considering themselves to be Caucasian of English background, and 27% (n=442)

Caucasian with Scottish, Irish or Welsh background. Approximately three-quarters

of subjects were born in Australia, with approximately 14% (n=182) born in Eng-

land, Ireland, Scotland, or Wales.

Over 90% (n=1,198) of subjects had some ability to tan, while only 8.47% (n=112)

reported that their skin never tanned, and only freckled. Men appeared to have

a greater ability to tan (P<0.0001) and also had a lower propensity to burn

(P<0.0001). This may be a result of self-reporting errors, with men more likely to

under-report their burning tendency, however there have not been any published

studies which support this theory.

Approximately 18% (n=237) of subjects considered their skin colour to be very

fair, 10.59% (n=140) had red hair at age 18, and 42.51% (n=562) had blue eyes.

Both hair (P<0.0001) and eye colour (P<0.0001) differed between sexes, with

men more likely to have black hair, and women more likely to have red hair. Men

were also more likely to have blue or brown eyes, with women more likely to have

hazel or green eyes. There is no scientific evidence for men and women having

different coloured eyes or hair, and this difference may be a result of reporting bias.

Degree of freckling decreased from childhood to adulthood. 23.45% (n=310) of

80Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

subjects had no freckling during childhood, which increased to 29.05% (n=384)

during adulthood. Similarly, 9.91% (n=131) of subjects considered themselves to

have ‘Many’ or ‘Very many’ freckles during childhood, which more than halved to

4.91% (n=65) in adulthood. Sex differences for childhood and adulthood freckling

also existed, with males less likely to have freckling compared to females for both

periods (P<0.0001). It is possible that this is due to differences in reporting by

males and females.

Another measure of sun exposure - blistering sunburn - also decreased from child-

hood to adolescence. 8.09% (n=107) of subjects reported more than 10 episodes of

blistering sunburn during childhood, which decreased to 5.98% (n=79) in adoles-

cence. Similarly, the percentage of subjects who reported no episodes of blistering

sunburn increased from childhood to adolescence, from 38.28% (n=506) in child-

hood, to 43.72% (n=578) in adolescence.

VariableWAMHS Males Females p-value

(n=1,322) (n=769) (n=553)

% [n] % [n] % [n]

Breslow thickness, mm

0–1.00 71.41 [944] 69.57 [535] 73.96 [409]

1.01–2.00 15.13 [200] 15.60 [200] 14.47 [80]

2.01–4.00 7.11 [94] 8.07 [94] 5.79 [32] 0.03

>4.00 3.55 [47] 4.55 [47] 2.17 [12]

Unknown 2.80 [37] 2.21 [17] 3.61 [20]

Age at diagnosis

18–29 2.19 [29] 1.43 [11] 3.26 [18]

30–44 14.14 [187] 9.75 [75] 20.25 [112]

45–59 33.06 [437] 30.94 [238] 35.99 [199] <0.0001

60–74 40.47 [535] 45.90 [353] 32.91 [182]

75–80 10.14 [134] 11.96 [92] 7.59 [42]

Clarks Level

II 36.23 [479] 36.02 [277] 36.53 [202]

III 25.72 [340] 24.58 [189] 27.31 [151]

Continued on Next Page. . .

2.3. The Western Australian Melanoma Health Study 81

Table 2.3.3.8.1 – Continued

VariableWAMHS Males Females p-value

(n=1,322) (n=769) (n=553)

IV 32.45 [429] 34.59 [266] 29.48 [163] 0.44

V 2.34 [31] 2.21 [17] 2.53 [14]

Unknown 0.45 [6] 0.52 [4] 0.26 [2]

Site

Head and neck 15.36 [203] 18.73 [144] 10.67 [59]

Trunk 38.05 [503] 46.69 [359] 26.04 [144]

Upper limb 23.68 [313] 19.90 [153] 28.93 [160] <0.0001

Lower limb 22.62 [299] 14.20 [110] 34.18 [189]

Unknown 0.30 [4] 0.39 [3] 0.18 [1]

Self-reported ethnicity

Caucasian - English 50.03 [813] 50.58 [478] 49.26 [335]

Caucasian - Scottish, Irish, Welsh 27.20 [442] 27.83 [263] 26.32 [179]

Caucasian - Northern European 7.69 [125] 8.47 [80] 6.62 [45]

Caucasian Southern European 2.52 [41] 1.69 [16] 3.68 [25]

Caucasian Eastern European 0.86 [14] 0.95 [9] 0.74 [5] 0.02

Caucasian Other 11.08 [180] 9.73 [92] 12.94 [88]

Indigenous 0.12 [2] 0.21 [2] 0.00 [0]

Middle Eastern 0.18 [3] 0.32 [2] 0.00 [0]

Pacific Islander 0.25 [4] 0.11 [1] 0.44 [3]

African 0.06 [1] 0.11 [1] 0.00 [0]

Place of Birth

Australia 74.58 [986] 75.68 [582] 73.06 [404]

New Zealand 4.31 [57] 3.77 [29] 5.06 [28]

England 10.82 [143] 10.40 [80] 11.39 [63]

Scotland/Ireland/Wales 2.95 [39] 2.99 [23] 2.89 [16] 0.78

Northern Europe 2.12 [28] 2.08 [16] 2.17 [12]

Other 4.31 [57] 4.42 [34] 4.16 [23]

Missing 0.91 [12] 0.65 [5] 1.27 [7]

Naevi

None 10.67 [141] 10.40 [80] 11.03 [61]

Few 35.02 [463] 34.20 [263] 36.17 [200]

Some 37.97 [502] 37.97 [292] 37.97 [210] 0.62

Many 15.20 [201] 15.99 [123] 14.10 [78]

Missing 1.14 [15] 1.43 [11] 0.72 [4]

Ability to tan

Very brown 12.55 [166] 15.47 [119] 8.50 [47]

Moderate brown 44.33 [586] 49.15 [378] 37.61 [208]

Slight tan 33.74 [446] 29.78 [229] 39.24 [217] <0.0001

No tan / freckle only 8.47 [112] 4.55 [35] 13.92 [77]

Missing 0.91 [12] 1.04 [8] 0.72 [4]

Continued on Next Page. . .

82Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.3.3.8.1 – Continued

VariableWAMHS Males Females p-value

(n=1,322) (n=769) (n=553)

Propensity to burn

Never burn, always tan 2.12 [28] 2.73 [21] 1.27 [7]

Sometimes burn, usually tan 50.76 [671] 55.40 [426] 44.30 [245]

Usually burn, sometimes tan 22.69 [300] 22.24 [171] 23.33 [129] <0.0001

Always burn, never tan 23.52 [311] 18.60 [143] 30.38 [168]

Missing 0.91 [12] 1.04 [8] 0.72 [4]

Skin Colour

Olive/brown 7.72 [102] 8.45 [65] 6.69 [37]

Fair 73.37 [970] 74.51 [573] 71.79 [397] 0.14

Very fair 17.93 [237] 15.99 [123] 20.61 [114]

Missing 0.98 [13] 1.04 [8] 0.90 [5]

Hair colour at age 18

Black 5.45 [72] 7.93 [61] 1.99 [11]

Dark brown 26.85 [355] 26.66 [205] 27.12 [150]

Grey 1.82 [24] 2.99 [23] 0.18 [1]

Light/mouse brown 33.59 [444] 31.73 [244] 36.17 [200] <0.0001

Fair/blonde 20.73 [274] 21.07 [162] 20.25 [112]

Red 10.59 [140] 8.58 [66] 13.38 [74]

Missing 0.98 [13] 1.04 [8] 0.90 [5]

Eye Colour

Brown 14.75 [195] 16.12 [124] 12.84 [71]

Hazel 22.77 [301] 19.51 [150] 27.31 [151]

Green 13.62 [180] 12.09 [93] 15.73 [87] <0.0001

Grey 5.37 [71] 5.07 [39] 5.79 [32]

Blue 42.51 [562] 46.16 [355] 37.43 [207]

Missing 0.98 [13] 1.04 [8] 0.90 [5]

Freckling, Childhood

None 23.45 [310] 27.57 [212] 17.72 [98]

Very Few 31.39 [415] 33.42 [257] 28.57 [158]

Few 20.42 [270] 19.90 [153] 21.16 [117]

Some 13.92 [184] 11.44 [88] 17.36 [96] <0.0001

Many/Very Many 9.91 [131] 6.63 [51] 14.47 [80]

Missing 0.91 [12] 1.04 [8] 0.72 [4]

Freckling, Adulthood

None 29.05 [384] 35.50 [273] 20.07 [111]

Very Few 37.90 [501] 39.27 [302] 35.99 [199]

Few 16.49 [218] 13.39 [103] 20.80 [115] <0.0001

Some 10.74 [142] 7.41 [57] 15.37 [85]

Many/Very Many 4.91 [65] 3.38 [26] 7.05 [39]

Missing 0.91 [12] 1.04 [8] 0.72 [4]

Continued on Next Page. . .

2.3. The Western Australian Melanoma Health Study 83

Table 2.3.3.8.1 – Continued

VariableWAMHS Males Females p-value

(n=1,322) (n=769) (n=553)

Blistering Sunburn, 5-12 years

0 times 38.28 [506] 35.24 [271] 42.50 [235]

1–5 times 40.54 [536] 42.26 [325] 38.16 [211]

6–10 times 10.06 [133] 11.05 [85] 8.68 [48]

> 10 times 8.09 [107] 8.84 [68] 7.05 [39] 0.07

Don’t know 1.74 [23] 1.43 [11] 2.17 [12]

Missing 1.29 [17] 1.17 [9] 1.45 [8]

Blistering Sunburn, 13-19 years

0 times 43.72 [578] 44.73 [344] 42.32 [234]

1–5 times 40.32 [533] 40.45 [311] 40.14 [222]

6–10 times 8.09 [107] 8.06 [62] 8.14 [45] 0.65

> 10 times 5.98 [79] 5.07 [39] 7.23 [40]

Don’t know 0.60 [8] 0.52 [4] 0.72 [4]

Missing 1.29 [17] 1.17 [9] 1.45 [8]

Table 2.3.3.8.1: Questionnaire, demographic and clinical characteristics of WAMHS sample, strat-

ified by sex

2.3.3.9 WAMHS sample summary

The WAMHS is a population-based case collection comprised of individual melanoma

cases recruited from the WACR. Phenotypic, demographic, clinical, and biospeci-

men data were collected as part of this study, and form a comprehensive resource

for melanoma researchers.

The WAMHS sample consisted of 1,643 consenting cases of melanoma diagnosed

between January 2006 and September 2009. Of these cases, 144 (8.76%) par-

ticipated in the pilot study, and 1,499 (91.24%) participated in the full-scale

implementation. Of consenting participants from the pilot study, 87 (67.97%)

completed both the questionnaire and donated a blood sample. After changes to

the study, and the implementation of the full-launch study, 1,157 (81.14%) partic-

84Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

ipants completed the questionnaire and donated a blood sample. This increase in

completion rates may be attributed to the shorter questionnaire and the change

to administering the questionnaire by telephone interview.

Analysis of consent versus non-consent revealed that the age distributions were

significantly different, with a greater proportion of subjects aged between 45 and

59 consenting to the study, and a lower proportion of subjects aged between 75

and 80 consenting. The lower proportion of older subjects consenting may be as

participation in a study can be particularly difficult for older individuals for many

reasons, e.g. they may be less mobile and suffering from poorer health.

Analysis of consent versus non-response found that the sex distribution was signif-

icantly different, with a greater proportion of males failing to respond, compared

to males who consented to participate. In addition, the age distribution differed

significantly, with more subjects aged less than 60 failing to respond, and more

individuals over 60 responding.

Analysis of the WAMHS sample revealed that the demographic and clinical vari-

ables, age at diagnosis, Breslow thickness, and site distributions, differed between

sexes. There were more females aged between 18 and 59 and more males aged be-

tween 60 and 80 in the study. Females were more likely to have thinner melanomas

(< 1 mm), while males were more likely to have thicker melanomas (> 4 mm).

Male subjects were more likely to have melanomas diagnosed on their head, neck

and trunk, while women were more likely to have melanomas diagnosed on their

upper and lower limbs. This site distribution is similar to those observed in pre-

2.3. The Western Australian Melanoma Health Study 85

vious studies [90,163].

Several questionnaire variables differed between sexes, including ability to tan,

propensity to burn, hair and eye colour. Males appeared to have a greater ability

to tan, and a lower propensity to burn. In addition, males were more likely to

have black hair, compared to females, and females were more likely to have red

hair, compared to males. Eye colour also differed between the sexes, with males

more likely to have brown or blue eyes, and females more likely to have hazel or

green eyes.

The imbalance in self-reported variables in the WAMHS sample between men and

women, such as hair and eye colour, does not appear to have a scientific basis.

These differences may be explained by systematic self-report bias. For example,

women in this sample may be more likely than men to select their eye colour as

green. This may have implications if these self-reported variables were included

as covariates in analyses.

2.3.4 WAMHS sub-sample used in this thesis

2.3.4.1 Introduction

The genetic association analyses presented in this thesis, which are found in Sec-

tions 2.4 (please see page 111) and 2.5 (please see page 131), use a subset of

the WAMHS sample. This subset consists of 800 unrelated individuals who had

completed both study questionnaires by February 2010, and had donated the

blood sample by September 2009. In addition, these subjects were chosen as they

had a Breslow thickness recorded and were of Caucasian descent. A subject was

86Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

categorised as Caucasian if they described both themselves and their parents as

Caucasian.

Blood samples from these 800 participants were genotyped for 42 SNPs, and the

selection of these SNPs is described in this section. This section also includes

methods for the descriptive analyses of both study variables and genotypic data,

and also presents the results of these descriptive analyses. In addition, the gener-

alisability of this sample to the entire population of melanoma diagnoses within

the diagnosis timeframe is investigated.

2.3.4.2 SNP selection

Forty two melanoma risk SNPs were chosen as they had been identified as vali-

dated melanoma-risk SNPs, or were in genes which have been validated as melanoma-

risk genes. Nineteen SNPs were tagged from three genes – MC1R, BRAF and

EGF. Variation in all three genes was captured by choosing tag-SNPs using a

pairwise tagging approach in the Haploview software package [164]. Gene coordi-

nates were determined by using the University of California Santra Cruz Genome

Browser Gateway [165], with the coordinates used for tagging including 10 kb

upstream and 10 kb downstream of each gene. Haploview tagger [166] was used

to select a set of tag-SNPs which captured maximal variation in the gene. The

minimum minor allele frequency for selected SNPs was set at 5% and the thresh-

old for LD was set to r2=0.80.

Table 2.3.4.2.1 shows the three genes tagged in Haploview, the coordinates used

for tagging, and the tag-SNPs successfully genotyped in the WAMHS sample.

2.3. The Western Australian Melanoma Health Study 87

Gene Chromosome SNP Tagging Coordinates

rs1805005

rs1805007

MC1R 16 rs3212363 88,502,527 – 88,524,885

rs3212369

rs1267635

rs1733826

rs10487888

BRAF 7 rs17161747 140,070,754 – 140, 281,033

rs2365151

rs17623382

rs4726020

rs6944385

rs7655579

rs929446

rs11568993

EGF 19 rs882471 111,053,499–111,152,868

rs4698803

rs6533485

rs11569121

Table 2.3.4.2.1: Tagged SNPs from MC1R, BRAF and EGF genes

Several SNPs initially chosen were not able to be genotyped as their SNP score

was too low (< 0.4), or they were in close proximity to an already selected SNP.

The remaining 23 SNPs were chosen as they had been identified as melanoma-risk

SNPs in GWAS or candidate gene studies. Table 2.3.4.2.2 lists these SNPs, the

risk variant, and the observed associations. Thirteen of these SNPs had been

associated with melanoma risk in earlier studies, while 18 of these SNPs had been

associated with melanoma risk factors, including pigmentation, freckling, naevi

count, and skin sensitivity to the sun.

88

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

Gene SNP Risk Allele Association

CDC91L1 rs910873 A Increased melanoma risk [106,107]

MC1R rs258322 A Increased melanoma risk [106,124]

MYH7B rs1885120 C Increased melanoma risk [106,107]

TYR rs1042602 C Freckling [109], increased melanoma risk [106]

TYR rs1393350 A Blonde hair [109], green eyes [109], increased skin sensitivity to sun [109], increased melanoma risk [106]

TYRP1 rs1408799 C Blue eyes [109], increased skin sensitivity to sun [109], increased melanoma risk [108]

KITLG rs12821256 C Blonde hair [109]

SLC2A4 rs12896399 T Blonde hair [109], increased skin sensitivity to sun [109]

IRF4 rs1540771 A Brown hair [109], increased freckling [109], increased skin sensitivity to sun [109]

IRF4 rs12203592 T Blonde hair [124], lighter eye and skin colour [124], reduced tanning ability [124]

TPCN2 rs35264875 T Blonde hair [167]

TPCN2 rs3829241 A Blonde hair [167]

OCA2 rs1800407 G Blue eyes [168]

OCA2 rs1800401 A Black or brown eyes [169]

OCA2 rs7495174 A Blue or green eyes [109,168]

HERC2 rs12913832 G Blue eyes [170,171]

TP53 rs1042522 C Increased melanoma risk [172]

MTAP rs4636294 A Increased melanoma risk [105,106], increased naevi count, [105]

MTAP rs10757257 G Increased melanoma risk [105,106], increased naevi count [105]

MTAP rs7023329 A Increased melanoma risk [105,106], increased naevi count [105]

MTAP rs1011970 T Increased melanoma risk [106]

PLA2G6 rs2284063 A Increased melanoma risk [105,106], increased naevi count [105]

PLA2G6 rs132985 C Increased melanoma risk [105], increased naevi count [105]

Table 2.3.4.2.2: Reported melanoma-risk associations between genotyped SNPs and melanoma-risk factors

2.3. The Western Australian Melanoma Health Study 89

2.3.4.3 Laboratory methods

The methods to prepare the DNA for genotyping and the genotyping methods are

described below.

2.3.4.3.1 DNA preparation

DNA extraction and sample preparation was performed by the WADB, Perth.

DNA was extracted at room temperature from the buffy coat using the phenol-

chloroform method, and then suspended in TE buffer. DNA was quantitated using

a ND-1000 Nanodrop Spectrophotometer to ensure that sufficient quantities of un-

contaminated DNA were present. DNA samples of > 20 ng/µL concentration were

diluted to approximately 2 ng/µL using the PerkinElmer Multiprobe II pipetting

robot. Two microlitres of each DNA sample was dispersed into each well of a

96-well plate using the Beckman Coulter biomek FX machine, resulting in 4 ng

of DNA sample per well. Plates were left to dry at room temperature overnight

prior to use.

2.3.4.3.2 Genotyping

DNA samples were genotyped by the PathWest Molecular Genetics Service, Perth.

All genotyping was performed using Illumina GoldenGate Assays. The DNA sam-

ple was first activated, and the DNA was added to the oligonucleotides and then

hybridised. Three oligonucleotides were used for each SNP locus. Two oligonu-

cleotides were specific to each allele of the SNP (‘allele specific oligo’), and the

third hybridised at a site downstream from the SNP site (‘locus specific oligo’).

Following hybridisation, wash steps were performed to reduce excess and mis-

hybridised oligonucleotides. The allele specific oligo joined with the locus specific

90Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

oligo, to provide information regarding the genotype present at the SNP site. A

set of fluorescent Polymerase Chain Reaction (PCR) primers were added and PCR

occurred, generating fluorescent products. These products were combined with

Illumina beads on the Sentrix Array Matrix, combining with their matching bead.

The fluorescence of each bead was analysed on the BeadArray Reader, generating

automated genotype clustering and calling.

Genotyping quality control was performed in several steps. Each plate contained

a negative control sample (water) and a positive control CEPH trio sample. If

these controls failed, then the plate was re-genotyped. The controls did fail on one

plate and this was re-genotyped with no further errors. Additionally, genotyping

was repeated on 10% of the samples to ensure accuracy which resulted in 100%

concordance.

2.3.4.4 Methods for descriptive analyses

This section describes the methods for descriptive statistics of both study variables

and genotypic data.

2.3.4.4.1 Descriptive statistics of study variables

Descriptive statistics for the phenotypic and genotypic data were calculated us-

ing the statistical package, R. Means and standard deviations were calculated for

Normally distributed continuous variables, and medians and interquartile range

(IQR) were calculated for skewed continuous variables. Counts and percentages

were calculated for categorical and dichotomous variables. A histogram of Bres-

low thickness was examined to identify any deviations from Normality. Breslow

2.3. The Western Australian Melanoma Health Study 91

thickness was transformed by taking the natural log, loge(Breslow thickness) and

the resulting histogram was examined. Due to the skewness of Breslow thickness,

both the geometric mean and and 95% confidence intervals were calculated.

2.3.4.4.2 Descriptive statistics of genotype data

Genotype and allele frequencies were calculated using R. Consistency of genotype

distributions with HWE in the WAMHS sub-sample were examined at each SNP

locus using Fisher’s exact test.

2.3.4.5 Results for descriptive analyses

2.3.4.5.1 Phenotypic data

This sub-sample consisted of 800 individuals who were diagnosed with melanoma

from the WAMHS. Questionnaire-based characteristics and demographic and clin-

ical characteristics of this sub-sample are presented in Table 2.3.4.5.1.1 and Table

2.3.4.5.1.2 respectively.

Variable Study Population

(n=800)

Continuous variable, mean (SD)

BMI, kg/m2 27.45 (5.22)

Continuous variables, median (IQR)

Number of naevi - upper back 4.00 (9.00)

Number of naevi - lower back 5.00 (13.00)

Smoking, pack years 11.63 (21.95)

Dichotomous and categorical variables, % [n]

Place of Birth

Australia 76.00 [608]

England 10.50 [84]

New Zealand 4.25 [34]

Continued on Next Page. . .

92Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.3.4.5.1.1 – Continued

Variable Study Population

(n=800)

Scotland/Ireland/Wales 3.25 [26]

Africa 2.38 [19]

Northern Europe 2.12 [17]

Other 1.50 [12]

Smoking

Never smoked 53.75 [430]

Ex-smokers 40.25[322]

Current Smokers 6.00 [48]

Naevi

None 11.88 [95]

Few 36.00 [288]

Some 38.50 [308]

Many 13.50[108]

Missing 0.02[1]

Sunbed use

Yes 11.38 [91]

No 88.25 [706]

Don’t know 0.37 [3]

Self-reported history of melanoma

Yes 27.37 [219]

No 72.63 [581]

WACR reported history of melanoma

Yes 21.13 [169]

No 78.87 [631]

Self-reported history of SCC or BCC

Yes 45.38 [363]

No 45.50 [364]

Don’t know 9.12 [71]

Ability to tan

Very brown 12.37 [99]

Moderate tan 44.50 [356]

Slight tan 34.00 [272]

No tan/freckle only 9.13 [73]

Propensity to burn

Continued on Next Page. . .

2.3. The Western Australian Melanoma Health Study 93

Table 2.3.4.5.1.1 – Continued

Variable Study Population

(n=800)

Never burn, always tan 2.25 [18]

Sometimes burn, usually tan 51.12 [409]

Usually burn, sometimes tan 24.13 [193]

Always burn, never tan 22.50 [180]

Skin colour

Olive/brown 7.00 [56]

Fair 74.30 [594]

Very fair 18.70 [150]

Hair colour at age 18

Black 5.50 [44]

Dark brown 29.38 [235]

Grey 2.25 [18]

Light/mouse brown 33.62 [269]

Fair/blonde 20.37 [163]

Red 8.88 [71]

Eye colour

Brown 14.12 [113]

Hazel 24.25 [194]

Green 14.00 [112]

Grey 6.13 [49]

Blue 41.50 [332]

Degree of freckling - childhood

None 24.38 [195]

Very few 33.00 [264]

Few 20.25 [162]

Some 12.87 [103]

Many/Very many 9.50 [76]

Degree of freckling - adulthood

None 30.38 [243]

Very few 38.25 [306]

Few 16.75 [134]

Some 10.62 [85]

Many/Very many 4.00 [32]

Family history of melanoma

Yes 23.25 [186]

Continued on Next Page. . .

94Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.3.4.5.1.1 – Continued

Variable Study Population

(n=800)

No 76.75 [614]

Sunburn causing blisters, 5–12 years

0 times 39.75 [318]

1–5 times 39.63 [317]

6–10 times 10.50 [84]

> 10 times 8.00 [64]

Don’t know 2.12[17]

Sunburn causing blisters, 13–19 years

0 times 43.25 [346]

1–5 times 42.00 [336]

6–10 times 8.88 [71]

> 10 times 5.37 [43]

Don’t know 0.50 [4]

Sunburn causing blisters, > 20 years

0 times 69.38 [555]

1–5 times 25.37 [203]

6–10 times 2.63 [21]

> 10 times 2.37 [19]

Don’t know 0.25 [2]

Table 2.3.4.5.1.1: Questionnaire-based characteristics of subjects in the WAMHS sub-

sample

All 800 subjects used in this analysis were of self-reported Caucasian descent.

Individuals were classified as Caucasian if they considered their ethnicity to be

‘Caucasian’ and if they considered both their parents to be ‘Caucasian’. The

sex distribution was skewed with 56.12% males and 43.88% females. The median

age at diagnosis was 62.67, while the average age at data collection was 63.84

years. This corresponds to a median lag-time of approximately 14 months be-

tween melanoma diagnosis and completion of the study questionnaire.

2.3. The Western Australian Melanoma Health Study 95

VariableStudy Population

(n=800)

Continuous variable, geometric mean (95% CI)

Breslow thickness, mm 0.73 (0.69, 0.77)

Continuous variables, median (IQR)

Age at melanoma diagnosis, years 62.67 (16.86)

Age at data collection, years 63.84 (16.77)

Dichotomous and categorical variables, % [n]

Sex

Male 56.12 [449]

Female 43.88 [351]

Clark’s Level

II 36.25 [290]

III 26.00 [208]

IV 33.87 [271]

V 2.75 [22]

Unknown 1.13 [9]

Melanoma Site

Head and neck 17.00 [136]

Trunk 38.25 [306]

Upper limb 23.37 [187]

Lower limb 21.38 [171]

Table 2.3.4.5.1.2: Demographic and clinical characteristics of subjects in the

WAMHS sub-sample

Mean Breslow thickness was 1.11 mm; the distribution of which was highly skewed

(see Figure 2.3.4.5.1.1), with a median of 0.65 mm, indicating that half of WAMHS

subjects had melanomas less than 0.65 mm. A natural log transformation of

Breslow thickness resulted in a more Normal-appearing histogram (see Figure

2.3.4.5.1.2). When back-transformed, the geometric mean of Breslow thickness

was 0.73 (95%CI = 0.69, 0.77).

Melanomas were approximately evenly distributed through Clark’s Levels II, III

96Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.3.4.5.1.1: Histogram of the distribution of Breslow thickness in the

WAMHS.

and IV, while only a very small percentage of melanomas (2.75%) had a Clark’s

level of V. A large percentage (38.25%) of melanomas were located on the trunk,

with the lowest percentage of melanomas located on the head and neck regions

(17.00%).

Approximately three-quarters of subjects were born in Australia, with 10.50%

(n=84) born in England, and 4.25% (n=34) born in New Zealand. 53.75% (n=430)

of subjects had not smoked for a period of greater than six months, with approx-

2.3. The Western Australian Melanoma Health Study 97

Figure 2.3.4.5.1.2: Histogram of the distribution of log-transformed Breslow thick-

ness in the WAMHS

imately 6% (n=48) of all subjects currently smoking. Mean BMI was calculated

as 27.45 kg/m2.

Fewer naevi were present on the upper back (median=4), compared to the lower

back (median=5). The majority of subjects (74.59%) considered themselves to

have ‘Few’ or ‘Some’ naevi, while only 11.88% (n=95) had ‘None’. The degree

of freckling from childhood to adulthood decreased, with 30.38% (n=243) of sub-

jects recording no freckling during adulthood, decreasing to 24.38% (n=195) of

98Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

subjects in childhood. Similarly, 9.50% (n=76) of subjects considered themselves

to have ‘Many’ or ‘Very Many’ freckles in childhood, which more than halved by

adulthood to 4.00% (n=32).

27.37% (n=219) of individuals reported to have had more than one melanoma

diagnosed, however, records from the WACR indicated that only 21.13% (n=169)

of individuals had more than one melanoma diagnosis notified to the WACR. The

concordance between these two histories was moderate (Cohen’s kappa of 0.54).

Of the 169 individuals with a WACR history of melanoma, 44 (26%) individuals

did not report to the WAMHS that they had been diagnosed with more than

one melanoma. This may be as they were unaware of additional diagnoses, par-

ticularly if more than one melanoma was excised on the same day. Similarly, of

the 219 individuals who reported to have a history of melanoma, the WACR did

not have 94 (43%) of these melanomas recorded. This may be due to individuals

falsely stating they had a history of melanoma, or the WACR may not have been

notified of all melanoma diagnoses, which may occur if the melanomas were diag-

nosed outside of Australia. However, it appears more likely that individuals may

have over- and under-stated their melanoma diagnoses.

45.38% (n=363) of individuals reported to have been diagnosed with either an

SCC or BCC, 45.50% (n=364) reported not to have been diagnosed with an SCC

or BCC, with the remaining 9% (n=71) unable to recall this information. It has

been estimated that by the age of 70 years, 69% of males and 60% of females in

Australia will have been diagnosed with an SCC or BCC [281]. Therefore, with a

mean age of 63 at data collection, this rate of SCC and BCC diagnoses appears

to be a reasonable estimate.

2.3. The Western Australian Melanoma Health Study 99

Approximately 23% of individuals reported to have a first degree relative diag-

nosed with melanoma. This was self-reported and not validated. This is consid-

erably higher than other estimates of family history [93] and may be due to in-

dividuals with melanoma falsely believing they have a family history of melanoma.

Nearly half of all subjects had blue eyes (41.50%), and approximately 9% had red

hair at age 18. The majority of subjects considered their skin colour to be ‘Fair’

or ‘Very fair’ (93.00%), with the remaining 7.00% having ‘Olive’ or ‘Brown’ skin.

Approximately one quarter of subjects responded that they ‘Always burn, never

tan’, while only 2.25% (n=18) of subjects indicated that they ‘Never burn, always

tan’. Approximately 75% (n=628) of subjects indicated that after repeated sun

exposure, they develop a ‘Slight or moderate tan’, while 12.37% (n=99) develop

a ‘Very brown tan’, with 9.13% (n=73) indicating that they never tan and only

develop freckles.

Approximately 40% (n=318) of individuals did not have any incidents of sunburn

causing blisters when aged 5 to 12 years old. This increased to 43.25% (n=346)

and 69.38% (n=555) for the periods 13 to 19, and over 20 years of age, respectively.

The number of individuals who were sunburnt in each age group decreased as the

number of times sunburnt increased. 8.00% (n=64) of individuals were sunburnt

more than 10 times between the ages of 5 and 12, and this decreased to 5.37%

(n=43) and 2.37% (n=19) for the periods 13 to 19, and over 20 years of age,

respectively. In addition, 11.38% (n=91) of subjects had used a sunbed at some

time in their life.

100Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.3.4.5.2 Genotypic data

Genotype and allele frequencies and counts for the 42 genotyped SNPs are pre-

sented in Table 2.3.4.5.2.1, along with HapMap-CEU minor allele frequencies.

Included in Table 2.3.4.5.2.1 is the exact p-value for the HWE test for each SNP.

Thirty seven of the 42 genotype distributions were consistent with HWE. How-

ever, five genotypic distributions (IRF4 rs12203592, BRAF rs12676365, BRAF

rs6944385, EGF rs929446 and EGF rs11569121) were not in HWE at the 5% level

of significance.

Only three of the 42 SNPs had MAF less than 5%, and 12 SNPS had MAF less

than 10%, indicating that the majority of the SNPS were fairly common.

2.3.T

heW

esternA

ustralianM

elanoma

Health

Study

101

Major Homozygote Heterozygote Minor Homozygote Minor Allele

SNP n Gene Gen % n Gen % n Gen % n SMAF1 hMAf2 HWE3

rs910873 798 CDC91L1 GG 77.19 616 AG 21.43 171 AA 1.38 11 A 12.09 8.4 1.00

rs1885120 798 MYH7B GG 78.32 625 CG 20.68 165 TT 1.00 8 C 11.34 7.1 0.60

rs1805005 793 MC1R GG 75.16 596 GT 23.33 185 TT 1.51 12 T 13.18 7.6 0.76

rs1805007 798 MC1R CC 74.06 591 CT 24.69 197 TT 1.25 10 T 13.60 12.0 0.18

rs3212363 798 MC1R AA 59.52 475 AT 33.96 271 TT 6.52 52 T 23.50 32.5 0.12

rs258322 800 MC1R CC 72.50 580 CT 25.87 207 TT 1.63 13 T 14.56 13.7 0.32

rs3212369 797 MC1R AA 67.13 535 AG 30.49 243 GG 2.38 19 G 17.63 13.7 0.18

rs1042602 799 TYR CC 38.92 311 AC 46.93 375 AA 14.15 113 A 37.61 43.0 1.00

rs1393350 796 TYR GG 47.49 378 AG 40.70 324 AA 11.81 94 A 32.16 22.6 0.06

rs1408799 793 TYRP1 CC 48.80 387 CT 41.87 332 TT 9.33 74 T 30.26 3.0 0.80

rs12821256 799 KITLG TT 80.98 647 CT 18.02 144 CC 1 .00 8 C 10.01 15.0 1.00

rs12896399 799 SLC24A4 GG 28.45 228 GT 51.69 413 TT 19.77 158 T 45.62 43.0 0.25

rs1540771 799 IRF4 AA 32.29 258 AG 49.56 396 GG 18.15 145 G 42.93 44.0 0.77

rs12203592 793 IRF4 CC 55.36 439 CT 34.93 277 TT 9.71 77 T 27.17 16.0 0.001

rs35264875 799 TPCN2 AA 69.09 552 AT 28.03 224 TT 2.88 23 T 16.90 17.5 1.00

rs3829241 797 TPCN2 GG 36.64 292 AG 45.92 366 AA 17.44 139 A 40.35 43.0 0.19

rs1800407 788 OCA2 GG 80.10 632 AG 18.76 148 AA 1.14 9 A 10.52 8.0 0.85

rs1800401 798 OCA2 CC 90.85 725 CT 9.15 73 TT 0.00 0 T 4.57 6.5 0.40

rs7495174 799 OCA2 AA 90.25 722 AG 9.63 77 GG 0.12 1 G 4.94 5.0 0.72

rs12913832 797 HERC2 GG 64.12 511 AG 30.74 245 AA 5.14 41 A 20.51 21.0 0.10

rs1042522 794 TP53 GG 58.57 465 CG 36.52 290 CC 4.91 39 C 23.17 23.0 0.55

rs4636294 791 MTAP AA 27.94 221 AG 48.93 391 GG 22.63 179 G 47.35 49.6 0.83

rs10757257 797 MTAP GG 38.27 305 AG 48.18 374 AA 14.80 118 A 38.36 37.6 0.88

rs7023329 797 MTAP AA 29.99 239 AG 48.18 384 GG 21.83 174 G 45.92 49.1 0.39

rs1011970 795 MTAP GG 65.54 521 GT 31.82 253 TT 2.64 21 T 18.55 17.4 0.16

Continued on Next Page. . .

102

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

Table 2.3.4.5.2.1 – Continued

Major Homozygote Heterozygote Minor Homozygote Minor Allele

SNP n Gene Gen % n Gen % n Gen % n SMAF1 hMAf 2 HWE3

rs2284063 797 PLA2G6 AA 45.18 360 AG 43.79 349 GG 11.04 88 G 32.94 31.9 0.81

rs132985 791 PLA2G6 CC 31.60 250 CT 49.56 392 TT 18.84 149 T 43.62 44.2 0.89

rs1267635 776 BRAF GG 71.52 555 AG 27.19 211 AA 1.29 10 A 14.88 12.8 0.05

rs1733826 792 BRAF GG 89.39 708 AG 10.10 80 AA 0.51 4 A 5.56 6.4 0.29

rs10487888 798 BRAF AA 29.70 237 AG 46.74 373 GG 23.56 188 G 46.93 47.8 0.08

rs17161747 783 BRAF GG 91.70 718 CG 8.30 65 CC 0.00 0 C 4.15 8.0 0.64

rs2365151 793 BRAF TT 88.02 698 CT 11.85 94 CC 0.13 1 C 6.05 5.0 0.35

rs17623382 800 BRAF TT 77.37 619 GT 21.50 172 GG 1.13 9 G 11.87 13.3 0.61

rs4726020 800 BRAF AA 55.87 447 AG 37.88 303 GG 6.25 50 G 25.19 21.9 0.93

rs6944385 800 BRAF AA 71.75 574 AT 27.00 216 TT 1.25 10 T 14.75 10.9 0.05

rs7655579 798 EGF AA 42.98 343 AG 43.99 351 GG 13.03 104 G 35.03 30.8 0.35

rs929446 798 EGF CC 35.84 286 CT 44.36 354 TT 19.80 158 T 41.98 39.7 0.01

rs11568993 799 EGF CC 84.11 672 CT 15.52 124 TT 0.37 3 T 8.13 9.0 0.47

rs882471 798 EGF GG 37.84 302 AG 46.62 372 AA 15.54 124 A 38.85 38.69 0.60

rs4698803 793 EGF TT 62.72 497 AT 33.04 262 AA 4.29 34 A 20.81 22.5 1.00

rs6533485 799 EGF GG 28.54 228 CG 49.44 395 CC 22.03 176 C 46.75 48.7 0.83

rs11569121 800 EGF GG 89.12 713 AG 10.13 81 AA 0.75 6 A 5.81 7.1 0.04

Table 2.3.4.5.2.1: Genotype frequencies of SNPs in the WAMHS sub-sample, including

HapMap-CEU frequencies and a test for HWE

1WAMHS minor allele frequency2HapMap-CEU minor allele frequency3P-value for exact test for HWE

2.3. The Western Australian Melanoma Health Study 103

2.3.4.6 Generalisation of sub-sample to population and participating

cases

2.3.4.6.1 Introduction

It is important that, as much as possible, a selected sub-sample represents the

eligible population and the participating cases so that results can be generalised

back to the participating sample, and in turn, the whole eligible population. The

eligible population in this thesis were all adult invasive cases of melanoma which

were diagnosed between January 2006 and September 2009 and were notified to

the WACR. WAMHS participating cases were all subjects who consented to par-

ticipate in the WAMHS, and who fully or partly completed questionnaires and/or

provided blood samples for DNA in either the pilot or the full-launch studies.

Subjects were included in the sub-sample on the basis that they had a recorded

Breslow thickness, therefore the eligible population and WAMHS participating

cases sample include only those subjects with a Breslow thickness recorded.

2.3.4.6.2 Methods

Comparisons between the selected sub-sample and both the WAMHS cases and

the eligible population were calculated for Breslow thickness, sex, age at diagnosis,

melanoma site and Clark’s level distributions. Differences in mean logged Breslow

thickness between the sub-sample and two other samples was calculated by a one-

sample t-test, with the null hypothesis that the sub-sample mean was equal to

both the WAMHS participating cases mean and population mean. Differences in

sex, age at diagnosis, Clark’s level and melanoma site distributions were calculated

using Fisher’s exact tests on contingency tables.

104Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.3.4.6.3 Results

Differences in Breslow thickness, sex, age at diagnosis, Clark’s level and melanoma

site distributions are shown in Table 2.3.4.6.3.1. The eligible population consisted

of 3,284 individuals who were eligible for inclusion in the WAMHS. The participat-

ing cases consisted of 1,418 individuals, and is a subset of the eligible population.

The sub-sample of 800 Caucasian subjects used in this thesis is a subset of the

WAMHS participating cases sample.

No significant differences were detected between the sub-sample used in this thesis

and the WAMHS case sample. This indicates that the sample of 800 subjects used

in this thesis is representative of the sample of 1,418 subjects who participated in

the WAMHS.

There were no significant differences in Breslow thickness, Clark’s level or melanoma

site between the WAMHS sub-sample used in this thesis and the eligible popula-

tion, with Fisher p-values of 0.75, 0.95 and 0.76, respectively.

Significant differences were observed between the distributions of sex (P=0.02)

and age (P<0.001). There was a higher proportion of females represented in the

WAMHS sub-sample compared to the eligible population, with females account-

ing for 43.88% (n=351) of the sample and only 39.37% (n=1,293) of the eligible

population. There was a higher proportion of individuals aged between 60 and

74 at the time of melanoma diagnosis in the WAMHS sub-sample, and a lower

proportion of all other age groups in the WAMHS sub-sample.

2.3. The Western Australian Melanoma Health Study 105

These differences in sex and age at diagnosis means that the WAMHS sub-sample

may not be representative of the eligible population.

106

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

VariableSub–sample WAMHS cases p-value Eligible Population p-value

(n=800) (n=1,418) (n=3,284)

Continuous variable, geometric mean (95% CI)

Breslow thickness, mm 0.73 (0.69, 0.77) 0.70 (0.67, 0.73) 0.24 0.73 (0.71, 0.75) 0.75

Dichotomous and categorical variables, % [n]

Sex

Male 56.12 [449] 58.89 [835] 60.60 [1991]

Female 43.88 [351] 41.11 [583] 0.22 39.37 [1293] 0.02

Age at diagnosis

18–29 1.38 [11] 2.05 [29] 3.87 [127]

30–44 12.13 [97] 13.61 [193] 15.99 [525]

45–59 30.37 [243] 32.79 [465] 0.28 31.18 [1024] <0.001

60–74 45.25 [362] 41.11 [583] 36.87 [1211]

75–80 10.87 [87] 10.44 [148] 12.09 [397]

Clark’s Level

II 36.25 [290] 37.73 [535] 37.42 [1229]

III 26.00 [208] 26.16 [371] 25.12 [825]

IV 33.88 [271] 32.93 [467] 0.28 33.43 [1098] 0.95

V 2.75 [22] 2.40 [34] 2.98 [98]

Unknown 1.12 [9] 0.78 [11] 0.12 [34]

Melanoma Site

Head and neck 17.00 [136] 15.73 [223] 16.60 [545]

Trunk 38.25 [306] 38.36 [544] 38.61 [1268]

Upper limb 23.37 [187] 23.70 [336] 0.79 23.54 [773] 0.76

Lower limb 21.38 [171] 22.00 [312] 20.98 [89]

Unknown 0.00 [0] 0.21 [3] 0.27 [9]

Table 2.3.4.6.3.1: Comparisons between the WAMHS sub-sample (n=800), WAMHS participating cases (n=1,418), and the

eligible population (n=3,284)

2.3. The Western Australian Melanoma Health Study 107

2.3.5 Summary

In this section, the establishment of the WAMHS resource has been described,

including the pilot study and full-scale launch of the study. Analysis of comple-

tion rates revealed that the completion rates of study components increased after

changes to the study protocol. Analysis of consenting, non-consenting and non-

responding subjects found that consenting and non-consenting subjects differed

in their age of diagnosis, while consenting and non-responding subjects differed in

both their age of diagnosis and sex.

Eligible subjects were identified through the WACR. Initially, each subject’s doc-

tor was contacted regarding their patient’s suitability to be contacted about the

study. If deemed suitable, subjects were subsequently contacted. This method

had both its benefits and limitations. Contacting the subject’s doctor initially

allowed the doctor to indicate whether their patient was suitable to be contacted

about participation in the study. Possible reasons not to contact their patient

included an inability to speak English, or the presence of co-morbidities which

meant contact or participation in the study was not suitable. Therefore, it is pos-

sible that the WAMHS sample has less non-English-speaking participants than

in the entire eligible population. There is also a possibility that individuals with

co-morbidities, such as dementia, or advanced melanoma were not invited to par-

ticipate in the study after their doctor deemed them not suitable to be contacted.

However, potential biases are likely to be small, as out of the 3,347 doctor contacts,

only 3% (n=112) of patients were not suitable to be contacted regarding the study.

Aside from these possible biases, one benefit of the study recruitment is that there

108Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

was limited selection bias when subjects were first identified through the WACR.

This meant that individuals contacted should have been representative of the en-

tire eligible population, and should not have differed in terms of demographic or

phenotypic variables. If recruiting subjects through a clinic, many selection biases

are possible, for example, it is possible that only subjects who live within a set

geographical distance from the clinic may be approached about participating in

the study. However, recruitment through the WACR meant that the WAMHS

was able to contact subjects from all over Western Australia, such that rural and

urban areas should be evenly represented. Also, as blood donation was possible

at many PathWest locations throughout Western Australia, there should not have

been an over-representation of any such geographical area. Analyses (not shown)

found that 70% of WAMHS subjects lived in urban areas, and 30% lived in rural

areas. These rates are comparable with the state geographical distributions, of

74% urban, and 26% rural [173].

Descriptive analyses of study variables identified several key variables which dif-

fered between sexes, including age at diagnosis, site, and pigmentation traits. The

WAMHS sub-sample used in this thesis was also described, with descriptive anal-

yses of phenotypic and genotypic variables performed.

Five of the 42 genotyped SNPs were not in HWE. These SNPs were IRF4 rs12203592,

BRAF rs12676365, BRAF rs6944385, EGF rs929446 and EGF rs11569121. Depar-

tures from HWE may be explained by genotyping errors, selection bias, violation

of the underlying assumptions, or chance [30]. However, departures from HWE

in cases does not necessarily represent a flawed study. Instead, departures from

HWE in cases may indicate an association between these SNPs and melanoma. In

2.3. The Western Australian Melanoma Health Study 109

fact, it has been proposed that deviations from HWE can be used as a screening

process to detect gene-disease associations [174].

Qualitative analysis of the key variables available from the WACR and the WAMHS

sub-sample – Breslow thickness, Clark’s level, age at diagnosis, sex and melanoma

site – revealed that Breslow thickness, Clark’s level, and site were similar between

the sub-sample used in this thesis and the eligible population for this thesis (sub-

jects with a recorded Breslow thickness).

The sub-sample of 800 subjects used in this thesis were the first 800 Caucasian

subjects with a recorded Breslow thickness who completed their questionnaire and

blood sample before November 2009. Therefore, there is a potential that these

early-responders may not have been similar to the late-responders. However, com-

parisons between the sub-sample and the WAMHS cases who participated in at

least one component of the study did not detect any significant differences between

these two samples for Breslow thickness, sex, age, Clark’s level and melanoma site.

In addition, the inclusion of only live subjects into the study may introduce bias.

Deceased subjects may have been different compared to live subjects, in particular

for characteristics which may have affected mortality, such as Breslow thickness.

However, this is not likely to have impacted greatly on the analyses as only 118

out of 3,420 (3.45%) individuals died prior to contact.

The sex distribution was skewed, with approximately 56% of the sub-sample male.

This a lower than expected number, given that it is expected that males are ap-

110Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

proximately 1.5 times more likely to be diagnosed with melanoma [63]. Therefore,

approximately 60% of the sub-sample would be expected to be male. In fact, anal-

ysis of the entire population diagnosed with melanoma in the diagnosis time frame

(n=3,284) revealed that 60.60% were male. The sex distribution of the eligible

population and the sub-sample used in this thesis differed, with 61% of the eligi-

ble population male, compared to only 56% males in the sub-sample. This may

indicate that women were more likely to participate in the study, or were more

likely to be early-responders, and therefore included in the sub-sample for this

thesis.

Age at diagnosis also differed between the sub-sample used in this thesis and the

eligible population. There was a higher proportion of individuals aged between 60

and 74 years in the sub-sample, and a lower proportion of individuals in all other

age groups. This may indicate that individuals aged between 60 and 74 were more

likely to participate in the study, or be early-responders.

With the exception of sex and age, analyses revealed that the WAMHS sub-

sample used in this thesis was generally representative of the eligible population.

Therefore, analyses which follow in the following sections should be generalisable

back to the entire melanoma population. These results may not be generalisable

if they are sex- or age-specific association results.

2.4. Association of Candidate Loci with Melanoma Susceptibility 111

2.4 Association of Candidate Loci with Melanoma Susceptibility

2.4.1 Introduction

The aim of this section is to investigate the genetic association of 42 candidate

SNPs in 16 candidate loci with melanoma risk using WAMHS cases and relevant

controls. The 42 genotyped SNPs described in Section 2.3.4.2 were chosen as they

had been identified as melanoma-risk SNPs, therefore replication of their associa-

tions with melanoma-risk in the WAMHS sample was investigated. Although not

a primary aim of this thesis, this was thought to be important to inform both

the generalisability of the WAMHS genetic association results and the interpre-

tation of any genetic associations with melanoma prognostic phenotypes. In this

section, SNP allelic frequencies in WAMHS cases were compared to two general

population (unselected) control samples: the Busselton Health Study (BHS) and

HapMap-CEU populations. The results of these analyses are then presented and

discussed.

2.4.2 Methods

2.4.2.1 Study populations

Data from two independent populations, one from the United States of America,

and one from Western Australia, were used to investigate the role of candidate

SNPs with melanoma risk.

The WAMHS sample (i.e. cases) used in these analyses was the sub-sample of 800

Caucasian individuals described earlier in Section 2.3.4.

112Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

The two non-melanoma populations (i.e. controls) were the HapMap-CEU and

the BHS populations. HapMap-CEU is a population of Caucasian individuals

from Utah in the United States of America, with Northern and Western Euro-

pean ancestry [175]. HapMap tag-SNPs have been shown to perform well in the

Australian population [176]. The HapMap-CEU sample size differs depending on

which SNPs are examined. For the SNPs investigated in this thesis, the HapMap

sample size ranged from 55 to 113 individuals. No phenotypic data, including age

or sex, were available for these individuals.

The BHS is a longitudinal epidemiological study of participants residing in Bus-

selton, a coastal town in the south-west of Western Australia. Since 1966, a series

of health surveys has collected epidemiological data on over 16,000 men, women

and children [177, 178]. For the current study, genotypic data from a GWAS un-

dertaken using the Illumina 610-Quad BeadChip was used. Phenotypic data was

collected as part of the 1994-1995 collection period. The BHS sample used in this

thesis consisted of 1,366 unrelated individuals, however not all individuals were

successfully genotyped at each SNP. For the SNPs investigated in this thesis, the

BHS sample size ranged from 1,322 to 1,366 individuals

Table 2.4.2.1.1 shows some characteristics of the BHS sample used in this thesis.

The sample consisted of 57.98% females and the median age of the sample was

53.05 years.

2.4. Association of Candidate Loci with Melanoma Susceptibility 113

VariableBHS Sample

(n=1,366)

Continuous variables, median (IQR)

Age at data collection, years 53.05 (28.1)

BMI, kg/m2 25.97 (5.12)

Dichotomous variable, % [n]

Sex

Male 42.02 [574]

Female 57.98 [792]

Table 2.4.2.1.1: Participant characteristics of the BHS sample

2.4.2.2 Statistical methods

Genotype frequency distributions were compared between the WAMHS sample

and Hapmap-CEU frequencies using Fisher’s exact test for contingency tables.

As additional covariate data were available in the BHS sample, multivariate case-

control analyses were undertaken for the WAMHS-BHS comparisons. WAMHS

genotype frequency distributions were compared to BHS genotype frequencies

using a generalised linear model, adjusted for any covariates common to both data

sets and significantly associated (P< 0.05) with melanoma risk. P-values were

presented as Likelihood Ratio Test (LRT) p-values. The LRT-statistic, D, was

calculated by D = −2 loge(likelihood [pheno]) − 2 loge(likelihood [pheno+SNP]),

where the model ‘pheno’ contains only the significant phenotypic variables and the

‘pheno+SNP’ model contains both phenotypic variables and the SNP. D is then

compared to a Chi-squared distribution with df1-df2 degrees of freedom, where

df1 and df2 are the degrees of freedom for the ‘pheno’ and ‘pheno+SNP’ models

respectively.

114Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.4.3 Results

Genotype frequency distributions between the WAMHS sample and HapMap-

CEU and BHS are presented in Table 2.4.3.1. Two SNPs, OCA2 rs1800401 and

TPCN2 rs35264875, were not genotyped in either the HapMap-CEU or the BHS

samples, and are therefore not shown in the table.

Genotype distributions were available for 40 of the 42 SNPs in the HapMap-CEU

sample. Statistical analysis revealed that seven genotypic distributions were sta-

tistically different from the HapMap genotypic distributions at the 5% level of sig-

nificance (MC1R rs3212363, TYR rs1042602, TYR rs1393350, KITLG rs12821256,

SLC2A4 rs12896399, IRF4 rs12203592, BRAF rs17161747), indicating their pos-

sible role as melanoma-risk SNPs. Three genotypic distributions showed marginal

significance (MYH7B rs1885120, MC1R rs1805005, MTAP rs1011970). When ad-

justed for multiple testing using FDR, no SNPs were significantly associated with

melanoma-risk compared to the HapMap-CEU sample.

Twenty seven of the 48 SNPs were available as genotyped or imputed SNPs in the

BHS GWAS. Multivariate modelling between WAMHS cases and BHS controls

found that the phenotypes sex, BMI, age, age2, age3 and an age:sex interaction

were significantly associated with melanoma-risk (Table 2.4.3.2).

After adjustment for these phenotypic variables, comparison of genotypic dis-

tributions between the WAMHS cases and BHS controls found nine distribu-

tions which were statistically different at the 5% level of significance (CDC91L1

rs910873, MC1R rs258322, TYR rs1393350, HERC2 12913832, IRF4 rs12203592

2.4. Association of Candidate Loci with Melanoma Susceptibility 115

MTAP rs4636294, MTAP rs7023329, MTAP rs1011970, BRAF rs6944385). In ad-

dition, five genotypic distributions were marginally significant (P<0.10), MYH7B

rs1885120, MTAP rs10757257, PLA2G6 rs2284063, PLA2G6 rs132985, BRAF

rs10487888). After adjustment for multiple testing, six SNP associations us-

ing the BHS sample remained significant with q-values < 0.05 (MC1R rs258322,

TYR rs1393350, IRF4 rs12203592, MTAP rs7023329, MTAP rs1011970, BRAF

rs6944385).

At the 5% level of significance, two SNPs, TYR rs1393350 and IRF4 rs12203592,

were both significantly associated with melanoma risk compared to both the

HapMap-CEU and BHS populations. However, after adjustment for multiple

testing, no SNPs were significantly associated with melanoma risk across both

populations.

116

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

WAMHS HapMap-CEU BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

GG 77.19 [616] 84.07 [95] 81.89 [1099]CDC91L1 rs910873 AG 21.43 [171] 15.04 [17] 0.25 0.62 16.99 [228] 0.05 0.06

AA 1.38 [11] 0.89 [1] 1.12 [15]

GG 78.32 [625] 86.73 [98] 82.45 [1109]MYH7B rs1885120 CG 20.68 [165] 12.39 [14] 0.08 0.35 16.65 [224] 0.06 0.06

CC 1.00 [8] 0.89 [1] 0.89 [12]

GG 75.16 [596] 84.82 [95] -MC1R rs1805005 GT 23.33 [185] 15.18 [17] 0.06 0.28 - - -

TT 1.51 [12] 0 [0] -

CC 74.06 [591] 76.79 [86] -MC1R rs1805007 CT 24.69 [197] 22.32 [25] 0.87 0.85 - - -

TT 1.25 [10] 0.89 [1] -

AA 59.52 [475] 40.00 [24] -MC1R rs3212363 AT 33.96 [272] 55.00 [33] 0.005 0.06 - - -

TT 6.52 [52] 5.00 [3] -

CC 72.50 [581] 73.45 [83] 79.73 [1082]MC1R rs258322 CT 25.87 [207] 25.66 [29] 1.00 0.87 18.79 [255] 0.00003 0.0001

TT 1.63 [13] 0.89 [1] 1.47 [20]

AA 67.13 [535] 72.57 [82] -MC1R rs3212369 AG 30.49 [243] 27.43 [31] 0.19 0.56 - - -

GG 2.38 [19] 0.00 [0] -

CC 38.92 [311] 37.17 [42] 39.82 [544]TYR rs1042602 AC 46.93 [376] 39.82 [45] 0.05 0.25 47.00 [642] 0.80 0.37Continued on Next Page. . .

2.4.A

ssociation

ofC

andidateL

oci

with

Melanom

aS

usceptibility117

Table 2.4.3.1 - Continued

WAMHS HapMap BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

AA 14.15 [113] 23.01 [26] 13.18 [180]

GG 47.55 [379] 62.83 [71] 52.12 [689]TYR rs1393350 AG 40.65 [324] 29.20 [33] 0.01 0.07 40.32 [533] 0.003 0.01

AA 11.79 [94] 7.97 [9] 7.56 [100]

CC 48.80 [387] 47.79 [54] 47.84 [653]TYRP1 rs1408799 CT 41.87 [332] 43.36 [49] 0.97 0.86 41.25 [563] 0.83 0.38

TT 9.33 [74] 8.85 [10] 10.92 [149]

TT 80.98 [647] 71.68 [81] 77.58 [1059]SLC24A4 rs12821256 CT 18.02 [144] 27.43 [31] 0.05 0.25 20.59 [281] 0.14 0.10

CC 1.00 [8] 0.89 [1] 1.83 [25]

GG 28.54 [228] 16.81 [19] 30.31 [414]KITLG rs12896399 GT 51.69 [413] 52.22 [59] 0.004 0.06 49.41 [675] 0.22 0.14

TT 19.77 [158] 30.97 [35] 20.28 [277]

AA 32.29 [258] 27.03 [111] -IRF4 rs1540771 AG 49.56 [396] 57.66 [64] 0.30 0.66 - - -

GG 18.15 [145] 15.31 [17] -

GG 36.64 [292] 32.74 [37] 35.16 [480]TPCN2 rs3829241 AG 45.92 [366] 49.56 [56] 0.71 0.82 49.45 [675] 0.37 0.21

AA 17.44 [139] 17.70 [20] 15.38 [210]

GG 80.10 [632] 85.72 [96] -OCA2 rs1800407 AG 18.76 [148] 13.39 [15] 0.35 0.70 - - -

AA 1.14 [9] 0.89 [1] -

Continued on Next Page. . .

118

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

Table 2.4.3.1 - Continued

WAMHS HapMap BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

AA 90.25 [722] 90.27 [102] 89.04 [1210]OCA2 rs7495174 AG 9.63 [77] 9.73 [11] 1.00 0.87 10.52 [143] 0.66 0.33

GG 0.12 [1] 0.00 [0] 0.44 [6]

GG 64.12 [512] 61.95 [70] 57.25 [782]HERC2 rs12913832 AG 30.74 [245] 34.51 [39] 0.65 0.81 35.80 [489] 0.05 0.06

AA 5.14 [41] 3.54 [4] 6.95 [95]

CC 55.36 [439] 70.80 [80] 62.42 [832]IRF4 rs12203592 CT 34.93 [277] 26.55 [30] 0.002 0.06 32.71 [436] 0.00002 0.0009

TT 9.71 [77] 2.65 [3] 4.88 [65]

GG 58.57 [465] 61.67 [37] -TP53 rs1042522 CG 36.52 [290] 30.00 [18] 0.31 0.67 - - -

CC 4.91 [39] 8.33 [5] -

AA 27.94 [221] 24.78 [28] 23.87 [326]MTAP rs4636294 AG 49.43 [391] 51.33 [58] 0.48 0.76 48.76 [666] 0.05 0.06

AA 22.63 [179] 23.89 [27] 27.38 [374]

GG 38.27 [305] 35.40 [40] 33.06 [451]MTAP rs10757257 AG 46. 93 [374] 53.98 [61] 0.30 0.66 49.41 [674] 0.07 0.07

AA 14.80 [118] 10.62 [12] 12.52 [239]

AA 29.99 [239] 26.13 [29] 24.74 [337]MTAP rs7023329 AG 48.18 [384] 49.55 [55] 0.66 0.81 49.05 [668] 0.04 0.05

GG 21.83 [174] 24.32 [27] 26.21 [357]

GG 65.54 [521] 70.54 [79] 71.23 [973]MTAP rs1011970 GT 31.82 [253] 24.11 [27] 0.09 0.39 26.57 [363] 0.02 0.04Continued on Next Page. . .

2.4.A

ssociation

ofC

andidateL

oci

with

Melanom

aS

usceptibility119

Table 2.4.3.1 - Continued

WAMHS HapMap BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

TT 2.64 [21] 5.35 [6] 2.20 [30]

AA 45.18 [360] 46.02 [52] 40.26 [550]PLA2G6 rs2284063 AG 43.79 [349] 44.25 [50] 0.96 0.86 46.49 [635] 0.09 0.07

GG 11.04 [88] 9.73 [11] 13.25 [181]

CC 31.60 [250] 30.09 [34] 27.11 [370]PLA2G6 rs132985 CT 49.56 [392] 51.33 [58] 0.95 0.86 50.04 [683] 0.09 0.07

TT 18.84 [149] 18.58 [21] 22.86 [312]

GG 71.52 [555] 76.11 [86] 75.11 [1026]BRAF rs1267635 AG 27.19 [211] 22.12[25] 0.44 0.74 22.99 [314] 0.19 0.13

AA 1.29 [10] 1.77 [2] 1.90 [26]

GG 89.39 [708] 87.27 [96] -BRAF rs1733826 AG 10.10 [80] 12.73 [14] 0.65 0.81 - - -

AA 0.51 [4] 0.00 [0] -

AA 29.70 [237] 23.89 [27] 31.77[434]BRAF rs10487888 AG 46.74 [373] 56.64 [64] 0.16 0.51 49.27 [673] 0.07 0.07

GG 23.56 [188] 19.47 [22] 18.96 [259]

GG 91.70 [718] 84.07 [95] -BRAF rs17161747 CG 8.30 [66] 15.93 [18] 0.01 0.70 - - -

CC 0.00 [0] 0.00 [0] -

TT 88.02 [698] 90.00 [54] -BRAF rs2365151 CT 11.85 [94] 10.00 [6] 0.85 0.85 - - -

CC 0.13 [1] 0.00 [0] -

Continued on Next Page. . .

120

Chapter

2.G

eneticE

pidemiology

ofM

alignantM

elanoma:

Susceptibility

andP

rognosisin

theW

AM

HS

Table 2.4.3.1 - Continued

WAMHS HapMap BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

TT 77.37 [619] 75.22 [85] 74.89 [1023]BRAF rs17623382 GT 21.50 [172] 23.01 [26] 0.68 0.82 23.28 [318] 0.31 0.19

GG 1.13 [9] 1.77 [2] 1.83 [25]

AA 55.87 [447] 61.61 [69] -BRAF rs4726020 AG 37.88 [303] 33.03 [37] 0.55 0.78 - - -

GG 6.25 [50] 5.36 [6] -

AA 71.75 [574] 80.00 [44] 82.25 [1098]BRAF rs6944385 AT 27.00 [216] 18.18 [10] 0.28 0.65 16.70 [223] 0.000001 0.0001

TT 1.25 [10] 1.82 [1] 1.05 [14]

AA 42.98 [343] 48.33 [29] 43.18 [589]EGF rs7655579 AG 43.99 [351] 41.67 [25] 0.71 0.82 45.09 [615] 0.66 0.33

GG 13.03 [104] 10.00[6] 11.73 [160]

CC 35.84 [286] 39.65 [23] 36.48 [494]EGF rs929446 CT 44.36 [354] 41.38 [24] 0.85 0.85 47.56 [644] 0.11 0.08

TT 19.80 [158] 18.97 [11] 15.95 [216]

CC 84.11 [672] 81.98 [91] 84.11 [1149]EGF rs11568993 CT 15.52 [124] 18.02 [20] 0.65 0.81 15.30 [209] 0.93 0.41

TT 0.37 [3] 0.00 [0] 0.59 [8]

GG 37.84 [302] 36.28 [41] 39.63 [541]EGF rs882471 AG 46.62 [372] 49.56 [56] 0.86 0.85 45.79 [625] 0.68 0.33

AA 15.54 [124] 14.16 [16] 14.58 [199]

TT 62.67 [497] 61.67 [37] -EGF rs4698803 AT 33.04 [262] 31.67 [19] 0.62 0.80 - - -Continued on Next Page. . .

2.4.A

ssociation

ofC

andidateL

oci

with

Melanom

aS

usceptibility121

Table 2.4.3.1 - Continued

WAMHS HapMap BHSGene SNP Genotype % [n] % [n] p-value q-value % [n] p-value1 q-value

AA 4.29 [34] 6.67 [4] -

GG 28.53 [228] 26.55 [30] -EGF rs6533485 CG 49.44 [395] 49.56 [56] 0.86 0.85 - - -

CC 22.03 [176] 23.89 [27] -

GG 89.12 [713] 86.72 [98] 90.30 [1220]EGF rs11569121 AG 10.13 [81] 12.39 [14] 0.57 0.79 9.25 [125] 0.43 0.24

AA 0.75 [6] 0.89 [1] 0.44 [6]

Table 2.4.3.1: Comparison between WAMHS and HapMap and BHS genotype frequencies

1P-value adjusted for age, age2, age3, sex, BMI and age:sex interaction

122Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Variable Odds ratio 95% CI p-value

Intercept 1.24 (0.04, 36.78) 0.90Sex 0.15 (0.07, 0.36) <0.001BMI 1.05 (1.03, 1.07) <0.001Age 0.75 (0.62, 0.92) <0.001Age2 1.01 (1.00, 1.01) <0.001Age3 1.00 (1.00, 1.00) <0.001Age:Sex 1.04 (1.03, 1.06) <0.001

Table 2.4.3.2: Phenotypic variables multivariately associated with melanoma risk

in the WAMHS, compared to the BHS controls and cases

Overall, 14 SNPs were associated with melanoma risk in the WAMHS at the 5%

level of significance. However, after adjusting for multiple testing using the FDR,

only six SNPs were significantly associated with melanoma risk. A summary of

these results can be seen in Table 2.4.3.3.

HapMap BHS

Gene SNP p-value q-value p-value q-value

CDC91C1 rs910873 0.25 0.62 0.05 0.06

MC1R rs3212363 0.005 0.06 - -

MC1R rs258322 1.00 0.87 0.00003 0.0001

TYR rs1042602 0.05 0.25 0.80 0.37

TYR rs1393350 0.01 0.07 0.003 0.01

KITLG rs12821256 0.05 0.25 0.14 0.10

SLC24A4 rs12896399 0.004 0.06 0.22 0.14

HERC2 rs12913832 0.65 0.81 0.05 0.06

IRF4 rs12203592 0.002 0.06 0.00002 0.00009

MTAP rs4636294 0.48 0.76 0.05 0.06

MTAP rs7023329 0.66 0.81 0.04 0.05

MTAP rs1011970 0.09 0.37 0.02 0.04

BRAF rs17161747 0.01 0.70 - -

BRAF rs6944385 0.28 0.65 0.00001 0.0001

Table 2.4.3.3: Summary of associations between significant SNPs and melanoma

risk in the WAMHS

2.4. Association of Candidate Loci with Melanoma Susceptibility 123

2.4.3.1 Summary of results

Seven SNPs (from six genes) were associated with melanoma risk when com-

pared to HapMap-CEU, and nine SNPs (from seven genes) were associated with

melanoma risk when compared to the BHS controls. Only two SNPs were sig-

nificantly associated with melanoma risk across both HapMap-CEU and BHS

samples. After adjustment for multiple testing, no SNPs were significantly asso-

ciated with melanoma risk when compared to the HapMap-CEU, and six SNPS

(from five genes) were significantly associated with melanoma risk compared to

the BHS controls. It is likely that the results using the BHS sample are more

reliable than those using the HapMap-CEU sample, as the BHS sample is larger,

and is also sampled from a Western Australian population. The significant gene

and SNP associations, both before and after adjustments for multiple testing, are

described below.

2.4.3.1.1 MTAP

Compared to the BHS controls, three SNPs in the MTAP gene were signifi-

cantly associated with melanoma risk - rs4636294, rs7023329 and rs1011970. The

melanoma-risk alleles for these SNPs were consistent with the melanoma-risk SNP

alleles identified in previous studies. In particular, the rare variants of rs4636294

and rs7023329 were both associated with increased melanoma risk and increased

naevi count in GWAS by Falchi et al. [105] and Bishop et al. [106], while the

rare variant of the rs1011970 SNP was associated with naevi count only [105].

The fourth MTAP SNP genotyped, rs10757257, which had been identified as a

melanoma risk and naevi count SNP [105, 106] was marginally associated with

melanoma risk, with a p-value of 0.07.

124Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

After adjusting for multiple testing, the associations between melanoma risk and

rs7023329 (q=0.05) and rs1011970 (q=0.04) remained significant.

Compared to the HapMap-CEU sample, no MTAP SNPs were significantly asso-

ciated with melanoma risk in this study.

2.4.3.1.2 CDC91C1

The rs910873 SNP in the CDC91C1 gene was significantly associated with melanoma

risk when compared to the BHS controls. In particular, the ‘A’ variant of this SNP

was associated with melanoma risk in this study. A similar association between

this variant and melanoma risk was also observed in the two GWAS studies by

Brown et al. [107] and Bishop et al. [106]. Compared to the HapMap sample, this

SNP was not associated with melanoma risk. However, the frequency of the ‘G’

allele in the HapMap and BHS samples was substantially lower when compared

to the WAMHS sample. The frequency of the ‘G’ allele in the HapMap and BHS

samples was 8.41% and 9.61% respectively, compared to the WAMHS sample fre-

quency of 12.09%.

After adjusting for multiple testing, the rs910873 SNP was marginally associated

with melanoma risk when compared to the BHS sample (q=0.06).

2.4.3.1.3 TYR

Two SNPs in the TYR gene were associated with melanoma risk in this study.

The ‘A’ variant of the rs1393350 SNP was significantly associated with melanoma

2.4. Association of Candidate Loci with Melanoma Susceptibility 125

risk in this study when compared to the HapMap-CEU and BHS samples. The

genotypic distributions differed greatly between samples, with 63% and 52% of

individuals in the control populations having the common homozygote, compared

to 48% in the WAMHS sample. Similarly, 8% of individuals in the control popu-

lations were rare homozygous, compared to 12% in the WAMHS sample.

This SNP was previously associated with melanoma risk in a study by Bishop et

al. [106]. In addition, the ‘A’ allele was also associated with blonde hair, green

eyes and increased skin sensitivity to the sun in a study by Sulem et al. [109],

which are all known melanoma risk factors.

After adjustment for multiple testing, the rs1393350 SNP remained significantly

associated with melanoma risk when compared to the BHS controls (q=0.01), and

marginally associated (q=0.07) when compared to the HapMap-CEU sample.

The ‘C’ variant of the rs1042602 SNP, also in TYR, was associated with melanoma

risk, compared to the HapMap sample. However, when compared to the BHS con-

trols, the rs1042602 SNP was not associated with melanoma risk, with a p-value

of 0.80. The ‘C’ variant has been associated with melanoma risk in the GWAS by

Bishop et al. [106], and was also associated with freckling in a study by Sulem et

al. [109] which is a known risk factor for melanoma.

After adjustment for multiple testing, the rs1042602 SNP was no longer associated

with melanoma risk when compared to the HapMap sample (q=0.25).

126Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.4.3.1.4 KITLG

The rs12821256 SNP in the KITLG gene was significantly associated with melanoma

risk when compared to the HapMap sample, however no association was detected

between this SNP and melanoma risk using the BHS controls (P=0.14). The ‘T’

variant was associated with melanoma risk in this study, however this association

is in direct contrast to a previous study by Sulem et al. [109]. In the study by

Sulem et al., the ‘C’ variant was associated with blonde hair, which is a melanoma

risk factor, therefore suggesting that the ‘C’ variant is associated with melanoma

risk.

I am unable to explain this difference. However, it may be that the association ob-

served in this study was a false positive, which is supported by the non-significance

of the association when adjusted for multiple testing (q=0.25).

2.4.3.1.5 SLC24A4

The rs12896399 SNP from the SLC24A4 gene was significantly (P=0.05) asso-

ciated with melanoma risk when compared to the HapMap sample, however no

association was detected between this SNP and melanoma risk when compared

to the BHS controls (P=0.22). The ‘G’ variant was associated with melanoma

risk in this study. In a previous study by Sulem et al. [109], the ‘T’ variant was

associated with melanoma risk factors, including blonde hair and increased skin

sensitivity to the sun. The results from the study by Sulem et al. are therefore in

direct contrast from the results of this study.

I am unable to explain this difference. However, as with rs12821256 from KITLG,

2.4. Association of Candidate Loci with Melanoma Susceptibility 127

it may be that the association observed in this study was a false positive. This is

supported by the only marginal significance of the association when adjusted for

multiple testing (q=0.06).

2.4.3.1.6 HERC2

The ‘G’ variant of the rs12913832 SNP in the HERC2 gene was associated with

melanoma risk in this study, when compared to BHS controls. 61.96% of the

WAMHS sample were homozygous for the ‘G’ allele, compared to 57.25% of the

BHS sample. No association was detected between this SNP and melanoma risk

compared to the HapMap sample.

The ‘G’ allele of this SNP has been associated with blue eyes in previous stud-

ies [170,171], and melanoma risk in a recent study by Duffy et al. [179].

After adjustment for multiple testing, the rs12913832 SNP was marginally asso-

ciated with melanoma risk when compared to the BHS controls (q=0.06).

2.4.3.1.7 IRF4

The ‘T’ variant of the rs12203592 SNP in the IRF4 gene was associated with

melanoma risk in this study, compared to both BHS controls and the HapMap

sample. In particular, 71% and 62% of individuals in the control populations had

the common homozygote, compared to 55% in the WAMHS sample. Similarly, 3%

and 5% of individuals in the control populations were rare homozygotes, compared

to 10% in the WAMHS sample. This variant has been found to be associated with

melanoma risk factors, such as blonde hair, lighter skin colour, lighter eye colour

128Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

and reduced tanning ability in a study by Han et al. [124].

After adjustment for multiple testing, the rs12203592 SNP was marginally associ-

ated with melanoma risk compared to the the HapMap sample (q=0.06), however

this SNP was strongly associated with melanoma risk when using the BHS con-

trols (q=0.0009). HWE analysis from Section 2.3.4.5 revealed that this SNP was

not in HWE in the WAMHS cases, which may have been due to its role as a

melanoma-risk SNP.

The rs1540771 SNP also in the IRF4 gene was not associated with melanoma risk

when compared to the HapMap sample, and this SNP was not available in the

BHS study.

2.4.3.1.8 MC1R

Two SNPS in the MC1R gene were associated with melanoma risk. The rs3212363

SNP was associated with melanoma risk when compared to the HapMap sample,

however this SNP was not available in the BHS sample.

The ‘T ’ variant of the rs258322 SNP was associated with melanoma risk, when

using BHS controls. This variant allele was also identified by Bishop et al. [106]

and Han et al. [124] as a melanoma risk variant. Compared to the HapMap sam-

ple, this SNP was not associated with melanoma risk.

After adjustment for multiple testing, rs3212363 was marginally associated with

melanoma risk compared to the HapMap controls (q=0.06). However, the asso-

2.4. Association of Candidate Loci with Melanoma Susceptibility 129

ciation between rs258322 and melanoma risk was significant when using the BHS

controls (q=0.0001).

2.4.3.1.9 BRAF

Two SNPs on the BRAF gene were significantly associated with melanoma risk.

The ‘G’ variant of the rs17161747 SNP was associated with melanoma risk using

the HapMap sample, however this SNP was not available in the BHS sample. The

rs6944385 SNP was associated with melanoma risk when using the BHS controls,

however there was no association between this SNP and melanoma risk when us-

ing the HapMap sample. HWE analysis from Section 2.3.4.5 revealed that this

SNP was not in HWE in the WAMHS cases, which may have been due to its role

as a melanoma-risk SNP.

After adjustment for multiple testing, both the rs17161747 and the rs6944385

SNPs were no longer associated with melanoma risk when using the HapMap

sample, with q-values of 0.70 and 0.65, respectively. However, when using the

BHS controls, the rs6944385 SNP remained strongly associated with melanoma

risk (q=0.0001).

2.4.3.2 Discussion

The SNPs used in this analysis were chosen as they had been identified as melanoma-

risk SNPs, or were on melanoma candidate genes. This analysis has found 14 SNPs

which were significantly associated with melanoma risk, out of a possible 67 SNP

associations. More specifically, 25 SNPs were not associated with melanoma risk

using either the HapMap sample or BHS controls. These results may be due to a

130Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

number of reasons which are discussed below.

The WAMHS sample (n=800) is relatively small, and it is likely that the analyses

were underpowered. Similarly, the HapMap sample is small, generally consisting

of approximately 100 individuals. For the rarer SNPs, the rare homozygote count

is very small, often with less than 20 individuals. Therefore, the power to de-

tect differences in genotypic distributions was likely to be low. This was further

evidenced by the substantial differences in genotype distributions between the

WAMHS and the HapMap samples which may have been significant differences if

the HapMap sample size was larger.

In addition, the WAMHS sample did not have many phenotypic variables in com-

mon with both the HapMap and BHS samples. In fact, the HapMap sample did

not contain any phenotypic variables, therefore it was not possible to adjust for

any possible covariates when analysing melanoma risk. There was a limited num-

ber of common phenotypic variables collected as part of the WAMHS and BHS

samples; these were age, sex and BMI. Therefore only these three variables were

considered as covariates. It would be preferable to adjust for more phenotypic

variables, including sun exposure and naevi counts.

There is also a discrepancy between the methods of obtaining the SNP data in

cases and controls. The case (WAMHS) SNPs were all obtained through geno-

typing, while the control SNPs from the BHS consisted of both genotyped and

imputed SNPs. For the imputed SNPs, the program MACH [266] was used to de-

termine the ‘most likely’ genotype for each individual based on a genotype prob-

2.5. Association of Candidate Loci with Breslow thickness 131

ability greater than 0.50. This may have led to biases within the analyses and a

reduction in power, as some genotype uncertainty will remain unaccounted [267].

If possible, it would be preferable to use only genotyped or imputed SNPs in a

case-control study; and if using imputed data, to incorporate the genotype uncer-

tainty into the analysis.

One particular advantage of the BHS sample over the HapMap-CEU sample is

that the BHS individuals are from Western Australia, and are likely to have experi-

enced a similar level of sun exposure, therefore limiting biases due to confounding.

However, the sun exposure or other exposures experienced by the HapMap-CEU

sample are likely to be different to the WAMHS sample.

This is, to the author’s knowledge, the first study which has used Western Aus-

tralian individuals to investigate the role of genetic variants and melanoma risk.

As such, large-scale replication is required in larger Australian samples. This

is particularly relevant for SNPs in MC1R (rs3212363) and BRAF (rs17161747,

rs6944385) which have not been identified previously as melanoma-risk SNPs.

2.5 Association of Candidate Loci with Breslow thickness

2.5.1 Introduction

The aim of the research described in this section was to investigate the associ-

ation between the primary melanoma prognosis phenotype - Breslow thickness

- and genetic variants in candidate genes within the WAMHS population. The

SNPs genotyped (n=42 in 16 genes) were chosen as they had been previously

identified as variants associated with melanoma risk or were from melanoma-risk

132Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

genes. The associations between these SNPs and melanoma risk in the WAMHS

was reported in Section 2.4 (please see page 111).

In this section, a genetic association study between these SNPs and the best in-

dependent predictor of melanoma prognosis, Breslow thickness, was undertaken.

The results of these genetic association analyses are presented, and their implica-

tions are discussed.

2.5.2 Statistical methods

The methods used for univariate and multivariate association analyses for both

phenotypic and genotypic variables are described in this section. All analyses were

performed in Rv2.10.1. The sub-samples used in these analyses was the WAMHS

sub-sample (n=800) described earlier in Section 2.3.4.

2.5.2.1 Association analyses

2.5.2.1.1 Epidemiological analyses

2.5.2.1.1.1 Univariate analyses

Possible associations were investigated between Breslow thickness and several di-

chotomous, categorical and continuous phenotypic variables, using linear regres-

sion. Due to its skewed distribution, Breslow thickness was transformed using the

formula loge(Breslow thickness) for all analyses. The magnitude of the association

of variables with Breslow thickness were calculated by back-transforming the co-

efficients using the formula 100 ∗ (expcoefficient−1). For continuous phenotypes,

model coefficients were presented as a percentage change for a one unit increase in

2.5. Association of Candidate Loci with Breslow thickness 133

the phenotype. For dichotomous and categorical phenotypes, model coefficients

were presented as a percentage change for each level of the phenotype, compared

to the baseline.

2.5.2.1.1.2 Multivariate analyses

Multivariate models were fitted using generalised linear models (linear regres-

sion) [180]. Independent predictors of Breslow thickness were determined by using

a stepwise variable selection procedure. Starting with the full model of all possible

predictors, a backwards elimination approach was used, based on the Akaike Infor-

mation Criterion (AIC). The AIC attempts to find the model which best explains

the data, while penalising the model for the number of parameters estimated. AIC

is calculated as AIC=2p − 2 loge(L), where p is the number of parameters, and

L is the maximised value of the likelihood function for the estimated model. In

these analyses, the model with the smallest AIC was chosen as the preferred model.

Quadratic effects of age at diagnosis and phenotypic interactions, in particular sex

interactions, were also investigated. Model diagnostic plots, including residual and

Normal quantile-quantile plots were used to check that the final model did not

violate linear regression assumptions.

2.5.2.1.2 Genotypic analyses

2.5.2.1.2.1 Univariate analyses

All 42 SNPs were tested for univariate linear associations with loge(Breslow thick-

ness). These SNPs were tested under a codominant model, with the common ho-

mozygote set as the reference category. A codominant model was chosen initially

134Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

as inspection of codominant model coefficients can be used to identify the best fit-

ting model (i.e. co-dominant, additive, dominant, or recessive). ANOVA p-values

derived from the F-distribution were used to determine statistical significance,

which was defined at the 5% level of significance.

2.5.2.1.2.2 Multivariate analyses

Multivariate analysis of loge(Breslow thickness) and the 42 SNPs were undertaken.

Each SNP was added to the multivariate phenotypic models derived in Section

2.5.2.1.1.2 (please see page 133). Multivariate analysis was performed to reduce

the variability in the outcome which may enhance the ability to recognize statis-

tically significant effects. Initially each SNP was modelled codominantly. If the

SNP was significantly (P<0.05) or marginally (P<0.10) associated with Breslow

thickness, the model coefficients were investigated to determine if the SNP was

more predictive under either a codominant, additive, dominant or recessive model.

P-values are presented for each genotype factor relative to the baseline (common)

homozygote.

SNP:phenotype interactions were also considered to investigate any non-additive

effect of the SNPs and phenotypes on Breslow thickness. A final model of all

significant SNPs adjusted for significant phenotypes was fitted, including possi-

ble SNP:phenotype interactions. Residual analysis and Normal quantile-quantile

plots were used to check that the final model did not violate linear regression

assumptions.

2.5. Association of Candidate Loci with Breslow thickness 135

2.5.2.2 Statistical power

In statistical analyses, the probability of a ‘false negative’ or Type II error is the

probability of failing to reject the null hypothesis when the null hypothesis is

false. Power is defined as 1 − β, where β is the Type II error rate. Therefore as

the Type II error rate decreases, power increases. Conventionally, power of 0.8 is

considered to be adequate. This means that if 100 tests are performed, when the

null hypothesis is not true, 80 of these tests would reject the null hypothesis and

detect significant differences.

Power in genetic analyses is dependent on the sample size, MAF, size of the

effect and the type of disease model being tested, such as dominant, additive or

recessive. Statistical power was calculated for both genetic main effects and gene-

environment interactions. An alpha level of 0.0012 was used for the genetic main

effects, calculated by 0.0542

. For the interaction effects, 42 x 2 = 84 interaction tests

were performed, and therefore the alpha level used was 0.0006 (0.0584

). In this thesis,

power calculations were performed in the statistical software package Quanto [181]

for a sample size of 800, MAF ranging from 0.1 to 0.5, effect sizes of 0.1, 0.2,

0.3 and 0.5 standard deviations (SD). Main effect power was calculated under

dominant, additive and recessive models. Interaction effect power was calculated

for an additive model, assuming main effects for the gene and environment of

0.5SD, and an environment prevalence of 0.2 and 0.8.

2.5.3 Results

Descriptive statistics for the phenotypic variables and genotyped SNPs were pre-

sented in Section 2.3.4.5 (please see page 91). Results of phenotypic and genotypic

136Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

associations with Breslow thickness are presented in the following section.

2.5.3.1 Epidemiological analyses

2.5.3.1.1 Univariate analyses

Four phenotypic variables were significantly associated with Breslow thickness

in the WAMHS population when analysed univariately. Univariate associations

between Breslow thickness and questionnaire-based, demographic, and clinical

variables are presented in Table 2.5.3.1.1.1.

VariableMean change in p-value

logged Breslow thickness1

Continuous variables

BMI, kg/m2 0.009 0.09

Age at melanoma diagnosis, years 0.009 <0.001

Age at data collection, years 0.009 <0.001

Number of naevi - upper back 0.0006 0.71

Number of naevi - lower back -0.001 0.47

Smoking, pack years 0.0008 0.71

Dichotomous and categorical variables

Sex

Males reference 0.64

Females -0.027

Clark’s Level

II reference

III 0.569

IV 1.307 <0.001

V 2.699

Unknown 1.139

Melanoma Site

Head and neck reference

Trunk -0.142 0.32

Continued on Next Page. . .

2.5. Association of Candidate Loci with Breslow thickness 137

Table 2.5.3.1.1.1 - Continued

VariableMean change in p-value

logged Breslow thickness1

Upper limb -0.096

Lower limb -0.037

Ever Smoked

No reference

Yes 0.112 0.06

Presence of Naevi

No reference

Yes -0.392 <0.001

Self-reported history of melanoma

No reference

Yes -0.054 0.41

Family history of melanoma

No reference

Yes -0.027 0.69

Skin colour

Very fair reference

Fair -0.021 0.31

Olive/brown 0.155

Hair colour at age 18

Red reference

Fair/blonde -0.119

Light/mouse brown -0.039 0.75

Grey -0.201

Dark brown -0.031

Black -0.149

Eye colour

Blue reference

Grey 0.179

Green 0.089 0.42

Hazel 0.117

Brown 0.035

Sunburn causing blisters, 5-12 years

0 times reference

1-5 times -0.0618

6-10 times -0.029 0.81

Continued on Next Page. . .

138Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.5.3.1.1.1 - Continued

VariableMean change in p-value

logged Breslow thickness1

> 10 times 0.046

Don’t know 0.0607

Sunburn causing blisters, 13-19 years

0 times reference

1-5 times 0.0376

6-10 times -0.090 0.61

> 10 times -0.134

Don’t know 0.067

Sunburn causing blisters, > 20 years

0 times reference

1-5 times -0.014

6-10 times 0.048 0.79

> 10 times 0.186

Don’t know 0.467

Table 2.5.3.1.1.1: Univariate analysis between phenotypic variables with Breslow thick-

ness in the WAMHS

Significant associations were observed between Breslow thickness and age at di-

agnosis (P<0.001), age at data collection (P<0.001), Clark’s level (P<0.001) and

the presence of naevi (P<0.001). Older age at diagnosis, older age at data collec-

tion and increasing Clark’s level were associated with increased Breslow thickness,

while the presence of naevi was associated with decreased Breslow thickness. BMI

and smoking status were marginally associated with Breslow thickness, with p-

values of 0.09 and 0.06 respectively. Both BMI and smoking status were marginally

associated with increased Breslow thickness.

1For continuous variables, this is the mean change in logged Breslow thickness per unit of

phenotype. For dichotomous and categorical variables, this is the mean change in logged Breslow

thickness for each level of the variable, in relation to the reference or baseline category.

2.5. Association of Candidate Loci with Breslow thickness 139

2.5.3.1.2 Multivariate analyses

Multivariate modelling suggested significant associations between Breslow thick-

ness and age at diagnosis and the presence of naevi in the WAMHS population

(Table 2.5.3.1.2.1).

Variable Mean change in % change 95% CI for % change p-value

logged Breslow

thickness (SE)

Intercept -0.016 (0.085) - - -

Age at diagnosis 0.007 (0.002) 0.71 (0.25, 1.17) 0.003

Naevi -0.343 (0.090) -29.01 (-15.25, -40.55) 0.0001

Table 2.5.3.1.2.1: Multivariate model of phenotypic variables associated with Bres-

low thickness in the WAMHS

Clark’s level was not considered as a possible predictor for Breslow thickness as

Clark’s level is determined at the same time as the Breslow thickness from the

excised melanoma and therefore is not able to predict melanoma thickness. Age

at diagnosis was associated with an increase in Breslow thickness of 0.71% per

year increase in age of diagnosis. The presence of naevi was associated with a

decrease in Breslow thickness of 29.01% compared to no naevi. The multivariate

phenotypic model accounted for 3.2% of variation in Breslow thickness.

Model diagnostic plots (Figure 2.5.3.1.2.1) show a fairly straight Normal Q-Q

plot, indicating the residuals appear to be Normally distributed. The residuals

also appear to be homoscedastic, independent and without pattern. Therefore the

major assumptions underlying the linear regressions do not appear to be violated.

140Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.1.2.1: Model diagnostic plots from the multivariate phenotypic model

for Breslow thickness in the WAMHS

2.5.3.2 Genotypic analyses

2.5.3.2.1 Univariate analyses

Univariate analyses revealed that when modelled codominantly, four SNPs were

significantly associated with Breslow thickness (Table 2.5.3.2.1.1). These were

IRF4 rs12203592 (P=0.008), TP53 rs1042522 (P=0.03), BRAF rs1733826 (P=0.05)

and EGF rs6533485 (P=0.04).

2.5. Association of Candidate Loci with Breslow thickness 141

Gene SNP ANOVA p-value

CDC91L1 rs910873 0.32

MYH7B rs1885120 0.80

MC1R rs1805005 0.15

MC1R rs1805007 0.21

MC1R rs3212363 0.34

MC1R rs258322 0.46

MC1R rs3212369 0.48

TYR rs1042602 0.23

TYR rs1393350 0.32

TYRP1 rs1408799 0.31

KITLG rs12896399 0.64

SLC24A4 rs12821256 0.75

IRF4 rs1540771 0.11

TPCN2 rs35264875 0.14

TPCN2 rs3829241 0.94

OCA2 rs1800401 0.11

OCA2 rs1800407 0.51

OCA2 rs7495174 0.51

HERC2 rs12913832 0.37

IRF4 rs12203592 0.008

TP53 rs1042522 0.03

MTAP rs4636294 0.79

MTAP rs10757257 0.62

MTAP rs7023329 0.21

MTAP rs1011970 0.36

PLA2G6 rs2284063 0.41

PLA2G6 rs132985 0.64

BRAF rs1267635 0.69

BRAF rs1733826 0.05

BRAF rs10487888 0.57

BRAF rs17161747 0.82

BRAF rs2365151 0.87

BRAF rs17623382 0.62

BRAF rs4726020 0.37

BRAF rs6944385 0.76

EGF rs7655579 0.93

EGF rs929446 0.74

Continued on Next Page. . .

142Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.5.3.2.1.1 - Continued

Gene SNP ANOVA p-value

EGF rs11568993 0.43

EGF rs882471 0.74

EGF rs4698803 0.86

EGF rs6533485 0.04

EGF rs11569121 0.80

Table 2.5.3.2.1.1: Univariate associations between Breslow thickness and SNPs modelled codom-

inantly in the WAMHS

2.5.3.2.2 Multivariate analyses

The model fitted in Section 2.5.3.1.2 formed the basis of the multivariate analysis

of the 42 genotyped genetic variants. Table 2.5.3.2.2.1 shows the coefficients for

each genotype and likelihood ratio p-values for all the SNPs, adjusted for age at

diagnosis and the presence of naevi. When modelled codominantly, three SNPs

were significantly associated with Breslow thickness at the 5% level of significance,

OCA2 rs1800401 (P=0.04), TP53 rs10425422 (P=0.009) and BRAF rs1733826

(P=0.03). Two SNPs were marginally associated with Breslow thickness, IRF4

rs12203592 (P=0.09) and EGF rs6533485 (P=0.07).

Also presented in Table 2.5.3.2.2.1 is the q-value for the FDR. After adjustment

for multiple testing, no SNPs were significantly associated with Breslow thickness.

Gene SNP Genotypes Mean change in LRT p-value q-value

logged Breslow thickness1

GG

CDC91L1 rs910873 AG 0.049 0.21 0.89

AA 0.408

GG

Continued on Next Page. . .

2.5. Association of Candidate Loci with Breslow thickness 143

Table 2.5.3.2.2.1 - Continued

Gene SNP Genotypes Mean change in LRT p-value q-value

logged Breslow thickness1

MYH7B rs1885120 CG 0.064 0.61 0.89

CC 0.138

GG

MC1R rs1805005 GT -0.004 0.18 0.89

TT 0.432

CC

MC1R rs1805007 CT -0.078 0.24 0.89

TT 0.301

AA

MC1R rs3212363 AT 0.013 0.59 0.89

TT -0.113

CC

MC1R rs258322 CT -0.020 0.62 0.89

TT 0.208

AA

MC1R rs3212369 AG -0.035 0.59 0.89

GG -0.178

CC

TYR rs1042602 AC 0.095 0.29 0.89

AA 0.081

GG

TYR rs1393350 AG -0.023 0.42 0.89

AA -0.124

CC

TYRP1 rs1408799 CT 0.031 0.52 0.89

TT -0.086

TT

SLC24A4 rs12821256 CT -0.049 0.81 0.89

CC -0.021

GG

KITLG rs12896399 GT 0.012 0.74 0.89

TT -0.046

Continued on Next Page. . .

144Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.5.3.2.2.1 - Continued

Gene SNP Genotypes Mean change in LRT p-value q-value

logged Breslow thickness1

AA

IRF4 rs1540771 AG -0.124 0.16 0.89

GG -0.080

AA

TPCN2 rs35264875 AG 0.104 0.27 0.89

GG 0.024

GG

TPCN2 rs3829241 AG 0.002 1.0 1.00

AA -0.007

GG

OCA2 rs1800407 AG 0.038 0.57 0.89

AA 0.262

CC

OCA2 rs1800401 CT -0.211 0.04 0.56

TT

AA

OCA2 rs7495174 AG 0.062 0.51 0.89

GG -0.791

GG

HERC2 rs12913832 AG -0.003 0.30 0.89

AA 0.201

CC

IRF4 rs12203592 CT 0.107 0.09 0.76

TT 0.183

GG

TP53 rs1042522 CG 0.174 0.009 0.38

CC 0.208

AA

MTAP rs4636294 AG -0.050 0.65 0.89

AA 0.007

GG

MTAP rs10757257 AG -0.007 0.52 0.89

Continued on Next Page. . .

2.5. Association of Candidate Loci with Breslow thickness 145

Table 2.5.3.2.2.1 - Continued

Gene SNP Genotypes Mean change in LRT p-value q-value

logged Breslow thickness1

AA 0.088

AA

MTAP rs7023329 AG -0.060 0.32 0.89

GG 0.046

GG

MTAP rs1011970 GT -0.034 0.53 0.89

TT -0.188

AA

PLA2G6 rs2284063 AG 0.027 0.63 0.89

GG -0.066

CC

PLA2G6 rs132985 CT 0.047 0.73 0.89

TT 0.002

GG

BRAF rs1267635 AG 0.036 0.68 0.89

AA 0.186

GG

BRAF rs1733826 AG -0.150 0.03 0.56

AA -0.865

AA

BRAF rs10487888 AG -0.056 0.56 0.89

GG 0.012

GG

BRAF rs17161747 CG 0.044 0.67 0.89

CC

TT

BRAF rs2365151 CT -0.009 0.94 0.99

CC -0.267

TT

BRAF rs17623382 GT 0.005 0.50 0.89

GG -0.317

AA

Continued on Next Page. . .

146Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Table 2.5.3.2.2.1 - Continued

Gene SNP Genotypes Mean change in LRT p-value q-value

logged Breslow thickness1

BRAF rs4726020 AG 0.038 0.36 0.89

GG 0.167

AA

BRAF rs6944385 AT 0.024 0.75 0.89

TT 0.179

AA

EGF rs7655579 AG 0.015 0.90 0.97

GG -0.025

CC

EGF rs929446 CT 0.040 0.68 0.89

TT -0.023

CC

EGF rs11568993 CT 0.127 0.25 0.89

TT -0.206

GG

EGF rs882471 AG 0.040 0.76 0.89

AA -0.009

TT

EGF rs4698803 AT 0.003 0.99 1.00

AA -0.013

GG

EGF rs6533485 CG 0.140 0.07 0.73

CC 0.022

GG

EGF rs11569121 AG -0.037 0.80 0.89

AA -0.184

Table 2.5.3.2.2.1: Multivariate associations between Breslow thickness and SNPs in the WAMHS

modelled codominantly

1This is the mean change in logged Breslow thickness for each genotype level, compared to

the baseline genotype.

2.5. Association of Candidate Loci with Breslow thickness 147

Analysis of the genotype coefficients determined that two of the significantly as-

sociated SNPs (rs6533485 and rs1800401) were best modelled codominantly (see

Table 2.5.3.2.2.2). The association between Breslow thickness and rs6533485 was

marginally significant (P=0.07), however only the heterozygote was significantly

associated with Breslow thickness (P=0.04).

The rs1800401 SNP did not have any rare homozygotes, however when compared

to the common homozygote, the heterozygote was significantly associated with

decreased Breslow thickness (P=0.04). This SNP accounted for 0.70% of vari-

ation in Breslow thickness. Model diagnostic plots (Figure 2.5.3.2.2.1) indicate

that the residuals appear Normally distributed, and also appear independent and

with constant variance.

SNP Genotype Mean change in % change 95% CI for % change p-value

logged Breslow

thickness1 (SE)

GG - - - -

rs6533485 CG 0.140 (0.068) 15.04 (0.72, 31.39) 0.04

CC 0.022 (0.082) 2.22 (-12.90, 19.97) 0.79

rs1800401 CC - - - -

CT -0.211 (0.100) -19.00 (-33.47, -1.37) 0.04

Table 2.5.3.2.2.2: Association between codominant SNPs and Breslow thickness in

the WAMHS, adjusted for age at diagnosis and presence of naevi

Two SNPs were best modelled additively, BRAF rs1733826 and IRF4 rs12203592,

and these results are presented in Table 2.5.3.2.2.3. Each copy of the minor allele

1This is the mean change in logged Breslow thickness for each genotype level, compared to

the baseline genotype.

148Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.2.2.1: Model diagnostic plots for the association between OCA2

rs1800401 and Breslow thickness in the WAMHS, adjusted for age at diagnosis

and presence of naevi

of the rs1733826 SNP decreased Breslow thickness, on average, by 18.06% (95%

CI = -2.70, -30.99). This SNP accounted for 0.44% of the variation in Breslow

thickness.

Conversely, each variant allele of the IRF4 rs12203592 SNP increased Breslow

thickness, on average, by 10.22% (95% CI = 1.08, 20.18). This SNP accounted

for 0.58% of variation in Breslow thickness.

2.5. Association of Candidate Loci with Breslow thickness 149

Model diagnostics for the two additive SNPs are presented in Figures 2.5.3.2.2.2

and 2.5.3.2.2.3. The residuals appear to be Normally distributed and do not ap-

pear to be violating any of the major assumptions underlying linear regression.

SNP Allele Mean change in % change 95% CI for % change p-value

logged Breslow

thickness1 (SE)

rs1733826 G - - - -

A -0.199 (0.088) -18.06 (-2.70, -30.99) 0.02

rs12203592 C - - -

T 0.097 (0.044) 10.22 (1.08, 20.18) 0.03

Table 2.5.3.2.2.3: Association between additive SNPs and Breslow thickness in the

WAMHS, adjusted for age at diagnosis and presence of naevi

One SNP was best modelled dominantly - TP53 rs1042522 - and was subject to a

significant interaction with the presence of naevi (Table 2.5.3.2.2.4). Inclusion of

the SNP and the SNP:(naevi count) interaction accounted for 1.87% of the varia-

tion in Breslow thickness. When the SNP:(naevi count) interaction was included

in the model, the presence of naevi was no longer significantly associated with

Breslow thickness (P=0.10). However, the SNP and the SNP:(naevi count) inter-

action were both significantly associated with Breslow thickness, with p-values of

0.0004 and 0.007, respectively. The interaction between TP53 rs1042522 and the

presence of naevi can be seen graphically in Figure 2.5.3.2.2.4.

Compared to a baseline model of no naevi and no variant alleles, an individual of

average age with no naevi and one of the CG or GG genotypes had, on average,

1This is the mean change in logged Breslow thickness for each variant allele.

150Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.2.2.2: Model diagnostic plots for the association between BRAF

rs1733826 and Breslow thickness in the WAMHS, adjusted for age at diagnosis

and presence of naevi

a melanoma 85% thicker. An individual with naevi and the GG genotype had

a melanoma 17% thinner, and an individual with both naevi and the GG geno-

type had a melanoma approximately 6% thicker. Therefore, there is evidence to

suggest that the effect of the TP53 rs1042522 SNP on Breslow thickness differs

according to whether an individual has naevi.

1For naevi, this is the mean change in logged Breslow thickness for individuals with naevi,

compared to individuals with no naevi. For rs1042522, this is the mean change in logged Breslow

thickness for CG+CC, compared to the baseline genotype. For rs1042522:Naevi, this is the mean

change in logged Breslow thickness for individuals with naevi and CG+CC, compared to the

2.5. Association of Candidate Loci with Breslow thickness 151

Figure 2.5.3.2.2.3: Model diagnostic plots for the association between IRF4

rs12203592 and Breslow thickness in the WAMHS, adjusted for age at diagnosis

and presence of naevi

SNP Genotype Mean change in % change 95% CI for % change p-value

logged Breslow

thickness1 (SE)

Naevi - -0.181(0.111) -16.59 (-32.93, 3.74) 0.10

rs1042522 GG - - - -

CG + CC 0.617 (0.174) 85.41 (31.88, 160.69) 0.0004

rs1042522:Naevi GG - - - -

rs1042522:Naevi CG+CC -0.495 (0.184) -39.04 (-57.54, -12.48) 0.007

Table 2.5.3.2.2.4: Association between rs1042522, modelled dominantly, and Bres-

low thickness in the WAMHS, adjusted for age at diagnosis and presence of naevi

152Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.2.2.4: Interaction between rs1042522 modelled dominantly and the

presence or absence of naevi in the WAMHS. CG+CC represents one or two copies

of the variant allele. The combination GG and no naevi forms the baseline.

Model diagnostics for the the rs1042522 model are presented in Figure 2.5.3.2.2.5.

The residuals appear to be Normally distributed and do not appear to be violating

any of the major assumptions underlying linear regression.

Modelling of Breslow thickness, with the four significant SNPS (from four genes)

and the rs1042522:(naevi count) interaction, adjusted for age at diagnosis and

the presence of naevi, found that all SNPs remained significantly associated with

Breslow thickness (P<0.05). This indicated that, before adjustment for multiple

baseline genotype and no naevi.

2.5. Association of Candidate Loci with Breslow thickness 153

Figure 2.5.3.2.2.5: Model diagnostic plots for the association between rs1052522

and Breslow thickness in the WAMHS, adjusted for age at diagnosis and presence

of naevi

testing, each SNP was an independent predictor of Breslow thickness. This final

model, including age at diagnosis, the presence of naevi and the four significant

SNPs, accounted for 6.8% of variation in Breslow thickness. However, after ad-

justment for multiple testing, these SNP associations were not significant.

154Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.5.3.3 Statistical power

2.5.3.3.1 Genetic main effects

Post-hoc statistical power analyses for an additive model with n=800 and alpha

level of 0.0012 demonstrated sufficient power (>0.80) to detect a 0.3SD change or

greater in Breslow thickness for most MAF (Figure 2.5.3.3.1.1). However, changes

in Breslow thickness of 0.1SD are not likely to have been detected by the current

study (maximum power for 0.1SD = 0.11).

Under a dominant model with n=800, there was sufficient power to detect a 0.3SD

change or greater in Breslow thickness for most MAF (Figure 2.5.3.3.1.2). How-

ever, changes in Breslow thickness of less than 0.2SD are not likely to have been

detected by the current study.

Under a recessive model with n=800, there was sufficient power to detect a 0.5SD

change in Breslow thickness for MAF > 0.30. (Figure 2.5.3.3.1.3). However,

changes in Breslow thickness of less than 0.5SD would likely not be detected.

2.5.3.3.2 Gene-environment interactions

Post–hoc statistical power analyses for an additive model with n=800, alpha level

of 0.0006, and environment exposure prevalence of 0.2 demonstrated sufficient

power (>0.80) to detect a 0.5SD change or greater in Breslow thickness for MAF

0.3 and greater (Figure 2.5.3.3.2.1). However, other combinations of MAF and

changes in Breslow thickness are not likely to have been detected by the current

2.5. Association of Candidate Loci with Breslow thickness 155

Figure 2.5.3.3.1.1: Estimated main effects statistical power under an additive model

for n=800, under varying MAF and SD

study.

For an additive model with n=800, alpha level of 0.0006, and environment expo-

sure prevalence of 0.8, there was sufficient power (>0.80) to detect a 0.5SD change

or greater in Breslow thickness for MAF 0.2 and greater (Figure 2.5.3.3.2.2). How-

ever, other combinations of MAF and changes in Breslow thickness are not likely

156Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.3.1.2: Estimated main effects statistical power under a dominant model

for n=800, under varying MAF and SD

to have been detected by the current study.

2.5.3.4 Summary of results

Four SNPs (from four different genes - IRF4, TP53, OCA2 and BRAF) were as-

sociated with Breslow thickness when modelled multivariately (Table 2.5.3.4.1).

2.5. Association of Candidate Loci with Breslow thickness 157

Figure 2.5.3.3.1.3: Estimated main effects statistical power under a recessive model

for n=800, under varying MAF and SD

The final multivariate model with all four SNPs accounted for 7.3% of variation

in Breslow thickness. Of this explained variance, 3.6% was due to these SNPs

and interactions, and 3.2% of the variance was due to age at diagnosis, and the

presence of naevi.

One SNP (OCA2 rs1800401) was modelled codominantly, and the heterozygote

158Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

Figure 2.5.3.3.2.1: Estimated interaction effects statistical power under an addi-

tive model for n=800, under varying MAF and SD for environmental exposure

prevalence of 0.2

genotype was associated with a decrease in Breslow thickness. Two SNPs were

modelled additively, with the variant allele associated with a thinner melanoma for

BRAF rs1733826, and a thicker melanoma for IRF4 rs12203592. The remaining

SNP, TP53 rs1042522, was modelled dominantly, with the variant allele associ-

ated with a thicker melanoma, which was moderated by the presence of naevi.

2.5. Association of Candidate Loci with Breslow thickness 159

Figure 2.5.3.3.2.2: Estimated interaction effects statistical power under an addi-

tive model for n=800, under varying MAF and SD for environmental exposure

prevalence of 0.8

However, after adjusting for multiple testing using the FDR, none of these asso-

ciations were significant at the 5% significance level.

The rs6533485 SNP was significant (without adjustment for multiple testing) when

modelled univariately, however it was not significant when adjusted for age at di-

160Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

agnosis and naevi, and is therefore not discussed further.

2.5.A

ssociation

ofC

andidateL

oci

with

Breslow

thickness161

SNP Model Genotype Mean change in % change 95% CI for % change p-value R2

logged Breslow

thickness (SE)

GG - - - - -

rs6533485 Codominant CG 0.140 (0.068) 15.04 (0.72,31.39) 0.04 0.04

CC 0.022 (0.082) 2.22 (-12.90,19.97) 0.79 -

rs1800401 Codominant CC - - - - -

CT -0.211 (0.100) -19.00 (-33.47,-1.37) 0.04 0.70

rs1733826 Additive GG - - - - -

AG+AA -0.199 (0.088) -18.06 (-2.70, -30.99) 0.02 0.44

rs12203592 Additive CC - - - -

CT+TT 0.097 (0.044) 10.22 (1.08, 20.18) 0.03 0.58

rs1042522 Dominant GG - - - - -

CG + CC 0.617 (0.174) 85.41 (31.88, 160.69) 0.0004 -

rs1042522:Naevi GG - - - - 1.87

rs1042522:Naevi CG+CC -0.495 (0.184) -39.04 (-57.54, -12.48) 0.007 -

Table 2.5.3.4.1: Summary of associations between significant SNPs and Breslow thickness in the WAMHS, adjusted for age

at diagnosis and presence of naevi

162Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.5.4 Discussion

2.5.4.1 Introduction

The aim of the analyses presented in this section was to investigate associations

between 42 identified melanoma-risk candidate SNPs and melanoma prognosis

in the WAMHS sample. In this study, four loci were significantly associated

with melanoma prognosis, however these associations become non-significant after

adjustment for multiple testing. These associations and their implications are

discussed further in Section 2.5.4.3 (please see page 171).

2.5.4.2 Summary of population characteristics

The WAMHS sub-sample used in this analysis were 800 Caucasian individuals who

were diagnosed with an invasive melanoma between January 2006 and September

2009. To be eligible for inclusion in this analysis, subjects were required to have a

Breslow thickness recorded, as this was the outcome variable. Breslow thickness

and associations between phenotypic variables and Breslow thickness are discussed

in this section.

2.5.4.2.1 Breslow thickness

Breslow thickness was the outcome variable in these analyses and was used as a

surrogate for melanoma prognosis. There is continued debate over whether Bres-

low thickness is a measure of the time from melanoma development to examination

and diagnosis by a physician (‘diagnosis delay’), a measure of the biological ag-

gressivity of the tumour, or a combination of both [182–188].

2.5. Association of Candidate Loci with Breslow thickness 163

The relationship between Breslow thickness, diagnosis delay and aggressiveness

of the tumour is complex. It is difficult to accumulate sufficient evidence to re-

ject the hypotheses that Breslow thickness is not a proxy for either the biological

aggressiveness of the melanoma tumour, or diagnosis delay, as prospective stud-

ies to evaluate the natural history of a melanoma tumour from its onset raises

both practical and ethical issues. Studies investigating these relationships must

rely on patients to recall when they first noticed the new lesion or the change in

a pre-existing lesion. These studies also assume that when the melanoma first

developed, there was some noticeable change to the skin that was recognised by

the patient. However, for melanomas arising on areas of the body that are not

frequently viewed, such as the back, the tumour may remain undetected for some

time.

Within these framework limitations, several studies have attempted to investigate

the relationship between tumour thickness, delay in diagnosis and the aggres-

siveness of the tumour. However, these studies have failed to identify a clear

association between thicker melanomas and a longer delay of diagnosis [182–188].

The largest of these studies, by Richard et al. [187], investigated the correlation

between Breslow thickness and diagnosis delay in a sample of 590 French indi-

viduals (57.6% female, mean age of 51.2 years, and median Breslow thickness of

1.13 mm). Richard et al. found some evidence for a positive correlation (r=0.17,

P=0.009) between Breslow thickness and the time the subject noticed the lesion

to the time the subject became suspicious of the lesion. However, this association

depended greatly on the thickness of the melanoma. For tumours less than 3mm

thick, thicker melanoma tumours were associated with an increase in diagnosis

164Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

delay, while for tumours greater than 3mm thick, thicker melanoma tumours were

associated with a decrease in diagnosis delay. Additionally, a negative correlation

was observed between the time the subject became suspicious of the lesion and

the first examination of the lesion by a physician, and Breslow thickness (r=-0.20,

p<0.001). These results suggest that Breslow thickness may be a result of both

diagnosis delay, and also the aggressiveness of the tumour.

Furthermore, Richard et al. suggested that in populations uninformed about

melanoma and the importance of skin examinations, diagnosis delay is possibly

the main factor responsible for tumour thickness, while in informed populations,

the biological behaviour of the tumour may be the major factor affecting tumour

thickness. Australia has had a large number of public education campaigns, par-

ticularly over the past 25 years [189]. These numerous campaigns, primarily run

by the Cancer Council of Australia, would suggest that Australia is an informed

population. As such, in Australian communities of informed individuals, Breslow

thickness is more likely to be a measure of the agressiveness of the melanoma

tumour.

The most recent study by Lui et al [188] investigated tumour rate of growth (es-

timated by Breslow thickness divided by delay in diagnosis) in a sample of 404

Australian individuals (45.0% female, mean age of 54.2 years, and median Bres-

low thickness of 1.14 mm). Liu et al. found that the rate of growth was highly

associated with thicker melanomas, providing evidence to suggest that thicker

melanomas are thicker as they have grown faster, not because there has been a

delay in diagnosis.

2.5. Association of Candidate Loci with Breslow thickness 165

If Breslow thickness is solely a measure of diagnosis delay, then the observed as-

sociations in this study between Breslow thickness, age at diagnosis, naevi and

several SNPs would represent associations between these variables and diagnosis

delay. However, if Breslow thickness is a measure of the aggressiveness of the

tumour, then these observed associations would represent associations between

these variables and melanoma prognosis. If the latter is true, then this study will

allow the identification of individuals who are at higher risk of melanoma with a

poorer prognosis.

Without the use of prospective, longitudinal cohort studies, it is difficult to assess

the relationship between Breslow thickness and the aggressiveness of the tumour

with any certainty. However, there is some evidence to suggest that Breslow thick-

ness is a sensible measure for melanoma growth and prognosis, and is largely de-

pendent on the aggressiveness of the tumour. Therefore, the identification of phe-

notypic variables and genetic variants that are associated with Breslow thickness

is important as it may help to identify individuals at risk of poor melanoma prog-

nosis, and may also elucidate the biological mechanisms underpinning melanoma

growth.

In the current study, Breslow thickness was analysed as a continuous variable. In

previous studies, convention has been to categorise Breslow thickness using cut-

points which represent staging classifications [153]. A review of melanoma studies

by Vollmer et al. [190] found that in these studies, Breslow thickness was cate-

gorised with as few as one cutpoint, and as many as seven. However, it is unclear

if natural cutpoints exist and if so, which cutpoints are optimal for modelling

Breslow thickness. Analyses by Vollmer et al. and Stadelmann et al. [191] found

166Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

that using Breslow thickness as a continuous variable more clearly modelled sur-

vival, than using various different cutpoints to create arbitrary categories.

In addition, the use of continuous outcomes in genetic association studies is of-

ten more powerful and informative than categorising outcomes [192]. However,

Breslow thickness is still often categorised [112, 129, 158, 193, 194]. This may be

due to various reasons, such as the ability to easily match thickness categories to

treatment. It may also be categorised due to the high positive skewness of Breslow

thickness measurements, with a large number of thin melanomas (< 1mm). In this

thesis, the natural log transformation sufficiently transformed Breslow thickness,

and modelling Breslow thickness this way did not appear to violate any linear

regression assumptions.

Median Breslow thickness in this study was 0.65. The comparison of this median

thickness with median thicknesses from other studies is difficult due to the conven-

tion of reporting Breslow thickness as a categorical variable. In addition, median

Breslow thickness has decreased over the past few decades due to the increase

in diagnosis of thin melanomas [195], which further compounds the difficultly in

comparing Breslow thickness in this study with earlier studies. However, the few

recent studies which have published median Breslow thickness found similar me-

dians of 0.55 mm [112], 0.6 mm [196] and 0.56 mm [112]. Therefore, the median

Breslow thickness in this study appears comparable to other studies.

Furthermore, analysis of Breslow thickness between the eligible population and

the WAMHS sub-sample from Section 2.3.4 found that median Breslow thickness

2.5. Association of Candidate Loci with Breslow thickness 167

in the sample did not differ significantly from the median Breslow thickness of all

adult individuals diagnosed with melanoma in Western Australia from January

2006 and September 2009.

Therefore, the distribution of Breslow thickness in this study appears compa-

rable to Breslow thickness described in Australian and international studies of

melanoma cases. This indicates that the associations observed in this study be-

tween Breslow thickness and age at diagnosis, the presence of naevi and several

genetic variants may be generalisable back to melanoma cases in Caucasian indi-

viduals world-wide.

2.5.4.2.2 Sex

No association was observed between Breslow thickness and sex in this study.

Median Breslow thickness for males was 0.63, compared to a median thickness

for females of 0.65, however this association was not statistically significant. This

thicker Breslow thickness for females contradicts other studies which have found

males tend to have thicker melanomas [150, 152, 154, 155]. Further analysis (not

shown) found that for the eligible population, the median Breslow thickness was

0.67 mm (range of 0.04 mm to 35.00 mm) for males and 0.60 mm (range of 0.10

mm to 23.00 mm) for females. Therefore, it appears that the sub-sample used

in this analysis was not representative of the eligible population, and included

a greater proportion of women with thicker melanomas, when compared to the

eligible population. However, it is unlikely that this had a great effect on the

analysis as sex was not significantly associated with Breslow thickness, and no

sex interactions with either age at diagnosis, the presence of naevi, or any genetic

variants were identified.

168Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.5.4.2.3 Age at diagnosis

Age at diagnosis was significantly associated with Breslow thickness. In this study,

each year of age resulted in an average increase in Breslow thickness of 0.71%,

when adjusted for the presence of naevi. Breslow thickness has been associated

with age at diagnosis in previous studies [149–153], in particular, with older indi-

viduals being diagnosed with thicker melanomas. One reason for this, suggested

by Hanrahan et al. [197], is that older individuals are less likely to notice changes

in melanoma, and are therefore less likely to have their tumours detected and

subsequently diagnosed.

However, as previously described, evidence suggests that melanoma thickness is

not solely related to diagnosis delay, and therefore this argument does not appear

to be wholly valid. In their study of rate of growth of melanoma, Liu et al. [188]

found that individuals aged over 70 years had higher rates of melanoma growth,

when compared to individuals aged under 70 years. As rate of growth is believed

to be an independent risk factor for Breslow thickness, it is therefore possible that

older individuals would have thicker melanomas. However, the evidence is not

clear and older individuals being diagnosed with thicker melanomas may be due

to a combination of these higher rates of tumour growth and a reduced ability for

older individuals to detect changes in melanoma.

2.5.4.2.4 Naevi

The presence of multiple naevi is the principle risk-factor for melanoma. In this

study, 75% of subjects considered their bodies to have ‘Few’ or ‘Some’ naevi, 14%

had ‘Many’ naevi and only 11% had ‘None’. The median number of naevi on the

2.5. Association of Candidate Loci with Breslow thickness 169

upper back was four, while the median number of naevi on the lower back was five.

The number of naevi on either section of the back was not significantly associated

with Breslow thickness. The four categories of naevi counts, ranging from ‘Few’

to ‘Many’ were also not significantly associated with Breslow thickness. This vari-

able was then transformed into a binary trait - the presence of naevi - which was

recorded as ‘No’ or ‘Yes’, and this new variable was significantly associated with

Breslow thickness.

In this study, having no naevi resulted in an increase of Breslow thickness of 29%,

when compared to having naevi, adjusted for age at diagnosis. While this supports

the hypothesis that melanoma risk factors will also be associated with melanoma

prognosis, the direction of the association is contrary to what may be expected.

That is, it seems more plausible that thicker melanomas would be associated with

more naevi, not that thicker melanomas are associated with fewer naevi.

In their study of Breslow thickness, Liu et al. [188] found a significant association

between the rate of growth of melanoma, and naevi. In particular, individuals

with fewer naevi had a faster melanoma growth rate and individuals with more

naevi had a slower melanoma growth rate.

One reason for the inverse relationship between naevi count and melanoma growth

rate may be that thicker melanomas tend to be the nodular type [198] and these

are associated with fewer naevi. In this study, the types of melanoma were not

available directly from the WACR as they are not uniformly recorded on pathology

forms, however it may be possible to obtain this information. If it was possible

170Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

to adjust for the type of melanoma, then the association between fewer naevi and

thicker melanomas may no longer be observed.

Another reason for this inverse relationship may be that individuals with many

naevi may be more likely to visit their doctor and be screened for melanoma. This

may result in individuals with naevi having thinner melanomas diagnosed, which

would result in naevi appearing to be associated with thinner melanomas.

The number of naevi on the back was not associated with melanoma thickness,

while the presence of naevi was associated with melanoma thickness. This discrep-

ancy may be explained by the way naevi were reported. In this study, subjects or

a family member of the subject were asked to count the number of naevi on two

sections of the back. The subject was sent an information sheet explaining how

to identify naevi, however anecdotal evidence suggested that WAMHS subjects

found it difficult to identify naevi. This may be as the back is a difficult area to

examine, and there may also be a large amount of freckling which may be confused

with naevi. Therefore, the naevi counts on the back may not be wholly reliable.

However, when the subject examined four different diagrams representing degrees

of naevi on the whole body, ranging from ‘None’ to ‘Many’, their responses may

have been more accurate and less likely to be subjected to biases. In particular,

it may have been easier for an individual to identify themselves as having ‘No’

naevi, while the categories ‘Few’, ‘Some’ or ‘Many’, may have been more difficult

to choose between.

2.5. Association of Candidate Loci with Breslow thickness 171

The only way to determine if the naevi counts were accurate would be for a

trained individual to count the number of naevi on subjects’ backs for a subset

of the sample. The concordance between the counts performed by an expert and

those performed by the subject could be measured to identify how accurately naevi

were counted. As part of the scar examination which some subjects underwent

as part of the WAMHS, each subject’s back was photographed. Therefore, it is

possible to estimate the concordance between self-reported naevi counts and those

identified by a trained individual. This was beyond the scope of the present study,

however it would make for interesting further research.

2.5.4.3 Summary of genetic association analyses

2.5.4.3.1 Review of genetic variants studied

Forty two genetic polymorphisms in 16 candidate genes and their associations with

Breslow thickness were analysed in the current study. Twenty three of these SNPs

were chosen as they had been identified as melanoma-risk SNPs in earlier GWAS

and candidate-gene studies, or were associated with melanoma-risk factors, such

as light pigmentation and naevi. The remaining nineteen SNPs were tag-SNPs

derived from HapMap [175] in the EGF, BRAF and MC1R genes.

2.5.4.3.2 Details of associations

After adjustment for age at diagnosis and naevi, four SNPs provided some evidence

for association with Breslow thickness. These SNPs were rs1800401 in the OCA2

gene, rs1042522 in the TP53 gene, rs12203592 in the IRF4 gene and rs1733826 in

the BRAF gene. After adjustment for multiple testing, none of these associations

remained significant. Therefore, it is possible that the associations detected were

172Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

false positives. However under different conditions, such as a larger sample size,

these associations may have remained significant after multiple testing.

Prior to this study, associations between these SNPs and Breslow thickness had

not been investigated. Therefore, the results of the current association study are

novel and warrant further investigations, including replication of these associations

in a larger sample.

2.5.4.3.2.1 OCA2

The rs1800401 SNP was modelled codominantly, with the heterozygote associ-

ated with a 19% thinner melanoma, compared to the major homozygote. The

rs1800401 SNP is located in the OCA2 (oculocutaneous albinism) gene on chro-

mosome 15q11. Mutations in OCA2 have been associated with albinism [199,200]

and pigmentation [169,201,202]. In particular, the ‘A’ allele of the rs1800401 SNP

was associated with greater odds of having brown or black eyes [169], or reduced

odds of having lighter eye colours. Lighter eyes are thought to be a melanoma-risk

factor, in particular, blue eyes have been identified as a melanoma risk factor [89],

therefore the ‘A’ allele would likely be associated with reduced melanoma risk. In

this thesis, the complementary DNA strand was analysed (‘T’ allele), and this ‘T’

allele was associated with thinner melanoma. Therefore the risk allele (‘T’ or ‘A’)

is associated with lower melanoma risk, and also thinner melanomas.

It is interesting to note that no subjects in this thesis were rare homozygotes for

this SNP. This was also observed in a study by Duffy et al. [203] of Australian

twin families without melanoma, where 1,268 were common homozygous, 136 were

2.5. Association of Candidate Loci with Breslow thickness 173

heterozygous, and none were rare homozygous.

2.5.4.3.2.2 TP53

The rs1042522 SNP in the TP53 gene was modelled dominantly, with one or two

copies of the variant C allele associated with thicker melanoma. This relationship

was moderated by the presence of naevi, with the ‘C’ variant associated with 85%

thicker melanomas if no naevi, and 6% thicker melanomas with some naevi, when

compared to having no variant alleles and no naevi.

The TP53 gene is found on chromosome 17p31.1, and encodes the tumour sup-

pressor protein p53. P53 plays a critical role in preventing cancer development, in

particular by preventing tumour cell growth [204]. Mutations in TP53 have been

associated with a wide spectrum of cancers, including Li-Fraumeni syndrome [205],

breast cancer, and leukemia, and is also associated with early onset cancer [206].

The most common polymorphism is rs1042522, and this has been found to be

associated with melanoma prognosis in several studies [172, 207–209]. However,

these associations have been contradictory, with two studies identifying the ‘C’ al-

lele as the risk allele [207,208], and the other two studies identifying the ‘A’ allele

as the risk allele [172, 209]. Further replication is therefore required to reconcile

these findings.

Few investigations have studied the relationship between naevi and TP53, however

there is some evidence of somatic mutations of TP53 in naevi [210]. In addition,

a linkage study of naevi by Zhu et al. [211] identified a region of chromosome 17

174Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

which was linked with naevi, which may have been due to TP53.

The association between Breslow thickness and the rs1042522 SNP is unclear, and

further research is required to unravel the relationship between Breslow thickness,

naevi and TP53.

2.5.4.3.2.3 IRF4

The rs12203592 SNP in the IRF4 gene was modelled additively, with each ‘T’

variant allele associated with a 10% thicker melanoma. The ‘T’ variant of the

rs12203592 SNP has been associated with many pigmentation factors, such as

blonde hair, light skin colour, light eye colour and an inability to tan [124], which

are known melanoma-risk factors. Therefore, it appears that the ‘T’ variant is

associated with both melanoma risk and also thicker melanomas.

The IRF4 (Interferon Regulatory Factor 4) gene is located on chromosome 6p25.3,

and encodes the IRF4 protein. This protein regulates the transcription of inter-

feron proteins which are made and released in response to pathogens, including

tumour cells [124].

The IRF4 gene was first shown to be associated with pigmentation factors in a

2007 GWAS by Sulem et al. [109]. Subsequent to this GWAS, a second GWAS

of pigmentation in 2008 [124] found the rs12203592 SNP on the IRF4 gene to be

associated with pigmentation factors and skin tanning response to sunlight. In

their paper, Han et al. [124] reported that this association had been replicated

in several studies. IRF4 encodes the B-cell protein which has been proposed as

2.5. Association of Candidate Loci with Breslow thickness 175

a marker for both primary and metastatic melanoma and benign naevi [212].

However, the role of IRF4 in pigmentation and therefore melanoma risk, is still

generally not understood.

2.5.4.3.2.4 BRAF

The rs1733826 SNP in the BRAF gene was associated with a decrease in Breslow

thickness, in particular, each copy of the ‘A’ allele decreased Breslow thickness by

18%.

The BRAF gene is located on chromosome 7q34 and encodes the B-RAF protein.

This protein is associated with serine/threonine kinase activities, and mediates

signals involving cell growth, transformation and differentiation [272]. Somatic

mutations in BRAF have been identified in many cancers, including melanoma,

lung cancer, and colorectal cancer [213]. Associations between germline mutations

in BRAF have been detected [130,214], however other studies have not found any

association [215,216], and therefore the role of BRAF as a melanoma-susceptibility

gene is still largely unclear.

No associations have been detected between somatic mutations in BRAF and

Breslow thickness [217]. However, to the author’s knowledge, no associations

between germline mutations and Breslow thickness have been investigated before

this study.

176Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

2.5.4.4 Causality

The set of criteria described by Hill and explained earlier in Section 1.1.1 can

be used to investigate if the associations observed in this thesis may be causal.

These criteria include strength, consistency and temporality. The strength of the

associations between genetic variants and Breslow thickness were not strong, with

each variant only associated with a small change in Breslow thickness.

Consistency of these associations is difficult to determine as the genetic vari-

ants identified in this study as associated with Breslow thickness have not been

replicated in other studies to date. Temporality is easier to identify in genetic

association studies, as genotypes are assigned to offspring from parental germline

DNA at conception, and therefore the genotype must predate the development of

melanoma.

The observed associations do not appear to violate Hill’s criteria. However, due

to the millions of common SNPs in the human genome, it is unlikely that any of

the variants found to be associated with Breslow thickness are themselves causal

variants. If the observed associations are true, it is more likely that these variants

are simply markers in LD with nearby causal variants.

2.5.4.5 Potential limitations

As with all genetic association studies, this study has potential limitations, in-

cluding the use of self-reported variables, low statistical power, and the possibility

of population stratification.

2.5. Association of Candidate Loci with Breslow thickness 177

All variables collected as part of the WAMHS questionnaire were self-reported.

This included relatively easy to measure data, such as height and weight, and also

more difficult to recall measures such as sun exposure. The use of self-reported

data has been extensively studied [169,218–222]. Sun exposure is notoriously dif-

ficult to measure, and there is debate surrounding the reliability and validity of

sun exposure recollection [70,219,221]. It is possible that sun exposure, sometimes

from decades earlier, cannot be reliably recalled. In this study, we attempted to

minimise recall bias due to sun exposure measures by using events such as blis-

tering sunburn, which may be recalled more easily than total time spent in the

sun. One way to reduce recall bias would be to conduct a prospective study of a

sample from the general population, where a group of individuals is followed, and

their sun exposures are measured. This would negate the need for individuals to

recall their sun exposure from years earlier. However, as melanoma is a relatively

rare disease, thousands of individuals would need to be followed-up in order to

obtain an adequate sample size of melanoma cases.

Many other variables were also self-reported, including height, weight, naevi

counts and pigmentation traits. It is possible that all of these were not mea-

sured reliably. Measurement error could be reduced if trained individuals were

employed by the study to measure these traits, however this would likely be more

costly, and time-consuming for the study.

The measure of freckling used in this thesis was taken from an estimate of freckling

on the face. Freckling on the face has been shown to be highly correlated with

freckling on other anatomic sites, including the arms and back [280]. Therefore,

freckling on the face, which is easy to measure, was used as a proxy for freckling

178Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

on other anatomic sites.

Ethnicity collected as part of the WAMHS was also self-reported. Subjects were

asked their ethnicity and their parents’ ethnicity. If either of these was non-

Caucasian, the subject was not considered Caucasian and was not included in

the analyses for this thesis. In a study of self-reported ethnicity, Mez et al. [223]

found self-reported ethnicity was a reasonable proxy for genetic ancestry [223].

In addition, any bias due to population stratification is likely to be small [224],

particularly for well conducted and analysed studies. However, there is a possibil-

ity that self-reported ethnicity, along with other self-reported variables were not

reliably measured.

The inclusion of only Caucasian individuals in the study also means that the

genetic association results are only relevant to Caucasian populations. Further

replication in other ethnic groups is required in order to generalise these results

back to different ethnic groups.

Another limitation of this study is the relatively small sample size, resulting in

modest statistical power. At this start of this thesis, it was estimated that with

1,000 melanoma cases diagnosed in Western Australia each year, approximately

2,000 individuals would be recruited and available for use in this thesis. However,

due to a number of issues with data collection, recruitment took longer than

expected and therefore only 800 individuals were able to be included in this thesis

which will have reduced statistical power substantially. Power analyses showed

that for an additive genetic model, for SNPs with a MAF greater than 0.2, this

2.5. Association of Candidate Loci with Breslow thickness 179

sample would only be sufficiently powered to detect a change in Breslow thickness

of more than 0.2SD (1.3% of variance). Therefore, the study was unable to detect

small genetic effects. Replication is therefore required in larger sample sizes which

would be more adequately powered.

2.5.5 Summary

In this section, possible associations between Breslow thickness and phenotypic

variables and genotypic variants were investigated. Two phenotypic variables

were significantly associated with Breslow thickness. These were age at diagnosis,

where older age of diagnosis was associated with thicker melanomas, and the pres-

ence of naevi, which was associated with thinner melanomas. The use of Breslow

thickness as a proxy for progression of the tumour means that these factors would

likely also be associated with melanomas which tend to grow faster.

In addition, four SNPs were found to be associated with Breslow thickness. Fur-

ther replication of these results is necessary, as after adjustment for multiple test-

ing, none of these associations remained significant. However, this study has

provided some evidence for the role of four genes in melanoma prognosis, which

may lead to an increased understanding of the mechanisms by which melanomas

grow. The results of this study will also assist with identifying individuals who

are at a higher risk of thicker melanomas, and therefore, poorer prognosis.

In total, age at diagnosis, the presence of naevi, and the four SNPs accounted

for 6.8% of variation in Breslow thickness. This means there is a large amount

of variation which is as yet unaccounted for. This is likely to include many other

180Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

SNPs, and also environment and host factors. Further studies of other genes,

and collection of more phenotypic data in larger samples may assist in elucidating

the role of these phenotypic and genetic variables in melanoma development and

prognosis.

2.6 Chapter Summary

Melanoma is a significant public health issue in Australia, particularly in Western

Australia. In Western Australia, more than 1,000 individuals are diagnosed with

melanoma each year, accounting for approximately 10% of all cancer diagnoses.

Invasive melanoma is an aggressive form of skin cancer, and it is expected that

Australian incidence and mortality rates from this disease will continue to rise [51].

The main environmental risk factor for melanoma is sun exposure, and the main

host risk factor is the presence of naevi, however the association of other risk-

factors with melanoma risk is uncertain and many associations have not been

consistently replicated. Several genetic variants which contribute to melanoma

risk have been identified, however, these account for only a small proportion of

melanoma diagnoses. In addition, the role of specific genetic and environmental

factors in melanoma prognosis are largely unknown. Therefore, the collection of

genetic epidemiological data from melanoma cases is required, along with further

investigations in order to elucidate the role of environmental, host and genetic

risk factors in melanoma-susceptibility and prognosis.

This chapter described the establishment of the WAMHS, and the use of these

data to conduct genetic epidemiological associations into melanoma susceptibility

2.6. Chapter Summary 181

and prognosis.

The WAMHS is a population-based case-collection and biospecimen resource com-

prised of consenting adults diagnosed with melanoma in Western Australia be-

tween January 2006 and September 2009. WAMHS subjects completed a ques-

tionnaire and also gave a blood sample for DNA, serum, and an additional blood

sample for RNA. The final sample consisted of 1,643 participants, with 1,157 hav-

ing complete questionnaire data and blood samples. This is the largest population-

based study of melanoma cases in Western Australia, and one of the largest in

the world. As such, the WAMHS is an ideal resource for genetic epidemiological

investigations into melanoma susceptibility and prognosis.

As part of this PhD, 42 SNPs in 16 genes were genotyped in the WAMHS sample.

The SNPs were chosen on the basis that they were in or close to loci which have

been previously identified as melanoma-risk genes. The association of these SNPs

and melanoma-susceptibility were investigated by comparing the genotypic dis-

tributions of the WAMHS cases, with the genotypic distributions of two general

population samples - the HapMap-CEU and the BHS samples. After adjustment

for multiple testing, the association of six SNPs in five genes with melanoma

risk was replicated, with only two SNPs associated when compared to both the

HapMap-CEU and BHS control populations.

In this chapter, possible associations between these 42 melanoma-risk SNPs and

Breslow thickness were also investigated. Melanoma prognosis was determined by

Breslow thickness, with a thicker Breslow thickness indicating a poorer progno-

182Chapter 2. Genetic Epidemiology of Malignant Melanoma: Susceptibility and

Prognosis in the WAMHS

sis. Phenotypic analyses found increasing age at diagnosis was associated with

thicker Breslow thickness, and therefore poorer prognosis, while the presence of

naevi was associated with thinner melanomas, and therefore a better prognosis.

Four SNPs were associated with Breslow thickness, however after adjustment for

multiple testing, none of these associations remained significant.

One SNP - rs12203592 in the IRF4 gene - was associated with melanoma risk

in the WAMHS sample, compared to both the HapMap and BHS control sam-

ples, and was also associated with thicker melanomas in the WAMHS sample. In

particular, the ‘T’ allele was associated with both increased melanoma risk, and

thicker melanomas. However, the latter association was not significant after ad-

justment for multiple testing. No other SNPs were consistent in their associations.

It is possible that genetic variants which increase melanoma-susceptibility are

also associated with thicker Breslow thickness and therefore poorer prognosis.

However, the current study had several limitations, and was not able to provide

sufficient evidence to support this hypothesis. Replication of these genetic results,

particularly in larger samples, is required.

183CHAPTER 3

Mendelian Randomisation: An Application of

Instrumental Variable Techniques

3.1 Introduction

‘Mendelian randomisation’ is an application of a technique known as instrumental

variables (IVs), which can be used to better understand and model causality in

epidemiological studies. In this chapter I provide a background for IV methods

and Mendelian randomisation, including their advantages and disadvantages. I

also describe a novel implementation of Mendelian randomisation analyses incor-

porating haplotypes in a software library for R, MRsnphap.

When this thesis was being planned, I intended to investigate causal pathways

underlying melanoma. Mendelian randomisation approaches were therefore at-

tractive as they would have allowed me to investigate such pathways assuming the

conditions for Mendelian randomisation analyses were met (these are described

further in Section 3.4.2). One potential analysis was to investigate the effect of

serum vitamin D levels on Breslow thickness in the WAMHS cases, and if an as-

sociation was detected, to investigate whether this association was causal. The

rationale for this is discussed further in Section 3.2. Due to financial constraints,

vitamin D levels in the WAMHS were not measured, and therefore this analysis

was not able to be performed. However, this example is used throughout this

section to illustrate the possible use of Mendelian randomisation studies using

melanoma cases from the WAMHS.

184Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

3.2 Vitamin D Levels and Breslow Thickness

Vitamin D is a fat-soluble secosteroid and is a term generally used to refer to

vitamin D2 and/or vitamin D3. Vitamin D3 is produced when the skin is ex-

posed to sunlight, in particular, ultraviolet-B radiation. In recent years, vitamin

D levels, in particular vitamin D deficiency has been linked to increased risks of

chronic disease, including osteoarthritis, diabetes, prostate cancer, breast cancer,

and melanoma [277].

Vitamin D levels have been associated with both melanoma susceptibility and

prognosis [263]. In particular, increased vitamin D levels are associated with a

lower risk of melanoma recurrence, and thinner melanomas. Vitamin D levels are

often higher in individuals with substantial sun exposure, as exposure to ultra-

violet radiation leads to an increased production of vitamin D. Therefore, this

association suggests that individuals with higher vitamin D levels (and higher

ultraviolet radiation exposure) are at a lower risk of melanoma recurrence, and

have better melanoma prognosis. This is particularly interesting as high ultravio-

let radiation exposure is the main known environmental risk factor for melanoma,

therefore the association between high vitamin D levels and improved melanoma

prognosis and reduced recurrence warrants further investigation.

In a study of 872 patients with melanoma in the United Kingdom, Newton-Bishop

et al. [263] found that vitamin D protected against the recurrence of melanoma.

In addition, higher vitamin D levels were associated with thinner Breslow thick-

ness (P=0.002). For individuals with a melanoma less than 0.75mm, mean serum

vitamin D levels were 55.8 nmol/L and this decreased linearly to 48.5 nmol/L for

3.3. Epidemiological Studies 185

individuals with Breslow thickness of more than 3mm.

However, it is possible that vitamin D levels are not causally associated with

Breslow thickness or melanoma recurrence, rather these associations may be con-

founded by other factors, such as socioeconomic status or a healthy lifestyle. If

this relationship is causal, then an intervention to increase vitamin D levels may

reduce Breslow thickness and therefore improve melanoma prognosis, and may

also reduce melanoma recurrence.

3.3 Epidemiological Studies

One aim of epidemiological studies is to detect associations between disease-

associated outcomes and modifiable exposures so that interventions can be per-

formed on the exposure to reduce the impact on disease outcomes. Disease-

associated outcomes may include quantitative measures associated with disease,

such as Breslow thickness as a close correlate of melanoma prognosis, or a binary

measure, such as diagnosed melanoma. A modifiable exposure is some environ-

mental factor to which an individual is exposed, such as vitamin D levels. An

example of this would be a positive association detected between vitamin D levels

and Breslow thickness, where an intervention to increase vitamin D levels leads

to a decrease in the Breslow thickness of excised melanomas.

For an intervention to make a meaningful impact on disease outcomes, the ex-

posure must be causally associated with these outcomes. However, a causal as-

sociation may be difficult to determine. Broadly speaking, there are two types

of epidemiological studies – experimental and observational studies. The benefits

186Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

and limitations of these studies and their ability to infer causality are discussed

in the following section.

3.3.1 Experimental Studies

Experimental studies are epidemiological studies where the investigator is able to

manipulate who is exposed and who is unexposed to some treatment or exposure.

Randomised controlled trials (RCTs) are a type of experimental study which al-

locates individuals randomly into exposed and non-exposed groups, or treatment

and non-treatment groups, and are considered the ‘gold-standard’ for evidence

of causality [9, 231, 233, 243]. RCTs have the advantage that, as individuals are

randomly allocated into treatment and non-treatment groups, there should not be

any confounding between unobserved factors and the treatment received. There-

fore if the treatment and non-treatment groups groups exhibit different disease

characteristics, this should be wholly due to the treatment received. It is then

plausible to suggest that the treatment has a causal effect on the disease outcome.

As such, RCTs are recognised as the gold-standard for epidemiological studies.

However, it is not always possible to perform a RCT as it may not be ethical

or feasible to allocate individuals to a particular exposure. For example, if it is

hypothesised that smoking causes high blood pressure, it would not be ethical to

randomize individuals into smoking and non-smoking groups due to the known

harmful effects of smoking. In this example, an observational study may be pre-

ferred, where an individual’s blood pressure is measured and their smoking history

is collected either prospectively or retrospectively. In a study of vitamin D lev-

els, it may be difficult to allocate individuals into low and high vitamin D level

groups as it may be difficult to restrict an individual’s exposure to UVB in the

3.3. Epidemiological Studies 187

low-vitamin D level group. Therefore, in situations such as these, observational

studies may be utilised.

3.3.2 Observational Studies

Observational epidemiological studies are often a precursor to RCTs as they are

cheaper and easier to conduct, and are unlikely to violate ethical standards.

The three main types of observational epidemiological studies, cross-sectional,

case-control and cohort studies, and their abilities to infer causality in epidemio-

logical studies are described below [13].

Cross-sectional studies involve collecting information (usually through a ques-

tionnaire) regarding an individual’s disease status and possible risk exposures at

a point in time. Researchers then use this information to determine if there is

an association between risk exposures and some disease outcome. For example,

a study to investigate the effect of vitamin D levels on Breslow thickness may

involve the recruitment of individuals recently diagnosed with melanoma. Indi-

viduals may then have their vitamin D levels measured as soon as possible after

melanoma diagnosis, and their Breslow thickness recorded to investigate whether

vitamin D levels are associated with Breslow thickness, and therefore melanoma

prognosis.

Case-control studies involve recruiting two groups of individuals: cases, who have

some disease, and controls, who do not. These two groups are then compared

188Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

based on their past exposure to suspected disease risk factors. If the risk factor

is more prevalent in the cases, then it may be that this risk factor contributes to

disease. For example, two groups of individuals may be recruited – those diag-

nosed with melanoma, and those who have not been diagnosed with melanoma.

Individuals may then have their vitamin D levels measured to determine if the

levels of vitamin D are different amongst cases and controls. If vitamin D lev-

els are higher in controls, this suggests that vitamin D levels are associated with

melanoma susceptibility.

Cohort studies follow groups of healthy individuals through time to record ex-

posure to risk factors and incidence of disease or disease outcomes. This allows

researchers to determine if those individuals who develop the disease or disease

outcome had increased exposures to risk factors before the disease occurred. For

example, a cohort study may recruit a group of individuals all aged 20. These

individuals may then be followed for many decades, with their vitamin D levels

recorded annually. If melanoma diagnoses are also recorded, researchers will be

able to identify individuals who develop melanoma and compare their vitamin D

level histories with the individuals who have not developed melanoma. If individ-

uals with diagnosed melanoma also have low vitamin D levels, this suggests that

low vitamin D levels are associated with melanoma susceptibility.

These studies all have respective advantages and disadvantages. Data collection

in cross-sectional and case-control studies is often easier and less time-intensive

compared to cohort studies, however these studies are more likely to be subject

to recall bias, as participants with disease (cases) may be more likely to recall

exposures that they believe caused disease, compared to controls. In addition,

3.3. Epidemiological Studies 189

cohort studies are also able to demonstrate a temporal sequence between the ex-

posure and the occurrence of disease. The disadvantage of cohort studies lies in

the cost and time of conducting such studies – a great number of subjects may

need to be followed for a long period of time to accrue enough cases, particularly

when the disease is rare, or if there is a long lag-time between exposure and dis-

ease occurrence [225]. All three types of observational studies may be subject to

confounding, as an individual is self-selecting their exposure to risk factors which

may be related to unobserved factors, such as socioeconomic status or diet. For

example, individuals who have higher levels of vitamin D may have a healthier

diet and be more educated. These factors may also affect melanoma susceptibility

and prognosis, therefore confounding the observed association between vitamin D

levels and melanoma.

These potential biases mean that observational studies may not always be able to

detect true associations. The potential disparity in results between observational

studies and subsequent RCTs are discussed in the next section.

3.3.2.1 Limitations of observational epidemiology

Observational epidemiology has had many successes at identifying risk exposures

for disease, including perhaps the most high-profile, establishing the link between

smoking and cancers, in particular, lung cancer [226, 227]. However, there have

also been many notable failures; that is, associations reported from observational

studies which have not been replicated in ‘gold-standard’ RCTs.

One such failure is the putative association between hormone replacement therapy

190Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

(HRT) and CVD. An initial meta-analysis of observational epidemiological studies

showed that the use of HRT halved a woman’s risk of CVD, with a relative risk of

0.56 (95% CI = 0.50, 0.61) [228]. Similarly, a subsequent meta-analysis conferred

a relative risk of 0.65 (95% CI= 0.59, 0.71), prompting the authors to state that

women with CVD or at high-risk of CVD should probably be recommended for

HRT [229].

To test the hypothesis generated from these observational studies, a series of

RCTs were performed to investigate the effect of HRT on CVD risk. In a pooled

analysis of these subsequent RCTs, women given HRT were not found to have a

reduced risk of CVD; in fact, if any change was detected, it was an increased risk

of CVD [230].

Other reported associations from observational studies that have not been repli-

cated by RCTs include the association between elevated homocysteine level and

increased CVD risk, the use of antioxidant vitamins and reduced CVD and cancer

risk, and the use of beta-carotene and reduced smoking-related cancers [231–234].

The disparity between associations detected in epidemiological studies and RCTs

suggest that the initially reported associations in observational epidemiological

studies were not in fact causal. Instead, these observations are likely to be ex-

plained by a number of biases, which were removed or reduced in the RCT. These

possible biases include confounding by lifestyle, behavioural and socioeconomic

factors, reverse causation, and selection bias [235].

3.3. Epidemiological Studies 191

Regression dilution bias is another bias which may effect associations in observa-

tional epidemiological studies [235]. These biases and their impact on observa-

tional epidemiological studies are described in the following section.

3.3.2.1.1 Confounding

Confounding occurs when the exposure of interest is not associated directly with

a disease outcome, but rather is observed to be associated due to its association

with some other unidentified (‘confounding’) factor. For instance, the association

between higher vitamin D levels and decreasing Breslow thickness, may be due to

confounding. That is, vitamin D levels may be higher in individuals who exercise

more outdoors, take dietary supplements, and are generally in good health. These

individuals may also be more likely to examine their skin, and have regular visits

to the doctor. This may result in earlier melanoma diagnosis, and therefore thin-

ner tumours excised. In this example, the association between vitamin D levels

and Breslow thickness is confounded by a healthy lifestyle.

Healthy lifestyle

$$zzBreslow thickness Vitamin D levelsoo

Figure 3.3.2.1.1.1: Confounding – the observed association between Breslow thick-

ness and vitamin D levels is due to confounding by a healthy lifestyle

192Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

3.3.2.1.2 Reverse causation

Reverse causation occurs when instead of the exposure of interest influencing some

disease outcome, the disease outcome influences the exposure. This is a violation

of the concept of temporality which was used to define causality in Section 1.1.1.

Continuing the Breslow thickness and vitamin D example from earlier, reverse

causation may occur if it is the presence of a thicker melanoma tumour (thicker

Breslow thickness) which actually reduces vitamin D levels, rather than lower vi-

tamin D levels increasing Breslow thickness. If the reported association between

vitamin D levels and Breslow thickness was due to reverse causation, then indi-

viduals with thicker tumours would have lower vitamin D levels and it may be

wrongly assumed that lower vitamin D levels actually increase melanoma thick-

ness.

3.3.2.1.3 Selection bias

Selection bias occurs when study participants have been selected based on their

exposure and disease combination in case-control studies, or to their exposure and

disease outcome in association studies. An example of this was seen in the Cancer

Prevention Study II which consisted of 1.2 million volunteers [238]. This study

found that in a middle-aged and elderly population, heavy alcohol consumption

was associated with a reduced risk of death from stroke. Alcohol consumption in-

creases blood pressure - a major risk factor for stroke - so this association seemed

counterintuitive. Instead, Ebrahim and Davey Smith suggested [239] that this as-

sociation may have been reported as healthy individuals who drank heavily were

more likely to volunteer for and participate in a health study, compared to heavy

3.3. Epidemiological Studies 193

drinkers who had poor health. Therefore, study participants may have been se-

lected with bias, and associations that indicated heavy drinking reduced stroke

mortality may have been an artefact of the selection.

Similarly, for the Breslow thickness and vitamin D level example, selection bias

may occur if healthy individuals (with higher vitamin D levels) who have thicker

melanomas are more likely to volunteer for a study, compared to individuals who

are less healthy (with lower vitamin D levels) with thicker melanomas. In this

situation, it may appear that high vitamin D levels are associated with thicker

melanomas, but it may just be that individuals with low vitamin D levels and

thicker melanomas did not volunteer for the study.

3.3.2.1.4 Regression dilution bias

Regression dilution bias, or attenuation by errors, is the dilution of the associa-

tion between the exposure and disease risk, due to measurement error, or random

noise in the exposure. This dilution will result in a smaller reported effect of the

exposure on disease [240,241].

This commonly occurs in observational epidemiological studies where exposures

are often only measured once, but are used as a proxy for an individual’s long-

term average value. For example, in a study of vitamin D levels and Breslow

thickness, vitamin D levels may only be measured once during the study, and

these one-off measurements may be unrepresentative of an individual’s lifetime

vitamin D levels. Therefore, the effect of vitamin D levels on Breslow thickness

may be diluted and the estimated effect of vitamin D levels on Breslow thickness

194Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

may be underestimated.

3.4 Statistical Methods for Analysing Observational Epidemiological

Studies

In this section the statistical methods for observational epidemiological studies,

in particular cross-sectional studies, are described.

3.4.1 Ordinary least squares

Ordinary least squares (OLS), in particular linear OLS, is often used in regression

analysis to model relationships between measurements collected in studies, for ex-

ample, associations between some disease outcome and exposures in observational

epidemiological studies.

Consider a linear regression model,

y = Xβ + u,

where y is a column vector of observed response variables, X is the design matrix

for the intercept and regressors (explanatory variables), β is the column vector of

regression parameters, and u is the column vector of random errors.

The estimated regression parameter βOLS is given by,

βOLS = (XTX)−1XTy,

3.4. Statistical Methods for Analysing Observational Epidemiological Studies 195

where T and −1 indicate the transpose and inverse of a matrix, respectively.

The variance of βOLS is given by,

V (βOLS) = σ2OLS(XTX)−1,

where σ2OLS =

(uTOLS uOLS)

n, where uOLS is the OLS residual, uOLS = y − XβOLS,

and n is the number of observations.

3.4.1.1 Assumptions of ordinary least squares

There are three assumptions underlying the use of OLS:

1. Explanatory variables and residuals are independent, such that E(Xiui) = 0,

2. residuals are Normally distributed with expected value 0 and a common

variance, u ∼ N(0, σ2OLS), and

3. residuals, u, are independent.

If the above assumptions are not violated, the model is valid to both predict values

of y given X and to make inferences about the causal nature of the modelled asso-

ciation. For example, if there is an observed association between Breslow thickness

and vitamin D levels, and the assumptions of OLS are not violated (implying the

association is causal), then an intervention in the community to increase vitamin

196Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

D levels will have an effect on the Breslow thickness of melanomas.

However, if Assumption 1 is violated, that is, when E(Xiui) 6= 0, βOLS will be a

biased and inconsistent estimator of β. This correlation between the explanatory

variables and residuals may occur if the association between Breslow thickness

and vitamin D levels is due to confounding; that is, there is an omitted variable

which is associated with both Breslow thickness and vitamin D levels, such as a

healthy lifestyle.

In this situation, OLS would not be an appropriate technique to model the data,

and a different technique may be required. Instrumental Variables (IVs) is one

such method, and is described in the following section.

3.4.2 Instrumental variable methods

IV methods are a statistical technique which can be used to estimate the possi-

ble nature and direction of a causal relationship between two variables. Imagine

a researcher has modelled a relationship between Breslow thickness and vitamin

D levels from a cross-sectional observational study. They may be interested in

whether vitamin D levels are associated with the Breslow thickness, and if vita-

min D levels also have a causal effect on Breslow thickness.

As before, consider the linear regression model,

y = Xβ + u,

3.4. Statistical Methods for Analysing Observational Epidemiological Studies 197

where each term is as described in Equation 3.4.1.

Under conditional homoskedasticity, the error, u, is distributed as follows : u ∼ (0,Ω),

where Ω is the covariance matrix, Ω = σ2IV I, where I is the identity matrix. As

before, σ2IV =

(uTIV uIV )

n, where uIV is the IV residual, uIV = y−XβIV ; the formula

for βIV is described in Section 3.4.2.2 (please see page 198).

IV methods can be utilised when some regressors are endogenous; that is, when

they are correlated with the error term, so that Assumption 1 is violated, or

E(Xiui) 6= 0. In this case, by using IV methods, βIV will be consistent. En-

dogeneity of the regressors may occur if the association modelled is not causal.

Instead, the association may be subject to some bias as discussed in Section 3.3.2.1

(please see page 189), leading to correlation between the regressor and the error

term, resulting in endogeneity of the regressor.

The IV method can be modelled in the following way (adapted from Baum

[242]). Define X to be the nxk design matrix and partition X into two parts,

X = [X1, X2], where X1 consists of the k1 endogenous regressors (including the

intercept term) which are assumed to be correlated with the error term and X2

are the k − k1 exogenous regressors which are not assumed to be correlated with

the error term.

Define Z to be the nxl matrix of instrumental variables, which are the vari-

ables assumed to be exogenous, or not correlated with the error term, such that

E(Ziui) = 0. Z can also be partitioned into two parts, Z = [Z1, Z2], where Z1

198Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

consists of the l1 excluded instruments and Z2 are the l− l1 included instruments.

These included instruments are also the exogenous regressors described above.

3.4.2.1 Assumptions of instrumental variables

An excluded instrument, Z1, is defined as a variable that satisfies the following

assumptions [243]:

1. Z1 is strongly correlated with X1,

2. Z1 is independent of U (the factors that confound the association of X and

Y ),

3. Z1 is independent of outcome Y , given X1 and U , and

4. all of the associations between Z1, X1, Y , and U are linear and are unaffected

by statistical interactions.

These assumptions are depicted in the directed acyclic graph (DAG) in Figure

3.4.2.1.1.

3.4.2.2 Instrumental variable estimator of β

The IV estimator of β can be calculated by,

βIV = (XTPZX)−1XTyPZ ,

3.4. Statistical Methods for Analysing Observational Epidemiological Studies 199

Z1

X1

U

oo

Y

Figure 3.4.2.1.1: DAG for the instrumental variables model. Z1, instrumental vari-

able; X1, exposure of interest; Y, outcome of interest; and U, unmeasured con-

founders.

where PZ is the projection matrix and is defined by Z(ZTZ)−1ZT .

3.4.2.3 Variance-covariance matrix estimator of βIV

The variance-covariance matrix of the IV estimator, βIV , is,

V (βIV ) = σ2IV (XTPZX)−1,

where σ2IV and PZ are as defined earlier.

3.4.3 Instrumental variable diagnostic tests

There are several diagnostic tests which should be performed either during or after

IV analysis to ensure that the assumptions of IV analysis have not been violated.

These tests are described below.

3.4.3.1 Strength of excluded instruments

The excluded instruments must be strongly correlated with the endogenous re-

gressor (Assumption 1 in Section 3.4.2.1). The strength of this correlation can

200Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

be calculated by first regressing the endogenous regressor on both the exogenous

regressor and excluded instruments, and also regressing the endogenous regressor

on only the exogenous regressor. Comparison of these two models by analysis of

variance techniques yields an F-statistic for the relationship between the excluded

instruments and the endogenous regressor.

It has been shown [244, 268] that an F-statistic ≥ 10 indicates a strong correla-

tion. When the F-statistic < 10, a weak correlation exists and the instrument is

referred to as weak. While this is a convenient rule of thumb, the strength of the

F statistic will depend on the number of instruments used. For example, Stock et.

al. [268] estimated that for 1 instrument, bias was minimised with an F-statistic

of 8.96. However, with 15 instruments, an F-statistic of 26.80 would be required.

The use of weak instruments increases both the bias and inconsistency in the IV

coefficient. In general, IV estimates are biased towards the OLS estimates. As

the correlation between the instrument and endogenous regressor decreases, the

bias of βIV increases. Therefore, the stronger the correlation, and therefore the

strength of the instrument, the smaller the bias in βIV [279].

3.4.3.2 Overidentification test

If there are more instruments than endogenous regressors, L1 > K1 (which is

equivalent to L > K), it is possible to test the assumption that the excluded

instruments are independent of the unobservable error process (Assumption 2

in Section 3.4.2.1). As this can only occur when the equation is overidentified

(or when L > K), this is referred to as the overidentification test, and can be

3.4. Statistical Methods for Analysing Observational Epidemiological Studies 201

performed by calculating Sargan’s test statistic [245],

S =(y −X ˆβIV )TZ(ZTZ)−1ZT (y −X ˆβIV )

(y−X ˆβIV )T y−X ˆβIVn

.

Under the null hypothesis that all instruments are independent of the error, S

∼ χ2l−k. If there is sufficient evidence to reject the null hypothesis, this suggests

the instruments are not truly exogenous (independent of the error term) and there-

fore are violating Assumption 2 in Section 3.4.2.1.

This test may also provide evidence that each instrument is providing a consistent

estimate of the exposure-outcome association, which increases confidence regard-

ing the causal nature of the exposure [243].

3.4.3.3 Endogeneity test

When using IV estimation, the asymptotic variance of the IV estimator is al-

ways larger than the asymptotic variance of the OLS estimator [246]. This loss

of efficiency is necessary if the OLS estimator would be biased and inconsistent.

However, if IV estimation is used when not required, for example, if regressors

are treated as endogenous when they are truly exogenous and not correlated with

the error, the loss of efficiency is unnecessary. Therefore, the test for endogeneity

tests whether the endogenous regressors are truly endogenous. This is done by

essentially testing the null hypothesis that the OLS coefficient and IV coefficient

are consistent with each other.

202Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

An endogeneity test can be performed by calculating the Durbin–Wu–Hausman

(DWH) statistic [247–249]. The hypothesis being tested are:

H0: OLS estimator, βOLS, is efficient

H1: OLS estimator, βOLS, is not efficient.

The DWH statistic can be calculated in the following steps:

1. Calculate OLS coefficient,

βOLS = (XTX)−1XTy.

2. Calculate OLS variance,

σ2OLS =

(y −XβOLS)T (y −XβOLS)

n.

3. Calculate DWH statistic,

DWH = n(βIV − βOLS)TD−(βIV − βOLS)T ,

where D− is the generalised inverse of D, defined as,

D = (σ2OLS(XTPZX)−1 − (XTX)−1).

Under the null hypothesis, DWH ∼ χ2K1

. If there is sufficient evidence to reject

the null hypothesis, this suggests the tested endogenous regressors are correlated

with the error term and IV methods should be employed.

3.5. Mendelian Randomisation 203

3.4.4 Instrumental variable methods for observational epidemiological

studies

IV methods can be employed for determining causal associations from observa-

tional cross-sectional epidemiological studies. In this framework the response, y,

is some disease outcome and the endogenous regressor, X1, is the exposure which

is believed may be associated with the error term. The excluded instrument, Z1,

is typically a genetic variant, and any adjusting explanatory variables are the in-

cluded instruments, or exogenous variables, X2 and Z2. When used in this way,

the IV method is referred to as Mendelian randomisation which is introduced and

discussed in the following section.

3.5 Mendelian Randomisation

3.5.1 Introduction

Mendelian randomisation is a term given to observational studies that use ge-

netic variants known to be associated with an outcome to make causal inferences

about modifiable risk factors for disease and health-related outcomes [233]. This

is an application of IV methods, where the genetic variant is the excluded instru-

ment for the non-genetic exposures which may be correlated with the error term.

Mendelian randomisation studies are based upon the principle of Mendel’s 2nd

law, or the Law of Independent Assortment, which states that the inheritance

pattern of one trait is independent of the inheritance pattern of other traits. That

is, alleles on paired chromosomes are transmitted by parents to their offspring

at random (i.e. 50:50 probability), and the transmission of these alleles will be

independent of other traits at a population level.

204Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

The concept of Mendelian randomisation and using genetic variants as instruments

has recently been attributed to Katan [233]. Katan commented on an observa-

tional study which showed that low serum cholesterol levels were associated with

cancer. Katan questioned whether this relationship was causal, or if it may have

been due to a hidden tumour which had decreased cholesterol levels (reverse cau-

sation), or if the association was confounded by other factors related to both low

cholesterol and cancer risk, for example, smoking [250].

Katan suggested the authors of the study may wish to use another method to

determine if this relationship is causal; this method would later be referred to as

Mendelian randomisation. Katan’s idea was as follows: the E2 allele of the Apo E

polymorphism has been shown to be associated with lower cholesterol levels, com-

pared to the E3 and E4 alleles. These alleles, which are transmitted from parent to

offspring at conception, will not be affected by cancer status (reverse causation),

nor will they be associated with lifestyle or socioeconomic status (confounding).

Therefore, a comparison of Apo E alleles in cancer cases and controls may help

explain the relationship between cholesterol levels and cancer.

If the association was causal and low cholesterol causes cancer, then in Katan’s

design, there would be more E2 alleles amongst the cases, compared to the con-

trols. If the association was not causal, then the E2, E3 and E4 alleles would be

distributed equally amongst the cancer cases and controls.

This Mendelian randomisation study was recently investigated by Trompet et

3.5. Mendelian Randomisation 205

al. [251], who found no evidence to suggest that low cholesterol levels were causally

associated with increased cancer risk. This is just one of the numerous Mendelian

randomisation studies which have been conducted [252–256].

3.5.2 Genetic variants as excluded instruments

The genetic variants used in MR analysis may be either SNPs or haplotypes. The

use of both of SNPs and haplotypes is discussed in Sections 3.5.2.1 and 3.5.2.2

respectively.

3.5.2.1 SNPs as excluded instruments

In Mendelian randomisation analyses, the excluded instruments may be one or

more SNPs. This situation is straightforward and can be modelled easily based

on the IV methods described earlier in Section 3.4.2 (please see page 196).

3.5.2.2 Haplotypes as excluded instruments

In Mendelian randomisation analyses, the excluded instruments may be one or

more haplotypes. One advantage of haplotype analysis over single SNP analysis

is the greater statistical power of using haplotypes compared to single SNP analy-

ses [274,275]. In addition, as complex diseases may be caused by multiple variants

in the same gene or genetic region, haplotypes represent the combined effects of

these SNPs (which may not be detected when analysing one SNP at a time).

However, the use of haplotypes in statistical analyses is not as straightforward as

the SNP case, as when an individual is heterozygous at more than one loci, their

haplotype pair, or diplotype, cannot be known with certainty using conventionally

genotyped data.

206Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

As described in Section 1.3.2.3, the EM algorithm can be utilised to calculate the

probability of an individual having certain haplotypes given their genotype set.

In Mendelian randomisation studies, when using haplotypes as instruments, one

analytic option which has been employed by Timpson et al. [257], is to perform

the analysis using the most probable haplotype (i.e. the realisation of the hap-

lotype with the highest posterior probability) for each individual. However, this

may fail to account appropriately for the variability within the data. This would

be particularly troublesome if the analysis was sensitive to the haplotype cho-

sen. Instead, it would be preferable to incorporate this uncertainty directly into

the Mendelian randomisation analysis. The inclusion of haplotype uncertainty

in Mendelian randomisation analysis, is the method employed by the R library

MRsnphap which was created by the author. This is discussed further in Section

3.5.7 (please see page 214).

3.5.3 Mendelian randomisation example

To further clarify the use of Mendelian randomisation analysis for epidemiologi-

cal studies, consider a researcher is investigating the possible causal relationship

between high vitamin D levels and Breslow thickness, such that if there exists a

casual relationship, an intervention which increases vitamin D levels will result in

thinner melanomas excised.

In the first stage of this analysis, an OLS regression model can be fitted, with Bres-

low thickness as the response, vitamin D levels as the main explanatory variable,

or exposure, adjusting for the confounders, age and sex, such that,

3.5. Mendelian Randomisation 207

Breslow Thickness = β0 + β1Age + β2Sex + β3Vitamin D level.

Having fitted the above linear regression model, a significant negative association

between vitamin D levels and Breslow thickness may be observed, such that high

vitamin D levels are associated with lower Breslow thickness. However, the re-

searcher may also wish to investigate whether this relationship is causal and that

an increase in vitamin D levels will have the effect of reducing Breslow thickness.

It may be suspected that this relationship is not causal, and that both vitamin D

levels and Breslow thickness may be associated with unmeasured confounders, or

that melanomas which have penetrated deeper into the skin (with thicker Breslow

thickness) may be influencing vitamin D levels, such that β3 is correlated with the

error term. If this is the case, vitamin D levels may be treated as an endogenous

regressor (X1).

The next stage of the analysis, and possibly the most vital, is to identify a genetic

variant which is strongly associated with vitamin D levels. One possible genetic

variant may be the rs2282679 SNP on the Vitamin D Receptor gene which has

been shown to be associated with vitamin D levels [278]. Therefore, this SNP may

be used as the excluded instrument (Z1). The rs2282679 SNP may be modelled

additively, and coded as 0, 1 and 2, where 0, 1, and 2 indicates the number of

variant alleles. The confounders age and sex are the included instruments, or the

exogenous regressors (X2 and Z2).

208Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

This can be modelled using a DAG shown in Figure 3.5.3.1.

Z1 : rs2282679

))))X1 : Vitamin D

**

Ue.g.

healthylifestyle

tt

oo

Y : Breslow thickness

Figure 3.5.3.1: DAG for the Mendelian Randomisation model. Z1:rs2282679, in-

strumental variable; X1:Vitamin D levels, exposure of interest; Y:Breslow thick-

ness, outcome of interest; and U, unmeasured confounders. [Adapted from Lawlor

et al. [243]]

The strength of the excluded instrument-exposure relationship can be tested using

ANOVA to compare the model of vitamin D levels regressed on all instruments,

Vitamin D level = β0 + β1rs2282679 + β2Age + β3Sex,

with the model of vitamin D regressed only on the exogenous regressors,

Vitamin D level = β0 + β2Age + β3Sex.

The F-statistic for the strength of the association between rs2282679 and vita-

min D levels, may be calculated as F = 11.5. As this is > 10, the association

between vitamin D levels and the rs2282679 SNP is considered strong, and so the

Mendelian randomisation analysis may continue.

If the F-statistic calculated from the ANOVA was < 10, the recommendation

would be to not continue with the Mendelian randomisation analysis, as the esti-

mates would likely be biased.

3.5. Mendelian Randomisation 209

In order to test for overidentification, there must be more instruments than expo-

sures, such that L > K. In this example, k = 3 consisting of k1 = 1 endogenous

regressors and k − k1 = 2 exogenous regressors. There are also l = 3 instruments

consisting of l1 = 1 excluded instruments, l − l1 = 2 included instruments. As

l = k, the analysis is referred to as exactly identified. As such, it is not possible

to test for overidentification.

To test for endogeneity, a Durbin-Wu-Hausman test can be performed. After per-

forming this test, there may be sufficient evidence to reject the null hypothesis

(DWH statistic p-value < 0.05). This would indicate that vitamin D levels are

endogenous and the OLS methods are not efficient, therefore IV methods should

be used. If the DWH p-value > 0.05, vitamin D levels would not be endogenous,

therefore OLS methods should be employed.

Finally, the p-value for the vitamin D/rs2282679 coefficient of the Mendelian ran-

domisation analysis must be interpreted. Interpretation of the analysis should

involve the careful consideration of the OLS and IV coefficients, confidence inter-

vals and p-values. In this case, the OLS and IV coefficients, confidence intervals

and p-values are presented in Table 3.5.3.1.

Model Vitamin D/rs2282679 Coefficient Confidence Interval p-value

OLS 1.05 (1.01, 1.09) 0.01

IV 0.98 (0.95, 1.01) 0.08

Table 3.5.3.1: IV and OLS coefficients and confidence intervals

This difference between OLS and IV coefficient estimates and confidence intervals

210Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

(along with the DWH test) suggest that the observational epidemiological associ-

ation may be subject to bias. That is, that vitamin D levels may not be causally

associated with Breslow thickness and that the different OLS and IV estimates

may be due to unmeasured confounding or reverse causation. Therefore in this

situation, Mendelian randomisation analysis suggests that a public health inter-

vention to reduce the severity of melanoma by encouraging individuals to increase

their vitamin D levels through sun exposure or diet would not be beneficial.

3.5.4 Benefits of Mendelian randomisation

The use of genetic variants as excluded instruments in Mendelian randomisation

analysis has major advantages. It may help overcome the potential limitations of

observational epidemiological associations which were discussed in Section 3.3.2.1,

namely confounding, reverse causation, selection bias, and regression dilution.

3.5.4.1 Confounding

Exposures may be associated with socioeconomic, behavioural and lifestyle fac-

tors, however individual genetic variants are unlikely a priori to be associated

with these factors at a population level, due to their random transmission at con-

ception. This means that any reported association between a disease outcome and

the genetic variant will generally not be due to confounding.

3.5.4.2 Reverse causation

Presence of, and severity of disease may influence exposures in epidemiological

studies. This may be due to behavioural changes made by individuals, or through

unobserved chemical processes. However, genetic variants will not be influenced

by disease and will remain unchanged regardless of disease status.

3.5. Mendelian Randomisation 211

3.5.4.3 Selection bias

While individuals may be selected or volunteer for a study dependent on their

exposure and its influence on some disease measure, an individual is unlikely to

volunteer for or be selected either as a case or a control based solely on their

genetic variant.

3.5.4.4 Regression dilution bias

Unlike exposures, genetic variants only need to be measured once as they will

remain unchanged, and if these variants are associated with some exposure, the

genetic variant should be indicative of these exposure levels across the individual’s

lifetime.

3.5.5 Limitations of Mendelian randomisation

Along with the many advantages of Mendelian randomisation analysis, there

are several limitations [233, 243] which need to be considered when undertak-

ing Mendelian randomisation studies. The most significant of these, namely the

identification of a suitable genetic variant known with some certainty to be as-

sociated with the trait of interest, population stratification, and pleiotropy are

discussed in the following section.

3.5.5.1 Identification of suitable genetic variant for exposure

In order to use Mendelian randomisation, there must be a known genetic variant

that is clearly and consistently associated with the trait of interest [7]. Such a

variant may itself be directly functional or may be a marker in strong linkage

disequilibrium with one or more functional genetic variants. Historically, these

212Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

variants have not been easy to identify, however with the increase in genome-wide

association studies and functional genomic work, more such genetic variants are

becoming available with each passing month [8, 243].

3.5.5.2 Reliable gene associations

As with any genetic epidemiological study, Mendelian randomisation studies re-

quire reliable associations between the genetic variant and exposure and also be-

tween the genetic variant and the disease outcome. Reliability of associations can

be determined by ensuring that the associations have been replicated in several

independent studies, and should also be confirmed within the current Mendelian

randomisation study [243]. However, genetic association studies are renowned

for producing statistical associations that are not able to be subsequently repli-

cated. In fact, an examination of 301 published studies of 25 different reported

associations found less than half of these associations had strong evidence of repli-

cation [259]. This lack of replication may be due to publication bias, small power

to detect associations and the reporting of spurious associations.

3.5.5.3 Population stratification

Population stratification occurs when a population consists of subgroups who have

systematic differences in disease rates and allele frequencies [224]. This can result

in associations between genetic variants and disease in the whole population which

are related to, and confounded, by ethnicity. This may result in confounding in a

Mendelian randomisation study, particularly when gene-exposure associations are

performed in different study populations to the gene-outcome associations.

3.5. Mendelian Randomisation 213

3.5.5.4 Pleiotropy

Mendelian randomisation analysis assumes that the genetic variant is associated

with the disease outcome only through the exposure. It is possible that this genetic

variant is pleiotropic and actually effects multiple traits. However, this is only a

problem if one of these additionally affected traits influence the outcome [243].

For example, in the vitamin D levels and Breslow thickness example discussed

earlier: if the rs2282679 SNP affects not only vitamin D levels, but also skin

pigmentation, then skin pigmentation may also affect Breslow thickness. This

would be violating the assumption (Section 3.4.2.1) that rs2282679 affects Breslow

thickness only through vitamin D levels. However, if rs2282679 is pleiotropic and

also affects height, it is unlikely that Breslow thickness will be affected by this.

3.5.6 Current software for Mendelian randomisation analysis

To the author’s knowledge, there are two software packages currently available

for Mendelian randomisation analysis; however neither were designed specifically

for use in the epidemiological framework. IVREG2 is a comprehensive module

for Stata which can perform Mendelian randomisation analysis using SNPs and

known haplotypes as the excluded instrument [242]. However, IVREG2 is only

able to use the most probable haplotype for each individual, which as discussed

in Section 3.5.2.2 may introduce bias. IVREG2 also performs comprehensive di-

agnostic tests described in Section 3.4.3 (please see page 199).

TSLS is a function in the R library sem [260] which, like IVREG2, can perform

Mendelian randomisation analysis using SNPs and the most probable haplotype

for each individual as excluded instruments. However, this function does not per-

214Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

form any of the diagnostic tests described in Section 3.4.3, which are integral in

interpreting the results of the Mendelian randomisation analysis.

3.5.7 Software implementation

As there are no software packages available which model haplotype uncertainty in

Mendelian randomisation studies, the author designed a library within the soft-

ware package R [261], MRsnphap. MRsnphap has been designed to work within

the current SimHap [262] framework. SimHap is a statistical analysis program

designed to analyse associations between SNPs and haplotypes and various out-

comes. Two functions are available in MRsnphap – these are mrsnp.quant which

models Mendelian randomisation studies with the SNP as the excluded instru-

ment, and mrhap.quant which models these studies using phase-ambiguous hap-

lotypes. These functions also perform the testing described in Section 3.4.3, which

includes a test of overidentification, a test of endogeneity and an F-test for weak

instruments.

The methods employed in MRsnphap for Mendelian randomisation analyses are

described further in Sections 3.5.7.1 and 3.5.7.2 respectively.

3.5.7.1 Using SNPs as instruments: mrsnp.quant

The R function mrsnp.quant enables the user to perform Mendelian randomisa-

tion analysis, with a SNP as the excluded instrument. The mrsnp.quant function

is able to model continuous outcomes and exposures, while the SNP may be mod-

elled under additive, dominant or recessive assumptions, using a generalised linear

3.5. Mendelian Randomisation 215

model.

3.5.7.1.1 Example: mrsnp.quant

Referring back to the Mendelian randomisation example in Section 3.5.3 (please

see page 206), a researcher is interested in determining if there exists a causal

association between Breslow thickness and vitamin D levels. As they believe vita-

min D is an endogenous regressor, they use the rs2282679 SNP as an instrument

for vitamin D levels.

In order to use mrsnp.quant function, the MRsnphap library must be loaded.

When loading the MRsnphap library, several other libraries, including the SimHap

library are loaded simultaneously. In addition, the SimHap function SNP2Geno

must be used to generate the necessary SNP file. The code to load these pack-

ages, and run the necessary SimHap functions are included in the MRsnphap

manual in Appendix I.

The appropriate R input is:

mymodel < − mrsnp.quant(outcome = “BT”, exposure= “VitD”, covariates =

“AGE”, geneticmodel = “rs2282679 add”, geno=geno.dat, pheno=pheno.dat),

where BT (Breslow thickness) is a continuous outcome, VitD (vitamin D) is the

endogenous regressor and age is the exogenous regressor. The genetic model is

the rs2282679 SNP modelled additively, taking the values 0, 1 or 2, where these

indicate the number of variant alleles. The files containing the genotypic and

216Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

phenotypic data are geno.dat and pheno.dat, respectively.

The output produced by mrsnp.quant can be seen in Appendix J.

3.5.7.2 Using haplotypes as instruments: mrhap.quant

The mrhap.quant function enables the user to perform Mendelian randomisa-

tion analysis, with a haplotype as the excluded instrument. Like mrsnp.quant,

mrhap.quant models continuous outcomes and exposures, and the haplotype can

be modelled as either additive, dominant or recessive. The mrhap.quant func-

tion uses the same maximum likelihood method as SimHap to infer diplotypes

for phase-ambiguous individuals; that is, for individuals whose diplotypes are not

known for certain. Each phase-ambiguous individual is assigned possible diplo-

types consisting of two haplotypes, where each haplotype has a probability at-

tached to it derived from the EM algorithm. This is because for phase-ambiguous

individuals, it is not clear which haplotypes each individual has within their diplo-

type. All that can be calculated (through the EM algorithm) is the probability of

an individual having certain haplotypes given their genotype set. As these hap-

lotypes occur with a probability and are not known for certain, this uncertainty

can be included within the modelling framework.

In SimHap, this is done by simulating data sets in which individuals are assigned

haplotypes with the appropriate probabilities. Particular statistical models are

then fitted to each simulated data set and the results are aggregated.

For example, in mrhap.quant, a Mendelian randomisation model is fitted to each

3.5. Mendelian Randomisation 217

simulated data set and coefficients and p-values for the Mendelian randomisa-

tion analysis are averaged. In addition, the mean F-statistic is presented for the

test for weak instruments, and the mean p-values are presented for the tests for

endogeneity and overidentification.

3.5.7.2.1 Example: mrhap.quant

The mrsnp.quant example described in Section 3.5.7.1.1 can be extended to in-

clude haplotypes in mrhap.quant. The researcher is interested in determining if

there exists a causal association between increased Breslow thickness and lower

vitamin D levels, and wishes to use a haplotype to test this. They choose a haplo-

type of several SNPs from the Vitamin D Receptor gene, CCTG, which has been

found to be associated with vitamin D levels.

As before, to use the mrhap.quant function, the MRsnphap library must first

be loaded. In addition, the SimHap functions SNP2Haplo, infer.haplos and

make.haplos must be used to generate the necessary haplotype files. The code

to load these packages, and run the necessary SimHap functions are included in

the MRsnphap manual in Appendix I.

The appropriate R input is:

mymodel < − mrhap.quant(outcome = “BT”, exposure= “VitD”, covariates

= “AGE”, geneticmodel = “CCTG”, haplo=myhaplo.dat, effect=“add”, sim=

1000, pheno=pheno.dat),

218Chapter 3. Mendelian Randomisation: An Application of Instrumental Variable

Techniques

where BT, VitD, and age are as described earlier. The genetic model is a haplo-

type modelled as an additive effect. In this example, the Mendelian randomisation

is simulated 1000 times. The files containing the haplotype and phenotypic data

are myhaplo.dat and pheno.dat, respectively.

The output produced by mrhap.quant can be seen in Appendix K.

3.5.8 Summary

This chapter has described an overview of epidemiological studies, and their failure

to identify causal associations between disease outcomes and modifiable exposures.

IV methods were proposed as a method to infer causality in epidemiological stud-

ies when the use of RCTs are not possible. An application of this technique –

Mendelian randomisation – was also proposed, and the benefits and limitations

of this technique were described. The remainder of this chapter described the

implementation of a novel extension to MR –the inclusion of imputed haplotypes

and the construction of an R library, MRsnphap, which performs Mendelian ran-

domisation using both SNPs and haplotypes as instruments.

219CHAPTER 4

Summary and Suggestions for Further Research

This final chapter provides a discussion of the findings of this thesis, and suggests

areas for further research.

This thesis has described the establishment of a population-based collection of

melanoma cases in Western Australia. The WAMHS facilitated investigation into

the genetic epidemiology of both melanoma susceptibility and prognosis. In addi-

tion, this thesis described the implementation of Mendelian randomisation tech-

niques in an R library, MRsnphap, which can be used to infer causal associations

in epidemiological studies.

4.1 The Western Australian Melanoma Health Study

The WAMHS was established with the aim of investigating the genetic epidemi-

ology of melanoma susceptibility and prognosis. The WAMHS is one the single

largest population-based studies of melanoma cases in the world, and the analysis

in this thesis has shown that the WAMHS is a valuable resource for investigating

genetic risk factors associated with melanoma susceptibility and progression.

The results of the association analyses between candidate genetic variants and

melanoma susceptibility replicated some of the associations identified by earlier

studies. However, not all associations were replicated in the WAMHS. This may

be due to one of four reasons: (1) the initial associations were spurious; (2) the

genetic variants associated with melanoma in other populations are not the same

as in the WAMHS; (3) the WAMHS sample (n=800) was underpowered to detect

these associations; or (4) the controls used in these analyses were not suitable. The

220 Chapter 4. Summary and Suggestions for Further Research

latter reason may be particularly relevant in the use of HapMap controls. Several

SNPs were also relatively rare (MAF < 0.15), which may have contributed to

low power as described in (3). As such, further investigation of these associations

in a larger WAMHS sample would help elucidate the potential role these genetic

variants have in melanoma susceptibility in the Western Australian population.

The analysis of Breslow thickness, as a surrogate for melanoma prognosis, pro-

vided evidence for novel associations between genetic variants and Breslow thick-

ness, which had not been widely studied previously. In particular, the current

study was the first of its kind to test the hypothesis that genetic variants which

predispose an individual to melanoma, are also associated with a poorer prog-

nosis. These associations, however, did not remain significant after adjustment

for multiple testing. This may have been due to two reasons: (1) the associa-

tions were spurious; or (2) this study had low statistical power to detect small

differences in Breslow thickness due to low MAF or small sample size (n=800).

As such, replication in larger samples is required. If the associations observed

in the current study are replicated, this may lead to an improved understanding

of the genetic mechanisms underlying both melanoma susceptibility and prognosis.

However, further studies should not be restricted only to testing associations be-

tween Breslow thickness and genetic variants which are associated with melanoma

susceptibility. While the analyses presented in this thesis provide some evidence

for the association of genetic variants with both increased melanoma susceptibility

and increased Breslow thickness, it is probable that other genetic variants exist

which affect only Breslow thickness. A GWAS of Breslow thickness would allow

an unbiased approach in identifying genetic variants associated with melanoma

4.2. Mendelian Randomisation 221

prognosis, however this would need to be performed on a large sample for suffi-

cient power.

The analyses presented in this thesis identified genetic variants which were associ-

ated with melanoma susceptibility and prognosis in the WAMHS. However, these

analyses had several limitations. The WAMHS sample consisted of only Western

Australian, adult, Caucasian individuals diagnosed with melanoma, and therefore

these results are generalisable only to Western Australian, adult, Caucasian indi-

viduals. In addition, the WAMHS sample used in this thesis was small (n=800),

therefore it would be preferable to attempt to replicate these associations in larger

samples. Furthermore, these analyses identified genetic variants associated with

melanoma susceptibility and prognosis, however these variants may not be causal,

and may only be markers for causal variants. Functional genetic investigations

are required to determine whether these associations are causal, and this may also

lead to an enhanced understanding of melanoma susceptibility and prognosis.

4.2 Mendelian Randomisation

Mendelian randomisation has been shown to overcome some of the problems with

observational epidemiological studies, by its ability to identify causal relationships.

The statistical methodological work to model imputed haplotypes in Mendelian

randomisation analyses is, to the best of my knowledge, novel. Prior to the imple-

mentation of Mendelian randomisation methods in the R library MRsnphap, to

the author’s knowledge, no software designed specifically for use in the setting of

genetic association analysis was available to perform these analyses. As such, the

uncertainty surrounding haplotype assignment had not previously been able to be

included in Mendelian randomisation analyses. The development of MRsnphap

222 Chapter 4. Summary and Suggestions for Further Research

will therefore allow researchers to perform Mendelian randomisation analyses us-

ing both SNPs and imputed haplotypes as instruments.

The MRsnphap library does have some limitations, and further development of

MRsnphap would make it more useful to epidemiological researchers. Possible

future developments of MRsnphap are described in the following section.

4.2.1 Future development of MRsnphap

In the current release of MRsnphap, the functions mrsnp.quant and mrhap.quant

assume conditional homoskedasticity of the errors. However, it is possible, partic-

ularly for family studies and longitudinal studies, that heteroskedasticity will be

present. As such, the methods described in Section 3.4.2 will no longer be valid;

the IV coefficient estimate will be unaffected by heteroskedasticity, however the

standard error estimates will be inconsistent [264] and therefore tests of statistical

significance will be invalid. Additionally, diagnostic tests, including the test for

endogeneity and overidentification will be affected.

Therefore, heteroskedasticity-consistent standard errors and statistics need to be

calculated. One method for this is Generalised Methods of Moments (GMM). In

fact, IV and OLS are special cases of GMM estimation. GMM is robust to het-

eroskedasticity, however its use does require very large samples [264]. Therefore,

a later release of MRsnphap could be developed to allow for both homoskedastic

and heteroskedastic errors, including the Pagan and Hall test [265] to determine

if heteroskedasticity is in fact present.

4.2. Mendelian Randomisation 223

Currently Mendelian randomisation analysis in MRsnphap requires the outcome

to be continuous, otherwise Assumption 4 in Section 3.4.2.1 will be violated. To

date, no methods have been published that allow the inclusion of binary outcomes,

however when available, these could be incorporated into MRsnphap. This would

allow for Mendelian randomisation analysis to determine causal associations be-

tween both binary and continuous measures of disease.

MRsnphap could also be extended to include imputed SNP data in Mendelian

randomisation studies that has been generated from GWAS. One commonly used

imputation program is IMPUTE [276]. After running the IMPUTE imputation

procedure, an output file is created which contains three probabilities for each

individual at each SNP. These probabilities represent the probability of each in-

dividual having each of the three genotypes, i.e. individual 1 at SNP A may have

the probabilities 0.2, 0.5, and 0.3 which represent the probability of having the

AG, GG, and GG genotypes, respectively. These probabilities could be used in

MRsnphap using the same modelling framework which is currently used for hap-

lotype analysis.

Some limitations of Mendelian randomisation described earlier in Section 3.5.5,

such as population stratification and pleiotropy, may be reduced by the use of

multiple instruments. That is, the use of two or more genetic variants which are

both associated with the exposure. At the current time, methods to incorporate

multiple genetic variants in Mendelian randomisation analysis are not available.

However, when available, these methods could be incorporated into MRsnphap.

224 Chapter 4. Summary and Suggestions for Further Research

4.3 Conclusion

The aims of this thesis, as described in Chapter 1, were to establish the WAMHS

database and to use these data to investigate the environmental, host and genetic

risk factors associated with melanoma susceptibility and prognosis. The main

hypothesis of these analyses was that factors associated with increased melanoma

susceptibility are also associated with poorer melanoma prognosis. One genetic

variant on the IRF4 gene was associated with both increased melanoma risk, and

thicker melanomas, resulting in a poorer melanoma prognosis.

In addition, this thesis aimed to extend a statistical technique for modelling causal

associations in epidemiological studies, known as Mendelian randomisation, to in-

clude the use of imputed haplotypes as instruments, and to implement this tech-

nique for use by other researchers in a library for R, MRsnphap.

The collection and use of high-quality genetic and epidemiological data is integral

in identifying risk factors for disease susceptibility and prognosis. The discovery

of genetic variants associated with melanoma susceptibility has the potential to

identify individuals who are high risk genetically of developing melanoma. These

identified individuals would then be able to modify their behaviour, for example,

by limiting their sun exposure, with the hope that the disease does not develop.

Similarly, the discovery of genetic variants associated with melanoma prognosis

may help identify individuals at increased risk of poorer melanoma prognosis,

and may also potentially lead to the development of new treatments. The de-

velopment of the R library MRsnphap may enable future melanoma (and other

disease) researchers to determine if environmental risk factors are causally asso-

4.3. Conclusion 225

ciated with disease. These lifestyle and behavioural risk factors for disease may

then be modified in order to reduce the impact of disease in the community.

226 Chapter 4. Summary and Suggestions for Further Research

227

Bibliography

[1] J.M. Last. A Dictionary of Epidemiology. Oxford University Press, 4th

edition, 2001.

[2] Hippocrates. On airs, waters, and places, 400 BC.

[3] J. Snow. The cholera near Golden Square, and at Deptford.

[4] W.G. Hill and B.S. Weir. Maximum-likelihood estimation of gene location

by linkage disequilibrium. American Journal of Human Genetics, 54:705–74,

1994.

[5] R. Doll and A.B. Hill. Smoking and carcinoma of the lung; preliminary

report. British Medical Journal, 2(4682):739–48, 1950.

[6] The Homocysteine Studies Collaboration. Homocysteine and risk of ischemic

heart disease and stroke. A meta-analysis. Journal of the American Medical

Association, 288:2015–22, 2002.

[7] S.J. Chanock, T. Manolio, M. Boehnke, E. Boerwinkle, D.J. Hunter, G.

Thomas, J.N Hirschhorn, G. Abecasis, D. Altshuler, J.E Bailey-Wilson, L.D

Brooks, L.R. Cardon, M. Daly, P. Donnelly, J.F Fraumeni, N.B Freimer,

D.S Gerhard, C. Gunter, A.E Guttmacher , M.S Guyer, E.L Harris, J.

Hoh, R. Hoover, C.A. Kong, K.R. Merikangas, C.C. Morton, L.J. Palmer,

E.G. Phimister, J.P. Rice, J. Roberts, C. Rotimi, M.A. Tucker, K.J. Vo-

gan, S. Wacholder, E.M Wijsman, D.M Winn, and F.S Collins. Replicating

genotype-phenotype associations Nature, 447(7145):655–60, 2007.

228 BIBLIOGRAPHY

[8] C.S. Ku, E.Y. Loy, Y. Pawitan, and K.S. Chia. The pursuit of genome-

wide association studies: where are we now? Journal of Human Genetics,

55(4):195–206, 2010.

[9] V. Didelez, and N. Sheehan. Mendelian randomization as an instrumen-

tal variable approach to causal inferences. Statistical Methods in Medical

Research, 16:309–30, 2007.

[10] H. Zur Hausen. Papillomaviruses in the causation of human cancers – a

brief historical account. Virology, 384(2):260–65, 2009.

[11] A.B. Hill. The environment and disease: Association or causation? Pro-

ceedings of the Royal Society of Medicine, 58:295–300, 1965.

[12] N.E. Morton. Outline of Genetic Epidemiology. Karger, Basel, 1982.

[13] D. Coggon, D. Barker, and G. Rose. Epidemiology for the Uninitiated. BMJ

Books, 2003.

[14] J.V. Neel and W.J. Schull. Human Heredity. University of Chicago Press,

Chicago, IL, 1954.

[15] M.J. Khoury. Genetic epidemiology and the future of disease prevention

and public health. Epidemiological Reviews, 19(1):175–80, 1997.

[16] H.L. Levy. Phenylketonuria: Old disease, new approach to treatment. Pro-

ceedings of the National Academy of Sciences of the United States of Amer-

ica, 96(5):1811–13, 1999.

[17] D. Wood, G. De Backer, O. Faergeman, I. Graham, G. Mancia, and K. Py-

orala. Prevention of coronary heart disease in clinical practice. European

Heart Journal, 19(10):1434–1503, 1998.

BIBLIOGRAPHY 229

[18] J. Bell. The new genetics: the new genetics in clinical practice. British

Medical Journal, 316(7131):618–20, 1998.

[19] R.H. Myers. Huntington’s disease genetics. Neurotherapeutics, 1(2):255–62,

2004.

[20] S.T. Lommatzsch and R. Aris. Genetics of cystic fibrosis. Seminars in

Respiratory and Critical Care Medicine, 30(5):531–38, 2009.

[21] W. Kalow. Pharmacogenetics and personalised medicine. Fundamental and

Clinical Pharmacology, 16(5):337–42, 2002.

[22] S. Mallal, D. Nolan, C. Witt, G. Masel, A.M. Martin, C. Moore, D. Sayer,

A. Castley, C. Mamotte, D. Maxwell, I. James, and F.T. Christiansen. As-

sociation between presence of HLA-B∗5701, HLA-DR7, and HLA-DQ3 and

hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir. Lancet,

359(9308):727–32, 2002.

[23] S. Mallal, E. Phillips, G. Carosi, J.-M. Molina, C. Workman, J. Tomazic,

E. Jagel-Guedes, S. Rugina, O. Kozyrev, J.F. Cid, P. Hay, D. Nolan,

S. Hughes, A. Hughes, S. Ryan, N. Fitch, D. Thorborn, A. Benbow, and The

Predict-Study team. HLA-B∗5701 screening for hypersensitivity to abacavir.

New England Journal of Medicine, 358(6):568–79, 2008.

[24] L. Excoffier and M. Slatkin. Maximum-likelihood estimation of molecular

haplotype frequencies in a diploid population. Molecular Biology and Evo-

lution, 12(5):921–27, 1995.

[25] A.G. Clark. Inference of haplotypes from PCR-amplified samples of diploid

populations. Molecular Biology and Evolution, 7(2):111–22, 1990.

230 BIBLIOGRAPHY

[26] M. Stephens, N.J. Smith, and P. Donnelly. A new statistical method for hap-

lotype reconstruction from population data. American Journal of Human

Genetics, 68(4):978–89, 2001.

[27] T. Niu, Z.S. Qin, X. Xu, and J.S. Liu. Bayesian haplotype inference for mul-

tiple linked single-nucleotide polymorphisms. American Journal of Human

Genetics, 70(1):157–69, 2002.

[28] A. Gelman and T.P. Speed. Characterizing a joint probability distribution

by conditionals. Journal of the Royal Statistical Society. Series B (Method-

ological), 55(1):185–88, 1993.

[29] M.R. Cummings. Human Heredity:Principles and Issues. Wadsworth Pub-

lishing Company, 4th edition, 1997.

[30] A.A. Mitchell, D.J. Cutler, and A. Chakravarti. Undetected genotyping

errors cause apparent overtransmission of common alleles in the transmis-

sion/disequilibrium test. American Journal of Human Genetics, 72(3):598–

610, 2003.

[31] R.C. Lewontin. The interaction of selection and linkage. I. General consid-

erations; heterotic models. Genetics, 49(1):49–67, 1964.

[32] B.S. Kerem, J.M. Rommens, J.A. Buchanan, D. Markiewicz, T.K. Cox,

A. Chakravarti, M. Buchwald, and L.C. Tsui. Identification of the cystic-

fibrosis gene - genetic-analysis. Science, 245(4922):1073–80, 1989.

[33] M.E. Macdonald, C.M. Ambrose, M.P. Duyao, R.H. Myers, C. Lin,

L. Srinidhi, G. Barnes, S.A. Taylor, M. James, N. Groot, H. Macfar-

lane, B. Jenkins, M.A. Anderson, N.S. Wexler, J.F. Gusella, G.P. Bates,

S. Baxendale, H. Hummerich, S. Kirby, M. North, S. Youngman, R. Mott,

BIBLIOGRAPHY 231

G. Zehetner, Z. Sedlacek, A. Poustka, A.M. Frischauf, H. Lehrach, A.J.

Buckler, D. Church, L. Doucettestamm, M.C. Odonovan, L. Ribaramirez,

M. Shah, V.P. Stanton, S.A. Strobel, K.M. Draths, J.L. Wales, P. Dervan,

D.E. Housman, M. Altherr, R. Shiang, L. Thompson, T. Fielder, J.J. Was-

muth, D. Tagle, J. Valdes, L. Elmer, M. Allard, L. Castilla, M. Swaroop,

K. Blanchard, F.S. Collins, R. Snell, T. Holloway, K. Gillespie, N. Datson,

D. Shaw, and P.S. Harper. A novel gene containing a trinucleotide repeat

that is expanded and unstable on Huntington’s-disease chromosomes. Cell,

72(6):971–83, 1993.

[34] J.N. Hirschhorn and M.J. Daly. Genome-wide association studies for com-

mon diseases and complex traits. Nature Review Genetics, 6(2):95–108,

2005.

[35] J. Altmuller, L.J. Palmer, G. Fischer, H. Scherb, and M. Wjst. Genomewide

scans of complex human diseases: true linkage is hard to find. American

Journal of Human Genetics, 69(5):936–50, 2001.

[36] N. Risch and K. Merikangas. The future of genetic studies of complex human

diseases. Science, 273(5281):1516–17, 1996.

[37] Wellcome Trust Case Control Consortium. Genome-wide association study

of 14,000 cases of seven common diseases and 3,000 shared controls. Nature,

447(7145):661–78, 2007.

[38] R. Mayeux. Mapping the new frontier: complex genetic disorders. The

Journal of Clinical Investigation, 115(6):1404–07, 2005.

[39] A.T. Hattersley and M.I. McCarthy. What makes a good genetic association

study? Lancet, 366(9493):1315–23, 2005.

232 BIBLIOGRAPHY

[40] H.J. Cordell and D.G. Clayton. Genetic association studies. Lancet,

366(9491):1121–31, 2005.

[41] T.V. Perneger. What’s wrong with Bonferroni adjustments. BMJ,

316(7139):1236–38, 1998.

[42] Y. Benjamini and Y. Hochberg. Controlling the false discovery rate: a

practical and powerful approach to multiple testing. Journal of the Royal

Statistical Society, 57(1):289–300, 1995.

[43] J. Ferlay, F. Bray, P. Pisani, and D.M. Parkin. Globocan 2002: Cancer

incidence, mortality and prevalence worldwide, 2004.

[44] R. Laennec. Sur les melanoses. Bulletin de la Faculte de Medecine de Paris,

1:24–26, 1806.

[45] D.C. Bodenham. A study of 650 observed malignant melanomas in the

south-west region. Annals of the Royal College of Surgeons of England,

43(4):218–39, 1968.

[46] S. Cooper. First lines of theory and practice of surgery. Longman, Orme,

Brown, Green and Longman, London, 1840.

[47] M.J. Sladden, C. Balch, D.A. Barzilai, D. Berg, A. Freiman, T. Handiside,

S. Hollis, M.B. Lens, and J.F. Thompson. Surgical excision margins for

primary cutaneous melanoma. Cochrane Database of Systematic Reviews,

(4):CD004835, 2009.

[48] H.O. Lancaster. Some geographical aspects of the mortality from melanoma

in Europeans. Medical Journal of Australia, 43(26):1082–87, 1956.

BIBLIOGRAPHY 233

[49] J. Lin, T.L. Hocker, M. Singh, and H. Tsao. Genetics of melanoma predis-

position. British Journal of Dermatology, 159(2):286–91, 2008.

[50] M.P. Staples, M. Elwood, R.C. Burton, J.L. Williams, R. Marks, and G.G.

Giles. Non-melanoma skin cancer in Australia: the 2002 national survey

and trends since 1985. Medical Journal of Australia, 184(1):6–10, 2006.

[51] Australian Institute of Health, Welfare, and Australasian Association

of Cancer Registries. Cancer in Australia: an overview, 2008. Cancer series

no.046. AIHW, Canberra, 2008.

[52] Australian Institute of Health, Welfare, and Cancer Australia. Non-

melanoma skin cancer: general practice consultations, hospitalisation and

mortality. Technical Report Series no. 43. Cat. no. 39., Australian Institute

of Health and Welfare, 2008.

[53] Clark, W.H., D.E. Elder, D.T. Guerry, M.N. Epstein, M.H. Greene, and

M. Van Horn. A study of tumor progression: the precursor lesions of super-

ficial spreading and nodular melanoma. Human Pathology, 15(12):1147–65,

1984.

[54] A.J. Miller and M.C. Mihm. Mechanisms of disease - melanoma. New

England Journal of Medicine, 355(1):51–65, 2006.

[55] S.B. Gruber, R.L. Barnhill, K.S. Stenn, and G.C. Roush. Nevomelanocytic

proliferations in association with cutaneous malignant melanoma: a multi-

variate analysis. Journal of the American Academy of Dermatology, 21(4 Pt

1):773–80, 1989.

234 BIBLIOGRAPHY

[56] R. Marks, A.P. Dorevitch, and G. Mason. Do all melanomas come from

“moles”? a study of the histological association between melanocytic naevi

and melanoma. Australas Journal of Dermatology, 31(2):77–80, 1990.

[57] Y. Chudnovsky, P.A. Khavari, and A.E. Adams. Melanoma genetics and

the development of rational therapeutics. Journal of Clinical Investigation,

115(4):813–24, 2005.

[58] P. Jelfs, M. Coates, G. Giles, D. Shugg, T. Threlfall, D. Roder, I. Ring,

B. Shadbolt, and J. Condon. Cancer in Australia, 1989 to 1990 (with projec-

tions to 1995). Technical report, Australian Institute of Health and Welfare,

1996.

[59] Australian Institute of Health and Welfare. Australian Cancer Incidence

and Mortality. Australian Institute of Health and Welfare, Canberra, 2007.

[60] Australian Institute of Health and Welfare. Number of new cases and age-

specific rates for selected cancers by year of registration, sex and 5 year-age

groups. COGNOS data cube. Australian Institute of Health and Welfare,

Canberra, 2007.

[61] Australian Institute of Health, Welfare, Cancer Australia, and Aus-

tralasian Association of Cancer Registries. Cancer survival and prevalence

in Australia: cancers diagnosed from 1982 to 2004. Technical report, Aus-

tralian Institute of Health and Welfare, 2008.

[62] T. Threlfall and J. Thompson. Cancer incidence and mortality in Western

Australia, 2007. Technical report, Department of Health, 2009.

[63] T. Threlfall and J. Thompson. Cancer incidence and mortality in Western

Australia, 2005. Technical report, Department of Health, 2007.

BIBLIOGRAPHY 235

[64] T. Threlfall and J. Thompson. Cancer incidence and mortality in Western

Australia, 2006. Technical report, Department of Health, 2008.

[65] V. Bataille and E. De Vries. Melanoma–Part 1: Epidemiology, risk factors,

and prevention. British Medical Journal, 337:1287–91, 2008.

[66] S. Gandini, F. Sera, M.S. Cattaruzza, P. Pasquini, D. Abeni, P. Boyle,

and C.F. Melchi. Meta-analysis of risk factors for cutaneous melanoma:

I. Common and atypical naevi. European Journal of Cancer, 41(1):28–44,

2005.

[67] International Agency for Research on Cancer. IARC monograph on the

evaluation of carcinogenic risks to humans; ultraviolet radiation. Technical

report, 1992.

[68] Y. Chen and M. Berwick. Reproducibility of reported sun exposure be-

haviour in a case-control study of cutaneous malignant-melaoma. American

Journal of Epidemiology, 138(8):668–668, 1993.

[69] D.R. English, B.K. Armstrong, and A. Kricker. Reproducibility of reported

measurements of sun exposure in a case-control study. Cancer Epidemiology

Biomarkers & Prevention, 7(10):857–63, 1998.

[70] D.C. Whiteman, C.A. Whiteman, and A.C. Green. Childhood sun exposure

as a risk factor for melanoma: a systematic review of epidemiologic studies.

Cancer Causes & Control, 12(1):69–82, 2001.

[71] R. Steinitz, D.M. Parkin, J.L. Young, C.A. Bieber, and L. Katz. Cancer inci-

dence in Jewish migrants to Israel, 1961-1981. IARC Scientific Publications,

(98):1–311, 1989.

236 BIBLIOGRAPHY

[72] D. Anaise, R. Steinitz, and N.B. Hur. Solar-radiation - possible etiological

factor in malignant-melanoma in Israel - retrospective study (1960-1972).

Cancer, 42(1):299–304, 1978.

[73] M. Movshovi and B. Modan. Role of sun exposure in etiology of malignant-

melanoma - epidemiologic inference. Journal of the National Cancer Insti-

tute, 51(3):777–79, 1973.

[74] M. Khlat, A. Vail, M. Parkin, and A. Green. Mortality from melanoma

in migrants to Australia - variation by age at arrival and duration of stay.

American Journal of Epidemiology, 135(10):1103–13, 1992.

[75] A.J. Dobson and S.R. Leeder. Mortality from malignant-melanoma in Aus-

tralia - effects due to country of birth. International Journal of Epidemiol-

ogy, 11(3):207–11, 1982.

[76] M. McCredie, M.S. Coates, and J.M. Ford. Cancer incidence in European

migrants to New south Wales. Annals of Oncology, 1(3):219–25, 1990.

[77] C.D.J. Holman, C.D. Mulroney, and B.K. Armstrong. Epidemiology of

pre-invasive and invasive malignant-melanoma in Western Australia. Inter-

national Journal of Cancer, 25(3):317–23, 1980.

[78] A.J. McMichael and G.G. Giles. Cancer in migrants to Australia - extending

the descriptive epidemiological data. Cancer Research, 48(3):751–56, 1988.

[79] K.R. Cooke and J. Fraser. Migration and death from malignant-melanoma.

International Journal of Cancer, 36(2):175–78, 1985.

[80] R.S. Paffenbarger, A.L. Wing, and R.T. Hyde. Chronic diseases in for-

mer college-students. 15. Characteristics in youth predictive of adult-onset

BIBLIOGRAPHY 237

malignant-lymphomas, melanomas, and leukemias. Journal of the National

Cancer Institute, 60(1):89–92, 1978.

[81] L. Katz, S. Ben-Tuvia, and R. Steinitz. Malignant melanoma of the skin in

Israel: effect of migration. Trends in Cancer Incidence, Causes and Practical

Implications, 419–26, 1982.

[82] M.W. Hinds and L.N. Kolonel. Malignant-melanoma of the skin in Hawaii,

1960-1977. Cancer, 45(4):811–17, 1980.

[83] S. Gandini, F. Sera, M.S. Cattaruzza, P. Pasquini, O. Picconi, P. Boyle, and

C.F. Melchi. Meta-analysis of risk factors for cutaneous melanoma: II. Sun

exposure. European Journal of Cancer, 41(1):45–60, 2005.

[84] C.D.J. Holman, and B.K. Armstrong. Cutaneous malignant-melanoma and

indicators of total accumulated exposure to the sun - an analysis separating

histogenetic types. Journal of the National Cancer Institute, 73(1):75–82,

1984.

[85] A. Green, C. Bain, R. McLennan, and V. Siskind. Risk factors for cutaneous

melanoma in Queensland. Recent Results in Cancer Research, 102:76–97,

1986.

[86] J.M. Elwood. Melanoma and sun exposure - contrasts between intermittent

and chronic exposure. World Journal of Surgery, 16(2):157–65, 1992.

[87] C.D.J. Holman and B.K. Armstrong. Pigmentary traits, ethnic-origin,

benign nevi, and family history as risk-factors for cutaneous malignant-

melanoma. Journal of the National Cancer Institute, 72(2):257–66, 1984.

238 BIBLIOGRAPHY

[88] J.A.H. Lee and D. Strickland. Malignant-melanoma - social-status and out-

door work. British Journal of Cancer, 41(5):757–63, 1980.

[89] J.M. Bliss, D. Ford, A.J. Swerdlow, B.K. Armstrong, M. Cristofolini, J.M.

Elwood, A. Green, E.A. Holly, T. Mack, R.M. Mackie, A. Osterlind, S.D.

Walter, J. Peto, and D.F. Easton. Risk of cutaneous melanoma-associated

with pigmentation characteristics and freckling - systematic overview of 10

case-control studies. International Journal of Cancer, 62(4):367–76, 1995.

[90] B.K. Armstrong and A. Kricker. The epidemiology of UV induced skin

cancer. Journal of Photochemistry and Photobiology B: Biology, 63(1-3):8–

18, 2001.

[91] A.R. Rhodes, M.A. Weinstock, T.B. Fitzpatrick, M.C. Mihm, and A.J.

Sober. Risk-factors for cutaneous melanoma - a practical method of rec-

ognizing predisposed individuals. Journal of the American Medical Associ-

ation, 258(21):3146–54, 1987.

[92] T.R. Fears, C.C. Bird, D. Guerry, R.W. Sagebiel, M.H. Gail, D.E. Elder,

A. Halpern, E.A. Holly, P. Hartge, and M.A. Tucker. Average midrange

ultraviolet radiation flux and time outdoors predict melanoma risk. Cancer

Research, 62(14):3992–96, 2002.

[93] N.K. Hayward. Genetics of melanoma predisposition. Oncogene,

22(20):3053–62, 2003.

[94] A.W. Kopf, L.J. Hellman, G.S. Rogers, D.F. Gross, D.S. Rigel, R.J. Fried-

man, M. Levenstein, J. Brown, F.M. Golomb, and D.F. Roses. Familial ma-

lignant melanoma. Journal of the American Medical Association, 256:1915–

19, 1986.

BIBLIOGRAPHY 239

[95] S. Gandini, F. Sera, M.S. Cattaruzza, P. Pasquini, R. Zanetti, C. Masini,

P. Boyle, and C.F. Melchi. Meta-analysis of risk factors for cutaneous

melanoma: III. Family history, actinic damage and phenotypic factors. Eu-

ropean Journal of Cancer, 41(14):2040–59, 2005.

[96] D. Ford, J.M. Bliss, A.J. Swerdlow, B.K. Armstrong, S. Franceschi,

A. Green, E.A. Holly, T. Mack, R.M. Mackie, A. Osterlind. Risk of cu-

taneous melanoma associated with a family history of the disease. Interna-

tional Journal of Cancer, 62(4):377–81, 1995.

[97] R.L. Barnhill, G.C. Roush, L. Titus-Ernstoff, M.S. Ernstoff, P.H. Duray,

and J.M. Kirkwood. Comparison of nonfamilial and familial melanoma.

Dermatology, 184(1):2–7, 1992.

[98] M.A. Tucker and A.M. Goldstein. Melanoma etiology: where are we? Onco-

gene, 22(20):3042–52, 2003.

[99] J.A. Newton. Familial melanoma. Clinical and Experimental Dermatology,

18(1):5–11, 1993.

[100] V. Bataille, J.A.Newton–Bishop, P. Sasieni, A.J. Swerdlow, E. Pinney,

K. Griffiths, and J. Cuzickz. Risk of cutaneous melanoma in relation to

the numbers, types and sites of naevi: A case-control study. British Journal

of Cancer, 73(12):1605–11, 1996.

[101] M. Cristofolini, S. Franceschi, L. Tasin, G. Zumiani, F. Piscioli, R. Ta-

lamini, and C. Lavecchia. Risk-factors for cutaneous malignant-melanoma in

a northern Italian population. International Journal of Cancer, 39(2):150–

54, 1987.

240 BIBLIOGRAPHY

[102] P. Youl, J. Aitken, N. Hayward, D. Hogg, L. Liu, N. Lassam, N. Martin, and

A. Green. Melanoma in adolescents: A case-control study of risk factors in

Queensland, Australia. International Journal of Cancer, 98(1):92–98, 2002.

[103] C.J. Hussussian, J.P. Struewing, A.M. Goldstein, P.A.T. Higgins, D.S. Ally,

M.D. Sheahan, W.H. Clark, M.A. Tucker, and N.C. Dracopoli. Germline

p16 mutations in familial melanoma. Nature Genetics, 8(1):15–21, 1994.

[104] L. Zuo, J. Weger, Q. Yang, A.M. Goldstein, M.A. Tucker, G.J. Walker,

N. Hayward, and N.C. Dracopoli. Germline mutations in the p16INK4a

binding domain of CDK4 in familial melanoma. Nature Genetics, 12(1):97–

99, 1996.

[105] M. Falchi, V. Bataille, N.K. Hayward, D.L. Duffy, J.A.N. Bishop, T. Pasti-

nen, A. Cervino, Z.Z. Zhao, P. Deloukas, N. Soranzo, D.E. Elder, J.H.

Barrett, N.G. Martin, D.T. Bishop, G.W. Montgomery, and T.D. Spector.

Genome-wide association study identifies variants at 9p21 and 22q13 asso-

ciated with development of cutaneous nevi. Nature Genetics, 41(8):915–19,

2009.

[106] D.T. Bishop, F. Demenais, M.M. Iles, M. Harland, J.C. Taylor, E. Corda,

J. Randerson-Moor, J.F. Aitken, M.-F. Avril, E. Azizi, B. Bakker,

G. Bianchi-Scarra, B. Bressac-De Paillerets, D. Calista, L.A. Cannon-

Albright, T. Chin-a Woeng, T. Debniak, G. Galore-Haskel, P. Ghiorzo,

I. Gut, J. Hansson, M. Hocevar, V. Hoiom, J.L. Hopper, C. Ingvar, P.A.

Kanetsky, R.F. Kefford, M.T. Landi, J. Lang, J. Lubinski, R. Mackie,

J. Malvehy, G.J. Mann, N.G. Martin, G.W. Montgomery, F.A. Van Nieuw-

poort, S. Novakovic, H. Olsson, S. Puig, M. Weiss, W. Van Workum, D. Ze-

lenika, K.M. Brown, A.M. Goldstein, E.M. Gillanders, A. Boland, P. Galan,

BIBLIOGRAPHY 241

D.E. Elder, N.A. Gruis, N.K. Hayward, G.M. Lathrop, J.H. Barrett, and

J.A. Newton Bishop. Genome-wide association study identifies three loci

associated with melanoma risk. Nature Genetics, 41(8):920–25, 2009.

[107] K.M. Brown, S. MacGregor, G.W. Montgomery, D.W. Craig, Z.Z. Zhao,

K. Iyadurai, A. K Henders, N. Homer, M.J. Campbell, M. Stark, S. Thomas,

H. Schmid, E.A. Holland, E.M. Gillanders, D.L. Duffy, J.A. Maskiell, J. Je-

tann, M. Ferguson, D.A. Stephan, A.E. Cust, D. Whiteman, A. Green,

H. Olsson, S. Puig, P. Ghiorzo, J. Hansson, F. Demenais, A.M. Goldstein,

N.A. Gruis, D.E. Elder, J.N. Bishop, R.F. Kefford, G.G. Giles, B.K. Arm-

strong, J.F. Aitken, J.L. Hopper, N.G. Martin, J.M. Trent, G.J. Mann, and

N.K. Hayward. Common sequence variants on 20q11.22 confer melanoma

susceptibility. Nature Genetics, 40(7):838–40, 2008.

[108] D.F. Gudbjartsson, P. Sulem, S.N. Stacey, A.M. Goldstein, T. Rafnar,

B. Sigurgeirsson, K.R. Benediktsdottir, K. Thorisdottir, R. Ragnarsson,

S.G. Sveinsdottir, V. Magnusson, A. Lindblom, K. Kostulas, R. Botella-

Estrada, V. Soriano, P. Juberias, M. Grasa, B. Saez, R. Andres, D. Scherer,

P. Rudnai, E. Gurzau, K. Koppova, L.A. Kiemeney, M. Jakobsdottir,

S. Steinberg, A. Helgason, S. Gretarsdottir, M.A. Tucker, J.I. Mayordomo,

E. Nagore, R. Kumar, J. Hansson, J.H. Olafsson, J. Gulcher, A. Kong,

U. Thorsteinsdottir, and K. Stefansson. ASPI and TYR pigmentation vari-

ants associate with cutaneous melanoma and basal cell carcinoma. Nat

Genet, 40(7):886–91, 2008.

[109] P. Sulem, D.F. Gudbjartsson, S.N. Stacey, A. Helgason, T. Rafnar, K.P.

Magnusson, A. Manolescu, A. Karason, A. Palsson, G. Thorleifsson,

M. Jakobsdottir, S. Steinberg, S. Palsson, F. Jonasson, B. Sigurgeirsson,

242 BIBLIOGRAPHY

K. Thorisdottir, R. Ragnarsson, K.R. Benediktsdottir, K.K. Aben, L.A.

Kiemeney, J.H. Olafsson, J. Gulcher, A. Kong, U. Thorsteinsdottir, and

K. Stefansson. Genetic determinants of hair, eye and skin pigmentation in

Europeans. Nature Genetics, 39(12):1443–52, 2007.

[110] A. Kamb, D. Shattuck-Eidens, R. Eeles, Q. Liu, N.A. Gruis, W. Ding,

C. Hussey, T. Tran, Y. Miki, J. Weaver-Feldhaus, M. Mcclure, J.F. Aitken,

D.E. Anderson, W. Bergman, R. Frants, D.E. Goldgar, A. Green, R. Maclen-

nan, N.G. Martin, L.J. Meyer, P. Youl, J.J. Zone, M.H. Skolnick, and L.A.

Cannon-Albright. Analysis of the p16 gene (CDKN2) as a candidate for

the chromosome 9p melanoma susceptibility locus. Nat Genet, 8(1):22–26,

1994.

[111] A.M. Goldstein, M. Chan, M. Harland, N.K. Hayward, F. Demenais,

D. Timothy Bishop, E. Azizi, W. Bergman, G. Bianchi-Scarra, W. Bruno,

D. Calista, L.A. Cannon Albright, V. Chaudru, A. Chompret, F. Cuellar,

D.E. Elder, P. Ghiorzo, E.M. Gillanders, N.A. Gruis, J. Hansson, D. Hogg,

E.A. Holland, P.A. Kanetsky, R.F. Kefford, M. Teresa Landi, J. Lang, S.A.

Leachman, R.M. Mackie, V. Magnusson, G.J. Mann, J. Newton Bishop,

J.M. Palmer, S. Puig, J.A. Puig-Butille, M. Stark, H. Tsao, M.A. Tucker,

L. Whitaker, and E. Yakobson. Features associated with germline CDKN2A

mutations: a GenoMEL study of melanoma-prone families from three con-

tinents. Journal of Medical Genetics, 44(2):99–106, 2007.

[112] M. Berwick, I. Orlow, A.J. Hummer, B.K. Armstrong, A. Kricker, L.D.

Marrett, R.C. Millikan, S.B. Gruber, H. Anton-Culver, R. Zanetti, R.P.

Gallagher, T. Dwyer, T.R. Rebbeck, P.A. Kanetsky, K. Busam, L. From,

U. Mujumdar, H. Wilcox, and C.B. Begg. The prevalence of CDKN2A

BIBLIOGRAPHY 243

germ-line mutations and relative risk for cutaneous malignant melanoma:

An international population-based study. Cancer Epidemiology Biomarkers

& Prevention, 15(8):1520–25, 2006.

[113] D.T. Bishop, F. Demenais, A.M. Goldstein, W. Bergman, J.N. Bishop,

B. Bressac-De Paillerets, A. Chompret, P. Ghiorzo, N. Gruis, J. Hansson,

M. Harland, N. Hayward, E.A. Holland, G.J. Mann, M. Mantelli, D. Nancar-

row, A. Platz, and M.A. Tucker. Geographical variation in the penetrance

of CDKN2A mutations for melanoma. Journal of the National Cancer In-

stitute, 94(12):894–903, 2002.

[114] C.B. Begg, I. Orlow, A.J. Hummer, B.K. Armstrong, A. Kricker, L.D. Mar-

rett, R.C. Millikan, S.B. Gruber, H. Anton-Culver, R. Zanetti, R.P. Gal-

lagher, T. Dwyer, T.R. Rebbeck, N. Mitra, K. Busam, L. From, M. Berwick,

for the Genes Environment Melanoma Study Group. Lifetime risk of

melanoma in CDKN2A mutation carriers in a population-based sample.

Journal of the National Cancer Institute, 97(20):1507–15, 2005.

[115] J. Aitken, J. Welch, D. Duffy, A. Milligan, A. Green, N. Martin, and N. Hay-

ward. CDKN2A variants in a population-based sample of Queensland fami-

lies with melanoma. Journal of the National Cancer Institute, 91(5):446–52,

1999.

[116] N. Soufir, M.F. Avril, A. Chompret, F. Demenais, J. Bombled, A. Spatz,

D. Stoppa-Lyonnet, J. Benard, and B. Bressac-De Paillerets. Prevalence of

p16 and CDK4 germline mutations in 48 melanoma-prone families in France.

The French Familial Melanoma Study group. Human Molecular Genetics,

7(2):209–16, 1998.

244 BIBLIOGRAPHY

[117] A.M. Goldstein, J.P. Struewing, A. Chidambaram, M.C. Fraser, and M.A.

Tucker. Genotype-phenotype relationships in U.S. melanoma-prone fami-

lies with CDKN2A and CDK4 mutations. Journal of the National Cancer

Institute, 92(12):1006–10, 2000.

[118] A.M. Goldstein, A. Chidambaram, A. Halpern, E.A. Holly, D.I. Guerry,

R. Sagebiel, D.E. Elder, and M.A. Tucker. Rarity of CDK4 germline muta-

tions in familial melanoma. Melanoma Research, 12(1):51–55, 2002.

[119] P. Valverde, E. Healy, I. Jackson, J.L. Rees, and A.J. Thody. Variants of

the melanocyte-stimulating hormone receptor gene are associated with red

hair and fair skin in humans. Nature Genetics, 11(3):328–30, 1995.

[120] N.F. Box, J.R. Wyeth, L.E. O’Gorman, N.G. Martin, and R.A. Sturm.

Characterization of melanocyte stimulating hormone receptor variant alleles

in twins with red hair. Human Molecular Genetics, 6(11):1891–1897, 1997.

[121] J.S. Palmer, D.L. Duffy, N.F. Box, J.F. Aitken, L.E. O’Gorman, A.C. Green,

N.K. Hayward, N.G. Martin, and R.A. Sturm. Melanocortin-1 receptor

polymorphisms and risk of melanoma: Is the association explained solely by

pigmentation phenotype? American Journal of Human Genetics, 66(1):176–

86, 2000.

[122] P. Valverde, E. Healy, S. Sikkink, F. Haldane, A.J. Thody, A. Carothers,

I.J. Jackson, and J.L. Rees. The Asp84Glu variant of the melanocortin 1

receptor (MC1R) is associated with melanoma. Human Molecular Genetics,

5(10):1663–66, 1996.

BIBLIOGRAPHY 245

[123] S. Raimondi, F. Sera, S. Gandini, S. Iodice, S. Caini, P. Maisonneuve, and

M.C. Fargnoli. MC1R variants, melanoma and red hair color phenotype: A

meta-analysis. International Journal of Cancer, 122(12):2753–60, 2008.

[124] J. Han, P. Kraft, H. Nan, Q. Guo, C. Chen, A. Qureshi, S.E. Hankinson, F.B.

Hu, D.L. Duffy, Z.Z. Zhao, N.G. Martin, G.W. Montgomery, N.K. Hayward,

G. Thomas, R.N. Hoover, S. Chanock, and D.J. Hunter. A genome-wide

association study identifies novel alleles associated with hair color and skin

pigmentation. PLoS Genetics, 4(5):e1000074, 2008.

[125] C. Kennedy, J. Ter Huurne, M. Berkhout, N. Gruis, M. Bastiaens,

W. Bergman, R. Willemze, and J.N.B. Bavinck. Melanocortin 1 receptor

(MC1R) gene variants are associated with an increased risk for cutaneous

melanoma which is largely independent of skin type and hair color. Journal

of Investigative Dermatology, 117(2):294–300, 2001.

[126] N.F. Box, D.L. Duffy, W. Chen, M. Stark, N.G. Martin, R.A. Sturm, and

N.K. Hayward. MC1R genotype modifies risk of melanoma in families

segregating CDKN2A mutations. American Journal of Human Genetics,

69(4):765–73, 2001.

[127] A.M. Goldstein, M.T. Landi, S. Tsang, M.C. Fraser, D.J. Munroe, and M.A.

Tucker. Association of MC1R variants and risk of melanoma in melanoma-

prone families with CDKN2A mutations. Cancer Epidemiol Biomarkers &

Prevention, 14(9):2208–12, 2005.

[128] M. Shahbazi, V. Pravica, N. Nasreen, H. Fakhoury, A.A. Fryer, R.C.

Strange, P.E. Hutchinson, J.E. Osborne, J.T. Lear, A.G. Smith, and I.V.

246 BIBLIOGRAPHY

Hutchinson. Association between functional polymorphism in EGF gene

and malignant melanoma. Lancet, 359(9304):397–401, 2002.

[129] S.L. McCarron, A.C. Bateman, J.M. Theaker, and W.M. Howell. Egf +61

gene polymorphism and susceptibility to and prognostic markers in cuta-

neous malignant melanoma. International Journal of Cancer, 107(4):673–5,

2003.

[130] M.R. James, N.K. Hayward, T. Dumenil, G.W. Montgomery, N.G. Martin,

and D.L. Duffy. Epidermal growth factor gene (EGF) polymorphism and risk

of melanocytic neoplasia. Journal of Investigative Dermatology, 123(4):760–

2, 2004.

[131] J.A. Randerson-Moor, R. Gaut, F. Turner, L. Whitaker, J.H. Barrett,

I.D. Silva, A.J. Swerdlow, D.T. Bishop, and J.A.N. Bishop. The rela-

tionship between the epidermal growth factor (EGF) 5’ utr variant A61G

and melanoma/nevus susceptibility. Journal of Investigative Dermatology,

123(4):755–59, 2004.

[132] K.L. Amend, J.T. Elder, L.P. Tomsho, J.D. Bonner, T.M. Johnson,

J. Schwartz, M. Berwick, and S.B. Gruber. EGF gene polymorphism and

the risk of incident primary melanoma. Cancer Research, 64(8):2668–72,

2004.

[133] National Health and Medical Research Council. The management of cuta-

neous melanoma. Technical report, NHMRC, 1999.

[134] K.S. Smalley, N.K. Haass, P.A. Brafford, M. Lioni, K.T. Flaherty, and

M. Herlyn. Multiple signaling pathways must be targeted to overcome drug

BIBLIOGRAPHY 247

resistance in cell lines derived from melanoma metastases. Molecular Cancer

Therapeutics, 5(5):1136–44, 2006.

[135] A. Breslow. Thickness, cross-sectional areas and depth of invasion in prog-

nosis of cutaneous melanoma. Annals of Surgery, 172(5):902–8, 1970.

[136] A.C. Allen and S. Spitz. Malignant melanoma - a clinicopathological analysis

of the criteria for diagnosis and prognosis. Cancer, 6(1):1–45, 1953.

[137] C.M. Balch, J.E. Gershenwald, S.J. Soong, J.F. Thompson, M.B. Atkins,

D.R. Byrd, A.C. Buzaid, A.J. Cochran, D.G. Coit, S. Ding, A.M. Egger-

mont, K.T. Flaherty, P.A. Gimotty, J.M. Kirkwood, K.M. Mcmasters, Jr.

Mihm, M.C., D.L. Morton, M.I. Ross, A.J. Sober, and V.K. Sondak. Fi-

nal version of 2009 AJCC melanoma staging and classification. Journal of

Clinical Oncology, 27(36):6199–6206, 2009.

[138] R.L. Barnhill, J.A. Fine, G.C. Roush, and M. Berwick. Predicting five-year

outcome for patients with cutaneous melanoma in a population-based study.

Cancer, 78(3):427–32, 1996.

[139] C.M. Balch, S.J. Soong, J.E. Gershenwald, J.F. Thompson, D.S. Reintgen,

N. Cascinelli, M. Urist, K.M. Mcmasters, M.I. Ross, J.M. Kirkwood, M.B.

Atkins, J.A. Thompson, D.G. Coit, D. Byrd, R. Desmond, Y.T. Zhang, P.Y.

Liu, G.H. Lyman, and A. Morabito. Prognostic factors analysis of 17,600

melanoma patients: Validation of the American Joint Committee on Cancer

melanoma staging system. Journal of Clinical Oncology, 19(16):3622–34,

2001.

[140] C.M. Balch, T.M. Murad, S.J. Soong, A.L. Ingalls, N.B. Halpern, and W.A.

Maddox. A multifactorial analysis of melanoma: prognostic histopatholog-

248 BIBLIOGRAPHY

ical features comparing Clark’s and Breslow’s staging methods. Annals of

Surgery, 188(6):732–42, 1978.

[141] C. Garbe, P. Buttner, J. Bertz, G. Burg, B. D’hoedt, H. Drepper,

I. Guggenmoos-Holzmann, W. Lechner, A. Lippold, and C.E. Orfanos. Pri-

mary cutaneous melanoma. Identification of prognostic groups and estima-

tion of individual prognosis for 5093 patients. Cancer, 75(10):2484–91, 1995.

[142] N. Cascinelli, E. Marubini, A. Morabito, and R. Bufalino. Prognostic factors

for stage I melanoma of the skin: a review. Statistics in Medicine, 4(3):265–

78, 1985.

[143] K. Sondergaard and G. Schou. Survival with primary cutaneous malignant

melanoma, evaluated from 2012 cases. A multivariate regression analysis.

Virchows Archives: A. Pathological Anatomy & Histopathology, 406(2):179–

95, 1985.

[144] S. Retsas, K. Henry, M.Q. Mohammed, and K. Macrae. Prognostic factors

of cutaneous melanoma and a new staging system proposed by the American

Joint Committee on Cancer (AJCC): Validation in a cohort of 1284 patients.

European Journal of Cancer, 38(4):511–6, 2002.

[145] D.S. Rigel, G.S. Rogers, and R.J. Friedman. Prognosis of malignant

melanoma. Dermatologic Clinics, 3(2):309–14, 1985.

[146] C.M. Balch, S. Soong, M.I. Ross, M.M. Urist, C.P. Karakousis, W.J. Tem-

ple, M.C. Mihm, R.L. Barnhill, W.R. Jewell, H.J. Wanebo, and R. Harrison.

Long-term results of a multi-institutional randomized trial comparing prog-

nostic factors and surgical results for intermediate thickness melanomas (1.0

BIBLIOGRAPHY 249

to 4.0 mm). Intergroup Melanoma Surgical Trial. Annals of Surgical On-

cololgy, 7(2):87–97, 2000.

[147] P.J. Heenan, D.R. English, C. Darcy, J. Holman, and B.K. Armstrong. Sur-

vival among patients with clinical stage I cutaneous malignant-melanoma

diagnosed in Western Australia in 1975/1976 and 1980/1981. Cancer,

68(9):2079–87, 1991.

[148] A. Downing, X.Q. Yu, J. Newton-Bishop, and D. Forman. Trends in prog-

nostic factors and survival from cutaneous melanoma in Yorkshire, UK and

New South Wales, Australia between 1993 and 2003. International Journal

of Cancer, 123(4):861–66, 2008.

[149] H.J. Cohen, E. Cox, K. Manton, and M. Woodbury. Malignant-melanoma

in the elderly. Journal of Clinical Oncology, 5(1):100–106, 1987.

[150] P.J. Heenan and C.D.J. Holman. A histological comparison of cutaneous

malignant-melanoma between the Oxford region and Western Australia.

Histopathology, 6(6):703–16, 1982.

[151] P. Hersey, R.W. Sillar, C.G. Howe, R.C. Burton, S.V. Darbar, H.M. Foster,

S.M. Collins, D.E. Bradley, and D. Owens. Factors related to the presenta-

tion of patients with thick primary melanomas. Medical Journal of Australia,

154(9):583–87, 1991.

[152] J. Levine, A. Kopf, D. Rigel, R. Bart, P. Hennessey, R. Friedman, and

M. Mintzis. Correlation of thicknesses of superficial spreading malignant

melanomas and ages of patients. The Journal of Dermatologic Surgery and

Oncology, 7(4):311–16, 1981.

250 BIBLIOGRAPHY

[153] J.E. Osborne and P.E. Hutchinson. Clinical correlates of Breslow thickness

of malignant melanoma. British Journal of Dermatology, 144(3):476–83,

2001.

[154] L. Bucchi, M. Serafini, and G. Lanzanova. Breslow thickness of cuta-

neous malignant-melanoma in Ravenna (northern Italy) 1981-1990. Tumori,

78(2):94–97, 1992.

[155] J. Melia, E.J. Cooper, T. Frost, R. Grahambrown, J. Hunter, A. Marsden,

A. Duvivier, J. White, S. Whitehead, A.P. Warin, M. Wroughton, R. Ell-

man, and J. Chamberlain. Cancer-research campaign health-education pro-

gram to promote the early detection of cutaneous malignant-melanoma.

2.Characteristics and incidence of melanoma. British Journal of Dermatol-

ogy, 132(3):414–21, 1995.

[156] J.L. Schwartz, T.S. Wang, T.A. Hamilton, L. Lowe, V.K. Sondak, and T.M.

Johnson. Thin primary cutaneous melanomas: associated detection pat-

terns, lesion characteristics, and patient characteristics. Cancer, 95(7):1562–

8, 2002.

[157] D. Duke, J. Castresana, L. Lucchina, T.H. Lee, A.J. Sober, W.P. Carey, D.E.

Elder, and R.L. Barnhill. Familial cutaneous melanoma and two-mutational-

event modeling. Cancer, 72(11):3239–43, 1993.

[158] N.M. Fisher, J.V. Schaffer, M. Berwick, and J.L. Bolognia. Breslow depth of

cutaneous melanoma: Impact of factors related to surveillance of the skin,

including prior skin biopsies and family history of melanoma. Journal of the

American Academy of Dermatology, 53(3):393–406, 2005.

BIBLIOGRAPHY 251

[159] F. Grange, A. Chompret, M. Guilloud-Bataille, J.C. Guillaume, A. Mar-

gulis, M. Prade, F. Demenais, and M.F. Avril. Comparison between fa-

milial and nonfamilial melanoma in France. Archives of Dermatology,

131(10):1154–9, 1995.

[160] I. Okamoto, F. Roka, J. Krogler, G. Endler, S. Kaufmann, S. Tockner,

C. Marsik, B. Jilma, C. Mannhalter, O. Wagner, and H. Pehamberger.

The EGF A61G polymorphism is associated with disease-free period and

survival in malignant melanoma. Journal of Investigative Dermatology,

126(10):2242–46, 2006.

[161] T.B. Fitzpatrick. Soleil et peau. Journal of Medical Esthetics, 2:33–34, 1975.

[162] R.A. Fisher. On the interpretation of Chi2 from contingency tables, and the

calculation of p. Journal of the Royal Statistical Society, 85(1):87–94, 1922.

[163] D.C. Whiteman, M. Stickley, P. Watt, M.C. Hughes, M.B. Davis, and A.C.

Green. Anatomic site, sun exposure, and risk of cutaneous melanoma. Jour-

nal of Clinical Oncology, 24(19):3172–77, 2006.

[164] J.C. Barrett, B. Fry, J. Maller, and M.J. Daly. Haploview: analysis and

visualization of LD and haplotype maps. Bioinformatics, 21(2):263–5, 2005.

[165] B. Rhead, D. Karolchik, R.M. Kuhn, A.S. Hinrichs, A.S. Zweig, P.A. Fu-

jita, M. Diekhans, K.E. Smith, K.R. Rosenbloom, B.J. Raney, A. Pohl,

M. Pheasant, L.R. Meyer, K. Learned, F. Hsu, J. Hillman-Jackson, R.A.

Harte, B. Giardine, T.R. Dreszer, H. Clawson, G.P. Barber, D. Haussler,

and W.J. Kent. The UCSC genome browser database: update 2010. Nucleic

Acids Research, 38(Database issue):D613–9, 2010.

252 BIBLIOGRAPHY

[166] P.I. De Bakker, R. Yelensky, I. Pe’er, S.B. Gabriel, M.J. Daly, and D. Alt-

shuler. Efficiency and power in genetic association studies. Nature Genetics,

37(11):1217–23, 2005.

[167] P. Sulem, D.F. Gudbjartsson, S.N. Stacey, A. Helgason, T. Rafnar,

M. Jakobsdottir, S. Steinberg, S.A. Gudjonsson, A. Palsson, G. Thorleifsson,

S. Palsson, B. Sigurgeirsson, K. Thorisdottir, R. Ragnarsson, K.R. Benedik-

tsdottir, K.K. Aben, S.H. Vermeulen, A.M. Goldstein, M.A. Tucker, L.A.

Kiemeney, J.H. Olafsson, J. Gulcher, A. Kong, U. Thorsteinsdottir, and

K. Stefansson. Two newly identified genetic determinants of pigmentation

in Europeans. Nature Genetics, 40(7):835–7, 2008.

[168] D.L. Duffy, G.W. Montgomery, W. Chen, Z.Z. Zhao, L. Le, M.R. James,

N.K. Hayward, N.G. Martin, and R.A. Sturm. A three-single-nucleotide

polymorphism haplotype in intron 1 of OCA2 explains most human eye-

color variation. American Journal of Human Genetics, 80(2):241–52, 2007.

[169] T.R. Rebbeck, P.A. Kanetsky, A.H. Walker, R. Holmes, A.C. Halpern,

L.M. Schuchter, D.E. Elder, and D. Guerry. P gene as an inherited

biomarker of human eye color. Cancer Epidemiology Biomarkers & Pre-

vention, 11(8):782–84, 2002.

[170] R.A. Sturm, D.L. Duffy, Z.Z. Zhao, F.P. Leite, M.S. Stark, N.K. Hayward,

N.G. Martin, and G.W. Montgomery. A single SNP in an evolutionary

conserved region within intron 86 of the HERC2 gene determines human

blue-brown eye color. American Journal of Human Genetics, 82(2):424–31,

2008.

BIBLIOGRAPHY 253

[171] H. Eiberg, J. Troelsen, M. Nielsen, A. Mikkelsen, J. Mengel-from, K.W.

Kjaer, and L. Hansen. Blue eye color in humans may be caused by a per-

fectly associated founder mutation in a regulatory element located within

the HERC2 gene inhibiting OCA2 expression. Human Genetics, 123(2):177–

87, 2008.

[172] H. Shen, Z. Liu, S.S. Strom, M.R. Spitz, J.E. Lee, J.E. Gershenwald, M.I.

Ross, P.F. Mansfield, M. Duvic, H.N. Ananthaswamy, and Q. Wei. p53

codon 72 Arg homozygotes are associated with an increased risk of cutaneous

melanoma. Journal of Investigative Dermatology, 121(6):1510–4, 2003.

[173] Australian Bureau of Statistics. Population by age and sex, regions of Aus-

tralia, May 2008.

[174] W.C. Lee. Searching for disease-susceptibility loci by testing for Hardy-

Weinberg disequilibrium in a gene bank of affected individuals. American

Journal of Epidemiology, 158(5):397–400, 2003.

[175] International Hapmap Consortium. The international Hapmap project. Na-

ture, 426(6968):789–96, 2003.

[176] J. Stankovich, C.J. Cox, R.B. Tan, D.S. Montgomery, S.J. Huxtable, J.P.

Rubio, M.G. Ehm, L. Johnson, H. Butzkueven, T.J. Kilpatrick, T.P. Speed,

A.D. Roses, M. Bahlo, and S.J. Foote. On the utility of data from the

international Hapmap project for Australian association studies. Human

Genetics, 119(1-2):220–2, 2006.

[177] I.W. Webster. Editorial: Busselton, a community study in cardiovascular

disease. Australia & New Zealand Journal of Medicine, 4(4):412–4, 1974.

254 BIBLIOGRAPHY

[178] A.L. James, M.W. Knuiman, M.L. Divitini, J. Hui, M. Hunter, L.J. Palmer,

G. Maier, and A.W. Musk. Changes in the prevalence of asthma in adults

since 1966: the Busselton health study. European Respiratory Journal,

35(2):273–8, 2010.

[179] D.L. Duffy, Z.Z. Zhao, R.A. Sturm, N.K. Hayward, N.G. Martin, and G.W.

Montgomery. Multiple pigmentation gene polymorphisms account for a sub-

stantial proportion of risk of cutaneous malignant melanoma. Journal of

Investigative Dermatology, 130(2):520–8, 2010.

[180] P. Mccullagh and J. Nelder. Generalized Linear Models, Second Edition

(Chapman & Hall/CRC Monographs on Statistics & Applied Probability).

Chapman and Hall/CRC, 1989.

[181] W.J. Gauderman. Sample size requirements for association studies of gene-

gene interaction. American Journal of Epidemiology, 155(5):478–84, 2002.

[182] B.R. Cassileth, W.H. Clark, R.M. Heiberger, V. March, and A. Tenaglia.

Relationship between patients early recognition of melanoma and depth of

invasion. Cancer, 49(1):198–200, 1982.

[183] L. Temoshok, R.J. Diclemente, D.M. Sweet, M.S. Blois, and R.W. Sagebiel.

Factors related to patient delay in seeking medical attention for cutaneous

malignant-melanoma. Cancer, 54(12):3048–53, 1984.

[184] L. Temoshok, B.W. Heller, R.W. Sagebiel, M.S. Blois, D.M. Sweet, R.J.

Diclemente, and M.L. Gold. The relationship of psychosocial factors to

prognostic indicators in cutaneous malignant-melanoma. Journal of Psy-

chosomatic Research, 29(2):139–53, 1985.

BIBLIOGRAPHY 255

[185] J.E.J. Krige, S. Isaacs, D.A. Hudson, H.S. King, R.M. Strover, and C.A.

Johnson. Delay in the diagnosis of cutaneous malignant-melanoma - a

prospective-study in 250 patients. Cancer, 68(9):2064–68, 1991.

[186] M. Baccard, S. Chevret, P. Chemaly, and P. Morel. Delay in diagnosis of

melanoma. Prospective study in 102 patients. Annales De Dermatologie Et

De Venereologie, 124(9):601–606, 1997.

[187] M.A. Richard, J.J. Grob, M.F. Avril, M. Delaunay, X. Thirion, P. Wolken-

stein, P. Souteyrand, B. Dreno, J.J. Bonerandi, S. Dalac, L. Machet, J.C.

Guillaume, J. Chevrant-Breton, C. Vilmer, F. Aubin, B. Guillot, M. Beylot-

Barry, C. Lok, N. Raison-Peyron, and P. Chemaly. Melanoma and tu-

mor thickness - challenges of early diagnosis. Archives of Dermatology,

135(3):269–274, 1999.

[188] W. Liu, J.P. Dowling, W.K. Murray, G.A. Mcarthur, J.F. Thompson,

R. Wolfe, and J.W. Kelly. Rate of growth in melanomas - characteristics

and associations of rapidly growing melanomas. Archives of Dermatology,

142(12):1551–1558, 2006.

[189] Cancer Council Australia. Sunsmart : campaigns and events, October 2009.

[190] R.T. Vollmer and H.F. Seigler. Using a continuous transformation of the

Breslow thickness for prognosis in cutaneous melanoma. American Journal

of Clinical Pathology, 115(2):205–12, 2001.

[191] W. Stadelmann, D. Rapaport, S. Soong, D. Reintgen, A. Buzaid, and

C. Balch. Prognostic clinical and pathologic features. In C. Balch,

A. Houghton, A. Sober, and S. Soong, editors, Cutaneous Melanoma, vol-

256 BIBLIOGRAPHY

ume 3rd edition, pages 11–35. Quality Medical Publishing, Inc., St. Louis,

1998.

[192] E.K. Silverman and L.J. Palmer. Case-control association studies for the

genetics of complex respiratory diseases. American Journal of Respiratory

Cell & Molecular Biology, 22(6):645–8, 2000.

[193] C.B. Begg, A.J. Hummer, U. Mujumdar, B.K. Armstrong, A. Kricker, L.D.

Marrett, R.C. Millikan, S.B. Gruber, H.A. Culver, R. Zanetti, R.P. Gal-

lagher, T. Dwyer, T.R. Rebbeck, K. Busam, L. From, and M. Berwick. A

design for cancer case-control studies using only incident cases: experience

with the GEM study of melanoma. International Journal of Epidemiology,

35(3):756–64, 2006.

[194] E. De Vries, T.E.C. Nijsten, O. Visser, E. Bastiaannet, S. Van Hattem,

M.L. Janssen-Heijnen, and J.W.W. Coebergh. Superior survival of females

among 10,538 dutch melanoma patients is independent of Breslow thickness,

histologic type and tumor site. Annals of Oncology, 19(3):583–89, 2008.

[195] M. Berwick. Could sun exposure improve melanoma survival? Solar Radi-

ation and Human Health, page 95, 2008.

[196] K.B. Stitzenberg, N.E. Thomas, M. Berwick, H. Anton-Culver, U.J. Mu-

jumdar, and R.C. Millikan. Sociodemographic factors influencing Breslow

thickness at diagnosis for patients with melanoma. Journal of Clinical On-

cology (Meeting Abstracts), 25(18 suppl):6552–, 2007.

[197] P.F. Hanrahan, P. Hersey, and C.A. D’este. Factors involved in presenta-

tion of older people with thick melanoma. Medical Journal of Australia,

169(8):410–14, 1998.

BIBLIOGRAPHY 257

[198] A.J. Chamberlain, L. Fritschi, G.G. Giles, J.P. Dowling, and J.W. Kelly.

Nodular type and older age as the most significant associations of thick

melanoma in Victoria, Australia. Archives of Dermatology, 138(5):609–14,

2002.

[199] R.A. King and W.S. Oetting. Oculocutaneous albinism type 2. In R.P.

Pagon, T.C. Bird, T.C. Dolan, and K. Stephens, editors, GeneReviews.

University of Washington, Seattle, 2010/03/20 edition, 1993.

[200] W.S. Oetting and R.A. King. Molecular basis of albinism: mutations and

polymorphisms of pigmentation genes associated with albinism. Human

Mutation, 13(2):99–115, 1999.

[201] R.A. Sturm. Molecular genetics of human pigmentation diversity. Human

Molecular Genetics, 18(R1):R9–17, 2009.

[202] R.A. Sturm and M. Larsson. Genetics of human iris colour and patterns.

Pigment Cell Melanoma Research, 22(5):544–62, 2009.

[203] D.L. Duffy, N.F. Box, W. Chen, J.S. Palmer, G.W. Montgomery, M.R.

James, N.K. Hayward, N.G. Martin, and R.A. Sturm. Interactive effects of

MC1R and OCA2 on melanoma risk phenotypes. Human Molecular Genet-

ics, 13(4):447–61, 2004.

[204] K. Ryan, A. Phillips, and K. Vousden. Regulation and function of the p53

tumor suppressor protein. Current Opinion in Cell Biology, 13(3):332–37,

2001.

[205] T. Frebourg, N. Barbier, Y.X. Yan, J.E. Garber, M. Dreyfus, Jr. Fraumeni,

J., F.P. Li, and S.H. Friend. Germline p53 mutations in 15 families with Li-

258 BIBLIOGRAPHY

Fraumeni syndrome. American Journal of Human Genetics, 56(3):608–15,

1995.

[206] K.D. Gonzalez, K.A. Noltner, C.H. Buzin, D. Gu, C.Y. Wen-Fong, V.Q.

Nguyen, J.H. Han, K. Lowstuter, J. Longmate, S.S. Sommer, and J.N.

Weitzel. Beyond Li Fraumeni syndrome: Clinical characteristics of families

with p53 germline mutations. Journal of Clinical Oncology, 27(8):1250–56,

2009.

[207] I. Stefanaki, A.J. Stratigos, G. Dimisianos, V. Nikolaou, O. Papadopoulos,

D. Polydorou, H. Gogas, D. Tsoutsos, P. Panagiotou, E. Kanavakis, C. An-

toniou, and A.D. Katsambas. p53 codon 72 Pro homozygosity increases

the risk of cutaneous melanoma in individuals with dark skin complexion

and among noncarriers of melanocortin 1 receptor red hair variants. British

Journal of Dermatology, 156(2):357–62, 2007.

[208] J. Han, G. Cox, D., A. Colditz, G., and J. Hunter, D. The p53 codon 72

polymorphism, sunburns, and risk of skin cancer in US Caucasian women.

Molecular Carcinogenesis, 45(9):694–700, 2006.

[209] C. Li, K. Chen, Z. Liu, L.-E. Wang, J.E. Gershenwald, J.E. Lee, V.G. Prieto,

M. Duvic, E.A. Grimm, and Q. Wei. Polymorphisms of TP53 Arg72Pro, but

not p73 G4C14>A4TA4 and p21 Ser31Arg, contribute to risk of cutaneous

melanoma. Journal of Investigative Dermatology, 128(6):1585–88, 2007.

[210] M.R.A.-E. Hussein and G.S. Wood. Molecular aspects of melanocytic dys-

plastic nevi. Journal of Molecular Diagnostics, 4(2):71–80, 2002.

[211] G. Zhu, G.W. Montgomery, M.R. James, J.M. Trent, N.K. Hayward, N.G.

Martin, and D.L. Duffy. A genome-wide scan for naevus count: linkage to

BIBLIOGRAPHY 259

CDKN2A and to other chromosome regions. European Journal of Human

Genetics, 15(1):94–102, 2006.

[212] U. Sundram, J.D. Harvell, R.V. Rouse, and Y. Natkunam. Expression of the

B-cell proliferation marker MUM1 by melanocytic lesions and comparison

with S100, gp100 (HMB45), and melanA. Modern Pathology, 16(8):802–10,

2003.

[213] H. Davies, G.R. Bignell, C. Cox, P. Stephens, S. Edkins, S. Clegg, J. Teague,

H. Woffendin, M.J. Garnett, W. Bottomley, N. Davis, E. Dicks, R. Ew-

ing, Y. Floyd, K. Gray, S. Hall, R. Hawes, J. Hughes, V. Kosmidou,

A. Menzies, C. Mould, A. Parker, C. Stevens, S. Watt, S. Hooper, R. Wil-

son, H. Jayatilake, B.A. Gusterson, C. Cooper, J. Shipley, D. Hargrave,

K. Pritchard-Jones, N. Maitland, G. Chenevix-Trench, G.J. Riggins, D.D.

Bigner, G. Palmieri, A. Cossu, A. Flanagan, A. Nicholson, J.W. Ho, S.Y.

Leung, S.T. Yuen, B.L. Weber, H.F. Seigler, T.L. Darrow, H. Paterson,

R. Marais, C.J. Marshall, R. Wooster, M.R. Stratton, and P.A. Futreal.

Mutations of the BRAF gene in human cancer. Nature, 417(6892):949–54,

2002.

[214] P. Meyer, C. Sergi, and C. Garbe. Polymorphisms of the BRAF gene pre-

dispose males to malignant melanoma. Journal of Carcinogenesis, 2(1):7,

2003.

[215] S. Jackson, M. Harland, F. Turner, C. Taylor, P.A. Chambers, J. Randerson-

Moor, A.J. Swerdlow, I. Dos Santos Silva, S. Beswick, D.T. Bishop, and J.A.

Newton Bishop. No evidence for BRAF as a melanoma/nevus susceptibility

gene. Cancer Epidemiology, Biomarkers & Prevention, 14(4):913–8, 2005.

260 BIBLIOGRAPHY

[216] K. Laud, C. Kannengiesser, M.F. Avril, A. Chompret, D. Stoppa-Lyonnet,

L. Desjardins, A. Eychene, F. Demenais, G.M. Lenoir, and B. Bressac-

De Paillerets. BRAF as a melanoma susceptibility candidate gene? Cancer

Research, 63(12):3061–5, 2003.

[217] M. Shinozaki, A. Fujimoto, D.L. Morton, and D.S. Hoon. Incidence of braf

oncogene mutation and clinical relevance for primary cutaneous melanomas.

Clinical Cancer Research, 10(5):1753–7, 2004.

[218] S.C. Gorber, M. Tremblay, D. Moher, and B. Gorber. A comparison of

direct vs. self-report measures for assessing height, weight and body mass

index: a systematic review. Obesity Reviews, 8(4):307–26, 2007.

[219] I.A. Van Der Mei, L. Blizzard, A.L. Ponsonby, and T. Dwyer. Validity and

reliability of adult recall of past sun exposure in a case-control study of mul-

tiple sclerosis. Cancer Epidemiology, Biomarkers & Prevention, 15(8):1538–

44, 2006.

[220] L.K. Dennis, E. White, J.A. Lee, A. Kristal, B. Mcknight, and P. Odland.

Constitutional factors and sun exposure in relation to nevi: a population-

based cross-sectional study. American Journal of Epidemiology, 143(3):248–

56, 1996.

[221] M. Berwick and Y.T. Chen. Reliability of reported sunburn history in a

case-control study of cutaneous malignant melanoma. American Journal of

Epidemiology, 141(11):1033–7, 1995.

[222] M.A. Weinstock, G.A. Colditz, W.C. Willett, M.J. Stampfer, B. Rosner, and

F.E. Speizer. Recall (report) bias and reliability in the retrospective assess-

BIBLIOGRAPHY 261

ment of melanoma risk. American Journal of Epidemiology, 133(3):240–5,

1991.

[223] J. Mez, J. Cole, T. Howard, L. Macclellan, O. Stine, J. O’Connell, M. Woz-

niak, B. Stern, J. Sorkin, B. Mitchell, and S. Kittner. Evaluation of self-

reported ethnicity in a case-control population: the stroke prevention in

young women study. BMC Research Notes, 2(1):260, 2009.

[224] L.R. Cardon and L.J. Palmer. Population stratification and spurious allelic

association. Lancet, 361(9357):598–604, 2003.

[225] C.J. Mann. Observational research methods. research design. II: Cohort,

cross sectional, and case-control studies. Emergency Medical Journal,

20(1472-0205):54–60, 2003.

[226] R. Doll, R. Peto, J. Boreham, and I. Sutherland. Mortality in relation to

smoking: 50 years’ observations on male British doctors. British Medical

Journal, 328(7455):1519–28, 2004.

[227] R. Doll, R. Peto, J. Boreham, and I. Sutherland. Mortality from cancer

in relation to smoking: 50 years observations on British doctors. British

Journal of Cancer, 92(3):426–29, 2005.

[228] M.J. Stampfer and G.A. Colditz. Estrogen replacement therapy and coro-

nary heart-disease - a quantitative assessment of the epidemiologic evidence.

Preventive Medicine, 20(1):47–63, 1991.

[229] D. Grady, S.M. Rubin, D.B. Petitti, C.S. Fox, D. Black, B. Ettinger,

V.L. Ernster, and S.R. Cummings. Hormone-therapy to prevent disease

and prolong life in postmenopausal women. Annals of Internal Medicine,

117(12):1016–37, 1992.

262 BIBLIOGRAPHY

[230] D.A. Lawlor and G. Davey Smith. Cardiovascular risk and hormone replace-

ment therapy. Current Opinion in Obstetrics & Gynecology, 18(6):658–65,

2006.

[231] D.A. Lawlor, G. Davey Smith, K.R. Bruckdorfer, D. Kundu, and

S. Ebrahim. Those confounded vitamins: what can we learn from the dif-

ferences between observational versus randomised trial evidence? Lancet,

363(9422):1724–27, 2004.

[232] G. Davey Smith and S. Ebrahim. What can Mendelian randomisation tell

us about modifiable behavioural and environmental exposures? British

Medical Journal, 330(7499):1076–9, 2005.

[233] G. Davey Smith and S. Ebrahim. ‘Mendelian randomization’: can genetic

epidemiology contribute to understanding environmental determinants of

disease? International Journal of Epidemiology, 32(1):1–22, 2003.

[234] L.A. Bazzano, K. Reynolds, K.N. Holder, and J. He. Effect of folic acid

supplementation on risk of cardiovascular diseases - a meta-analysis of ran-

domized controlled trials. Journal of the American Medical Association,

296(22):2720–26, 2006.

[235] G. Davey Smith and S. Ebrahim. Mendelian randomization: prospects,

potentials, and limitations. International Journal of Epidemiology, 33(1):30–

42, 2004.

[236] D.A. Lawlor, G. Davey Smith, and S. Ebrahim. Socioeconomic position and

hormone replacement therapy use: explaining the discrepancy in evidence

from observational and randomized controlled trials. American Journal of

Public Health, 94(12):2149–54, 2004.

BIBLIOGRAPHY 263

[237] G. Davey Smith, Y. Ben-Shlomo, and J. Lynch. Life course approaches to

inequalities in coronary heart disease risk. In S.A. Stansfeld, M.G. Mar-

mot, editors, Stress and the heart. Psychosocial pathways to coronary heart

disease. BMJ Books, London, 2002.

[238] M.J. Thun, R. Peto, A.D. Lopez, J.H. Monaco, S.J. Henley, C.W. Heath,

and R. Doll. Alcohol consumption and mortality among middle-aged and

elderly US adults. New England Journal of Medicine, 337(24):1705–14, 1997.

[239] S. Ebrahim and G. Davey Smith. Mendelian randomization: can genetic epi-

demiology help redress the failures of observational epidemiology? Human

Genetics, 123(1):15–33, 2008.

[240] S. Macmahon, R. Peto, R. Collins, J. Godwin, S. Macmahon, J. Cutler,

P. Sorlie, R. Abbott, R. Collins, J. Neaton, R. Abbott, A. Dyer, and J. Stam-

ler. Blood pressure, stroke, and coronary heart disease : Part 1, prolonged

differences in blood pressure: prospective observational studies corrected for

the regression dilution bias. The Lancet, 335(8692):765–74, 1990.

[241] G. Davey Smith and A.N. Phillips. Inflation in epidemiology: “the proof and

measurement of association between two things” revisited. British Medical

Journal, 312(7047):1659–61, 1996.

[242] M. E. Baum, C.A.S. and S. Stillman. IVREG2: Stata module for extended

instrumental variables/2SLS and GMM estimation, 2002. Statistical Soft-

ware Components.

[243] D.A. Lawlor, R.M. Harbord, J.A. Sterne, N. Timpson, and G. Davey Smith.

Mendelian randomization: Using genes as instruments for making causal

inferences in epidemiology. Statistics in Medicine, 27(8):1133–63, 2007.

264 BIBLIOGRAPHY

[244] D. Staiger and J.H. Stock. Instrumental variables regression with weak

instruments. Econometrica, 65(3):557–86, 1997.

[245] J.D. Sargan. The estimation of economic relationships using instrumental

variables. Econometrica, 26(3):393–415, 1958.

[246] J.M. Wooldridge. Introductory Econometrics: A Modern Approach. Thom-

son Learning, New York, 2nd edition, 2003.

[247] J. Durbin. Errors in variables. Review of the International Statistical Insti-

tute, 22:23–32, 1954.

[248] D.M. Wu. Alternative tests of independence between stochastic regressors

and disturbances. Econometrica, 41(4):733–50, 1973.

[249] J. Hausman. Specication tests in econometrics. Econometrica, 46(3):1251–

71, 1978.

[250] M.B. Katan. Apolipoprotein-E isoforms, serum-cholesterol, and cancer, Mar

1986.

[251] S. Trompet, J.W. Jukema, M.B. Katan, G.J. Blauw, N. Sattar, B. Buckley,

M. Caslake, I. Ford, J. Shepherd, R.G. Westendorp, and A.J. De Craen.

Apolipoprotein E genotype, plasma cholesterol, and cancer: a Mendelian

randomization study. Am J Epidemiol, 170(11):1415–21, 2009.

[252] K.H. Allin, B.G. Nordestgaard, J. Zacho, A. Tybjaerg-Hansen, and S.E. Bo-

jesen. C-reactive protein and the risk of cancer: a Mendelian randomization

study. Journal of the National Cancer Institute, 102(3):202–6, 2010.

BIBLIOGRAPHY 265

[253] S.J. Lewis. Mendelian randomization as applied to coronary heart disease,

including recent advances incorporating new technology. Circulation: Car-

diovascular Genetics, 3(1):109–17, 2010.

[254] G. Pare and S.S. Anand. Mendelian randomisation, triglycerides, and CHD.

Lancet, 375(9726):1584–6, 2010.

[255] J. Danesh and M.B. Pepys. C-reactive protein and coronary disease: is there

a causal link? Circulation, 120(21):2036–9, 2009.

[256] A. Ioannidis, E. Ikonomi, N.L. Dimou, L. Douma, and P.G. Bagos. Poly-

morphisms of the insulin receptor and the insulin receptor substrates genes

in polycystic ovary syndrome: a Mendelian randomization meta-analysis.

Molecular Genetics & Metabolism, 99(2):174–83, 2010.

[257] N.J. Timpson, D.A. Lawlor, R.M. Harbord, T.R. Gaunt, I.N.M. Day, L.J.

Palmer, A.T. Hattersley, S. Ebrahim, G.D.O. Lowe, A. Rumley, and G.

Davey Smith. C-reactive protein and its role in metabolic syndrome:

Mendelian randomisation study. Lancet, 366(9501):1954–59, 2005.

[258] P.A. Mccaskie, G. Cadby, J. Hung, B.M. Mcquillan, C.M. Chapman, K.W.

Carter, P.L. Thompson, L.J. Palmer, and J.P. Beilby. The C-480T hepatic

lipase polymorphism is associated with HDL-C but not with risk of coronary

heart disease. Clinical Genetics, 70(2):114–21, 2006.

[259] K.E. Lohmueller, C.L. Pearce, M. Pike, E.S. Lander, and J.N. Hirschhorn.

Meta-analysis of genetic association studies supports a contribution of

common variants to susceptibility to common disease. Nature Genetics,

33(2):177–82, 2003.

[260] J. Fox. SEM: Structural equation models, 2009.

266 BIBLIOGRAPHY

[261] R Development Core Team. R: A Language and Environment for Statistical

Computing. Vienna, 2005.

[262] K.W. Carter, P.A. Mccaskie, and L.J. Palmer. SimHap GUI: an intuitive

graphical user interface for genetic association analysis. BMC Bioinformat-

ics, 9:557, 2008.

[263] J.A. Newton-Bishop, S. Beswick, J. Randerson-Moor, Y.M. Chang, P. Af-

fleck, F. Elliott, M. Chan, S. Leake, B. Karpavicius, S. Haynes, K. Kukalizch,

L. Whitaker, S. Jackson, E. Gerry, C. Nolan, C. Bertram, J. Marsden, D.E.

Elder, J.H. Barrett, and D.T. Bishop. Serum 25-hydroxyvitamin d3 lev-

els are associated with Breslow thickness at presentation and survival from

melanoma. Journal of Clinical Oncology, 27(32):5439–44, 2009.

[264] F. Hayashi. Econometrics. Princeton University Press, Princeton, 1st edi-

tion, 2000.

[265] A.R. Pagan and A.D. Hall. Diagnostic tests as residual analysis. Economet-

ric Reviews, 2(2):159–218, 1983.

[266] Y. Li and G.R. Abecasis. MACH 1.0: rapid haplotype reconstruction

and missing genotype inference. American Journal of Human Genetics,

S79:2290, 2006.

[267] P.I.W. de Bakker, M.A.R. Ferreira, X. Jia, B.M. Neale, S. Raychaudhuri,

and B.F. Voight. Practical aspects of imputation-driven meta-analysis of

genome-wide association studies. Human Molecular Genetics, 17(2):R122-

R128, 2008.

[268] J.H. Stock, J.H. Wright, and M. Yogo. A Survey of Weak Instruments and

Weak Identification in Generalized Method of Moments. American Statisti-

BIBLIOGRAPHY 267

cal Association Journal of Business and Economic Statistics, 20(4):518-529,

2002.

[269] D.L. Duffy, M.M. Iles, D. Glass, G. Zhu, J.H. Barrett, V. Hoiom, Z.Z Zhao,

R.A. Sturm, N. Soranzo, C. Hammond, M. Kvaskoff, D.C. Whiteman, M.

Mangino, J. Hansson, J.A. Newton-Bishop, GenoMEL, V. Bataille, N.K.

Hayward, N.G. Martin, D.T. Bishop, T.D. Spector, and G.W. Montgomery.

IRF4 variants have age-specific effects on nevus count and predispose to

melanoma. American Journal of Human Genetics, 87(1):6-16, 2010.

[270] K. Lasithiotakis, U. Leiter, F. Meier, T. Eigentler, G. Metzler, M. Moehrle,

H. Breuninger, and C. Garbe. Age and gender are significant independent

predictors of survival in primary cutaneous melanoma. Cancer, 112(8):1795-

804, 2008.

[271] C.R Scoggins, M.I. Ross, D.S. Reintgen, R.D. Noyes, J.S. Goydos, P.D.

Beitsch, M.M. Urist, S. Ariyan, J.J. Sussman, M.J. Edwards, A.B. Chag-

par, R.C. Martin, A.J. Stromberg, L. Hagendoorn, and K.M. McMasters.

Gender-related differences in outcome for melanoma patients. Annals of

Surgery, 243(5):693-8, 2006.

[272] N.G. Williams and T.M. Roberts. Signal transduction pathways involving

the raf proto-oncogene. Cancer and Metastasis Reviews, 13(1):105, 1994.

[273] Office for National Statistics. Mortality Statistics: Cause. England and

Wales 2008. London TSO, 2010.

[274] R.W. Morris and N.L. Kaplan. On the advantage of haplotype analysis in

the presence of multiple disease susceptibility alleles. Genetic Epidemiology,

23(3):221-223,2002.

268 BIBLIOGRAPHY

[275] P.S. Rosenberg, A. Che, and B.E. Chen. Multiple hypothesis testing strate-

gies for genetic case-control association studies. Statistics in Medicine,

25(18):3134-3149, 2006.

[276] B.N. Howie, P. Donnelly, and J. Marchini. A flexible and accurate geno-

type imputation method for the next generation of genome-wide association

studies. PLoS Genetics, 5(6):e1000529, 2009.

[277] M.F. Holick. Vitamin D Deficiency. New England Journal of Medicine,

357:266-281, 2007.

[278] J. Ahn, D. Albanes, S.I. Berndt, U. Peters, N. Chatterjee, N.D. Freed-

man, C.C. Abnet, W.Y. Huang, A.S. Kibel, E.D.Crawford, S.J. Weinstein,

S.J. Chanock, A. Schatzkin, and R.B. Hayes. Vitamin D-related genes,

serum vitamin D concentrations and prostate cancer risk. Carcinogenesis,

30(5):769776, 2009.

[279] J. Bound, D.A. Jaeger, and R.M. Baker. Problems with instrumental vari-

ables estimation when the correlation between the instruments and the en-

dogenous explanatory variable is weak. Journal of the American Statistical

Association, 90(430):443-450, 1995.

[280] R.P. Gallagher, D.I. McLean, C.P. Yang, A.J. Coldman, H.K.B. Silver,

J.J. Spinelli, and M. Beagrie. Suntan, Sunburn, and Pigmentation Fac-

tors and the Frequency of Acquired Melanocytic Nevi in Children. Similar-

ities to Melanoma: The Vancouver Mole Study Archives of Dermatology,

126(6):770-776, 1990.

BIBLIOGRAPHY 269

[281] M.P. Staples, M. Elwood, R.C. Burton, J.L. Williams, R. Marks, and G.G

Giles. Non-melanoma skin cancer in Australia: the 2002 national survey and

trends since 1985 Medical Journal of Australia, 184(1):6-10, 2006.

270 BIBLIOGRAPHY

271APPENDIX A

Letter to Doctor

Western Australian Cancer Registry Information Collection and Management, DOH (WA)

1st floor C Block, 189 Royal St EAST PERTH WA 6004

Ph. 08 9222 4022 Fax 08 9222 4236 Website: http://www.health.wa.gov.au/wacr

e-mail: [email protected]

«DOCNAME» «Doctor_Mail_Add1» «Doctor_Mail_Add2» «Doctor_Mail_Add3» «Doctor_Mail_Town» «Doctor_Mail_PCode»

Dear Dr «DOCSNAM»

I am writing to you about the Western Australian Melanoma Health Study (WAMHS) which has recently been established in order to develop a database of information and blood specimens to assist in research into this disease, which has significant mortality and morbidity.

The researchers wish to approach persons diagnosed with melanoma, via the Western Australian Cancer Registry (WACR), to seek their consent and participation. Participation involves the patient completing a questionnaire containing health and lifestyle information, as well as donating a blood sample for storage and use in medical research.

I am attaching details of persons whom WACR information suggests were your patients. I would appreciate it if you would indicate on the list whether there is any reason not to approach particular people about this project and return it to us in the reply paid envelope provided. If we do not receive a reply from you within four weeks, we will assume that you have no concerns about the researchers contacting any of the people listed.

Should you have any questions or would like further information about the WACR, please do not hesitate to contact me using the details above. For more information about the Study, please contact the WAMHS study coordinator, Ms Sarah Ward, on (08) 9346 1119.

This study has been approved by The University of Western Australia Ethics Committee and the Confidentiality of Health Information Committee of the Department of Health.

Yours sincerely,

Dr Timothy Threlfall Principal Medical Officer/Manager

<<Date>>

273APPENDIX B

Doctor Information Sheet

1

WESTERN AUSTRALIAN MELANOMA

HEALTH STUDY

Information for Doctors

BACKGROUND AND PURPOSE

Researchers from the Centre for Genetic Epidemiology and Biostatistics at The University of Western Australia have established the Western Australian Melanoma Health Study (WAMHS) in an effort to investigate the genetic and environmental factors that may cause malignant melanoma. This study has been approved by the University of Western Australia’s Human Research Ethics Committee and the Department of Health’s Human Research Ethics Committee (formerly Confidentiality of Health Information Committee).

Your patient/s are being asked to take part in this study because they have recently been diagnosed with melanoma, and have had this noted in the Western Australian Cancer Registry (WACR). We are asking all patients who have been diagnosed with melanoma in Western Australia to consider participating in this study.

Taking part in this study is completely voluntary. This information sheet will give you information regarding the study that your patient/s are being invited to participate in.

WHAT DO YOU HAVE TO DO?

You are asked to indicate on the enclosed list of your patients that have been diagnosed with malignant melanoma, if there is any reason, medical or other, not to approach particular patients about this project. You are then asked to return the list to the WACR in the reply paid envelope provided. Those patients that you consider suitable to participate in the study will be contacted by the WACR and can then decide if they wish to participate.

How will consent be handled? Once approval to contact the patient has been indicated by you, information about the WAMHS and a consent form will be sent directly to them by the WACR, on behalf of the WAMHS research staff. They will have ample time to discuss their decision with you, their family or WAMHS research staff and will be asked to return the consent form indicating their decision either way to the WACR. If the patient consents to participate in the study, the WACR will notify the WAMHS research staff.

WHAT IS THE PATIENT GIVING CONSENT FOR?

If patients choose to take part in the study, they will be asked to:

• Complete a questionnaire: This would involve the patient answering questions on their health, lifestyle, background and family history. The questionnaire will be conducted as a telephone interview and will take approximately 45 minutes.

• Provide a blood sample: This would be collected for serum and DNA at a PathWest collection centre, with the option of providing an additional sample for RNA.

2

• Health information access: In addition we will ask for consent to access health information kept about the patient. This information may come from hospital case notes, GP records or may be information kept about them by the Department of Health as part of its regular function.

DOES THE PATIENT HAVE TO TAKE PART IN THE STUDY?

No. Participation in this study is entirely up to the individual and their care will not be affected in any way by their decision. We will not inform you as the treating doctor whether your patient chose to participate. Patients can also withdraw their consent at any time by communicating this to WAMHS staff in writing.

WHO CAN ANSWER QUESTIONS ABOUT THE STUDY?

If you or your patients have any concerns or questions about the Western Australian Melanoma Health Study, please contact the WAMHS staff:

Website: http://www.wamhs.org.au

Phone: (08) 6488 6753 or 1800 145 511 (Sarah Ward - Study Coordinator)

Mail: Western Australian Melanoma Health Study Centre for Genetic Epidemiology and Biostatistics

The University of Western Australia M409, 35 Stirling Highway CRAWLEY WA 6009

OTHER ISSUES

If you or your patients have any queries about the Western Australian Cancer Registry, please contact the registry directly:

Mail: Dr. Timothy Threlfall Principal Medical Officer and Manager Western Australia Cancer Registry Department of Health (WA) 1st Floor C Block, 189 Royal Street East Perth WA 6004

Phone: (08) 9222 4022

If you or your patients have any complaints about any aspect of the study, or any questions about your patients’ rights as participants, then you may contact this committee:

Mail: Secretary, Human Research Ethics Committee Registrar’s Office - University of Western Australia 35 Stirling Highway Crawley WA 6009

Phone: (08) 6488 3703

275APPENDIX C

Letter to Patient

«crn»

Western Australian Cancer Registry Information Collection and Management, DOH (WA)

1st floor C Block, 189 Royal St EAST PERTH WA 6004

Ph. 08 9222 4022 Fax 08 9222 4236 Website: http://www.health.wa.gov.au/wacr

e-mail: [email protected]

«title» «FORENAME1» «FORENAME2» «SURNAME» «Person_Mail_Address» «Person_Mail_Town» WA «Person_Mail_PostCode»

Dear «title» «SURNAME» I am writing to ask for your assistance in an important state wide research project, interested in identifying the factors that cause malignant melanoma. This study is called the Western Australian Melanoma Health Study (WAMHS), and is being conducted by researchers at the Centre for Genetic Epidemiology and Biostatistics at The University of Western Australia. Currently, melanoma is the third most common form of cancer in Australia. The researchers wish to collect a blood sample and information on health, lifestyle and family history from people recently diagnosed with melanoma. Please find an information brochure about the study enclosed. Your name has been notified to the Western Australian Cancer Registry (WACR) as a result of medical tests, as required by law, and I am writing to invite you to participate in this study (please also find an information brochure about the WACR enclosed). If we could identify your doctor, we have contacted them as a courtesy prior to writing to you. However, your doctor will not be notified as to whether you chose to participate in this study or not. Further, no personal information will be given to any researchers without your consent. If you would like to participate in this study, I would be grateful if you would read and complete the enclosed consent form and post it back to me, using the reply paid envelope provided. No stamp is required (free postage). An extra copy of the consent form is also enclosed for you to keep for your records. If you do agree to participate, I will forward your name and consent to the WAMHS team at the Centre for Genetic Epidemiology and Biostatistics. They will then send you a questionnaire and a blood collection form to take with you if you have a blood sample taken. I will send a reminder letter about this study in 2 weeks, as it is most important to the success of the research that we get information from everyone we contact about whether they want to take part or not. If you do NOT wish to participate, you can indicate this on the consent form, so I can avoid sending you another letter and your name will NOT be given to the researchers. I appreciate your time in considering this request, and would be happy to discuss any problems or other WACR matters with you. Alternatively, you can call the Study Coordinator, Ms Sarah Ward, on 1800 145 511 (free-call number) or (08) 9346 1119 about the study itself. Yours sincerely,

Dr Timothy Threlfall Principal Medical Officer/Manager

«DATE»

277APPENDIX D

Patient Information Brochure

Participant Information Brochure

What is the purpose of this study?

Researchers at the Centre for Genetic Epidemiology and Biostatistics

(CGEB) at the University of Western Australia are investigating why people get malignant melanoma, a type of skin cancer, as part of the Western

Australian Melanoma Health Study (WAMHS).

Causes of melanoma include genetic (hereditary factors passed on to you

from your parents) and environmental factors (such as lifestyle differences

between people).

You are being asked to take part in this study because you have recently

been diagnosed with melanoma, and this has been notified to the Western Australian Cancer Registry (WACR). All patients who have been diagnosed

with malignant melanoma in Western Australia are being asked if they would

like to be a part of this study.

This information brochure will give you information to help you decide

whether or not you would like to be a part of the study. Please read this

carefully and discuss with members of your family and friends, as necessary.

Do I have to take part in this study?

No. Taking part in this study is completely voluntary and entirely up to you. Your decision will not be revealed to your doctor and will not affect your care

in any way.

What does this study involve?

There are three parts to this study:

1. Questionnaire (necessary)

2. Blood samples (two options)

a. One type of blood sample (DNA and serum)

b. Two types of blood samples (DNA, serum and RNA)

3. Health information access

You may participate in one part, more than one part, or all parts. However, if you participate in any parts of the study, you will need to

complete the questionnaire (Part 1) so that researchers can better

understand any environmental factors that are relevant to you.

1

Part 1. Questionnaire

What is this for?

The questionnaire provides information about your health and lifestyle, your

background and family history, and your melanoma. This information will help

us to understand environmental factors that cause melanoma.

What does this involve?

The questionnaire will be completed in a telephone interview with WAMHS

staff at a time convenient to you. This should take approximately 45 minutes.

Before the interview, some questions will be sent to you, so that you can

complete them beforehand (e.g. height) and have the answers ready to give

the interviewer.

What will be done with this information?

This information will be used by medical researchers interested in

environmental or lifestyle factors which may be involved in melanoma, e.g.

sun exposure. In addition, information regarding your family history will help

to see if melanoma runs in families.

Part 2. Blood Samples

What is this for?

Blood samples are very important to help in understanding malignant

melanoma. Specifically, blood samples can be used to identify possible

causes, improve diagnosis and potentially lead to new treatments. Your

sample will greatly assist melanoma research in the following ways:

1. A number of blood tests are currently performed to detect certain types

of disease, such as the cholesterol test for high cholesterol or the blood

sugar test for diabetes.

New tests are currently being developed for other forms of skin cancer

(basal cell carcinoma (BCC) and squamous cell carcinoma (SCC)). Your

blood sample would be very valuable in helping to evaluate new tests

that may be able to be used for melanoma.

2. Genes or DNA carry the information in your body that determines many

2

of your characteristics, including the colour of your hair or eyes. RNA is

produced from genes and in turn makes all the proteins in your body.

Your DNA and RNA will be used in research looking at the genes that

affect melanoma.

Some researchers may wish to perform tests to see if you are carrying

genes that might predict risk of melanoma or that may be involved in

whether or not you respond to a particular type of therapy.

What does this involve?

There are two options for the type of blood sample that you can donate:

a. One type of blood sample (DNA and serum)

This blood sample will provide a sample of your DNA (along with serum - the

proteins and chemicals in your blood).

If you would only like to donate this blood sample, it can be collected at any

PathWest centre in Western Australia (both metropolitan and regional

centres).

You will be asked to give approximately 30ml (about 2 and a half

tablespoons) of blood with this option.

b. Two types of blood samples (DNA, serum and RNA)

An extra blood sample is needed to collect RNA. This is collected at the

same time as the sample for DNA and serum, as part of the same blood test.

As a special tube is used to collect the RNA, it can only be done at certain

PathWest centres.

If you would like to donate both types of blood samples, this will be collected

at the PathWest centre at QEII Medical Centre in Nedlands, Perth.

You will be asked to give approximately 35ml (about 3 tablespoons) of blood

with this option.

What will be done with this information?

Serum, DNA and RNA will be extracted from your blood sample and stored in

small plastic tubes in a secure freezer located at the Western Australian DNA

Bank (WADB), within the CGEB.

3

The samples will be used by medical researchers in biochemical and genetic

studies of melanoma and other types of skin cancer such as basal cell

carcinoma (BCC) and squamous cell carcinoma (SCC).

These types of skin cancer occur more often than melanoma but are not as

dangerous, as they are less likely to spread to other parts of the body or

spread as quickly. All three types of skin cancer develop in different types of

cells in the skin and differ in their appearance.

Your blood will not be used for research that involves reproductive

cloning. Researchers will not be permitted to work out a genetic profile of

you as an individual.

With your permission, we would like to store this blood indefinitely for future

research into skin cancer. As such, your blood will be stored until it has all

been used, unless you contact the WAMHS to request its destruction

(contact details on the back page of this brochure).

What are the future implications of donating blood?

Feedback to participants

Your donated samples are not intended to be used in your diagnosis or

treatment. Feedback will not normally be given to participants regarding

personal genetic information. This is because melanoma is not thought to

be caused by just one gene (e.g. like cystic fibrosis is), but a complex

disease with many genes contributing to its development.

These genes will have small effects by themselves for each individual and

will generally combine together with other factors (such as other genes, the

environment, or lifestyle factors) to increase disease risk.

The disease risk attributable to each factor will be small and we do not yet

know exactly how all these elements interact, so releasing genetic

information would not be clinically meaningful or useful to you or your family.

However, it is possible that future DNA testing may result in new information

about diseases or potential diseases that you carry. Such information will

require extensive testing and validation before it can be determined to be

clinically useful but some of this information may have health implications for

yourself and your family, including your descendants.

4

Contact with you about any potentially important findings will only be made

where there is clear evidence of the medical importance to you or your

family. Each situation will be carefully reviewed by a Human Research Ethics

Committee before you are informed of the availability of results.

At that time, you may choose whether you would like to receive information

and what further information and tests you would like to proceed with.

Information will be provided through approved medical channels. Please note

it is your responsibility to keep the WAMHS up to date with your contact

details.

Disclosure to family members

You should consider whether or not any significant health information

obtained as a result of DNA testing should be disclosed to your family

members or descendents in the future. We will ask whether you consent to

the disclosure of your information to your family members or descendents (on

the attached consent form).

Family members would have to supply a written request to the Chief

Investigator and a Human Research Ethics Committee would review and

consider the request to disclose information.

Part 3. Health Information Access

What is this for?

The discovery of factors important in understanding disease may also require

additional knowledge of other relevant medical information about you,

beyond what is already stored in the WACR in the Department of Health of

Western Australia .

What does this involve?

We will ask for your consent for the WAMHS to access medical information

kept about you that is relevant to medical research, for example specific

information about other illnesses you may have had in the past. This

information may come from hospital case notes, GP records or from the

Department of Health as part of its routine data collection.

What will be done with this information?

This information will be used for research into melanoma and related health

areas.

5

281APPENDIX E

Patient Consent Form

1

WESTERN AUSTRALIAN MELANOMA HEALTH STUDY

CONSENT FORM I have read the information given to me and have had all of my questions answered. I have also been given a copy of the:

• Information Brochure entitled “DNA (Genetic) Testing and Storing”;

• Patient Information Brochure for the Western Australian Melanoma Health Study and

• Consent Form for my records

There are 8 parts to this consent form, Parts A to H. If you WOULD LIKE to participate in the WA Melanoma Health Study, please complete parts A to H of this consent form including completing and signing the declaration in Part H. You may consent to any or all parts of the study. However, it is necessary to consent to participate in the questionnaire (Part A) in order to consent to participate in the other parts. If you WOULD NOT LIKE to participate in the WA Melanoma Health Study, please go straight to Part H: Declaration, tick the box stating that you do not agree to participate and sign the form.

Part A: Consent to participate in questionnaire

I consent to complete a questionnaire about the lifestyle and environmental factors associated with melanoma. The questionnaire will be completed by phone interview at a time convenient to me. Some questions will be mailed to me before the interview, so that I can complete them beforehand and have the answers ready at the time of the interview (e.g. height).

I Do Do Not (Please continue below) (Please go to Part H)

Part B: Consent for Blood Sample and Storage for Medical Research

I consent to blood being taken and donate that blood absolutely for testing and research into melanoma and related health areas. In making my donation of blood, I understand and agree that: (a) the blood (which in this consent form includes its constituents and any genetic material or any

cell lines derived from the blood) will be stored and used for research into melanoma and related health areas;

(b) storage of blood samples will be conducted in accordance with the National Health & Medical Research Council’s1* Guidelines for Genetic Registers and Associated Genetic Material. Storage will be conducted under a coded system, to ensure that my confidentiality is maintained;

(c) samples of blood held will be discarded upon my written request;

1 *The National Health & Medical Research Council advises the Australian community and Commonwealth and the

State Governments on standards of individual and public health, and supports research to improve these standards.

2

(d) access to my blood donation for research will be managed by an Advisory Committee and only released where the research project that wishes to use my blood donation has been approved by a Human Research Ethics Committee;

(e) feedback will not normally be given to me regarding my personal genetic research information as these results would not be clinically meaningful or useful to me;

(f) I can ask to know more specific details of any studies that used my blood sample by contacting the WA Melanoma Health Study;

(g) the results of the research may be of interest to my immediate family, including my descendants, and I may decide whether or not the information may be disclosed to my family in accordance with ‘Options for disclosure to family members’ detailed in Part C;

(h) international research collaborations using my blood will only take place where researchers abide by equal or more stringent regulations of privacy and ethics as those in Australia, as assessed by a Human Research Ethics Committee;

(i) the Chief Investigator of this project and his or her associates involved with this project as well as The University of Western Australia will not be liable for any loss or damage to the blood/tissue taken or used in accordance with this form.

I consent to the collection of a blood sample for DNA extraction and biochemical analyses (from serum) and understand that this donation is given under the conditions (a) to (i) stated above. I understand that this sample may be taken at any PathWest centre in WA.

I Do Do Not (Please continue below) (Please go to Part C)

IN ADDITION:

I also consent to the collection of an additional blood sample for RNA to be taken at the same time as the sample for DNA and serum is collected. I understand that this donation is also given under the conditions (a) to (i) stated above. I understand that this sample must be taken at the QE-II Medical Centre, Nedlands.

I Do Do Not (Please go to Part C) (Please go to Part C)

Part C: Options for disclosure to family members

Please only complete this section if you have consented to any part of Part B.

I understand that future DNA testing may result in new information about diseases or potential diseases that may have health implications for my family, including my descendants. I understand that extensive testing and validation would be required before the information can be determined to be useful and that information can only be provided through approved medical channels when there is clear evidence of the medical importance to my family.

I consent to genetic information obtained from my DNA, as a result of this study, being revealed to my family members, upon their written request to the Chief Investigator, in the event of my death.

I Do Do Not (Please go to Part D) (Please go to Part D)

N.B: Part D is no longer applicable

3

Part E: Consent to access health information

I consent to allow access to health information about me relevant to melanoma-related research, such as is kept in a medical record or by the WA Department of Health, to assist medical research only where the research proposal has been approved by a Human Research Ethics Committee.

I Do Do Not (Please go to Part F) (Please go to Part F)

Part F: Future clinical trials for new treatments for melanoma

I consent to allow WAMHS staff to contact me in the future if there are any new treatments for melanoma which I may be suitable for (for example a clinical trial for a new drug or other treatment).

I Do Do Not (Please go to Part G) (Please go to Part G)

Part G: General

I understand that: (a) If a researcher wishes to obtain additional information or samples from me, my name will not be

divulged to that researcher without my written permission; (b) Any research results about me, and the fact that I have taken part in this study, will not be

revealed to any third party without my written consent, except if required by law; (c) The researchers will not reveal my identity and personal information about this project if

published in any public form; (d) I will not receive, or be entitled to, any reward or remuneration for providing my blood sample for

this project. I understand that the sample being donated could be used for commercial development and acknowledge the public interest in the research and donate the sample absolutely;

(e) I understand the potential benefits and risks involved in taking part in this study which have been

explained to me and I accept the risks involved. I have had the opportunity to ask questions and am satisfied with the explanation and the answers to my questions;

(f) I may withdraw from the study at any time, no questions asked, and without it negatively

affecting my future medical treatment; (g) If I choose to withdraw from the study, I understand that any information about me already

collected by the researchers will be retained unless I request otherwise in writing; (h) I understand that my information will only be collected and used for the study of melanoma and

related conditions. Related conditions are defined as other types of skin cancer; basal cell carcinoma (BCC) and squamous cell carcinoma (SCC).

4

Part H: Declaration

(Please tick the option that applies)

I agree to participate in the WA Melanoma Health Study and consent to the parts of the study that I have indicated in Part A through F. I have also read and understood Part G.

OR

I do not agree to participate in the WA Melanoma Health Study. Please sign:

……………………………... …………...………… …………………...

Name of Participant Signature Date ……………………………... …………...………… …………………...

Name of Witness Signature Date

The Human Research Ethics Committee at The University of Western Australia requires that all participants are informed that, if they have any complaint regarding the manner in which a research project is conducted, it may be given to the researcher or, alternatively, to the Secretary, Human Research Ethics Committee, Registrar’s Office, The University of Western Australia, Nedlands, WA 6009 (telephone number 6488-3703, fax number 6488-3861, email [email protected]). Name:

Address: CRN:

5

IF YOU HAVE CONSENTED TO PARTICIPATE

PLEASE COMPLETE THE FOLLOWING:

QUESTIONNAIRE INTERVIEW TIMES The questionnaire part of the study is completed by telephone interview. Could you please place a √ next to the day you would prefer us to call you for this interview?

Monday Tuesday Wednesday Thursday Friday

Could you please place a √ next to the time you would prefer us to call you for this interview:

Morning (9am-12pm) Afternoon (12pm-5pm) Evening (5pm – 8pm)

We will try to contact you on your home number first to complete the interview. If we are unable to contact you on this number, could you please place a √ next to the phone number you would prefer us to try contacting you on.

Work phone Mobile

CONTACT DETAILS

Please complete the following details: Home Phone Number Work Phone Number

Mobile Number

Please check the following details that we have for you and amend any that are incorrect. Name: Address: Date of Birth:

285APPENDIX F

Mole-Counting Chart

MOLE COUNTING CHART This chart will help you to complete Section D of the questionnaire

What are moles? Moles are collections of the skin’s “colour” cells lying just below the surface of the skin. Moles: • Vary from pale brown to almost black in colour. • Vary in size from about 1mm to 10mm. • May be completely flat but are usually raised above the surrounding

skin and can be felt if you run your fingers over them. • Usually appear during childhood or the teenage years. • Do not come and go with sunlight.

Freckles The hardest part of counting moles is telling the difference between them and freckles.

• Moles are usually darker and more regular in shape than freckles. • Freckles have a ‘splotchier’ appearance and come and go with sunlight. • Freckles are never raised above the surrounding skin and usually occur

together in patches, particularly on the upper part of the body and especially on the face and shoulders.

• Freckles on the back will mainly occur on the upper part of the back and the shoulders.

• Usually raised and occur in a range of colours, often light to dark brown. • Seem to be “stuck on” and may flake a bit at the edges. • Look dull and waxy. • Mostly on the face, neck and trunk but can

be anywhere on the body. • More common with increasing age but can

appear in a person’s 20’s or 30’s.

Sebborhoeic keratoses (‘Senile warts’) It can be hard to tell the difference between moles and senile warts (sebborhoeic keratoses). Senile warts are:

PLEASE COUNT THESE DO NOT COUNT THESE

287APPENDIX G

Questionnaire

IdId

EntryEntry

EntryEntry

EntryEntry

IDID Start TimeStart TimeDateDate

Good morning/afternoon, could I please speak to

Please enter your ID, today's date and the participant's sex.

Participant's AgeParticipant's Age SexSex

MHS No.MHS No.

EntryEntry

My name is ......... and I am calling about the telephone interview for the Western Australian Melanoma Health Study. Would now be a convenient time to do the interview and do you have your brochure with you?

If:- asked who is calling/why you are calling- someone else answers- the participant is not there- you get an answering machinePlease refer to your questionnaire guide for standard responses.

Once speaking to the participant:

If YES - continue to next page (they MUST have the brochure to continue)If NO - refer to questionnaire guide

Have you had a chance to complete the height and weight questions and do your mole count ?

If NO, arrange time to call back today, or another day [refer to interviewing roster], for these answers.

Now go to page 3a.

If YES, ask questions below and continue to page 3a.

cm

What is your weight in kilograms? kg

We would also like to know the number of moles on your back. Please refer to the diagram in question D2.

How many moles do you have in section A, or from the base of your neck to the top of your armpits ?

D2

How many moles do you have in section B, or from the top of your armpit to the top of your underpants ?

moles

What is your height in centimetres?A2

A3

moles

Before we start, I’d like to ask you to get a pen and paper, as I’ll ask you to note down a few things a bit later on in the interview to help you with your answers. [Wait whilst items obtained].

SECTION B : COLOURING AND SKIN TYPE

SECTION A : PERSONAL DETAILS

Single De facto Married Divorced WidowedSingle De facto Married Divorced Widowed

Red (including auburn)

Fair or blonde (including white)

Light or mouse brown

Grey

Dark brown

Black

Red (including auburn)

Fair or blonde (including white)

Light or mouse brown

Grey

Dark brown

Black

Blue

Grey

Green

Hazel

Brown

Black

Blue

Grey

Green

Hazel

Brown

Black

Very fair

Fair

Olive or brown

Dark

Very fair

Fair

Olive or brown

Dark

Which colour best describes your skin before tanning or on areas never exposed to the sun, such as the inside of your upper arm?

Could you confirm your date of birth is :

B3

Which colour best describes the colour of your eyes?B2

Which colour best describes your natural hair colour at age 18?

B1

Page 3Version 4

What is your marital status?

B4 Which statement best describes what would happen to your skin if it were exposed to bright sunlight for the first time in summer, for one hour in the middle of the day, without any protection (e.g. sunscreen, clothing)?

Get a severe sunburn with blistering

Have a painful sunburn for a few days followed by peeling

Get mildly burnt followed by some tanning

Go brown without any sunburn

Get a severe sunburn with blistering

Have a painful sunburn for a few days followed by peeling

Get mildly burnt followed by some tanning

Go brown without any sunburn

B5 Which of the following best describes what would happen to your skin if it were repeatedly exposed to bright sunlight in summer without any protection (e.g. sunscreen, clothing)?

Which statement best describes what would happen to your skin if it were exposed to bright sunlight for the first time in summer, for one hour in the middle of the day, without any protection (e.g. sunscreen, clothing)?

B6

Always burns, never tans

Usually burns, never tans

Sometimes burns, usually tans

Never burns, always tans

Always burns, never tans

Usually burns, never tans

Sometimes burns, usually tans

Never burns, always tans

Go very brown and deeply tanned

Get moderately tanned

Get mildly or occasionally tanned

Get no suntan at all or only get freckled

Go very brown and deeply tanned

Get moderately tanned

Get mildly or occasionally tanned

Get no suntan at all or only get freckled

Please look at the faces above questions B7 and B8 in your brochure. Each of the faces shows some degree of freckling, from none to very many. For the following questions, we are looking for your best estimate.

B7

None Very few Few Some Many Very ManyNone Very few Few Some Many Very Many

Which of these faces best describes how many freckles you had at the end of summer as an adult; that is, when you were over 18 ?

B8

None Very few Few Some Many Very manyNone Very few Few Some Many Very many

Page 5

Which of these faces best describes how many freckles you had at the end of summer during childhood; that is, up until you were 17 years old?

How old were you when you first started smoking cigarettes or tobacco at least once a day?

Have you ever smoked cigarettes or tobacco once a day for six months or more?

C1

No

Yes

No

Yes

SECTION C : LIFESTYLE HISTORY

C2

years old

C3 Do you smoke cigarettes or tobacco at least once a day now?

No

Yes

No

Yes

C4 How old were you when you stopped smoking cigarettes or tobacco at least once a day?

years old

C5 Over your life time, how many total years have you smoked?

years

C6 On average, over the entire time that you smoked, how much did you smoke each day?

cigarettes per day (manufactured cigarettes only, not self rolled cigarettes)

of loose tobacco per day

cigars per day50g = 1 pouch of tobacco 1g = 1 self rolled cigarette

grams

ounces

grams

ounces (includes loose tobacco smoked in pipes or self rolled cigarettes)

Page 6

SECTION D : MOLES

The following question is about the number of moles that you have. Again, we are looking for your best estimate.

Please look at the diagrams in question D1. They show various numbers of moles. Which picture best describes how many moles you have now ?

D1

None Few Some ManyNone Few Some Many

Page 7

F2

In which country were you born ?

I would now like to ask you some questions about your background and family history.

DO NOT READ OPTIONS TO PARTICIPANT, CODE FROM RESPONSE

If other selected, please type country's name in the box.

What year did you first arrive in Australia?What year did you first arrive in Australia?

Where did you live for the majority of the time from when you were born to when you were 12 years of age ?

Country Country

State/County/Province/RegionState/County/Province/Region

Where did you live for the majority of the time between the ages of 13 and 19 ?

Where did you live for the majority of the time from the age of 20 to now ?

CountryCountry

State/County/Province/Region State/County/Province/Region

CountryCountry

State/County/Province/Region State/County/Province/Region

F1

F3

F4

F5

What is the highest level of education you have completed?

F6

City/TownCity/Town

City/TownCity/Town

City/TownCity/Town

I'd now like to ask you some questions about your ethnic background and the ethnic background of your family.

How would you describe your ethnic background ?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

F7A

DO NOT READ OPTIONS TO PARTICIPANT, CODE FROM RESPONSEIf participant wishes to select two answers, they may, however do not suggest this option.

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

B How would you describe the ethnic background of your biological mother?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

How would you describe the ethnic background of your biological father?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

C

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

D How would you describe the ethnic background of your mother's mother?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

How would you describe the ethnic background of your mother's father?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

E

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

F How would you describe the ethnic background of your father's mother?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

G How would you describe the ethnic background of your father's father?

Indigenous Australian (Aboriginal, Torres Strait Islander)Indigenous Australian (Aboriginal, Torres Strait Islander)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

South East Asian (Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar/Burma, Philippines,

Singapore, Thailand, Vietnam)

North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)North-East Asian (China, Hong Kong, Japan, Korea, Macau, Taiwan)

South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)South-Asian (Afghanistan, Bangladesh, India, Nepal, Pakistan, Sri Lanka)

Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)Middle Eastern (Israel, Iran, Iraq, Lebanon, Turkey, Egypt)

Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)Pacific Islander (New Zealand Maori or originated from Pacific Islands, Hawaii, New Guinea)

African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)African (Originating from North Africa, Sub-Saharan Africa, Zimbabwe, Indigenous South African)

Caucasian - English (Orginated from England)Caucasian - English (Orginated from England)

Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)Caucasian - Scottish, Irish or Welsh (Orginated from Scotland, Ireland or Wales)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Other Northern European (Austria, Latvia, Lithuania, Estonia, Denmark, France, Germany, Luxembourg, Netherlands/Holland, Sweden, Norway, Finland, Switzerland, other Western/Northern European countries)

Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)Caucasian - Southern European (Greece, Italy, Portugal, Spain, Former Yugoslavia, Malta, Cyprus, other Southern European countries)

Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)Caucasian - Eastern European (Bulgaria, Former Czechoslovakia, Hungary, Poland, Romania, Former USSR, other Eastern European countries)

Caucasian - Other CaucasianCaucasian - Other Caucasian Please specify - Please specify -

I’d now like to ask you some questions about your family history.

F8

Yes

No

Don't know

Has your biological mother ever been told by a doctor that she has any type of skin cancer?Has your biological mother ever been told by a doctor that she has any type of skin cancer?

No

Don't know

F9A

Yes

No

Don't know

Did she have a melanoma?Did she have a melanoma?

No

Don't know

F9B

Yes

No

Don't know

Did she have another type of skin cancer that wasn’t a melanoma?Did she have another type of skin cancer that wasn’t a melanoma?

No

Don't know

F10

Yes - Please specify

No

Don't know

Has your biological mother ever been told by a doctor that she has any other type of cancer?Has your biological mother ever been told by a doctor that she has any other type of cancer?

No

Don't know BreastBreast

BladderBladder

ColorectalColorectal

KidneyKidney

StomachStomach

PancreasPancreas

OvarianOvarian

LeukemiaLeukemia

ThyroidThyroid

LungLung

LymphomaLymphoma

UterusUterus

Other - Please SpecifyOther - Please Specify

F11

Yes

No

Don't know

Has your biological father ever been told by a doctor that he has any type of skin cancer ?Has your biological father ever been told by a doctor that he has any type of skin cancer ?

No

Don't know

F12A

F12B

Yes

No

Don't know

Did he have a malignant melanoma?Did he have a malignant melanoma?

No

Don't know

Yes

No

Don't know

Did he have another type of skin cancer that wasn’t a melanoma?Did he have another type of skin cancer that wasn’t a melanoma?

No

Don't know

Yes - Please specify

No

Don't know

Has your biological father ever been told by a doctor that he has any other type of cancer?Has your biological father ever been told by a doctor that he has any other type of cancer?

No

Don't know

F13

BladderBladder

ColorectalColorectal

KidneyKidney

LeukemiaLeukemia

LungLung

LymphomaLymphoma

Other - Please SpecifyOther - Please Specify

ThyroidThyroid

StomachStomach

PancreasPancreas

ProstateProstate

F14

Yes

No

Don't know

Do you have any full brothers? That is, brothers with the same mother and father as you.Do you have any full brothers? That is, brothers with the same mother and father as you.

No

Don't know

How many full brothers do you have? How many full brothers do you have?

Yes

No

Don't know

Have any of your full brothers ever been told by a doctor that they had any type of skin cancer?Have any of your full brothers ever been told by a doctor that they had any type of skin cancer?

No

Don't know

How many brothers were told this? How many brothers were told this?

Yes

No

Don't know

Did any of them have a malignant melanoma?Did any of them have a malignant melanoma?

No

Don't know

Yes

No

Don't know

Did any of them have another type of skin cancer that wasn’t a melanoma?Did any of them have another type of skin cancer that wasn’t a melanoma?

No

Don't know

How many brothers were told this?How many brothers were told this?

How many brothers were told this?How many brothers were told this?

Yes - Please Specify How Many

No

Don't know

Have any of your full brothers ever told by a doctor that they had any other type of cancer?Have any of your full brothers ever told by a doctor that they had any other type of cancer?

No

Don't know

BladderBladder

ColorectalColorectal

KidneyKidney

LungLung

LymphomaLymphoma

PancreasPancreas

F21F21

F15F15

F16F16

F17F17

F18AF18A

F19F19

F18BF18B

F20F20

LeukemiaLeukemia ProstateProstate

StomachStomach

ThyroidThyroid

OtherOther

F23

Yes

No

Don't know

Do you have any half brothers? That is, brothers with the same mother OR father as you.Do you have any half brothers? That is, brothers with the same mother OR father as you.

No

Don't know

How many half brothers do you have? How many half brothers do you have?

Yes

No

Don't know

Have any of your half brothers ever been told by a doctor that they have any type of skin cancer?Have any of your half brothers ever been told by a doctor that they have any type of skin cancer?

No

Don't know

How many brothers were told this? How many brothers were told this?

Yes

No

Don't know

Did any of them have a malignant melanoma?Did any of them have a malignant melanoma?

No

Don't know

Yes

No

Don't know

Did any of them have another type of skin cancer that wasn’t a melanoma?Did any of them have another type of skin cancer that wasn’t a melanoma?

No

Don't know

How many brothers were told this?How many brothers were told this?

How many brothers were told this?How many brothers were told this?

Yes - Please Specify How Many

No

Don't know

Were any of your half brothers ever told by a doctor that they had any other type of cancer?Were any of your half brothers ever told by a doctor that they had any other type of cancer?

No

Don't know

F24F24

F25F25

F26F26

F27AF27A

F28F28

F27BF27B

F29F29

F30F30

BladderBladder

ColorectalColorectal

KidneyKidney

LeukemiaLeukemia ProstateProstate

PancreasPancreas

LymphomaLymphoma

LungLung StomachStomach

ThyroidThyroid

OtherOther

F32

Yes

No

Don't know

Do you have any full sisters? That is, sisters with the same mother and father as you.Do you have any full sisters? That is, sisters with the same mother and father as you.

No

Don't know

How many full sisters do you have? How many full sisters do you have?

Yes

No

Don't know

Have any of your full sisters ever been told by a doctor that they have any type of skin cancer?Have any of your full sisters ever been told by a doctor that they have any type of skin cancer?

No

Don't know

How many sisters were told this? How many sisters were told this?

Yes

No

Don't know

Did any of them have a malignant melanoma?Did any of them have a malignant melanoma?

No

Don't know

Yes

No

Don't know

Did any of them have another type of skin cancer that wasn’t a melanoma?Did any of them have another type of skin cancer that wasn’t a melanoma?

No

Don't know

How many sisters were told this?How many sisters were told this?

How many sisters were told this?How many sisters were told this?

Yes - Please Specify How Many

No

Don't know

Were any of your full sisters ever told by a doctor that they had any other type of cancer?Were any of your full sisters ever told by a doctor that they had any other type of cancer?

No

Don't know

F33F33

F34F34

F35F35

F36AF36A

F37F37

F36BF36B

F38F38

F39F39

OtherOther

ThyroidThyroidLungLung

LymphomaLymphoma

PancreasPancreas

StomachStomach

BladderBladder

ColorectalColorectal

KidneyKidney

LeukemiaLeukemia

BreastBreast

OvarianOvarian

UterusUterus

F41

Yes

No

Don't know

Do you have any half sisters? That is, sisters with the same mother or father as you.Do you have any half sisters? That is, sisters with the same mother or father as you.

No

Don't know

How many half sisters do you have? How many half sisters do you have?

Yes

No

Don't know

Have any of your half sisters ever been told by a doctor that they had any type of skin cancer?Have any of your half sisters ever been told by a doctor that they had any type of skin cancer?

No

Don't know

How many sisters were told this? How many sisters were told this?

Yes

No

Don't know

Did any of them have a malignant melanoma?Did any of them have a malignant melanoma?

No

Don't know

Yes

No

Don't know

Did any of them have another type of skin cancer that wasn’t a melanoma?Did any of them have another type of skin cancer that wasn’t a melanoma?

No

Don't know

How many sisters were told this?How many sisters were told this?

How many sisters were told this?How many sisters were told this?

Yes - Please Specify How Many

No

Don't know

Were any of your half sisters ever been told by a doctor that they had any other type of cancer?Were any of your half sisters ever been told by a doctor that they had any other type of cancer?

No

Don't know

F42F42

F43F43

F44F44

F45AF45A

F46F46

F45BF45B

F47F47

F48F48

OtherOther

LymphomaLymphoma

PancreasPancreas

StomachStomach

OvarianOvarian

UterusUterus

ThyroidThyroidLungLungBladderBladder

ColorectalColorectal

KidneyKidney

LeukemiaLeukemia

BreastBreast

F50 Are you a twin, triplet or other multiple birth?

Yes, a twin

Yes, a triplet

Yes, other multiple birth

No

Don't know

Are you a twin, triplet or other multiple birth?Are you a twin, triplet or other multiple birth?

Yes, a triplet

Yes, other multiple birth

No

Don't know

F51

Yes

No

Don't know

Do you have any biological children? Do you have any biological children?

No

Don't know

How many children do you have? How many children do you have?

Yes

No

Don't know

Have any of your children ever been told by a doctor that they have any type of skin cancer?Have any of your children ever been told by a doctor that they have any type of skin cancer?

No

Don't know

F52F52

F53F53

How many children have been told this? How many children have been told this?

This only refers to biological children, not step-children.

F54F54

Yes

No

Don't know

Did any of them have a malignant melanoma?Did any of them have a malignant melanoma?

No

Don't know

F55AF55A

How many children were told this?How many children were told this?

Yes

No

Don't know

Did any of them have another type of skin cancer that wasn’t a melanoma?Did any of them have another type of skin cancer that wasn’t a melanoma?

No

Don't know

F56F56

F55BF55B

How many children were told this?How many children were told this?F57F57

Yes

No

Don't know

Were any of your children ever told by a doctor that they had any other type of cancer?Were any of your children ever told by a doctor that they had any other type of cancer?

No

Don't know

F58F58

BreastBreast

OtherOtherPancreasPancreas

OvarianOvarianColorectalColorectal

KidneyKidney

LymphomaLymphoma

BladderBladder LungLung

LeukemiaLeukemia ProstateProstate

StomachStomach

UterusUterus

ThyroidThyroid

I am now going to ask you some questions about time that you've spent outdoors at various ages. By 'outdoors', I mean the time you spent outdoors and not under any shade between 9am and 5pm.

Please remember we are looking for your best estimate.

To help you with the next set of questions, it might be useful to write down ‘not under any shade’ and ‘between 9am and 5pm’, to act as a reminder when answering the questions.”

I'd like to start by asking you about time outdoors in the warmer months, that is, spring and summer, when you were 5-12 years of age or in primary school.

How many hours per day did you usually spend outdoors and not under any shade, between 9am and 5pm in the warmer months on:

School days School days

WeekendsWeekends

ORDon't knowDon't know

ORDon't knowDon't know

G1.3

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, in the warmer months, during the hours that you were at school, were your activities:On average, in the warmer months, during the hours that you were at school, were your activities:

Split between indoors and outdoorsMainly outdoors

G1.4

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, were these activities:On average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did during the warmer months (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G1.1,1.2

hrs

hrs

I'd now like to ask you about time outdoors in the cooler months, that is, autumn and winter, when you were 5-12 years of age.

How many hours per day did you usually spend outdoors and not under any shade, between 9am and 5pm in the cooler months on:

School days School days

WeekendsWeekends

ORDon't knowDon't know

ORDon't knowDon't know

G1.7

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, in the cooler months, during the hours that you were at school, were your activities:On average, in the cooler months, during the hours that you were at school, were your activities:

Split between indoors and outdoorsMainly outdoors

G1.8

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, were these activities:On average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did during the cooler months (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G1.5,1.6

hrs

hrs

I'd now like to ask you some questions about when you were 13-19 years of age.

How many hours per day did you usually spend outdoors and not under any shade, between 9am and 5pm in the warmer months on:

School or week days School or week days

WeekendsWeekends

ORDon't knowDon't know

ORDon't knowDon't know

G2.4

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, in the warmer months, during the hours that you were at school or working, were your activities:On average, in the warmer months, during the hours that you were at school or working, were your activities:

Split between indoors and outdoorsMainly outdoors

G2.5

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, were these activities:On average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did during the warmer months (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G2.1

During these years, what was your main occupation?

I'd now like to ask you about time outdoors in the warmer months, that is, spring and summer, when you were 13-19 years of age.

I'd now like you to think about all the jobs that you had during these years (including studying).

G2.2,2.3

hrs

hrs

OtherOther

How many hours per day did you usually spend outdoors and not under any shade, between 9am and 5pm in the cooler months on:

School or week days School or week days

WeekendsWeekends

ORDon't knowDon't know

ORDon't knowDon't know

G2.8

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, in the cooler months, during the hours that you were at school or working, were your activities:On average, in the cooler months, during the hours that you were at school or working, were your activities:

Split between indoors and outdoorsMainly outdoors

G2.9

I'd now like you to think about recreational activities that you did during the cooler months (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

I'd now like to ask you about time outdoors in the cooler months, that is, autumn and winter, when you were 13-19 years of age.

I'd now like you to think about all the jobs that you had during these years (including studying).

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, were these activities:On average, were these activities:

Split between indoors and outdoorsMainly outdoors

G2.6,2.7

hrs

hrs

I'd now like to ask you some questions about when you were 20-39 years of age.

G3.2

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, during the hours that you were working between 9am and 5pm, were your activities:On average, during the hours that you were working between 9am and 5pm, were your activities:

Split between indoors and outdoorsMainly outdoors

G3.3

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the warmer months, on average, were these activities:In the warmer months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G3.1

During these years, what was your main occupation?

I'd now like you to think about all the jobs that you had during these years (including studying).

G3.4

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the cooler months, on average, were these activities:In the cooler months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

OtherOther

I'd now like to ask you some questions about when you were 40-65 years of age.

G4.3

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, during the hours that you were working between 9am and 5pm, were your activities:On average, during the hours that you were working between 9am and 5pm, were your activities:

Split between indoors and outdoorsMainly outdoors

G4.4

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the warmer months, on average, were these activities:In the warmer months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G4.1

During these years, what was your main occupation?

I'd now like you to think about all the jobs that you had during these years (including studying).

G4.5

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the cooler months, on average, were these activities:In the cooler months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

Year Retired ?Year Retired ?

OtherOther

I'd now like to ask you some questions about when you were 65 years of age till now.

G5.3

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

On average, during the hours that you were working between 9am and 5pm, were your activities:On average, during the hours that you were working between 9am and 5pm, were your activities:

Split between indoors and outdoorsMainly outdoors

G5.4

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the warmer months, on average, were these activities:In the warmer months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

I'd now like you to think about recreational activities that you did (for example, playing sport, going to the beach, playing musical instruments) between 9am and 5pm.

G5.1

During these years, what was your main occupation?

I'd now like you to think about all the jobs that you had during these years (including studying).

G5.5

Mainly indoorsSplit between indoors and outdoorsMainly outdoors

In the cooler months, on average, were these activities:In the cooler months, on average, were these activities:

Split between indoors and outdoorsMainly outdoors

Year Retired ?Year Retired ?

OtherOther

I'd now like to ask you some questions about sunburn for various ages. Before we start, can I ask you to write down the following options, which might help you with your responses: "0, 1-5, 6-10, more than 10".

G6.1

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so as to cause pain for 2 or more days when you were between 5-12 years of age?

How many times were you sunburnt so as to cause pain for 2 or more days when you were between 5-12 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so severely as to cause blisters between 5-12 years of age?How many times were you sunburnt so severely as to cause blisters between 5-12 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

G6.2

G7.1

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so as to cause pain for 2 or more days when you were between 13-19 years of age?

How many times were you sunburnt so as to cause pain for 2 or more days when you were between 13-19 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so severely as to cause blisters between 13-19 years of age?How many times were you sunburnt so severely as to cause blisters between 13-19 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

G7.2

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so severely as to cause blisters since you were 20 years of age? How many times were you sunburnt so severely as to cause blisters since you were 20 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

0Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

How many times were you sunburnt so as to cause pain for 2 or more days since you were 20 years of age?How many times were you sunburnt so as to cause pain for 2 or more days since you were 20 years of age?

Between 1 and 5 timesBetween 6 and 10 timesMore than 10 timesDon't know

G8.1G8.1

G8.2G8.2

I'd now like to ask you about sunbed use.

G9

YesNoDon't know

Have you ever used a sunlamp or sunbed for any reason on more than one occasion?Have you ever used a sunlamp or sunbed for any reason on more than one occasion?

NoDon't know

How old were you when you first used one?How old were you when you first used one?

How old were you when you last used one?How old were you when you last used one?

About how many sunlamp or sunbed sessions have you had in total, over your lifetime?

About how many sunlamp or sunbed sessions have you had in total, over your lifetime?

G13 In what types of locations have you used a sunlamp or sunbed?

Tanning salonTanning salon

Hairdressers, beauty salonsHairdressers, beauty salons

Gymnasium, health club/spa or fitness club/spaGymnasium, health club/spa or fitness club/spa

Hospital or medical facilityHospital or medical facility

Private homePrivate home

Other - Please specifyOther - Please specify

Don't knowDon't know

DO NOT READ OPTIONS TO PARTICIPANT.

The participant may select more than one answer.

G10G10

G11G11

G12G12

I'd now like to ask you some questions about your medical history.

How many melanomas have you been told you've had by a doctor?How many melanomas have you been told you've had by a doctor?

H1.1

How many non-melanoma skin cancers have you been told you've had by a doctor? How many non-melanoma skin cancers have you been told you've had by a doctor?

H1.2a

H1.3

How many skin cancers are you unsure about?How many skin cancers are you unsure about?

Yes

No

Have you had any skin cancers and are unsure what they were?Have you had any skin cancers and are unsure what they were? No

H1.4H1.4

Yes

No

Don't know

Have you had any non-melanoma skin cancers, that is basal cell carcinomas or squamous cell carcinomas? These are often called BCC's or SCC's.

Have you had any non-melanoma skin cancers, that is basal cell carcinomas or squamous cell carcinomas? These are often called BCC's or SCC's.

No

Don't know

H1.2b

When did a doctor tell you that you had a melanoma for the X time?

Where on your body was your X melanoma situated?

Where were you living when you were told by a doctor about your X melanoma?

How large is the scar where your X melanoma was removed?

Please fill in the table for each melanoma.

You should then ask the questions above the table, replacing X with first, second, third, etc.

Number of melanomasNumber of melanomas

YearYear

Don't knowDon't know

MonthMonth

1st

AnatomicalAnatomical

SideSide

StateState

Please specify countryPlease specify country

Size of scarSize of scar

MonthMonth

2nd

Don't knowDon't know

SideSide

AnatomicalAnatomical StateState

Please specify countryPlease specify country

Size of scarSize of scar

MonthMonth

3rd

Don't knowDon't know

SideSide

AnatomicalAnatomical StateState

Please specify countryPlease specify country

Size of scarSize of scar

YearYear

YearYear

When did a doctor tell you that you had a melanoma for the X time?

Where on your body was your X melanoma situated?

YearYear

MonthMonth AnatomicalAnatomical

SideSide

Don't knowDon't know

MonthMonth AnatomicalAnatomical

Where were you living when you were told by a doctor about your X melanoma?

StateState

Please specify countryPlease specify country

5th

Don't knowDon't know

SideSide

StateState

Please specify countryPlease specify country

MonthMonth AnatomicalAnatomical

6th SideSide

StateState

How large is the scar where your X melanoma was removed?

Size of scarSize of scar

Please specify countryPlease specify country

Size of scarSize of scar

Don't knowDon't know

Size of scarSize of scar

YearYear

YearYear

MonthMonth

YearYear

Don't knowDon't know

AnatomicalAnatomical

SideSide

StateState

Please specify countryPlease specify country

Size of scarSize of scar

MonthMonth

YearYear

Don't knowDon't know

AnatomicalAnatomical

SideSide

StateState

Please specify countryPlease specify country

Size of scarSize of scar

4th

7th

8th

When did a doctor tell you that you had a melanoma for the X time?

Where on your body was your X melanoma situated?

YearYear

MonthMonth AnatomicalAnatomical

SideSide

Don't knowDon't know

MonthMonth AnatomicalAnatomical

Where were you living when you were told by a doctor about your X melanoma?

StateState

Please specify countryPlease specify country

10th

Don't knowDon't know

SideSide

StateState

Please specify countryPlease specify country

MonthMonth AnatomicalAnatomical

11th SideSide

StateState

How large is the scar where your X melanoma was removed?

Size of scarSize of scar

Please specify countryPlease specify country

Size of scarSize of scar

Don't knowDon't know

Size of scarSize of scar

YearYear

YearYear

MonthMonth

YearYear

Don't knowDon't know

AnatomicalAnatomical

SideSide

StateState

Please specify countryPlease specify country

Size of scarSize of scar

MonthMonth

YearYear

Don't knowDon't know

AnatomicalAnatomical

SideSide

StateState

Please specify countryPlease specify country

Size of scarSize of scar

9th

12th

13th

How many have you been told you've had?How many have you been told you've had?H16H16

H17H17 What year were you first told you had an SCC?What year were you first told you had an SCC?

H15H15Yes

No

Don't know

Of the non-melanoma skin cancers that you've been told about, were any of them squamous cell carcinomas or SCC?

Of the non-melanoma skin cancers that you've been told about, were any of them squamous cell carcinomas or SCC?

No

Don't know

Yes

No

Don't know

Of the non-melanoma skin cancers that you've been told about, were any of them basal cell carcinomas or BCC?

Of the non-melanoma skin cancers that you've been told about, were any of them basal cell carcinomas or BCC?

No

Don't know

H12H12

How many have you been told you've had?How many have you been told you've had?H13H13

H14H14 What year were you first told you had a BCC?What year were you first told you had a BCC?

H18 Yes

No

Don't know

Have you ever been told by a doctor that you've had any other type of cancer?Have you ever been told by a doctor that you've had any other type of cancer? No

Don't know

H19H19

H20 Yes

No

Don't know

Has a doctor ever told you that you have any other medical problems apart from cancer?

Has a doctor ever told you that you have any other medical problems apart from cancer?

No

Don't know

BreastBreast

BladderBladder

ColorectalColorectal

KidneyKidney

LungLung

LymphomaLymphoma

Ovarian Ovarian

PancreasPancreas

ThyroidThyroid

UterusUterus

LeukemiaLeukemia ProstateProstate

StomachStomach

What type of cancer were you told you had?What year were you told you had that cancer?

OtherOther

OtherOther

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

YearYear

AsthmaAsthma

H21 What medical conditions were they?

DiabetesDiabetes

Heart attackHeart attack

High blood pressureHigh blood pressure

StrokeStroke

DepressionDepression

OtherOther

OtherOther

OtherOther

OtherOther

I1 Do you have any comments, or other information that you think would be useful?Do you have any comments, or other information that you think would be useful?

I2 For the interviewer: do you have any comments regarding this interview?For the interviewer: do you have any comments regarding this interview?

End TimeEnd Time

Please check with participant whether or not they have done their blood sample yet, if they consented to give one.

If YES - please thank themIf NO - please let them know it would be greatly appreciated if they could do it ASAP.

Please see QxQ for details on types of blood forms.

328 Appendix G. Questionnaire

329APPENDIX H

Questionnaire Brochure

330

App

endixH

.Q

uestionnaireB

rochure

Appendix H. Questionnaire Brochure 331

332 Appendix H. Questionnaire Brochure

333APPENDIX I

MRsnphap R Package User Manual

Package ‘MRsnphap’

May 18, 2010

Type Package

Title A modeling framework for Mendelian Randomisation studies of single SNPsand haplotypes

Version 1.0.0

Date 2010-05-05

Depends MASS, stats, SimHap

Author Gemma Cadby

Maintainer Gemma Cadby <[email protected]>

Description MRsnphap is a package for modelling Mendelian Randomisation studies. Itcan perform single SNP and haplotype association analyses for continuousNormal outcomes measured in population-based samples. MRsnphap usesthe estimation maximisation techniques for inferring haplotypic phase, andincorporates a multiple-imputation approach to deal with the uncertainty ofimputed haplotypes in Mendelian Randomisation analyses.

License GPL2

R topics documented:

mrhap.quant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1mrsnp.quant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

mrhap.quant Mendelian Randomisation using Haplotypes as Instruments

Description

mrhap.quant is used for Mendelian Randomisation modelling of epidemiological data withcontinuous outcomes, where one or more haplotypes are used as instruments for somemodifiable exposure.

Usage

mrhap.quant(outcome, exposure, covariates = NULL, geneticmodel, pheno,

haplo, sim, effect="add",small = NULL)

1

Arguments

outcome a continuous outcome.

exposure a modifiable exposure

covariates a list of covariates in the form"x1+...+xn". Defaults to NULL.

geneticmodel the haplotype/s used as the instrument. May be coded as either additive,recessive or dominant.

haplo a haplotype object made by make.haplo.rare.

pheno a dataframe containing phenotype data.

sim the number of simulations from which to evaluate the results.

effect the haplotypic effect type: "add" for additive, "dom" for dominant and"rec" for recessive. Defaults to additive.

small a small sample correction. Defaults to NULL.

Details

If an instrument is considered weak, a warning file with the current date will be stored inthe working directory. Use getwd() to locate this directory.

Value

mrhap.quant returns an object of class MRhap.

The summary function can be used to obtain and print a summary of the results.

An object of class MRhap is a list containing the following components:

instruments formula containing instruments and covariates

structual structural formula containing the outcome regressed on exposure and co-variates

results a table containing the coefficients, standard errors, p-values and 95% con-fidence limits of the parameter estimates, averaged over the sim modelsperformed

empiricalResults

a list containing the coefficients, standard errors and p-values calculatedat each simulation.

observations number of observations

weakfstat mean F-Statistic for the first stage regression of exposure on instrument/s,averaged over the sim models performed

sarganp mean P-values for the Sargan test statistic, averaged over the sim mod-els performed. The sarganp will only be shown when there are moreinstruments than exposures

DWHp mean P-values for the Durbin-Wu-Haussman statistic, averaged over thesim models performed

effect the haplotypic effect modelled, ‘ADDITIVE’, ‘DOMINANT’ or ‘RECES-SIVE’.

2

Author(s)

Gemma Cadby

References

Carter, K.W., McCaskie, P.A., Palmer, L.J. (2008). SimHap GUI: An intuitive graphicaluser interface for genetic association analysis. BMC Bioinformatics 2008 Dec 25;9(1):557.

McCaskie, P.A., Carter, K.W, Hazelton, M., Palmer, L.J. (2007) SimHap: A comprehen-sive modeling framework for epidemiological outcomes and a simulation-based approach tohaplotypic analysis of population-based data, [online] www.genepi.org.au/simhap.

Baum, C.F., Schaffer, M.E., Stillman, S. (2007) Enhanced routines for instrumental vari-ables/GMM Estimation and testing, Stata Journal, 7 (4), 465-506.

See Also

mrsnp.quant

Examples

data(SNP.dat)

# convert SNP.dat to format required by infer.haplos

haplo.dat <- SNP2Haplo(SNP.dat)

data(pheno.dat)

# generate haplotype frequencies and haplotype design matrix

myinfer<-infer.haplos(haplo.dat)

# print haplotype frequencies generated by infer.haplos

myinfer$hap.freq

# generate haplo object where haplotypes with a frequency

# below min.freq are grouped as a category called "rare"

myhaplo<-make.haplo.rare(myinfer,min.freq=0.05)

mymodel.hap<- mrhap.quant(outcome="BT", exposure="VitD", covariates="AGE", geneticmodel="h.N1AA",

pheno=pheno.dat, haplo=myhaplo, sim=10, effect="add")

summary(mymodel.hap)

mrsnp.quant Mendelian Randomisation using Single SNPs as Instruments

Description

mrsnp.quant is used for Mendelian Randomisation modelling of epidemiological data withcontinuous outcomes, where one or more SNPs are used as instruments for some modifiableexposure.

3

Usage

mrsnp.quant(outcome, exposure, covariates = NULL, geneticmodel,

geno, pheno, small = NULL)

Arguments

outcome a continuous outcome.

exposure a modifiable exposure

covariates a list of covariates in the form"x1+...+xn". Defaults to NULL.

geneticmodel the SNP used as the instrument. May be coded as either additive, reces-sive or dominant.

geno a dataframe containing genotype data.

pheno a dataframe containing phenotype data.

small a small sample correction. Defaults to NULL.

Details

If an instrument is considered weak, a warning message will be printed.

Value

mrsnp.quant returns an object of class MRsnp.

The summary function can be used to obtain and print a summary of the results.

An object of class MRsnp is a list containing the following components:

instruments formula containing instruments and covariates

structual structural formula containing the outcome regressed on exposure and co-variates

results a table containing the coefficients, standard errors, p-values and 95% con-fidence limits of the parameter estimates

observations number of observations

weakfstat F-Statistic for the first stage regression of exposure on instrument/s

sarganstat Sargan test statistic for the overidentification test of all instruments. Thesarganstat will only be shown when there are more instruments thanexposures

sarganp P-value for the Sargan test statistic. The sarganp will only be shownwhen there are more instruments than exposures

DWHstat Durbin-Wu-Haussman statistic for the test of endogeneity.

DWHp P-value for the Durbin-Wu-Haussman statistic

residualerror Residual error of the model

Author(s)

Gemma Cadby

4

References

Carter, K.W., McCaskie, P.A., Palmer, L.J. (2008). SimHap GUI: An intuitive graphicaluser interface for genetic association analysis. BMC Bioinformatics 2008 Dec 25;9(1):557

McCaskie, P.A., Carter, K.W, Hazelton, M., Palmer, L.J. (2007) SimHap: A comprehen-sive modeling framework for epidemiological outcomes and a simulation-based approach tohaplotypic analysis of population-based data, [online] www.genepi.org.au/simhap.

Baum, C.F., Schaffer, M.E., Stillman, S. (2007) Enhanced routines for instrumental vari-ables/GMM Estimation and testing, Stata Journal, 7 (4), 465-506.

See Also

mrhap.quant

Examples

data(SNP.dat)

#convert SNP.dat to format required by mrsnp.quant

geno.dat<- SNP2Geno(SNP.dat, baseline=c("MM", "11", "GG", "CC"))

data(pheno.dat)

mymodel.snp<- mrsnp.quant(outcome="BT", exposure="VitD", covariates="AGE", geneticmodel="SNP_1_add",

geno=geno.dat, pheno=pheno.dat)

summary(mymodel.snp)

5

339APPENDIX J

Example output using mrsnp.quant

The output in this appendix was produced by modelling the association between

a prognostic measure of melanoma, Breslow thickness (BT), and Vitamin D levels

(VitD), adjusted for age, using the mrsnp.quant function in MRsnphap. The

instrument is a SNP which is modelled additively, and was produced by the

SNP2Geno function in SimHap.

mymodel.snp<- mrsnp.quant(outcome="BT", exposure="VitD", covariates="AGE",

geneticmodel="SNP_1_add", geno=geno.dat, pheno=pheno.dat)

summary(mymodel.snp)

Instruments:

~SNP_1_add + AGE

Structural Model:

BT ~ VitD + AGE

Coefficient Std.Error Lower 95% CI Upper 95% CI P.Value

(Intercept) 90.0988 29.8852 31.1194 149.078 0.00295 **

VitD 1.7064 25.2587 -48.1424 51.555 0.94621

AGE 0.7266 0.1326 0.4650 0.988 1.45e-07 ***

---

Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Number of Observations:

179

Weak Instrument F Statistic:

5.6199

Sargan F Statistic:

NA: equation exactly identified

Sargan P-Value:

NA: equation exactly identified

Durbin-Wu-Hausman Statistic:

0.15521

Durbin-Wu-Hausman P-Value:

0.69361

Residual Error:

19.141

The F-statistic for the weak instrument test (F=5.6199) indicates that the SNP

340 Appendix J. Example output using mrsnp.quant

is a weak instrument for Vitamin D and should not be used. The Durbin-Wu

Hausman p-value of 0.69361 indicates that Vitamin D should not be treated as

an endogeneous regressor.

341APPENDIX K

Example output using mrhap.quant

The output in this appendix was produced by modelling the association between

a prognostic measure of melanoma, Breslow thickness (BT), and Vitamin D levels

(VitD), adjusted for age, using the mrhap.quant function in MRsnphap. The in-

strument is a haplotype, h.N1AA, which is modelled additively, and was produced

by the haplo.dat and infer.haplos functions in SimHap.

The Mendelian randomisation model is run 100 times, and the average of the co-

efficients, statistics and p-values are reported.

mymodel.hap<- mrhap.quant(outcome="BT", exposure="VitD", covariates="AGE",

geneticmodel="h.N1AA", pheno=pheno.dat, haplo=myhaplo, sim=100, effect="add")

[1] "* Finding highest individual frequency ..."

[1] " Done"

[1] "* Populating individual haplotypes and posterior probabilities ..."

[1] " Done"

[1] "* Distributing individual occurrences across the simulations by posterior probability ..."

[1] " 25%"

[1] " 50%"

[1] " 75%"

[1] " 90%"

[1] " Done"

[1] "* Generating a random pattern of individuals for each simulation ..."

[1] " Done"

[1] "* Constructing datafiles and performing model for each simulation ..."

[1] " Done"

summary(mymodel.hap)

Instruments:

~h.N1AA + AGE

Structural Model:

BT ~ VitD + AGE

Coefficients:

Coefficient Std.error Lower 95% CI Upper 95% CI P.Value

(Intercept) 204.0052 580.5689 -941.7227 1349.733 0.1655

VitD -97.0130 502.4302 -1088.5376 894.511 0.4170

AGE 0.9529 1.1717 -1.3594 3.265 0.0815 .

---

Signif. codes: 0 *** 0.001 ** 0.01 * 0.05 . 0.1 1

Number of Observations:

180

Weak Instrument F Statistic:

342 Appendix K. Example output using mrhap.quant

[1] 1.4697

Sargan P-Value:

[1] NA

Durbin-Wu-Hausman P-Value:

[1] 0.25544

The F-statistic for the weak instrument test (F=1.4697) indicates that the h.N1AA

haplotype is a weak instrument for Vitamin D levels and should not be used. The

Durbin-Wu Hausman p-value of 0.25544 indicates that Vitamin D should not be

treated as an endogeneous regressor.