Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human...

12
Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 21 1 Michael A. Rogers, Lutz Langbein , Hermelita Winter, Iris Beckmann, Silke Praetzel, and Ju ¨ rgen Schweizer Section of Normal and Neoplastic Epidermal Differentiation and Division of Cell Biology, German Cancer Research Center, Heidelberg, Germany Analysis of the EBI/GeneBank database using nonhuman hair keratin associated protein (KAP) gene sequences as a query resulted in the identification of two human KAP gene domains on chromosome 21, one of which, located at 21q22.1, has recently been characterized. The second domain presented here, an approximately 90 kb domain on chromosome 21q23, harbored 16 KAP genes and two KAP pseudogenes. By comparison with known sheep and mouse KAP families, these genes could be assigned to two KAP families, KAP10 and KAP12, with the KAP10 family (12 members) being distinctly larger than the KAP12 family (four members). Systematic cDNA/3 0 rapid amplification of cDNA ends isolation studies using human scalp mRNA led to the identification of eight KAP10 and two KAP12 cDNA sequences. In situ hybridization analyses of human anagen hair follicles using specific 3 0 -noncoding sequences of the various KAP10/KAP12 genes revealed mRNA expression of nearly all KAP10 and KAP12 members exclusively in a narrow region of the middle portion of the hair fiber cuticle. Bioinformatic analyses of the promoter regions of the KAP10/KAP12 genes demonstrated several enhancer elements that were present in nearly all of the KAP genes. Primary among these were binding elements for the ETS, heat shock factor, AML, and HOX families of transcription factors. Key words: cDNA/gene expression/hair follicle/in situ hybridization/keratin. J Invest Dermatol 122:147 –158, 2004 In the past decade, significant advances have been made toward the characterization of human hair keratins and their associated proteins. These proteins make up the main structural components of the hair fiber. The human hair keratins consist of 15 members, which can be subdivided into two families, comprising the type I (acidic) and type II (basic/neutral) hair keratins (Rogers et al, 1998, 2000). Copolymers of individual type I and type II hair keratin members, which are differentially expressed in the hair fiber, make up the structural backbone of the 8–10 nm hair keratin intermediate filaments of trichocytes. The gene structures of all human hair keratins have been identified and character- ized, the nine members of the type I hair keratin genes being found on human chromosome 17q21.2 and the six members of the type II hair keratin genes on chromosome 12q13.3 (Rogers et al, 1998, 2000). In addition, the expression patterns of all human hair keratins have also been characterized, with specific type I and type II hair keratin proteins being found in the hair matrix as well as the cortex and the cuticle of the hair fiber (Langbein et al, 1999, 2001). Based on gene sequence characteristics, differential expression patterns, and proximity to each other, the type I and type II hair keratin gene domains could be divided into three gene groups (AC, type I hair keratins; A and C, type II hair keratins; Langbein et al, 1999, 2001). Located between the keratin intermediate filament bundles in the hair fiber is a largely amorphous mass con- taining the hair keratin associated proteins (KAP), which are assumed to be responsible for the lateral linkage of keratin intermediate filament bundles in trichocytes (Powell and Rogers, 1997). In humans, the number of individual KAPs appears to be very large (480, unpublished data). KAPs are currently divided into 23 families, the majority of which have been recently characterized in humans (Powell and Rogers, 1997; Rogers et al, 2001, 2002). KAPs can be subdivided into three groups based on their chemical properties: (1) The high sulfur KAPs (o30 mol % cysteine content) con- sisting of the KAP1KAP3, KAP10KAP16, and KAP23 families; (2) the ultrahigh sulfur KAP (430 mol % cysteine content) comprising the KAP4, KAP5, KAP9, and KAP17 families; and (3) the high glycine tyrosine KAP consisting of the KAP6KAP8 and KAP18KAP22 families. Recent stu- dies have characterized two domains of human high/ultrahigh sulfur and high glycine tyrosine KAP genes (Rogers et al, 2001, 2002). The first domain encompasses approximately 600 kb on chromosome 17q21.2 and harbors the members of the KAP1KAP3, KAP4, KAP9, KAP16, and KAP17 families, which are embedded inside the type I keratin gene cluster. The second domain, approximately 535 kb in size, is found on human chromosome 21q22.1 and contains the members of the KAP6KAP8 and KAP19KAP22 high glycine tyrosine KAP families, as well as the KAP11, KAP13, 1 In fond memory of Dr Peter Steinert. Abbreviations: ISH, in situ hybridization; KAP, hair keratin associated protein; RACE, rapid amplification of cDNA ends. Copyright r 2003 by The Society for Investigative Dermatology, Inc. 147

Transcript of Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human...

Page 1: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

Hair Keratin Associated Proteins: Characterization of a SecondHigh Sulfur KAP Gene Domain on Human Chromosome 211

Michael A. Rogers, Lutz Langbein�, Hermelita Winter, Iris Beckmann, Silke Praetzel,� and Jurgen SchweizerSection of Normal and Neoplastic Epidermal Differentiation and �Division of Cell Biology, German Cancer Research Center, Heidelberg, Germany

Analysis of the EBI/GeneBank database using nonhuman hair keratin associated protein (KAP) gene sequences as

a query resulted in the identification of two human KAP gene domains on chromosome 21, one of which, located at

21q22.1, has recently been characterized. The second domain presented here, an approximately 90 kb domain on

chromosome 21q23, harbored 16 KAP genes and two KAP pseudogenes. By comparison with known sheep and

mouse KAP families, these genes could be assigned to two KAP families, KAP10 and KAP12, with the KAP10 family

(12 members) being distinctly larger than the KAP12 family (four members). Systematic cDNA/30 rapid amplification

of cDNA ends isolation studies using human scalp mRNA led to the identification of eight KAP10 and two KAP12

cDNA sequences. In situ hybridization analyses of human anagen hair follicles using specific 30-noncoding

sequences of the various KAP10/KAP12 genes revealed mRNA expression of nearly all KAP10 and KAP12 members

exclusively in a narrow region of the middle portion of the hair fiber cuticle. Bioinformatic analyses of the promoter

regions of the KAP10/KAP12 genes demonstrated several enhancer elements that were present in nearly all of the

KAP genes. Primary among these were binding elements for the ETS, heat shock factor, AML, and HOX families of

transcription factors.

Key words: cDNA/gene expression/hair follicle/in situ hybridization/keratin.J Invest Dermatol 122:147 –158, 2004

In the past decade, significant advances have been madetoward the characterization of human hair keratins and theirassociated proteins. These proteins make up the mainstructural components of the hair fiber. The human hairkeratins consist of 15 members, which can be subdividedinto two families, comprising the type I (acidic) and type II(basic/neutral) hair keratins (Rogers et al, 1998, 2000).Copolymers of individual type I and type II hair keratinmembers, which are differentially expressed in the hair fiber,make up the structural backbone of the 8–10 nm hair keratinintermediate filaments of trichocytes. The gene structures ofall human hair keratins have been identified and character-ized, the nine members of the type I hair keratin genes beingfound on human chromosome 17q21.2 and the sixmembers of the type II hair keratin genes on chromosome12q13.3 (Rogers et al, 1998, 2000). In addition, theexpression patterns of all human hair keratins have alsobeen characterized, with specific type I and type II hairkeratin proteins being found in the hair matrix as well as thecortex and the cuticle of the hair fiber (Langbein et al, 1999,2001). Based on gene sequence characteristics, differentialexpression patterns, and proximity to each other, the type Iand type II hair keratin gene domains could be divided into

three gene groups (A�C, type I hair keratins; A and C, type IIhair keratins; Langbein et al, 1999, 2001).

Located between the keratin intermediate filamentbundles in the hair fiber is a largely amorphous mass con-taining the hair keratin associated proteins (KAP), which areassumed to be responsible for the lateral linkage of keratinintermediate filament bundles in trichocytes (Powell andRogers, 1997). In humans, the number of individual KAPsappears to be very large (480, unpublished data). KAPs arecurrently divided into 23 families, the majority of which havebeen recently characterized in humans (Powell and Rogers,1997; Rogers et al, 2001, 2002). KAPs can be subdividedinto three groups based on their chemical properties: (1)The high sulfur KAPs (o30 mol % cysteine content) con-sisting of the KAP1�KAP3, KAP10�KAP16, and KAP23families; (2) the ultrahigh sulfur KAP (430 mol % cysteinecontent) comprising the KAP4, KAP5, KAP9, and KAP17families; and (3) the high glycine tyrosine KAP consisting ofthe KAP6�KAP8 and KAP18�KAP22 families. Recent stu-dies have characterized two domains of human high/ultrahighsulfur and high glycine tyrosine KAP genes (Rogers et al,2001, 2002). The first domain encompasses approximately600 kb on chromosome 17q21.2 and harbors the membersof the KAP1�KAP3, KAP4, KAP9, KAP16, and KAP17families, which are embedded inside the type I keratin genecluster. The second domain, approximately 535 kb in size, isfound on human chromosome 21q22.1 and contains themembers of the KAP6�KAP8 and KAP19�KAP22 highglycine tyrosine KAP families, as well as the KAP11, KAP13,1In fond memory of Dr Peter Steinert.

Abbreviations: ISH, in situ hybridization; KAP, hair keratinassociated protein; RACE, rapid amplification of cDNAends.

Copyright r 2003 by The Society for Investigative Dermatology, Inc.

147

Page 2: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

KAP15, and KAP23 high sulfur KAP families. The mRNAexpression of many individual human KAP has beendescribed, the majority being found in the hair fiber cortexand to a lesser extent in the cuticle (Rogers et al, 2001,2002; Shimomura et al, 2002a, 2002b, 2003). In addition, anantibody recognizing the members of the human KAP1family has successfully been used to demonstrate KAPexpression at the protein level (Shimomura et al, 2002b).Moreover, several KAP family specific polymorphisms havebeen recently identified (Shimomura et al, 2002a, 2002b,2003).

This report, a continuation of our efforts to elucidatehuman KAP genes and to characterize their expression inthe human hair follicle, identifies 16 novel high sulfur KAPgenes as well as two KAP pseudogenes present on asecond locus of human chromosome 21. We show them tobe the orthologs of the single KAP10 and KAP12 familymembers found previously in other species, and demon-strate the mRNA expression of all of these KAP members ina narrow region of the hair fiber cuticle.

Results

KAP gene analysis The recent characterization of adomain of high/ultrahigh sulfur KAP genes on humanchromosome 17q21.2, as well as a further domain of highsulfur and high glycine tyrosine KAP genes on chromosome21q22.1, led us to continue our search for further humanorthologs of KAP genes previously described in otherspecies. One family of interest was the KAP12 family,whose single gene had been previously isolated during theinitial sequencing of the mouse genome (Cole and Reeves,1998). Using this sequence as a query, we analyzed thehuman GeneBank/EBI database. Several sequences of highhomology were found on a 90 kb region of humanchromosome 21q23, a region that had been initiallybioinformatically identified as a KAP-gene-containing do-main during the sequencing of chromosome 21 (see Fig 1and Table II; Hattori et al, 2000). Eighteen putative KAPgenes were found on the contiguous human genomic se-quences AP001754 and AL163300 (previously AP001755),which covered the region in question. Sixteen of thesesequences were determined to be intact genes based onthe completeness of their ORF. Nearly all of the genesequences possessed an upstream tataaa box-like struc-ture within 100 bp of the initiation codon. In addition,putative polyadenylation signals were found within a regionof 600 bp downstream of the end of the ORF. The putativegenes/pseudogenes were small in size (approximately 1 kb)and consisted of a single exon. The genes were separatedby 3.5–21 kb of intervening DNA sequence, and possessedno unified direction of transcription. Amino acid homologycomparisons of ORF translations of each putative KAP genewith known KAP proteins from other species and theirgraphical representation via evolutionary tree analysisidentified 12 proteins that were orthologs to sheepKAP10.1 (Powell and Rogers, 1997) and four that wereorthologs to mouse KAP12.1 (Cole and Reeves, 1998; seeFig 2). Direct amino acid comparisons of the human KAP10and KAP12 family members with their orthologs from sheep

and mouse (Figs 3, 4) further confirmed the familyclassification of these KAP. The KAP10 family membersappear to be unique due to the relatively large size of theirproteins (25.1–40.4 kDa, see Table II). The cysteine contentof the KAP10 family members (24.3–27.4 mol %) placesthem in the category of high sulfur KAP. Similar to all KAPfamilies described to date, the human KAP10 familypossesses largely conserved, family-specific, amino- andcarboxy-terminal amino acid sequences (MA(A/T)(S/C)TMS(V/I)(C/R)SS, amino-terminus; S(G/T)(Q/K)KSSC, car-boxy-terminus; see Fig 3). Several exceptions occur. Forexample, KAP10.4 and KAP10.12 possess shorter amino-terminal and carboxy-terminal sequences; KAP10.8 showsa larger amino-terminal and a shorter carboxy-terminalsequence; and KAP10.1 and KAP10.5 exhibit shortercarboxy-terminal sequences. The center of the KAP10protein members consists largely of varying amounts ofdouble cysteine containing pentameric repeats (boxed inFig 3). Interspersed between these repeats are severalregions of single cysteine-containing tetrameric/pentamericrepeat structures (Fig 3). The four KAP12 protein familymembers (KAP12.1�KAP12.4) are much smaller in size(9.7–14.7 kDa) and possess a slightly larger cysteinecontent (21.2–23.2 mol %) compared to the KAP10 familymembers (Table II). Like the other KAP families, the KAP12proteins possess conserved amino- and carboxy-terminalamino acid sequences (MCHTS(C/H)S(S/p)(G/A), amino-terminus; (C/W/Y)S(T/I/N/)(p/S)(S/T)(C/G)C, carboxy-termi-nus; see Fig 4). The repeat structures of the KAP12 familymembers consist of single cysteine-containing pentamericstructures.

KAP expression analysis 30-noncoding region PCRprobes derived by amplification of human genomic DNAwere used to screen an arrayed human scalp cDNA libraryaccording to previously described methods (Rogers et al,2001). This allowed the isolation of a total of 10 partial KAPcDNA clones encompassing eight members of the KAP10family (KAP10.1�KAP10.5, KAP10.7, KAP10.9, KAP10.10)and two members of the KAP12 family (KAP12.1, KAP12.2,see Table I). As all of these cDNA sequences wereincomplete, we tried to obtain full-length sequences bymeans of 30 RACE procedures using an upstream primerlocated in front of the initiation of translation codon of genesKAP10.7 and KAP12.2 (Table II). The 30 RACE productswere subcloned into the PCR cloning vector CR4.1 and alarge number of individual clones were analyzed. Themajority of these PCR clones gave incorrect products,most of them being incomplete KAP cDNA sequences.Three full-length 30 RACE products were also isolated,however: KAP12.1 and KAP12.2, found using the KAP12.2upstream oligonucleotide, as well as KAP10.10, obtained asa cross-hybridization product of the KAP10.7 oligonucleo-tide. All full-length sequences contained an intact ORF, amethionine initiation codon, and a polyadenylation signalfollowed by a poly A tail.

Previous studies had shown that weakly expressed KAPgenes were not always detected in our arrayed scalp cDNAlibrary. Therefore, in order to obtain information concerningthe region of expression of all the KAP10 and KAP12 familymembers, we subcloned all of the 30-noncoding region PCR

148 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY

Page 3: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

products generated for screening into the CR4.1 vector andused these plasmid constructs to obtain antisense cRNAprobes for in situ hybridization (see Table I). All of the in situhybridizations performed localized the mRNA of therespective KAP10 or KAP12 family members to a narrowregion of the hair fiber cuticle, lying approximately 20 celllayers above the apex of the dermal papilla (Figs 5, 6). As

the 30-noncoding regions of KAP10.2 and KAP10.9 as wellas KAP12.1 and KAP12.2 were almost identical, theexpression of these KAP genes could only be analyzedcollectively. No expression of any KAP member was seeneither in the matrix, lower cuticle, cortex, and medulla, orthe inner or outer root sheaths of the hair follicle.Furthermore, KAP expression in the interfollicular epidermis

Table I. PCR primers and isolated cDNA clones

KAP PCRProducts PCR primer sequence Size of PCR product

Annealingtemperature cDNAs found

cDNA accessionnumbers

bp C1

KAP10.1 ccccgtgtctcccctgtgct 160 62 DKFZp636 AJ566380gtcgagaagtgacccagag K1765Q4

KAP10.2 cross-hybridisation see KAP10.9 DKFZp636 AJ566381KAP10.9 G1021Q4

KAP10.3 ttccagggccatctctgacttc 186 59 DKFZp636 AJ566383ggggcaggacacagtgacc B1012Q4

KAP10.4 atttcctggtgtctgctgttga 206 58 DKFZp636 AJ556382gaaggctctgcggctctg K2422Q4

KAP10.5 acctggtcctgactcgggttag 190 60 DKFZp636 AJ566384agtgtcctggggcagagaac K2052Q4

KAP10.6 agatggggcaggctctttgtct 143? 59 NF cross-hybridization

tgcccagggtggaaccag KAP12.1

KAP10.7 ttctttacccttgacggctctc 232 59 DKFZp636 AJ566385cctcgtttctccccacacc M2324Q4

KAP10.8 ggtaagtggctgcccctacctg 190 60 NF cross-hybridization

cgaggccaagtgacccagag KAP10.1

KAP10.9 agcccacccagcctcagcaca 187 63 DKFZp636 AJ566386cgtccccaaccagcgaccag G0427Q4

KAP10.10 gcaggctctttgtcttgg 3-RACE –ttagaactagagggggtgctg

KAP10.11 gagcgcgtcacactttc 132 54 DSP –cagttcttctttgtcacattgg

KAP10.12 ctgctcaggccagaagtccag 256 61 DSP –caaggaggcgagaccacagg

KAP12.1 crosshybridization – – 3-RACE –KAP12.2

KAP12.2 acctgtacactccctggatg 201 55 3-RACE –gctggtttatttggctctcat

KAP12.3 acacctgtatccctccgtgaa 170 56 DSP –

KAP12.4 ggcctcctatgaccctcgta 191 58 DSP –tgaacccaggagaaactgtgt

cKAP10A tggctgacctcataccctaca 226 58 DSP –aaggccagtttattcattgtg

cKAP12A tgacagatccacctcccactg 161 54 DSP –ttggtcaggaaggcaaaggac

KAP 3-RACEproducts

50primer sequence size of product cDNA found cDNA accessionnumber

KAP 10.10 ttcactcactcacccactcact 1120 scl459 AJ566387

KAP12.1/KAP12.2

cactgccctctgcctcgacctt 604 scl460 AJ566388

761 scl458 AJ566389

SECOND CHROMOSOME 21 KAP GENE DOMAIN 149122 : 1 JANUARY 2004

Page 4: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

was also not found. Minor variations in the onset of KAPexpression were not discernible using the methods de-scribed above, and therefore cannot be excluded.

Because all of the KAP10 and KAP12 family membersshowed mRNA expression in a very limited region of the haircuticle, a bioinformatic analysis of the region 1 kb upstreamof the initiation of translation codon was performed in orderto identify enhancer elements common to all of the KAPgenes described here (Fig 7). Upon analysis, a large numberof infrequently found canonical enhancer elements weredetected (presented as black boxes in Fig 7) as well as eightelements that were present in nearly all of the promoterregions (colored boxes in Fig 7; see the legend for details).These were, based on their frequency of occurrence, theETS-1 enhancer element and the heat shock factorenhancer, which were present in all KAP promoter regions,as well as the binding elements for CTCF, Yy1, AML-1,MZF1, HOXC12/HOXC13, and TBP (see below).

Discussion

The recent studies of human KAP genes, initiating out ofnearly three decades of work by many laboratories on KAPgenes and proteins derived from sheep, mouse, rat, andrabbit (for an overview, see Powell and Rogers, 1997), haveresulted in the identification of a number of human KAPgenes orthologous to those found in these species(MacKinnon et al, 1990; Emonet et al, 1997; Perez et al,1999; Rogers et al, 2001, 2002; unpublished data). To date,complete gene sequence data are available for the humanKAP1�KAP4, KAP6�KAP9, KAP11, KAP13, and KAP15�KAP23 families. In addition, partial sequence data exist forthe human KAP5 family members. No data existed,however, for the human ortholog of sheep KAP10.1, whichwas originally presented in a review paper as an unpub-lished observation (Powell and Rogers, 1997; no databaseentry exists for this sequence), as well as for mKAP12.1, a

Table II. Genomic location and molecular characteristics of the KAP10/KAP12 family members: high sulfur KAP genes/

pseudogenes/proteins

KAP designationAccession no.

genomic sequences ORF StrandMolecular weightof protein (Da)

Cysteine residues(mol %)

KAP10.1 AP001574 254923�255771 � 28659 26.6

KAP10.2 AP001574 266312�267079 � 25615 27.1

KAP10.3 AP001574 273661�274336 � 22347 25.8

KAP10.4 AP001574 289381�290586 þ 40428 27.4

KAP10.5 AP001574 295385�296200 � 27625 26.9

KAP10.6 AP001574 307018�308115 � 36790 26.6

KAP10.7 AP001574 316278�317405 þ 37815 26.7

KAP10.8 AP001574 327774�328553 þ 26298 25.5

KAP10.9 AP001575 3846�2724 þ 30036 25.0

KAP10.0 AP001575 14092�14847 þ 25570 24.3

KAP10.11 AP001575 23133�24029 þ 30189 26.1

KAP10.12 AP001575 3874�74621 þ 25107 25.7

KAP12.4 AP001575 30950�131288 � 11432 23.2

KAP12.3 AP001575 34654�34944 þ 9946 22.9

KAP12.2 AP001575 43120�43160 � 14689 21.2

KAP12.1 AP001575 58505�58795 � 9737 21.9

cKAP12A AP001575 63033�63327 þ

cKAP10A AP001575 78892�79440 þ

c before a KAP gene designates a pseudogene.

Figure 1Physical map of the chromosome 21q23 KAP gene domain. Black lines show relevant portions of DNA sequences AP001754 and AL163300(AP001755), which harbor this domain. Black boxes represent KAP genes, white boxes KAP pseudogenes. Horizontal black arrows denote thedirection of gene transcription. The numbers below the boxes indicate the name of the respective KAP gene (e.g., 10.1¼KAP10.1). c before anumber designates a KAP pseudogene. Numbers in bold show KAP genes for which a corresponding cDNA/3-RACE product has been isolated.

150 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY

Page 5: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

mouse gene sequence whose human ortholog was shownby genomic hybridization to be present on chromosome21q23 (Cole and Reeves, 1998). Using the mKAP12.1 genesequence as a database query, it was possible to locate 12members of the KAP10 family and four members of theKAP12 family grouped together on two Human GenomeProject chromosome 21q23 DNA sequences (Hattori et al,2000). Similar to what was described in the original papercovering sheep KAP10.1 (Powell and Rogers, 1997), thehuman KAP10 family comprises the largest known KAPproteins, ranging in size from 22 to 40 kDa. Only oneadditional single KAP family member of similar size,KAP17.1, has recently been discovered on the chromosome17q21 KAP gene cluster (Rogers et al, 2001). KAP17.1 alsoshows the highest homology (approximately 50%) with theKAP10 protein family members. Compared to the KAP10family members, the KAP12 family members are muchsmaller (9.7–14.6 kDa), being similar in size to members ofthe cortically expressed KAP3 family (Rogers et al, 2001).Both the KAP10 and KAP12 family members not only

possess a fairly high cysteine content (Table II), but alsoexhibit a high serine and proline content similar to thecortically expressed KAP2, KAP3, and KAP9 family mem-bers (Rogers et al, 2001). Moreover, the KAP10 familyexhibits similarities in repeat structures with members of theKAP1, KAP2, KAP4, and KAP9 families. All of these KAPcontain di-cysteine repeat structures containing, in addition,proline frequently in position four and either a hydrophobicor weakly charged hydroxy-amino acid residue in positionfive. Previously described higher ordered repeat structures(28 and 42 amino acid repeats, Powell and Rogers, 1997)could not be found in the human KAP10 members.

Novel, when compared to the expression patterns of therecently described KAP genes on chromosomes 17q21.2and 21q22.1, is the restriction of mRNA expression of allKAP10 and KAP12 genes to the hair fiber cuticle, where itoccurs in a region located well above the apex of the dermalpapilla. This site of expression, in terms of distance alongthe hair shaft from the dermal papilla, corresponds to theexpression site of several cortical KAP families and the

Figure 2Division of human KAP into families via CLUSTREE analysis. Multiple amino acid alignments of the KAP identified in this paper were performedtogether with representative members of the human KAP1�KAP5, KAP9�KAP13, and KAP15�KAP17 as well as KAP family members from otherspecies using the CLUSTAL program. Graphical analyses of the homology data were performed using CLUSTREE. Names in black show previouslydescribed human KAP family members characterized in this laboratory; names in violet designate previously described human KAP5 familymembers; names in red indicate mouse KAP sequences; names in blue designate sheep KAP sequences. The CLUSTAL alignment allowed thedivision of KAP proteins into families, but was not statistically significant enough for determination of paralogous evolutionary relationships.�KAP23.1 did not segregate correctly. Sequence accession numbers: mouse KAP9.1 (m27685), mKAP11.1 (uo3686), hacl1 (mKAP12.1, af081797),mEMB-UHSP (mKAP13.1, af031485), mKAP14.1 (d85925), sheep KAP4.1 (x73462), sKAP5.1 (x55294), sKAP5.4 (x73434), sKAP5.5 (x73435),sKAP10.1 found in Powell and Rogers (1997), human KAP5 members KERA (aj006692), KERB (aj006693), and UHSP3 (x63755).

SECOND CHROMOSOME 21 KAP GENE DOMAIN 151122 : 1 JANUARY 2004

Page 6: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

152 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY

Page 7: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

group A hair keratins in the keratogenous zone of the hairfollicle (Langbein et al, 1999, 2001; Rogers et al, 2001,2002). Also surprising is the limited region in which thecuticular gene expression occurs. In general, very fewproteins have been identified that possess an exclusivelycuticular location. Hair keratins hHa2 and hHb2, twomembers of the group C hair keratin proteins, were shownto exhibit a cuticular expression pattern, which started,however, much lower in the hair bulb, beginning in thematrix/precortex region (Langbein et al, 1999, 2001). More-over, the mRNA of a human KAP gene termed hKAP5.1 inthis report (see Fig 2), which belongs to the ultrahigh sulfurKAP5 gene family identified by homology analysis to sheepKAP5 family members (MacKinnon et al, 1990; Emonet et al,1997; Perez et al, 1999), was shown to be specificallyexpressed in the cuticle of the hair follicle, apparently closeto the narrow expression region of the KAP10 and KAP12family members described here (MacKinnon et al, 1990).Similarly, S100A3, a likewise cysteine-rich, calcium-bindingprotein, considered to be a member of the epithelial keratinassociated protein family due to the localization of its genewithin the ‘‘epidermal differentiation complex’’ on humanchromosome 1q21, was shown by both in situ hybridizationand immunogold electron microscopy to be expressed inthe so-called endocuticular region of the hair cuticle(Kizawa et al, 1996, 2002; Takizawa et al, 1999). As noneof the expression studies of S100A3 showed longitudinalsections of the hair follicle, however, it was not possible to

discern where the expression of this protein initiates.Collectively, these data suggest the presence of about 20different KAP proteins in a narrow area of the cuticle. Thisprobably results in a strong crosslinking of hair keratinshHa2/hHb2 and hHa5/hHb5 expressed in this region(Langbein et al, 1999, 2001). Further work is necessary,however, to elucidate in which subcellular compartment ofthe hair cuticle these proteins are located.

The surprising site-specific expression of all KAP10 andKAP12 family members in the hair cuticle led us to analyzethe region 1 kb upstream of the ORF of all KAP genesdescribed here, in order to ascertain if a distinct pattern ofenhancer elements were present, which might be specificfor cuticular KAP gene expression and thus provide astarting point for future promoter characterizations. Ouranalyses showed that two elements, the ETS-1 enhancerand the heat shock factor enhancer, were present in allpromoter regions analyzed. In addition, we identified sixfurther binding elements for AML1 (Wheeler et al, 2000),TBP (Green, 2000), Yy1 (Thomas and Seto, 1999), CTCF(Bell and Felsenfeld, 1999), MZF1 (Hromas et al, 1991), andHOXC12/HOXC13 (Awgulewitsch, 2003), which were foundin the majority of the KAP10/KAP12 promoter regions. Theposition of TBP, the canonical binding protein for TFIIDmediated transcription, is assumed to demarkate thepromoter regions for the KAP genes analyzed in this study.

One of the most interesting binding elements occurring inall KAP10 and KAP12 promoter regions was ETS, which

Figure 4Multiple sequence alignment of KAP12 family members. For details, see Fig 3. An asterisk beside one protein name indicates mouse KAP12.1(Cole and Reeves, 1998).

Figure 3Multiple sequence alignments of KAP10 family members. The amino acid sequences shown are ORF translations of the respective genesequences. Multialignments were constructed using the CLUSTAL program. The asterisk beside one protein name designates sheep KAP10.1(Powell and Rogers, 1997). Asterisks below the amino acid sequence indicate amino acid identity; dots below the sequences show amino acidhomology; hyphens found within the sequences are gaps induced by the CLUSTAL software program. Sequences in bold show KAP proteins forwhich the respective cDNA has been isolated. Repeat structures are boxed.

SECOND CHROMOSOME 21 KAP GENE DOMAIN 153122 : 1 JANUARY 2004

Page 8: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

comprises a large family of transcription factors (approxi-mately 15 members in humans, Li et al, 2000; Sharrocks,2001). Recently, ETS-1 and ETS-2 expression has beenfound in the epidermis (Andreoli et al, 1997). Interestingly,ETS-2 partially rescued knockout mice exhibit wavy hair,curly whiskers, and abnormal hair follicle shape associatedwith incorrect spatial arrangement (Yamamoto et al, 1998).

ETS-1 is generally present in an inactive conformationalform that can be lifted by physical association with AML1(Kim et al, 1999), a transcription factor whose bindingelement was found in 10 of the 16 KAP10/KAP12 promoterregions. The members of the AML family associate with avariety of cofactors, some of which also modulate theexpression of LEF1, a transcription factor crucial for hair

Figure5Regions of mRNA expression of KAP10family members. The numbers in the upperleft-hand corner of each panel designate thename of the individual KAP members. co,cortex; cu, hair fiber cuticle; icu, inner rootsheath cuticle; dp, dermal papilla. Bar: 150 mm.

154 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY

Page 9: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

follicle morphogenesis (Zhou et al, 1995; Bruhn et al, 1997;Levanon et al, 1998; Wheeler et al, 2000). Although LEF1binding sites have been found previously in several hairkeratin and KAP promoter regions (Zhou et al, 1995), theywere absent from the promoters of the KAP10/KAP12genes. A further element found in all KAP10/KAP12promoter regions was the heat shock factor, generallyresponsible for gene activation following cellular responseto stress or temperature change (Sonna et al, 2002). Micepossessing a homozygous deletion of the HSF1 genedevelop normally, however, and possess a phenotypicallynormal hair coat (Xiao et al, 1999). Although the epidermisand its appendages would appear as appropriate targets forthe heat shock response, very little is known about the roleof heat shock factor in this tissue. Further factors havingimportance in hair fiber protein expression are the Hoxfamily members HOXC12 and HOXC13 (Awgulewitsch,2003). In this study, binding elements for HOXC12/HOXC13could be found in the promoter regions of 12 members ofthe KAP10/KAP12 family. Mice exhibiting overexpression ofHOXC13 show dramatic changes in both hair keratin andKAP mRNA expressions, and the phenotype in HOXC13knockout mice is also consistent with alterations in hairkeratins or KAP (Godwin and Capecchi, 1998; Tkatchenkoet al, 2001).

Methods

Identification of KAP10 and KAP12 gene family members Theexperiments shown in this paper have been performed with theapproval of the German Cancer Research Center. Patient consentfor the use of genomic DNA or hair scalp tissue was obtained.

Using the mouse KAP12.1 gene sequence as a query, theGeneBank/EBI database was analyzed using the BLASTN2program (Cole and Reeves, 1998). This resulted in the identificationof a second domain of human KAP genes on chromosome 21q23,which were found on the DNA sequence accession numbersAP001754 and AL163300 (previously termed AP001755, Hattoriet al, 2000). Further homology analyses of these sequences wereperformed using the SIMILARITY program. Regions of highhomology to the previously described KAP genes containing anintact open reading frame (ORF) and also possessing a putativepolyadenylation signal within 600 bp downstream of the ORF wereconsidered putative KAP genes. Amino acid translations of theORF were performed and grouped into families using the programCLUSTAL and graphically displayed using the CLUSTREE programas previously described (Rogers et al, 2001). Analyses for aminoacid repeat structures were performed using the DOTPLOTprogram. All programs are part of the Heidelberg UNIX SequenceAnalysis Resource (HUSAR). For KAP promoter gene analysis, theregion 1 kb upstream of the initiation of translation codon wasanalyzed using the software program PATCH of the TRANSFACdatabase (Wingender et al, 1997) as found on the website ofBIOBASE. Only mammalian consensus enhancer sequences wereused in the analysis, and one mismatch was allowed. Detailedinformation on the enhancer elements presented and the proteinsto which they bind can be found on the BIOBASE website.HOXC12/HOXC13 enhancer elements were analyzed manuallyusing consensus enhancer sequences determined in a previousstudy (Jave-Suarez et al, 2002). All consensus elements thatscored positive in the analysis are presented in Fig 7.

Isolation of KAP cDNA clones 30 noncoding region probes fromeach KAP gene were PCR amplified (Expand Long PCR System,Roche, Mannheim, Germany) with specific oligonucleotide primersusing conditions shown in Table I. The PCR products were purified(DNA Gel Extraction Kit, Roche) and used to screen an arrayedhuman scalp cDNA library according to previously describedprocedures (Rogers et al, 2001). In addition, the DNA probes were

Figure 6Regions of mRNA expression of KAP12 familymembers. For details see Fig 5. (A), (B) Enlarge-ments of KAP12.2 and KAP12.4, showing theabsence of staining in the inner root sheathcuticle. Bar: (A) 150 mm; (B) 50 mm.

SECOND CHROMOSOME 21 KAP GENE DOMAIN 155122 : 1 JANUARY 2004

Page 10: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

subcloned into the PCR cloning vector pCR4.1 (Invitrogen,Karlsruhe, Germany) for use as templates to create antisense 35ScRNA probes for in situ hybridization (ISH).

30 rapid amplification of cDNA ends (RACE) analysis Total RNAwas isolated from freshly plucked anagen hair follicles using theRNeasy system (Qiagen, Hilden, Germany). cDNA synthesis and 30

RACE were performed using a 30 RACE kit (Qiagen), according to

the manufacturer’s instructions, in combination with 50 primersfrom KAP10.7 and KAP10.2 (see Table I). The isolated PCRproducts were subcloned into cloning vector CR4.1 (Invitrogen)and sequenced.

DNA sequencing The isolated cDNA clones were sequenced byfluorescent dye terminator cycle sequencing (Big Dye SequencingKit, Applied Biosystems, Darmstadt, Germany) and analyzed on an

Figure 7Bioinformatic analysis of KAP10 andKAP12 promoter regions. The regions 1kb upstream of the ORF of all KAP10/KAP12 genes described here were ana-lyzed for consensus enhancer sequen-ces using the TRANSFAC database (seeMethods; Wingender et al, 1997). Thehorizontal black scale at the top showsthe distance from the initiation of transla-tion codon in base pairs. Black linesdenote the region of individual geneanalysis. Boxes above the line are posi-tive strand elements; boxes below theline are negative strand elements. Or-ange boxes, TBP enhancer; green boxes,ETS-1 enhancer; light blue boxes, CTCFenhancer; yellow boxes, AML1 enhancer;violet boxes, MZF1 enhancer; red boxes,heat shock factor enhancer; white boxes,HOXC12/HOXC13 enhancer. Blackboxes indicate further enhancer ele-ments that are named in the graphic.

156 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY

Page 11: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

ABI310 capillary DNA sequencing apparatus (Rogers et al, 2000).Assembly and correction of the DNA sequences were performedusing the STADEN program.

ISH Using the subcloned 30 noncoding region PCR probesdescribed above and shown in Table I, ISH was performed onhuman scalp (material kindly provided by M. Deb, Mannheim,Germany) or plucked beard hair follicle cryosections as previouslydescribed (Langbein et al, 1993; Rogers et al, 1997). Antisense 35ScRNA transcripts of the subcloned DNA were used as probes fordetection of the respective KAP mRNA. Visualizations of the ISHsignals were performed using a confocal laser scanning micro-scope (LSM510, Carl Zeis, Jena/Oberkochen). Simultaneousvisualization of reflected ISH signals derived from epi-illuminationwere combined with transmitted light images in bright field by anoverlay and were assigned pseudocolors (transmission image ingreen, electronically changed to black/white; reflection image (ISHsignal) shown in red).

Accession numbers (KAP cDNA sequences) The accessionnumbers were as follows: KAP10.1, AJ566380; KAP10.2,AJ566381; KAP10.3, AJ566383; KAP10.4, AJ566382; KAP10.5,AJ566384, KAP10.7, AJ566385; KAP10.9, AJ566386; KAP10.10,AJ566387; KAP12.1, AJ566388; KAP12.2, AJ566389.

We thank Dr Herbert Spring (German Cancer Research Center) forsupport with confocal microscopy. The majority of the cDNA clonesisolated in this study can be obtained from the German HumanGenome Resource Center (RZPD, Berlin, Germany) under the namesshown in Table I. We thank them for their support.

DOI: 10.1046/j.0022-202X.2003.22128.x

Manuscript received July 8, 2003; revised August 22, 2003; acceptedfor publication August 25, 2003

Address correspondence to: Dr Michael A. Rogers, Research ProgramA (A145), German Cancer Research Center, Im Neuenheimer Feld 280,69120 Heidelberg, Germany. Email: [email protected]

References

Andreoli JM, Jang SI, Chung E, Cottichia CM, Steinert PM, Markova NG: The

expression of a novel, epithellium-specific ets transcriptor factor is

restricted to the most differentiated layers in the epidermis. Nucleic Acids

Res 25:4287–4295, 1997

Awgulewitsch A: Hox in hair growth and development. Naturwissenschaften

90:193–211, 2003

Bell AC, Felsenfeld G: Stopped at the border: Boundaries and insulators. Curr

Opin Genet Dev 9:191–198, 1999

Bruhn L, Munnerlyn A, Grosschedl R: ALY, a context-dependent coactivator of

LEF-1 and AML-1, is required for TCRa enhancer function. Genes Dev

11:640–653, 1997

Cole SE, Reeves RH: A cluster of keratin-associated proteins on mouse

chromosome 10 in the region of conserved linkage with human

chromosome 21. Genomics 54:437–442, 1998

Emonet N, Michaille JJ, Dhouailly D: Isolation and characterization of genomic

clones of human sequences presumably coding for hair cysteine-rich

proteins. J Dermatol Sci 14:1–11, 1997

Godwin AR, Capecchi MR: Hoxc13 mutant mice lack external hair. Genes Dev

12:11–20, 1998

Green MR: TBP-associated factors (TAFIIs): Multiple, selective transcriptional

mediators in common complexes. Trends Biochem Sci 25:59–63, 2000

Hattori M, Fujiyama A, Taylor TD, et al: The DNA sequence of human

chromosome 21. The chromosome 21 mapping and sequencing

consortium. Nature 405:311–319, 2000

Hromas R, Collins SJ, Hickstein D, et al: A retinoic acid-responsive human zinc

finger gene, MZF-1, preferentially expressed in myeloid cells. J Biol Chem

266:14183–14187, 1991

Jave-Suarez LF, Winter H, Langbein L, Rogers MA, Schweizer J: HOXC13 is

involved in the regulation of human hair keratin gene expression. J Biol

Chem 277:3718–3726, 2002

Kim WY, Sieweke M, Ogawa E, Wee HJ, Englmeier U, Graf T, Ito Y: Mutual

activation of Ets-1 and AML1 DNA binding by direct interaction of their

autoinhibitory domains. EMBO J 18:1609–1620, 1999

Kizawa K, Uchiwa H, Murakami U: Highly-expressed S100A3, a calcium-binding

protein, in human hair cuticle. Biochim Biophys Acta 1312:94–98, 1996

Kizawa K, Troxler H, Kleinert P, Inoue T, Toyoda M, Morohashi M, Heizmann CW:

Characterization of the cysteine-rich calcium-binding S100A3 protein

from human hair cuticles. Biochem Biophys Res Commun 299:857–862,

2002

Langbein L, Heid HW, Moll I, Franke WW: Molecular characterization of the body

site-specific human epidermal cytokeratin 9: cDNA cloning, amino acid

sequence, and tissue specificity of gene expression. Differentiation

55:57–71, 1993

Langbein L, Rogers MA, Winter H, Praetzel S, Beckhaus U, Rackwitz H-R,

Schweizer J: The catalog of human hair keratins: I. Expression of the nine

type I members in the hair follicle. J Biol Chem 274:19874–19884, 1999

Langbein L, Rogers MA, Winter H, Praetzel S, Schweizer J: The catalog of human

hair keratins. II. Expression of the six type II members in the hair follicle

and the combined catalog of human type I and II keratins. J Biol Chem

276:35123–35132, 2001

Levanon D, Goldstein RE, Bernstein Y, et al: Transcriptional repression by AML1

and LEF-1 is mediated by the TLE/Groucho corepressors. Proc Natl Acad

Sci USA 95:11590–11595, 1998

Li R, Pei H, Watson DK: Regulation of Ets function by protein–protein interactions.

Oncogene 19:6514–6523, 2000

MacKinnon PJ, Powell BC, Rogers GE: Structure and expression of genes for a

class of cysteine-rich proteins of the cuticle layers of differentiating wool

and hair follicles. J Cell Biol 111:2587–2600, 1990

Perez C, Auriol J, Gerst C, Bernard BA, Egly JM: Genomic organization and

promoter characterization of two human UHS keratin genes. Gene

227:137–148, 1999

Powell BC, Rogers GE: The role of keratin proteins and their genes in the growth,

structure and properties of hair. In: Jolles P, Zahn H, Hocker H, eds.

Formation and Structure of Human Hair. Basel: Birkhauser, 1997,

pp 59–148

Rogers MA, Langbein L, Praetzel S, Moll I, Krieg T, Winter H, Schweizer J:

Sequences and differential expression of three novel human type-II hair

keratins. Differentiation 61:187–194, 1997

Rogers MA, Winter H, Wolf C, Heck M, Schweizer J: Characterization of a 190-

kilobase pair domain of human type I hair keratin genes. J Biol Chem

273:26683–26691, 1998

Rogers MA, Winter H, Langbein L, Wolf C, Schweizer J: Characterization of a 300

kbp region of human DNA containing the type II hair keratin gene domain.

J Invest Dermatol 114:464–472, 2000

Rogers MA, Langbein L, Winter H, Ehmann C, Praetzel S, Korn BSJ:

Characterization of a cluster of human high/ultrahigh sulfur keratin

associated protein (KAP) genes imbedded in the type I keratin gene

domain on chromosome 17q12-21. J Biol Chem 276:19440–19451, 2001

Rogers MA, Langbein L, Winter H, Ehmann C, Praetzel S, Schweizer J:

Characterization of a first domain of human high glycine-tyrosine and

high sulfur keratin-associated protein (KAP) genes on chromosome

21q22.1. J Biol Chem 277:48993–49002, 2002

Sharrocks AD: The ETS-domain transcription factor family. Nat Rev Mol Cell Biol

2:827–837, 2001

Shimomura Y, Aoki N, Rogers MA, Langbein L, Schweizer J, Ito M: hKAP1.6 and

hKAP1.7, two novel human high sulfur keratin-associated proteins are

expressed in the hair follicle cortex. J Invest Dermatol 118:226–231,

2002a

Shimomura Y, Aoki N, Schweizer J, Langbein L, Rogers MA, Winter H, Ito M:

Polymorphisms in the human high sulfur hair keratin associated protein

(KAP) 1 gene family. J Biol Chem, 277: 45493–45501, 2002b

Shimomura Y, Aoki N, Rogers MA, Langbein L, Schweizer J, Ito M:

Characterization of human keratin-associated protein 1 family members.

J Invest Dermatol Symp Proc The 8:96–99, 2003

Sonna LA, Fujita J, Gaffin SL, Lilly CM: Invited review. Effects of heat and cold

stress on mammalian gene expression. J Appl Physiol 92:1725–1742,

2002

Takizawa T, Takizawa T, Arai S, Kizawa K, Uchiwa H, Sasaki I, Inoue T:

Ultrastructural localization of S100A3, a cysteine-rich, calcium binding

protein, in human scalp hair shafts revealed by rapid-freezing immuno-

cytochemistry. J Histochem Cytochem 47:525–532, 1999

Thomas MJ, Seto E: Unlocking the mechanisms of transcription factor YY: Are

chromatin modifying enzymes the key? Gene 236:197–208, 1999

Tkatchenko AV, Visconti RP, Shang L, et al: Overexpression of Hoxc13 in

differentiating keratinocytes results in downregulation of a novel hair

keratin gene cluster and alopecia. Development 128:1547–1558, 2001

SECOND CHROMOSOME 21 KAP GENE DOMAIN 157122 : 1 JANUARY 2004

Page 12: Hair Keratin Associated Proteins: Characterization of a Second High Sulfur KAP Gene Domain on Human Chromosome 211

Wheeler JC, Shigesada K, Gergen JP, Ito Y: Mechanisms of transcriptional

regulation by Runt domain proteins. Semin Cell Dev Biol 11:369–375,

2000

Wingender E, Kel AE, Karas H, et al: Transfac, TRDD and Compel: Toward a

federated database system of transcriptional regulation. Nucl Acids Res

25:265–268, 1997

Xiao X, Zuo X, Davis AA, McMillan DR, Curry BB, Richardson JA, Benjamin IJ:

HSF1 is required for extra-embryonic development, postnatal growth and

protection during inflammatory responses in mice. EMBO J 18:5943–

5952, 1999

Yamamoto H, Flannery ML, Kupriyanov S, et al: Defective trophoblast function in

mice with a targeted mutation of Ets2. Genes Dev 12:1315–1326, 1998

Zhou P, Byrne C, Jacobs J, Fuchs E: Lymphoid enhancer factor 1 directs hair

follicle patterning and epithelial cell fate. Genes Dev 9:700–713, 1995

158 ROGERS ET AL THE JOURNAL OF INVESTIGATIVE DERMATOLOGY