ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to...

13
1 ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 2012-01-24 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation ะœะตะถะดัƒะฝะฐั€ะพะดะฝะฐั ะพั€ะณะฐะฝะธะทะฐั†ะธั ะฟะพ ัั‚ะฐะฝะดะฐั€ั‚ะธะทะฐั†ะธะธ Doc Type: Working Group Document Title: Proposal for encoding the Old Permic script in the SMP of the UCS Source: UC Berkeley Script Encoding Initiative (Universal Scripts Project) Authors: Michael Everson Status: Liaison Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N1687 (1998-01-18), N1947 (1999-01-29) Date: 2012-01-24 1. Introduction. The Old Permic script was devised in the 14th century by Russian missionary Stefan of Perm (fl. 1373โ€“1395), and was used to write the Uralic languages Komi and Komi-Permyak. Its model was based on the Greek and Cyrillic alphabets (as can be seen from the order of the letters) but many of the glyphs were taken from the โ€œTamga signsโ€, used in indigenous Komi religious practices; in this way Stefan made a link between traditional signs, which had much to do with the self-identity of the Komi, and Christian traditions. In addition to writing Komi, the script was used cryptographically to write Russian, since it was unknown to most readers of Russian; this explains a number of the extensions made at the end of the alphabet. 2. Processing. Old Permic is a simple, caseless, alphabetic script, read from left to right in horizontal lines running from top to bottom. Several combining diacritics are attested in use in Old Permic. In the table, EF U+1036B and HA U+1036C are given with glyphs which are not the most commonly found ones. More common representation of the corresponding phonetic entities makes use of combining characters: is represented by U+1035F PEEI + ยจ and is represented by U+103FA KOKE + ยจ. Note the similarity of and to Greek ฯ‰ and ฯ‡. Five combining characters may be found with Old Permic characters: COMBINING DIAERESIS, COMBINING DOT ABOVE, COMBINING BREVE, COMBINING COMMA ABOVE, and COMBINING THREE DOTS ABOVE (providing that U+20DB can be used with letters; otherwise a new combining character should be added). The use of diacritical marks is peculiar in Old Permic. Sometimes it appears that the diacritic is semantically null, analogous to the use of a breve in German handwriting (especially the angular handwritten script Fraktur) over lower-case u to differentiate it from n (that is, where u ห˜ is not really supposed to be u + breve and would not be so encoded in text). Sometimes, particularly in word-initial position, vowel quality or consonant palatalization may be marked, apparently indifferently, by these diacritics. There may be more information on this in Lytkin 1952 but the Russian is slow going for me. Characters like COMBINING CYRILLIC TITLO may be used, e.g. to indicate abbreviation or numeric use of a character (see sample texts below). 3. Script name. Stipa calls the script โ€œPermischโ€; Pillu uses the terms โ€œvanapermiโ€, โ€œะดั€ะตะฒะฝะตะฟะตั€ะผัะบะธะนโ€, and โ€œOld-Permianโ€ in Estonian, Russian, and English respectively. Haarmann calls it โ€œAburโ€ and โ€œAltsyrjรคnischโ€ (= โ€˜Old Komiโ€™); โ€œAburโ€ is a name derived from the names of the first two characters AN and BUR (so - is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other script names in the UCS, as well as consistent with Pilluโ€™s terminology.

Transcript of ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to...

Page 1: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

1

ISO/IEC JTC1/SC2/WG2 N4177L2/12-0252012-01-24

Universal Multiple-Octet Coded Character SetInternational Organization for StandardizationOrganisation Internationale de Normalisation

ะœะตะถะดัƒะฝะฐั€ะพะดะฝะฐั ะพั€ะณะฐะฝะธะทะฐั†ะธั ะฟะพ ัั‚ะฐะฝะดะฐั€ั‚ะธะทะฐั†ะธะธ

Doc Type: Working Group DocumentTitle: Proposal for encoding the Old Permic script in the SMP of the UCSSource: UC Berkeley Script Encoding Initiative (Universal Scripts Project)Authors: Michael EversonStatus: Liaison ContributionAction: For consideration by JTC1/SC2/WG2 and UTCReplaces: N1687 (1998-01-18), N1947 (1999-01-29)Date: 2012-01-24

1. Introduction. The Old Permic script was devised in the 14th century by Russian missionary Stefan of Perm (fl.1373โ€“1395), and was used to write the Uralic languages Komi and Komi-Permyak. Its model was basedon the Greek and Cyrillic alphabets (as can be seen from the order of the letters) but many of the glyphswere taken from the โ€œTamga signsโ€, used in indigenous Komi religious practices; in this way Stefanmade a link between traditional signs, which had much to do with the self-identity of the Komi, andChristian traditions. In addition to writing Komi, the script was used cryptographically to write Russian,since it was unknown to most readers of Russian; this explains a number of the extensions made at theend of the alphabet.

2. Processing. Old Permic is a simple, caseless, alphabetic script, read from left to right in horizontallines running from top to bottom. Several combining diacritics are attested in use in Old Permic. In thetable, ๐ซ EF U+1036B and ๐ฌ HA U+1036C are given with glyphs which are not the most commonly foundones. More common representation of the corresponding phonetic entities makes use of combiningcharacters: ๐ท is represented by U+1035F ๐Ÿ PEEI + ยจ and ๐ธ is represented by U+103FA ๐š KOKE + ยจ. Notethe similarity of ๐ซ and ๐ฌ to Greek ฯ‰ and ฯ‡. Five combining characters may be found with Old Permiccharacters: COMBINING DIAERESIS, COMBINING DOT ABOVE, COMBINING BREVE, COMBINING COMMA ABOVE,and COMBINING THREE DOTS ABOVE (providing that U+20DB can be used with letters; otherwise a newcombining character should be added). The use of diacritical marks is peculiar in Old Permic. Sometimesit appears that the diacritic is semantically null, analogous to the use of a breve in German handwriting(especially the angular handwritten script Fraktur) over lower-case u to differentiate it from n (that is,where u is not really supposed to be u + breve and would not be so encoded in text). Sometimes,particularly in word-initial position, vowel quality or consonant palatalization may be marked, apparentlyindifferently, by these diacritics. There may be more information on this in Lytkin 1952 but the Russian isslow going for me. Characters like COMBINING CYRILLIC TITLO may be used, e.g. to indicate abbreviationor numeric use of a character (see sample texts below).

3. Script name. Stipa calls the script โ€œPermischโ€; Pillu uses the terms โ€œvanapermiโ€, โ€œะดั€ะตะฒะฝะตะฟะตั€ะผัะบะธะนโ€,and โ€œOld-Permianโ€ in Estonian, Russian, and English respectively. Haarmann calls it โ€œAburโ€ andโ€œAltsyrjรคnischโ€ (= โ€˜Old Komiโ€™); โ€œAburโ€ is a name derived from the names of the first two characters AN

and BUR (so ๐๐-๐‘๐ž๐  is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with otherscript names in the UCS, as well as consistent with Pilluโ€™s terminology.

Page 2: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

4. Character names and repertoire. The names used for the characters here are based on the namesgiven in Lytkin 1952. The name SHCHOOI is not attested but has been modeled on SHOOI on the analogy ofRussian SHA and SHCHA.

5. Numerals. Script-specific numerals are not known. Letters of the alphabet can be marked withCYRILLIC COMBINING TITLO to indicate numeric use.

6. Punctuation. In the manuscript spaces are used to separate words; a middle dot, a separating colon,and a sort of apostrophe can be seen. No script-specific punctuation is used in Old Permic.

7. Ordering. Alphabetical ordering is a little bit erratic in the sources, especially toward the end of thealphabet. This proposal follows the order 1โ€“34 given in Lytkin 1952, where 1โ€“24 are primary letters and25โ€“34 are supplementary (see Figure 1 below). Four characters are considered to be subordinate in theorder to another character. Thus Lytkin gives U+10365 ๐ฅ SHOOI the value 22 and evidently U+10366 ๐ฆSHCHOOI is โ€œ22aโ€; similarly we have U+10367 ๐ฏ YRY โ€œ23โ€ and U+10368 ๐จ YERU โ€œ23aโ€, U+10369 ๐ฉ O

โ€œ24โ€ and U+1036A ๐ช OO โ€œ24aโ€, and U+10374 ๐ด YA โ€œ34โ€ and U+10375 ๐ต IA โ€œ34aโ€. These โ€œsubordinateโ€letters correspond to U+0449 ั‰ SHCHA, U+044B ั‹ YERU, U+0461 ัก OMEGA, and U+0467 ัง LITTLE YUS.They should not be considered subordinate in ordering, however. The ordering of Old Permic letters is asin the code chart, with each letter having a primary weight.

8. Unicode Character Properties10350;OLD PERMIC LETTER AN;Lo;0;L;;;;;N;;;;;10351;OLD PERMIC LETTER BUR;Lo;0;L;;;;;N;;;;;10352;OLD PERMIC LETTER GAI;Lo;0;L;;;;;N;;;;;10353;OLD PERMIC LETTER DOI;Lo;0;L;;;;;N;;;;;10354;OLD PERMIC LETTER E;Lo;0;L;;;;;N;;;;;10355;OLD PERMIC LETTER ZHOI;Lo;0;L;;;;;N;;;;;10356;OLD PERMIC LETTER DZHOI;Lo;0;L;;;;;N;;;;;10357;OLD PERMIC LETTER ZATA;Lo;0;L;;;;;N;;;;;10358;OLD PERMIC LETTER DZITA;Lo;0;L;;;;;N;;;;;10359;OLD PERMIC LETTER I;Lo;0;L;;;;;N;;;;;1035A;OLD PERMIC LETTER KOKE;Lo;0;L;;;;;N;;;;;1035B;OLD PERMIC LETTER LEI;Lo;0;L;;;;;N;;;;;1035C;OLD PERMIC LETTER MENOE;Lo;0;L;;;;;N;;;;;1035D;OLD PERMIC LETTER NENOE;Lo;0;L;;;;;N;;;;;1035E;OLD PERMIC LETTER VOOI;Lo;0;L;;;;;N;;;;;1035F;OLD PERMIC LETTER PEEI;Lo;0;L;;;;;N;;;;;10360;OLD PERMIC LETTER REI;Lo;0;L;;;;;N;;;;;10361;OLD PERMIC LETTER SII;Lo;0;L;;;;;N;;;;;10362;OLD PERMIC LETTER TAI;Lo;0;L;;;;;N;;;;;10363;OLD PERMIC LETTER U;Lo;0;L;;;;;N;;;;;10364;OLD PERMIC LETTER CHERY;Lo;0;L;;;;;N;;;;;10365;OLD PERMIC LETTER SHOOI;Lo;0;L;;;;;N;;;;;10366;OLD PERMIC LETTER SHCHOOI;Lo;0;L;;;;;N;;;;;10367;OLD PERMIC LETTER YRY;Lo;0;L;;;;;N;;;;;10368;OLD PERMIC LETTER YERU;Lo;0;L;;;;;N;;;;;10369;OLD PERMIC LETTER O;Lo;0;L;;;;;N;;;;;1036A;OLD PERMIC LETTER OO;Lo;0;L;;;;;N;;;;;1036B;OLD PERMIC LETTER EF;Lo;0;L;;;;;N;;;;;1036C;OLD PERMIC LETTER HA;Lo;0;L;;;;;N;;;;;1036D;OLD PERMIC LETTER TSIU;Lo;0;L;;;;;N;;;;;1036E;OLD PERMIC LETTER VER;Lo;0;L;;;;;N;;;;;1036F;OLD PERMIC LETTER YER;Lo;0;L;;;;;N;;;;;10370;OLD PERMIC LETTER YERI;Lo;0;L;;;;;N;;;;;10371;OLD PERMIC LETTER YAT;Lo;0;L;;;;;N;;;;;10372;OLD PERMIC LETTER IE;Lo;0;L;;;;;N;;;;;10373;OLD PERMIC LETTER YU;Lo;0;L;;;;;N;;;;;10374;OLD PERMIC LETTER YA;Lo;0;L;;;;;N;;;;;10374;OLD PERMIC LETTER IA;Lo;0;L;;;;;N;;;;;

9. Bibliography(รˆpifanij Premudaryj) ะญะฟะธั„ะฐะฝะธะน ะŸั€ะตะผัƒะดั€ั‹ะน. 1862. โ€œะŸะพะฒะตัั‚ัŒ ะพ ะกั‚ะตะฟะฐะฝะต, ัะฟะธัะบะพะฟะต ะŸะตั€ะผัะบะพะผโ€, in

ะŸะฐะผัั‚ะฝะธะบะธ ัั‚ะฐั€ะธะฝะฝะพะน ั€ัƒััะบะพะน ะปะธั‚ะตั€ะฐั‚ัƒั€ั‹ 4: 119-71. ะก. ะŸะตั‚ะตั€ะฑัƒั€ะณ.Haarmann, Harald. 1990. Universalgeschichte der Schrift. Frankfurt/Main; New York: Campus. ISBN 3-

593-34346-02

Page 3: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Hรคmรคlรคinen, A. 1909. โ€œรˆpifanij Viisan tiedot Pyhรคstรค Tapanista ja syrjรครคneistรคโ€, in Suomi 4,6,2: 1-27.(Tr. of ะญะฟะธั„ะฐะฝะธะน ะŸั€ะตะผัƒะดั€ั‹. 1862.)

(Kuznecova) ะšัƒะทะฝะตั†ะพะฒะฐ, ะ—. ะ˜. 1958. โ€œะžะฑะทะพั€ ะฟะฐะผัั‚ะฝะธะบะพะฒ ะบะพะผะธ ะฟะธััŒะผะตะฝะฝะพัั‚ะธ XVIII ะฒ.โ€, inะ˜ัั‚ะพั€ะธะบะพ-ั„ะธะปะพะปะพะณะธั‡ะตัะบะธะน ัะฑะพั€ะฝะธะบ 4: 213-41. ะกั‹ะบั‚ั‹ะฒะบะฐั€.

Lepechin, I. I. 1790. โ€œHistoire de lโ€™Acadรฉmie Imperiale des Sciences annรฉe MDCCLXXXVIII [1787]โ€,in Nova Acta Acadamiae Scientiarum Imperiales Petropolitanae 6: 1-34. Petropoli.

(Lytkin) ะ›ั‹ั‚ะบะธะฝ, ะ’. ะ˜. 1952. ะ”ั€ะตะฒะฝะตะฟะตั€ะผัะบะธะน ัะทั‹ะบ: ั‡ั‚ะตะฝะธะต ั‚ะตะบัั‚ะพะฒ, ะณั€ะฐะผะผะฐั‚ะธะบะฐ, ัะปะพะฒะฐั€ัŒ.ะœะพัะบะฒะฐ.

(Lytkin) ะ›ั‹ั‚ะบะธะฝ, ะ’. ะ˜. 1957. ะ˜ัั‚ะพั€ะธั‡ะตัะบะฐั ะณั€ะฐะผะผะฐั‚ะธะบะฐ ะบะพะผะธ ัะทั‹ะบะฐ. 1: ะ’ะฒะตะดะตะฝะธะต. ะคะพะฝะตั‚ะธะบะฐ.ะกั‹ะบั‚ั‹ะฒะบะฐั€.

Lytkin, V. I. 1962. โ€œรœber die Geschichte der syrjรคnischen Literaturโ€, in Ural-Altรคische Jahrbรผcher 34:214-22.

(Lytkin) ะ›ั‹ั‚ะบะธะฝ, ะ“. ะก. 1889. ะ—ั‹ั€ัะฝัะบะธะน ะšั€ะฐะน ะฟั€ะธ ัะฟะธัะบะพะฟะฐั… ะฟะตั€ะผัะบะธั… ะธ ะทั‹ั€ัะฝัะบะธะน ัะทั‹ะบ. ะกะฐะฝะบั‚ -ะฟะตั‚ะตั€ ะฑัƒั€ะณ.

Pillu, Kaljo. 1990. Tallinna Kunstiรผlikooli kรผmme soome-ugri uurimisreisi. Tallinn: Olion. ISBN 5-450-00582-2

(Sidorov) ะกะธะดะพั€ะพะฒ, ะ. ะก. 1962. โ€œะะพะฒั‹ะต ะฟะฐะผัั‚ะฝะธะบะธ ะดั€ะตะฒะฝะตะบะพะผะธ ะฟะธััŒะผะตะฝะฝะพัั‚ะธโ€. Ed. ะ’. ะ˜. ะ›ั‹ั‚ะบะธะฝ. Inะ’ะพะฟั€ะพัั‹ 1962: 178-211.

Stipa, Gรผnter Johannes. 1961. โ€œDer Ursprung der permischen Schriftโ€, in Zeitschrift der DeutschenMorgenlรคndischen Gesellschaft 110: 342-64. Wiesbaden.

Stipa, Gรผnter Johannes. 1963. โ€œDer Ursprung der permischen Schriftโ€, in Congressus internationalisFenno-ugristarum Budapestini habitus 20-24. Pp. 281-87. Budapest.

Stipa, Gรผnter Johannes. 1990. Finnishc-ugrische Sprachforschung von der Renaissance bis zumNeupositivismus. (Mรฉmoires de la Sociรฉtรฉ Finno-ougrienne; 206) Helsinki: Suomalais-ugrilainen Seura.ISBN 951-9403-35-3

10. Acknowledgements. This project was made possible in part by a grant from the U.S. NationalEndowment for the Humanities, which funded the Universal Scripts Project (part of the Script EncodingInitiative at UC Berkeley) in respect of the Old Permic encoding. Any views, findings, conclusions orrecommendations expressed in this publication do not necessarily reflect those of the NationalEndowment for the Humanities.

3

Page 4: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Letters

10350 ๐ OLD PERMIC LETTER AN

10351 ๐‘ OLD PERMIC LETTER BUR

10352 ๐’ OLD PERMIC LETTER GAI

10353 ๐“ OLD PERMIC LETTER DOI

10354 ๐” OLD PERMIC LETTER E

10355 ๐• OLD PERMIC LETTER ZHOI

10356 ๐– OLD PERMIC LETTER DZHOI

10357 ๐— OLD PERMIC LETTER ZATA

10358 ๐˜ OLD PERMIC LETTER DZITA

10359 ๐™ OLD PERMIC LETTER I

1035A ๐š OLD PERMIC LETTER KOKE

1035B ๐› OLD PERMIC LETTER LEI

1035C ๐œ OLD PERMIC LETTER MENOE

1035D ๐ OLD PERMIC LETTER NENOE

1035E ๐ž OLD PERMIC LETTER VOOI

1035F ๐Ÿ OLD PERMIC LETTER PEEI

10360 ๐  OLD PERMIC LETTER REI

10361 ๐ก OLD PERMIC LETTER SII

10362 ๐ข OLD PERMIC LETTER TAI

10363 ๐ฃ OLD PERMIC LETTER U

10364 ๐ค OLD PERMIC LETTER CHERY

10365 ๐ฅ OLD PERMIC LETTER SHOOI

10366 ๐ฆ OLD PERMIC LETTER SHCHOOI

10367 ๐ง OLD PERMIC LETTER YRY

10368 ๐จ OLD PERMIC LETTER YERU

10369 ๐ฉ OLD PERMIC LETTER O

1036A ๐ช OLD PERMIC LETTER OO

1036B ๐ซ OLD PERMIC LETTER EF

1036C ๐ฌ OLD PERMIC LETTER HA

1036D ๐ญ OLD PERMIC LETTER TSIU

1036E ๐ฎ OLD PERMIC LETTER VER

1036F ๐ฏ OLD PERMIC LETTER YER

10370 ๐ฐ OLD PERMIC LETTER YERI

10371 ๐ฑ OLD PERMIC LETTER YAT

10372 ๐ฒ OLD PERMIC LETTER IE

10373 ๐ณ OLD PERMIC LETTER YU

10374 ๐ด OLD PERMIC LETTER YA

10375 ๐ต OLD PERMIC LETTER IA

Printed using UniBookโ„ข

(http://www.unicode.org/unibook/)

Date: 2012-01-244

1037FOld Permic10350

1035 1036 1037

๐

๐‘

๐’

๐“

๐”

๐•

๐–

๐—

๐˜

๐™

๐š

๐›

๐œ

๐

๐ž

๐Ÿ

๐ 

๐ก

๐ข

๐ฃ

๐ค

๐ฅ

๐ฆ

๐ง

๐จ

๐ฉ

๐ช

๐ซ

๐ฌ

๐ญ

๐ฎ

๐ฏ

๐ฐ

๐ฑ

๐ฒ

๐ณ

๐ด

๐ต

10350

10351

10352

10353

10354

10355

10356

10357

10358

10359

1035A

1035B

1035C

1035D

1035E

1035F

10360

10361

10362

10363

10364

10365

10366

10367

10368

10369

1036A

1036B

1036C

1036D

1036E

1036F

10370

10371

10372

10373

10374

10375

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Page 5: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

5

10. Figures.

Figure 1a. The first part of a table from Lytkin 1952, showing glyph variants of Old Permic characters.Note columns 1 and 2 which give ordering of primary (1โ€“24) and secondary (25โ€“34) characters.

Page 6: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Figure 1b. The second part of a table from Lytkin 1952, showing glyph variants of Old Permiccharacters. Note that the first three columns from the Nomokanon 1510 are repeated from Part 1 of the

table.

6

Page 7: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Figure 2. Old Permic inscription on the icon โ€œTroitsyโ€. The text is the beginning of a story aboutAbraham and Sarah (Genesis 18:1โ€“8), from Lytkin 1952.

7

Page 8: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Figure 3. ะกะบะฒะพั€ั†ะพะฒัะบะธะนโ€™s transcription of the Old Permic inscription on the icon โ€œTroitsyโ€ from Lytkin1952. Linebreaks here were to get a 40cm by 5cm text to fit on an A4 sheet.

8

Page 9: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

Figure 4. Lytkinโ€™s Cyrillic transcription of the Old Permic inscription on the icon โ€œTroitsyโ€. On the left(โ€œour reproductionโ€) is the normalized Komi text; on the right (โ€œour readingโ€) is the literal transcription,

with additions and corrections in parentheses.

9

Page 10: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

๐ก๐ฆ[๐ฅโ‰ ] ๐ฎ๐ฑ๐›๐œ๐ฉ๐ก[๐ฅ โ‰ ]๐™๐ก ๐™๐ฆ๐ ๐š๐ฃ๐ ๐š๐œ๐›๐ฃ ๐œ๐š๐œ๐œ๐ ๐™ ๐ข๐ฃ๐Ÿ๐ฃ๐“๐ฉ๐ ๐จ๐ ๐Ÿ๐ฃ๐š๐จ๐ฅ๐›๐ฃ ๐ก๐จ๐›๐ฃ ๐ฉร†[๐˜๐ฉ]๐ก ๐ฃ[๐ช]๐˜๐จ๐ ๐ค๐ช๐œ๐›๐ฉ๐ ๐ก๐จ๐›๐ฉ๐๐›๐ฃ๐๐ฅ๐ฉ[๐ ๐ฉ] ๐ฎ๐™๐˜๐™๐ก ๐šโ‰ ๐ฃ๐ ๐š๐š ร†๐œ ๐šร†[๐ก] [๐ฅโ‰ ]๐™๐[๐™]๐š[๐ก๐ฉ๐] ๐™ร† ๐š ร†[๐“]๐˜๐™๐ก ๐šร†๐ก [๐ฅโ‰ ]-๐™๐๐œ๐ฉ๐ ๐ฃ๐ช๐˜๐˜๐š ๏ฟฝ๏ฟฝ ๐ฎ๐ฆ๐ ๐ฉ๐ก ๐ก๐ฃ๐›๐š๐›๐ฉ [๐œ]๐š๐ข๐จ๐“ ๐ก๐จ๐›๐จ๐ฅ ๐š โ‰ ๐“๐˜๐จ๐œ๐จ๐ฅ ๐Ÿ๐จ๐ฅ๐™๐™๐ก ๐จ๐ช๐˜๐˜๐š ๐๐จ๐›๐ฃ ๐šโ‰ ๐ก ๐ค๐ช๐œ๐›๐ฉ๐ ๐ฉร†๐จร†๐ก ๐“[๐ฉ]๐ ๐จ๐ฅ ๐™ ๐™๐ฎ๐ -๐œ๐จ๐ ๐ข๐™๐ก ๐œ๐ฃ๐ฉ[๐˜] ๐™ ร˜ ๐ฅ๐™๐™๐ก ๏ฟฝ๏ฟฝ๐— ๐™๐ฆ๐๐œ๐ฉ [๐œ๐š]๐ ๐š ๐ฉ ๐šร†๐“๐˜๐™๐œ ๐š๐ฉ ๐œ-๐ฆ๐“๐š๐› [๐ก๐ฉร†] ๐ข๐ฆ๐จ๐ช๐˜๐š๐“ ๐šร†[๐ก] ๐ฃ๐ฆ๐ ๐™๐“ ๐ฃ๐จ๐›๐ข๐™ ๐™ ร†๐ ๐œ๐ฃ๐ ๐œ๐ฆ๐“๐ฃ๐š๐™๐ฅ๐š[๐ก] ๐ฃ[๐š] [๐œ๐จ๐ฅ๐š]๐จ๐๐จ๐ก ๐Ÿ[๐ช]๐“๐ฉ๐ก ๐ข๐™๐™๐š๐. ๐™ร˜ ๐ก๐š๐™โ‰ [๐š]๐ฉ๐“[๐ค๐š]๐“๐-๐จ๐“ [๐ข]๐ฃ๐Ÿ๐ฃ ๐“๐จ[๐๐จ]๐ [๐™] ๐ฃ[๐š]๐™๐ฅ๐š๐ก๐๐™๐ก ๐[๐š]๐๐™๐š๐ก ๐œ[๐ฆ]๐“ ๐ฅ๐ช๐™๐š๐“-๐๐จ๐“ ๐ก๐ฆ๐ฅ ๐œ๐ฃ๐๐š๐“[๐]๐จ๐“ ๐ข๐ฃ ๐ข๐ฃ๐™๐ฉ [๐š๐ช]๐“๐ฃ๐“๐›๐จ๐ฅ โ‰  ๐š๐ฑ๐ช๐™๐“-๐[๐จ]๐“ ๐š[๐ก] ๐ฃ๐ฆ๐  ๐ฉ๐ ๐“๐ฉ ๐™ร† ๐ฅ๐™๐™ร†๐ก๐๐จ๐ก ๐ก๐™๐˜ ๐ฃ๐–๐š๐› ๐š๐ฃ๐˜[๐™๐š] [๐ฅ]๐™[๐™ร†]๐ ๐™ ๐Ÿ-๐จ[๐ฅ]๐™๐™๐ก ๐šโ‰ ๐ฃ๐ ๐š โ‰ ๐œ ๐ก๐š๐ ๐š ๐ฉ๐ ๐“๐ฉ [๐คโ‰ ]๐ช๐œ๐™๐ฉ ๐™ ๐ฅ๐™๐™ร†๐ก ๐ข๐ฆ๐ ๐œ๐ฉ๐“-๐ฅ๐™ ๐›๐ฉ๐™ ๐ ๐ข๐š๐ฅ๐ข๐™ [๐ก๐ฉ]๐ก๐ข๐ฉ๐œ ๐Ÿ๐จ๐ฅโ‰  ๐™ร† ๐ฃ๐–๐š๐› ๏ฟฝ๏ฟฝ ๐š๐ฉ๐›๐“-๐ฉ๐œ ๐โ‰ ๐š๐ ๐Ÿ๐จ๐ฅ๐™๐™ ๐šร†๐ฃ๐ ๐š๐šร†๐œ ๐œ๐ฉ๐ก๐™๐ฉ๐ก ๐ฉร†๐ ๐“๐ฉ [๐š]๐ฃ๐ข๐™๐ก๐ค๐ฉ๐› ๐ฉร†๐ฅ๐ฉร†๐ก ๐œ๐ฃ๐ ๐ฉ๐ก ๐™ร˜ ๐ฅ๐ฆ๐ข๐™๐ก ๐ก๐™๐™๐ฉ ๐ฃ๐ฆ๐ ๐›๐ฃ ๐™ร† ๐ ๐ฑ๐๐จ-๐“ ๐“๐š๐ฅ๐ข๐™๐ก ๐™ร† ๐œ๐ฉ[๐ฅโ‰ ]๐ข๐™[๐ก] ๐ฃ[๐จ๐™] ๐œ[๐š] ๐™๐ฉ๐› ๐š๐ฃ๐š ๐š[๐ช]๐“๐ฉ๐ก[๐ก] ๐“[๐š๐ฅ๐ข]๐™๐ก ๐™ ๐Ÿ๐ฃ๐๐ข๐™๐ก ๐๐จ ๐ฃ๐ช๐˜๐จ๐ ๐™ร† ๐ฅ๐ช๐™๐ก๐๐จ๐ก[๐š๐ค๐จ๐ก] ๐ฎ[๐™๐˜]๐™๐ก ๐๐จ ๐จ๐ช๐˜๐จ๐ ๐ข๐ฃ๐Ÿ๐ฃ ๐“[๐จ]๐[๐จ๐].

Figure 5. My typeset version of Lytkinโ€™s Cyrillic literal transcription of the Old Permic inscription on theicon โ€œTroitsyโ€. The linebreaks in the text above are the same as those of the text in Figure 3 (hyphenshave been added at line-final word breaks). Diacritical marks and brackets are given in red here. Note theuse of COMBINING DOT ABOVE, COMBINING DIAERESIS, COMBINING COMMA ABOVE, and CYRILLIC COMBINING

TITLO.

10

Page 11: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

18 ๐ก๐ฆ๐ฅ โ‰  ๐ฎ๐ฑ๐›๐œ๐ฉ๐ก๐ฅ โ‰ ๐ฉ๐ก ๐™ ยจ๐ฆ๐ ๐š๐ฎ๐ ๐š๐œ๐›๐ฃ ๐œ๐š๐œ๐œ๐ ๐™ ๐ข๐ฃ๐Ÿ๐ฃ ๐“๐ฉ๐ ๐จ๐,๐Ÿ๐ฃ๐š๐จ๐ฅ โ‰ ๐›๐ฃ ๐ก๐จ๐›๐ฃ ๐ฉร†๐˜๐ฉร†๐ก ๐ฃ๐ช๐˜๐จ๐ ๐คโ‰ ๐ช๐œ๐›๐ฉ ร†๐ ๐ก๐จ๐›๐ฉร†๐ ๐›๐ฃ๐๐ฅ๐ฉร†๐ ๐ฉร†.

2๐ฎ๐™๐˜๐™๐ก ๐š๐ฎ๐ ๐š๐š๐œ ๐š๐ก ๐ฅ โ‰ ๐™๐๐™ ยจ๐š๐ก๐ฉ ร†๐ ๐™ ๐š๐“๐˜๐™๐ก ๐š๐ก ๐ฅ โ‰ ๐™๐๐œ๐ฉ ร†๐: ๐ฎ๐ช๐˜๐˜๐š 3๐ฎ๐ฆ๐ ๐ฉ ร†๐ก ๐ก๐ฃ๐›๐š๐›๐ฉ ร† ๐œ๐š๐“๐จ๐“ ๐ก๐จ๐›๐จ๐ฅ โ‰ ; ๐š๐“๐˜๐จ๐œ๐จ๐ฅ โ‰  ๐Ÿ๐จ๐ฅ๐™ ยจ๐™๐ก ๐ฎ๐ช๐˜๐˜๐š ๐๐จ๐›๐ฃ๐š๐ก ๐คโ‰ ๐ช๐œ๐›๐ฉ ร†๐ ๐ฉ ร†๐™โ‰ ๐จ๐ก ๐“๐ฉ๐ ๐จ๐ฅโ‰  ๐™ ๐™ ยจ๐ฃ๐ ๐œ๐จ๐ ๐ข๐™๐ก ๐œ๐ฃ๐ฉร†๐˜.

3๐™ ๐ฅ๐™๐™๐ก: ห๐œ๐ฆ (๐œ๐ฆ๐—๐ฉ ร†๐ก๐ฉ ร†) ๐™ ยจ๐ฆ๐๐œ๐ฉ ร† ๐œ๐š๐ ๐š-๐ฉ ร† ๐š๐“๐˜๐™๐œ ๐š๐ฉ ร† ๐œ๐ฆ๐“๐š๐›๐ก๐ฉ ร†๐ข๐ฆ ๐ฎ๐ช๐˜๐š๐“ ๐š๐ก ๐ฎ๐ฆ๐ ๐™๐“ ๐ฎ๐จ๐›๐ข๐™ ๐™๐ ๐œ๐ฃ๐.

4ห๐œ๐ฆ๐“ ๐ฎ๐š๐™ ยจ๐ฅ โ‰ ๐š๐ก ๐ฎ๐š ๐œ๐จ๐ฅ โ‰ ๐š๐จ๐๐จ๐ก ๐Ÿ๐ช๐“๐ฉ ร†๐ก ๐ข๐™๐™ ยจ๐š๐ ๐™๐ก๐š๐™ ยจ๐š๐ฉ ร†๐“๐ค โ‰ ๐š๐“๐๐จ๐“ ๐ข๐ฃ๐Ÿ๐ฃ ๐“๐จ๐๐จ๐ ๐™ ๐ฎ๐š๐™ ยจ๐ฅ โ‰ ๐š๐ก๐๐™๐ก ๐ โ‰ ๐š๐ โ‰ ๐™ ยจ๐š๐ก, ๐œ๐ฆ๐“๐ฅ โ‰ ๐ช๐™ ยจ๐š๐“๐๐จ๐“ ๐ก๐ฆ๐ฅ โ‰  ๐œ๐ฃ๐๐š๐“๐๐จ๐“ ๐ข๐ฃ ๐ข๐ฃ๐™ ยจ๐ฉ ร† , ๐š๐ช๐“๐ฃ๐“๐›๐จ๐ฅ โ‰ ๐š๐ฑ๐ช๐™๐“๐๐จ๐“ ๐š๐ก ๐ฎ๐ฆ๐  ๐ฉ๐ ๐“๐ฉร†.ห› ๐™ ๐ฅ๐™๐™๐ก๐๐จ๐ก: ห๐ก๐™๐˜ ๐ฃ๐–๐š๐›, ๐š๐ฃ๐˜๐™๐š ๐ฅ๐™๐™๐.ห›

6๐™ ๐Ÿ๐จ๐ฅ๐™ ยจ๐™๐ก ๐š๐ฎ๐ ๐š๐œ ๐ก๐š๐ ๐š ๐ฉ๐ ๐“๐ฉ ร† ๐ค โ‰ ๐ช๐œ๐™ ยจ๐ฉ ร† ๐™ ๐ฅ๐™๐™๐ก: ห๐ข๐ฆ๐ ๐œ๐ฉ ร†๐“๐ฅ โ‰ ๐™,๐›๐ฉ๐™ยจ 3 ๐ข๐š๐ฅโ‰ ๐ข๐™ ๐ก๐ฉร†๐ก๐ข๐ฉร†๐œ ๐Ÿ๐จ๐ฅโ‰  ๐™ ๐ฃ๐–๐š๐› 3 ๐š๐ฉร†๐›๐“๐ฉ ร†๐œ ๐โ‰ ๐š๐ โ‰ !ห›

7๐Ÿ๐จ๐ฅ๐™ ยจ๐™ ๐š๐ฎ๐ ๐š๐š๐œ ๐œ๐ฉ ร†๐ก๐™ ยจ๐ฉ ร†๐ก ๐ฉ๐ ๐“๐ฉ ร†, ๐š๐ฃ๐ข๐™๐ก ๐ค โ‰ ๐ฉ ร†๐› ๐ฉ ร†๐ฅ๐ฉ ร†๐ก, ๐œ๐ฃ๐ ๐ฉ ร†๐ก, ๐™๐ฅ โ‰ ๐ฆ๐ข๐™๐ก ๐ก๐™๏ฟฝ๏ฟฝ๐ฉร† ๐ฎ๐ฆ๐ ๐›๐จ, ๐™ ๐ ๐ฑ๐๐จ๐“ ๐“๐š๐ฅ โ‰ ๐ข๐™๐ก.

8๐™ ๐œ๐ฉ๐ฅ โ‰ ๐ข๐™๐ก ๐ฎ๐จ๐™ ยจ, ๐œ๐š, ๐™ ยจ๐ฉ ร†๐›, ๐š๐ฃ๐š, ๐š๐ช๐“๐ฉ ร†๐ก ๐“๐š๐ฅ โ‰ ๐ข๐™๐ก ๐™ ๐Ÿ๐ฃ๐๐ข๐™๐ก ๐๐จ๐ฃ๐ช๐˜๐จ๐; ๐™ ๐ฅโ‰ ๐ช๐™๐ก๐๐จ๐ก; ๐š๐คโ‰ ๐จ๐ก ๐ฎ๐™๐˜๐™๐ก ๐๐จ ๐ฎ๐ช๐˜๐จ๐ ๐ข๐ฃ๐Ÿ๐ฃ ๐“๐จ๐๐จ๐.Figure 6. My typeset version of Lytkinโ€™s Cyrillic normalized transcription of the Old Permic inscriptionon the icon โ€œTroitsyโ€. It is possible that this is the first time that a text in this script has ever been typesetwith fonts. Linebreaks, punctuation, and numbering follow normal biblical editorial practice. Diacriticalmarks and punctuation are given in red here. Note the use of COMBINING DOT ABOVE, COMBINING

DIAERESIS, COMBINING COMMA ABOVE, and CYRILLIC COMBINING TITLO.

11

Page 12: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

A. Administrative1. TitleFinal proposal for encoding the Caucasian Albanian script in the SMP of the UCS2. Requesterโ€™s nameUC Berkeley Script Encoding Initiative (Universal Scripts Project) Author: Michael Everson3. Requester type (Member body/Liaison/Individual contribution)Liaison contribution.4. Submission date2012-01-245. Requesterโ€™s reference (if applicable)6. Choose one of the following:6a. This is a complete proposalYes.6b. More information will be provided laterNo.

B. Technical โ€“ General1. Choose one of the following:1a. This proposal is for a new script (set of characters)Yes.1b. Proposed name of scriptOld Permic.1c. The proposal is for addition of character(s) to an existing blockNo.1d. Name of the existing block2. Number of characters in proposal38.3. Proposed category (A-Contemporary; B.1-Specialized (small collection); B.2-Specialized (large collection); C-Major extinct; D-Attestedextinct; E-Minor extinct; F-Archaic Hieroglyphic or Ideographic; G-Obscure or questionable usage symbols)Category E.4a. Is a repertoire including character names provided?Yes.4b. If YES, are the names in accordance with the โ€œcharacter naming guidelinesโ€ in Annex L of P&P document?Yes.4c. Are the character shapes attached in a legible form suitable for review?Yes.5a. Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the standard?Michael Everson.5b. Identify the party granting a license for use of the font by the editors (include address, e-mail, ftp-site, etc.).Michael Everson.6a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided?Yes.6b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached?Yes.7. Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching,indexing, transliteration etc. (if yes please enclose information)?Yes.8. Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist incorrect understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties are: Casinginformation, Numeric information, Currency information, Display behaviour information such as line breaks, widths etc., Combiningbehaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts, Compatibilityequivalence and other Unicode normalization related information. See the Unicode standard at http://www.unicode.org for such informationon other scripts. Also see Unicode Character Database http://www.unicode.org/Public/UNIDATA/ UnicodeCharacterDatabase.html andassociated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in theUnicode Standard.See above.

C. Technical โ€“ Justification1. Has this proposal for addition of character(s) been submitted before? If YES, explain.No.2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, otherexperts, etc.)?Yes. 2b. If YES, with whom?There has been contact with a major centre for Uralic studies, Kotimaisten kielten tutkimuskeskus, Helsinki. There has been nocontact in the Komi Republic. An earlier version of the proposal was circulated on [email protected]. The principle authors ofsome of the materials used to prepare the proposal (chiefly Lytkin 1952 and Stipa 1961), are deceased.

12

Page 13: ISO/IEC JTC1/SC2/WG2 N4177 L2/12-025 - Unicodeย ยท and BUR (so ๐๐-๐‘๐ž๐ is equivalent to ฮฌฮปฯ†ฮฑ-ฮฒฮฎฯ„ฮฑ and ะฐะท-ะฑัƒะบะธ). โ€œOld Permicโ€ is consistent with other

2c. If YES, available relevant documents3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, orpublishing use) is included?Scholars and Komi language enthusiasts.4a. The context of use for the proposed characters (type of use; common or rare)Old Permic was used chiefly to write Komi texts, but a number of Russian texts are also found in Old Permic script.4b. Reference5a. Are the proposed characters in current use by the user community?Yes.5b. If YES, where?Marginal internet use (some fonts).6a. After giving due considerations to the principles in the P&P document must the proposed characters be entirely in the BMP?No.6b. If YES, is a rationale provided?6c. If YES, reference7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)?Yes.8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence?No.8b. If YES, is a rationale for its inclusion provided?8c. If YES, reference9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposedcharacters?No.9b. If YES, is a rationale for its inclusion provided?9c. If YES, reference10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character?No; though this is a Greek-derived script like Coptic, Cyrillic, Armenian, Georgian, and Gothic.10b. If YES, is a rationale for its inclusion provided?Yes.10c. If YES, referenceยง2 above.11a. Does the proposal include use of combining characters and/or use of composite sequences?Yes.11b. If YES, is a rationale for such use provided?No.11c. If YES, reference11d. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided?No. 11e. If YES, reference12a. Does the proposal contain characters with any special properties such as control function or similar semantics?No.12b. If YES, describe in detail (include attachment if necessary)13a. Does the proposal contain any Ideographic compatibility character(s)?No.13b. If YES, is the equivalent corresponding unified ideographic character(s) identified?

13