Chinese Domain Name Consortium (CDNC) Status Update for IDN CDNC Vincent WS Chen, TWNIC 2003.03.25.
-
Upload
mervyn-hubbard -
Category
Documents
-
view
220 -
download
3
Transcript of Chinese Domain Name Consortium (CDNC) Status Update for IDN CDNC Vincent WS Chen, TWNIC 2003.03.25.
Chinese Domain Name Consortium (CDNC)
Status Update for IDN
CDNCVincent WS Chen, TWNIC
2003.03.25
CDNC Status update CDNC Introduction Major Activities on IDN/CDN Status of Chinese Variant Table Issues Next Step
The Chinese Domain Name Consortium (CDNC)
Since May, 2000 CDNC was founded by CNNIC, TWNIC, HKNIC, MO
NIC in Beijing. Currently co-chairs
Prof. Hua-lin Qian (CNNIC) Dr. Shian-shyong Tseng (TWNIC)
Members CNNIC, TWNIC, HKNIC, MONIC Other organizations with DN technology and kno
wledge, which are interested in CDN ( Chinese domain name ) .
Other related Organizations JET
Joint Engineer Task force for IDN effort Members: TWNIC, CNNIC, JPNIC, KRNIC
TWNIC IDN Task force Since May, 2000 Mission
Study CDN technology and implementation solutions TW Variant Table Working Group
Since Feb. 2002 Mission
Edit Traditional Chinese Variant Table and coordinate Simplified Chinese Variant Table with CNNIC for CDN
Major Activities on IDN/CDN CDNC announcement on current IDN solution
in Feb, and June 2002 No solution for Chinese TC/SC equivalence in IETF
architecture No complementary documents for corresponding I
DN registration and administration Aug. 2002, 8th CDNC meeting in Yokohama
Joint development for CDN client software Modification on algorithm of IDN Admin. guideline
draft Consider to make Chinese variant table
Oct. 2002, 9th CDNC meeting in shanghai Discussion on Chinese character variant tables
Plan to merge CN/TW tables into one table in Dec. 2002 Development of client and its supporting system Suggestions on options of the algorithm of
guideline draft Feb. 2003, 10th CDNC meeting in Taipei
Merge CN/TW tables CDN demonstration system to get user experience
for draft table usage
Major Activities on IDN/CDN
IETF draft Internationalized Domain Names and Unique Identifiers/Names(u
name) Traditional and Simplified Chinese Conversion(TSCONV) Requirements of Chinese Domain Name(CDNREQ) Phased Implementation for Internationalized Domain Names in A
pplications (PIIDNA) And other drafts
IDN Admin Guideline http://www1.ietf.org/mail-archive/ietf-announce/Current/msg18
308.html http://www1.ietf.org/mail-archive/ietf-announce/Current/msg20
812.html http://www.ietf.org/internet-drafts/draft-jseng-idn-admin-02.txt
Major Activities on IDN/CDN
TC/SC interchangeable is the must Chinese variant table for CDN
registration and administration TW variant table CN variant table Chinese variant table (CVT)
One Table includes TW and CN variant table
Status of Chinese Variant Table
Why TC/SC In 1956 and 1964, mainland China pub
lish “A Complete Set of Simplified Chinese Characters” It convert some Tradition Chinese to Sim
plified Chinese Ex:
釘 钉許 许
TC/SC mapping TC SC : 1 to 1 mapping TC SC : many to 1 mapping
Ex:發 发髮 发
TC SC : 1 to many mapping Ex:
餘 馀餘 余
TW variant Table Working Group The 16th TWVT WG meeting has held at
February 20, 2003 Members
Linguist from Academia SINICA Computer expert from National Central University
(NCU) Linguist from Taipei Computer Association (TCA) Linguist from CMEX (Foundation) Linguist from Directorate General of Budget
Accounting and Statistics Executive Yuan R.O.C. (DGBAS)
Linguist from IBM Taiwan
Status of TW Variant Table Submit revised draft table to the Bureau
of Standard, Taiwan June, 2002 October, 2002 December, 2002
Next revised draft table Expected to submit in this March After reviewed by the Bureau of Standard,
Taiwan the Chinese Variant Table would become CNS standard
Status of CN Variant Table Invite linguists as the advisers to revie
w the variant table created by TW variant Table Working Group
CN variant table 1st version has been finished in last month
Based on table of TWNIC, minor modification will be adopted Will be completed soon
CVT Structure and Definition
Character for registration (CR) All the Chinese character code point that could
be registered as CDN TW Corresponding character
T-source Chinese character code point which correspond to CR
CN Corresponding character G-source Chinese character code point which
correspond to CR Relevant character
All the variant code point related to CR
CVT Structure and Definition
Character for registration CJK Han character
20902 CJK Unified Ideographs (4E00-9FA5) 52 Characters in Extension A
LDH RFC 1035 Alphabet ‘a-z’ Numeric ‘0-9’ Symbol ‘-’
21017 code points will be included in this table
Sample of CVTCharacter for registration
(CR)
TW Corresponding character
(TWCC)
CNCorresponding
character(CNCC)
Relevant character(s)
(RC)Remarks
公 公 公 公Mapping to oneself司 司 司 司
會 會 会 會会
1-n mappingOnly CR CC 2 RCConsistent RCs
会 會 会 會会
議 議 议 議议
议 議 议 議议
Sample of TCVTCharacter for registration
(CR)
TW Corresponding character
(TWCC)
CN Corresponding
character(CNCC)
Relevant character(s)
(RC)Remarks
風 風 风 風风凨1-1 mappingConsistent RCs
风 風 风 風风凨
凨 風 风 風风凨
台 台臺颱檯 台 台臺颱檯
1-n mappingInconsistent RCs
臺 台臺 台 台臺
颱 颱 台 台颱
檯 檯 台 台檯
CVT Structure and Definition TW Corresponding character
Adopt the corresponding character selected by TW
1 to 1: 19,029 1 to many: 1,925
CN Corresponding character Adopt the corresponding character selected
by CN 1 to 1: 20943 1 to many: 11
CVT Structure and Definition Relevant character
All the variant code points related to Character for Registration
CVT Operation algorithm Character for registration (CR)
Let user register one CDN Put the registered CDN in zone file
TW Corresponding character (TWCC) If only one TWCC
Put the TWCC DN in zone file If more than one TWCC
Let user choose one TWCC DN then put in zone file
CVT Operation algorithm CN Corresponding character (CNCC)
If only one CNCC Put the CNCC DN in zone file
If more than one CNCC Let user choose one CNCC DN then put in zone
file
Relevant character (RC) Reserve RC DN in registration database If Active RC DN
Put the ARCDN in zone file
CVT Operation algorithm CDN registration package
CDN-CR (character for registration) TWCC DN CNCC DN RC DN ARC DN
Demonstration( 公司 .TW) Register CDN: 公司 .TW TWCC DN: 公司 .TW CNCC DN: 公司 .TW RC DN: none Zone file only 1 CDN: 公司 .TW
Demonstration( 會議公司 .TW)
Register CDN: 會議公司 .TW TWCC DN: 會議公司 .TW CNCC DN: 会议公司 .TW RC DN
1. 會议公司 .TW2. 会議公司 .TW
Zone file 2 CDN1. 會議公司 .TW2. 会议公司 .TW
Demonstration( 會議公司 .TW) RC DN
1. 會议公司 .TW2. 会議公司 .TW
RC DN could be active by registrant1. 會议公司 .TW
Zone file 3 CDN1. 會議公司 .TW2. 会议公司 .TW3. 會议公司 .TW
Demonstration( 颱風 .TW) Register CDN: 颱風 .TW TWCC DN: 颱風 .TW CNCC DN: 台风 .TW RC DN
1. 台凨 .TW2. 台風 .TW3. 颱凨 .TW4. 颱风 .TW
Zone file: 2 CDN1. 颱風 .TW2. 台风 .TW
Demonstration( 台風 .TW) Register CDN: 台風 .TW
There are 4 TWCC DN1. 檯風 .TW2. 臺風 .TW3. 颱風 .TW4. 台風 .TW Option process:
Let user to choose one TWCC DN TWCC DN: 台風 .TW
Demonstration( 台風 .TW)
CNCC DN: 台风 .TW RC DN
1. 台凨 .TW2. 檯凨 .TW3. 檯風 .TW4. 檯风 .TW5. 臺凨 .TW
Zone file 2 CDN1. 台風 .TW2. 台风 .TW
6. 臺風 .TW7. 臺风 .TW8. 颱風 .TW9. 颱凨 .TW10. 颱风 .TW
Corresponding DN overlap CDN 颱風 .TW and 台風 .TW
Have the same CNCC DN: 台风 .TW FCFS principle The second registrant will not get the CNCC DN
4 overlapped RC DN 1. 台凨 .TW2. 台風 .TW3. 颱凨 .TW4. 颱风 .TW
Relevant Character DN overlap The result of RC DN will depending on regi
ster order Case 1
If register 颱風 .TW first Will get 4 RC DN
Then register 台風 .TW Will get 10-4=6 RC DN
Case 2 If register 台風 .TW first
Will get 10 RC DN Then register 颱風 .TW
Will not get RC DN
Case study: .tw CDN Statistics with CVT
CDN length Sample NumbersTotal Numbers of Relventcharacters
Themultiples
2 9349 25532 2.733 6102 28947 4.744 16964 148183 8.745 4131 56936 13.786 5196 137571 26.487 5074 247015 48.688 2218 132665 59.819 1445 191016 132.19
10 2199 476125 216.5211 1291 242827 188.09
12 (above) 2261 1418233 627.26Total 56230 3105050 55.22
Too many Reserved CDN Register CDN: 經濟部標準檢驗局
972 RCDN Register CDN: 經濟部標準檢驗局第一會議室 More than 19,000 RC DN
Issues Technical issue
Reserved CDN overlap increase the registration system more complex
Package register/delete/cancel/active/de-active need to relay on a sophisticated registration system to maintain the correct CDN package
The amount of Zone file will increase after zone delegation
Issues Policy issue
Package register/delete/cancel/active/de-active policies, especially for overlapped characters
Too many RCs and how to regulate by registration policy to decrease the database size and system workload
Backward compatibility before or after registration, including implementing variant table
new DRP for package concept if possible Variant table issue
Table version and its backward compatibility
Next Step To tune Chinese (CN/TW) variant table
Consider users expectation Reduce RC size as possible Reduce TW Corresponding character 1 to many case
1 to many: 1,925 --988 To set up the Rules for table use To implement table and develop API for CDN
Registration and resolution To Work out feasible Registration policy
Adopt IDN Admin Guideline Sunrise period
Q & A