Thai IDN Table
Table Information:
Table ID: |
|
th |
Table Type: |
|
Language |
Version: |
|
2.2 |
Date Introduced: |
|
2020-01-13 |
Domain Extensions which support this table:
This IDN table is enabled for the following domain extensions:
Policy Statement:
Additional validation rules for Thai:
References for contextual rules
RFC 5891: Internationalized Domain Names in Applications (IDNA): Protocol
(proposed standard), http://tools.ietf.org/html/rfc5891
.SCB IDN Registration Policy by Thai Name Server Co.,Ltd,
https://www.nic.scb/docs/scb/SCB_IDN_Policy.pdf
Code point Description of rule/Reference
U+002D Label must neither start nor end with U+002D.
HYPHEN-MINUS Label must not have U+002D in both third and fourth
position. RFC 5891 (sec 4.2.3.1)
U+0E31, 0E34..0E3A, Not in initial position. RFC 5891 (4.2.3.2).
0E46..0E4D
Marks and Modifier
Letter
U+0E40..0E44 Must be directly followed by a code point of
Leading Vowels 0E01..0E2E.
U+0E31, U+0E34..U+0E37, Must be directly preceded by a code point of
U+0E47, U+0E4C..U+0E4D 0E01..0E2E.
Upper Vowel Symbol
and Upper Diacritics
U+0E48..0E4B Must be directly preceded by a code point of
Tone Marks 0E01..0E2E or U+0E31, U+0E34..U+0E37.
Must not be followed by a
code point of U+0E31, U+0E34..U+0E37, U+0E47,
U+0E4C..U+0E4D.
U+0E30, U+0E32 Must be directly preceded by a code point of
Following Vowels 0E01..0E2E or a code point of U+0E31,
U+0E34..U+0E37, U+0E47, U+0E4C, U+0E4D or a
code point of 0E48..0E4B.
U+0E45 Must be directly preceded by a code point of
U+0E24 or U+0E26
U+0E38..U+0E3A Must be directly preceded by a code point of
Lower Vowel Symbols 0E01..0E2E or a code point of U+0E31,
U+0E34..U+0E37, U+0E47, U+0E4C, U+0E4D or a
code point of 0E48..0E4B.
Additionally, Thai labels cannot contain both European and Thai digits.
A max of 40 allocatable labels are allowed.
Character Table:
Download th-2.2.txt