VISION 1995-01-02
INTERNATIONAL STANDARD
Information technology -8-bit single-byte coded graphic character sets
Part 11 : Latin/Thai character set
Contents
A Bibliography
ISO/IEC 8859 consists of several parts. Each part specifies a set of up to 191 graphic characters and the coded representation of these characters by means of a single 8-bit bytes. Each set is intended for use for a particular group of languages.
Control funtions may be used in conjunction with the coded characters specified in the parts of ISO/IEC 8859. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see 6).
This part of ISO/IEC 8859 specifies a set of 183 coded graphic characters identified as Latin/Thai character set.
This set of coded graphic characters is intended for use in data and text processing applications and may also be used for information interchange.
The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages :
English, Thai.
This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1.
This part of ISO/IEC 8859 may not be used in conjuction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3.
The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429.
NOTE - ISO/IEC 8859 is not intended for use with Telematic services sdefined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.
A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with this part of this International Standard if all the coded representations of graphic characters within that CC-data-element conform to the requirements of clause 6.
A device is in conformance with this International Standard if it conforms to the requirements of 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which contains the description specified in 2.2.1.
A device that conforms to this International Standard shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in 2.2.2 and 2.2.3.
An originating device shall allow its user to supply any sequence of characters from those specified in clause 6, and shall be capable of transmitting their coded representations within a CC-data-element.
A receiving device shall be capable of receiving and interpreting any coded representations of characters that are within a CC-data-element, and that conform to clause 6, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those specified there, and can distinguish them from each other.
ISO/IEC 2022, Information technology - Character code structure and extensin techniques.
ISO/IEC 4873, Information technology - ISO 8-bit code for information interchange - Structure and rules for implementation.
ISO/IEC 8824, Information technology - Open systems interconnection - Abstract Syntax Notation One (ASN.1).
An ordered set of bits used for the representation of characters.
A bit string that is operated upon as a unit.
A member of a set of elements used for the organization, control or representation of data.
A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations.
A table showing the characters allocated to each bit combination in a code.
A character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations.
Note - In ISO/IEC 8859 a single bit combination is used to represent each character.
A visual representation of a graphic character or of a control function.
That part of a code table identified by its column and row co-ordinates.
The bits of the bit combinations of the 8-bit code are identified by b8, b7, b6, b5, b4, b3, b2, and b1, where b8 is the highest-order, or most-significant bit and b1 is the lowest-order, or least-significant bit.
The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits:
| Bit | b8 | b7 | b6 | b5 | b4 | b3 | b2 | b1 |
| Weight | 128 | 64 | 32 | 16 | 8 | 4 | 2 | 1 |
Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b8 to b1 is as follows:
- xx is the number represented by b8, b7, b6, and b5 where these bits are given the weights 8, 4, 2, and 1, respectively;
- yy is the number represented by b4, b3, b2, and b1 where these bits are given the weights 8, 4, 2, and 1 respectively.
A 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered 00 to 15.
The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number.
The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, is the same as that of the corresponding bit combination.
This part of ISO/IEC 8859 assigns a unique name to each graphic character. These names have been taken from ISO/IEC 10646-1 (E). It also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text.
The names chosen to denote graphic characters are intended to reflect their customary meaning. However, except for SPACE(SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of ISO/IEC 8859 does not define and does not restrict the meanings of graphic characters. Neither does it specify a particular style or font design for imaging graphic characters.
This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corresponding position of the code table.
A graphic character the visual representation of which consists of the absence of a graphic symbol.
A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented.
A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word.
This part of ISO/IEC 8859 specifies 183 characters allocated to the bit combinations of the code table (table 2).
Control functions , such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are graphic symbols made up from the graphic representations of two or more characters.
1. Characters of the set and their coded representation
Table 1A - Name and Coded Representation of the Characters in Columns 02 to 07
| Bit combination | Name |
| 02/00 | SPACE |
| 02/01 | EXCLAMATION MARK |
| 02/02 | QUOTATION MARK |
| 02/03 | NUMBER SIGN |
| 02/04 | DOLLAR SIGN |
| 02/05 | PERCENT SIGN |
| 02/06 | AMPERSAND |
| 02/07 | APOSTROPHE |
| 02/08 | LEFT PARENTHESIS |
| 02/09 | RIGHT PARENTHESIS |
| 02/10 | ASTERISK |
| 02/11 | PLUS SIGN |
| 02/12 | COMMA |
| 02/13 | HYPHEN-MINUS |
| 02/14 | FULL STOP |
| 02/15 | SOLIDUS |
| 03/00 | DIGIT ZERO |
| 03/01 | DIGIT ONE |
| 03/02 | DIGIT TWO |
| 03/03 | DIGIT THREE |
| 03/04 | DIGIT FOUR |
| 03/05 | DIGIT FIVE |
| 03/06 | DIGIT SIX |
| 03/07 | DIGIT SEVEN |
| 03/08 | DIGIT EIGHT |
| 03/09 | DIGIT NINE |
| 03/10 | COLON |
| 03/11 | SEMICOLON |
| 03/12 | LESS-THAN SIGN |
| 03/13 | EQUALS SIGN |
| 03/14 | GREATER-THAN SIGN |
| 03/15 | QUESTION MARK |
| 04/00 | COMMERCIAL AT |
| 04/01 | LATIN CAPITAL LETTER A |
| 04/02 | LATIN CAPITAL LETTER B |
| 04/03 | LATIN CAPITAL LETTER C |
| 04/04 | LATIN CAPITAL LETTER D |
| 04/05 | LATIN CAPITAL LETTER E |
| 04/06 | LATIN CAPITAL LETTER F |
| 04/07 | LATIN CAPITAL LETTER G |
| 04/08 | LATIN CAPITAL LETTER H |
| 04/09 | LATIN CAPITAL LETTER I |
| 04/10 | LATIN CAPITAL LETTER J |
| 04/11 | LATIN CAPITAL LETTER K |
| 04/12 | LATIN CAPITAL LETTER L |
| 04/13 | LATIN CAPITAL LETTER M |
| 04/14 | LATIN CAPITAL LETTER N |
| 04/15 | LATIN CAPITAL LETTER O |
| 05/00 | LATIN CAPITAL LETTER P |
| 05/01 | LATIN CAPITAL LETTER Q |
| 05/02 | LATIN CAPITAL LETTER R |
| 05/03 | LATIN CAPITAL LETTER S |
| 05/04 | LATIN CAPITAL LETTER T |
| 05/05 | LATIN CAPITAL LETTER U |
| 05/06 | LATIN CAPITAL LETTER V |
| 05/07 | LATIN CAPITAL LETTER W |
| 05/08 | LATIN CAPITAL LETTER X |
| 05/09 | LATIN CAPITAL LETTER Y |
| 05/10 | LATIN CAPITAL LETTER Z |
| 05/11 | LEFT SQUARE BRACKET |
| 05/12 | REVERSE SOLIDUS |
| 05/13 | RIGHT SQUARE BRACKET |
| 05/14 | CIRCUMFLEX ACCENT |
| 05/15 | LOW LINE |
| 06/00 | GRAVE ACCENT |
| 06/01 | LATIN SMALL LETTER A |
| 06/02 | LATIN SMALL LETTER B |
| 06/03 | LATIN SMALL LETTER C |
| 06/04 | LATIN SMALL LETTER D |
| 06/05 | LATIN SMALL LETTER E |
| 06/06 | LATIN SMALL LETTER F |
| 06/07 | LATIN SMALL LETTER G |
| 06/08 | LATIN SMALL LETTER H |
| 06/09 | LATIN SMALL LETTER I |
| 06/10 | LATIN SMALL LETTER J |
| 06/11 | LATIN SMALL LETTER K |
| 06/12 | LATIN SMALL LETTER L |
| 06/13 | LATIN SMALL LETTER M |
| 06/14 | LATIN SMALL LETTER N |
| 06/15 | LATIN SMALL LETTER O |
| 07/00 | LATIN SMALL LETTER P |
| 07/01 | LATIN SMALL LETTER Q |
| 07/02 | LATIN SMALL LETTER R |
| 07/03 | LATIN SMALL LETTER S |
| 07/04 | LATIN SMALL LETTER T |
| 07/05 | LATIN SMALL LETTER U |
| 07/06 | LATIN SMALL LETTER V |
| 07/07 | LATIN SMALL LETTER W |
| 07/08 | LATIN SMALL LETTER X |
| 07/09 | LATIN SMALL LETTER Y |
| 07/10 | LATIN SMALL LETTER Z |
| 07/11 | LEFT CURLY BRACKET |
| 07/12 | VERTICAL LINE |
| 07/13 | RIGHT CURLY BRACKET |
| 07/14 | TILDE |
Table 1B - Name and Coded Representation of the Characters in Columns 10 to 15
Bit combination Name 10/00 NO-BREAK SPACE 10/01 THAI CHARACTER KO KAI 10/02 THAI CHARACTER KHO KHAI 10/03 THAI CHARACTER KHO KHUAT 10/04 THAI CHARACTER KHO KHWAI 10/05 THAI CHARACTER KHO KHON 10/06 THAI CHARACTER KHO RAKHANG 10/07 THAI CHARACTER NGO NGU 10/08 THAI CHARACTER CHO CHAN 10/09 THAI CHARACTER CHO CHING 10/10 THAI CHARACTER CHO CHANG 10/11 THAI CHARACTER SO SO 10/12 THAI CHARACTER CHO CHOE 10/13 THAI CHARACTER YO YING 10/14 THAI CHARACTER DO CHADA 10/15 THAI CHARACTER TO PATAK 11/00 THAI CHARACTER THO THAN 11/01 THAI CHARACTER THO NANGMONTHO 11/02 THAI CHARACTER THO PHUTHAO 11/03 THAI CHARACTER NO NEN 11/04 THAI CHARACTER DO DEK 11/05 THAI CHARACTER TO TAO 11/06 THAI CHARACTER THO THUNG 11/07 THAI CHARACTER THO THAHAN 11/08 THAI CHARACTER THO THONG 11/09 THAI CHARACTER NO NU 11/10 THAI CHARACTER BO BAIMAI 11/11 THAI CHARACTER PO PLA 11/12 THAI CHARACTER PHO PHUNG 11/13 THAI CHARACTER FO FA 11/14 THAI CHARACTER PHO PHAN 11/15 THAI CHARACTER FO FAN 12/00 THAI CHARACTER PHO SAMPHAO 12/01 THAI CHARACTER MO MA 12/02 THAI CHARACTER YO YAK 12/03 THAI CHARACTER RO RUA 12/04 THAI CHARACTER RU 12/05 THAI CHARACTER LO LING 12/06 THAI CHARACTER LU 12/07 THAI CHARACTER WO WAEN 12/08 THAI CHARACTER SO SALA 12/09 THAI CHARACTER SO RUSI 12/10 THAI CHARACTER SO SUA 12/11 THAI CHARACTER HO HIP 12/12 THAI CHARACTER LO CHULA 12/13 THAI CHARACTER O ANG 12/14 THAI CHARACTER HO NOK HUK 12/15 THAI CHARACTER PAI YAN NOI 13/00 THAI CHARACTER SARA A 13/01 THAI CHARACTER MAI HAN-AKAT 13/02 THAI CHARACTER SARA AA 13/03 THAI CHARACTER SARA AM 13/04 THAI CHARACTER SARA I 13/05 THAI CHARACTER SARA II 13/06 THAI CHARACTER SARA UE 13/07 THAI CHARACTER SARA UEE 13/08 THAI CHARACTER SARA U 13/09 THAI CHARACTER SARA UU 13/10 THAI CHARACTER PHINTHU 13/11 (This position shall not be used) 13/12 (This position shall not be used) 13/13 (This position shall not be used) 13/04 (This position shall not be used) 13/15 THAI CURRENCY SYMBOL BAHT 14/00 THAI CHARACTER SARA E 14/01 THAI CHARACTER SARA AE 14/02 THAI CHARACTER SARA O 14/03 THAI CHARACTER SARA AI MAI MUAN 14/04 THAI CHARACTER SARA AI MAI MALAI 14/05 THAI CHARACTER LAK KHANG YAO 14/06 THAI CHARACTER MAI YAMOK 14/07 THAI CHARACTER MAI TAI KHU 14/08 THAI CHARACTER MAI EK 14/09 THAI CHARACTER MAI THO 14/10 THAI CHARACTER MAI TRI 14/11 THAI CHARACTER MAI CHATTAWA 14/12 THAI CHARACTER THANTHAKHAT 14/13 THAI CHARACTER NIKHAHIT 14/14 THAI CHARACTER YAMAKKAN 14/15 THAI CHARACTER FONGMAN 15/00 THAI CHARACTER LEK SUN 15/01 THAI CHARACTER LEK NUNG 15/02 THAI CHARACTER LEK SONG 15/03 THAI CHARACTER LEK SAM 15/04 THAI CHARACTER LEK SI 15/05 THAI CHARACTER LEK HA 15/06 THAI CHARACTER LEK HOK 15/07 THAI CHARACTER LEK CHET 15/08 THAI CHARACTER LEK PAET 15/09 THAI CHARACTER LEK KAO 15/10 THAI CHARACTER ANGKHANKHU 15/11 THAI CHARACTER KHOMUT 15/12 (This position shall not be used) 15/13 (This position shall not be used) 15/14 (This position shall not be used) 15/15 (This position shall not be used) 2. Code table
For each character in the set the code table (table 2) shows a graphic symbol at the position in the code table corresponding to the bit combination specified in table 1A or 1B.
The shaded positions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISO/IEC 8859; it is specified in other International Standards, for example ISO/IEC 6429.
Table 2 - Code Table
[To be supplied. A photocopy of the table from the ECMA standard is attached to the printed text]
8. Identification of the character set
- Identification according to ISO/IEC 2022 and ISO/IEC 4873
The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in accordance with ISO/IEC 2022 and ISO/IEC 4873 the code table of this part of ISO/IEC 8859 may be considered to consist of the following components:
- the character SPACE represented by bit combination 02/00;
- a 94-character G0 graphic character set represented by bit combinations 02/01 to 07/14;
- a 96-character G1 graphic character set represented by bit combinations 10/00 to 15/15.
When the identification methods of ISO/IEC 2022 or ISO/IEC 4873 are used this part of ISO/IEC 8859 shall be identified by the following pair of designation functions :
GZD4 04/02 (ESC 02/08 04/02) G1D6 05/04 (ESC 02/13 05/04)
Note : the corresponding escape sequences are shown in parentheses.
- Identification according to ISO/IEC 8824 (ASN.1)
In the terminology of ISO/IEC 8824 the character set of this part of ISO/IEC 8859 and the corresponding coded representations are distinct, and are known as the "character abstract syntax" and the "character transfer syntax" respectively.
When the identification methods of ISO/IEC 8824 are used this part of ISO/IEC 8859 shall be identified by the following object identifiers :
- character set {iso standard 8859 11 abstract-syntax (1)}
- coded representations {iso standard 8859 11 abstract-syntax (0)}
The corresponding object descriptors shall be :
- character set "ISO 8859 part 11 repertoire"
- coded representations "ISO 8859 part 11 code"
- Identification using the ISO International register of coded character sets to be used with escape sequences
According to 7.1 above the character set of this part of ISO/IEC 8859 may be considered to consist of the character SPACE, a 94-character G0 and G1 graphic character sets may be identified by the use of the Registration Numbers from the ISO International register of coded character sets to be used with escape sequences.
When these registration numbers are used this part of ISO/IEC 8859 shall be identified by the following pair of registration numbers :
- G0 graphic character set ISO-IR 6
- G1 graphic character set ISO-IR 166
(Informative)
Bibliography
- ISO/IEC 6429, Information technology - Control functions for coded character sets.
- ISO/IEC 10646-1, Information technology - Universal Multiple-Octet Coded Character Set (UCS).
- ISO International register of coded character sets to be used with escape sequences.
