VISION 1995-01-02

INTERNATIONAL STANDARD

Information technology -8-bit single-byte coded graphic character sets

Part 11 : Latin/Thai character set

Contents

  1. Introduction
  2. Scope
  3. Conformance
  4. Normative references
  5. Definitions
  6. Notation, code table and character names
  7. Specification of the coded character set
  8. Identification of the character set

Annexes

A Bibliography




  1. Introduction

    ISO/IEC 8859 consists of several parts. Each part specifies a set of up to 191 graphic characters and the coded representation of these characters by means of a single 8-bit bytes. Each set is intended for use for a particular group of languages.

    Control funtions may be used in conjunction with the coded characters specified in the parts of ISO/IEC 8859. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see 6).




  2. Scope

    This part of ISO/IEC 8859 specifies a set of 183 coded graphic characters identified as Latin/Thai character set.

    This set of coded graphic characters is intended for use in data and text processing applications and may also be used for information interchange.

    The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages :

    English, Thai.

    This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1.

    This part of ISO/IEC 8859 may not be used in conjuction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3.

    The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429.

    NOTE - ISO/IEC 8859 is not intended for use with Telematic services sdefined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point.




  3. Conformance
    1. Conformance of information interchange

      A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with this part of this International Standard if all the coded representations of graphic characters within that CC-data-element conform to the requirements of clause 6.

    2. Conformance of devices

      A device is in conformance with this International Standard if it conforms to the requirements of 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which contains the description specified in 2.2.1.

      1. Device description

        A device that conforms to this International Standard shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in 2.2.2 and 2.2.3.

      2. Originating devices

        An originating device shall allow its user to supply any sequence of characters from those specified in clause 6, and shall be capable of transmitting their coded representations within a CC-data-element.

      3. Receiving devices

        A receiving device shall be capable of receiving and interpreting any coded representations of characters that are within a CC-data-element, and that conform to clause 6, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those specified there, and can distinguish them from each other.  




  4. Normative references

    ISO/IEC 2022, Information technology - Character code structure and extensin techniques.

    ISO/IEC 4873, Information technology - ISO 8-bit code for information interchange - Structure and rules for implementation.

    ISO/IEC 8824, Information technology - Open systems interconnection - Abstract Syntax Notation One (ASN.1).




  5. Definitions
    1. bit combination:

      An ordered set of bits used for the representation of characters.

    2. byte:

      A bit string that is operated upon as a unit.

    3. character:

      A member of a set of elements used for the organization, control or representation of data.

    4. coded character set; code:

      A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations.

    5. code table:

      A table showing the characters allocated to each bit combination in a code.

    6. graphic character:

      A character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations.

      Note - In ISO/IEC 8859 a single bit combination is used to represent each character.

    7. graphic symbol:

      A visual representation of a graphic character or of a control function.

    8. position:

      That part of a code table identified by its column and row co-ordinates.




  6. Notation, code table and character names
    1. Notation

      The bits of the bit combinations of the 8-bit code are identified by b8, b7, b6, b5, b4, b3, b2, and b1, where b8 is the highest-order, or most-significant bit and b1 is the lowest-order, or least-significant bit.

      The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits:

      Bitb8b7b6b5b4b3b2b1
      Weight1286432168421

      Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b8 to b1 is as follows:

      - xx is the number represented by b8, b7, b6, and b5 where these bits are given the weights 8, 4, 2, and 1, respectively;

      - yy is the number represented by b4, b3, b2, and b1 where these bits are given the weights 8, 4, 2, and 1 respectively.

    2. Layout of the code table

      A 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered 00 to 15.

      The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number.

      The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, is the same as that of the corresponding bit combination.

    3. Names and meanings

      This part of ISO/IEC 8859 assigns a unique name to each graphic character. These names have been taken from ISO/IEC 10646-1 (E). It also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text.

      The names chosen to denote graphic characters are intended to reflect their customary meaning. However, except for SPACE(SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of ISO/IEC 8859 does not define and does not restrict the meanings of graphic characters. Neither does it specify a particular style or font design for imaging graphic characters.

      This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corresponding position of the code table.

      1. SPACE(SP)

        A graphic character the visual representation of which consists of the absence of a graphic symbol.

      2. NO-BREAK SPACE (NBSP)

        A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented.

      3. SOFT HYPHEN (SHY)

        A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word.




  7. Specification of the coded character set

    This part of ISO/IEC 8859 specifies 183 characters allocated to the bit combinations of the code table (table 2).

    Control functions , such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are graphic symbols made up from the graphic representations of two or more characters.

    1. Characters of the set and their coded representation

    Table 1A - Name and Coded Representation of the Characters in Columns 02 to 07

    Bit combinationName
    02/00SPACE
    02/01EXCLAMATION MARK
    02/02QUOTATION MARK
    02/03NUMBER SIGN
    02/04DOLLAR SIGN
    02/05PERCENT SIGN
    02/06AMPERSAND
    02/07APOSTROPHE
    02/08LEFT PARENTHESIS
    02/09RIGHT PARENTHESIS
    02/10ASTERISK
    02/11PLUS SIGN
    02/12COMMA
    02/13HYPHEN-MINUS
    02/14FULL STOP
    02/15SOLIDUS
    03/00DIGIT ZERO
    03/01DIGIT ONE
    03/02DIGIT TWO
    03/03DIGIT THREE
    03/04DIGIT FOUR
    03/05DIGIT FIVE
    03/06DIGIT SIX
    03/07DIGIT SEVEN
    03/08DIGIT EIGHT
    03/09DIGIT NINE
    03/10COLON
    03/11SEMICOLON
    03/12LESS-THAN SIGN
    03/13EQUALS SIGN
    03/14GREATER-THAN SIGN
    03/15QUESTION MARK
    04/00COMMERCIAL AT
    04/01LATIN CAPITAL LETTER A
    04/02LATIN CAPITAL LETTER B
    04/03LATIN CAPITAL LETTER C
    04/04LATIN CAPITAL LETTER D
    04/05LATIN CAPITAL LETTER E
    04/06LATIN CAPITAL LETTER F
    04/07LATIN CAPITAL LETTER G
    04/08LATIN CAPITAL LETTER H
    04/09LATIN CAPITAL LETTER I
    04/10LATIN CAPITAL LETTER J
    04/11LATIN CAPITAL LETTER K
    04/12LATIN CAPITAL LETTER L
    04/13LATIN CAPITAL LETTER M
    04/14LATIN CAPITAL LETTER N
    04/15LATIN CAPITAL LETTER O
    05/00LATIN CAPITAL LETTER P
    05/01LATIN CAPITAL LETTER Q
    05/02LATIN CAPITAL LETTER R
    05/03LATIN CAPITAL LETTER S
    05/04LATIN CAPITAL LETTER T
    05/05LATIN CAPITAL LETTER U
    05/06LATIN CAPITAL LETTER V
    05/07LATIN CAPITAL LETTER W
    05/08LATIN CAPITAL LETTER X
    05/09LATIN CAPITAL LETTER Y
    05/10LATIN CAPITAL LETTER Z
    05/11LEFT SQUARE BRACKET
    05/12REVERSE SOLIDUS
    05/13RIGHT SQUARE BRACKET
    05/14CIRCUMFLEX ACCENT
    05/15LOW LINE
    06/00GRAVE ACCENT
    06/01LATIN SMALL LETTER A
    06/02LATIN SMALL LETTER B
    06/03LATIN SMALL LETTER C
    06/04LATIN SMALL LETTER D
    06/05LATIN SMALL LETTER E
    06/06LATIN SMALL LETTER F
    06/07LATIN SMALL LETTER G
    06/08LATIN SMALL LETTER H
    06/09LATIN SMALL LETTER I
    06/10LATIN SMALL LETTER J
    06/11LATIN SMALL LETTER K
    06/12LATIN SMALL LETTER L
    06/13LATIN SMALL LETTER M
    06/14LATIN SMALL LETTER N
    06/15LATIN SMALL LETTER O
    07/00LATIN SMALL LETTER P
    07/01LATIN SMALL LETTER Q
    07/02LATIN SMALL LETTER R
    07/03LATIN SMALL LETTER S
    07/04LATIN SMALL LETTER T
    07/05LATIN SMALL LETTER U
    07/06LATIN SMALL LETTER V
    07/07LATIN SMALL LETTER W
    07/08LATIN SMALL LETTER X
    07/09LATIN SMALL LETTER Y
    07/10LATIN SMALL LETTER Z
    07/11LEFT CURLY BRACKET
    07/12VERTICAL LINE
    07/13RIGHT CURLY BRACKET
    07/14TILDE

    Table 1B - Name and Coded Representation of the Characters in Columns 10 to 15

Bit combinationName
10/00NO-BREAK SPACE
10/01THAI CHARACTER KO KAI
10/02THAI CHARACTER KHO KHAI
10/03THAI CHARACTER KHO KHUAT
10/04THAI CHARACTER KHO KHWAI
10/05THAI CHARACTER KHO KHON
10/06THAI CHARACTER KHO RAKHANG
10/07THAI CHARACTER NGO NGU
10/08THAI CHARACTER CHO CHAN
10/09THAI CHARACTER CHO CHING
10/10THAI CHARACTER CHO CHANG
10/11THAI CHARACTER SO SO
10/12THAI CHARACTER CHO CHOE
10/13THAI CHARACTER YO YING
10/14THAI CHARACTER DO CHADA
10/15THAI CHARACTER TO PATAK
11/00THAI CHARACTER THO THAN
11/01THAI CHARACTER THO NANGMONTHO
11/02THAI CHARACTER THO PHUTHAO
11/03THAI CHARACTER NO NEN
11/04THAI CHARACTER DO DEK
11/05THAI CHARACTER TO TAO
11/06THAI CHARACTER THO THUNG
11/07THAI CHARACTER THO THAHAN
11/08THAI CHARACTER THO THONG
11/09THAI CHARACTER NO NU
11/10THAI CHARACTER BO BAIMAI
11/11THAI CHARACTER PO PLA
11/12THAI CHARACTER PHO PHUNG
11/13THAI CHARACTER FO FA
11/14THAI CHARACTER PHO PHAN
11/15THAI CHARACTER FO FAN
12/00THAI CHARACTER PHO SAMPHAO
12/01THAI CHARACTER MO MA
12/02THAI CHARACTER YO YAK
12/03THAI CHARACTER RO RUA
12/04THAI CHARACTER RU
12/05THAI CHARACTER LO LING
12/06THAI CHARACTER LU
12/07THAI CHARACTER WO WAEN
12/08THAI CHARACTER SO SALA
12/09THAI CHARACTER SO RUSI
12/10THAI CHARACTER SO SUA
12/11THAI CHARACTER HO HIP
12/12THAI CHARACTER LO CHULA
12/13THAI CHARACTER O ANG
12/14THAI CHARACTER HO NOK HUK
12/15THAI CHARACTER PAI YAN NOI
13/00THAI CHARACTER SARA A
13/01THAI CHARACTER MAI HAN-AKAT
13/02THAI CHARACTER SARA AA
13/03THAI CHARACTER SARA AM
13/04THAI CHARACTER SARA I
13/05THAI CHARACTER SARA II
13/06THAI CHARACTER SARA UE
13/07THAI CHARACTER SARA UEE
13/08THAI CHARACTER SARA U
13/09THAI CHARACTER SARA UU
13/10THAI CHARACTER PHINTHU
13/11(This position shall not be used)
13/12(This position shall not be used)
13/13(This position shall not be used)
13/04(This position shall not be used)
13/15THAI CURRENCY SYMBOL BAHT
14/00THAI CHARACTER SARA E
14/01THAI CHARACTER SARA AE
14/02THAI CHARACTER SARA O
14/03THAI CHARACTER SARA AI MAI MUAN
14/04THAI CHARACTER SARA AI MAI MALAI
14/05THAI CHARACTER LAK KHANG YAO
14/06THAI CHARACTER MAI YAMOK
14/07THAI CHARACTER MAI TAI KHU
14/08THAI CHARACTER MAI EK
14/09THAI CHARACTER MAI THO
14/10THAI CHARACTER MAI TRI
14/11THAI CHARACTER MAI CHATTAWA
14/12THAI CHARACTER THANTHAKHAT
14/13THAI CHARACTER NIKHAHIT
14/14THAI CHARACTER YAMAKKAN
14/15THAI CHARACTER FONGMAN
15/00THAI CHARACTER LEK SUN
15/01THAI CHARACTER LEK NUNG
15/02THAI CHARACTER LEK SONG
15/03THAI CHARACTER LEK SAM
15/04THAI CHARACTER LEK SI
15/05THAI CHARACTER LEK HA
15/06THAI CHARACTER LEK HOK
15/07THAI CHARACTER LEK CHET
15/08THAI CHARACTER LEK PAET
15/09THAI CHARACTER LEK KAO
15/10THAI CHARACTER ANGKHANKHU
15/11THAI CHARACTER KHOMUT
15/12(This position shall not be used)
15/13(This position shall not be used)
15/14(This position shall not be used)
15/15(This position shall not be used)

 2. Code table

For each character in the set the code table (table 2) shows a graphic symbol at the position in the code table corresponding to the bit combination specified in table 1A or 1B.

The shaded positions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISO/IEC 8859; it is specified in other International Standards, for example ISO/IEC 6429.

Table 2 - Code Table

[To be supplied. A photocopy of the table from the ECMA standard is attached to the printed text]




  8. Identification of the character set

  1. Identification according to ISO/IEC 2022 and ISO/IEC 4873

    The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in accordance with ISO/IEC 2022 and ISO/IEC 4873 the code table of this part of ISO/IEC 8859 may be considered to consist of the following components:

    • the character SPACE represented by bit combination 02/00;
    • a 94-character G0 graphic character set represented by bit combinations 02/01 to 07/14;
    • a 96-character G1 graphic character set represented by bit combinations 10/00 to 15/15.

    When the identification methods of ISO/IEC 2022 or ISO/IEC 4873 are used this part of ISO/IEC 8859 shall be identified by the following pair of designation functions :

    GZD404/02(ESC 02/08 04/02)
    G1D605/04(ESC 02/13 05/04)

    Note : the corresponding escape sequences are shown in parentheses.

  2. Identification according to ISO/IEC 8824 (ASN.1)

    In the terminology of ISO/IEC 8824 the character set of this part of ISO/IEC 8859 and the corresponding coded representations are distinct, and are known as the "character abstract syntax" and the "character transfer syntax" respectively.

    When the identification methods of ISO/IEC 8824 are used this part of ISO/IEC 8859 shall be identified by the following object identifiers :

    - character set {iso standard 8859 11 abstract-syntax (1)}

    - coded representations {iso standard 8859 11 abstract-syntax (0)}

    The corresponding object descriptors shall be :

    - character set "ISO 8859 part 11 repertoire"

    - coded representations "ISO 8859 part 11 code"

  3. Identification using the ISO International register of coded character sets to be used with escape sequences

    According to 7.1 above the character set of this part of ISO/IEC 8859 may be considered to consist of the character SPACE, a 94-character G0 and G1 graphic character sets may be identified by the use of the Registration Numbers from the ISO International register of coded character sets to be used with escape sequences.

    When these registration numbers are used this part of ISO/IEC 8859 shall be identified by the following pair of registration numbers :

    - G0 graphic character set ISO-IR 6

    - G1 graphic character set ISO-IR 166




 Annex A

(Informative)

Bibliography

Table 2 - Code table of Latin alphabet No.11


View full-size table