DICOM PS3.5 2024e - Data Structures and Encoding

K.3 Example of Long Text Value Representation in the Chinese Language with GB2312 G1

Example K.3-1. Example of Long Text Value Representation in the Chinese Language with GB2312 G1

Chinese (ISO 2022 IR 58) and ASCII [ISO 646] character sets can be used intermingled without explicit escape sequences between them. The Chinese character set ISO IR 58 is invoked to the G1 area, and the ASCII character set is invoked to the G0 area. The following is an example of a Long Text Value Representation that includes ASCII and Chinese character set. Every line is presumed to start in the default character set and requires an explicit invocation of [GB 2312] into G1, but does not require re-invocation of the default ASCII character set into G0.

Specific Character Set:

  • (0008,0005) \ISO 2022 IR 58

Character String (with CR LF after each line):

  • 1) 第一行文字。

    2) 第二行文字。

    3) 第三行文字。

Encoded String:

  • 1) ESC 02/04 02/09 04/01 第一行文字。

    2) ESC 02/04 02/09 04/01 第二行文字。

    3) ESC 02/04 02/09 04/01 第三行文字。

Character encoded representation (GB2312):

  • 0x31 0x2e 0x1B 0x24 0x29 0x41 0xB5 0xDA 0xD2 0xBB 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3 0x0D 0x0A

    0x32 0x2e 0x1B 0x24 0x29 0x41 0xB5 0xDA 0xB6 0xFE 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3 0x0D 0x0A

    0x33 0x2e 0x1B 0x24 0x29 0x41 0xB5 0xDA 0xC8 0xFD 0xD0 0xD0 0xCE 0xC4 0xD7 0xD6 0xA1 0xA3 0x0D 0x0A

Note

The underlined byte codes correspond to double byte characters, the bold byte codes to escape sequences.


Table K.3-1. Character Sets and Escape Sequences used in the Examples of Person Name

Character Set Description

Component Group

Value of (0008,0005) Defined Term

ISO registration number

Standard for Code Extension

ESC Sequence

Code Element

Character Set: Purpose of Use

Chinese

First:

Alphabetic (English name)

Value 1:

none

ISO-IR 6

G0

[ISO 646]

Second:

Ideographic

Value 1:

ISO 2022 IR 58

ISO-IR58

ISO 2022

ESC 02/04 02/09 04/01

G1

ISO 2022 CN:

Chinese

Third:

Phonetic

Value 1:

none

ISO-IR 6

ISO 2022

ESC 02/08 04/02

G0

[ISO 646]:

For delimiters


DICOM PS3.5 2024e - Data Structures and Encoding