DICOM PS3.5 2024d - Data Structures and Encoding

I.2 Example of Person Name Value Representation in the Korean Language

Example I.2-1. Example of Person Name Value Representation in the Korean Language

Person names in the Korean language may be written in Hangul (phonetic characters), Hanja (ideographic characters), or Latin (alphabetic characters). The three component groups should be written in the order of alphabetic, ideographic, and phonetic (see Table 6.2-1).

Specific Character Set:

  • (0008,0005) \ISO 2022 IR 149

Character String:

  • Hong^Gildong=洪^吉洞=홍^길동

  • Hong^Gildong= ESC 02/04 02/09 04/03 洪 ^ ESC 02/04 02/09 04/03 吉洞 = ESC 02/04 02/09 04/03 홍 ^ ESC 02/04 02/09 04/03 길동

Encoded representation:

  • 04/08 06/15 06/14 06/07 05/14 04/07 06/09 06/12 06/04 06/15 06/14 06/07 03/13 01/11 02/04 02/09 04/03 15/11 15/03 05/14 01/11 02/04 02/09 04/03 13/01 12/14 13/04 13/07 03/13 01/11 02/04 02/09 04/03 12/08 10/11 05/14 01/11 02/04 02/09 04/03 11/01 14/06 11/05 11/15

An example of what might be displayed or printed by an ASCII based machine that displays or prints the Control Character ESC (01/11) using \033:

  • Hong^Gildong=\033$) C\373\363^\033$)C\321\316\324\327=\033$)C\310\253^\033$)C\261\346\265\277


Note

  1. The multi-byte character set (ISO-IR 149) and single-byte character set [ISO 646] can be used intermixed without any explicit escape sequence after the initial escape sequence. Once [ISO 646] has been designated to the GL area and ISO-IR 149 to the GR area, each character set has different code area, thus can be used intermixed. The decoder will check the most significant bit of a character to know whether it is a two byte character in the GR area (high bit one) or a one byte character in the GL area (high bit zero).

  2. In the above example of person name representation, explicit escape sequences precede each Hangul and Hanja string. These escape sequences are to meet the requirements of the code extension technique that specifies a switch to the Default Character Repertoire before delimiters. In the previous example, it is assumed that the Default Character Repertoire (ISO-646) is invoked to G0 code area and no character set to G1 area after delimiters ("^" and "=" signs). See Section 6.1.2.5.3.

DICOM PS3.5 2024d - Data Structures and Encoding