DICOM PS3.5 2025b - Data Structures and Encoding

H.3 Example of Person Name Value Representation in the Japanese Language

Character strings representing person names are encoded using a convention for PN Value Representation based on component groups with 5 components.

For languages that use ideographic characters, it is sometimes necessary to write names both in ideographic characters and in phonetic characters. Ideographic characters may be required for official purposes, while phonetic characters may be needed for pronunciation and data processing purposes.

For the purpose of writing names in ideographic characters and in phonetic characters, up to 3 component groups may be used. The delimiter of the component group shall be the equals character "=" (3DH). The three component groups in their order of occurrence are: an alphabetic representation, an ideographic representation, and a phonetic representation.

H.3.1 Value 1 of Attribute Specific Character Set (0008,0005) is Not Present.

Example H.3-1. Value 1 of Attribute Specific Character Set (0008,0005) is Not Present

In this case, ISO-IR 6 is used by default in Specific Character Set:

  • (0008,0005) \ISO 2022 IR 87

Character String:

  • Yamada^Tarou=山田^太郎=やまだ^たろう

  • Yamada^Tarou= ESC 02/04 04/02 山田 ESC 02/08 04/02 ^ ESC 02/04 04/02 太郎 ESC 02/08 04/02 = ESC 02/04 04/02 やまだ ESC 02/08 04/02 ^ ESC 02/04 04/02 たろう ESC 02/08 04/02

Encoded representation:

  • 05/09 06/01 06/13 06/01 06/04 06/01 5/14 05/04 06/01 07/02 06/15 07/05 03/13 01/11 02/04 04/02 03/11 03/03 04/05 04/04 01/11 02/08 04/02 05/14 01/11 02/04 04/02 04/02 04/00 04/15 03/10 01/11 02/08 04/02 03/13 01/11 02/04 04/02 02/04 06/04 02/04 05/14 02/04 04/00 01/11 02/08 04/02 05/14 01/11 02/04 04/02 02/04 03/15 02/04 06/13 02/04 02/06 01/11 02/08 04/02

An example of what might be displayed or printed by an ASCII based machine that displays or prints the Control Character ESC (01/11) using \033:

  • Yamada^Tarou=\033$B;3ED\033(B^\033$BB@O:\033(B=\033$B$d$^$@\033(B^\033$B$?$m$&\033(B


Table H.3-1. Character Sets and Escape Sequences Used in Example 1

Character Set Description

Component Group

Value of (0008,0005) Defined Term

ISO Registration Number

Standard for Code Extension

ESC Sequence

Code Element

Character Set: Purpose of Use

Japanese

First:

Single-byte

Value 1:

none

ISO-IR 6

GL

[ISO 646]

Second:

Ideographic

Value 2:

ISO 2022 IR 87

ISO-IR 87

ISO 2022

ESC 02/04 04/02

GL

[JIS X 0208]:

Japanese kanji, hiragana, katakana

Value 1:

none

ISO-IR 6

ISO 2022

ESC 02/08 04/02

GL

[ISO 646]

for delimiters

Third:

Phonetic

Value 2:

ISO 2022 IR 87

ISO-IR 87

ISO 2022

ESC 02/04 04/02

GL

[JIS X 0208]:

Japanese hiragana, and katakana

Value 1:

none

ISO-IR 6

ISO 2022

ESC 02/08 04/02

GL

[ISO 646]

for delimiters


DICOM PS3.5 2025b - Data Structures and Encoding