0xx

Contents   Fixed field   0xx   1xx   2xx   3xx   4xx   5xx   6xx   7xx   8xx   9xx

066  Character Sets Present (NR)

Input Standards

System supplied/System supplied
1st Indicator  Undefined
blank character Undefined
2nd Indicator  Undefined
blank character Undefined
Subfields (R=Repeatable  NR=Nonrepeatable) Input Standards
‡c Alternate graphic character set identification (R) System supplied/System supplied

Definition

Field 066 is system supplied and identifies the presence of any character sets for non-Latin scripts in the record. You cannot add, edit, or delete field 066. For more technical information about this field, see MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media (http://www.loc.gov/marc/specifications/) and Chapter 4 of OCLC-MARC Records (http://oclc.org/support/services/worldcat/documentation/records/subscription.en.html)

1st Indicator

Undefined. The 1st indicator position is undefined and contains a blank ( blank character ).

blank character

Undefined

2nd Indicator

Undefined. The 2nd indicator position is undefined and contains a blank ( blank character ).

blank character

Undefined

Subfields

 
‡c Alternate graphic character set identification

Subfield ‡c contains a code identifying the alternative character set used in the record. The subfield is repeated for each additional character set present. The following codes display:

$1 Chinese, Japanese, Korean script
(2 Basic Hebrew script
(3 Basic Arabic script
(4 Extended Arabic script
(N Basic Cyrillic script
(Q Extended Cyrillic script
(S Extended Greek script
Armn Armenian script
Beng Bengali script
Cyrl Cyrillic script (outside the MARC-8 character set)
Deva Devanagari script
Ethi Ethiopic script
Syrc Syriac script
Taml Tamil script
Thai Thai script

Character sets for Armenian, Bengali, Cyrillic (outside the MARC-8 character set), Devanagari, Ethiopic, Syriac, Tamil, and Thai. There are no MARC-8 character sets for Armenian, Bengali, Cyrillic (outside the MARC-8 character set), Devanagari, Ethiopic, Syriac, Tamil, and Thai. OCLC implemented the four-character script identification codes noted above for these scripts based on the ISO 15924 Code Lists (http://www.unicode.org/iso15924/codelists.html). OCLC supports Unicode UTF-8 characters for these scripts.

Note: Records containing non-MARC-8 characters are expected to be output in the UTF-8 (Unicode) data format. Field 066 does not appear in records exported in UTF-8 (Unicode) and the script code does not appear in subfield ‡6 of the 880 linkage field.

If multiple non-Latin scripts exist in a single field or a single record and the MARC-8 data format is used, all non-MARC-8 characters are expressed by numeric character references (NCR) structured as ampersand ( & ), pound sign ( # ), lowercase letter x, four-position Unicode character code, and a trailing semicolon ( ; ). For example, च would be used to represent the Devanagari script character ( च ) which will display as the appropriate character when fonts are available to display that script. The non-MARC-8 script code does not appear in subfield ‡6 of the 880 linkage field.

066     ‡c (2 ‡c $1 ‡c (3
[Hebrew, Chinese, and Arabic scripts present]
066     ‡c Thai
[Thai script present]
066     ‡c Armn ‡c (N
[Armenian and Basic Cyrillic script present]

Indexing

For indexing and searching information, see Searching WorldCat Indexes, field 066.

Printing

Field 066 does not print.

This page last revised: April 7, 2014