Character Sets

Traditional Chinese   Simplified Chinese   Multilingual
Big5   GB   CCCII & EACC
CNS       Unicode - ISO/IEC 10646
HK GCCS - SCS        

Definition

There is a lot of discussion about the term 'Character Set'.
  • Character
    ECMA 35 : "A member of a set of elements used for the organisation, control or represenation of data".
    Unicode consortium : "The smalles component of written language that has semantic values; refers to the abstract meaning and/or shape, rather than a specific shape (see also glyph)".
    W3C : "An atom of information".
  • Coded Character Set
    ECMA 35 : "A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations".
    Chinese : 編碼字符集 (biānmǎ zìfújí)
  • Character Repertoire
    ECMA 35 : "A specified set of characters that are represented by means of one or more bit combinations of a coded character set".
    Chinese : 字彙 (zìhuì)
  • Character Encoding Scheme
  • Charset
  • Glyph
    Unicode Consortium : "An abstract from that represents one or more glyph images", an glyph image is "The actual, concrete image of a glyph representation having been rasterized or otherwise imaged onto some display surface".
    ISO 9541-1 : "A recognizable abstract grpahic symbol which is independent of a specific design".

Test your browser



   
Search >


Local links are in blue, links to other websites are in red, commands are in green.

You need unicode fonts, a 4+ browser and acrobat reader to fully explore and enjoy this webpage. (if necessary you can download asian fontpacks for acrobat reader)

Currently translating my thesis to English : more info

© Seba - contact at seba at ulyssis dot org
users online: 3