Transliteration - Software Library

Character Set Mapping and Transliteration

Are you facing the challenge of working with foreign languages like Greek, Russian, Japanese or Chinese without being a native speaker and having the respective input devices at your disposal?

AddressDoctor Character Set Mapping and Transliteration is an invaluable help when working with strings in almost all common character sets. It supports 40 different character sets and can transform six non-Latin writing systems into Latin characters through Transliteration:

  • Mapping between 40 different character sets, including UTF-8, ISO 8859-1, GBK, BIG5, JIS, EBCDIC
  • Character filter on 'a'-'Z' and '0'-'9'
  • Correct "removal" of diacritics according to language specific rules
  • HTML and URL encoding and decoding
  • Unix <-> Windows line break conversions

AddressDoctor supports the following Non-Latin writing systems

  1. Greek transliteration (BGN/PCGN 1962, ISO 843 - 1997)
  2. Cyrillic transliteration (BGN/PCGN 1947, ISO 9 - 1995)
  3. Hebrew transliteration
  4. Japanese Katakana, Hiragana and Kanji transliteration
  5. Chinese Pinyin transliteration (Mandarin, Cantonese)
  6. Korean Hangul transliteration

AddressDoctor Character Set Mapping and Transliteration is fully Unicode enabled and is the basis for all AddressDoctor products.

AddressDoctor supports the following writing systems:

  1. Latin (Transliteration of various Latin code page characters ISO 8859-X to Latin, Eastern and Northern European alphabets)
  2. Greek (BGN/PCGN 1962, ISO 843 – 1997)
  3. Cyrillic (BGN/PCGN 1947, ISO 9 – 1995)
  4. Hebrew
  5. Japanese
  6. Chinese
  7. Korean