Buy
Download Tour
This is a list of the 145 Unicode conversions and encoding conversions offered
by TextPipe, in addition to 151 code
page conversions.
- Convert Unicode to ANSI
- Convert ANSI to Unicode
- Convert Unicode to ASCII
- Convert ASCII to Unicode
The Unicode conversions are found under Filters Menu\Unicode.
Unicode Normalization filters:
- NFC - Canonical Decomposition, followed by Canonical Composition
- NFD - Canonical Decomposition
- NFKD - Compatibility Decomposition
- NFKC - Compatibility Decomposition, followed by Canonical Composition
- Compose
Conversions between Unicode and:
- European languages
- ASCII
- ISO-8859-1 (Western)
- ISO-8859-2 (Central European)
- ISO-8859-3 (South European)
- ISO-8859-4 (Baltic)
- ISO-8859-5 (Cyrillic)
- ISO-8859-7 (Greek)
- ISO-8859-9 (Turkish)
- ISO-8859-10 (Nordic)
- ISO-8859-13 (Baltic)
- ISO-8859-14 (Celtic)
- ISO-8859-15 (Western)
- ISO-8859-16 (Romanian)
- Windows 1250 (Central Europe)
- Windows 1251 (Cyrillic)
- Windows 1252 (Latin 1)
- Windows 1253 (Greek)
- Windows 1254 (Turkish)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic)
- Windows 1257 (Baltic)
- Windows 1258 (Vietnam)
- CP437, CP737 DOS Greek, CP775 DOS BaltRim, CP850, CP852, CP853, CP855,
CP856 Hebrew PC, CP857, CP858, CP860, CP861, CP863, CP865, CP866, CP869,
CP1125
- MacRoman, MacCentralEurope, MacIceland, MacCroatian, MacRomaniaCyrillic,
MacUkraine, MacGreek, Mac Dingbats, Mac Farsi , Mac Romania
- Semitic languages
- ISO-8859-6 (Arabic)
- ISO-8859-8 (Hebrew Visual)
- CP255, CP1256
- CP862, CP864
- MacHebrew, MacArabic
- Japanese
- EUC-JP
- SHIFT_JIS
- P932
- ISO-2022-JP, ISO-2022-JP-1, ISO-2022-JP-2, ISO-2022-JP-3
- EUC-JISX0213
- Shift_JISX0213
- Chinese
- EUC-CN
- HZ, GBK
- GB18030 Standard Chinese
- UC-TW
- BIG5
- CP950
- BIG5-HKSCS,
- ISO-2022-CN, ISO-2022-CN-EXT
- Korean
- KOI8-R, KOI8-U, KOI8-RU
- EUC-KR
- CP949
- ISO-2022-KR
- JOHAB
- Armenian
- Georgian
- Georgian-Academy
- Georgian-PS
- Tajik
- Thai
- TIS-620
- CP874 Thai
- MacThai
- Laotian
- Vietnamese
- Platform specific/other
- HP-ROMAN8
- NEXTSTEP
- RISCOS-LATIN1
- C99
- JAVA
- IBM424
- IBM437
- IBM850
- IBM852
- IBM855
- IBM857
- IBM860
- IBM861
- IBM862
- IBM863
- IBM864
- IBM865
- IBM866
- IBM869
- JIS_X0201
- TIS-620
- Full Unicode
- UTF-8
- UCS-2, UCS-2BE, UCS-2LE
- UCS-4, UCS-4BE, UCS-4LE
- UTF-16, UTF-16BE, UTF-16LE
- UTF-32, UTF-32BE, UTF-32LE
- UTF-7, UTF-7 Optional Direct Characters
Note:
- UCS-4 is UTF-32 with support for code points beyond U+10FFFF (which are
supposed to be unassignable forever).
- UCS-2 is UTF-16 with surrogate support removed (so code points beyond
U+FFFF cannot be represented).
- Turkmen
Buy
Download Tour