State of Unidecode 0.04.1 mapping
This is the report generated to see the state of Unidecode 0.04.1 code mapping data. This was generated from unicheck.py utility.
COUNTING LIST FOR [?] UNKNOWN MARKED CODE
NoDef Defnd Width Bad% Code Range Range Description
----- ----- ----- ---- --------- ----------------------------------------
0 128 128 0% 0000-007f Basic Latin
0 128 128 0% 0080-00ff Latin-1 Supplement
0 128 128 0% 0100-017f Latin Extended-A
30 208 208 14% 0180-024f Latin Extended-B
2 96 96 2% 0250-02af IPA Extensions
16 79 80 20% 02b0-02ff Spacing Modifier Letters
30 112 112 26% 0300-036f Combining Diacritical Marks
33 143 144 23% 0370-03ff Greek and Coptic
17 255 256 6% 0400-04ff Cyrillic
48 48 48 100% 0500-052f Cyrillic Supplement
10 96 96 10% 0530-058f Armenian
29 111 112 26% 0590-05ff Hebrew
49 255 256 19% 0600-06ff Arabic
9 80 80 11% 0700-074f Syriac
48 48 48 100% 0750-077f Arabic Supplement
15 64 64 23% 0780-07bf Thaana
63 63 64 100% 07c0-07ff NKo
0 0 64 100% 0800-083f Samaritan
24 128 128 18% 0900-097f Devanagari
38 127 128 29% 0980-09ff Bengali
53 128 128 41% 0a00-0a7f Gurmukhi
49 127 128 38% 0a80-0aff Gujarati
48 128 128 37% 0b00-0b7f Oriya
65 127 128 51% 0b80-0bff Tamil
48 128 128 37% 0c00-0c7f Telugu
47 127 128 37% 0c80-0cff Kannada
49 128 128 38% 0d00-0d7f Malayalam
47 127 128 37% 0d80-0dff Sinhala
41 128 128 32% 0e00-0e7f Thai
62 127 128 48% 0e80-0eff Lao
63 255 256 24% 0f00-0fff Tibetan
82 160 160 51% 1000-109f Myanmar
17 95 96 17% 10a0-10ff Georgian
15 255 256 5% 1100-11ff Hangul Jamo
40 384 384 10% 1200-137f Ethiopic
32 32 32 100% 1380-139f Ethiopic Supplement
10 95 96 10% 13a0-13ff Cherokee
11 640 640 1% 1400-167f Unified Canadian Aboriginal Syllabics
3 32 32 9% 1680-169f Ogham
14 95 96 14% 16a0-16ff Runic
32 32 32 100% 1700-171f Tagalog
32 32 32 100% 1720-173f Hanunoo
32 32 32 100% 1740-175f Buhid
32 32 32 100% 1760-177f Tagbanwa
24 127 128 18% 1780-17ff Khmer
22 176 176 12% 1800-18af Mongolian
79 79 80 100% 18b0-18ff Unified Canadian Aboriginal Syllabics Ex
0 0 80 100% 1900-194f Limbu
0 0 48 100% 1950-197f Tai Le
0 0 96 100% 1980-19df New Tai Lue
0 0 32 100% 19e0-19ff Khmer Symbols
0 0 32 100% 1a00-1a1f Buginese
0 0 144 100% 1a20-1aaf Tai Tham
0 0 128 100% 1b00-1b7f Balinese
0 0 64 100% 1b80-1bbf Sundanese
0 0 80 100% 1c00-1c4f Lepcha
0 0 48 100% 1c50-1c7f Ol Chiki
0 0 48 100% 1cd0-1cff Vedic Extensions
0 0 128 100% 1d00-1d7f Phonetic Extensions
0 0 64 100% 1d80-1dbf Phonetic Extensions Supplement
0 0 64 100% 1dc0-1dff Combining Diacritical Marks Supplement
9 255 256 3% 1e00-1eff Latin Extended Additional
22 255 256 8% 1f00-1fff Greek Extended
29 112 112 25% 2000-206f General Punctuation
17 48 48 35% 2070-209f Superscripts and Subscripts
32 48 48 66% 20a0-20cf Currency Symbols
27 47 48 57% 20d0-20ff Combining Diacritical Marks for Symbols
21 80 80 26% 2100-214f Letterlike Symbols
15 64 64 23% 2150-218f Number Forms
11 111 112 9% 2190-21ff Arrows
255 255 256 100% 2200-22ff Mathematical Operators
255 255 256 100% 2300-23ff Miscellaneous Technical
25 64 64 39% 2400-243f Control Pictures
21 32 32 65% 2440-245f Optical Character Recognition
20 159 160 12% 2460-24ff Enclosed Alphanumerics
0 128 128 0% 2500-257f Box Drawing
10 32 32 31% 2580-259f Block Elements
7 95 96 7% 25a0-25ff Geometric Shapes
146 255 256 57% 2600-26ff Miscellaneous Symbols
5 192 192 2% 2700-27bf Dingbats
48 48 48 100% 27c0-27ef Miscellaneous Mathematical Symbols-A
15 15 16 100% 27f0-27ff Supplemental Arrows-A
0 256 256 0% 2800-28ff Braille Patterns
0 0 128 100% 2900-297f Supplemental Arrows-B
0 0 128 100% 2980-29ff Miscellaneous Mathematical Symbols-B
0 0 256 100% 2a00-2aff Supplemental Mathematical Operators
0 0 256 100% 2b00-2bff Miscellaneous Symbols and Arrows
0 0 96 100% 2c00-2c5f Glagolitic
0 0 32 100% 2c60-2c7f Latin Extended-C
0 0 128 100% 2c80-2cff Coptic
0 0 48 100% 2d00-2d2f Georgian Supplement
0 0 80 100% 2d30-2d7f Tifinagh
0 0 96 100% 2d80-2ddf Ethiopic Extended
0 0 32 100% 2de0-2dff Cyrillic Extended-A
128 128 128 100% 2e00-2e7f Supplemental Punctuation
127 127 128 100% 2e80-2eff CJK Radicals Supplement
224 224 224 100% 2f00-2fdf Kangxi Radicals
16 16 16 100% 2fe0-2fef NOT SPECIFIED
15 15 16 100% 2ff0-2fff Ideographic Description Characters
3 64 64 4% 3000-303f CJK Symbols and Punctuation
6 96 96 6% 3040-309f Hiragana
1 95 96 1% 30a0-30ff Katakana
8 48 48 16% 3100-312f Bopomofo
2 96 96 2% 3130-318f Hangul Compatibility Jamo
0 16 16 0% 3190-319f Kanbun
8 32 32 25% 31a0-31bf Bopomofo Extended
48 48 48 100% 31c0-31ef CJK Strokes
15 15 16 100% 31f0-31ff Katakana Phonetic Extensions
53 255 256 20% 3200-32ff Enclosed CJK Letters and Months
6 255 256 2% 3300-33ff CJK Compatibility
192 192 6592 100% 3400-4dbf CJK Unified Ideographs Extension A
63 63 64 100% 4dc0-4dff Yijing Hexagram Symbols
350 20991 20992 1% 4e00-9fff CJK Unified Ideographs
3 1168 1168 0% a000-a48f Yi Syllables
14 64 64 21% a490-a4cf Yi Radicals
47 47 48 100% a4d0-a4ff Lisu
0 0 320 100% a500-a63f Vai
0 0 96 100% a640-a69f Cyrillic Extended-B
0 0 96 100% a6a0-a6ff Bamum
0 0 32 100% a700-a71f Modifier Tone Letters
0 0 224 100% a720-a7ff Latin Extended-D
0 0 48 100% a800-a82f Syloti Nagri
0 0 16 100% a830-a83f Common Indic Number Forms
0 0 64 100% a840-a87f Phags-pa
0 0 96 100% a880-a8df Saurashtra
0 0 32 100% a8e0-a8ff Devanagari Extended
0 0 48 100% a900-a92f Kayah Li
0 0 48 100% a930-a95f Rejang
0 0 32 100% a960-a97f Hangul Jamo Extended-A
0 0 96 100% a980-a9df Javanese
0 0 96 100% aa00-aa5f Cham
0 0 32 100% aa60-aa7f Myanmar Extended-A
0 0 96 100% aa80-aadf Tai Viet
0 0 64 100% abc0-abff Meetei Mayek
12 11184 11184 0% ac00-d7af Hangul Syllables
79 79 80 100% d7b0-d7ff Hangul Jamo Extended-B
0 0 896 100% d800-db7f High Surrogates
0 0 128 100% db80-dbff High Private Use Surrogates
0 0 1024 100% dc00-dfff Low Surrogates
0 0 6400 100% e000-f8ff Private Use Area
222 511 512 43% f900-faff CJK Compatibility Ideographs
22 80 80 27% fb00-fb4f Alphabetic Presentation Forms
94 687 688 13% fb50-fdff Arabic Presentation Forms-A
16 16 16 100% fe00-fe0f Variation Selectors
16 16 16 100% fe10-fe1f Vertical Forms
12 16 16 75% fe20-fe2f Combining Half Marks
4 32 32 12% fe30-fe4f CJK Compatibility Forms
4 32 32 12% fe50-fe6f Small Form Variants
4 144 144 2% fe70-feff Arabic Presentation Forms-B
17 240 240 7% ff00-ffef Halfwidth and Fullwidth Forms
9 16 16 56% fff0-ffff Specials
UNKNOWN TOTAL: 4340
RANGE FOUND: 4340
RANGE UNKNOWN: 0
RANGE DESCRIPTION NOT FOUND FOR FOLLOWING CODE: []
MISSING FILES: 76 ['x08.py', 'x19.py', 'x1a.py', 'x1b.py', 'x1c.py',
'x1d.py', 'x29.py', 'x2a.py', 'x2b.py', 'x2c.py', 'x2d.py', 'x34.py',
'x35.py', 'x36.py', 'x37.py', 'x38.py', 'x39.py', 'x3a.py', 'x3b.py',
'x3c.py', 'x3d.py', 'x3e.py', 'x3f.py', 'x40.py', 'x41.py', 'x42.py',
'x43.py', 'x44.py', 'x45.py', 'x46.py', 'x47.py', 'x48.py', 'x49.py',
'x4a.py', 'x4b.py', 'x4c.py', 'xa5.py', 'xa6.py', 'xa7.py', 'xa8.py',
'xa9.py', 'xaa.py', 'xab.py', 'xd8.py', 'xd9.py', 'xda.py', 'xdb.py',
'xdc.py', 'xdd.py', 'xde.py', 'xdf.py', 'xe0.py', 'xe1.py', 'xe2.py',
'xe3.py', 'xe4.py', 'xe5.py', 'xe6.py', 'xe7.py', 'xe8.py', 'xe9.py',
'xea.py', 'xeb.py', 'xec.py', 'xed.py', 'xee.py', 'xef.py', 'xf0.py',
'xf1.py', 'xf2.py', 'xf3.py', 'xf4.py', 'xf5.py', 'xf6.py', 'xf7.py',
'xf8.py']
===========================end of line
