OCR Languages Supported by CaseGuard May 23, 2024 | 1 minute read Print page Connect on LinkedIn Follow us on X CaseGuard’s Optical Character Recognition (OCR) supports over 100 languages: Afrikaans Amharic Arabic Assamese Azerbaijani Azerbaijani – Cyrilic Belarusian Bengali Tibetan Bosnian Breton Bulgarian Catalan; Valencian Cebuano Czech Chinese – Simplified Chinese – Traditional Cherokee Corsican Welsh Danish German German (Fraktur Latin) Dzongkha Greek, Modern (1453-) English English, Middle (1100-1500) Esperanto Math / equation detection module Estonian Basque Faroese Persian Filipino (old – Tagalog) Finnish French German – Fraktur (now deu_latf) French, Middle (ca.1400-1600) Western Frisian Scottish Gaelic Irish Galician Greek, Ancient (to 1453) (contrib) Gujarati Haitian; Haitian Creole Hebrew Hindi Croatian Hungarian Armenian Inuktitut Indonesian Icelandic Italian Italian – Old Javanese Japanese Kannada Georgian Georgian – Old Kazakh Central Khmer Kirghiz; Kyrgyz Kurmanji (Kurdish – Latin Script) Korean Korean (vertical) Lao Latin Latvian Lithuanian Luxembourgish Malayalam Marathi Macedonian Maltese Mongolian Maori Malay Burmese Nepali Dutch; Flemish Norwegian Occitan (post 1500) Oriya Orientation and script detection module Panjabi; Punjabi Polish Portuguese Pushto; Pashto Quechua Romanian; Moldavian; Moldovan Russian Sanskrit Sinhala; Sinhalese Slovak Slovenian Sindhi Spanish; Castilian Spanish; Castilian – Old Albanian Serbian Serbian – Latin Sundanese Swahili Swedish Syriac Tamil Tatar Telugu Tajik Thai Tigrinya Tonga Turkish Uighur; Uyghur Ukrainian Urdu Uzbek Uzbek – Cyrilic Vietnamese Yiddish Yoruba