Tesseract OCR 识别语言编码 简体中文chi_sim
Teseeract ORC 是一款开源的ORC识别库。备注下识别语言编码:简体中文是chi_sim。Tesseract uses 3-character ISO 639-2 language codes。如下从其gitHub摘抄的:地址:https://github.com/tesseract-ocr/tesseract/blob/a75ab450a8cc9a2b69cf05f5c4f7a39