Optical Character Recognition

Introduction
This project is to improved OCR for Indian languages for existing open source OCR (optical character recognition) system. Initially we will concentrate on Gujarati language OCR with tesserect OCR system.

Current Goals

 * 1) Gujarati OCR has very bad accuracy in tesserect. Investigate which modules or parts are introducing errors.
 * 2) When image quality if slightly bad  (E.g. photo of book taken by cellphone) but human can very easily read it, tesserect accuracy is very low (even for English). Investigate why.

Team

 * 1) Gautam Navapara: Surat
 * 2) Ishan Kalavadiya: Rajkot
 * 3) Mitesh Kalal: Ahmedabad

Knowledge Base
How to

Resources

Activity
Task Updates

TestBench

Logbook

Questions