Publication Date

Spring 2014

Degree Type

Master's Project

Degree Name

Master of Science (MS)

Department

Computer Science

Abstract

This project investigates the principles of optical character recognition used in the Tesseract OCR engine and techniques to improve its efficiency and runtime. Optical character recognition (OCR) method has been used in converting printed text into editable text in various applications over a variety of devices such as Scanners, computers, tablets etc. But now Mobile is taking over the computer in all the domains but OCR still remains one not so conquered field. So programmers need to improve the efficiency of the OCR system to make it run properly on Mobile devices. This paper focuses on improving the Tesseract OCR efficiency for Hindi language to run on Mobile devices as there a not many applications for the same and most of them are either not open source or not for mobile devices. Improving Hindi text extraction will increase Tesseract's performance for Mobile phone apps and in turn will draw developers to contribute towards Hindi OCR . This paper presents a preprocessing technique being applied to the Tesseract Engine to improve the recognition of the characters keeping the runtime low. Hence the system runs smoothly and efficiently on mobile devices(Android) as it does on the bigger machines.

Recommended Citation

Badla, Sahil, "IMPROVING THE EFFICIENCY OF TESSERACT OCR ENGINE" (2014). Master's Projects. 420.
DOI: https://doi.org/10.31979/etd.5avd-kf2g
https://scholarworks.sjsu.edu/etd_projects/420

Download

Included in

Computer Sciences Commons

COinS

DOI

https://doi.org/10.31979/etd.5avd-kf2g

Master's Projects

IMPROVING THE EFFICIENCY OF TESSERACT OCR ENGINE

Publication Date

Degree Type

Degree Name

Department

Abstract

Recommended Citation

Included in

DOI

Search

Browse All

Links

Master's Projects

IMPROVING THE EFFICIENCY OF TESSERACT OCR ENGINE

Author

Publication Date

Degree Type

Degree Name

Department

Abstract

Recommended Citation

Included in

Share

DOI

Search

Browse All

Links