Implementation of Tesseract Algorithm to Extract Text from Different Images
5 Pages Posted: 1 May 2020
Date Written: May 1, 2020
Abstract
Image processing is one of the most growing fields in research and technology in today’s world. There is a high demand of a computer system that can store the information available in newspapers and other hard copy paper documents. One of the most simplest ways to store the information of text into computer systems in by scanning the paper. It can then be stored in the computer and changes can be made on it if required. But, detection of text from the captured image is a very challenging task. Thus, an attempt has been made using the Tesseract algorithm that makes it easier to extract text from images.
Keywords: Tesseract, Text Recognition, Optical Character Recognition, Flatbed scanner, Guilloche pattern, Leptonica
Suggested Citation: Suggested Citation