OCR – Optical Character Recognition


Optical Character Recognition, usually abbreviated as OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten, or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website. OCR makes it possible to edit the text, search for a word or phrase, store it more compactly, display, or print a copy free of scanning artifacts, and apply techniques such as machine translation, text-to-speech, and text mining to it. OCR is a field of research in pattern recognition, artificial intelligence, and computer vision.

This process translates images of typewritten articles into electronic mode or a picture of characters into a standard encoding scheme representing them in ASCII or text.

