Optical Character recognition(OCR)
-
i want to make an OCR application & extract text from an image. i am trying the following approach. 1) i can convert an image to PDF. 2) i can extract text from PDF. So, theoretically, perhaps OCR is possible this way- to convert image to PDF & then convert the PDF into text & do whatever search operations you have to. http://www.altimate.ca/flash/ocrdsktp.htmshows the above approach. However, i can extract text from any normal pdf, it is not happening with a pdf that has been generated from an image.What things may i be missing? Kindly Guide. Ankush Mehta
-
i want to make an OCR application & extract text from an image. i am trying the following approach. 1) i can convert an image to PDF. 2) i can extract text from PDF. So, theoretically, perhaps OCR is possible this way- to convert image to PDF & then convert the PDF into text & do whatever search operations you have to. http://www.altimate.ca/flash/ocrdsktp.htmshows the above approach. However, i can extract text from any normal pdf, it is not happening with a pdf that has been generated from an image.What things may i be missing? Kindly Guide. Ankush Mehta
Simply because your image will be recognized as an image within the pdf file. So you cannot extract text from an image. From the link you gave us, they probably do character recognition inside their program.