Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. General Programming
  3. C#
  4. Tiff Image -> Searchable PDF [modified]

Tiff Image -> Searchable PDF [modified]

Scheduled Pinned Locked Moved C#
questionsysadminxml
2 Posts 2 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • A Offline
    A Offline
    abcxyz82
    wrote on last edited by
    #1

    We have got several scanned documents(Tiff) in CDs, around 10GB of data. We are looking for a way to make them searchable PDF for easy accessibility. We are using PDF creator to convert Tiff to PDF also we have built custom module on top of OCR engine which saves text from page by page in Xml. So, How can I create Searchable PDF out of PDF Image and OCRed text in xml?? We are not looking for high end server side solutions, it would be nice if we can develop our own app or use opensource software. Regards, MaulikCE -- modified at 10:13 Thursday 25th May, 2006

    S 1 Reply Last reply
    0
    • A abcxyz82

      We have got several scanned documents(Tiff) in CDs, around 10GB of data. We are looking for a way to make them searchable PDF for easy accessibility. We are using PDF creator to convert Tiff to PDF also we have built custom module on top of OCR engine which saves text from page by page in Xml. So, How can I create Searchable PDF out of PDF Image and OCRed text in xml?? We are not looking for high end server side solutions, it would be nice if we can develop our own app or use opensource software. Regards, MaulikCE -- modified at 10:13 Thursday 25th May, 2006

      S Offline
      S Offline
      Sumit_Ghosh
      wrote on last edited by
      #2

      Hi Maulik, You can use some OCR like Terrasact from Google or some Paid ones and convert the tiff to text. store the text in an Lucene index using lucence.net and then create a search interface for the lucene index. let me know if you need any paid consultation for this solution. My company can provide you the same. We are experts in Enterprise Search. Thanks, Sumit Globussoft

      1 Reply Last reply
      0
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Don't have an account? Register

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups