Removing tables froma pdf file using VB application
-
HI all I am working on a application where i am taking PDF file as source and writing it into a text file. My problem is i don't want the tables from the pdf file, I just want the paragraphs. My final output in text file should only contain paragraphs ( i just want to escape the table part). Can any one help me out, I tried in Google searching for any solution but all provides way to extract the data. I want some kind of filter mechanism in my VB application. It would be great if any one can provide the code. PL help Regards
Prakash Mishra(Banglore,India)
-
HI all I am working on a application where i am taking PDF file as source and writing it into a text file. My problem is i don't want the tables from the pdf file, I just want the paragraphs. My final output in text file should only contain paragraphs ( i just want to escape the table part). Can any one help me out, I tried in Google searching for any solution but all provides way to extract the data. I want some kind of filter mechanism in my VB application. It would be great if any one can provide the code. PL help Regards
Prakash Mishra(Banglore,India)
I'd imagine this kind of functionality would be exposed by the PDF library you're using?? I don't use any of them, so I really couldn't tell you. But I'd probably start by researching how the PDF document is exposed by the library and search for anything that might describe the document in sections or if it exposes any kind of Select and/or Copy methods that allow you to specify text only.
A guide to posting questions on CodeProject[^]
Dave Kreskowiak Microsoft MVP Visual Developer - Visual Basic
2006, 2007