Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. What's the best PDF to HTML program?

What's the best PDF to HTML program?

Scheduled Pinned Locked Moved The Lounge
htmladobequestionannouncement
23 Posts 12 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S Offline
    S Offline
    Sean Ewington
    wrote on last edited by
    #1

    I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

    Thanks, Sean Ewington The Code Project

    S P C T M 6 Replies Last reply
    0
    • S Sean Ewington

      I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

      Thanks, Sean Ewington The Code Project

      S Offline
      S Offline
      Shog9 0
      wrote on last edited by
      #2

      ...Please define "Best". ;) (I just use PDF Download[^] - it's easy, and my needs are simple)

      S P 2 Replies Last reply
      0
      • S Shog9 0

        ...Please define "Best". ;) (I just use PDF Download[^] - it's easy, and my needs are simple)

        S Offline
        S Offline
        Sean Ewington
        wrote on last edited by
        #3

        Fair enough. So say I have this in PDF:

        32 [finishedSquiggles removeAllObjects]; // clear the array of squiggles
        33 [self setNeedsDisplay]; // refresh the display
        34 } // end method resetView
        35
        36 // draw the view
        37 - (void)drawRect:(CGRect)rect

        When I make the conversion, I want it to come out looking as close to the above as possible. And not like this:

        <
        TABLE
        align="center" border=0 cellspacing=0 cellpadding=2

        <
        TR

        <
        TH
        align="left" width="25" valign="top" height="12"

        88<
        /TH

        <
        TD
        align="left" width="448" valign="top" height="12"
        // called whenever the user places a finger on the screen <
        /TD
        <
        /TR
        <
        TR

        <
        TH
        align="left" width="25" valign="top" height="14"

        89<
        /TH

        <
        TD
        align="left" width="448" valign="top" height="14"

        • (void)touchesBegan:(NSSet *)touches withEvent:(UIEvent *)event <
          /TD
          <
          /TR

        Which is how Adobe Online does it, and "the best" I've seen at doing this so far. Sadly :((

        Thanks, Sean Ewington The Code Project

        R S 2 Replies Last reply
        0
        • S Shog9 0

          ...Please define "Best". ;) (I just use PDF Download[^] - it's easy, and my needs are simple)

          P Offline
          P Offline
          peterchen
          wrote on last edited by
          #4

          Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

          Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
          | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

          H R 2 Replies Last reply
          0
          • S Sean Ewington

            I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

            Thanks, Sean Ewington The Code Project

            P Offline
            P Offline
            peterchen
            wrote on last edited by
            #5

            Does it have to be .pdf? You can also save a whole page with all data to a single .MHT file that can be displayed in IE.

            Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
            | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

            S 1 Reply Last reply
            0
            • P peterchen

              Does it have to be .pdf? You can also save a whole page with all data to a single .MHT file that can be displayed in IE.

              Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
              | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

              S Offline
              S Offline
              Sean Ewington
              wrote on last edited by
              #6

              Yes sir, it has to be .pdf

              Thanks, Sean Ewington The Code Project

              1 Reply Last reply
              0
              • S Sean Ewington

                Fair enough. So say I have this in PDF:

                32 [finishedSquiggles removeAllObjects]; // clear the array of squiggles
                33 [self setNeedsDisplay]; // refresh the display
                34 } // end method resetView
                35
                36 // draw the view
                37 - (void)drawRect:(CGRect)rect

                When I make the conversion, I want it to come out looking as close to the above as possible. And not like this:

                <
                TABLE
                align="center" border=0 cellspacing=0 cellpadding=2

                <
                TR

                <
                TH
                align="left" width="25" valign="top" height="12"

                88<
                /TH

                <
                TD
                align="left" width="448" valign="top" height="12"
                // called whenever the user places a finger on the screen <
                /TD
                <
                /TR
                <
                TR

                <
                TH
                align="left" width="25" valign="top" height="14"

                89<
                /TH

                <
                TD
                align="left" width="448" valign="top" height="14"

                • (void)touchesBegan:(NSSet *)touches withEvent:(UIEvent *)event <
                  /TD
                  <
                  /TR

                Which is how Adobe Online does it, and "the best" I've seen at doing this so far. Sadly :((

                Thanks, Sean Ewington The Code Project

                R Offline
                R Offline
                Rama Krishna Vavilala
                wrote on last edited by
                #7

                Are you doing an iPhone application for CP?

                S 1 Reply Last reply
                0
                • S Sean Ewington

                  Fair enough. So say I have this in PDF:

                  32 [finishedSquiggles removeAllObjects]; // clear the array of squiggles
                  33 [self setNeedsDisplay]; // refresh the display
                  34 } // end method resetView
                  35
                  36 // draw the view
                  37 - (void)drawRect:(CGRect)rect

                  When I make the conversion, I want it to come out looking as close to the above as possible. And not like this:

                  <
                  TABLE
                  align="center" border=0 cellspacing=0 cellpadding=2

                  <
                  TR

                  <
                  TH
                  align="left" width="25" valign="top" height="12"

                  88<
                  /TH

                  <
                  TD
                  align="left" width="448" valign="top" height="12"
                  // called whenever the user places a finger on the screen <
                  /TD
                  <
                  /TR
                  <
                  TR

                  <
                  TH
                  align="left" width="25" valign="top" height="14"

                  89<
                  /TH

                  <
                  TD
                  align="left" width="448" valign="top" height="14"

                  • (void)touchesBegan:(NSSet *)touches withEvent:(UIEvent *)event <
                    /TD
                    <
                    /TR

                  Which is how Adobe Online does it, and "the best" I've seen at doing this so far. Sadly :((

                  Thanks, Sean Ewington The Code Project

                  S Offline
                  S Offline
                  Shog9 0
                  wrote on last edited by
                  #8

                  Excellent! With that, we can rule out PDF Download for your needs: Before[^] After[^]

                  S 1 Reply Last reply
                  0
                  • P peterchen

                    Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

                    Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
                    | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

                    H Offline
                    H Offline
                    Hans Dietrich
                    wrote on last edited by
                    #9

                    Have you tried G|A|Y toolbar v2.0? It's much better than 1.0.

                    Best wishes, Hans


                    [Hans Dietrich Software]

                    1 Reply Last reply
                    0
                    • R Rama Krishna Vavilala

                      Are you doing an iPhone application for CP?

                      S Offline
                      S Offline
                      Sean Ewington
                      wrote on last edited by
                      #10

                      That'd be cool, but not what I'm doing. I'm prepping a book chapter on iPhone development. But what kind of Code Project iPhone application would you want to see?

                      Thanks, Sean Ewington The Code Project

                      1 Reply Last reply
                      0
                      • S Shog9 0

                        Excellent! With that, we can rule out PDF Download for your needs: Before[^] After[^]

                        S Offline
                        S Offline
                        Sean Ewington
                        wrote on last edited by
                        #11

                        X|

                        Thanks, Sean Ewington The Code Project

                        S 1 Reply Last reply
                        0
                        • S Sean Ewington

                          X|

                          Thanks, Sean Ewington The Code Project

                          S Offline
                          S Offline
                          Shog9 0
                          wrote on last edited by
                          #12

                          Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                          S R 2 Replies Last reply
                          0
                          • S Shog9 0

                            Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                            S Offline
                            S Offline
                            Sean Ewington
                            wrote on last edited by
                            #13

                            Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

                            Thanks, Sean Ewington The Code Project

                            S 1 Reply Last reply
                            0
                            • S Shog9 0

                              Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                              R Offline
                              R Offline
                              Rama Krishna Vavilala
                              wrote on last edited by
                              #14

                              Shog9 wrote:

                              an odd little "viewer" built around images

                              That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

                              S 1 Reply Last reply
                              0
                              • R Rama Krishna Vavilala

                                Shog9 wrote:

                                an odd little "viewer" built around images

                                That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

                                S Offline
                                S Offline
                                Shog9 0
                                wrote on last edited by
                                #15

                                Heh... I was tickled when I saw that I could actually select text in what appeared to be a static image. :-)

                                1 Reply Last reply
                                0
                                • S Sean Ewington

                                  Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

                                  Thanks, Sean Ewington The Code Project

                                  S Offline
                                  S Offline
                                  Shog9 0
                                  wrote on last edited by
                                  #16

                                  Sean Ewington wrote:

                                  I'm starting to think PDF to HTML is just a bad idea

                                  Well, if you think about it, it's pretty low-level; even basic HTML has a lot more semantic / meta information encoded in it, and you just can't reliably recreate that from what ends up in a PDF. It'd be like trying to decompile a binary executable program without any information on which compiler originally created it...

                                  1 Reply Last reply
                                  0
                                  • P peterchen

                                    Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

                                    Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
                                    | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

                                    R Offline
                                    R Offline
                                    Rage
                                    wrote on last edited by
                                    #17

                                    peterchen wrote:

                                    G|A|Y toolbar

                                    :laugh: :laugh:

                                    ~RaGE();

                                    I think words like 'destiny' are a way of trying to find order where none exists. - Christian Graus Do not feed the troll ! - Common proverb

                                    J 1 Reply Last reply
                                    0
                                    • S Sean Ewington

                                      I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                      Thanks, Sean Ewington The Code Project

                                      C Offline
                                      C Offline
                                      CurtainDog
                                      wrote on last edited by
                                      #18

                                      I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                                      A 1 Reply Last reply
                                      0
                                      • C CurtainDog

                                        I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                                        A Offline
                                        A Offline
                                        Alan Burkhart
                                        wrote on last edited by
                                        #19

                                        Open Office to my knowledge doesn't import pdf. It'll export as pdf, but it doesn't import. That's as of version 3.1.

                                        1 Reply Last reply
                                        0
                                        • S Sean Ewington

                                          I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                          Thanks, Sean Ewington The Code Project

                                          T Offline
                                          T Offline
                                          Terence Wallace
                                          wrote on last edited by
                                          #20

                                          Have you tried Many2PDFX.dll from Gnostice? The offer a .NET, VCL, and Java components for PDF. http://www.gnostice.com/[^]

                                          "If you think it's expensive to hire a professional to do the job, wait until you hire an amateur." - Red Adair

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups