Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. What's the best PDF to HTML program?

What's the best PDF to HTML program?

Scheduled Pinned Locked Moved The Lounge
htmladobequestionannouncement
23 Posts 12 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S Shog9 0

    ...Please define "Best". ;) (I just use PDF Download[^] - it's easy, and my needs are simple)

    P Offline
    P Offline
    peterchen
    wrote on last edited by
    #4

    Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

    Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
    | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

    H R 2 Replies Last reply
    0
    • S Sean Ewington

      I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

      Thanks, Sean Ewington The Code Project

      P Offline
      P Offline
      peterchen
      wrote on last edited by
      #5

      Does it have to be .pdf? You can also save a whole page with all data to a single .MHT file that can be displayed in IE.

      Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
      | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

      S 1 Reply Last reply
      0
      • P peterchen

        Does it have to be .pdf? You can also save a whole page with all data to a single .MHT file that can be displayed in IE.

        Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
        | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

        S Offline
        S Offline
        Sean Ewington
        wrote on last edited by
        #6

        Yes sir, it has to be .pdf

        Thanks, Sean Ewington The Code Project

        1 Reply Last reply
        0
        • S Sean Ewington

          Fair enough. So say I have this in PDF:

          32 [finishedSquiggles removeAllObjects]; // clear the array of squiggles
          33 [self setNeedsDisplay]; // refresh the display
          34 } // end method resetView
          35
          36 // draw the view
          37 - (void)drawRect:(CGRect)rect

          When I make the conversion, I want it to come out looking as close to the above as possible. And not like this:

          <
          TABLE
          align="center" border=0 cellspacing=0 cellpadding=2

          <
          TR

          <
          TH
          align="left" width="25" valign="top" height="12"

          88<
          /TH

          <
          TD
          align="left" width="448" valign="top" height="12"
          // called whenever the user places a finger on the screen <
          /TD
          <
          /TR
          <
          TR

          <
          TH
          align="left" width="25" valign="top" height="14"

          89<
          /TH

          <
          TD
          align="left" width="448" valign="top" height="14"

          • (void)touchesBegan:(NSSet *)touches withEvent:(UIEvent *)event <
            /TD
            <
            /TR

          Which is how Adobe Online does it, and "the best" I've seen at doing this so far. Sadly :((

          Thanks, Sean Ewington The Code Project

          R Offline
          R Offline
          Rama Krishna Vavilala
          wrote on last edited by
          #7

          Are you doing an iPhone application for CP?

          S 1 Reply Last reply
          0
          • S Sean Ewington

            Fair enough. So say I have this in PDF:

            32 [finishedSquiggles removeAllObjects]; // clear the array of squiggles
            33 [self setNeedsDisplay]; // refresh the display
            34 } // end method resetView
            35
            36 // draw the view
            37 - (void)drawRect:(CGRect)rect

            When I make the conversion, I want it to come out looking as close to the above as possible. And not like this:

            <
            TABLE
            align="center" border=0 cellspacing=0 cellpadding=2

            <
            TR

            <
            TH
            align="left" width="25" valign="top" height="12"

            88<
            /TH

            <
            TD
            align="left" width="448" valign="top" height="12"
            // called whenever the user places a finger on the screen <
            /TD
            <
            /TR
            <
            TR

            <
            TH
            align="left" width="25" valign="top" height="14"

            89<
            /TH

            <
            TD
            align="left" width="448" valign="top" height="14"

            • (void)touchesBegan:(NSSet *)touches withEvent:(UIEvent *)event <
              /TD
              <
              /TR

            Which is how Adobe Online does it, and "the best" I've seen at doing this so far. Sadly :((

            Thanks, Sean Ewington The Code Project

            S Offline
            S Offline
            Shog9 0
            wrote on last edited by
            #8

            Excellent! With that, we can rule out PDF Download for your needs: Before[^] After[^]

            S 1 Reply Last reply
            0
            • P peterchen

              Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

              Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
              | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

              H Offline
              H Offline
              Hans Dietrich
              wrote on last edited by
              #9

              Have you tried G|A|Y toolbar v2.0? It's much better than 1.0.

              Best wishes, Hans


              [Hans Dietrich Software]

              1 Reply Last reply
              0
              • R Rama Krishna Vavilala

                Are you doing an iPhone application for CP?

                S Offline
                S Offline
                Sean Ewington
                wrote on last edited by
                #10

                That'd be cool, but not what I'm doing. I'm prepping a book chapter on iPhone development. But what kind of Code Project iPhone application would you want to see?

                Thanks, Sean Ewington The Code Project

                1 Reply Last reply
                0
                • S Shog9 0

                  Excellent! With that, we can rule out PDF Download for your needs: Before[^] After[^]

                  S Offline
                  S Offline
                  Sean Ewington
                  wrote on last edited by
                  #11

                  X|

                  Thanks, Sean Ewington The Code Project

                  S 1 Reply Last reply
                  0
                  • S Sean Ewington

                    X|

                    Thanks, Sean Ewington The Code Project

                    S Offline
                    S Offline
                    Shog9 0
                    wrote on last edited by
                    #12

                    Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                    S R 2 Replies Last reply
                    0
                    • S Shog9 0

                      Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                      S Offline
                      S Offline
                      Sean Ewington
                      wrote on last edited by
                      #13

                      Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

                      Thanks, Sean Ewington The Code Project

                      S 1 Reply Last reply
                      0
                      • S Shog9 0

                        Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

                        R Offline
                        R Offline
                        Rama Krishna Vavilala
                        wrote on last edited by
                        #14

                        Shog9 wrote:

                        an odd little "viewer" built around images

                        That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

                        S 1 Reply Last reply
                        0
                        • R Rama Krishna Vavilala

                          Shog9 wrote:

                          an odd little "viewer" built around images

                          That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

                          S Offline
                          S Offline
                          Shog9 0
                          wrote on last edited by
                          #15

                          Heh... I was tickled when I saw that I could actually select text in what appeared to be a static image. :-)

                          1 Reply Last reply
                          0
                          • S Sean Ewington

                            Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

                            Thanks, Sean Ewington The Code Project

                            S Offline
                            S Offline
                            Shog9 0
                            wrote on last edited by
                            #16

                            Sean Ewington wrote:

                            I'm starting to think PDF to HTML is just a bad idea

                            Well, if you think about it, it's pretty low-level; even basic HTML has a lot more semantic / meta information encoded in it, and you just can't reliably recreate that from what ends up in a PDF. It'd be like trying to decompile a binary executable program without any information on which compiler originally created it...

                            1 Reply Last reply
                            0
                            • P peterchen

                              Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

                              Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
                              | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

                              R Offline
                              R Offline
                              Rage
                              wrote on last edited by
                              #17

                              peterchen wrote:

                              G|A|Y toolbar

                              :laugh: :laugh:

                              ~RaGE();

                              I think words like 'destiny' are a way of trying to find order where none exists. - Christian Graus Do not feed the troll ! - Common proverb

                              J 1 Reply Last reply
                              0
                              • S Sean Ewington

                                I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                Thanks, Sean Ewington The Code Project

                                C Offline
                                C Offline
                                CurtainDog
                                wrote on last edited by
                                #18

                                I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                                A 1 Reply Last reply
                                0
                                • C CurtainDog

                                  I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                                  A Offline
                                  A Offline
                                  Alan Burkhart
                                  wrote on last edited by
                                  #19

                                  Open Office to my knowledge doesn't import pdf. It'll export as pdf, but it doesn't import. That's as of version 3.1.

                                  1 Reply Last reply
                                  0
                                  • S Sean Ewington

                                    I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                    Thanks, Sean Ewington The Code Project

                                    T Offline
                                    T Offline
                                    Terence Wallace
                                    wrote on last edited by
                                    #20

                                    Have you tried Many2PDFX.dll from Gnostice? The offer a .NET, VCL, and Java components for PDF. http://www.gnostice.com/[^]

                                    "If you think it's expensive to hire a professional to do the job, wait until you hire an amateur." - Red Adair

                                    1 Reply Last reply
                                    0
                                    • S Sean Ewington

                                      I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                      Thanks, Sean Ewington The Code Project

                                      M Offline
                                      M Offline
                                      Moa144
                                      wrote on last edited by
                                      #21

                                      Not to HTML but pdftoword.com has a fantastic online converter to .doc (also does .rtf) This may be a useful path for you. All the best

                                      1 Reply Last reply
                                      0
                                      • R Rage

                                        peterchen wrote:

                                        G|A|Y toolbar

                                        :laugh: :laugh:

                                        ~RaGE();

                                        I think words like 'destiny' are a way of trying to find order where none exists. - Christian Graus Do not feed the troll ! - Common proverb

                                        J Offline
                                        J Offline
                                        James Lonero
                                        wrote on last edited by
                                        #22

                                        G|A|Y toolbar? It must be one of those San Francisco Specials'

                                        1 Reply Last reply
                                        0
                                        • S Sean Ewington

                                          I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                                          Thanks, Sean Ewington The Code Project

                                          A Offline
                                          A Offline
                                          aldo hexosa
                                          wrote on last edited by
                                          #23

                                          try somepdf.com

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups