Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. What's the best PDF to HTML program?

What's the best PDF to HTML program?

Scheduled Pinned Locked Moved The Lounge
htmladobequestionannouncement
23 Posts 12 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S Shog9 0

    Excellent! With that, we can rule out PDF Download for your needs: Before[^] After[^]

    S Offline
    S Offline
    Sean Ewington
    wrote on last edited by
    #11

    X|

    Thanks, Sean Ewington The Code Project

    S 1 Reply Last reply
    0
    • S Sean Ewington

      X|

      Thanks, Sean Ewington The Code Project

      S Offline
      S Offline
      Shog9 0
      wrote on last edited by
      #12

      Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

      S R 2 Replies Last reply
      0
      • S Shog9 0

        Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

        S Offline
        S Offline
        Sean Ewington
        wrote on last edited by
        #13

        Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

        Thanks, Sean Ewington The Code Project

        S 1 Reply Last reply
        0
        • S Shog9 0

          Yeah. ;-) FWIW, Google's even given up: they appear to have gone to using an odd little "viewer" built around images (similar to their book search) instead of offering "view as HTML" for PDF search results. Have you considered starting with HTML (or DocBook...), and just converting to PDF as-needed?

          R Offline
          R Offline
          Rama Krishna Vavilala
          wrote on last edited by
          #14

          Shog9 wrote:

          an odd little "viewer" built around images

          That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

          S 1 Reply Last reply
          0
          • R Rama Krishna Vavilala

            Shog9 wrote:

            an odd little "viewer" built around images

            That's one of the brilliant JavaScript + Web hacks I have seen. Lot of efforts to make it look like behave like PDF.

            S Offline
            S Offline
            Shog9 0
            wrote on last edited by
            #15

            Heh... I was tickled when I saw that I could actually select text in what appeared to be a static image. :-)

            1 Reply Last reply
            0
            • S Sean Ewington

              Starting out with HTML would be ideal, but in this case I only have PDF :( Thanks for trying though! I'm starting to think PDF to HTML is just a bad idea

              Thanks, Sean Ewington The Code Project

              S Offline
              S Offline
              Shog9 0
              wrote on last edited by
              #16

              Sean Ewington wrote:

              I'm starting to think PDF to HTML is just a bad idea

              Well, if you think about it, it's pretty low-level; even basic HTML has a lot more semantic / meta information encoded in it, and you just can't reliably recreate that from what ends up in a PDF. It'd be like trying to decompile a binary executable program without any information on which compiler originally created it...

              1 Reply Last reply
              0
              • P peterchen

                Free, crap-free (as in, doesn't want to install G|A|Y toolbar or put e-bay links to random places), no hassle to set up or use, works on all PDF you come across, results look good in all mainstream browsers, fast enough, ideally can be automated. Now *that* was simple.

                Personally, I love the idea that Raymond spends his nights posting bad regexs to mailing lists under the pseudonym of Jane Smith. He'd be like a super hero, only more nerdy and less useful. [Trevel]
                | FoldWithUs! | sighist | µLaunch - program launcher for server core and hyper-v server

                R Offline
                R Offline
                Rage
                wrote on last edited by
                #17

                peterchen wrote:

                G|A|Y toolbar

                :laugh: :laugh:

                ~RaGE();

                I think words like 'destiny' are a way of trying to find order where none exists. - Christian Graus Do not feed the troll ! - Common proverb

                J 1 Reply Last reply
                0
                • S Sean Ewington

                  I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                  Thanks, Sean Ewington The Code Project

                  C Offline
                  C Offline
                  CurtainDog
                  wrote on last edited by
                  #18

                  I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                  A 1 Reply Last reply
                  0
                  • C CurtainDog

                    I've heard open office produces non-insane html from docs... maybe it could handle pdfs as well?

                    A Offline
                    A Offline
                    Alan Burkhart
                    wrote on last edited by
                    #19

                    Open Office to my knowledge doesn't import pdf. It'll export as pdf, but it doesn't import. That's as of version 3.1.

                    1 Reply Last reply
                    0
                    • S Sean Ewington

                      I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                      Thanks, Sean Ewington The Code Project

                      T Offline
                      T Offline
                      Terence Wallace
                      wrote on last edited by
                      #20

                      Have you tried Many2PDFX.dll from Gnostice? The offer a .NET, VCL, and Java components for PDF. http://www.gnostice.com/[^]

                      "If you think it's expensive to hire a professional to do the job, wait until you hire an amateur." - Red Adair

                      1 Reply Last reply
                      0
                      • S Sean Ewington

                        I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                        Thanks, Sean Ewington The Code Project

                        M Offline
                        M Offline
                        Moa144
                        wrote on last edited by
                        #21

                        Not to HTML but pdftoword.com has a fantastic online converter to .doc (also does .rtf) This may be a useful path for you. All the best

                        1 Reply Last reply
                        0
                        • R Rage

                          peterchen wrote:

                          G|A|Y toolbar

                          :laugh: :laugh:

                          ~RaGE();

                          I think words like 'destiny' are a way of trying to find order where none exists. - Christian Graus Do not feed the troll ! - Common proverb

                          J Offline
                          J Offline
                          James Lonero
                          wrote on last edited by
                          #22

                          G|A|Y toolbar? It must be one of those San Francisco Specials'

                          1 Reply Last reply
                          0
                          • S Sean Ewington

                            I've tried some free ones, some online ones, Adobe Online (best so far), BCL's version, PDF Ripper, and PrimoPDF. There has to be something better. Is there?

                            Thanks, Sean Ewington The Code Project

                            A Offline
                            A Offline
                            aldo hexosa
                            wrote on last edited by
                            #23

                            try somepdf.com

                            1 Reply Last reply
                            0
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Don't have an account? Register

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups