Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. HTML de-bloater

HTML de-bloater

Scheduled Pinned Locked Moved The Lounge
questionhtmlcomannouncement
41 Posts 18 Posters 1 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • J J Dunlap

    basementman wrote: strip out all of the unnecessary spaces and tabs :omg: Just don't do that with the copy you're going to be working on!!! Also, IMHO with computer-generated HTML, it's all those extra <font> tags that really hog space. Reduce them to only the necessary ones, and you've cut the file's size to ¼ the size. I guess I'm a hardcore programmer - I never use HTML designers, and I always head straight for the SQL view in the MS Access query designer.

    "Blessed are the peacemakers, for they shall be called sons of God." - Jesus
    "You must be the change you wish to see in the world." - Mahatma Gandhi

    B Offline
    B Offline
    basementman
    wrote on last edited by
    #11

    We either hand code the HTML or generate the HTML based on hand-coded templates. You should be using CSS rather than font tags anyway. As far the the space stripper goes, we run it against code that is being migrated out to production servers. Also, it allows you to specify a source dir and output dir so that you don't overwrite your development copy.  onwards and upwards...

    J 1 Reply Last reply
    0
    • V Vikram A Punathambekar

      Short question: Does anybody know of a good HTML debloater that can strip useless HTML code? Long reason: I happen to be a Frontpage fan. I'm rewriting my site from scratch using FP, and the only gripe I have with it is that it produces incredibly bloated HTML code. It doesn't rub well with those on dialup, like me. On a related note: Following CP advice, I decided to try out Dreamweaver MX. I have 25 more days to go before the trial version expires and it can prove to me that it doesn't suck. So far, it's losing - badly. If I can't get my hands on a debloater, I'll have to do it by hand. X| TIA,
      Vikram. ----------------------------- My soon-to-be-updated site KI klike KDE kand kuse kit, kbut KI kmust kadmit, kstarting kall knames kwith K kis ksilly. KI khope kthey kwill kgive kup kthis kwhole kscheme ksoon kand kcome kup kwith kreal knames. pI vThink aHungarian nNotation vIs iA aWonderful nThing cAnd pEveryone avShould vUse pIt aAll dThe nTime, adNo nMatter pWhat dThe nContext, adEven adWhen vSpeaking.

      C Offline
      C Offline
      Codin Carlos
      wrote on last edited by
      #12

      Why do I use bloated Word generated HTML you ask?!?!?! I get most of these files form a computer-challenged person, whom only knows how to open MSWord, and email them to me. The person want to be able to update the docs every few months.... X| any suggestions?

      1 Reply Last reply
      0
      • B basementman

        We either hand code the HTML or generate the HTML based on hand-coded templates. You should be using CSS rather than font tags anyway. As far the the space stripper goes, we run it against code that is being migrated out to production servers. Also, it allows you to specify a source dir and output dir so that you don't overwrite your development copy.  onwards and upwards...

        J Offline
        J Offline
        J Dunlap
        wrote on last edited by
        #13

        basementman wrote: You should be using CSS rather than font tags anyway That's the other thing - many WYSIWYG editors don't even have CSS support, or if they do, they put all the styles on the page, rather than in a seperate file. This makes the user have to download the whole style def every time they go to a new page.

        "Blessed are the peacemakers, for they shall be called sons of God." - Jesus
        "You must be the change you wish to see in the world." - Mahatma Gandhi

        1 Reply Last reply
        0
        • J Jamie Hale

          Ya serious. I never could make those WYSIWYG editors work. And I've never ever ever seen a "generated" page that's even remotely readable. TextPad is the way to go. J

          "You can get anything you want at Alice's Restaurant."

          realJSOPR Offline
          realJSOPR Offline
          realJSOP
          wrote on last edited by
          #14

          I edit the outbound TCP/IP packets in realtime with a binary editor (that's why my web page lags a bit sometimes). ------- signature starts "...the staggering layers of obscenity in your statement make it a work of art on so many levels." - Jason Jystad, 10/26/2001 Please review the Legal Disclaimer in my bio. ------- signature ends

          V J 2 Replies Last reply
          0
          • realJSOPR realJSOP

            I do all my HTML "code" by hand. ------- signature starts "...the staggering layers of obscenity in your statement make it a work of art on so many levels." - Jason Jystad, 10/26/2001 Please review the Legal Disclaimer in my bio. ------- signature ends

            D Offline
            D Offline
            dandy72
            wrote on last edited by
            #15

            > I do all my HTML "code" by hand. I'm a Notepad fan myself. If there's any bloat, you only have yourself to blame. ;P

            W 1 Reply Last reply
            0
            • realJSOPR realJSOP

              I edit the outbound TCP/IP packets in realtime with a binary editor (that's why my web page lags a bit sometimes). ------- signature starts "...the staggering layers of obscenity in your statement make it a work of art on so many levels." - Jason Jystad, 10/26/2001 Please review the Legal Disclaimer in my bio. ------- signature ends

              V Offline
              V Offline
              Venet
              wrote on last edited by
              #16

              John Simmons / outlaw programmer wrote: I edit the outbound TCP/IP packets in realtime with a binary editor (that's why my web page lags a bit sometimes). :-D:-D:-D:-D You must have a super fast editor to calculate CRC. Regards, Venet. Donec eris felix, multos numerabis amicos.

              1 Reply Last reply
              0
              • V Vikram A Punathambekar

                Short question: Does anybody know of a good HTML debloater that can strip useless HTML code? Long reason: I happen to be a Frontpage fan. I'm rewriting my site from scratch using FP, and the only gripe I have with it is that it produces incredibly bloated HTML code. It doesn't rub well with those on dialup, like me. On a related note: Following CP advice, I decided to try out Dreamweaver MX. I have 25 more days to go before the trial version expires and it can prove to me that it doesn't suck. So far, it's losing - badly. If I can't get my hands on a debloater, I'll have to do it by hand. X| TIA,
                Vikram. ----------------------------- My soon-to-be-updated site KI klike KDE kand kuse kit, kbut KI kmust kadmit, kstarting kall knames kwith K kis ksilly. KI khope kthey kwill kgive kup kthis kwhole kscheme ksoon kand kcome kup kwith kreal knames. pI vThink aHungarian nNotation vIs iA aWonderful nThing cAnd pEveryone avShould vUse pIt aAll dThe nTime, adNo nMatter pWhat dThe nContext, adEven adWhen vSpeaking.

                V Offline
                V Offline
                Venet
                wrote on last edited by
                #17

                Textpad :) Regards, Venet. Donec eris felix, multos numerabis amicos.

                1 Reply Last reply
                0
                • P Paul Watson

                  You want a HTML optimiser? Something that looks at your code and goes "those tags are unnecesary, strip them out"? I don't think beyond the most basic level that exists or is possible. If it were do you think I would spend days doing HTML templates with CSS and what not. I would rather use FP, produce a TABLE layout and have HTML Optimiser go and re-write it all nicely for me. :) You do get tools to rip out all the tags Office puts into HTML. Not sure if you get the same thing for the crap that FP outputs, but I doubt it as it is not a simple case like Office HTML. Learn HTML and code by hand, only way you will produce clean code at this present point. Dreamweaver is getting there, but no cigar yet.

                  Paul Watson
                  Bluegrass
                  Cape Town, South Africa

                  Robert Edward Caldecott wrote: My father-in-law calls yer man bits "weasels"

                  C Offline
                  C Offline
                  Chris Maunder
                  wrote on last edited by
                  #18

                  Paul Watson wrote: Learn HTML and code by hand All good advice as long as you don't want to use the ASP.NET web controls. <dark looks in the direction of Redmond> cheers, Chris Maunder

                  D T 2 Replies Last reply
                  0
                  • realJSOPR realJSOP

                    I edit the outbound TCP/IP packets in realtime with a binary editor (that's why my web page lags a bit sometimes). ------- signature starts "...the staggering layers of obscenity in your statement make it a work of art on so many levels." - Jason Jystad, 10/26/2001 Please review the Legal Disclaimer in my bio. ------- signature ends

                    J Offline
                    J Offline
                    Jamie Hale
                    wrote on last edited by
                    #19

                    I find you can cut down on that lag by moving the de-bloatification code to firmware. I reprogrammed an old toaster of mine to do it for me. Keep up with the times, Johnny. :cool: J

                    "You can get anything you want at Alice's Restaurant."

                    L 1 Reply Last reply
                    0
                    • V Vikram A Punathambekar

                      Short question: Does anybody know of a good HTML debloater that can strip useless HTML code? Long reason: I happen to be a Frontpage fan. I'm rewriting my site from scratch using FP, and the only gripe I have with it is that it produces incredibly bloated HTML code. It doesn't rub well with those on dialup, like me. On a related note: Following CP advice, I decided to try out Dreamweaver MX. I have 25 more days to go before the trial version expires and it can prove to me that it doesn't suck. So far, it's losing - badly. If I can't get my hands on a debloater, I'll have to do it by hand. X| TIA,
                      Vikram. ----------------------------- My soon-to-be-updated site KI klike KDE kand kuse kit, kbut KI kmust kadmit, kstarting kall knames kwith K kis ksilly. KI khope kthey kwill kgive kup kthis kwhole kscheme ksoon kand kcome kup kwith kreal knames. pI vThink aHungarian nNotation vIs iA aWonderful nThing cAnd pEveryone avShould vUse pIt aAll dThe nTime, adNo nMatter pWhat dThe nContext, adEven adWhen vSpeaking.

                      J Offline
                      J Offline
                      Jack Puppy
                      wrote on last edited by
                      #20

                      I'm like alot of the others and use Notepad. I use the following tool to trim whitespace and such after my page is working properly: http://www.serpik.com/ahc/ Just make sure you keep master copies. My 20 favorite films:
                      http://www.ymdb.com/user_top20_view.asp?usersid=8912

                      C 1 Reply Last reply
                      0
                      • C Chris Maunder

                        Paul Watson wrote: Learn HTML and code by hand All good advice as long as you don't want to use the ASP.NET web controls. <dark looks in the direction of Redmond> cheers, Chris Maunder

                        D Offline
                        D Offline
                        David Stone
                        wrote on last edited by
                        #21

                        Chris Maunder wrote: All good advice as long as you don't want to use the ASP.NET web controls That's why ASP.NET support in Dreamweaver is almost a joke. Dreamweaver is supposed to output clean HTML code...which it does. But ASP.NET goes in and screws it all up. :( I personally hate the ASP.NET controls. But I use them anyway...


                        Hawaian shirts and shorts work too in Summer. People assume you're either a complete nut (in which case not a worthy target) or so damn good you don't need to worry about camouflage... -Anna-Jayne Metcalfe on Paintballing

                        1 Reply Last reply
                        0
                        • J Jack Puppy

                          I'm like alot of the others and use Notepad. I use the following tool to trim whitespace and such after my page is working properly: http://www.serpik.com/ahc/ Just make sure you keep master copies. My 20 favorite films:
                          http://www.ymdb.com/user_top20_view.asp?usersid=8912

                          C Offline
                          C Offline
                          Codin Carlos
                          wrote on last edited by
                          #22

                          OOH! The Manchurian Candidate! Good one! ;-)

                          J 1 Reply Last reply
                          0
                          • J J Dunlap

                            basementman wrote: strip out all of the unnecessary spaces and tabs :omg: Just don't do that with the copy you're going to be working on!!! Also, IMHO with computer-generated HTML, it's all those extra <font> tags that really hog space. Reduce them to only the necessary ones, and you've cut the file's size to ¼ the size. I guess I'm a hardcore programmer - I never use HTML designers, and I always head straight for the SQL view in the MS Access query designer.

                            "Blessed are the peacemakers, for they shall be called sons of God." - Jesus
                            "You must be the change you wish to see in the world." - Mahatma Gandhi

                            D Offline
                            D Offline
                            David Stone
                            wrote on last edited by
                            #23

                            jdunlap wrote: Also, IMHO with computer-generated HTML, it's all those extra <font> tags that really hog space. Reduce them to only the necessary ones, and you've cut the file's size to ¼ the size. :omg: You use font tags? In the name of God I command you to stop! Go learn CSS!


                            Hawaian shirts and shorts work too in Summer. People assume you're either a complete nut (in which case not a worthy target) or so damn good you don't need to worry about camouflage... -Anna-Jayne Metcalfe on Paintballing

                            J 1 Reply Last reply
                            0
                            • P Paul Watson

                              You want a HTML optimiser? Something that looks at your code and goes "those tags are unnecesary, strip them out"? I don't think beyond the most basic level that exists or is possible. If it were do you think I would spend days doing HTML templates with CSS and what not. I would rather use FP, produce a TABLE layout and have HTML Optimiser go and re-write it all nicely for me. :) You do get tools to rip out all the tags Office puts into HTML. Not sure if you get the same thing for the crap that FP outputs, but I doubt it as it is not a simple case like Office HTML. Learn HTML and code by hand, only way you will produce clean code at this present point. Dreamweaver is getting there, but no cigar yet.

                              Paul Watson
                              Bluegrass
                              Cape Town, South Africa

                              Robert Edward Caldecott wrote: My father-in-law calls yer man bits "weasels"

                              S Offline
                              S Offline
                              Steven Hicks n 1
                              wrote on last edited by
                              #24

                              Well if you could get a browser to render it and then write it out to a file, this way it would be "reinterpeting" the page, and the result would be optimized, becuase it wouldn't be putting a font tag after everyletter, unless the font kept changing. -Steven "the yellow dart" Hicks

                              CPA

                              CodeProjectAddict

                              Actual Linux Penguins were harmed in the creation of this message.

                              More tutorials: Ltpb.8m.com: Tutorials |404Browser.com (Download Link)

                              P 1 Reply Last reply
                              0
                              • S Steven Hicks n 1

                                Well if you could get a browser to render it and then write it out to a file, this way it would be "reinterpeting" the page, and the result would be optimized, becuase it wouldn't be putting a font tag after everyletter, unless the font kept changing. -Steven "the yellow dart" Hicks

                                CPA

                                CodeProjectAddict

                                Actual Linux Penguins were harmed in the creation of this message.

                                More tutorials: Ltpb.8m.com: Tutorials |404Browser.com (Download Link)

                                P Offline
                                P Offline
                                Paul Watson
                                wrote on last edited by
                                #25

                                I am not sure if I am following your logic Steven, could you explain a bit more please?

                                Paul Watson
                                Bluegrass
                                Cape Town, South Africa

                                Robert Edward Caldecott wrote: My father-in-law calls yer man bits "weasels"

                                S 1 Reply Last reply
                                0
                                • D David Stone

                                  jdunlap wrote: Also, IMHO with computer-generated HTML, it's all those extra <font> tags that really hog space. Reduce them to only the necessary ones, and you've cut the file's size to ¼ the size. :omg: You use font tags? In the name of God I command you to stop! Go learn CSS!


                                  Hawaian shirts and shorts work too in Summer. People assume you're either a complete nut (in which case not a worthy target) or so damn good you don't need to worry about camouflage... -Anna-Jayne Metcalfe on Paintballing

                                  J Offline
                                  J Offline
                                  J Dunlap
                                  wrote on last edited by
                                  #26

                                  I don't use font tags, but many WYSIWYG editors do. If I used <font> tags I'd have gone mad by now.* I use a good ol' external CSS file. *Imagine updating 1000+ <font> font tags by hand just because you wanted to change the font size. :omg: X|

                                  "Blessed are the peacemakers, for they shall be called sons of God." - Jesus
                                  "You must be the change you wish to see in the world." - Mahatma Gandhi

                                  1 Reply Last reply
                                  0
                                  • C Chris Maunder

                                    Paul Watson wrote: Learn HTML and code by hand All good advice as long as you don't want to use the ASP.NET web controls. <dark looks in the direction of Redmond> cheers, Chris Maunder

                                    T Offline
                                    T Offline
                                    Terry Denham
                                    wrote on last edited by
                                    #27

                                    Chris, What problem are you having with the APS.NET controls? I don't like how hard they are to use in the VS.NET designer especially the Panel control but I work around this by using Tables to control my layout. The controls themselves appear to render just fine although I've seen some problem with them rendering against NS 4.x. but then they would also have problems rendering against IE 4.x

                                    C 1 Reply Last reply
                                    0
                                    • P Paul Watson

                                      I am not sure if I am following your logic Steven, could you explain a bit more please?

                                      Paul Watson
                                      Bluegrass
                                      Cape Town, South Africa

                                      Robert Edward Caldecott wrote: My father-in-law calls yer man bits "weasels"

                                      S Offline
                                      S Offline
                                      Steven Hicks n 1
                                      wrote on last edited by
                                      #28

                                      Well the browser translates the HTML "code" into something visual it could regenerate what was translated and displayed back into code (which would be smaller, and not resulting in <font color="blue">h</font>he<font color="black">ll</font>o world). -Steven "the yellow dart" Hicks

                                      CPA

                                      CodeProjectAddict

                                      Actual Linux Penguins were harmed in the creation of this message.

                                      More tutorials: Ltpb.8m.com: Tutorials |404Browser.com (Download Link)

                                      P 1 Reply Last reply
                                      0
                                      • S Steven Hicks n 1

                                        Well the browser translates the HTML "code" into something visual it could regenerate what was translated and displayed back into code (which would be smaller, and not resulting in <font color="blue">h</font>he<font color="black">ll</font>o world). -Steven "the yellow dart" Hicks

                                        CPA

                                        CodeProjectAddict

                                        Actual Linux Penguins were harmed in the creation of this message.

                                        More tutorials: Ltpb.8m.com: Tutorials |404Browser.com (Download Link)

                                        P Offline
                                        P Offline
                                        Paul Watson
                                        wrote on last edited by
                                        #29

                                        I hear what you are saying. When IE parses the HTML document it must convert style="color: red" into something which Windows can understand and display. Also I assume it builds up a "tree" of the code. Now, it would be nice to think IE optimises that tree, detects redudnant styles and bits and excludes them. But does it? Also if you have ever used the DOM in IE to access the innerHTML property of an element you will see the pure and utter tripe that IE returns. It even takes wonderfully coded HTML and spits out rubbish (not knocking Chris at all but go to the submission wizard and using the nice new WYSIWYG editor put in some good HTML. Then swap to WYSIWYG view, then swap back to HTML view... You see what it does? Uppercases elements, rips out quotes, puts in FONT tags and what not. That is IE doing it! Shocking.) Redundant tags are fine, even I could probably write a parser that figured out redundant tags. It is when someone uses a TABLE instead of a DIV or a SPAN instead of a LABEL that it must become damned hard to write an optimiser. Also how does an optimiser "see" what the developer is trying to produce and then re-write the code to make it happen? Really, I don't think currently that can be done.

                                        Paul Watson
                                        Bluegrass
                                        Cape Town, South Africa

                                        Robert Edward Caldecott wrote: My father-in-law calls yer man bits "weasels"

                                        S 1 Reply Last reply
                                        0
                                        • C Codin Carlos

                                          OOH! The Manchurian Candidate! Good one! ;-)

                                          J Offline
                                          J Offline
                                          Jack Puppy
                                          wrote on last edited by
                                          #30

                                          The scene where the GIs are brainwashed to think they're in New Jersey is one of my favorite scenes of any film.


                                          My 20 favorite films:
                                          http://www.ymdb.com/user_top20_view.asp?usersid=8912

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups