Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. word wrapping

word wrapping

Scheduled Pinned Locked Moved The Lounge
htmlcssquestion
60 Posts 20 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • U User 13269747

    The browser rendering engine already has good support for splitting words. I put "hypens: auto;" into my CSS and I am quite pleased with the result. [edit]: My css is as follows (lookup the hyphenate styles for what they do)

    body {
    font-family: 'Computer Modern Serif';
    margin: 40px auto;
    max-width: 640px;
    line-height: 1.6;
    font-size: 16px;
    padding: 0 10px;
    hyphens: auto;
    hyphenate-limit-lines: 2;
    hyphenate-limit-chars: 6 3 2;
    text-indent: 30px;
    }

    H Offline
    H Offline
    honey the codewitch
    wrote on last edited by
    #23

    Wouldn't that be great if I didn't have to implement my own web browser which is the whole point of this?

    Real programmers use butterflies

    U 1 Reply Last reply
    0
    • H honey the codewitch

      Oh, so you're volunteering to write the CSS engine for me right?

      Real programmers use butterflies

      Y Offline
      Y Offline
      YahiaEQ
      wrote on last edited by
      #24

      Totally - since CSS engines are my bread and butter... not really, I just thought EPUB displays content using something that is capable of CSS...

      H 1 Reply Last reply
      0
      • R Rage

        honey the codewitch wrote:

        In the real world, I'd just find a syllable and then hyphenate

        What about hyphenating at random places :-) ? How often do you need it ?

        Do not escape reality : improve reality !

        H Offline
        H Offline
        honey the codewitch
        wrote on last edited by
        #25

        I'm not sure. It depends on the size of the screen that's connected to the e-reader I'm making. It can use any size screen, though in the end I'll be settling on a 600x800 epaper display

        Real programmers use butterflies

        1 Reply Last reply
        0
        • H honey the codewitch

          Wouldn't that be great if I didn't have to implement my own web browser which is the whole point of this?

          Real programmers use butterflies

          U Offline
          U Offline
          User 13269747
          wrote on last edited by
          #26

          Quote:

          Wouldn't that be great if I didn't have to implement my own web browser which is the whole point of this?

          Well, your initial message didn't specify that as a point and my crystal ball isn't working today :-) But, isn't it great that I gave you a really nice introduction into the complexities of your problem over here: The Lounge[^]?

          H 1 Reply Last reply
          0
          • Y YahiaEQ

            Totally - since CSS engines are my bread and butter... not really, I just thought EPUB displays content using something that is capable of CSS...

            H Offline
            H Offline
            honey the codewitch
            wrote on last edited by
            #27

            Yes, it does. I wish it didn't. I'm making an epub reader on a device that can't even run a basic web browser.

            Real programmers use butterflies

            Y 1 Reply Last reply
            0
            • U User 13269747

              Disregard my pevious answer - that's for HTML only. If it's for an e-reader you have bigger problems: use a library to do this because there are multiple things involved in flowing text. If you've never used LaTeX you are probably not familiar with all the complexities involved: 1. Font kerning: changes the width of a line. 2. Inter-word spacing: algorithm must ensure no rivers run through paragraphs and edges line up. 3. Paragraph indentation: in literary prose all paragraphs have an indentation. 4. Long word wrapping: You need to use a table of pre-calculated breakpoints that are specific to a glyph. 5. Language: Some languages read right-to-left, and these may be in the middle of a sentence in a language which reads left-to-right. If you're implementing an E-reader then you need to know all of the following concepts: 1. Kerning 2. Ligatures 3. Unicode code-points 4. Unicode BMP 5. Unicode surrogates 6. Unicode characters 7. Unicode glyphs 8. Rivers/runs in text 9. Struts and rules in text 10. Baselines, Caplines 11. Ascenders, descenders 12. Subpixels ... and probably a hundred other typesetting things I forgot or don't know about. Flowing text for an E-reader (or PDF, or book or any typesetting) is an entire Phd topic on its own and can take years of work to implement. It does help if you've used LaTeX in the past, because it chooses good defaults for all of the above, and if you want to change anything you're forced to learn what all those things mean.

              H Offline
              H Offline
              honey the codewitch
              wrote on last edited by
              #28

              Find me a library for this that works on the Arduino framework. 1. Done 2. Not 3. Done 4. Not 5. Done 6. Done 7. Done (although i think you mean ttf/otf glyphs 8. Not 9. Not 10.Done 11.I don't know what that is 12. If you mean fractional pixels or anti-aliasing, done

              Real programmers use butterflies

              U R 2 Replies Last reply
              0
              • U User 13269747

                Quote:

                Wouldn't that be great if I didn't have to implement my own web browser which is the whole point of this?

                Well, your initial message didn't specify that as a point and my crystal ball isn't working today :-) But, isn't it great that I gave you a really nice introduction into the complexities of your problem over here: The Lounge[^]?

                H Offline
                H Offline
                honey the codewitch
                wrote on last edited by
                #29

                I also didn't say I was using a web browser. It's weird that you would assume I was. C# is the most commonly used language at CP, not HTML and CSS.

                Real programmers use butterflies

                1 Reply Last reply
                0
                • H honey the codewitch

                  Yes, it does. I wish it didn't. I'm making an epub reader on a device that can't even run a basic web browser.

                  Real programmers use butterflies

                  Y Offline
                  Y Offline
                  YahiaEQ
                  wrote on last edited by
                  #30

                  Do you know https://github.com/litehtml/litehtml ? this not a browser but just a parser for html and CSS... perhaps this can help...

                  H 1 Reply Last reply
                  0
                  • Y YahiaEQ

                    Do you know https://github.com/litehtml/litehtml ? this not a browser but just a parser for html and CSS... perhaps this can help...

                    H Offline
                    H Offline
                    honey the codewitch
                    wrote on last edited by
                    #31

                    I've got a parser, and that uses the STL and I cannot rely on the STL being available. :)

                    Real programmers use butterflies

                    1 Reply Last reply
                    0
                    • H honey the codewitch

                      Find me a library for this that works on the Arduino framework. 1. Done 2. Not 3. Done 4. Not 5. Done 6. Done 7. Done (although i think you mean ttf/otf glyphs 8. Not 9. Not 10.Done 11.I don't know what that is 12. If you mean fractional pixels or anti-aliasing, done

                      Real programmers use butterflies

                      U Offline
                      U Offline
                      User 13269747
                      wrote on last edited by
                      #32

                      Quote:

                      Find me a library for this that works on the Arduino framework.

                      Maybe you should tell your client that the project they contracted you for is not completely realisable on their choice of hardware. Sure, you can get part of the way there on an atmega platform but it may not match their expectations. Or maybe get them to scale their requirements down to exclude extended unicode characters and exclude any sort of typesetting rules; just render text instead of typesetting it. Most (if not all) times my clients let me choose the hardware anyway (they set price restrictions and I tell them what is capable at that price) so it shouldn't really be a problem having this discussion with your client.

                      H 1 Reply Last reply
                      0
                      • H honey the codewitch

                        Say I don't have room to wrap a single world like supercalifragilisticexpialidocious. In the real world, I'd just find a syllable and then hyphenate. I don't think I can do that - and i don't think i *should* do that in html and css. I don't know *what* to do. What would you do?

                        Real programmers use butterflies

                        M Offline
                        M Offline
                        Marc Clifton
                        wrote on last edited by
                        #33

                        Scrollbars! :laugh:

                        Latest Articles:
                        DivWindow: Size, drag, minimize, and maximize floating windows with layout persistence

                        1 Reply Last reply
                        0
                        • U User 13269747

                          Quote:

                          Find me a library for this that works on the Arduino framework.

                          Maybe you should tell your client that the project they contracted you for is not completely realisable on their choice of hardware. Sure, you can get part of the way there on an atmega platform but it may not match their expectations. Or maybe get them to scale their requirements down to exclude extended unicode characters and exclude any sort of typesetting rules; just render text instead of typesetting it. Most (if not all) times my clients let me choose the hardware anyway (they set price restrictions and I tell them what is capable at that price) so it shouldn't really be a problem having this discussion with your client.

                          H Offline
                          H Offline
                          honey the codewitch
                          wrote on last edited by
                          #34

                          I could tell my client that but it's not true, and I wouldn't believe myself (I'm the client) anyway. I've got unicode, i'm working on the typesetting. It's not that the device isn't capable - I think I've written enough of the code to prove that the ESP32 can do it. It's just that I don't know the behaviors of all of it - not that I'm trying to implement all of it - just enough to show most epubs mostly correctly.

                          Real programmers use butterflies

                          U 1 Reply Last reply
                          0
                          • H honey the codewitch

                            Say I don't have room to wrap a single world like supercalifragilisticexpialidocious. In the real world, I'd just find a syllable and then hyphenate. I don't think I can do that - and i don't think i *should* do that in html and css. I don't know *what* to do. What would you do?

                            Real programmers use butterflies

                            S Offline
                            S Offline
                            Stuart Dootson
                            wrote on last edited by
                            #35

                            How much (ROM,RAM) space do you have? Could a translation of a library like [this](https://github.com/mnater/Hyphenopoly) into C++ be of any use? The core hyphenation engine is 24kB of WASM, which would probably translate into a similar amount of assembly...

                            Java, Basic, who cares - it's all a bunch of tree-hugging hippy cr*p

                            H 1 Reply Last reply
                            0
                            • S Stuart Dootson

                              How much (ROM,RAM) space do you have? Could a translation of a library like [this](https://github.com/mnater/Hyphenopoly) into C++ be of any use? The core hyphenation engine is 24kB of WASM, which would probably translate into a similar amount of assembly...

                              Java, Basic, who cares - it's all a bunch of tree-hugging hippy cr*p

                              H Offline
                              H Offline
                              honey the codewitch
                              wrote on last edited by
                              #36

                              I might be able to translate it, but 24kB is significant. I have about 300kB of usable general purpose RAM, 4MB of NVS flash, and *sometimes* 4MB of PSRAM Basically how I've orchestrated this it works in 300kB but works much faster if you have the PSRAM as well. The 4MB of NVS storage is divvied up between my code and a data partition I use to store unpacked EPUB content. I'm not sure offhand how much program space I have. The bad part of using more program space is it starts to make the dev cycle turnaround longer because you have to upload the code via serial uart.

                              Real programmers use butterflies

                              1 Reply Last reply
                              0
                              • H honey the codewitch

                                Say I don't have room to wrap a single world like supercalifragilisticexpialidocious. In the real world, I'd just find a syllable and then hyphenate. I don't think I can do that - and i don't think i *should* do that in html and css. I don't know *what* to do. What would you do?

                                Real programmers use butterflies

                                B Offline
                                B Offline
                                BernardIE5317
                                wrote on last edited by
                                #37

                                What's wrong w/ hyphenating at vowels? Software can figure out where the vowels are w/o a dictionary Best Wishes - Cheerio

                                H 1 Reply Last reply
                                0
                                • B BernardIE5317

                                  What's wrong w/ hyphenating at vowels? Software can figure out where the vowels are w/o a dictionary Best Wishes - Cheerio

                                  H Offline
                                  H Offline
                                  honey the codewitch
                                  wrote on last edited by
                                  #38

                                  I mean, breaking bookkeeper on the first o seems kind of not good.

                                  Real programmers use butterflies

                                  B 1 Reply Last reply
                                  0
                                  • H honey the codewitch

                                    I could tell my client that but it's not true, and I wouldn't believe myself (I'm the client) anyway. I've got unicode, i'm working on the typesetting. It's not that the device isn't capable - I think I've written enough of the code to prove that the ESP32 can do it. It's just that I don't know the behaviors of all of it - not that I'm trying to implement all of it - just enough to show most epubs mostly correctly.

                                    Real programmers use butterflies

                                    U Offline
                                    U Offline
                                    User 13269747
                                    wrote on last edited by
                                    #39

                                    Quote:

                                    I could tell my client that but it's not true,

                                    You appaer to be in agreement with this statement of mine:

                                    Quote:

                                    Sure, you can get part of the way there on an atmega platform

                                    So, yeah, that's completely true and factual - you can definitely get part of the way there! I'm guessing that the easiest bits can be done (inter-word spacing and hyphenation at syllable boundaries), but anything else you add in is going to result in a ragged right-edge. If you are okay with a ragged right edge, then it all becomes doable on an ESP32 (that's like what, 80KB RAM?)

                                    H 1 Reply Last reply
                                    0
                                    • H honey the codewitch

                                      I mean, breaking bookkeeper on the first o seems kind of not good.

                                      Real programmers use butterflies

                                      B Offline
                                      B Offline
                                      BernardIE5317
                                      wrote on last edited by
                                      #40

                                      Why not break at the consonants then or simply rely on whatever the rule is for word wrapping w/ hyphenation I assume such a rule exist But in any case a dictionary is not needed - Cheerio

                                      1 Reply Last reply
                                      0
                                      • H honey the codewitch

                                        Say I don't have room to wrap a single world like supercalifragilisticexpialidocious. In the real world, I'd just find a syllable and then hyphenate. I don't think I can do that - and i don't think i *should* do that in html and css. I don't know *what* to do. What would you do?

                                        Real programmers use butterflies

                                        M Offline
                                        M Offline
                                        mdblack98
                                        wrote on last edited by
                                        #41

                                        Look at the CSS options which cover a lot https://tippingpoint.dev/pure-css-truncate-text Seem like simple wrap would be good for an ereader just splitting the word at the last char on the line...no ellipses or such needed.

                                        H 1 Reply Last reply
                                        0
                                        • U User 13269747

                                          Disregard my pevious answer - that's for HTML only. If it's for an e-reader you have bigger problems: use a library to do this because there are multiple things involved in flowing text. If you've never used LaTeX you are probably not familiar with all the complexities involved: 1. Font kerning: changes the width of a line. 2. Inter-word spacing: algorithm must ensure no rivers run through paragraphs and edges line up. 3. Paragraph indentation: in literary prose all paragraphs have an indentation. 4. Long word wrapping: You need to use a table of pre-calculated breakpoints that are specific to a glyph. 5. Language: Some languages read right-to-left, and these may be in the middle of a sentence in a language which reads left-to-right. If you're implementing an E-reader then you need to know all of the following concepts: 1. Kerning 2. Ligatures 3. Unicode code-points 4. Unicode BMP 5. Unicode surrogates 6. Unicode characters 7. Unicode glyphs 8. Rivers/runs in text 9. Struts and rules in text 10. Baselines, Caplines 11. Ascenders, descenders 12. Subpixels ... and probably a hundred other typesetting things I forgot or don't know about. Flowing text for an E-reader (or PDF, or book or any typesetting) is an entire Phd topic on its own and can take years of work to implement. It does help if you've used LaTeX in the past, because it chooses good defaults for all of the above, and if you want to change anything you're forced to learn what all those things mean.

                                          C Offline
                                          C Offline
                                          CosmoSpacely
                                          wrote on last edited by
                                          #42

                                          There is a pre baked library that covers many of these problems: http://site.icu-project.org/ Do not underestimate the weedy-ness of wrapping text. Check out some of the South Asian scripts. Thai, for example, has no spaces. Another strategy would be to scroll the too-long word. Think 1980's 16 character LED and LCD displays.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups