Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. Parsing user input

Parsing user input

Scheduled Pinned Locked Moved The Lounge
pythoncomjsontutoriallearning
37 Posts 26 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • M Offline
    M Offline
    Marc Clifton
    wrote on last edited by
    #1

    Examples (#'s have been removed):

    P O BOX
    P.O. BOX
    PMB
    PO B0X
    PO BO X
    PO BOK
    PO BOS
    BOX

    :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

    Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

    B M T V J 15 Replies Last reply
    0
    • M Marc Clifton

      Examples (#'s have been removed):

      P O BOX
      P.O. BOX
      PMB
      PO B0X
      PO BO X
      PO BOK
      PO BOS
      BOX

      :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

      Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

      B Offline
      B Offline
      Bassam Abdul Baki
      wrote on last edited by
      #2

      Could have been a manager studying his PMBOK. :confused:

      Web - BM - RSS - Math - LinkedIn

      J 1 Reply Last reply
      0
      • M Marc Clifton

        Examples (#'s have been removed):

        P O BOX
        P.O. BOX
        PMB
        PO B0X
        PO BO X
        PO BOK
        PO BOS
        BOX

        :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

        Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

        M Offline
        M Offline
        Maximilien
        wrote on last edited by
        #3

        You really need to parse addresses ?If you start doing that, there will always be outliers that you will miss. :confused:

        I'd rather be phishing!

        N S M 3 Replies Last reply
        0
        • M Marc Clifton

          Examples (#'s have been removed):

          P O BOX
          P.O. BOX
          PMB
          PO B0X
          PO BO X
          PO BOK
          PO BOS
          BOX

          :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

          Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

          T Offline
          T Offline
          Tim Carmichael
          wrote on last edited by
          #4

          When we put our mail on vacation hold, it validates and 'normalizes' the address, so I do understand what you're working with. Where I grew up, our address was RR#1; it wasn't until I was in my teens that we had an address with a number and street name. So.. consider this.. are you only dealing with P.O. and its variants or do you have R.R. addresses as well?

          M B E K 4 Replies Last reply
          0
          • T Tim Carmichael

            When we put our mail on vacation hold, it validates and 'normalizes' the address, so I do understand what you're working with. Where I grew up, our address was RR#1; it wasn't until I was in my teens that we had an address with a number and street name. So.. consider this.. are you only dealing with P.O. and its variants or do you have R.R. addresses as well?

            M Offline
            M Offline
            Marc Clifton
            wrote on last edited by
            #5

            RR, CR, HC, etc., as well as regular street addresses (as best as those are). Perfect accuracy is not necessary, just best guess. :) Marc

            Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

            A 1 Reply Last reply
            0
            • T Tim Carmichael

              When we put our mail on vacation hold, it validates and 'normalizes' the address, so I do understand what you're working with. Where I grew up, our address was RR#1; it wasn't until I was in my teens that we had an address with a number and street name. So.. consider this.. are you only dealing with P.O. and its variants or do you have R.R. addresses as well?

              B Offline
              B Offline
              Bassam Abdul Baki
              wrote on last edited by
              #6

              Tim Carmichael wrote:

              RR#1

              Rolls Royce #1? Homeless billionaire? [sidebar] Reminds me of [The Bumpkin Billionaires](https://en.wikipedia.org/wiki/The\_Bumpkin\_Billionaires) which I used to read as a kid.

              Web - BM - RSS - Math - LinkedIn

              R 1 Reply Last reply
              0
              • M Marc Clifton

                Examples (#'s have been removed):

                P O BOX
                P.O. BOX
                PMB
                PO B0X
                PO BO X
                PO BOK
                PO BOS
                BOX

                :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                V Offline
                V Offline
                Vikram A Punathambekar
                wrote on last edited by
                #7

                Marc Clifton wrote:

                The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'.

                Maybe [this](https://xkcd.com/1530/) happened to your user? :-D Just kidding - O and K are nearby, so he probably hit K accidentally along with BO and missed the X :-\ Or maybe he went for BOKS and missed the S, who knows? :doh:

                Cheers, विक्रम "We have already been through this, I am not going to repeat myself." - fat_boy, in a global warming thread :doh:

                1 Reply Last reply
                0
                • B Bassam Abdul Baki

                  Could have been a manager studying his PMBOK. :confused:

                  Web - BM - RSS - Math - LinkedIn

                  J Offline
                  J Offline
                  Jeremy Falcon
                  wrote on last edited by
                  #8

                  :laugh: :thumbsup:

                  Jeremy Falcon

                  1 Reply Last reply
                  0
                  • M Marc Clifton

                    Examples (#'s have been removed):

                    P O BOX
                    P.O. BOX
                    PMB
                    PO B0X
                    PO BO X
                    PO BOK
                    PO BOS
                    BOX

                    :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                    Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                    J Offline
                    J Offline
                    Jeremy Falcon
                    wrote on last edited by
                    #9

                    See, isn't programming fun!?

                    Jeremy Falcon

                    1 Reply Last reply
                    0
                    • M Marc Clifton

                      Examples (#'s have been removed):

                      P O BOX
                      P.O. BOX
                      PMB
                      PO B0X
                      PO BO X
                      PO BOK
                      PO BOS
                      BOX

                      :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                      Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                      G Offline
                      G Offline
                      Gary Wheeler
                      wrote on last edited by
                      #10

                      I smell OCR in the mix - hence the BOK, BOS, B0X, etc.

                      Software Zen: delete this;

                      M V 2 Replies Last reply
                      0
                      • B Bassam Abdul Baki

                        Tim Carmichael wrote:

                        RR#1

                        Rolls Royce #1? Homeless billionaire? [sidebar] Reminds me of [The Bumpkin Billionaires](https://en.wikipedia.org/wiki/The\_Bumpkin\_Billionaires) which I used to read as a kid.

                        Web - BM - RSS - Math - LinkedIn

                        R Offline
                        R Offline
                        Roland M Smith
                        wrote on last edited by
                        #11

                        Rural Route

                        1 Reply Last reply
                        0
                        • G Gary Wheeler

                          I smell OCR in the mix - hence the BOK, BOS, B0X, etc.

                          Software Zen: delete this;

                          M Offline
                          M Offline
                          Marc Clifton
                          wrote on last edited by
                          #12

                          Gary Wheeler wrote:

                          I smell OCR in the mix - hence the BOK, BOS, B0X, etc.

                          Ah - excellent point! Marc

                          Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                          1 Reply Last reply
                          0
                          • M Marc Clifton

                            Examples (#'s have been removed):

                            P O BOX
                            P.O. BOX
                            PMB
                            PO B0X
                            PO BO X
                            PO BOK
                            PO BOS
                            BOX

                            :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                            Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                            C Offline
                            C Offline
                            Chris Losinger
                            wrote on last edited by
                            #13

                            welcome to my life

                            H V 2 Replies Last reply
                            0
                            • C Chris Losinger

                              welcome to my life

                              H Offline
                              H Offline
                              Herbie Mountjoy
                              wrote on last edited by
                              #14

                              And I thought parsing dates that had been defined as strings or ints or decimals was a nightmare. Maybe there is a need for an AI to second guess what the user might have meant.

                              We're philosophical about power outages here. A.C. come, A.C. go.

                              1 Reply Last reply
                              0
                              • M Maximilien

                                You really need to parse addresses ?If you start doing that, there will always be outliers that you will miss. :confused:

                                I'd rather be phishing!

                                N Offline
                                N Offline
                                Nelek
                                wrote on last edited by
                                #15

                                Maximilien wrote:

                                If you start doing that, there will always be outliers that you will miss.

                                Software development is a constant war with the universe... Developers trying to do better idiot-proof software and the universe trying to do even dumber users... So far the universe is winning

                                M.D.V. ;) If something has a solution... Why do we have to worry about?. If it has no solution... For what reason do we have to worry about? Help me to understand what I'm saying, and I'll explain it better to you Rating helpful answers is nice, but saying thanks can be even nicer.

                                F B 2 Replies Last reply
                                0
                                • M Marc Clifton

                                  Examples (#'s have been removed):

                                  P O BOX
                                  P.O. BOX
                                  PMB
                                  PO B0X
                                  PO BO X
                                  PO BOK
                                  PO BOS
                                  BOX

                                  :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                                  Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                                  D Offline
                                  D Offline
                                  dan sh
                                  wrote on last edited by
                                  #16

                                  Randomly throw them to various fields. They might not be bright enough to notice.

                                  "It is easy to decipher extraterrestrial signals after deciphering Javascript and VB6 themselves.", ISanti[^]

                                  1 Reply Last reply
                                  0
                                  • M Maximilien

                                    You really need to parse addresses ?If you start doing that, there will always be outliers that you will miss. :confused:

                                    I'd rather be phishing!

                                    S Offline
                                    S Offline
                                    sir_download_alot
                                    wrote on last edited by
                                    #17

                                    Fully agree! This is mission impossible. How can one know that "BOS" should be "BOX" and not "BOSS" or "BOSSA NOVA"? Keep it simple and no risk, no fun!

                                    1 Reply Last reply
                                    0
                                    • M Marc Clifton

                                      Examples (#'s have been removed):

                                      P O BOX
                                      P.O. BOX
                                      PMB
                                      PO B0X
                                      PO BO X
                                      PO BOK
                                      PO BOS
                                      BOX

                                      :sigh: The one with the 'K' is interesting. 'K' is on the opposite side of the keyboard -- I can understand the 'S'. The hardest part about parsing crap like this (there are 166,333 records) is determining what other variants I did not parse correctly (for example, considered as a street address, not a PO Box), not which ones I successfully accounted for. Marc

                                      Latest Article - Create a Dockerized Python Fiddle Web App Learning to code with python is like learning to swim with those little arm floaties. It gives you undeserved confidence and will eventually drown you. - DangerBunny Artificial intelligence is the only remedy for natural stupidity. - CDP1802

                                      K Offline
                                      K Offline
                                      kalberts
                                      wrote on last edited by
                                      #18

                                      We have several times received paper mail where the entire name/address is no more than an alphabet soup - yet it is delivered to us no more than one day delayed. First time this happened we were really puzzled: How could the mailman know that the mail is intended for us? (It is!) Finally we realized that a keyboard "Left shift" operation would give our name and address correctly. Later, we have seen both right and left shifts, of one hand or both hands. I asked a mail guy about it, and he confirmed that is is well known: If name/address looks like alphabet soup, chances are 9 in 10 that a keyboard shift changes it to a sensible address. Maybe you should include full and partial (i.e. one-hand) right and left shifts in your user input parsing. But don't expect the shift machine instructions to be of great help for this task :-)

                                      1 Reply Last reply
                                      0
                                      • N Nelek

                                        Maximilien wrote:

                                        If you start doing that, there will always be outliers that you will miss.

                                        Software development is a constant war with the universe... Developers trying to do better idiot-proof software and the universe trying to do even dumber users... So far the universe is winning

                                        M.D.V. ;) If something has a solution... Why do we have to worry about?. If it has no solution... For what reason do we have to worry about? Help me to understand what I'm saying, and I'll explain it better to you Rating helpful answers is nice, but saying thanks can be even nicer.

                                        F Offline
                                        F Offline
                                        fmsalmeida
                                        wrote on last edited by
                                        #19

                                        Nelek wrote:

                                        Software development is a constant war with the universe... Developers trying to do better idiot-proof software and the universe trying to do even dumber users...

                                        You made my day with this phrase!

                                        N 1 Reply Last reply
                                        0
                                        • N Nelek

                                          Maximilien wrote:

                                          If you start doing that, there will always be outliers that you will miss.

                                          Software development is a constant war with the universe... Developers trying to do better idiot-proof software and the universe trying to do even dumber users... So far the universe is winning

                                          M.D.V. ;) If something has a solution... Why do we have to worry about?. If it has no solution... For what reason do we have to worry about? Help me to understand what I'm saying, and I'll explain it better to you Rating helpful answers is nice, but saying thanks can be even nicer.

                                          B Offline
                                          B Offline
                                          BryanFazekas
                                          wrote on last edited by
                                          #20

                                          The universe will always win.

                                          1 Reply Last reply
                                          0
                                          Reply
                                          • Reply as topic
                                          Log in to reply
                                          • Oldest to Newest
                                          • Newest to Oldest
                                          • Most Votes


                                          • Login

                                          • Don't have an account? Register

                                          • Login or register to search.
                                          • First post
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • World
                                          • Users
                                          • Groups