Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. What is the term for this area of research...

What is the term for this area of research...

Scheduled Pinned Locked Moved The Lounge
questioncomregex
13 Posts 6 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • C Offline
    C Offline
    chaiguy1337
    wrote on last edited by
    #1

    I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

    “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

    C S Steve EcholsS D J 5 Replies Last reply
    0
    • C chaiguy1337

      I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

      “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

      C Offline
      C Offline
      Chris Meech
      wrote on last edited by
      #2

      All of the above. :)

      Chris Meech I am Canadian. [heard in a local bar] In theory there is no difference between theory and practice. In practice there is. [Yogi Berra]

      C 1 Reply Last reply
      0
      • C Chris Meech

        All of the above. :)

        Chris Meech I am Canadian. [heard in a local bar] In theory there is no difference between theory and practice. In practice there is. [Yogi Berra]

        C Offline
        C Offline
        chaiguy1337
        wrote on last edited by
        #3

        Chris Meech wrote:

        All of the above.

        Great! That narrows it down. :P

        “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

        1 Reply Last reply
        0
        • C chaiguy1337

          I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

          “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

          S Offline
          S Offline
          Steven A Lowe
          wrote on last edited by
          #4

          pattern matching google regex

          Best regards, Steven A. Lowe CEO, Innovator LLC www.nov8r.com

          C 1 Reply Last reply
          0
          • S Steven A Lowe

            pattern matching google regex

            Best regards, Steven A. Lowe CEO, Innovator LLC www.nov8r.com

            C Offline
            C Offline
            chaiguy1337
            wrote on last edited by
            #5

            Ah yeah, I know all about regular expressions; just wanted to know if there was a specific field that may not necessarily use regex's to do it. Though now that you remind me I recall once running into a huge "regex library" for all sorts of various kinds of patterns... that might be very useful. Alas I wonder how "powerful" regex is for matching more complex things like street addresses, or geographical coordinates, where more than just a pattern is involved. Then again, I suppose it would be a good start, at least to find potential matches, and then use some logic afterwards to verify them or what not.

            “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

            1 Reply Last reply
            0
            • C chaiguy1337

              I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

              “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

              Steve EcholsS Offline
              Steve EcholsS Offline
              Steve Echols
              wrote on last edited by
              #6

              My gut goes with Pattern Recognition, but my head can't decide. What about "Parsing"?


              - S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!

              • S
                50 cups of coffee and you know it's on!
                Code, follow, or get out of the way.
              C 1 Reply Last reply
              0
              • Steve EcholsS Steve Echols

                My gut goes with Pattern Recognition, but my head can't decide. What about "Parsing"?


                - S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!

                C Offline
                C Offline
                chaiguy1337
                wrote on last edited by
                #7

                According to Wiki, pattern recognition is more like facial recognition and stuff, so that's not what I want. Not sure about parsing either, because that implies turning it into something else, whereas I'm really only (on the surface) interested in finding those segments. (Yes, I'll then turn them into something, so in that sense it's parsing, but I want to do that part myself.) I'm gonna look into regex's and see if I can find that library of patterns again. I was hoping someone might have had some experience or could recommend some places or books to read up on the subject. I'm especially interested in efficient algorithms, and as far as I know regex's are pretty quick, so they might fit the bill--but then again what if I have a hundred patterns I'm looking for--a regex might be quick for a single pattern, but can I scan for a hundred at the same time? These are the sorts of things I want to find out.

                “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                Steve EcholsS 1 Reply Last reply
                0
                • C chaiguy1337

                  According to Wiki, pattern recognition is more like facial recognition and stuff, so that's not what I want. Not sure about parsing either, because that implies turning it into something else, whereas I'm really only (on the surface) interested in finding those segments. (Yes, I'll then turn them into something, so in that sense it's parsing, but I want to do that part myself.) I'm gonna look into regex's and see if I can find that library of patterns again. I was hoping someone might have had some experience or could recommend some places or books to read up on the subject. I'm especially interested in efficient algorithms, and as far as I know regex's are pretty quick, so they might fit the bill--but then again what if I have a hundred patterns I'm looking for--a regex might be quick for a single pattern, but can I scan for a hundred at the same time? These are the sorts of things I want to find out.

                  “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                  Steve EcholsS Offline
                  Steve EcholsS Offline
                  Steve Echols
                  wrote on last edited by
                  #8

                  Pattern recognition isn't just facial recognition, it can also be patterns in data (which is essentially what images are, right?) Anyway, If you look at the Wiki for Regex, it has a broad range of terms that essentially mean the same thing, but I think "pattern matching" is probably the best for what you're doing. Regex is powerful, but not the most efficient if you're doing it over and over, or scanning large documents. There are optimizations you can make, such as precompiling your regex expression. Also if can limit the size of string you're scanning, it obviously will make a huge improvement. That's all I got.


                  - S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!

                  • S
                    50 cups of coffee and you know it's on!
                    Code, follow, or get out of the way.
                  C 1 Reply Last reply
                  0
                  • Steve EcholsS Steve Echols

                    Pattern recognition isn't just facial recognition, it can also be patterns in data (which is essentially what images are, right?) Anyway, If you look at the Wiki for Regex, it has a broad range of terms that essentially mean the same thing, but I think "pattern matching" is probably the best for what you're doing. Regex is powerful, but not the most efficient if you're doing it over and over, or scanning large documents. There are optimizations you can make, such as precompiling your regex expression. Also if can limit the size of string you're scanning, it obviously will make a huge improvement. That's all I got.


                    - S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!

                    C Offline
                    C Offline
                    chaiguy1337
                    wrote on last edited by
                    #9

                    Sounds good; I'll check it out. Thanks.

                    “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                    1 Reply Last reply
                    0
                    • C chaiguy1337

                      I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

                      “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                      D Offline
                      D Offline
                      dwales
                      wrote on last edited by
                      #10

                      Sounds like you want Text Analytics or Text Mining (both in wikipedia)

                      C 1 Reply Last reply
                      0
                      • D dwales

                        Sounds like you want Text Analytics or Text Mining (both in wikipedia)

                        C Offline
                        C Offline
                        chaiguy1337
                        wrote on last edited by
                        #11

                        I'll check those out too, thanks!

                        “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                        1 Reply Last reply
                        0
                        • C chaiguy1337

                          I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ

                          “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                          J Offline
                          J Offline
                          Jonathan C Dickinson
                          wrote on last edited by
                          #12

                          Microsoft SmartTags do the same thing, so you might find the name for the technology in write-ups on the product.

                          He who asks a question is a fool for five minutes. He who does not ask a question remains a fool forever. [Chineese Proverb] Jonathan C Dickinson (C# Software Engineer)

                          C 1 Reply Last reply
                          0
                          • J Jonathan C Dickinson

                            Microsoft SmartTags do the same thing, so you might find the name for the technology in write-ups on the product.

                            He who asks a question is a fool for five minutes. He who does not ask a question remains a fool forever. [Chineese Proverb] Jonathan C Dickinson (C# Software Engineer)

                            C Offline
                            C Offline
                            chaiguy1337
                            wrote on last edited by
                            #13

                            Thanks.

                            “Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’)            Built with home-grown CodeProject components! -”-”-

                            1 Reply Last reply
                            0
                            Reply
                            • Reply as topic
                            Log in to reply
                            • Oldest to Newest
                            • Newest to Oldest
                            • Most Votes


                            • Login

                            • Don't have an account? Register

                            • Login or register to search.
                            • First post
                              Last post
                            0
                            • Categories
                            • Recent
                            • Tags
                            • Popular
                            • World
                            • Users
                            • Groups