What is the term for this area of research...
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
All of the above. :)
Chris Meech I am Canadian. [heard in a local bar] In theory there is no difference between theory and practice. In practice there is. [Yogi Berra]
-
All of the above. :)
Chris Meech I am Canadian. [heard in a local bar] In theory there is no difference between theory and practice. In practice there is. [Yogi Berra]
Chris Meech wrote:
All of the above.
Great! That narrows it down. :P
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
pattern matching google regex
Best regards, Steven A. Lowe CEO, Innovator LLC www.nov8r.com
-
pattern matching google regex
Best regards, Steven A. Lowe CEO, Innovator LLC www.nov8r.com
Ah yeah, I know all about regular expressions; just wanted to know if there was a specific field that may not necessarily use regex's to do it. Though now that you remind me I recall once running into a huge "regex library" for all sorts of various kinds of patterns... that might be very useful. Alas I wonder how "powerful" regex is for matching more complex things like street addresses, or geographical coordinates, where more than just a pattern is involved. Then again, I suppose it would be a good start, at least to find potential matches, and then use some logic afterwards to verify them or what not.
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
My gut goes with Pattern Recognition, but my head can't decide. What about "Parsing"?
- S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!
-
My gut goes with Pattern Recognition, but my head can't decide. What about "Parsing"?
- S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!
According to Wiki, pattern recognition is more like facial recognition and stuff, so that's not what I want. Not sure about parsing either, because that implies turning it into something else, whereas I'm really only (on the surface) interested in finding those segments. (Yes, I'll then turn them into something, so in that sense it's parsing, but I want to do that part myself.) I'm gonna look into regex's and see if I can find that library of patterns again. I was hoping someone might have had some experience or could recommend some places or books to read up on the subject. I'm especially interested in efficient algorithms, and as far as I know regex's are pretty quick, so they might fit the bill--but then again what if I have a hundred patterns I'm looking for--a regex might be quick for a single pattern, but can I scan for a hundred at the same time? These are the sorts of things I want to find out.
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
According to Wiki, pattern recognition is more like facial recognition and stuff, so that's not what I want. Not sure about parsing either, because that implies turning it into something else, whereas I'm really only (on the surface) interested in finding those segments. (Yes, I'll then turn them into something, so in that sense it's parsing, but I want to do that part myself.) I'm gonna look into regex's and see if I can find that library of patterns again. I was hoping someone might have had some experience or could recommend some places or books to read up on the subject. I'm especially interested in efficient algorithms, and as far as I know regex's are pretty quick, so they might fit the bill--but then again what if I have a hundred patterns I'm looking for--a regex might be quick for a single pattern, but can I scan for a hundred at the same time? These are the sorts of things I want to find out.
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
Pattern recognition isn't just facial recognition, it can also be patterns in data (which is essentially what images are, right?) Anyway, If you look at the Wiki for Regex, it has a broad range of terms that essentially mean the same thing, but I think "pattern matching" is probably the best for what you're doing. Regex is powerful, but not the most efficient if you're doing it over and over, or scanning large documents. There are optimizations you can make, such as precompiling your regex expression. Also if can limit the size of string you're scanning, it obviously will make a huge improvement. That's all I got.
- S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!
-
Pattern recognition isn't just facial recognition, it can also be patterns in data (which is essentially what images are, right?) Anyway, If you look at the Wiki for Regex, it has a broad range of terms that essentially mean the same thing, but I think "pattern matching" is probably the best for what you're doing. Regex is powerful, but not the most efficient if you're doing it over and over, or scanning large documents. There are optimizations you can make, such as precompiling your regex expression. Also if can limit the size of string you're scanning, it obviously will make a huge improvement. That's all I got.
- S 50 cups of coffee and you know it's on! A post a day, keeps the white coats away!
Sounds good; I'll check it out. Thanks.
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I'll check those out too, thanks!
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
-
I want to research some stuff but don't know what the proper term for it is. What is it called where you take a body of text and find information in it that is recognized as some certain form... like finding a street address, or a phone number, or a web or email address... stuff like that? Text processing? Pattern matching? Pattern recognition? Data mining? Furthermore, anyone know any good libraries--preferably free--for this kind of thing? Civ
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-
Microsoft SmartTags do the same thing, so you might find the name for the technology in write-ups on the product.
He who asks a question is a fool for five minutes. He who does not ask a question remains a fool forever. [Chineese Proverb] Jonathan C Dickinson (C# Software Engineer)
-
Microsoft SmartTags do the same thing, so you might find the name for the technology in write-ups on the product.
He who asks a question is a fool for five minutes. He who does not ask a question remains a fool forever. [Chineese Proverb] Jonathan C Dickinson (C# Software Engineer)
Thanks.
“Time and space can be a bitch.” –Gushie, Quantum Leap {o,o}.oO( Looking for a great RSS reader? Try FeedBeast! ) |)””’) Built with home-grown CodeProject components! -”-”-