How to process each word in a microsoft word document
-
-
I known 2 ways. 1 is regex 2 is use string.split(' ') splite the phrase to a string[]. wang
-
I assume that you mean that you want to get the text out of Word and manipulate it in some way. There is a way that doesn't involve using Microsoft Word at all, and relies instead on something called an IFilter. Basically, an IFilter is the mechanism by which the Search mechanism in Windows works, and allows you to get information out of sources such as Excel, Word, Adobe Acrobat, etc. What you need to do is read in the document using the appropriate IFilter and then parse it to extract the words. Here's a link to a C# version that shows what you need to do. http://www.codeproject.com/csharp/IFilter.asp[^]
the last thing I want to see is some pasty-faced geek with skin so pale that it's almost translucent trying to bump parts with a partner - John Simmons / outlaw programmer
Deja View - the feeling that you've seen this post before.