Text extraction
-
How do i extract text from MS word? It would be better if the end user does not need to have microsoft office installed.
-
Well, try searching for any character with an ASCII value of between 32 and 128 (i think thats right, the characters that can be typed on a keyboard, but just check it before you do)
-
Well, try searching for any character with an ASCII value of between 32 and 128 (i think thats right, the characters that can be typed on a keyboard, but just check it before you do)
-
What i need is to convert a microsoft words document to a text file. How can I do this without having microsoft office? If not, using library in microsoft office is fine but how?
Geez. Open the document, parse it, but discard any formatting information - just keep the text. Write the text to a file. Done. WHY, do YOU think, "Office-to-anything-else" converters are so freaking expensive? Cheers, Sebastian -- Contra vim mortem non est medicamen in hortem.
-
Geez. Open the document, parse it, but discard any formatting information - just keep the text. Write the text to a file. Done. WHY, do YOU think, "Office-to-anything-else" converters are so freaking expensive? Cheers, Sebastian -- Contra vim mortem non est medicamen in hortem.
-
From now on, questions are answered at a rate of 40 EUR/h. Answering your last question would take around 6-12 months. Should I go ahead? Seriously, if you really want to do it, you will have to analyze the file yourself. I aint got no clue how Word hides its content. Cheers, Sebastian -- Contra vim mortem non est medicamen in hortem.
-
How do i extract text from MS word? It would be better if the end user does not need to have microsoft office installed.
I could recommend www.wordcnv.com[^] - these guys have fastest library, which can be supplied in a small (<50K!) lib file, and their support is great. I'm using their library myself. Igor Green http://www.grigsoft.com/ - files and folders comparison tools
-
How do i extract text from MS word? It would be better if the end user does not need to have microsoft office installed.
like others wrote, you need to do your own reverse engineering on the Word format to extract the text; which is no walk in the park. I'm certain that if you google enough you might find something interresting on the subject.
Maximilien Lincourt Your Head A Splode - Strong Bad
-
How do i extract text from MS word? It would be better if the end user does not need to have microsoft office installed.
Without using Word, you'll need to know the format of a .doc file. See http://www.wotsit.org/[^] for this.
"The greatest good you can do for another is not just to share your riches but to reveal to him his own." - Benjamin Disraeli