Extracting text from HTML
-
Anyone know of a good method for extracting the text from an HTML document (loaded from an address). I would also be interested in knowing the size of the text as well. Any thoughts?
-
Anyone know of a good method for extracting the text from an HTML document (loaded from an address). I would also be interested in knowing the size of the text as well. Any thoughts?
StreamReader should be able to read the text.
-
Anyone know of a good method for extracting the text from an HTML document (loaded from an address). I would also be interested in knowing the size of the text as well. Any thoughts?
You can try applying a regex that replaces everything between < and > with empty strings. There is also this[^] SGML parser which you can use to extract the text. Regards Senthil _____________________________ My Blog | My Articles | WinMacro