What's a good FREE Windows-executable app that can scrape the text out of an HTML file
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
Maybe this one: ParseHub[^] or: Diggernaut - Turn website content into datasets[^]
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
Having written VBA in Access to do it before, keep in mind that different sites use different tags. That is another small pain in the ass you have to handle.
Our Forgotten Astronomy | Object Oriented Programming with C++ | Wordle solver
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
What color should it be?
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
WebResourceProvider[^], perhaps? /ravi
My new year resolution: 2048 x 1536 Home | Articles | My .NET bits | Freeware ravib(at)ravib(dot)com
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
Sounds like you want the Lynx (web browser) - Wikipedia[^] It's a text based web browser. Should work fine as long as you don't need automation.
Wrong is evil and must be defeated. - Jeff Ello
-
Maybe this one: ParseHub[^] or: Diggernaut - Turn website content into datasets[^]
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.
-
I basically want to scrape out the text part of a webpage, leaving all the frilly stuff - including login screens - out. And what I don't want is to have an app that just opens up the HTML as a text file like Notepad. I have also tried using OpenOffice Writer, and once it worked, putting the text at the bottom, but every subsequent time it just crashed.