Quite honestly, if I had the time I would not use IE at all. Better use a recompiled Mozilla or other programmable browsers with source code. And put web page filtering on top of it. What has stopped me many times already with programming IE is that there are many cases where the IE API doesn't work as expected. For instance, when you have frames in the web page, several events are not triggered at all, or you have to subscribe special things. None of this is documented. So it takes time, and there is no guarantee at all you succeed. Besides that, IE uses multiple threads. One thread for each picture being downloaded for instance, which means it is hard to stop the process as a whole. If you go start programming IE, then you may consider this scenario : - I assume you've got the target URL - use MSHTML as a COM object to download the html code from that URL. (you also could use the wininet library). Store it as a file. To do this, look for the MSDN sample called "Walkall" (C++ code). - filter the html code - add the <base href="URL"> tag inside the HTML code, in order to mimic the right base domain. - ask IE to load this html code in the browser : document.load(...)
MS quote (http://www.microsoft.com/ddk) : As of September 30, 2002, the Microsoft® Windows® 2000 DDK, the Microsoft Windows 98 DDK, and the Microsoft Windows NT® 4.0 DDK will no longer be available for purchase or download on this site.