Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. Web Development
  3. Linux, Apache, MySQL, PHP
  4. Need to capture raw HTML and write to file any page displayed on browser

Need to capture raw HTML and write to file any page displayed on browser

Scheduled Pinned Locked Moved Linux, Apache, MySQL, PHP
perlhtmldatabasesecuritytools
3 Posts 2 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • A Offline
    A Offline
    Antone Eason
    wrote on last edited by
    #1

    Greetings, I would like to know if anyone has any code or even VB script, Perl etc that can simply be run from a command line to download whatever page is on a browser and write the raw HTML to a file, it can also be MHT for that matter. I know about using - Navigate etc for a specific web page. This is different. The page I go to require authentication with word art and user name and password etc. Then once there you must click a few times to get where you are going for the data. Therefore running a script to direct me to a certain webpage will not work. The page has to be displayed first and then downloaded. So, I just want to take whatever is currently on the browser, mainly due to catch refreshes of new data. Understand the page I am getting data from is generated as from a database so I can not simply run a script file to download that page. The program must be able to simple be activated and write to disk what ever is displayed on the browser at the time the program is run. I have a seperate data extractor to run against the new file to find the data I need. This file can always have the same name as well and be overwritten. Thanks, Antone

    J A 2 Replies Last reply
    0
    • A Antone Eason

      Greetings, I would like to know if anyone has any code or even VB script, Perl etc that can simply be run from a command line to download whatever page is on a browser and write the raw HTML to a file, it can also be MHT for that matter. I know about using - Navigate etc for a specific web page. This is different. The page I go to require authentication with word art and user name and password etc. Then once there you must click a few times to get where you are going for the data. Therefore running a script to direct me to a certain webpage will not work. The page has to be displayed first and then downloaded. So, I just want to take whatever is currently on the browser, mainly due to catch refreshes of new data. Understand the page I am getting data from is generated as from a database so I can not simply run a script file to download that page. The program must be able to simple be activated and write to disk what ever is displayed on the browser at the time the program is run. I have a seperate data extractor to run against the new file to find the data I need. This file can always have the same name as well and be overwritten. Thanks, Antone

      J Offline
      J Offline
      jschell
      wrote on last edited by
      #2

      Antone Eason wrote:

      Understand the page I am getting data from is generated as from a database so I can not simply run a script file to download that page.

      Of course you can. If the user does it simply to get to that page then you can write a script to do the same thing. But if you really want to grab the browser then you must use something that interacts with the browser and then some other piece of software, presumably to save it. Autoit on windows would do that.

      1 Reply Last reply
      0
      • A Antone Eason

        Greetings, I would like to know if anyone has any code or even VB script, Perl etc that can simply be run from a command line to download whatever page is on a browser and write the raw HTML to a file, it can also be MHT for that matter. I know about using - Navigate etc for a specific web page. This is different. The page I go to require authentication with word art and user name and password etc. Then once there you must click a few times to get where you are going for the data. Therefore running a script to direct me to a certain webpage will not work. The page has to be displayed first and then downloaded. So, I just want to take whatever is currently on the browser, mainly due to catch refreshes of new data. Understand the page I am getting data from is generated as from a database so I can not simply run a script file to download that page. The program must be able to simple be activated and write to disk what ever is displayed on the browser at the time the program is run. I have a seperate data extractor to run against the new file to find the data I need. This file can always have the same name as well and be overwritten. Thanks, Antone

        A Offline
        A Offline
        Antone Eason
        wrote on last edited by
        #3

        Thank you everyone for all your replies. I have no updates on the issue as I do not have time to pursue this project any longer. See ya, Antone

        1 Reply Last reply
        0
        Reply
        • Reply as topic
        Log in to reply
        • Oldest to Newest
        • Newest to Oldest
        • Most Votes


        • Login

        • Don't have an account? Register

        • Login or register to search.
        • First post
          Last post
        0
        • Categories
        • Recent
        • Tags
        • Popular
        • World
        • Users
        • Groups