Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
CODE PROJECT For Those Who Code
  • Home
  • Articles
  • FAQ
Community
  1. Home
  2. General Programming
  3. Visual Basic
  4. Problem with listing the URLs from html source code

Problem with listing the URLs from html source code

Scheduled Pinned Locked Moved Visual Basic
databasehelpcsharphtml
12 Posts 2 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • S Stephen McGuire

    Easiest way would be to cut/paste them yourself... You want to create a function that strips URL's from web page source code? Are you sure that is legal/ethical? Steve

    A Offline
    A Offline
    Aaron128
    wrote on last edited by
    #3

    I don't know Steve, but I'm trying to write a search engine and need a program to index hyperlinks into a database, Google did it. I would much appreciate any help though, thanks. :laugh:

    Aaron

    S 1 Reply Last reply
    0
    • A Aaron128

      I don't know Steve, but I'm trying to write a search engine and need a program to index hyperlinks into a database, Google did it. I would much appreciate any help though, thanks. :laugh:

      Aaron

      S Offline
      S Offline
      Stephen McGuire
      wrote on last edited by
      #4

      Ok. You may want to have a look at the Split function using 'space' as the delimiter. You could put each word into an Array or ArrayList. Loop the Array/ArrayList and look for words beginning 'http' or containing 'www' using the InStr function or something similar. Steve

      A 1 Reply Last reply
      0
      • S Stephen McGuire

        Ok. You may want to have a look at the Split function using 'space' as the delimiter. You could put each word into an Array or ArrayList. Loop the Array/ArrayList and look for words beginning 'http' or containing 'www' using the InStr function or something similar. Steve

        A Offline
        A Offline
        Aaron128
        wrote on last edited by
        #5

        OK, I will give it a go. Thanks;)

        Aaron

        S 1 Reply Last reply
        0
        • A Aaron128

          OK, I will give it a go. Thanks;)

          Aaron

          S Offline
          S Offline
          Stephen McGuire
          wrote on last edited by
          #6

          You may want to find the first instance of a URL before assigning anything to your array. Just to keep the array size down. You will probably need to use an ArrayList since it's capacity will increase dynamically and you don't know what the size will be. You should probably strip as much out as you can before populating your ArrayList. Steve

          A 1 Reply Last reply
          0
          • S Stephen McGuire

            You may want to find the first instance of a URL before assigning anything to your array. Just to keep the array size down. You will probably need to use an ArrayList since it's capacity will increase dynamically and you don't know what the size will be. You should probably strip as much out as you can before populating your ArrayList. Steve

            A Offline
            A Offline
            Aaron128
            wrote on last edited by
            #7

            Sorry if I'm being a bit annoying but, I was just wondering if it would be easier to use RichTextBox1.DetectURLs, because I have no idea on how to use the ArrayList, I'm not the best at VB.NET in the world:laugh: But I'm trying. Thanks though

            Aaron

            S 1 Reply Last reply
            0
            • A Aaron128

              Sorry if I'm being a bit annoying but, I was just wondering if it would be easier to use RichTextBox1.DetectURLs, because I have no idea on how to use the ArrayList, I'm not the best at VB.NET in the world:laugh: But I'm trying. Thanks though

              Aaron

              S Offline
              S Offline
              Stephen McGuire
              wrote on last edited by
              #8

              Nice idea but I'm afraid that is a property not a method. Its purpose is to automatically format a Uniform Resource Locator (URL) when it is typed into the control. Steve

              A 1 Reply Last reply
              0
              • S Stephen McGuire

                Nice idea but I'm afraid that is a property not a method. Its purpose is to automatically format a Uniform Resource Locator (URL) when it is typed into the control. Steve

                A Offline
                A Offline
                Aaron128
                wrote on last edited by
                #9

                I think I will try again with your idea, thanks;)

                Aaron

                S 1 Reply Last reply
                0
                • A Aaron128

                  I think I will try again with your idea, thanks;)

                  Aaron

                  S Offline
                  S Offline
                  Stephen McGuire
                  wrote on last edited by
                  #10

                  Look up the Split function in your help file. You will see an example of how to split a string into an Array or ArrayList. Get the data from the textbox into a string (strip out unwanted text at beginning). Then split it into an array. Loop each item in the array to see if it contains 'www', if it does, add it to your DataSet or write it to your database. That would be a start. Then you can play with it to make it more efficient. Steve

                  A 1 Reply Last reply
                  0
                  • S Stephen McGuire

                    Look up the Split function in your help file. You will see an example of how to split a string into an Array or ArrayList. Get the data from the textbox into a string (strip out unwanted text at beginning). Then split it into an array. Loop each item in the array to see if it contains 'www', if it does, add it to your DataSet or write it to your database. That would be a start. Then you can play with it to make it more efficient. Steve

                    A Offline
                    A Offline
                    Aaron128
                    wrote on last edited by
                    #11

                    Thanks, your help is much appreciated, thanks, it looks like you have helped a lot of people to day, Well Done!! Thanks again:-D

                    Aaron

                    S 1 Reply Last reply
                    0
                    • A Aaron128

                      Thanks, your help is much appreciated, thanks, it looks like you have helped a lot of people to day, Well Done!! Thanks again:-D

                      Aaron

                      S Offline
                      S Offline
                      Stephen McGuire
                      wrote on last edited by
                      #12

                      You're welcome!:cool:

                      1 Reply Last reply
                      0
                      Reply
                      • Reply as topic
                      Log in to reply
                      • Oldest to Newest
                      • Newest to Oldest
                      • Most Votes


                      • Login

                      • Don't have an account? Register

                      • Login or register to search.
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups