Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. Other Discussions
  3. IT & Infrastructure
  4. Problem with robots.txt Disallow

Problem with robots.txt Disallow

Scheduled Pinned Locked Moved IT & Infrastructure
questionhtmlcomagentic-aihelp
3 Posts 3 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • H Offline
    H Offline
    hasanali00
    wrote on last edited by
    #1

    Hi I have a problem with the robots.txt and google. I have this robots.txt file: User-agent: * Disallow: page1.html Disallow: dir_1/sub_dir_1/ Disallow: /data/ When I enter 'site:www.MySite.com' into Google search box, Goolge gets the content from the 'data' directory as well. Google should not have indexed the content of data directory. So why is google getting the results from 'data' directory, whereas I have disallowed it. How can I restrict everyone from accessing the data directory? Thanks

    E L 2 Replies Last reply
    0
    • H hasanali00

      Hi I have a problem with the robots.txt and google. I have this robots.txt file: User-agent: * Disallow: page1.html Disallow: dir_1/sub_dir_1/ Disallow: /data/ When I enter 'site:www.MySite.com' into Google search box, Goolge gets the content from the 'data' directory as well. Google should not have indexed the content of data directory. So why is google getting the results from 'data' directory, whereas I have disallowed it. How can I restrict everyone from accessing the data directory? Thanks

      E Offline
      E Offline
      Ed Poore
      wrote on last edited by
      #2

      You may need to remove the first /, as I think this takes it back to the root directory.


      If you're stuck in a rut: 1) Consult the documentation* 2) Google it 3) Ask a sensible question 4) Try an ancient ritualistic knowledge summoning (:badger::badger::badger:) dance :jig: 5) Phone :bob: * - If the documentation is MSDN > 6.0 then forget it!

      1 Reply Last reply
      0
      • H hasanali00

        Hi I have a problem with the robots.txt and google. I have this robots.txt file: User-agent: * Disallow: page1.html Disallow: dir_1/sub_dir_1/ Disallow: /data/ When I enter 'site:www.MySite.com' into Google search box, Goolge gets the content from the 'data' directory as well. Google should not have indexed the content of data directory. So why is google getting the results from 'data' directory, whereas I have disallowed it. How can I restrict everyone from accessing the data directory? Thanks

        L Offline
        L Offline
        Link2006
        wrote on last edited by
        #3

        Where did you put your robots.txt? It needs to be in root level. (ie. you should be able to read it from www.MySite.com/robots.txt) Also, the search result may be from google cache before you put the robots.txt up.

        1 Reply Last reply
        0
        Reply
        • Reply as topic
        Log in to reply
        • Oldest to Newest
        • Newest to Oldest
        • Most Votes


        • Login

        • Don't have an account? Register

        • Login or register to search.
        • First post
          Last post
        0
        • Categories
        • Recent
        • Tags
        • Popular
        • World
        • Users
        • Groups