Quote:
Originally Posted by Jacob
Doesnt work.
I can configure (or used to be able to configure) my webbroser to behave as a googlebot, and I could access any webpage I want: password or no password.
|
Browsers DON'T read the
robots.txt!!!
the idea of the robots.txt protocol is not that it blocks useragents ad-hoc, but the useragents that ARE bots can read the text file for the directives that pertain to them specifically, and act on those directives.
If the text files specifies that a certain files or folders should not be indexed by any bots
Code:
Useragent: *
Disallow: /folder/
or by a specific bot;
Code:
Useragent: Bot_UA
Disallow: /
the robots that honour the protocol should NOT index the files contained in the folder.