Jul 25, 2008

Robots.txt

Free Web Hosting, No Ads > CONTRIBUTE > The Internet > Website Discussion

free web hosting

Robots.txt

osknockout
Robots.txt is a file to keep robots from indexing
certain portions of the site.

I was looking at trap17's robot.txt, here's a copy:

CODE

User-agent: *
Disallow: /stylesheets/

User-agent: Googlebot
Disallow: /stylesheets/


One question. If you specify '*', then why would one need to
specify the Googlebot?

Reply

alperuzi
maybe its just redundant, or maybe some bots don't recognize the asterisk

Reply

OpaQue
Oh.. I was restricting google from indexing some files and then there was not any need.
I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy tongue.gif
But Google does recognise more complex sets of instruction at robots.txt file than other bots. Examples are special wildcards which allows to filter urls .. etc.

Reply

bjrn
QUOTE(OpaQue @ Jan 15 2005, 04:01 AM)
Oh.. I was restricting google from indexing some files and then there was not any need.
I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy tongue.gif
*


Laziness is good. If you're lazy you'll always keep handy things around tongue.gif

I thought I could post a link to a quite nice robots.txt generator. It's quite simple; you enter whether bots are allowed or not, and then you can enter allowed/refused for a number of common bots if you want them to be different from the default. And then you can enter specific restricted directories.

Nothing complex, but nice. And there are examples and links to pre-made robots.txt files.

 

 

 


Reply

osknockout
Alright, just checking... I found almost the same thing at quite a few other
sites like cnn.com, microsoft.com, and howstuffworks.com

Are there any robots that don't recognize the asterisk?

Reply

bjrn
QUOTE(osknockout @ Jan 16 2005, 01:50 AM)
Alright, just checking... I found almost the same thing at quite a few other
sites like cnn.com, microsoft.com, and howstuffworks.com

Are there any robots that don't recognize the asterisk?
*


As far as I know Googlebot is the only bot that recognises "*.gif" and such. Other bots don't understand the asterix.

Reply



Got an Opinion! Express your Views! (no registration):-
Add your Reply/ Opinion/ Views/ Comments/ Suggestion/ Questions/ Queries etc.
Posts with decent grammar & English will be accepted and please refrain from profanities.
For asking a Question, We recommend you to sign-up (for free) so that you can track the topic easily.

Nature of your Post*: Opinion/ Reply/ Comments
Question/Query
Feedback to us.
       
Name   Email
Title/Question*

(Maximum characters: 10,000)
You have characters left.
Confirm Code:

Similar Topics

Keywords : robots txt


    Looking for robots, txt

Searching Video's for robots, txt
advertisement



Robots.txt



 

 

 

 

ADD REPLY / Got an Opinion! Remove these ADs! RAPID SEARCH! Free Web Hosting [X]
Express your Opinions, Thoughts or Contribute more info. to help others.
Ask your Doubts & Queries to get answers, So that "Together We can help others!"
Register FREE for AD-FREE forum, Create your own topics, Ask Questions, track topics, setup subscriptions & notifications and Get a Free Website w/ Email and FTP.
500MB Space *No Ads*, CPanel, FTP, PHP, MySQL, EMails - 100% FREE