Welcome Guest ( Log In | Register)



 
Reply to this topicStart new topic
> Robots.txt
osknockout
post Jan 14 2005, 11:07 PM
Post #1


Super Member
*********

Group: Members
Posts: 399
Joined: 14-November 04
From: Elysium
Member No.: 2,280



Robots.txt is a file to keep robots from indexing
certain portions of the site.

I was looking at trap17's robot.txt, here's a copy:

CODE

User-agent: *
Disallow: /stylesheets/

User-agent: Googlebot
Disallow: /stylesheets/


One question. If you specify '*', then why would one need to
specify the Googlebot?
Go to the top of the page
 
+Quote Post
alperuzi
post Jan 15 2005, 01:54 AM
Post #2


delete me
*********

Group: Members
Posts: 518
Joined: 18-September 04
From: delete me
Member No.: 1,185



maybe its just redundant, or maybe some bots don't recognize the asterisk
Go to the top of the page
 
+Quote Post
OpaQue
post Jan 15 2005, 02:01 AM
Post #3


Administrator
Group Icon

Group: Admin
Posts: 1,459
Joined: 11-June 04
From: Somewhere in Time & Space.
Member No.: 1



Oh.. I was restricting google from indexing some files and then there was not any need.
I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy tongue.gif
But Google does recognise more complex sets of instruction at robots.txt file than other bots. Examples are special wildcards which allows to filter urls .. etc.
Go to the top of the page
 
+Quote Post
bjrn
post Jan 15 2005, 11:43 PM
Post #4


Super Member
*********

Group: Members
Posts: 378
Joined: 8-January 05
Member No.: 3,174



QUOTE(OpaQue @ Jan 15 2005, 04:01 AM)
Oh.. I was restricting google from indexing some files and then there was not any need.
I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy tongue.gif
*


Laziness is good. If you're lazy you'll always keep handy things around tongue.gif

I thought I could post a link to a quite nice robots.txt generator. It's quite simple; you enter whether bots are allowed or not, and then you can enter allowed/refused for a number of common bots if you want them to be different from the default. And then you can enter specific restricted directories.

Nothing complex, but nice. And there are examples and links to pre-made robots.txt files.
Go to the top of the page
 
+Quote Post
osknockout
post Jan 15 2005, 11:50 PM
Post #5


Super Member
*********

Group: Members
Posts: 399
Joined: 14-November 04
From: Elysium
Member No.: 2,280



Alright, just checking... I found almost the same thing at quite a few other
sites like cnn.com, microsoft.com, and howstuffworks.com

Are there any robots that don't recognize the asterisk?
Go to the top of the page
 
+Quote Post
bjrn
post Jan 19 2005, 11:36 AM
Post #6


Super Member
*********

Group: Members
Posts: 378
Joined: 8-January 05
Member No.: 3,174



QUOTE(osknockout @ Jan 16 2005, 01:50 AM)
Alright, just checking... I found almost the same thing at quite a few other
sites like cnn.com, microsoft.com, and howstuffworks.com

Are there any robots that don't recognize the asterisk?
*


As far as I know Googlebot is the only bot that recognises "*.gif" and such. Other bots don't understand the asterix.
Go to the top of the page
 
+Quote Post

Reply to this topicStart new topic

Collapse

> Similar Topics

Topics Topics
  1. Us War Robots Go Bad(6)
  2. People Will Marry Robots In Future, Says Scientist!(21)
  3. Robots : Our Future Or Our End(75)
  4. Robots That Can Touch(19)
  5. Want To Know Anything About Robots Today?(8)
  6. Question About Robots.txt File(4)
  7. Dancing Robots(23)
  8. Robots And Sub-domains(2)
  9. And What Does Google Want With Microsoft?(11)
  10. 2 New Packages Angels And Robots(2)
  11. Modifying The Robosapien(4)
  12. From Passive Applications To Sentient Machines(3)
  13. New Generation Of Robots(13)
  14. Encrypt Your Email Address For Websites(15)
  15. Doctor's Becoming Robots(10)
  1. Robogen Free But Limited Edition(3)
  2. Robot's(1)
  3. The new Movie Robots(3)
  4. robots.txt for stopping some spambots (very long post)(7)


 



- Lo-Fi Version Time is now: 29th August 2008 - 08:27 PM