|
|
|
|
![]() ![]() |
Jan 14 2005, 11:07 PM
Post
#1
|
|
|
Super Member ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Members Posts: 399 Joined: 14-November 04 From: Elysium Member No.: 2,280 |
Robots.txt is a file to keep robots from indexing
certain portions of the site. I was looking at trap17's robot.txt, here's a copy: CODE User-agent: * Disallow: /stylesheets/ User-agent: Googlebot Disallow: /stylesheets/ One question. If you specify '*', then why would one need to specify the Googlebot? |
|
|
|
Jan 15 2005, 01:54 AM
Post
#2
|
|
|
delete me ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Members Posts: 518 Joined: 18-September 04 From: delete me Member No.: 1,185 |
maybe its just redundant, or maybe some bots don't recognize the asterisk
|
|
|
|
Jan 15 2005, 02:01 AM
Post
#3
|
|
|
Administrator ![]() Group: Admin Posts: 1,459 Joined: 11-June 04 From: Somewhere in Time & Space. Member No.: 1 |
Oh.. I was restricting google from indexing some files and then there was not any need.
I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy But Google does recognise more complex sets of instruction at robots.txt file than other bots. Examples are special wildcards which allows to filter urls .. etc. |
|
|
|
Jan 15 2005, 11:43 PM
Post
#4
|
|
|
Super Member ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Members Posts: 378 Joined: 8-January 05 Member No.: 3,174 |
QUOTE(OpaQue @ Jan 15 2005, 04:01 AM) Oh.. I was restricting google from indexing some files and then there was not any need. I simply forgot to remove that thing. However, I think I kept it because if I need to add somethign else, then I wont have to search everything back and look for the exact syntax. In short.. I was lazy Laziness is good. If you're lazy you'll always keep handy things around I thought I could post a link to a quite nice robots.txt generator. It's quite simple; you enter whether bots are allowed or not, and then you can enter allowed/refused for a number of common bots if you want them to be different from the default. And then you can enter specific restricted directories. Nothing complex, but nice. And there are examples and links to pre-made robots.txt files. |
|
|
|
Jan 15 2005, 11:50 PM
Post
#5
|
|
|
Super Member ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Members Posts: 399 Joined: 14-November 04 From: Elysium Member No.: 2,280 |
Alright, just checking... I found almost the same thing at quite a few other
sites like cnn.com, microsoft.com, and howstuffworks.com Are there any robots that don't recognize the asterisk? |
|
|
|
Jan 19 2005, 11:36 AM
Post
#6
|
|
|
Super Member ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Group: Members Posts: 378 Joined: 8-January 05 Member No.: 3,174 |
QUOTE(osknockout @ Jan 16 2005, 01:50 AM) Alright, just checking... I found almost the same thing at quite a few other sites like cnn.com, microsoft.com, and howstuffworks.com Are there any robots that don't recognize the asterisk? As far as I know Googlebot is the only bot that recognises "*.gif" and such. Other bots don't understand the asterix. |
|
|
|
![]() ![]() |
Similar Topics
|
Lo-Fi Version | Time is now: 29th August 2008 - 08:27 PM |