Jump to content

Recommended Posts

Posted

Hi everyone,

We are all doing our best to get visitors to our sites so this might be a little odd question. Is there any way I can prevent that search engines (like google) indexes my site, except for not publishing it?

I am gonna set up a "members area" for a friend, but we do not want to find the individual pages in different search engines, as I have found out i can with my own site.

 

-Thomas

Posted

I can't speak too loud on this, however one of the key concepts is not to have links to those pages anywhere, I have a family site (billmccorddotcom) that I made one refrence to here at the forum and now it has a pr 3 from google. your friends can not refrence their pages in other forums, guestbooks etc.

 

The other thing that I think you can do is add a nospider line into your .htaccess file (someone else will have to tell you how, I am not an expert on this one....

 

Good luck!

 

Mr. Bill :Nerd:

Posted (edited)

You change the robots.txt file. I found this

 

If you want to keep it a little more private (without having to password protect it) add this to your robots.txt file :

 

User-agent: *

Disallow: /hidden.html/

 

Boxturt on TCH forum

 

Obviously, replace hidden.html with the page(s) and dir(s) not to index.

 

Also, I think you could password protect the section and the spiders wouldn't get in.

 

*** If you haven't put a robots.txt file on your site then you probably don't have one. It's just a text file put in the public_html directory of your site. I would advise reading a tutorial on it because if you goof it up then you might send all spiders away from all pages.

Edited by surefire
Posted

Here are a couple of good links on robots.txt:

 

A tutorial:

www.searchengineworld.com/robots/robots_tutorial.htm

 

A validator (to make sure everything is OK):

www.searchengineworld.com/cgi-bin/robotcheck.cgi

 

 

Just remember -

1) robots.txt MUST be in your public_html folder and

2) you must have a carriage return (new line) after your last "Disallow" line.

 

Make sure that you set up your robots.txt as soon as you publish a site. That way YOU control which parts of your site will get spidered. Once a page is in google, it takes a long time for it to disappear!

 

Good luck,

Dave

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Unfortunately, your content contains terms that we do not allow. Please edit your content to remove the highlighted words below.
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...