The Joomla! Forum ™



Forum rules


Forum Rules
Absolute Beginner's Guide to Joomla! <-- please read before posting, this means YOU.
Security Checklist
Forum Post Assistant - If you are serious about wanting help, you will use this tool to help you post.



Post new topic This topic is locked, you cannot edit posts or make further replies.  [ 34 posts ]  Go to page 1, 2  Next
Author Message
 Post subject: Stop Googling cache
PostPosted: Sun Jan 22, 2006 4:07 pm 
Joomla! Apprentice
Joomla! Apprentice

Joined: Sun Sep 04, 2005 7:36 am
Posts: 17
:)
Hello,
Where do I put this code (given by Google) to stop indexing the cache by Google?
They tell me to put it in the HTML but the index.php is a bit uncooked spaggetti to me.
This is the code:

I tried here and there but as a result my site was unreachable,
so I need a bit of advise, I guess.
Thanks


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Mon Jan 23, 2006 3:07 am 
User avatar
Joomla! Guru
Joomla! Guru

Joined: Wed Aug 17, 2005 11:26 pm
Posts: 903
The robots.txt that came with your Joomla installation included the following:

Code:
User-agent: *
Disallow: /administrator/
Disallow: /cache/
Disallow: /components/
Disallow: /editor/
Disallow: /help/
Disallow: /images/
Disallow: /includes/
Disallow: /language/
Disallow: /mambots/
Disallow: /media/
Disallow: /modules/
Disallow: /templates/
Disallow: /installation/


Google respects the robots.txt and will not index your cache. 
Please Note: the robots.txt tells all search bots that respect it, to not index those directories.  "Index" is the key word here - what it means is that the search bots are told to not publish the results of their spidering for those directories - they still spider them. (They just don't tell the world what they have found in there .) )

The meta tag that you have given here should NOT be placed in your index.php.  The cache is just saving content so it does not have to be regenerated everytime a visitor loads it.  You cannot specify seperate meta tags for the cache.  If you add tags to your index.php to block search bots, none of your site will be indexed by search engines.

_________________
For Mambo assistance: http://forum.mambo-foundation.org
Open Source Research & Best Practice: http://osprojects.info


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Wed Jan 25, 2006 10:32 pm 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Thu Aug 18, 2005 6:38 pm
Posts: 11
Location: Pelham, NH USA
Elpie, the question may not be to prevent Google from indexing the cache directory; it may be how to prevent the site from showing up in Google's cache. :) Here's the documentation from Google to prevent your site from getting cached:

http://www.google.com/webmasters/remove.html#uncache

_________________
Signature rules: Literal URLs only - viewtopic.php?f=8&t=65


Last edited by Sean on Wed Jan 25, 2006 10:38 pm, edited 1 time in total.

Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Wed Jan 25, 2006 11:46 pm 
User avatar
Joomla! Guru
Joomla! Guru

Joined: Wed Aug 17, 2005 11:26 pm
Posts: 903
Sean,
The question was:
Quote:
Where do I put this code (given by Google) to stop indexing the cache by Google?


The Google cache should not ever contain links to any of the directories disallowed in robots.txt.

_________________
For Mambo assistance: http://forum.mambo-foundation.org
Open Source Research & Best Practice: http://osprojects.info


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Tue Mar 21, 2006 4:28 am 
Joomla! Enthusiast
Joomla! Enthusiast

Joined: Tue Jan 31, 2006 8:22 pm
Posts: 108
I put that nocache tag under my template file...

appears on each page so it tells google not to cache....


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Mon Nov 03, 2008 1:35 am 
Joomla! Apprentice
Joomla! Apprentice

Joined: Wed Jun 04, 2008 6:14 pm
Posts: 44
You should probably put this

<meta name="robots" content="noarchive">

or

<meta name="googlebot" content="noarchive">

in HEAD section of your index.php placed in /templates/your-joomla-template/ folder.

You should make sure to put that in right place.

_________________
http://balkan-online.com/ | http://users.teol.net/~santic/


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Thu Dec 11, 2008 9:30 am 
User avatar
I've been banned!

Joined: Fri Oct 17, 2008 8:24 am
Posts: 27
Location: Shenzhen,China
The Robots.txt is the best selection for you.


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Jan 11, 2009 8:24 am 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Tue Dec 02, 2008 5:12 pm
Posts: 6
You need to Disallow: /admin as well to stop spider index
/admin
including:
/adminisrator
/administrator/
/adminibator
/admin/
/admin.html
/admin.php

_________________
Please read forum rules regarding signatures: viewtopic.php?f=8&t=65


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Fri Feb 13, 2009 2:58 am 
Joomla! Fledgling
Joomla! Fledgling

Joined: Sun Dec 21, 2008 12:14 pm
Posts: 4
Location: australia
is it better to use the above or use the Google webmaster tools or both???

_________________
Please read forum rules regarding signatures: viewtopic.php?t=65


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Mon Mar 02, 2009 4:40 am 
Joomla! Apprentice
Joomla! Apprentice

Joined: Thu Jan 18, 2007 6:51 am
Posts: 7
I'm going to try that code

_________________
http://www.namc.us


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Mon Mar 16, 2009 4:19 pm 
Joomla! Intern
Joomla! Intern

Joined: Mon Sep 15, 2008 6:54 pm
Posts: 51
Hi folks,
I found this thread whilst looking for a way to insert the noarchive tag.

I was trying to find a meta tag related component or plugin, or whatever, that would allow me to set side-wide meta tags. All the ones I tried (including the Metatag system in sh404sef) seemed to be geared towards setting tags on a URL by URL (page by page) basis.

I gather J!1.5.x has no built in way to set site wide meta tags. Correct?

Is the only way to set a site wide metatag (robots tag with a value of noarchive) to hard code it into the template index.php page?

Also, am I correct in thinking there is no tag I can put into the robots.txt file that will tell all search engines not to cache the site?
I have one that tells the internet achieve robot not to cache the site... but I want to get rid of as many caches of my site as possible.

Any and all help is greatly appreciate.

With thanks,

Jonathan


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Wed Apr 08, 2009 11:39 am 
User avatar
I've been banned!

Joined: Wed Feb 25, 2009 6:08 pm
Posts: 65
Location: Texas USA
Another way is to of course just setup spider traps for the search engines and hackers that are trying to go trolling for files in your hosting server.

Did you know that hackers will often try to backdoor or download everything in your website using some pretty funky programs such as backdoor browser etc?

I simply turn all of the index.html files into index.php and make it execute LOTS OF CODE,

depending on what kind of machine winds up searching the site, depends on the reaction of the poor guys face is when his computer crashes.

Poploops, and of course the notorious references to hostile java scripts always do a number on the script kiddies that try to hack but have to use tools.

Now this doesnt keep the REAL hackers away, if they want to get in THEY WILL!!!!

_________________
Always Make Backups | Make a test site to test components on first
Always read the reviews on the extension before downloading. | Always read the manual before posting.
Signature rules - No more then 4 lines allowed


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Tue May 19, 2009 10:50 am 
Joomla! Apprentice
Joomla! Apprentice

Joined: Tue May 19, 2009 1:52 am
Posts: 18
Well, if back to the topic, my solution is:
1. To put the noarchive tag to the template index.php - google will not cache it.
2. To use the webmasters tools to remove the data already in the cache.

And as for website crawlers/spiders/downloaders - the only thing i am using is anti-flood protection, others may hurt yourself with search engines indexing machines..

eclark461 - what do you mean by making a page to execute lots of code? On the client or server side? How does that help? Does that make the page to load longer?


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Thu Jun 11, 2009 6:40 pm 
User avatar
Joomla! Fledgling
Joomla! Fledgling

Joined: Sat Jun 06, 2009 7:38 am
Posts: 1
can i use both robot.txt and metatag?

_________________
my homepage www.ebusiness05.com e-business knowledge & directory
Please read forum rules regarding signatures: viewtopic.php?t=65


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Fri Jun 12, 2009 12:29 pm 
User avatar
Joomla! Enthusiast
Joomla! Enthusiast

Joined: Sat Apr 18, 2009 9:38 pm
Posts: 140
Location: UK
Hi my robot.txt seems not working. I can see in my cpanel that all pages are accessed by search bots. which solution of the above can help? Thanks

_________________
http://www.hoodiadietworks.com | http://www.findiet.com


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Fri Jun 12, 2009 6:00 pm 
User avatar
Joomla! Master
Joomla! Master

Joined: Mon Mar 20, 2006 1:56 am
Posts: 11645
Location: The Girly Side of Joomla in Sussex
youryoru wrote:
can i use both robot.txt and metatag?


in the simplest terms, without the robots txt file, the search bot will just blindly follow all links it finds.
meta tag is a wide term, i think you mean for the desctiption or the no-follow.. there is no reason why you cant use bother methods.

_________________
HU2HY- Poor questions = Poor answer
Un requested Help PM's will be added to the foe list and possibly just deleted
{Community.Connect Administrator }{ Showcase & Security Moderator}


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Jun 28, 2009 9:00 pm 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Sun Jun 28, 2009 8:29 pm
Posts: 19
Setting <meta name="robots" content="noarchive"> in your (goes in head) will solve this. I would strongly recommend not disallowing in robots.txt unless you don't want search engines to totally skip your site.

_________________
Only literal URLs allowed. See forum rules for details.


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Jun 28, 2009 9:29 pm 
User avatar
Joomla! Master
Joomla! Master

Joined: Mon Mar 20, 2006 1:56 am
Posts: 11645
Location: The Girly Side of Joomla in Sussex
Vespre wrote:
Setting <meta name="robots" content="noarchive"> in your (goes in head) will solve this. I would strongly recommend not disallowing in robots.txt unless you don't want search engines to totally skip your site.


slightly off the mark, but google should ignore your folders listed in posting.php?mode=quote&f=267&p=1748022. google will still have access to the main content which is what you want - sometimes.

in my case i dont want google anywhere near my admin or cache folders, as it will fall into a honeytrap and get itself banned, therefore the NO-cache/archive directive is wrong , it must be NO-follow

_________________
HU2HY- Poor questions = Poor answer
Un requested Help PM's will be added to the foe list and possibly just deleted
{Community.Connect Administrator }{ Showcase & Security Moderator}


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Jun 28, 2009 9:46 pm 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Sun Jun 28, 2009 8:29 pm
Posts: 19
Hmm, yes, you are right of course. Missunderstood the original question, thought it was ho to stop Google from caching rather than keeping Google out of the cache. Sorry about that.

_________________
Only literal URLs allowed. See forum rules for details.


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Tue Jul 07, 2009 3:02 pm 
User avatar
Joomla! Explorer
Joomla! Explorer

Joined: Wed Feb 28, 2007 2:46 am
Posts: 276
Location: Oslo, Norway
You can also use this fro simple procedures:

<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">
<META NAME="ROBOTS" CONTENT="NOARCHIVE">

_________________
My blog: http://seonomics.com
I'm also a Search Engine Land Columnist: http://searchengineland.com/author/datatrond


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Tue Jul 07, 2009 3:19 pm 
Joomla! Apprentice
Joomla! Apprentice

Joined: Tue Jul 07, 2009 3:06 pm
Posts: 8
Use the noarchive in the head tag to prevent Google from caching your website. The sh404SEF extension can help you with this.

_________________
My dog has it's own website at: http://www.hundefeber.no


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Fri Jul 17, 2009 1:52 am 
Joomla! Fledgling
Joomla! Fledgling

Joined: Fri Jul 17, 2009 1:20 am
Posts: 2
From the google I found this link where it says how to do that:
http://www.google.com/support/webmaster ... swer=35306

Also do not forget to goto the URL removal tool in the webmaster to to request expedited removal of the cached content for a minimum of six months.

Thanks
Raj

_________________
http://softwareqatestings.com Software Testing : "Quality is never an accident; it is always the result of intelligent effort."
Signature rules - Literal URL's Only


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Fri Jul 17, 2009 8:37 am 
User avatar
Joomla! Explorer
Joomla! Explorer

Joined: Wed Feb 28, 2007 2:46 am
Posts: 276
Location: Oslo, Norway
Yes, I forgot to mention this. Google Webmastertools is absolutely valuable! :-) I use this so much that I simply forgot how unique this service is :-[

_________________
My blog: http://seonomics.com
I'm also a Search Engine Land Columnist: http://searchengineland.com/author/datatrond


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Jul 19, 2009 4:12 pm 
Joomla! Fledgling
Joomla! Fledgling

Joined: Fri Jul 17, 2009 1:20 am
Posts: 2
Yes some pages may want. Suppose a page which contains download information of a product. Later on you have decided to hide it from public but the register users will only have access. But by that time Google has indexed and cahched it.
You see what I mean..

Thanks
Raj

_________________
http://softwareqatestings.com Software Testing : "Quality is never an accident; it is always the result of intelligent effort."
Signature rules - Literal URL's Only


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Mon Jul 27, 2009 8:11 am 
Joomla! Apprentice
Joomla! Apprentice

Joined: Fri Jul 24, 2009 8:35 am
Posts: 38
I would use google webmastertools. :-)

_________________
http://www.alutec.no/


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Sep 06, 2009 7:29 pm 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Fri Aug 21, 2009 5:18 pm
Posts: 29
Location: NY
if you want to stop google from crawling your pages a quick alternative is use rel="noindex" this will prevent google from indexing the page or use noindex in meta .
meanwhile what above posst suggests is also good just use both for playing it safe


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sat Oct 10, 2009 3:47 am 
Joomla! Intern
Joomla! Intern

Joined: Fri Feb 27, 2009 9:27 am
Posts: 90
Hi,

related to my problem using https and i want to place this code to the top of page :
Code:
< ?phpif ($_SERVER["SERVER_PORT"] == 443){echo "< meta name=" robots " content=" noindex,nofollow " > “;}?>

but how i can do it, please any one help me.

Thank you

_________________
http://id.intogsm.com
http://www.intogsm.com


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Dec 13, 2009 7:45 am 
Joomla! Fledgling
Joomla! Fledgling

Joined: Sun Dec 13, 2009 7:35 am
Posts: 1
Code:
User-agent: *
Disallow: /

you can put these code in your robots.txt.
and put the file in the root of your host.


or you can put meta code in every file of your webpage

<meta name="robots" content="noarchive">


but you have to know that google will take some time to make this affective. maybe two weeks.


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Sun Dec 20, 2009 2:41 am 
User avatar
Joomla! Apprentice
Joomla! Apprentice

Joined: Thu Nov 23, 2006 11:14 am
Posts: 5
bt-t wrote:
I would use google webmastertools. :-)


I'm totally agree with this.. Google Webmaster tools is the best solution ;)

_________________
Signature rules: Literal URLs only - viewtopic.php?f=8&t=65


Top
 Profile  
 
 Post subject: Re: Stop Googling cache
PostPosted: Wed Feb 10, 2010 11:45 am 
User avatar
Joomla! Master
Joomla! Master

Joined: Mon Mar 20, 2006 1:56 am
Posts: 11645
Location: The Girly Side of Joomla in Sussex
lonor wrote:
2 weeks!!!! there is no faster way??? thanks :eek:

yes a big brown envelope cotaining your families life savings and all your future earnings to google.inc
Quote:
build it an google will come.
or not as the case may be, but google cache and the "way back machine" will have a copy of it

_________________
HU2HY- Poor questions = Poor answer
Un requested Help PM's will be added to the foe list and possibly just deleted
{Community.Connect Administrator }{ Showcase & Security Moderator}


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic This topic is locked, you cannot edit posts or make further replies.  [ 34 posts ]  Go to page 1, 2  Next



Who is online

Users browsing this forum: No registered users and 3 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Jump to:  
Powered by phpBB® Forum Software © phpBB Group