Page 1 of 1

Google strated to report strange URLs in my site

Posted: Tue Mar 24, 2015 8:13 am
by ldor
Hi everybody,

My site was migrated to another host recently. Right after that Google reported an increase in crawl errors.

The URLs Google reports are really strange, they look like this:
http://www.my_site_url.org/link1/link2/link2/link4.html

Each of these link1, link2, link3, etc. are meaningful on their own, I mean, I do have URLs like http://www.my_site_url.org/link1.html, http://www.my_site_url.org/link2.html, etc. But when they appear mixed together in a single URL, that is completely meaningless.

From the raw log files, I've noticed the same URLs crawled by Yandex, so this is not something specific just to Google.

Google reports such URLs with error codes 500, 503, 504. But how does it manage to find such URLs on my site? They should not exist at all?! This is the 2nd time I had to change the host in 5 years. On the first two hosts there was nothing like that. I did contact tech support but so far they have no idea...

Re: Google strated to report strange URLs in my site

Posted: Tue Mar 24, 2015 10:11 am
by raghumudaliar
Use robots.txt and disallow the components and plugins and themes folders which are unnecessary for Google

Re: Google strated to report strange URLs in my site

Posted: Tue Mar 24, 2015 1:02 pm
by ldor
Thank you. Actually, I do have such a robots.txt file but I've just noticed some new folders not mentioned in it. I've included those folders too (and erased some temporary folders). Hopefully that will help