Google Ranking Drop Because Of Duplicate Content

Sunday, October 12th, 2008

This is a follow up post to my previous posts about my friend’s ranking drop. As you may remember, his Google ranking was restored a few weeks after he blocked the proxy website from copying his entire website and submitted a reinclusion request. As you may have guessed, he was quite thrilled to see his SERP ranking shoot up again.

Well, as luck would have it, I received a phone call last night from my friend telling me that his website was bombing again. I Googled his favorite and they seemed to rank fine over at my end, but he explained that he traffic stats from was flat. They nosedived a day or two ago. I chalked up the results I was getting to adjusting the results.

This new twist got me thinking. What in the world could be making this website’s ranking bounce around like this? Looking back, the proxy website may not have been 100% at fault. There has to be something else.

I began doing a little research and learned about few things about duplicate content. The reason I looked at that particular area is because there is absolutely nothing else I can find wrong with this website. Duplicate content seems to be a rather popular culprit.

I came across a pretty well laid out website called “Google Rankings Diagnostics” that describes a whole heck of a lot of issues you might be having with your website. This website validated what I pretty much already knew…that if you have multiple URLs (on a domain) with the same exact content, has trouble figuring out which page is the original and may throw all of them out.

I took a very close look at my friend’s website. Again, I took a unique line of text from his homepage and searched for it in (inside quotes). A funny thing happened. I saw the homepage result, but there were a few extra results as well, all on his domain. There were about 5 extra pages in total.

Now, some of these extra results have been there for years, so I don’t attribute the issue to those pages being duplicate content. What struck me was one of the extra pages.

A few months ago, my friend moved one of his pages. He put a 301 redirect in his .htaccess file, which was the correct thing to do. So now, the old directory where the page was held forwarded to a new page. It looked something like this:

Redirect 301 /olddirectory/ http://www.hiswebsite.com/newpage.php

The redirect worked fine, but here is what that extra page in the search results looked like:

http://www.hiswebsite.com/newpage.phpoldpage.php

Guess what page was showing at that URL…yup, the homepage. The dynamic nature of his website sends unknown page results like this to the homepage. This was a fluke. My friend forgot that there were pages inside the old directory he redirected to the new page. Every old page in that old directory was tacked on to the new page, like you see above. To make matters worse, there were a bunch of links from other websites pointing to the old pages in the old directory.

I am not sure if this would cause the ranking drops that he is experiencing, but the timing certainly lines up with when the issue began. It is also certainly considered duplicate content.

So, here is what I did to deal with the issue this time. I deleted the redirects in the .htaccess file and blocked the URLs of all those extra results in the robots.txt file. Hopefully, this will tell to not spider or index those pages and it will also tell that those links into the site are dead.

Now, we have to wait. I am not going to submit another reinclusion request to because I want to see if the ranking returns naturally. If it does, this was the problem for sure.

Related posts

Tags: , , , , , , , , , , , , , , , , , , , , ,

Google Ranking Restored

Thursday, September 25th, 2008

This is a follow up post to my “Sudden Drop In Google Ranking” post.

This morning, I checked the ranking of the website in question. To my surprise, the site had again ranked number 4 in the Google Search Engine Results. This was most definitely good news. In fact, all key phrases now ranked on page one of the Google SERPs.

I can only hope this persists. So, what did we do? Here is a short list:

- Noticed the website had dropped in Google ranking.
- Took a unique phrase from the website and searched Google using quotes, “like this.”
- Found a direct copy of the website and discovered it had been “Proxy Hijacked.”
- Found IP address of website that Proxy Hijacked our website and blocked it using the .htaccess file.
- Submitted a “Reconsideration Request” to Google.

After about a week and a half, our website had regained its ranking in Google.

I read a long article about Proxy Hijacking and it mentioned that Google had fixed the problem. If this was the case with my friend’s website, this certainly isn’t true. While I can not be totally sure Proxy caused this case of Google ranking loss, the facts seem to lead down this path.

What is my advice to you? Check either Google or Copyscape once a month to see if someone has taken text or Proxy Hijacked your website.

Related posts

Tags: , , , , , , , , , , , , , , , , , , , , , ,

How To Check Your Web Page HTTP Headers & Response Codes

Tuesday, September 16th, 2008

There may be cases when you would like to see what your webpage HTTP headers look like. Why? Well, because they are kind of important. As Wikipedia states, the HTTP headers define what the returned data looks like.

Still you ask, “Why in the world do I care about that?” Ok, I’ll keep going. The main reason I look at the HTTP headers is to find out what the HTTP status code is. The reason the status code is important to me is because this is the code the use for a multitude of things.

Let me give you a little example, and this related to my previous post regarding the sudden drop in Google rankings. As I was doing research into what the problem may be for this particular website, I came across an issue where someone had recently put custom “404 Not Found” error pages up on some of their websites. Everyone knows that custom “404 Not Found” error pages are cool, but what some people don’t know is that if those 404 error pages show a “200 OK” (successful HTTP requests) code, the site may be in big trouble, SEO-wise. The reason for this is because there are going to be many “404 Not Found” error pages on a dynamic website. If you have your custom “404 Not Found” error page showing a “200 OK” response code, the will think that all the instances of this page are duplicate. You know as well as I do, that spells trouble.

What’s worse is if you set your as your “404 Not Found” page. Your is going to return a response code of “200 OK.” That’s not good, because now you have multiple instances of your …all duplicate content.

It’s my opinion that the are smart enough to figure this out. The page (such as your ) with the highest Pagerank will prevail. Still, I have some websites that I am working on that have multiple instances of the and they all have Pagerank, which isn’t good, because the duplicates are taking the Pagerank from the real page. Now, again, that’s my opinion.

Here are two tips:

- How to check your HTTP headers - visit this website or just Google “Website header check”

- How to set a particular page as your “404 Not Found” error page in your .htaccess file - Just place this code in the file: “ 404 /404.php” without the quotes. The 404.php file is the actual error page in this case.

Related posts

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

Sudden Google Ranking Drop - Proxy Hijack

Tuesday, September 16th, 2008

Do you remember my article from yesterday about the sudden drop in Google search ranking for my friend’s ? Well, I just can’t stop thinking about it.

From what I have been reading, it seems as though my conclusion may be correct. At least I am hoping it is. If I ever conclude anything semi-concrete while thinking about Google, it’s a good day for me.

Ok, I found this very helpful and thorough website that pretty much described the exact problem my friend is having. It’s titled “Google Proxy Hijacking” and tells the whole story.

Here is what struck me as I think about this some more.

- My friend’s has been live since 2004.
- The site seemed to be in the Google sandbox for the entire 4 years.
- For his most competitive keywords, he was ranking past page 20 on Google.
- About two months ago, he made some changes to the as well as an HTML overhaul.
- About a month after that, the site ranked number 3 for his most competitive keywords.
- The site ranked on page 1 of Google for about a month.
- The site now sits at page 25 for its most competitive keywords.

Here is my theory. I think the has been hijacked for a number of years. This is what caused the poor rankings for such a long time. When the homepage text and HTML changes were made about 2 months ago, Google visited the site and found it unique. Google ranked the site well, due to this new unique content. During the month, Google noticed the was now a of my friend’s once again and dropped the ’s ranking.

Does that make sense? From what I read on the I linked to, it does.

Here are the similarities with what we are experiencing and what the author wrote on the other :

- My friend’s has never been banned.
- We did a quoted Google for supposedly unique content on my friend’s and a showed in the results.
- The URL looked like this: proxysite.com/cgi-bin/pxy/nph-pxy.pl/000010A//www.friendssite.com/
- The site was an exact of my friend’s .

Now, I am not sure if this is what caused my friends ranking to drop, but all the factors are there. The keywords we are talking about are very competitive, but the fact that his site showed so well in the results for a month shows me that the potential is there.

I would appreciate your thoughts on this.

Related posts

Tags: , , , , , , , , , , , , , , , , , , , ,

What To Do If You Suspect Copyright Infringement

Tuesday, September 16th, 2008

I am talking about copyright infringement on your website here.

Say, one morning you wake up to find that someone has copy and pasted your homepage (among other pages) text onto their homepage. I am sure you would get rather peeved at the sight of that. I mean, it’s not a trivial matter here. Website copy costs good money. There are you have to think about and research and many, many more variables that led you to place that copy on your . For someone to just steal it like that is very frustrating, to say the least.

So, what can you do to deal with the situation? Well, according to Wikipedia, there are a few things you need to do:

1. Establish ownership
2. Establish actual copying
3. Establish misappropriation

These are the items you need to get squared away before anything else. Again, I am just getting these things from Wikipedia.

There is something out there called the Digital Millennium Copyright Act (DMCA) that kind of governs the whole online copyright issue. It was signed into law on October 28, 1998 and extends the reach of copyright to protect online assets as well.

With this new online , it makes handing the issue over to your attorney much easier and less expensive. They have something concrete to work with, and since we all know how attorneys get paid by the hour, that matters.

Once your attorney has the information they he/she needs, they can go ahead and send a notification claiming infringement to the website hosting provider of the website that copied your work. According to the attorney I work with, the hosting provider usually takes the site down rather quickly upon receiving a letter like this. I would think they don’t really want to get in the middle of this kind of thing.

The owner of the website can always go ahead and set up shop with a different hosting provider, but that shows a certain amount of audacity on their part and I would think you could go after someone like this personally. In my opinion, it is much easier to change the copy on their website than to fight you in court.

Related posts

Tags: , , , , , , , , , , , , , , , , , ,