Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
My website is penalized from google with no message in GWT.
-
On 26 of October 2018 My website have around 1 million pages indexed on google. but after hour when I checked my website was banned from google and all pages were removed. I checked my GWT and I did not receive any message. Can any one tell me what are the possible reasons and how can I recover my website? My website link is https://www.whoseno.com
-
Would you be able to send me a dm with a copy of that email? I'm interested in larger sized automatic sites and trying to figure out where the limit is (and how yours isn't allowed when others are)
-
Thank you for your responses. I just received email from google after 3 days with the reason. They are saying you website is generating automatic content.
-
This is a really fascinating question. It's highly irregular for Google to de-list a site with absolutely no reason given. Even if it's something really bad like serving malware to Google's users, you usually get a hacked content notification
Your assertion that your site has been de-listed by Google due to data you are seeing in various analytics packages is backed up by Google's front-end:
- https://www.google.co.uk/search?q=site%3Awhoseno.com
- https://www.google.com/search?q=site%3Awhoseno.com
- https://www.google.fr/search?q=site%3Awhoseno.com
- https://www.google.bg/search?q=site%3Awhoseno.com
I can't find any pages from your site in Google US, UK, France or Bulgaria. Whatever has happened they seem to have gone fairly thermonuclear!
I performed a 25% crawl of your site using Screaming Frog (rendering / JS enabled), using Google's user agent (Googlebot). Some pages returned an error 404:
- https://www.whoseno.com/number-information
- https://www.whoseno.com/whose-number-is-this
- https://www.whoseno.com/track-location
- https://www.whoseno.com/get-mobile-number-details
- https://www.whoseno.com/phone-number-details
- https://www.whoseno.com/get-complete-details-of-your-ex
- https://www.whoseno.com/track-any-mobile-number
- https://www.whoseno.com/wrong-number
- https://www.whoseno.com/whose-number-is-this-calling-me
- https://www.whoseno.com/phone-number-search
- https://www.whoseno.com/recent-lookups-on-whoseno
- https://www.whoseno.com/get-details-of-any-mobile
- https://www.whoseno.com/get-details-of-any-phone-number-for-free
- https://www.whoseno.com/trace-location-on-map
- https://www.whoseno.com/reverse-directory
- https://www.whoseno.com/track-location-by-phone-number
- https://www.whoseno.com/reverse-phone-lookup
- https://www.whoseno.com/phone-number-lookup
- https://www.whoseno.com/reverse-phone-lookup-service
Although this seems like quite a few broken pages, there were many more which were rendering properly. This just looks like the kind of stuff which Google would flag as crawl errors, rather than taking a site down in its entirety when the majority of pages return 200 (OK).
Some of the URLs like, getting "complete details about your ex" Google may frown upon. People shouldn't really be able to go on a site and get complete details for their ex-partner as that promotes stalking (something which Google is firmly against, and which most first-world governments are moving to take more and more action on). Even if the name of the page is misleading and it doesn't (when working) really supply that functionality, that then makes it a spam page instead (as it looks to satisfy unscrupulous users looking for such information and then fails to deliver).
Out of the pages which are returning 200 (OK), most of them are individual phone number pages. An example might be this page: https://www.whoseno.com/US/2014623561 - the number has been publicly logged as spam. With the advent of GDPR legislation, if you are logging phone numbers and publicly keeping a database of them (without the permission of the phone number's owner) then you may be in breach of new European GDPR legislation (read about it here).
Google wants to continue operating in Europe, so whilst they may be an American company GDPR does heavily impact Google. They want to comply with GDPR
I checked the technical indexation of your pages, there don't seem to be any huge red flags.
- Robots.txt isn't blocking critical pages and resources
- Nor is the Meta no-index tag
- Canonical tags don't seems to be de-indexing real pages and pointing Google to broken ones
- Google's user-agent seems to be able to access most pages properly
I decided to search for your site on Bing to see if they had also de-indexed you:
Bing still holds pages and records of your domain.
One of the results really interested me. There's a Twitter profile listed on those search results, the SERP snippet reads like this:
"Hussɑin Aвduℓℓɑtif (@whoseno) | Twitter
The latest Tweets from Hussɑin Aвduℓℓɑtif (@whoseno_)"_
The Twitter profile has been suspended. This may or may not be your Twitter profile. If it's not your Twitter profile, your digital identity may have accidentally been combined with this person's who may or may not have Twitter ToS or state-level action against them
You need to go to Google's Webmaster support forum here and ask them what the deal is.
It's unlikely to be Penguin / link related and I don't think it's tech related either. It could be GDPR concerns, pollution of your digital identity - combined with a 3rd party who has state-level action against them, or it could be a basic 'Google glitch'
-
Ok, this one may be interesting, if it's none of these options below I'd love to take a deeper look, send me a dm on twitter: https://twitter.com/thomasharvey_me
So, I see that you're on Cloudflare, are you still being crawled by Google?
Have you looked in the old search console? Have you or anyone you work with done anything in the "remove urls" section?
Have you seen any change in crawl stats recently?
Any recent changes to the site that may have caused this?
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Staging website got indexed by google
Our staging website got indexed by google and now MOZ is showing all inbound links from staging site, how should i remove those links and make it no index. Note- we already added Meta NOINDEX in head tag
Intermediate & Advanced SEO | | Asmi-Ta0 -
Why has my website been removed from Bing?
I have a website that has recently been removed from Bing's index, but can't figure out why. The website isn't new, and it is indexed just fine on Google. These are the steps I've tried: The website is verified in Bing Webmaster Tools and successfully submitted the sitemap. I tested the URL to ensure that Bingbot is allowed to crawl the site I submitted URLs to Bing via the URL Submission tool There isn't a "noindex" on the site preventing it from being indexed When I do a URL Inspection, an error message comes up saying "The inspected URL is known to Bing but has some issues which are preventing us from serving it to our users. We recommend you to follow Bing Webmaster Guidelines." I contacted Bing to ask whether the website was removed in error, but received a reply that the website doesn't comply with Bing's quality guidelines, but they wouldn't go into detail as to which guidelines the website isn't meeting. The website URL is https://www.pardeehospital.org. Can anyone offer any advice or insight as to why Bing won't index our site? Thank you!
Intermediate & Advanced SEO | | lindsey.steinkamp0 -
Google indexed "Lorem Ipsum" content on an unfinished website
Hi guys. So I recently created a new WordPress site and started developing the homepage. I completely forgot to disallow robots to prevent Google from indexing it and the homepage of my site got quickly indexed with all the Lorem ipsum and some plagiarized content from sites of my competitors. What do I do now? I’m afraid that this might spoil my SEO strategy and devalue my site in the eyes of Google from the very beginning. Should I ask Google to remove the homepage using the removal tool in Google Webmaster Tools and ask it to recrawl the page after adding the unique content? Thank you so much for your replies.
Intermediate & Advanced SEO | | Ibis150 -
Website ranking stuck on 2nd page of google. How to bring it in top 10 position??
Hi I am working on a site indianhomeappliances.in that for search terms such as 'best washing machine in india' appears near the top of the 2nd page of Googl for url https://indianhomeappliances.in/best-washing-machine-in-india/ When looking at what is listed on the 1st page for 'best washing machine in india' I would appreciate any advice/guidance on what else could be done to give the site a final push to get on the 1st page of search results. I have made more than 60 backlinks along with sites from competitor analysis via moz for this page Looking at the sites that are on the 1st page I cant understand why many of them are ranking higher than me? Any insight and plan of attack would be most appreciated from any search experts on the forum. My website is 2.5 months old. Many Thanks
Intermediate & Advanced SEO | | Pank00770 -
"Null" appearing as top keyword in "Content Keywords" under Google index in Google Search Console
Hi, "Null" is appearing as top keyword in Google search console > Google Index > Content Keywords for our site http://goo.gl/cKaQ4K . We do not use "null" as keyword on site. We are not able to find why Google is treating "null" as a keyword for our site. Is anyone facing such issue. Thanks & Regards
Intermediate & Advanced SEO | | vivekrathore0 -
How does google recognize original content?
Well, we wrote our own product descriptions for 99% of the products we have. They are all descriptive, has at least 4 bullet points to show best features of the product without reading the all description. So instead using a manufacturer description, we spent $$$$ and worked with a copywriter and still doing the same thing whenever we add a new product to the website. However since we are using a product datafeed and send it to amazon and google, they use our product descriptions too. I always wait couple of days until google crawl our product pages before i send recently added products to amazon or google. I believe if google crawls our product page first, we will be the owner of the content? Am i right? If not i believe amazon is taking advantage of my original content. I am asking it because we are a relatively new ecommerce store (online since feb 1st) while we didn't have a lot of organic traffic in the past, i see that our organic traffic dropped like 50% in April, seems like it was effected latest google update. Since we never bought a link or did black hat link building. Actually we didn't do any link building activity until last month. So google thought that we have a shallow or duplicated content and dropped our rankings? I see that our organic traffic is improving very very slowly since then but basically it is like between 5%-10% of our current daily traffic. What do you guys think? You think all our original content effort is going to trash?
Intermediate & Advanced SEO | | serkie1 -
Google is displaying wrong address
I have a client whose Google Places listing is not showing correctly. We have control of the page, and have the address verified by postcard. Yet when we view the listing it shows a totally different address that is miles away and on a totally different street. We have relogged into manage the business listing and all of the info is correct. We dragged the marker and submitted it to them that they had things wrong and left a note with the right address. Why would this happen and how can we fix it? Right now they rank highly but with a blatantly wrong address.
Intermediate & Advanced SEO | | Atomicx0 -
Google Indexing Feedburner Links???
I just noticed that for lots of the articles on my website, there are two results in Google's index. For instance: http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html and http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+thewebhostinghero+(TheWebHostingHero.com) Now my Feedburner feed is set to "noindex" and it's always been that way. The canonical tag on the webpage is set to: rel='canonical' href='http://www.thewebhostinghero.com/articles/tools-for-creating-wordpress-plugins.html' /> The robots tag is set to: name="robots" content="index,follow,noodp" /> I found out that there are scrapper sites that are linking to my content using the Feedburner link. So should the robots tag be set to "noindex" when the requested URL is different from the canonical URL? If so, is there an easy way to do this in Wordpress?
Intermediate & Advanced SEO | | sbrault740