Google DMCA team got their hands full on a daily basis, no doubt about it. They are really good if someone snips and puts up a copyrighted photo you just made, or copy pastes your original text article. What happens when you have 10 000, 50 000 or even 300 000 pages of your content, indexed through clone sites?
Well, you sure can submit them all, no one is stopping you. Mind you, they have a daily limit for number of DMCA requests you can submit, and since they request original url and infringing url, be ready to spend couple of years on submitting all of it. Compare that to the fact that a yet another clone site can be launched in only minutes, and you can figure out if it’s worthwhile to even bother with google DMCA submissions.
Fun facts about Google DMCA and clone site issue
- COMPLEXITY – submissions must be made on a page per page basis, with original url and infringing url for each page
- ONE PAGE ONLY – when submitting just the main link, that means if you remove your-site-clone.com, you’ve actually just removed the homepage of the clone site, all other subpages are still in the index
- DESCRIPTION IGNORED – it pretty much doesn’t matter what you write in “description” box of the form, I’ve tried to explain the situation and even requested that they check out site:clone.com query, to no luck
- ONLY VISUAL – when dealing with sneaky clones (googlebot sees your content, surfer sees something else), google dmca fellas will not bother to check cache, only visual confirmation is their weapon of choice
- MAILS IGNORED – every now and then you’ll get a mail from them, with request for more info – don’t bother replying, I’m suspecting that no one ever reads the reply
- CAPTCHA – last time I was doing some dmca requests, I marked more streets and traffic signs than I ever saw while driving my car, and I do drive a lot
I doubt that google is using their alpha team to respond to Google DMCA, but I wonder why can’t they help them a little bit.
Something simple enough like this could easily do the trick:
INPUT YOUR SITE: yoursite.com
INPUT CLONE SITE YOU’RE REPORTING: clone-site.com
Algorithm should be able to do two simple things:
1. compare sitewide data of both sites, % similarity
2. compare actual indexing dates of pages and age of both sites
If algorithm is in doubt of any kind –> pass along to manual review.
That kind of form would of course need more data (about the owner, electronic signature etc), and it doesn’t really have to be a part of dmca. New kind of submit a clone type of thing would work just fine.