Content Scraping: Is Someone Ripping Your Content? – Part 1
Image via
You won’t even know it if you don’t check regularly.
Content scrapers are simple programs that scrape content on topic from sites or blogs for posting to the content scraper’s site. The sole purpose of content scrapers is to rip content, post it to a series of junk sites that are slathered with PPC and paid advertising, and make money on clickthroughs.
Spamming Search Engines
Sites that are built on content scraping don’t cost a lot of money – just the low cost hosting fee. Scrapers search the web for sites that post content on the “subject” of the scraper’s website. So, all of the content is related to a topic – usually something about your health and well being, your family, your finances or some other topic that has wide appeal.
Scrapers register a dozen domain names (cost: less than $30 USD) and build a dozen duplicate sites. Then, the scraper takes all of these garbage sites, and links them together, in effect, spamming search engines who “think” that all of this low level linkage indicates sites that are visited frequently and further the search of site visitors to any of the dozen scraper sites.
Google has cracked down on scraper sites, but they’re still out there. And, (1) they may be scraping content for use on their ring of low-rent PR0 sites, or (2) they may be linking to your higher ranking site and dragging you down – without your knowledge.
Links Farms
Scrapers make their money on clickthroughs. No self respecting site owner would place a paid advert on one of these sites, knowing that the site’s sole objective is to generate revenues via clickthroughs.
Google hates links farms. The prime objective of any search engine is to deliver quality SERPs – the best possible search results based on the user’s query keywords. Links farms water down the quality of SERPs and, as a result, diminish the value of Google’s #1 product – links to relevant sites. It’s not nice to make Google mad.
Most links farms scrape content from blogs and web sites with impunity. Let’s face facts: copyright law on the W3 is non-existent. Do you think I’m going to take the time to sue some content scraper who ripped off an article from my site and posted it to her site without permission? How much do you think I’d collect in damages? Crossing international borders and dealing with copyright laws of another nation. It’s not worth the expenses and content scrapers know it.
So they rip you off. They post your content and content scraped from a dozen other sites, build a bunch of low-rent sites, populate these PR0 sites with ripped content, link the sites together and create the appearance of a series of quality sites.
How To Know If You’ve Been Scraped?
The easiest and least expensive way to see if you’re the victim of copy scraping is to use search engines, since they’re the target of content scrapers (so don’t take it personally).
Go back into your archives. Find a sentence in each post you’ve made. Choose a sentence that has a distinctive phrase or even anchor text in it.
Next, cut and paste that sentence into the Google search box. Then, add quotes around the sentence. This tells Google that you ONLY want SERPs that contain this exact sentence. Now you may get lucky.
However, if you get a “No results found” message, or your blog is the only one that shows up on SERPs, it still doesn’t mean you haven’t been scraped. Content scrapers are black hats so the rules fly right out the window when dealing with these clowns.
Another way of detecting it is by typing in search query link:website.com or install the free & mighty SEO for Firefox tool to have a look at your backlinks. If those domains consist of numerical figures with dash in it, most likely these are the sites that you would like to stay away from. Just click on it & if you see ads are flying all over the site, bingo you have just caught a thief with your own bare hands.
Some content scrapers will take your beautifully written piece and stuff it with their keywords, simply inserting keywords throughout your article. So a simple Google search using quotes may not turn up a content scraper.
Read Part 2 here
Ok but if we found that our blog has been attacked by content scrapers, then what to do.
How to take action against this.
Maish: This is something that will be covered in part 2 of the series
Ok Thank you.
I will be waiting.
True indeed. I got affected badly especially on my google pagerank. Thanks for the tips!
: Gain more high quality inbound links & you will be fine
Just wasting some free time on Digg and I found your entry. Not typically what I prefer to learn about, but it was definitely worth my time. Thanks.
Hello folks!
I am questioning if another person can assist me out! Really I desire to view this page on my own completely new iPad, nevertheless it does not present up accurately, So I used to be asking yourself if a person can propose me any optimal option? I don’t know but need to I try and discover out an update for my computer software plan or anything at all else? I know this can be some thing kinda off the subject, but please replace me and thank you upfront for that support! Sophie 
Hi fellows!
I’m pondering if a person might help me out! Basically I wish to look at this unique page on my own absolutely new iPad, however it doesn’t show up accurately, So I was questioning if another person can suggest me any optimum option? I don’t know but really should I strive and discover out an update for my computer software system or anything at all else? I realize this can be something kinda off the subject, but please update me and many thanks in advance for that assist! Sophie 
Hi there,To start with Happy New Yr to all!! I hope you had an excellent time partying!! Well, this is some thing not rather related to the topic here, but I have to have urgent aid….I got stuck with my iphone as I’m not in a position to access the internet over it. I’ll extremely value if any person could assist me out sorting out the situation!! Thanks upfront for ones kind aid!!Cheers!! Kattie
very good sharing article. Thanks for the tips