Mar 02, 2013 - 02:30 AM
Web scraping is when a site is "scraped" or mined of content to be reposted on another site. Read the glossary definition of Web scraping.
How Your Content is "Scraped"
There are really just two ways that your content will be scraped.
Manually - by simple copy and paste by one of your readers
Automatically - by a tool or program (commonly called a "bot") created to crawl the web and harvest all content that fits within certain parameters
How to Protect Your Content
Although there are a number of tools and applications to help limit or even prevent site scraping, there really is no way to stop it.
Technical Ways to Slow Down the Web Scraping Bots
Block an IP address
Block bots with tools like CAPTHCA services that verify a human is the operator
Commercial anti-bot services
The Problem: There is a way around every technical block. And there is no way to stop a reader from simply copying and pasting your carefully crafted blog post and publishing it on their own site.
The Only Real Way to Beat the Web Scrapers
The best thing to do, is include site links within the text copy, so when they copy it, it will actually send traffic back to your site. When they copy/paste the post, they almost never remove links ... so with in-copy links you'll actually benefit. Who can't benefit from new in-bound links and traffic? A little SEO help never hurt anyone.