What is "data scraping" and is it legal?

What is “data scraping” and is it legal?

Published by Catherine Tan Product Manager

Categorised as Articles

Published on 21 May 2021

Illustration of a computer with a net catching documents

Is data scraping legal?

Like most activities that deal with personal information, it can be. However there are also many ways of conducting data scraping that can heighten the risk of a privacy violation. For example, while it might seem that scraping data in the public domain is okay, some content might actually be subject to licensing or to a website’s Terms of Use and Privacy Policy.

Besides getting hit with a huge fine and legal action, you could be severely limited on what you’re allowed to do with scraped data. For instance, you won’t be able to reproduce, share, or sell this information without the owner’s consent or authorisation.

There has been much legal debate around what constitutes good, bad, legal or illegal scraping activities, but generally speaking, businesses who utilise scraping tools should be mindful of fair use laws, a website’s terms and conditions for use, the scraping method used (eg. a bot that circumvents login processes would be a big no-no), and whether they plan to use the scraped data in a way that is legal and ethical.

What are the privacy implications of web scraping?

One of the biggest concerns that privacy advocates have is mass harvesting of email addresses with intent to share or sell this information to third parties without the owner’s consent or knowledge. Often, this means more spam and malicious emails make their way into people’s inboxes, which violates multiple anti-spam and privacy laws around email marketing and unsolicited communications.

Another insidious threat is data scraped from people’s social media profiles. As we’ve seen in the past with Facebook’s Cambridge Analytica scandal, scraper bots have the ability to harvest vast amounts of personal information about us, build profiles, and weaponize it through political advertising and other unsavory ends, like fake profiles and online impersonation.

Keep this in mind: While data scraping offers a wealth of new opportunities in business intelligence, academic research, e-commerce, and other niche industries, it also generates a range of privacy and fair use challenges for everyday users and regulators.