Scraper site
This article needs additional citations for verification. (August 2011) |
A scraper site is a website that copies content from other websites using web scraping. The content is then mirrored with the goal of creating revenue, usually through advertising and sometimes by selling user data.
Scraper sites come in various forms: Some provide little if any material or information and are intended to obtain user information such as e-mail addresses to be targeted for spam e-mail. Price aggregation and shopping sites access multiple listings of a product and allow a user to rapidly compare the prices.
Examples of scraper websites
Search engines such as Google could be considered a type of scraper site. Search engines gather content from other websites, save it in their own databases, index it and present the scraped content to the search engines' own users. The majority of content scraped by search engines is copyrighted.[1]
The scraping technique has been used on various dating websites as well. These sites often combine their scraping activities with facial recognition.[2][3][4][5][6][7][8][9][10][11][excessive citations]
Scraping is also used on general image analysis (recognition) websites, as well as websites specifically made to identify images of crops with pests and diseases.[12][13]
Made for advertising
Some scraper sites are created to make money by using advertising programs. In such case, they are called Made for
Made for AdSense sites are considered search engine spam that dilute the search results with less-than-satisfactory search results. The scraped content is redundant compared to content shown by the search engine under normal circumstances, had no MFA website been found in the listings.
Some scraper sites link to other sites in order to improve their
Legality
Scraper sites may violate
Techniques
This section possibly contains original research. (September 2007) |
Depending upon the objective of a scraper, the methods in which websites are targeted differ. For example, sites with large amounts of content such as airlines, consumer electronics, department stores, etc. might be routinely targeted by their competition just to stay abreast of pricing information.
Another type of scraper will pull snippets and text from websites that rank high for keywords they have targeted. This way they hope to rank highly in the
Other scraper sites consist of advertisements and paragraphs of words randomly selected from a dictionary. Often a visitor will click on a
Scrapers tend to be associated with link farms and are sometimes perceived as the same thing, when multiple scrapers link to the same target site. A frequent target victim site might be accused of link-farm participation, due to the artificial pattern of incoming links to a victim website, linked from multiple scraper sites.
Domain hijacking
Some programmers who create scraper sites may purchase a recently expired
Services at some expired domain name registration agents provide both the facility to find these expired domains and to gather the HTML that the domain name used to have on its web site.[citation needed]
See also
- Scraping
- Contact scraping
- Domain parking
- Web scraping
- Blog scraping
- Multi-protocol messengers: can connect to several networks, yet require to have an account on all of these, so don't violate any terms of the networks
- Content farm
- Search engine optimization (SEO)
References
- ^ Google 'illegally took content from Amazon, Yelp, TripAdvisor,' report finds
- ^ "This App Lets You Find People On Tinder Who Look Like Celebrities". BuzzFeed News. 20 June 2017. Archived from the original on 2023-05-08.
- ^ Dating app boss sees ‘no problem’ on face-matching without consent
- ^ Dating.ai App Matches You With Celebrity Look-alikes
- ^ Facial recognition app matches strangers to online profiles
- ^ NameTag: Facial recognition app criticized as creepy and invasive
- ^ Swipe Buster
- ^ Stalker-friendly app, NameTag, uses facial recognition to look you up online
- ^ This Smart (but Unsettling) App Lets You Point Your Phone at People to Find Out Who They Are
- ^ Truly.am Uses Facial Recognition To Help You Verify Your Online Dates
- ^ 3 Fascinating Search Engines That Search for Faces
- ^ "Wolfram has created a website that will identify any image you throw at it". The Verge. 2015-05-14. Archived from the original on 2023-06-03.
- ^ Machine Learning Helps Small Farmers Identify Plant Pests And Diseases
- ^ Made for AdSense
- ^ "Text of the GNU Free Documentation License".
- ^ "Creative Commons Attribution-ShareAlike 3.0 Unported License".
- ^ "Wikipedia:Reusing Wikipedia content".