- Original Poster
- #1
Hi all,
A new project I'm researching...
I want to build a directory listing site for a niche sector that is currently dominated by 3 established sites.
Initially I want to add several thousand free listings to my database, essentially an aggregate of businesses identified by scraping the existing listing sites, and also by automated general web searching for relevant contact and company details.
Once established, the businesses listed would individually be contacted to 'upgrade' their listing for a small charge. But initially, and for quite some time, the listings are free and also relatively worthless until/if the directory site gains some prominence.
So the question is: If I scrape sites that already have business profiles, and replicate only what is publicly available and not copyright protected, is that legal?
The data I would scrape and use would be:
-company name
-contact number
-email
-web address
-type of business
-address
(in a nutshell, everything you might find on a business card)
What I would not scrape is:
-business description (as this may be considered copyright to the person that wrote it)
-any ranking information or other content generated by the site that is being scraped
So basically all data that could have been submitted specifically to be shown on the original site will be left. Anything that can (or could) be gained simply by visiting the businesses own website or looking in a local business directory, will be scraped.
Thoughts?
A new project I'm researching...
I want to build a directory listing site for a niche sector that is currently dominated by 3 established sites.
Initially I want to add several thousand free listings to my database, essentially an aggregate of businesses identified by scraping the existing listing sites, and also by automated general web searching for relevant contact and company details.
Once established, the businesses listed would individually be contacted to 'upgrade' their listing for a small charge. But initially, and for quite some time, the listings are free and also relatively worthless until/if the directory site gains some prominence.
So the question is: If I scrape sites that already have business profiles, and replicate only what is publicly available and not copyright protected, is that legal?
The data I would scrape and use would be:
-company name
-contact number
-web address
-type of business
-address
(in a nutshell, everything you might find on a business card)
What I would not scrape is:
-business description (as this may be considered copyright to the person that wrote it)
-any ranking information or other content generated by the site that is being scraped
So basically all data that could have been submitted specifically to be shown on the original site will be left. Anything that can (or could) be gained simply by visiting the businesses own website or looking in a local business directory, will be scraped.
Thoughts?
