Viewing a single comment thread. View all comments

tyler1128 t1_jc27nqp wrote

I'm personally thinking about writing a service to sell the data at something like 1/10,000th the cost twitter is charging or less. It'd cache most of the tweet data in LRU form up to a specific data limit in a central database, and dynamically grab new data in the case it isn't already there. There's also be a constantly running scraper for new data to throw it in the central DB cache. Only think stopping me is understanding the legal ramifications. On-demand access to historical data is too slow for large cohorts.

3

FamousSuccess t1_jc2ry05 wrote

Well. Keep in mind that google effectively sells advertising based on user data, and their services/users depend entirely on content and data of non google entities.

So I’d say if google can build a business on other entities public data, so can you.

Not a perfect parallel but a parallel nonetheless

1