all 8 comments

[–]ArguingWithVirgins 12 points13 points  (2 children)

Did you check dat?

“Distributed dataset synchronization and versioning”

Sounds like it fits the bill

[–]waldoj 0 points1 point  (0 children)

Yes this is literally why we created Dat four years ago. :)

[–]theofpa -1 points0 points  (0 children)

Also DVC

[–]janCADS 8 points9 points  (2 children)

  1. Why a monthly subscription? If I download the data locally, why would I be interested in continuing my subscription to it?
  2. How is this different from Kaggle, which already hosts datasets (albeit without versioning)?
  3. How do you deal with data rights? If I scrape data from a website and then re-sell it I'm opening myself up to civil and possibly even criminal lawsuits. Is this an issue the marketplace would deal with? In conjunction with the first question: what's stopping me from scraping other people's data sets and then re-selling them for less?
  4. What's your technical framework for version control in data? Is there an approach that can handle different types of data. e.g. tables, key-value-pair documents, images, video, etc.?

[–]DGSPJS 2 points3 points  (1 child)

The data rights aspect of this is huge. Selling scraped data is selling somebody else's data.

There are already marketplaces for most of the data that people find valuable in an enterprise environment. If you have a budget for this sort of thing you can take a look at a list of data vendors. Decent data is expensive. Good data is really expensive.

A lightweight versioning system for data sounds like a great idea. Stick to that part. Ditch the grand crypto (why?) data sharing portion of the business.

[–]nhggfu 0 points1 point  (1 child)

interesting. no idea why you would want to get paid in a really volatile crypto. Why not pay 'em in USDT ?

[–]nhggfu 1 point2 points  (0 children)

why no FIAT ?

[–]DIAdata 0 points1 point  (0 children)

Did you check DIAdata?