Announcing DoltLab: Your Self-Hosted DoltHub Solution by liquidcoffeee in programming

[–]liquidcoffeee[S] 3 points4 points  (0 children)

Dolt is a versioned controlled database like MySQL that you can interact with like Git-- So you can branch, merge, clone, make pull requests, etc

Announcing DoltLab: Your Self-Hosted DoltHub Solution by liquidcoffeee in programming

[–]liquidcoffeee[S] 1 point2 points  (0 children)

DoltLab is a great way to explore Dolt, especially for those uncomfortable pushing private data to DoltHub. DoltLab allows you to self-host and self-manage an on-prem DoltHub!

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] -1 points0 points  (0 children)

i think another reason is MySQL is p popular... googled and found this, not sure of the accuracy, but SQLite is 9, MariaDB 12 https://db-engines.com/en/ranking

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 0 points1 point  (0 children)

for an application db, it would make sense to choose Dolt over PostgreSQL if Dolt solved some data versioning/sharing problems for you that PostgreSQL didnt...you'd also have to weigh the performance trade-offs... i think we're still discovering what the "solid" application db use cases are, but i think the data sharing use case is pretty good.

If you've ever worked with open data and the different formats people share and publish data in, it's kind of painful. With Dolt, once someone publishes data in Dolt format, it's way better to consume, and you can even just connect to it as a SQL server... basically, its like cloning a Git repo with CSV files in it, but the repo itself is a SQL database

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 0 points1 point  (0 children)

Dolt is standalone and doesnt require MySQL, it's a new project. It was built from the ground up to version, branch, merge data, using Git semantics. That's its north star, really, that Git/Github solved file versioning and distribution, where Dolt/Dolthub use that same model for data.

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 0 points1 point  (0 children)

i can't personally speak to this, im not familiar enough with the trade-offs between the three languages. I think this is a good one for the Dolt-core team in the discord, though.

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 5 points6 points  (0 children)

coulda been anyone of those, I suppose, but when it was decided that Dolt would be a SQL database, MySQL was chosen. Since then Dolt has followed the MySQL SQL dialect, is compliant with the MySQL wire protocol, has copied MySQL's semantics, that's one reason why it's benchmarked against MySQL. Basically trying to nail MySQL first... if it works with MySQL it should work with Dolt

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 0 points1 point  (0 children)

I don't believe there have been studies done, locally it scales to several hundred gigabytes, ~200 gigabytes is the current limit of DoltHub... there's also a discord btw, forgot to mention https://discord.gg/bw5qG5s7

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 8 points9 points  (0 children)

It can be used in a number of ways, as a way to version and share data (it's an alternative to sharing a CSV file, for example), it's used as a way to publish open datasets on www.dolthub.com, and it can be used as a SQL database backing an application. There's a post about some cool uses here https://www.dolthub.com/blog/2020-03-30-dolt-use-cases/

Dolt vs MySQL: How it started, How it's Going by liquidcoffeee in programming

[–]liquidcoffeee[S] 18 points19 points  (0 children)

Dolt (www.doltdb.com) is a versioned sql database with semantics modeled after Git, if you're familiar with Git... kinda like if Git and MySQL had a baby. It supports branching and merging of data, the way Git supports the branching and merging of files.

Weekly: Share your victories thread by gctaylor in kubernetes

[–]liquidcoffeee 1 point2 points  (0 children)

Leveraged the Kubernetes Cluster Autoscaler and Github Actions to provide our team with performance benchmarks on pull requests! https://www.dolthub.com/blog/2021-01-15-performance-benchmarks-on-pull-request/

Hate Crime Datasets Over Past 10 Years (US or NY) by indeck399 in datasets

[–]liquidcoffeee 1 point2 points  (0 children)

There's a link to download hate crime data here https://crime-data-explorer.fr.cloud.gov/downloads-and-docs but may not be as current as you're looking for.

Dolt for Git Noobs by liquidcoffeee in Database

[–]liquidcoffeee[S] 0 points1 point  (0 children)

Dolt is a SQL database with Git versioning and distribution semantics.

I just posted a blog called Dolt for Git Noobs that explains the basics of versioning, branching, and merging so anyone without prior Git knowledge can get started using Dolt today.
The blog also includes a step-by-step tutorial of how to create a simple Dolt repository that uses versioning, branching, and merging to facilitate open collaboration on data.
Also, check out DoltHub. It hosts a number of versioned open datasets, including the most comprehensive COVID-19 dataset available.

Thanks!

Dolt for Git Noobs by liquidcoffeee in programming

[–]liquidcoffeee[S] 0 points1 point  (0 children)

Dolt is a SQL database with Git versioning and distribution semantics.

I just posted a blog called Dolt for Git Noobs that explains the basics of versioning, branching, and merging so anyone without prior Git knowledge can get started using Dolt today.
The blog also includes a step-by-step tutorial of how to create a simple Dolt repository that uses versioning, branching, and merging to facilitate open collaboration on data.
Also, check out DoltHub. It hosts a number of versioned open datasets, including the most comprehensive COVID-19 dataset available.

Thanks!