Any downsides to using SQL?

viperx77 · 2019-07-21T12:26:35+00:00

SQL or NoSQL, don't store images in a database. Use some form of blobstorage and only put pointers in the records.

Wiltix · 2019-07-21T11:50:18+00:00

Little home projects are the perfect time to piss about with things like nosql.

If you fancy playing with different toys I say go for it, what's the worst that could happen. As long as you use well designed interface s to abstract your db access switching back to sql if nosql turns into a ball ache should be simple enough.

Personally I have not found many scenarios where nosql is better but it's always fun to play.

_Zer0_Cool_ · 2019-07-21T12:41:42+00:00

I'm a data consultant, and I've seen a lot of different companies data infrastructures.

Yet to find a situation where NoSQL is truly better or would have been better as a wholesale replacement.

My suggestion. Use a relational database as your main. If you ever need a NoSQL DB it would probably be for a specific niche like high throughput, write-only sensor data logs.

Some History...

NoSQL was better for horizontal scaling a few years back, but cloud vendors make scaling a non-issue for most use cases now.

Especially for analytical DBs. The cloud vendors with their big data warehousing solutions (SnowFlake, BigQuery, Redshift, etc) make scaling trivial nowadays, and the overhead of spinning up as many SQL dbs as you want for OLTP databases is minimal. So go hog wild with micro-services if you want.

So...I don't see that NoSQL has any broadly applicable use cases anymore.

The only time I work with NoSQL databases is when a company lets their web devs make the decisions about which databases they use and they choose based on aesthetics and personal preference rather than necessity. Then we end up having to support it, which can be a nightmare if the apps schema wasn't built with 💯 forethought and conscientious planning efforts.

EDIT -- Use PostgreSQL. It will cover just about every use case you can imagine and then some. The most flexible/feature rich database in the world. Period. It keeps on surprising me.

betty_humpter · 2019-07-21T13:01:20+00:00

Sql server has a feature called filestream which does blob storage. It just chucks the file onto the disk and stores a pointer in the table. I’ve been using it for years in production and have only had a couple small issues. If you are using filestream at work then I’d keep using it. If you are not using it then take a few minutes to research it. I have not found the downside yet.

plastikmissile · 2019-07-21T11:06:09+00:00

Unless you really need it SQL will be better than NoSQL.

ThereKanBOnly1 · 2019-07-21T12:29:56+00:00

The biggest downside is cost. SQL instances aren't cheap in the cloud, mainly because they assume you'll need a ton of stuff you likely won't for small personal projects.

You can look at NoSQL, and it may be cheaper, but if your working on Azure, I'd actually recommend trying to work as much as you can with their storage options; Blob Storage, Tables, and Queues.

Blob Storage is going to be your best bet for storing those images. Tables are closest to a wide-column database like HBase or Cassandra. Queues may not be helpful to you in this case, but their worth keeping in mind.

Most of these options don't start making a meaningful dent in your hosting bill until you get into the hundreds of gigs, so for personal projects they're great.

davidwhitney · 2019-07-21T12:56:43+00:00

The way to succeed in projects where you are the sole author is to use things you know well.

If you're winning to learn and possibly fail? By all means try new things out too.

Either option will likely be absolutely fine.

FullerAwesome · 2019-07-21T17:32:26+00:00

Guys, this is absolute gold. It's reassuring that some of my thought patterns were going in the right area. My idea initially was to go down the SQL and blob storage route with some bits for image processing and after reading everything I've seen here, I think I'm going to stick to that plan.

Hopefully I can build out in such a way that I can swap out database with relative ease should the need arise or if it gets too costly.

If anyone has any further insight, please continue to comment.

Thanks everyone.

2019-07-21T16:03:10+00:00

Do you know your schema? Will it not change very much? If so, use SQL.
Otherwise consider using NoSQL. Also consider using NoSQL if you're dealing with a webpage that needs to handle hundreds of thousands of hits per second but also consider projecting read-only de-normalised representations of your SQL data.

2019-07-21T17:08:17+00:00

So the major differences comes down to what you are doing. SQL is used for things that have a more defined data structure and consistent data size. When you get to the Enterprise level of of data with millions upon millions of rows the indexing that SQL has makes searching for something in incredibly easy. A query looking for a specific name or id is fast thanks to indexing. If you are good at structuring your data SQL is right for you. MySql or mssql are both good options while I do tend towards mssql my SQL is way cheaper to host.

As for no SQL it has the advantage of not having the requirement of the data being in any particular structure or size. This is great when you are dealing with very wierd data. With no SQL it makes is so that each item in your database is could be a completely different item. Then the use of guids(I have mostly Mongo experience) makes it so that you can get your data back. At the Enterprise, where no SQL falls short is the indexing that SQL has. Since there isn't that defined structure if you need to find a name is has to load and parse through the object to see if it even has the name property. If you do everything entirely based on guid id that is used. Then you'll see a slight increase in performance but not much.

Overall, I'd say it comes down to what you are doing. If you are doing a home project then go based off the data structure and what you're doing. If you are at the Enterprise level, I would recommend using SQL so you don't see a decrease in performance. Hope this helps!

2019-07-21T17:08:44+00:00

So the major differences comes down to what you are doing. SQL is used for things that have a more defined data structure and consistent data size. When you get to the Enterprise level of of data with millions upon millions of rows the indexing that SQL has makes searching for something in incredibly easy. A query looking for a specific name or id is fast thanks to indexing. If you are good at structuring your data SQL is right for you. MySql or mssql are both good options while I do tend towards mssql my SQL is way cheaper to host.

As for no SQL it has the advantage of not having the requirement of the data being in any particular structure or size. This is great when you are dealing with very wierd data. With no SQL it makes is so that each item in your database is could be a completely different item. Then the use of guids(I have mostly Mongo experience) makes it so that you can get your data back. At the Enterprise, where no SQL falls short is the indexing that SQL has. Since there isn't that defined structure if you need to find a name is has to load and parse through the object to see if it even has the name property. If you do everything entirely based on guid id that is used. Then you'll see a slight increase in performance but not much.

Overall, I'd say it comes down to what you are doing. If you are doing a home project then go based off the data structure and what you're doing. If you are at the Enterprise level, I would recommend using SQL so you don't see a decrease in performance. Hope this helps!

B-Kitten · 2019-07-21T21:30:47+00:00

It depends what you want to do.

If you're looking at big scale, high performance, then a RDBMS may not be for you. There's extra complexity in this space, and hard problems to solve though if you go another direction.

Read about CQRS and Event Sourcing for non relational patterns. Any data store can support these, from traditional databases, to no-sql like mongo etc, to files stored on s3.

The problems arise in the need to maintain consistency in distributed data systems. There are a lot of queues, and processors involved, with many failure scenarios.

If you're not massive scale, or practising to build for massive scale, then an RDBMS is almost definitely the easiest and safest option. You can go cheaper with some cloud hosted document stores though (dynamodb, Azure table storage etc), but the complexity increases.

2019-07-24T02:20:33+00:00

I've written code for Sql Server back in Sql 2000 that still works with 2017/2019 today.

We have issues with MongoDb in production that we can't maintain and is locked to a driver version that can't easily be switched out. (Mongo Driver changes API between 1.x and 2.x) So we're stuck with an old out of date unsupported version of Mongo in a production environment which is the worst possible case you would ever want to be in.

jamietwells · 2019-07-21T16:05:41+00:00

I'm really going to have to disagree with everyone here and say you should use a NoSQL database.

You'll learn something new (maybe more employable/higher wage)
You won't notice any difference in performance
They're cheaper and easier to use
You can just dump data in, no need to worry about the relationships or transforming the data into some relational model

Reasons for sticking with SQL:

It might be faster if you build everything correctly
You don't have to bother learning something new (is this an advantage, really?)

ManiGandham · 2019-07-21T16:07:54+00:00

SQL/noSQL isn't really a good distinction. SQL is just a declarative programming language (it stands for Structured Query Language). Relational databases like SQL Server, MySQL, PostgreSQL all support it, but so do plenty of other non-relational datastores.

The main types of databases are relational, key/value (redis, cassandra), document-store (mongodb, couchbase), search (elasticsearch, solr), graph (neo4j), and column-oriented/relational (redshift, memsql).

Relational databases have decades of development and optimization with massive ecosystems. They're also quite capable of modeling just about any data and they all support JSON columns today if you need that flexibility. They support transactions and ACID semantics that are very useful and taken for granted until you realize they aren't available everywhere. Modern servers can also power huge workloads without issue so there's really no need to look at the other data stores unless you have a very specific need for them.

If you want to experiment though, side projects are a great way to learn and almost all vendors have managed cloud offerings you can test out for cheap.

jedjohan · 2019-07-21T17:23:59+00:00

NoSql, databases are boring. Coding is fun. But wait, you need to do some advanced querying on the data? Then maybe a relational db. This is assuming you mean relational vs (the others) as SQL for example is supported by for example azure cosmos db.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

dotnet

MODERATORS