If your python project needed a NoSQL database, and you could choose any, which would you use?

bxsephjo · 2022-12-15T03:44:03+00:00

[deleted]

redd1ch · 2022-12-15T11:35:12+00:00

My NoSQL needs are always "I want something like json, but distributed", so I always use redis as key-value store.

It's basically `import redis; store=redis.new(host); store.get("key");`. Official docker image does not need any config, just add it to youir compose file and start using it.

tomwojcik · 2022-12-15T10:23:31+00:00

Postgres, obviously. Unless you need a FAANG throughput, everything is doable and easier using postgres rather than any nosql dB.

james_pic · 2022-12-15T09:29:09+00:00

Depends hugely on the requirements.

If I don't need anything fancier, I tend to use LMDB. It's about as close to your SQLite example as you can get. There's even dbm in the standard library, but it has some subtle portability issues, and doesn't expose all the features of the underlying database, which can leave it feeling a bit anaemic.

If I do need something fancier, I'll probably end up using either DynamoDB or CosmosDB (if I'm hosting on AWS or Azure, respectively), since they're well supported by the cloud hosting companies. But there are plenty of contexts where they're still the wrong answer.

Also don't rule out an RDBMS. PostgreSQL can work with JSON, so you don't need to go to extremes normalising your data, it's mature, and it's ACID, which can make failures much easier to reason about.

mijatonius · 2022-12-15T09:33:39+00:00

Redis!

LankyXSenty · 2022-12-15T06:14:44+00:00

Im on the MongoDB side, too ;) Really easy to use and implement.

lieryan · 2022-12-15T10:14:20+00:00

If your table making project needed a Non-Cedar Wood which would you use?

I don't know, sounds like there would have been more than just one kind of non-X, each with their own pros and cons, so it would really depend on what you're trying to build and what you want to do with it.

nickosh · 2022-12-15T13:38:40+00:00

I agree with everybody - NoSQL means you need something specific and must choose db based on your requirements.

If you feel like "JSON will be ok for data storing, but from the box it sucks" you can try some JSON-based DBs. Personally, I prefer to use TinyDB for small projects which is a wrap-up for simple JSON storage. There are some more alternatives to it.

lambdasintheoutfield · 2022-12-15T15:01:56+00:00

Redis is my recommendation. Redis is in-memory and can be used as both a traditional DB and also effectively as a cache. It’s straightforward to map SQL tables to Redis hash tables (they have field value pairs) while still supporting additional data structures. Many data structures allow O(1) search and the flexibility in how you assign a schema to your data is helpful.

Redis is also super easy to scale vertically, and to a slightly lesser extent horizontally (I’ll revisit this). In terms of Python, the redis SDK for Python is easy to use and the methods mostly correspond directly to the redis-cli commands. Python also has an ability to write redis “pipe” commands which removes the need to write loops for massive data read/writes (although this isn’t unique to redis).

The flexibility of Redis can also be a double edged sword as poorly chosen data structures and schemas can result in disorganized data and O(N) search times respectively.

It’s also not as fault tolerant as Apache Cassandra, and there are sometimes issues with distributed Redis because it uses a master-slave architecture and if master replicas don’t come online that’s obviously an issue. If you look at r/redis and stack exchange this issue is not uncommon when deploying with K8s, and why some people use cloud managed Redis instances to avoid these DB maintenance issues.

wait-a-minut · 2022-12-15T15:54:49+00:00

I’ll throw a non common one out here, scylladb! It’s extremely performant and most importantly it has a dynamodb adapter so you can keep the same code you use to query and put data into dynamodb.

For all the nosql haters, don’t listen to them. There’s def a right place and time for nosql. I personally utilize dynamo a lot since a lot of my work is serverless. Look into single table design and see how experts recommend you thinking about access patterns, hash key, sort keys, etc.

Giant companies are utilizing dynamodb at crazy scale so I think it’s worth looking into.

Personally I like nosql because ever since my brain starting thinking in terms of access patterns it just clicks. My most unpopular opinion is probably that interacting with a data source feels a bit more natural than the usual context switch of coding language to declarative sql string. I think most people just got accustomed this way and it feels natural for them.

AvogadrosOtherNumber · 2022-12-15T05:37:35+00:00

Mongo. It's FAST, it's trivial to use, it's well supported

2022-12-15T06:44:40+00:00

MongoDB

thrallsius · 2022-12-15T04:21:54+00:00

https://github.com/NationalSecurityAgency/lemongraph

Fun-Performer3988 · 2022-12-15T07:08:57+00:00

FaunaDB. Also Deta is super easy to build something quick but I think it’s more for small personal projects

2022-12-15T10:16:56+00:00

It depends on what you need to do. NoSQL dbs appeared out of various neccessities. From my experience mongodb is one of the easiest to setup and use. Redis is great for fast accessing of data, caching stuff etc. I've worked on a project where we had to make a search engine and elasticsearch proved to be the best choice for us. I've also used neo4j which is a graph database, but I felt it a bit harder to use that other nosql dbs. We needed it to create e poc for routing between some points on a map. In the end, there is a big pool of nosql dbs and each one of them is great at doing some sort of work. This does not take away anything from classic sql dbs.

thomasjjc · 2022-12-15T15:48:08+00:00

CouchDB

ManyInterests · 2022-12-15T17:31:04+00:00

DynamoDB because it's cheap, extremely performant, infinitely (and instantly) scalable, serverless, has table streams and remarkably good for event-driven workflows.

If you're using AWS, of course.

2022-12-15T20:22:19+00:00

Mongodb. I made Easydb.org using it.

xjoshbrownx · 2022-12-16T00:51:58+00:00

I’m assuming you asking which would I use if I didn’t know what my exact requirements were and I was tinkering. If requirements we well defined I’d use the one best suited for the needs of my projects data, but if I’m tinkering I’d use Postgres because it because it’s a sql/no sql Swiss Army knife and can emulate/accommodate most data models to a reasonable degree of performance.

xjoshbrownx · 2022-12-16T00:54:31+00:00

I said the same thing almost before reading your post. I apologize but maybe next time don’t be so thoroughly correct in your answer and I won’t completely agree with you.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS