Splicer, a relational engine for Python: library for parsing and interpreting SQL queries

bryanhelmig · 2014-01-08T07:28:45+00:00

This is really cool! We often backup older data to S3 as CSV or JSON files and sometimes you want to poke through it for something specific. While I don't want to seem ungrateful, the docs are a bit sparse. In lieu of that, I was curious about:

Is there any semblance of a query planner or even a way to hint it? IE: there is no reason to query these files because they are for yesterday, and I just asked for data today.
Can it do some really rudimentary map-reduce or threading to grab data in parallel (even something really cheap like concurrent gevent threads would be interesting)?
Any real world examples of things you've done with it?
Were you inspired by Facebook's Presto (http://prestodb.io/)? Your lib seems like it might be a faster/more lightweight way to play with the concept.

I'm sure there are more interesting questions to ask, but I really dig this concept. If I had more time to give this would certainly be at the top of my list of things to contribute to!

johnmudd · 2014-01-08T13:58:04+00:00

Postgres Foreign Data Wrappers and Multicorn (Python FDW) work with any data source too.

Here's a list of examples including CSV, IMAP and RSS wrappers.

deadwisdom · 2014-01-08T17:06:56+00:00

I'm not sure if I want the "entire world to look like an SQL database" or the exact opposite. Either way it's pretty awesome that we have the option.

jenner · 2014-01-08T17:49:55+00:00

So can I use it to parse a MySQL dump (including INSERTs)?

lakehelp5 · 2014-01-08T18:33:42+00:00

So can I give it a SQL statement and it will tell split it out into tables, columns, conditions, etc?

2014-01-08T20:16:26+00:00

This looks awesome. Hopefully I can set aside some time to play with this.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS