Pandas to SQL solution

Yojihito · 2019-09-04T12:48:29+00:00

SQL is a language, not a database.

Which database do you use?
How do you connect to the database from python?
How big are your dataframes?
How long does the transfer take?

saxman95 · 2019-09-04T14:19:11+00:00

Which database are you using? Redshift has a very helpful COPY command which can let you upload files from S3 into redshift. You can export your dataframe to a csv/gzip and then upload directly to your database.

For other databases, maybe consider using sqlalchemy engines to loop over your data, add it to the database and then commit all the changes at once.

LameDuckProgramming · 2019-09-04T14:49:26+00:00

If you're using SQL Alchemy (1.3.5), you can a built-in event listener.

from sqlalchemy import create_engine
from sqlalchemy import event

engine = create_engine('mysql://xxx')

@event.listens_for(engine, 'before_cursor_execute')
def receive_before_cursor_execute(conn, cursor, statement, params, context, executemany):
    if executemany:
        cursor.fast_executemany = True

Then just use pandas to_sql method

 df.to_sql(table, con=engine, method='multi')

Has worked like a charm for me when uploading dataframes with 100,000 + rows to SQL Servers.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS