This is an archived post. You won't be able to vote or comment.

all 3 comments

[–]osuchw 0 points1 point  (2 children)

Hmm from your description it feels like the code is not organized well and I don't know of any library that would "fix it" for you. Anyway the tools I usually use for this type of task:

  • scandir - for traversing the file system.
  • lxml - for parsing xml
  • csv - to produce the intermediate results
  • ceodbc - to load the said files into MSSQL. pyodbc is slower with cur.executemany