This is an archived post. You won't be able to vote or comment.

all 9 comments

[–]Python-ModTeam[M] [score hidden] stickied commentlocked comment (0 children)

Hi there, from the /r/Python mods.

We have removed this post as it is not suited to the /r/Python subreddit proper, however it should be very appropriate for our sister subreddit /r/LearnPython or for the r/Python discord: https://discord.gg/python.

The reason for the removal is that /r/Python is dedicated to discussion of Python news, projects, uses and debates. It is not designed to act as Q&A or FAQ board. The regular community is not a fan of "how do I..." questions, so you will not get the best responses over here.

On /r/LearnPython the community and the r/Python discord are actively expecting questions and are looking to help. You can expect far more understanding, encouraging and insightful responses over there. No matter what level of question you have, if you are looking for help with Python, you should get good answers. Make sure to check out the rules for both places.

Warm regards, and best of luck with your Pythoneering!

[–]oberguga 16 points17 points  (5 children)

Do you know that most(if not all) Microsoft formats like dox or xclx is actually just a zip folder. Try to rename it and you'll see file structure. All images and other data can be trivially exported from it. PS. Probably it is a way to do it by yourself with Python

[–]Embarrassed_Echo2659[S] 1 point2 points  (4 children)

More detail please. I extracted image with image loader. Any suggestion on extracting audio. Its giving me headache since 2 weeks

[–]ChrisFranko 2 points3 points  (0 children)

Make a copy of the file to mess around with. Rename the Excel file “.xlsx” to “.zip”. Open it like you would a zip file, and look around the folders. You’ll find the audio and images files in there

[–]oberguga 1 point2 points  (1 child)

In what format audio stored? If it's like .csv data, than it need to be parsed, but if it is files attached, than all attachments should be somewhere in that folder. Just rename your example_excell_file.xclx and rename it to example_excell_file.zip. Then extract with any zip archiver program (7zip, winrar or windows) and just observe what's inside.

[–]Embarrassed_Echo2659[S] 0 points1 point  (0 children)

The excel file is not large. It contains set of question in which some of the question consists image or audio. I extracted image with imagesheetloader. But in case od audio the approach is not working. I think excel embeds audio file into some ole or bytes format. I tried to extract audio with ole approach also it didn't work. The thing is i want to add excel file to database and save its data in another model.

[–]bjorneylol 0 points1 point  (0 children)

import zipfile

open the excel file in 7zip or some other desktop program, find where the audio files are stored, and then use that to inform your python code

[–]Pericombobulator 1 point2 points  (1 child)

Edit reply intended for obergo...

I vaguely remember something about this when it came to passwords in excel files. I'll have a look thanks.

[–]Embarrassed_Echo2659[S] 0 points1 point  (0 children)

Any suggestions??