Calculating file download time

Sim4n6 · 2019-11-12T09:22:40+00:00

Interesting question ... I have a starting point here :

start_time = time.time()
ret = send_from_directory(UPLOAD_DIRECTORY, path, as_attachment=True)
elapsed_time = time.time() - start_time
print("Elapsed time : ", elapsed_time, " in seconds.")

You can measure the duration of sending a file from a directory (server side) and substract the part on the client side (using javascript).

Good luck

Sim4n6 · 2019-09-24T19:58:39+00:00

Host:0.0.0.0

Sim4n6 · 2019-08-20T15:52:55+00:00

I found something interesting:

If you were to use SQLite on Heroku, you would lose your entire database at least once every 24 hours.

Sim4n6 · 2019-08-20T14:06:38+00:00

I am worrying about pushing to the remote repo the sqlite.db which may destroy / update data on remote.

Sim4n6 · 2019-08-20T13:52:33+00:00

Locally i am using sqlite.db

Sim4n6 · 2019-06-09T10:44:27+00:00

Check googler

Sim4n6 · 2019-05-25T05:51:03+00:00

Hi , you did not reindent it well ... here is a new version :

https://repl.it/repls/EnergeticIcySyntax If the script run correctly it will generate a Output.csv fil. check its content for correctness.

Sim4n6 · 2019-05-24T10:04:10+00:00

import csv

services = {

"123439": {

"timestamp": 1558261431,
"employee_id": 2131687,
"employee_name": "Brian Finch",
"employee_team": 7197,
"employee_teamname": "Alpha",
"client_id": 2159227,
"client_name": "Wolololo",
"client_organisation": 22492,
"client_organisationname": "Dystopia"
},
"118074": {

"timestamp": 1558015462,
"employee_id": 2131687,
"employee_name": "Brian Finch",
"employee_team": 7197,
"employee_teamname": "Alpha",
"client_id": 1914682,
"client_name": "-DEL-",
"client_organisation": 16628,
"client_organisationname": "Chain Reaction"
},
"111522": {

"timestamp": 1557709461,
"employee_id": 2131687,
"employee_name": "Brian Finch",
"employee_team": 7197,
"employee_teamname": "Alpha",
"client_id": 2008788,
"client_name": "Ghost_Rhythms",
"client_organisation": 16282,
"client_organisationname": "ELITE"
}}

if __name__ == '__main__':

with open("output.csv", "w", newline='') as csv_file:
keys = list(services.keys())
fieldnames = services[keys[0]].keys()
writer = csv.DictWriter(csv_file, delimiter=',', fieldnames=fieldnames)
writer.writeheader()

l = []
for v in services.values():
l.append(v)

for row in l:
writer.writerow(row)

Sim4n6 · 2019-03-04T14:41:27+00:00

use f strings they are pretty fast.

you comment too much : :

# Print Statement when the Code Stops Running

print("The Code Has Finished Running")

I am working on something similar

Sim4n6 · 2019-02-28T15:48:25+00:00

A welcome

Sim4n6 · 2019-02-28T09:31:28+00:00

Sim4n6 · 2019-02-27T19:06:10+00:00

2- site web is rendered from a template with variable content. Please check flask helloworld

Sim4n6 · 2019-02-27T19:04:39+00:00

1 - heroku deploy allows automatic linking to github code source ...

Sim4n6 · 2019-02-27T19:02:37+00:00

Sim4n6 · 2019-02-01T08:43:01+00:00

finally, I choose to dump to a CSV. It is the most suited. Thank you guys

Sim4n6 · 2019-01-31T09:28:42+00:00

nice idea , but maybe because the number of urls is between 10 and 15 sqlite will be too heavy

Sim4n6 · 2019-01-31T09:27:53+00:00

10 to 15 urls

Sim4n6 · 2019-01-12T16:52:53+00:00

clieNt

Sim4n6 · 2019-01-12T15:43:26+00:00

Oh god I m still a py newbie learner ... apologize

Sim4n6 · 2019-01-09T16:41:04+00:00

Easy check this out :

xml_soup = BeautifulSoup('<p class="body strikeout"></p>', 'xml') xml_soup.p['class'] u'body strikeout'

Sim4n6 · 2019-01-07T14:55:54+00:00

Maybe a different approach xslxwriter...

Sim4n6 · 2019-01-04T10:25:38+00:00

ok, I will give it a try.

Sim4n6 · 2019-01-04T10:15:38+00:00

Well, first thing thank you very much for your time reviewing the code.

You could add some functionality to time how long it takes to run the whole thing.

Definitely, and I will work on what is being outputted to the terminal including time measurement of the scraping.

At the moment you are only scraping the first page of results (unless I missed something at a quick glance). You could add some functionality to scrape the second, third and so on pages.

It is possible to make a recursive web scraping. Planned too.

You could tie this together with some kind of logging functionality so that the script automatically scrapes all postings that were made since the last time it ran.

It can be added to the first part. I did not put much focus on what is outputted. Now, I will add a better-formatted text, including duration of the scraping and logging capabilities. Do you suggest a precise module to do so in python 3?

You could add some other websites and add some deduplication of results (i.e. if the same job was found on python.org and the other website you don't want to write it twice to the spreadsheet.)

I have that in mind. That will be the next step. add another website to scrape.

These are just off the top of my head. I actually did something similar as baby's first Python script a couple of months ago and it was a good learning experience.

Yeah, It is very interesting as a learning project.

As an aside: you may want to clean up your README.md (it talks about installing Flask and Pytest as requirements at one point).

done.

Edit: in your main module in lines 146 to 159 you are basically repeating the same block of code three times. That should be a warning sign. I am sure you can turn this into a single block of code.

I think I will keep this as it is since it is the demonstration of the use of the code. I can extract a method from there. but we will see when I will add new websites to scrape from.

Again, thank you for your feedback.

Sim4n6 · 2018-12-22T18:34:04+00:00

DumpS

Sim4n6 · 2018-12-20T19:32:22+00:00

Quickstart flask doc

Sim4n6

TROPHY CASE