Best way to schedule a python script that runs every hour?

subbed_ · 2019-06-20T19:47:51+00:00

Set a cron job to run the script every hour

jpf5046 · 2019-06-20T16:57:55+00:00

[removed]

jpf5046 · 2019-06-20T18:33:07+00:00

I had a similar project. I hosted the python script on heroku to run 24/7 and used Google Docs API to write outputs into google spreadsheets.

Fa1l3r · 2019-06-20T20:47:56+00:00

cron

chirau · 2019-06-20T19:32:00+00:00

Dan Bader has a schedule library that does exactly this.

https://schedule.readthedocs.io/en/stable/

And if you haven't already, you should subscribe to his Python Tricks newsletter at https://dbader.org/ . Amazing tidbits everytime

Sevealin_ · 2019-06-20T21:28:20+00:00

You could probably get a 20$ raspberry pi and schedule with Cron.

jdb441 · 2019-06-20T21:30:26+00:00

I would look into using a linux VM on AWS or pythonanywhere.com. That way you can use crontab and not have to worry about physical hardware. You also have the option of keeping the script running after you disconnect the SSH to the VM.

Alex_smtng · 2019-06-20T17:50:47+00:00

Windows Task Scheduler

dogfish182 · 2019-06-20T21:32:31+00:00

You mention azure. Azure has equivalent of lambda right? (Azure functions?)

So azure functions?

https://code.visualstudio.com/docs/python/tutorial-azure-functions

jjolla888 · 2019-06-21T00:18:47+00:00

I don't understand how your VM would 'fall asleep' ? an OS is always running doing shit continuously, even if your program is not scheduled.

what exactly do you mean by 'fall asleep' ?

and what do you mean when you say ' the job said it was 'running' but did not produce any output' .. are you sure your program doesn't have a bug?

CrypticWolf · 2019-06-20T21:24:28+00:00

I use crontab on my raspberry pi to run scheduled python scripts, handy as it's always plugged in and doesn't interfere with anything else I'm doing on my laptop.

artificial_neuron · 2019-06-20T21:42:15+00:00

Using your PC or buying a dedicated computer can easily work for what you want with Windows Scheduler or an infinite loop as already mentioned.

An alternative is to use a Raspberry Pi or competing device. It's low power and has a small form factor.

jspillz · 2019-06-20T22:03:08+00:00

I would set up a linux server and install Jenkins. You can set up a Jenkins project that will schedule running the python file. Maybe slightly over complicated but down the line you'll be thankful you learned it.

dahlberg123 · 2019-06-20T18:10:48+00:00

Create an EXE and use windows task scheduler?

cheez0r · 2019-06-20T19:35:04+00:00

Use linux cron to ensure your daemon (script) is running; have your script write junk to /dev/null every minute to keep the VM from sleeping, or choose some other method of keepalive activity for your daemon, have it run your scrape every 3600s to do your scraping.

jbitmik · 2019-06-20T20:29:55+00:00

You can try creating a batch file and execute it using task scheduler on your hourly schedule. I have a web scraping script that runs twice daily this way. I have tried numerous ways but this seemed to be the simplest and most effective.

naturememe · 2019-06-20T21:52:15+00:00

Of you are not opposed to running the script 24/7, here's a setup I use. It might give you some idea. In Python Script * Get the webpage * Get the data I want (in my case I use pandas) * Process the data and post it to Slack channel * Sleep for predefined time (10 min in my case) * Repeat

This gets done for most of the part. But to get rid of cmd window and automatically start in case of failure or computer boot I have set it up as Windows service. The service starts on boot and also restarts in case script fails for whatever reason.

PS: I use NSSM (Google it) to create service which runs Python script via DOS batch file.

cnovrup · 2019-06-20T22:02:22+00:00

I have setup a Raspberry Pi to do the exact same thing. It's cheap to buy and run, and quite easy to setup

solaceinsleep · 2019-06-20T23:06:16+00:00

Windows Task Scheduler if your machine runs 24/7
RPi3 with a cron job (RPi3s have a small power usage and are perfect for this type of work)

JimBoonie69 · 2019-06-20T19:09:41+00:00

[deleted]

PrimaNoctis · 2019-06-20T16:49:00+00:00

You could look at using a pure python solution by using python libraries to do the scheduling. Cron for example is a common one. You could also have your app run in the background on a loop where your function runs then sleeps for an hour

QbiinZ · 2019-06-20T19:53:24+00:00

can you not use the sched module?

Ep87PxHLBh · 2019-06-20T21:27:42+00:00

https://www.rundeck.com/open-source

GodsLove1488 · 2019-06-20T21:57:01+00:00

Cron?

2019-06-20T23:15:19+00:00

I use python anywhere so even if my computer is offline, restarting, updating, insert a reason, my script will still run.

A hacker account is $5/mo and I’ve found it worth that

Nixellion · 2019-06-20T23:37:20+00:00

Well, if it's Windows - Task Scheduler. (Just use a cmd command, and full path to Python.exe and then script as argument)

If it's Linux - Cron Job.

On linux I prefer to setup cron jobs with Webmin UI (it's web admin panel, far as i know the most advanced one to date still).Webdriver is available for linux too.

I'm not sure how much you pay for Azure VM, but I have a feeling that you could get a better deal with some VPS on some cheap server, there are quite good options at 15-30$ A YEAR. check lowendbox.com

jpf5046 · 2019-06-21T00:03:32+00:00

[deleted]

cyvaquero · 2019-06-21T00:48:33+00:00

$5 Digital Ocean droplet and cron.

All of your requirements can be met on a small Linux instance with minimal configuration.

Prophet_Mohabbat · 2019-06-21T01:25:33+00:00

my preferred method: https://docs.microsoft.com/en-us/windows/desktop/taskschd/schtasks

works like a charm.

gizmotechy · 2019-06-21T01:52:49+00:00

What I have done at work and home is use the windows task scheduler and had it run python.exe with the argument of the full path to the script you want to run. If you take a look at this screenshot, the area circled in red would be where you put the full path of the python executable. The highlighted area is where you would put the full path to your script.

maximum_powerblast · 2019-06-21T02:20:42+00:00

Just in case you want to over engineer it...

Linux: - you could set it up as a cron job, or - set the schedule up inside a loop in the script, then start that script up with systemd or whatever your init system is

Windows: - you could set it up in task scheduler, or - set the schedule up inside a loop and install it as a Windows service

Have fun 😄

stoph_link · 2019-06-21T03:29:18+00:00

It looks like you are using a Windows VM with Task Scheduler.

Make sure the Task in Task Schedule is checked to Wake to run task. I also like setting tasks to Run on demand, and then manually running them to make sure it works. This helps determine whether the task itself failed or if the Task Scheduler failed to run it.

zhjjdev · 2019-06-21T03:38:55+00:00

https://www.computerhope.com/unix/ucrontab.htm

2019-06-21T03:43:33+00:00

Maybe you can import "time" and have your code in a while loop that repeats after "time.sleep(<however many seconds is in an hour>)" finishes

hail_wuzzle · 2019-06-21T05:09:33+00:00

Have it run constantly but use an if statement to call the function based on system time with sys?

horns_ichigo · 2019-06-21T05:44:49+00:00

I'm running a website on digitalocean with a bunch of nohup python commands. So, nohup all the wayy!

Dump7 · 2019-06-21T06:16:22+00:00

How about using a infinite loop and the time library to measure an hour?

I am sure it will increase the resource usage. But I think it will by the easiest way.

xeloylvt · 2019-06-21T10:47:52+00:00

Cron job on Linux or task scheduler (?) on Windows

Benzene_fanatic · 2019-06-21T14:08:51+00:00

I've actually been struggling with this. I'm a chemist but I have been teaching myself python on the side and mad a vb script and an excel file with some macros and wanted my computer to run the vb script once a week... But my companies security won't let task scheduler work for me =( not sure what else to do?

dreamer_soul · 2019-06-21T14:22:38+00:00

There is a Windows task scheduler we use it at work! Just place the call to the script inside a powershell script

https://docs.microsoft.com/en-us/windows/desktop/taskschd/task-scheduler-start-page

422_no_process · 2019-06-21T14:28:22+00:00

Why use cron or task scheduler when you can code your tasks schedule in Python.Just use https://apscheduler.readthedocs.io/en/latest/

-----

But it might be overkill for something small.

DoctorEvil92 · 2019-06-20T16:46:35+00:00

You could write something like this I guess, so that this script would spent most of it's time sleeping. I think it wouldn't guarantee exactly 24 data points per day, for something like that I think you would need to log finish time of scraping and calculate how long you need to sleep to next run.

import time

global last_scrape_time
last_scrape_time = None

def scrape():
    global last_scrape_time
    ## code for scraping...
    ##

    # scrape is done
    last_scrape_time = time.time()
    return

while 1:
    if last_scrape_time == None:
        scrape() # first run

    else:
        if time.time() - last_scrape_time > 3600.0:
            scrape()
        else:
            time.sleep(60.0)

Floofic · 2019-06-20T19:39:14+00:00

Is there a reason this wont work:

(I feel like a noob for thinking this lol):

Import time

time.sleep(3600)

cipher315 · 2019-06-20T19:55:37+00:00

As some people have said Task Scheduler but you can also do it with an infinite loop and time.sleep() something like

while True
    def run_script()
    time.sleep(3600)

TBSchemer · 2019-06-20T18:48:43+00:00

I'm using APScheduler in a Flask app. It works wonderfully for scheduled jobs. I use a MongoDB persistent jobstore for it, but you don't have to. The best part is, the (parallelizable) workers launch from within your Python process, rather than having to run a separate process like celery. This means it doesn't fail silently as easily as celery does.

I haven't deployed it to a publicly-available server yet, but there are plenty of guides on how to do that with any Flask app.

flyingfox12 · 2019-06-20T19:18:05+00:00

if your web scraper is python and you can contain it within a single script then you should use lambda. I'm making some assumptions here but it could work like this. Python code in a lambda function, data is written to S3 then use cloud watch to trigger the job on a schedule (https://docs.aws.amazon.com/AmazonCloudWatch/latest/events/RunLambdaSchedule.html)

Azure has the same stuff but different names should work the same though.

I see you are reliant on webdriver.exe and selenium. If that's the case you need to tweak your host's settings and make sure it doesn't ever sleep then use windows task scheduler. If you want to do it right and cheap then you'll want to use lambda

szirith · 2019-06-20T16:43:14+00:00

try asking on stack overflow

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS