web2pyTM cron

The web2py cron provides the ability for applications to execute tasks at preset times, in a platform independent manner (tested on Windows XP, Linux and MacOS X, but should work on any python 2.5 supported platform). Cron functionality is defined by a crontab file (in regular crontab syntax) in your application's cron directory. This also means that every application can have a separate cron configuration and that cron config can be changed from within web2py without affecting the host OS itself.

Example:
1.
2.
3.
4.
5.
6.
7.
8.
9.
 0-59/1 *       *       *       *       root python /path/to/python/script.py

*/30 * * * * root **applications/admin/cron/expire_sessions.py

30 3 * * * root *applications/admin/cron/db_vacuum.py

@reboot root *mycontroller/myfunction

@hourly root *applications/admin/cron/expire_sessions.py
As you might have noticed, some of the lines use extensions to regular cron syntax to provide additional web2py functionality.

Modes of use

Depending on how you are invoking web2py, there are four modes of operation for web2py cron. The default is hard cron if you are using the built-in web server, in all other cases the default is soft-cron.

Soft cron

Soft cron is the default if you are using CGI, FASTCGI or WSGI. Your tasks will be executed in the first call (page load) to web2py after the time specified in crontab (but AFTER processing the page, so no delay to the user is visible). Obviously, there is some uncertainty exactly when the task will be executed depending on the traffic the site receives. Also, the cron task may get interrupted if the web server has a page load timeout set. If these limitations are not acceptable, see external cron. Soft cron is a reasonable last resort, but if your web server allows other cron methods, they should be preferred over soft cron.

Hard cron

Hard cron is the default if you are using the built-in web server (either directly or via Apache mod_proxy). Hard cron is executed in a parallel thread, so unlike soft cron there are no limitations with regard to run time or execution time precision.

External cron

External cron is not default in any scenarios, but requires you to have access to the system cron facilities. It runs in a parallel process, so none of the limitations of soft cron apply. This is the recommended way of using cron under WSGI or FASTCGI.

Example of line to add to the system crontab, (usually /etc/crontab):
1.
0-59/1 * * * * web2py cd /var/www/web2py/ && python web2py.py -C -D 1 >> /tmp/cron.output 2>&1
If you are running external cron, make sure you add the -N command line parameter to your web2py startup script or config so there is no collision of multiple types of cron.

No cron

In case you do not need any cron functionality within a particular process, you can use the -N command line parameter to disable it. Note that this might disable some maintenance tasks (like the automatic cleaning of session dirs). The most common use of this function:

Cron extensions

Web2py cron has a some extra syntax to support web2py application specifics.

Calling scripts in the web2py environment

If the task/script is prefixed with an asterisk and ends with ".py", it will be executed in the web2py environment. This means you will have all the controllers and models at your disposal. If you use TWO asterisks, the MODELs will not be initialized. This is the recommended way of calling as it has less overhead and avoids potential locking problems.

CAVEATS




Example:
1.
2.
3.
*/30    *       *       *       *       root **applications/admin/cron/expire_sessions.py

30 3 * * * root *applications/my_app/cron/db_vacuum.py

Calling controller functions

Same as above, but a function from a controller is executed instead of a separate script file. Note that unlike above, there is no need to specify a path. The controller and function will be that of the invoking application. Take special care about the caveats listed above.

Example:
1.
*/30    *       *       *       *       root *mycontroller/myfunction

Application initialization

If you specify @reboot in the first field in the crontab file, the given task will be executed only ONCE, on web2py startup. You can use this feature if you want to precache, check or initialize data for an application on web2py startup. Note that cron tasks are executed in parallel with the application - if the application is not ready to serve requests until the cron task is finished, you should implement checks to reflect this. Example:
1.
@reboot      *       *       *       *       root *mycontroller/myfunction