Dissecting the Magento Cron System
Magento has a really neat cron system that lets developers run scheduled jobs at defined intervals, much like the cron service found on most Unix-like systems. I wrote a post a over two years ago covering how to use the Magento cron, but in this post I explore mechanism behind the service.
The cron, as a Magento module, is a simple example of how things should be done with Magento, which is why I think it makes it an interesting service to examine.
Kicking it all off
We put an entry in our Operating Systemβs crontab to handle Magentoβs crontab with something like: */5 * * * * www-data /usr/bin/php /var/www/magento/cron.php
. We generally set things off every 5 minutes, which is more than often enough for any store.
Crontab as an event
Magento makes heavy use of the Event-Observer pattern to allow modules to hook into key points, or events, through the course of normal program execution. The cron service takes advantage of this by defining a new event called the crontab in the file cron.php. This allows for the actual cron.php script to be very concise (only 8 lines, really) and easily extensible, so that if another module needs to hook into the raw loading of the cron.php script, then it can do so through this same event.
XML-based Cron Entries
Modules wishing to make use of the cron do not hook directly into this crontab event otherwise theyβd either have their event fired every time we hit cron.php, remember weβre running it every 5 minutes at the moment, or theyβd have to manage their own inhibition parameters, e.g. make sure that they only run every day, or every 3 hours, etc.
Instead they define a config.xml, discussed here, entry describing the model and method to be run, as well as a standard cron expression to define how frequent the method should be executed.
Managing Schedules
The method thatβs executed, by default when the crontab event is fired is cron/observer::dispatch
which handles the schedule and execution of the jobs defined in the various config.xml files. The dispatch function takes care of:
- Schedule creation
- Schedule cleanup
- Schedule execution
Magento converts the config.xml into pending jobs in the database. Once conditions are satisfied, i.e. the desired execution time is now or in the past and the deadline has not been missed, the job will be executed and itβs status updated to complete.
Cron instances could, quite easily, take longer than 5 minutes which, with our setup, would cause another call to the cron.php script. To avoid duplicate runs, Magento locks the job once the job has been started.
And thatβs it really. A really concise module, with most of the heavy lifting done by the cron/schedule model handling the cron expression parsing. A perfect example of a module that uses Event-Observers well, custom config.xml
entries and a database resource.