English 中文(简体)
Batch Job Dependencies Using Open Source/Free Software
原标题:

I run a large data warehouse plant where we have a lot of nightly jobs running concerruently however many have dependencies on a extract or data load process before they start. Currently we use an expensive scheduling system to scehdule these at the moment.

Is there any way you can setup job dependencies using an open source or free unix/linux tool such as cron?

Moving to an open soruce solution would be great and save us lots!

Regards Matt

最佳回答

I asked a similar question last year (maybe Serverfault would be a better place these days?). There doesn t seem to be a simple, install-and-go solution unfortunately.

问题回答

Cfengine can be made to do something like this. You can set it up as a cron replacement, running arbitrary commands at scheduled times, and you can set up "classes" so that certain actions are performed only if certain classes are enabled. Classes can be anything from "this is a Linux system" to "it s currently between 5 and 10 minutes after the hour" to "system load is above value x" to "this arbitrary shell command that I just specified returned true", so you could set up your classes to indicate your job dependencies.

I doubt that this would be as powerful as a scheduling system (dependencies would have to be set up manually by configuring classes, scheduling concurrently would requires extra scripting or configuration work), but it is free and open source.

Version 2 of Cfengine was not particularly pleasant to work with (in the words of Seth Vidal, "it s [sic] syntax kills kittens"). I haven t used Cfengine 3. Puppet has similar design goals as Cfengine and may be easier to work with.

Cron doesn t handle this natively. Can the process that loads the data write out a status file upon completion? This would allow subsequent jobs to check the status file before doing their real work. Obviously, this isn t an ideal solution (too many points of failure, I suspect), but perhaps it s good enough for what you re trying to accomplish.

Schedulix is an open source workload automation solution for Linux: http://www.schedulix.org





相关问题
Signed executables under Linux

For security reasons, it is desirable to check the integrity of code before execution, avoiding tampered software by an attacker. So, my question is How to sign executable code and run only trusted ...

encoding of file shell script

How can I check the file encoding in a shell script? I need to know if a file is encoded in utf-8 or iso-8859-1. Thanks

How to write a Remote DataModule to run on a linux server?

i would like to know if there are any solution to do this. Does anyone? The big picture: I want to access data over the web, using my delphi thin clients. But i´would like to keep my server/service ...

How can I use exit codes to run shell scripts sequentially?

Since cruise control is full of bugs that have wasted my entire week, I have decided the existing shell scripts I have are simpler and thus better. Here is what I have so far svn update /var/www/...

Good, free, easy-to-use C graphics libraries? [closed]

I was wondering if there were any good free graphics libraries for C that are easy to use? It s for plotting 2d and 3d graphs and then saving to a file. It s on a Linux system and there s no gnuplot ...

热门标签