Softpanorama

May the source be with you, but remember the KISS principle ;-)
Contents Bulletin Scripting in shell and Perl Network troubleshooting History Humor

Job Post Mortem

SGE produces job post mortem, via setting execd_param KEEP_ACTIVE=ERROR

Set “KEEP_ACTIVE=TRUE” for execd_params in the global or hostspecific configuration disables the job spool dir deletion. That means that directory contents can be analyzed after job failed...

Job spool dir retention

Job spool dir file contents

Job spool dir retention

Trigger SGE Debug Output

  • It contains environment. 
    */active_jobs_dir/*/environment
    
    It's hard to debug when something is running as deamon. So you need to prevent daemonizing with SGE_ND=1.
    SGE_DEBUG_LEVEL defines the emount of information 
    
    execd_param -- important KEEP_ACTIVE=ERROR
    

    Recommended Links

    Softpanorama hot topic of the month

    Softpanorama Recommended

    Top articles

    Sites

    Top articles

    Sites

    Grid Engine Trouble Shooting HOWTO

    Daniel Templeton SGE developer, one of the DRMAA evangelists, frequent Sun blogger on SGE topics
    Site: http://blogs.sun.com/templedf/
    Permalink to post on debug output
    http://blogs.sun.com/templedf/entry/using_debugging_output
     

    Enabling Debugging Output (DanT's Grid Blog)