So I'm been running a c-program that takes rougly 25 to 60 hours to complete. It's purpose is to analyze data sets, and in this case rather large data sets (~1Gb). The Mac I'm running them on has OS X 10.6.6 and has 2 6-Core Intel Xeon Processors.
So I've written short shell scripts to run my program, and they seemed to run fine when I submitted my jobs last night. I get into work this morning and check on how they are doing.
I run "top" and I notice that all three programs are in the STATE "stuck". Every few screen updates they will momentarily go into the STATE "running". But most of the time they are "stuck". I'm concurrently running the same programs (different data), on a Red Hat machine, and they seem to be doing just fine. I've also successfully run my program on much smaller data sets many times in the past. I've also ran it successfully on similarly sized datasets.
I thought that perhaps that they got stuck do to memory issues. However I have 32Gb, with 2.5Gb active, 23Gb inactive, 27Gb used, and 5Gb free.
So my questions:
What are the causes to programs getting "stuck"?
How do I prevent programs from getting "stuck"?
Is there a solution to get programs 'unstuck'?
Killing the processes does not seem like a good option. If my program has run 20 hours and will take 50+hours to complete, it would be an enormous waste of time to kill them. What would guarantee them from not getting stuck again? Thanks everyone for your time.
~spooner
So I've written short shell scripts to run my program, and they seemed to run fine when I submitted my jobs last night. I get into work this morning and check on how they are doing.
I run "top" and I notice that all three programs are in the STATE "stuck". Every few screen updates they will momentarily go into the STATE "running". But most of the time they are "stuck". I'm concurrently running the same programs (different data), on a Red Hat machine, and they seem to be doing just fine. I've also successfully run my program on much smaller data sets many times in the past. I've also ran it successfully on similarly sized datasets.
I thought that perhaps that they got stuck do to memory issues. However I have 32Gb, with 2.5Gb active, 23Gb inactive, 27Gb used, and 5Gb free.
So my questions:
What are the causes to programs getting "stuck"?
How do I prevent programs from getting "stuck"?
Is there a solution to get programs 'unstuck'?
Killing the processes does not seem like a good option. If my program has run 20 hours and will take 50+hours to complete, it would be an enormous waste of time to kill them. What would guarantee them from not getting stuck again? Thanks everyone for your time.
~spooner