Project

General

Profile

Bug #1038

Add support for srun to charmrun script for gni layer

Added by Bilge Acun about 3 years ago. Updated over 2 years ago.

Status:
Merged
Priority:
Normal
Category:
Build & Test Automation
Target version:
Start date:
04/18/2016
Due date:
% Done:

90%

Spent time:

Description

Charmrun for gni layer calls either aprun or mpirun. Edison switched using srun therefore charmrun needs to test for srun as well.
Autobuild tests fail because of this problem.

History

#1 Updated by Eric Bohm almost 3 years ago

  • Assignee set to Michael Robson

#2 Updated by Michael Robson almost 3 years ago

From Core: Attack problem by just failing over to srun if aprun and mpirun both fail to run

#3 Updated by Michael Robson almost 3 years ago

  • % Done changed from 0 to 80
  • Status changed from New to In Progress

Implemented and testing a fix

#4 Updated by Michael Robson almost 3 years ago

  • Status changed from In Progress to Implemented
  • % Done changed from 80 to 90

#5 Updated by Phil Miller almost 3 years ago

  • Status changed from Implemented to In Progress

Getting there, but still not quite:

make[3]: Leaving directory `/scratch1/scratchdirs/acun/autobuild/gni-crayxc/charm/gni-crayxc/examples/collide/collidethread'
make[2]: Leaving directory `/scratch1/scratchdirs/acun/autobuild/gni-crayxc/charm/gni-crayxc/examples/collide'
make[1]: Leaving directory `/scratch1/scratchdirs/acun/autobuild/gni-crayxc/charm/gni-crayxc/examples'
Submitting batch job for> make test OPTS=
 using the command> sbatch /scratch1/scratchdirs/acun/autobuild/gni-crayxc/charmrun_script.19109.sh
Job enqueued under job ID Submitted batch job 722369
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
squeue: error: Invalid job id: Submitted
autobuild> too many errors, abort!
fatal> error code 1 during remote> ./instead_test.sh charm/gni-crayxc/tmp make  test 
Returned from executing scripts/gni-crayxc/test on remote host
fatal> Test on remote host failed with fatal error (0)

Looks like it's mis-parsing the job ID out of the sbatch output.

#6 Updated by Sam White over 2 years ago

  • Target version set to 6.8.0
  • Category set to Build & Test Automation

#7 Updated by Sam White over 2 years ago

Now the error is this:

Submitting batch job for> make test OPTS=
 using the command> sbatch /scratch1/scratchdirs/acun/autobuild/gni-crayxc/charmrun_script.17216.sh
Job enqueued under job ID Submitted batch job 1732462
squeue: error: Invalid job id: Submitted
cat: /scratch1/scratchdirs/acun/autobuild/gni-crayxc/charmrun_script.17216.stdout: No such file or directory

#8 Updated by Sam White over 2 years ago

Issue is fixed for gni-crayxc but gni-crayxc-smp has a parsing issue in ./instead_test.sh:

Submitting batch job for> make test OPTS=
 using the command> sbatch /scratch1/scratchdirs/acun/autobuild/gni-crayxc-smp/charmrun_script.7180.sh
./instead_test.sh: command substitution: line 71: syntax error near unexpected token `|'
./instead_test.sh: command substitution: line 71: `cat .status.$$ | tail -1 | | awk '{print $4}''

#9 Updated by Michael Robson over 2 years ago

  • Status changed from In Progress to Merged
  • translation missing: en.field_closed_date set to 2016-08-28 19:05:10.883024

Should be all good now

#10 Updated by Phil Miller over 2 years ago

  • Target version changed from 6.8.0 to 6.8.0-beta1

Also available in: Atom PDF