[Diagnostics] Add diagnostics suite with first tool to diagnose SLURM accounting setup#7336
Conversation
a757e50 to
36caf02
Compare
d7af7de to
3ac826a
Compare
Any specific reason for not keeping this script already as part of |
|
|
||
| echo "[INFO] Installing requirements on head node..." | ||
|
|
||
| ssh "${SSH_ARGS[@]}" "${DEFAULT_USER}@${HEAD_NODE_IP}" "pip install -r ~/${REMOTE_DIR}/requirements.txt" |
There was a problem hiding this comment.
[Non-blocking] We should create a virtual environement so that we do not install packages which could be related to CVE and can be picked up during a scan especially when we are not baking them into the AMI
There was a problem hiding this comment.
Agree, will do in follow up PR
We will not ship the diagnostics suite till it is stable and relevant enough. For now the goal is to use the suite internally (us and the support engineers) and I wanted the most immediate place to store it which is in the main pcluster package. |
3ac826a to
f9b35e2
Compare
f9b35e2 to
6d72b65
Compare
Description of changes
Add diagnostics suite with first tool to diagnose SLURM accounting setup.
Notes
User Experience
The user uploads the diagnostics suite to the head node with a one-click script.
The deployment script returns the command to log directly into the folder to execute the diagnosis.
The user logs into the head node in the diagnostics folder:
This is the helper of the first diagnosis tool about SLURM acocunting:
This is an example of diagnosis made for SLURM accounting:
Tests
References
Checklist
developadd the branch name as prefix in the PR title (e.g.[release-3.6]).Please review the guidelines for contributing and Pull Request Instructions.
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.