Frequently Asked Questions/fr: Difference between revisions

Jump to navigation Jump to search
Created page with "Ce message et le message similaire ''slurmstepd: error: Exceeded job memory limit at some point'' peuvent être trompeurs car dans certains cas, ils indiquent une condition sa..."
(Created page with "== Message d'erreur ''slurmstepd: error: Exceeded step memory limit at some point'' ==")
(Created page with "Ce message et le message similaire ''slurmstepd: error: Exceeded job memory limit at some point'' peuvent être trompeurs car dans certains cas, ils indiquent une condition sa...")
Line 52: Line 52:
== Message d'erreur ''slurmstepd: error: Exceeded step memory limit at some point'' ==
== Message d'erreur ''slurmstepd: error: Exceeded step memory limit at some point'' ==


This and the similar message, "slurmstepd: error: Exceeded job memory limit at some point" are potentially misleading. In some, but not all, cases it signifies a harmless condition. If your job otherwise appears to have terminated normally, that is, if all expected output is present, then you should ignore these messages. Do not increase your memory requests simply to suppress these messages!
Ce message et le message similaire ''slurmstepd: error: Exceeded job memory limit at some point'' peuvent être trompeurs car dans certains cas, ils indiquent une condition sans gravité. Si la tâche semble s'être terminée normalement puisqu'elle a produit tous les résultats attendus, ne tenez pas compte de ces messages et ne demandez pas plus de mémoire pour les éviter.


If your job was actually killed for exceeding the requested memory, the key word "Killed" should appear in the standard error output of the job.  
If your job was actually killed for exceeding the requested memory, the key word "Killed" should appear in the standard error output of the job.  


However, if you are using job dependencies (<code>dependency=afterok:<jobid></code>), then either of the messages "Exceeded job memory limit" or "Exceeded step memory limit" probably means that the dependent job was cancelled. We are [https://bugs.schedmd.com/show_bug.cgi?id=3820 in discussion] with the Slurm development team about fixing this behaviour, as well as suppressing the misleading messages in non-fatal circumstances.
However, if you are using job dependencies (<code>dependency=afterok:<jobid></code>), then either of the messages "Exceeded job memory limit" or "Exceeded step memory limit" probably means that the dependent job was cancelled. We are [https://bugs.schedmd.com/show_bug.cgi?id=3820 in discussion] with the Slurm development team about fixing this behaviour, as well as suppressing the misleading messages in non-fatal circumstances.
rsnt_translations
53,464

edits

Navigation menu