Skip to content

Bump to 24.11.4 #75

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 273 commits into
base: 24.11.ug
Choose a base branch
from
Open

Bump to 24.11.4 #75

wants to merge 273 commits into from

Conversation

itkovian
Copy link
Member

@itkovian itkovian commented Apr 9, 2025

No description provided.

gaijin03 and others added 30 commits March 13, 2025 12:38
Continuation of a0a14da

Ticket: 21266
Ticket: 22340
Cherry-picked: db16925
Cherry-pick !743 into slurm-24.05

See merge request SchedMD/dev/slurm!751
Update slurm.spec and debian/changelog as well.
If stepd_connect() should jump to rwfail, then it will return a file
descriptor that was already close()ed.

Ticket: 22315
Changelog: slurmd - Avoid crash due when slurmd has a communications
  failure with slurmstepd.
Cherry-picked: 3c944ee
Cherry-pick !747 into slurm-24.05

See merge request SchedMD/dev/slurm!749
Cherry-pick !728 into slurm-24.11

See merge request SchedMD/dev/slurm!756
Cherry-pick !764 into slurm-24.11

See merge request SchedMD/dev/slurm!768
Entry had SLURM_COMMUNICATIONS_MISSING_SOCKET_ERROR when it should have
had SLURM_COMMUNICATIONS_INVALID_OUTGOING_FD.

Issue: 50321
Ticket: 22312
Cherry-picked: 9a6fa96
Cherry-pick !724 into slurm-24.11

See merge request SchedMD/dev/slurm!771
The allocated fields of the yaml_event_t in _yaml_to_data() were not being
freed.

Ticket: 22348
Changelog: Fix memory leak when parsing yaml input.
Cherry-picked: cd8303f
Continuation of commit d31cf03

Ticket: 21398
Cherry-picked: 2d3d07d
These plugins were removed previously.
Continuation of commit d31cf03

Ticket: 21398
Cherry-picked: 97218cb
Ticket: 21398
Cherry-picked: de20dc6
Cherry-pick !441 into slurm-24.11

See merge request SchedMD/dev/slurm!778
Cherry-pick !758 into slurm-24.11

See merge request SchedMD/dev/slurm!775
Cherry-pick !780 into slurm-24.11

See merge request SchedMD/dev/slurm!781
This is a regression from 2e60ebc. We should only validate and take
actions if part_desc->preempt_mode isn't set to NO_VAL16.

Changelog: Prevent slurmctld from showing error message about
 PreemptMode=GANG being a cluster-wide option for `scontrol update part`
 calls that don't attempt to modify partition PreemptMode.
Ticket: 22360
Cherry-picked: c8faf92
A partition without an explicit preempt_mode set is
NO_VAL16 which will test positive against PREEMPT_MODE_GANG. Only
preserve PREEMPT_MODE_GANG if the partition has an explicit preempt_mode
set.

See 509551c

Changelog: Fix setting GANG preemption on partition when updating
 PreemptMode with scontrol.
Ticket: 22360
Cherry-picked: e9a45ec
Cherry-pick !765 into slurm-24.11

See merge request SchedMD/dev/slurm!785
Cherry-pick !789 into slurm-24.11

See merge request SchedMD/dev/slurm!801
If the slurmstepd.scope/slurmd cgroup was created while having CoreSpec
or MemSpec limits in the node, and then the spec limits were removed in
slurm.conf and the slurmd restarted, the slurmd cgroup would remain with
the old limits.

This commit unsets the cpu and memory limits of slurmstepd.scope/slurmd
cgroup at slurmd initialization.

Changelog: Fix CoreSpec and MemSpec limits not being removed from previously
 configured slurmd.
Ticket: 20943
Cherry-picked: 9be763a
wickberg and others added 30 commits April 30, 2025 14:13
In case of 2nd takeover by backup slurmctld bit cache is already
initialized, but it's called with the same size. This shouldn't be an
issue and we can safely continue.

Ticket: 21670
Changelog: Fix backup slurmctld failure on 2nd takeover.
Cherry-picked: 3b62fec
If the query returns an empty result, the function _cluster_remove_wckeys()
returns early but does not free the query string or the mysql result.

Ticket: 20771
Cherry-picked: c4e4a62
vals is never xfree()-ed, leading to a definite leak.

Ticket: 20771
Cherry-picked: 26caa12
Cherry-pick !1069 into slurm-24.11

See merge request SchedMD/dev/slurm!1079
Cherry-pick !639 into slurm-24.11

See merge request SchedMD/dev/slurm!1076
This is only useful in a corner case used by the internal QA environment.

Cherry-picked: 4f2e6da
Cherry-pick !1092 into slurm-24.11

See merge request SchedMD/dev/slurm!1102
Cherry-pick !1083 into slurm-24.11

See merge request SchedMD/dev/slurm!1111
priority_g_thread_start() needs to be called to start the decay thread.

Ticket: 22722
Changelog: Testsuite - fix python test 130_2.
Cherry-picked: da5c1b7
Cherry-pick !1119 into slurm-24.11

See merge request SchedMD/dev/slurm!1122
Cherry-pick !1112 into slurm-24.11

See merge request SchedMD/dev/slurm!1126
Continuation of c05c957

The slurmstepd uses slingshot_stepd_init() to set
slingshot_config.destroy_retries. If destroy_retries was not configured in
SwitchParameters it was being left as 0. Now it will default to
SLINGSHOT_CXI_DESTROY_RETRIES, which is 5.

Ticket: 22715
Cherry-picked: 22b6c55
Cherry-pick !1118 into slurm-24.11

See merge request SchedMD/dev/slurm!1127
Changelog: Fix security issue where a coordinator could add a user with
 elevated privileges. CVE-2025-43904.
Issue: 50391
Fix user add coord issue - 24.11

See merge request SchedMD/dev/slurm!1132
Update slurm.spec and debian/changelog as well.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.