Clarify work-item scope atomics (and memory model in general) #849

Pennycook · 2025-06-25T16:18:46Z

This started as an attempt just to clarify work-item scope atomics, but I think the fix has ended up addressing some broader long-standing issues with the memory model.

I think figuring out the steps that a SYCL application needs to take to guarantee "sequential consistency" is the only remaining open in the memory model, and I'm afraid to touch it. 😆

Closes #665.

In OpenCL, these atomics are only required to support a very specific use-case involving images, and are forbidden in all other contexts. In SYCL, we would like a work-item to be viewed as a degenerate case of a group containing a single work-item. Work-item scope atomics should thus be permitted, and their effect should be equivalent to non-atomic operations.

An implementation of atomic_ref<T> that was not lock-free needs to know which work-items may access the lock in order to decide where to allocate the lock. This additionally serves as a clarification of the behavior of work-item scope; using an atomic_ref with work-item scope at the same time as an atomic_ref with broader scope is invalid.

Even when using two atomic_ref objects with the same DefaultScope, it's possible to encounter a data race by overriding the scope parameter of individual operations. This is a general clean-up but was motivated by work-item scope atomics: any potentially concurrent use of work-item scope atomics and atomics with a different scope results in undefined behavior.

The ISO C++ synchronizes-with relationship does not account for scopes. The scopes do not need to match exactly, but there are restrictions on which pairs of scopes are valid. This is the final part of the clarification for work-item scope atomics; a work-item scope atomic cannot sychronize with the atomic operations performed by other work-items, and so their effects are not guaranteed to be visible to other work-items without some other synchronization taking place.

The previously proposed wording suggested that any difference in scopes would lead to undefined behavior, which was inconsistent with the paragraph immediately afterwards about which atomics synchronize-with each other.

TApplencourt · 2025-06-26T14:41:37Z

Potentially concurrent conflicting actions with different memory scopes may lead
to a data race, resulting in undefined behavior.
An atomic operation _A_ with scope _S~1~_ operating on the same memory location
as atomic operation _B_ with scope _S~2~_ is a data race if:

* The work-items which executed _A_ and _B_ are not both in the same group of
 work-items associated with scope _S~1~_; or
* The work-items which executed _A_ and _B_ are not both in the same group of
 work-items associated with scope _S~2~_.

same group of work-items associated

We use the term "set" , should we replace group for set here?

The set of work-items and devices to which the memory ordering constraints of a given atomic operation apply is controlled)

I tried to rewrite this little section, but really not sure if it's better...

Let:

- An atomic operation _A1_ with scope _MS~1~_ operating on Memory _M1_; and MS~1_ is associated with a set of work-item _MS~1_
- An atomic operation _A2_ with scope _MS~2~_ operating on Memory _M2_; and _MS~2_ is associated with a set of work-item _MS~2_

A data-race exist if and only if _MS~1_ == _MS~1_ and _S~1_ != _S~2_

Pennycook · 2025-06-26T15:03:36Z

We use the term "set" , should we replace group for set here?
...
I tried to rewrite this little section, but really not sure if it's better...

I think I'll need to read (and re-read!) things a few times before I can determine which wording I prefer.

But to shed some light on why I chose the wording I did... One thing that is pretty subtle here is that it's not enough to just compare the memory_scope values themselves. As a concrete example of what I mean, this is a data race:

float* ptr = 0x42;

// Work-item 0, in Work-group 0
atomic_ref<float, memory_order::seq_cst, memory_scope::work_group>(*ptr) += 1;

// Work-item 0, in Work-group 1
atomic_ref<float, memory_order::seq_cst, memory_scope::work_group>(*ptr) += 1;

Even though these atomics use the same memory_scope value, the work-items themselves are in different work-groups. This is what I was trying to convey when talking about the "group of work-items associated with the scope".

The OpenCL version of this wording defines the concept of "inclusive scope" to try and explain this (see here) but I personally think their wording is quite unclear.

keryell

Thanks!

adoc/chapters/programming_interface.adoc

tomdeakin · 2025-08-21T15:14:33Z

WG approved merge as clarification to SYCL 2020

tomdeakin · 2025-08-27T15:08:13Z

@gmlueck Please can you add to your cherry-pick list. Thanks.

gmlueck · 2025-09-03T21:25:05Z

adoc/chapters/architecture.adoc

+* The work-items which executed _A_ and _B_ are not both in the same group of
+ work-items associated with scope _S~1~_; or
+* The work-items which executed _A_ and _B_ are not both in the same group of
+ work-items associated with scope _S~2~_.


John and I were having a side discussion about this part of the PR before he left. The question is whether operations A and B need to have the same scope, or whether it is sufficient for the scopes to include both work-items. To illustrate, consider the following example:

// Work-item A sycl::atomic_ref<int, memory_order::release, memory_scope::work_group> a(mem); a.store(1); // Work-item B (in the same work-group a A) sycl::atomic_ref<int, memory_order::acquire, memory_scope::device> a(mem); int x = a.load();

Note that the two operations have different scopes, but each scope includes both A and B.

The question is whether SYCL should guarantee that these operations are atomic even though the scopes are different. My first question to John was about Intel hardware. We think that Intel hardware is guaranteed to be atomic in this scenario, so we have no concerns from the standpoint of our own ability to implement the proposed SYCL wording.

However, then we realized that the OpenCL specification seems to not guarantee atomicity in this case. Instead, the OpenCL wording seems to require both operations to have the same scope in order to guarantee atomicity. There is some debate, though, about whether the OpenCL wording should be changed. There is an open internal issue against the OpenCL specification on this point:

https://gitlab.khronos.org/opencl/OpenCL-Docs/-/issues/367

The SYCL WG should consider whether we want to adopt the wording that John proposes in this PR even though it guarantees atomicity in a case that OpenCL does not guarantee. Or, whether we should adopt the same language about atomicity that is currently in the OpenCL spec,

gmlueck · 2025-09-03T21:26:23Z

I realize that the WG approved this already, but I was wondering if we could reconsider. There are two point I would like to raise:

The review comment here did not get resolved. In fact, John's last response ends with "... so maybe it's better to take this out". I think this means that he intended to remove that paragraph before merging this PR.
John and I were having an internal side discussion about this PR, which never really got resolved before he left. That conversation was initially about Intel hardware (which is why it was internal), but then it branched out into OpenCL semantics. In retrospect, we should have made the conversation public at that point. I tried to capture our discussion in this comment.

tomdeakin · 2025-09-04T15:50:02Z

Further discussion required, and then a re-review.

We decided that this requirement doesn't actually help implementations.

Pennycook added 4 commits June 25, 2025 15:42

Pennycook added this to the SYCL 2020 milestone Jun 25, 2025

Pennycook added memory model clarification Something is unclear labels Jun 25, 2025

sycl-issue-bot bot mentioned this pull request Jun 25, 2025

[Spec change] Clarify work-item scope atomics (and memory model in general) KhronosGroup/SYCL-CTS#1119

Closed

Pennycook force-pushed the clarification/work-item-scope-atomics branch from ea207e8 to 7f05756 Compare June 26, 2025 08:16

Fix a bug in description of data races with scopes

eb0b73a

The previously proposed wording suggested that any difference in scopes would lead to undefined behavior, which was inconsistent with the paragraph immediately afterwards about which atomics synchronize-with each other.

Pennycook force-pushed the clarification/work-item-scope-atomics branch from 7f05756 to eb0b73a Compare June 26, 2025 08:26

keryell approved these changes Jul 31, 2025

View reviewed changes

gmlueck reviewed Jul 31, 2025

View reviewed changes

adoc/chapters/programming_interface.adoc Outdated Show resolved Hide resolved

TApplencourt approved these changes Aug 1, 2025

View reviewed changes

gmlueck reviewed Sep 3, 2025

View reviewed changes

gmlueck added the Agenda To be discussed during a SYCL committee meeting label Sep 3, 2025

Remove requirement on DefaultScope

04a8e80

We decided that this requirement doesn't actually help implementations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify work-item scope atomics (and memory model in general) #849

Clarify work-item scope atomics (and memory model in general) #849

Uh oh!

Pennycook commented Jun 25, 2025

Uh oh!

TApplencourt commented Jun 26, 2025 •

edited

Loading

Uh oh!

Pennycook commented Jun 26, 2025

Uh oh!

keryell left a comment

Uh oh!

Uh oh!

tomdeakin commented Aug 21, 2025

Uh oh!

tomdeakin commented Aug 27, 2025

Uh oh!

gmlueck Sep 3, 2025

Uh oh!

gmlueck commented Sep 3, 2025

Uh oh!

tomdeakin commented Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Clarify work-item scope atomics (and memory model in general) #849

Are you sure you want to change the base?

Clarify work-item scope atomics (and memory model in general) #849

Uh oh!

Conversation

Pennycook commented Jun 25, 2025

Uh oh!

TApplencourt commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Pennycook commented Jun 26, 2025

Uh oh!

keryell left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tomdeakin commented Aug 21, 2025

Uh oh!

tomdeakin commented Aug 27, 2025

Uh oh!

gmlueck Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

gmlueck commented Sep 3, 2025

Uh oh!

tomdeakin commented Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TApplencourt commented Jun 26, 2025 •

edited

Loading