Thead-safe library initialization #37

yugr · 2025-01-18T15:38:14Z

@Artem-B you once worked on adding multi-threading support to Implib.so. Perhaps you could review this Pthread-based implementation? In particular I'm not sure I test this enough.

codecov-commenter · 2025-01-18T15:39:24Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 88.57%. Comparing base (c6ec69e) to head (d4b6bf5).

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master      #37      +/-   ##
==========================================
+ Coverage   88.49%   88.57%   +0.08%     
==========================================
  Files           1        1              
  Lines         391      394       +3     
==========================================
+ Hits          346      349       +3     
  Misses         45       45

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Artem-B

It's not clear, what exactly is intended to be protected by the mutex.

The patch appears to serialize the load_library call, the code seems to leave the door open for data races between calls modifying the data used by load_library(). E.g
nothing stops user from calling *_tramp_reset while load_library in another thread has already set lib_handle, but didn't set do_dlclose yet. It will reset lib_handle to NULL,
and we'll end up with the library never dlclosed.

I don't think we care much about making locking particularly fine-grained, so I would consider locking all API calls and guarantee thread safety for both code execution and the internal implib state.

As for the testing, thread sanitizer does a very good job catching threading issues, but it relies on comprehensive enough set of tests. I do not have much experience in creating such tests. That's part of the reason I'd rather rely on everything touching shared state being under lock so we have less risk of encountering the race conditions we didn't think of.

One useful way to think whether the code is thread safe is "if I stop this thread here, can I break things from other threads?". implib's call is small enough to make it feasible to do that as a thought experiment.

yugr · 2025-01-18T18:48:56Z

The patch appears to serialize the load_library call, the code seems to leave the door open for data races between calls modifying the data used by load_library(). E.g nothing stops user from calling *_tramp_reset while load_library in another thread has already set lib_handle, but didn't set do_dlclose yet. It will reset lib_handle to NULL, and we'll end up with the library never dlclosed.

Right. This PR just touches the library initialization, not Implib's APIs which remain thread-unsafe as you noticed. I will support them in separate PR as they are less critical for the users (most don't even use them).

As for the testing, thread sanitizer does a very good job catching threading issues, but it relies on comprehensive enough set of tests. I do not have much experience in creating such tests. That's part of the reason I'd rather rely on everything touching shared state being under lock so we have less risk of encountering the race conditions we didn't think of.

Right, the added test already runs under Tsan (I even had to get rid double-checked locking because of this).

One useful way to think whether the code is thread safe is "if I stop this thread here, can I break things from other threads?". implib's call is small enough to make it feasible to do that as a thought experiment.

+1

Artem-B

LGTM for the stated goal of serializing library loading only.

Artem-B · 2025-01-18T19:25:07Z

tests/thread/main.c

+  }
+
+  for (int i = 0; i < N; ++i) {
+    if (0 != pthread_create(&tids[i], 0, run, &args[i]))


The run is very short-lived. It's possible for the launched thread to run and finish before the main thread manages to launch the next one.

A more robust way to ensure a race would be to make all threads block on something locked by the main thread, and then unlock all of them at once. E.g use pthread_barrier_wait in run to wait until the last thread has been launched.

yugr added 2 commits January 18, 2025 18:12

Thread-safe initialization.

62afdd2

FreeBSD fixes.

cb534a1

yugr added the enhancement label Jan 18, 2025

yugr self-assigned this Jan 18, 2025

yugr temporarily deployed to secrets January 18, 2025 15:38 — with GitHub Actions Inactive

Update README.

1e39bf6

yugr temporarily deployed to secrets January 18, 2025 17:25 — with GitHub Actions Inactive

Artem-B reviewed Jan 18, 2025

View reviewed changes

yugr changed the title ~~Thead-safe initialization~~ Thead-safe library initialization Jan 18, 2025

Remove volatile.

b3ba7e9

yugr temporarily deployed to secrets January 18, 2025 18:44 — with GitHub Actions Inactive

Disable thread test on mipsel (can not repro fail locally).

d4b6bf5

yugr temporarily deployed to secrets January 18, 2025 19:03 — with GitHub Actions Inactive

yugr deployed to secrets January 18, 2025 19:03 — with GitHub Actions Active

Artem-B approved these changes Jan 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thead-safe library initialization #37

Thead-safe library initialization #37

yugr commented Jan 18, 2025

codecov-commenter commented Jan 18, 2025 •

edited

Loading

Artem-B left a comment

yugr commented Jan 18, 2025

Artem-B left a comment

Artem-B Jan 18, 2025

Thead-safe library initialization #37

Are you sure you want to change the base?

Thead-safe library initialization #37

Conversation

yugr commented Jan 18, 2025

codecov-commenter commented Jan 18, 2025 • edited Loading

Codecov Report

Artem-B left a comment

Choose a reason for hiding this comment

yugr commented Jan 18, 2025

Artem-B left a comment

Choose a reason for hiding this comment

Artem-B Jan 18, 2025

Choose a reason for hiding this comment

codecov-commenter commented Jan 18, 2025 •

edited

Loading