-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
crypt format occasionally fails a --test-full=0 --format=cpu
run on Ubuntu 22 powerpc64le
#5491
Comments
Oh wow, could be a thread-safety bug in libxcrypt. What do you mean by "Canonical hardware"? |
The owner is Canonical (Ubuntu company). It might be the first time I've seen this, but it might not be either. |
Does this occur during snap package build? Does the build fail when this happens? I suppose we have no easy way to try and trigger the bug on its own (e.g., running just this one test)? Maybe we should try on Compile Farm's hardware. |
Yes
Only if I want it to fail.
It's probably random, so it will be extremely difficult to get more information about it. |
Reviewed the code in libxcrypt and our |
It wouldn't be needed this time, but in general I wonder if we want and can easily add libxcrypt version in here, but somehow only when we know we're linking against libxcrypt (tricky, since we don't do that explicitly - we just do |
--test-full=0 --format=cpu_
run--test-full=0 --format=cpu
run on Ubuntu 22 powerpc64le
I can't reproduce this on cfarm29 (Raptor Blackbird ppc64le POWER9 Debian 12.5 bookworm 6.1.0-21-powerpc64le) running: while :; do OMP_NUM_THREADS=4 ../run/john --test-full=0 --format=crypt || break; done
|
On the same system, I also couldn't reproduce this with: for n in `seq 0 99`; do time OMP_NUM_THREADS=4 ../run/john --test-full=0 -form=cpu || break; done which took over 10 hours. |
Still couldn't reproduce with 32, 33, 333 threads (this system has 32 hardware threads). However, after a while I got this:
which could be a bug in our format, in OpenSSL, in the OpenMP implementation, in the kernel, or below. |
The
and on another occasion:
It usually fails on On this system, it auto-tunes to OMP_SCALE 2, which isn't something we commonly test on x86_64 (where we have pre-tuned OMP_SCALE of 1 for this format). |
Looks like a red herring. After a little while, I also got it to fail here with
|
I took further comments on the gpg issue to #3543 as we already had that issue opened and its cause is quite likely separate from what Claudio observed with the crypt format. |
If anyone wants to proceed to debug this further, a next step could be to identify and take Ubuntu's exact libxcrypt version and binary package where the issue was triggered and experiment with that on cfarm29. I think the system is similar enough that the same libxcrypt binary could be loaded there via LD_LIBRARY_PATH. @claudioandre-br maybe you? |
This is Ubuntu 22, Canonical's hardware. Nothing related changed in
john
itself recently.This is probably a hard to reproduce issue.
In all cases, documenting. Well, "a close look" at the format can always make it better.
As I recall, crypt is a format that I have seen fail in some context(s).
Full log at https://launchpadlibrarian.net/733306772/buildlog_snap_ubuntu_jammy_ppc64el_john-the-ripper_BUILDING.txt.gz
The text was updated successfully, but these errors were encountered: