-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/reproducible #1446
base: develop
Are you sure you want to change the base?
Feature/reproducible #1446
Commits on Jul 18, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ccb1c73 - Browse repository at this point
Copy the full SHA ccb1c73View commit details
Commits on Jul 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2049be6 - Browse repository at this point
Copy the full SHA 2049be6View commit details -
Remove use of zero function and fix caxpyxmazMR functor for when the …
…reduction type is changed
Configuration menu - View commit details
-
Copy full SHA for 81566c8 - Browse repository at this point
Copy the full SHA 81566c8View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce5d396 - Browse repository at this point
Copy the full SHA ce5d396View commit details -
Add doubledouble support for host, add complex-number support, remove…
… legacy zero helper function
Configuration menu - View commit details
-
Copy full SHA for 7a4e04f - Browse repository at this point
Copy the full SHA 7a4e04fView commit details
Commits on Jul 20, 2023
-
Modify reduction kernels to use device_reduce_t and not double for in…
…ternal computation
Configuration menu - View commit details
-
Copy full SHA for 2d67d97 - Browse repository at this point
Copy the full SHA 2d67d97View commit details -
Configuration menu - View commit details
-
Copy full SHA for feccf89 - Browse repository at this point
Copy the full SHA feccf89View commit details -
Configuration menu - View commit details
-
Copy full SHA for d70303a - Browse repository at this point
Copy the full SHA d70303aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a7061a - Browse repository at this point
Copy the full SHA 4a7061aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e40280 - Browse repository at this point
Copy the full SHA 7e40280View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a80b2f - Browse repository at this point
Copy the full SHA 2a80b2fView commit details -
Commence the slog that generizes the host-side scalar precision: intr…
…oduce new CMake set types: real_t (QUDA_SCALAR_TYPE) - the host side scalar precision, complex_t the complex version of this (replaces Complex), device_reduce_t (QUDA_REDUCTION_TYPE). Eventually we will be able to set these to non-double types, but we're there yet....
Configuration menu - View commit details
-
Copy full SHA for e9089e1 - Browse repository at this point
Copy the full SHA e9089e1View commit details
Commits on Jul 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a4e8f76 - Browse repository at this point
Copy the full SHA a4e8f76View commit details
Commits on Jul 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a7cc5f7 - Browse repository at this point
Copy the full SHA a7cc5f7View commit details -
Updated ReduceArg::complete function to work when real_t and device_r…
…educe_t are different types, e.g., double vs doubledouble
Configuration menu - View commit details
-
Copy full SHA for 008c632 - Browse repository at this point
Copy the full SHA 008c632View commit details -
Configuration menu - View commit details
-
Copy full SHA for dc62b01 - Browse repository at this point
Copy the full SHA dc62b01View commit details
Commits on Jul 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3324b05 - Browse repository at this point
Copy the full SHA 3324b05View commit details
Commits on Jul 25, 2023
-
Add missing cast operator to deviation_t::operator= when copying from…
… a different type (needed when copying from deviation_t<doubledouble> to deviation_t<double> for example
Configuration menu - View commit details
-
Copy full SHA for a16ff6c - Browse repository at this point
Copy the full SHA a16ff6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b5bac8 - Browse repository at this point
Copy the full SHA 2b5bac8View commit details -
Update CUDA block_reduce_helper.h atomic types to work with doubledou…
…ble (need to split into 64-bit words) and small generic cleanup
Configuration menu - View commit details
-
Copy full SHA for 9d69abd - Browse repository at this point
Copy the full SHA 9d69abdView commit details -
Add support for doubledouble heterogeneous-atomic reductions. This al…
…so updates the coalesced writing to sysmem to work with large reduce_t types, such that sizeof(device_reduce_t) / sizeof(atomic_type<device_reduce_t>) > warp_size, which previously was a restriction: we now use a warp-stride loop to do a coalesced write to sysmem
Configuration menu - View commit details
-
Copy full SHA for 7e21a5b - Browse repository at this point
Copy the full SHA 7e21a5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for d5f914d - Browse repository at this point
Copy the full SHA d5f914dView commit details -
Add initial support for multi-process doubledouble reductions: only Q…
…MP at present and just a simple gather method for now
Configuration menu - View commit details
-
Copy full SHA for 1a73132 - Browse repository at this point
Copy the full SHA 1a73132View commit details -
Multi-process reduction now uses device_reduce_t with the conversion …
…to real_t done after the multi-process reduction
Configuration menu - View commit details
-
Copy full SHA for d76e57c - Browse repository at this point
Copy the full SHA d76e57cView commit details -
Updates for blas_test: use same basis for host and device to allow fo…
…r direct comparisons, use max error not error sum when multiple norms are used to check correctness, print out the deviation when verbosity >= QUDA_VERBOSE
Configuration menu - View commit details
-
Copy full SHA for 27ba8de - Browse repository at this point
Copy the full SHA 27ba8deView commit details
Commits on Jul 26, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 4b5aa52 - Browse repository at this point
Copy the full SHA 4b5aa52View commit details
Commits on Jul 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for bcde6ad - Browse repository at this point
Copy the full SHA bcde6adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ee73a6 - Browse repository at this point
Copy the full SHA 2ee73a6View commit details
Commits on Aug 15, 2023
-
Initial version of reproduction reductions, fully works but a few lim…
…itations representing this being WIP (bin bounds LUT repeatadly recomputed on the host, bin bounds LUT presently in explicit constant, CG reduction not supported, warp reductions rather register heavy, etc.)
Configuration menu - View commit details
-
Copy full SHA for 9789820 - Browse repository at this point
Copy the full SHA 9789820View commit details -
Merge branch 'feature/gaugefield_unity' of github.com:lattice/quda in…
…to feature/reproducible
Configuration menu - View commit details
-
Copy full SHA for 67514d0 - Browse repository at this point
Copy the full SHA 67514d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for d455000 - Browse repository at this point
Copy the full SHA d455000View commit details -
Configuration menu - View commit details
-
Copy full SHA for 030836d - Browse repository at this point
Copy the full SHA 030836dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 08b9776 - Browse repository at this point
Copy the full SHA 08b9776View commit details -
Configuration menu - View commit details
-
Copy full SHA for 64ed607 - Browse repository at this point
Copy the full SHA 64ed607View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba96720 - Browse repository at this point
Copy the full SHA ba96720View commit details -
Fix gauge_path_test loop trace test when using doubledouble reduction…
… variables (remove bad cast)
Configuration menu - View commit details
-
Copy full SHA for b7687b4 - Browse repository at this point
Copy the full SHA b7687b4View commit details
Commits on Aug 23, 2023
-
Rework of reproducible reductions to pre-compute the bins when initia…
…lizing the reduction. All tests passing, but seemingly break when a SANITIZE build is used
Configuration menu - View commit details
-
Copy full SHA for bc74e7b - Browse repository at this point
Copy the full SHA bc74e7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a60bc3 - Browse repository at this point
Copy the full SHA 6a60bc3View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8085dc - Browse repository at this point
Copy the full SHA a8085dcView commit details -
Merge branch 'feature/gaugefield_unity' of github.com:lattice/quda in…
…to feature/reproducible
Configuration menu - View commit details
-
Copy full SHA for a413153 - Browse repository at this point
Copy the full SHA a413153View commit details