Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/reproducible #1446

Open
wants to merge 40 commits into
base: develop
Choose a base branch
from
Open

Feature/reproducible #1446

wants to merge 40 commits into from

Commits on Jul 18, 2023

  1. Configuration menu
    Copy the full SHA
    ccb1c73 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2023

  1. Configuration menu
    Copy the full SHA
    2049be6 View commit details
    Browse the repository at this point in the history
  2. Remove use of zero function and fix caxpyxmazMR functor for when the …

    …reduction type is changed
    maddyscientist committed Jul 19, 2023
    Configuration menu
    Copy the full SHA
    81566c8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    ce5d396 View commit details
    Browse the repository at this point in the history
  4. Add doubledouble support for host, add complex-number support, remove…

    … legacy zero helper function
    maddyscientist committed Jul 19, 2023
    Configuration menu
    Copy the full SHA
    7a4e04f View commit details
    Browse the repository at this point in the history

Commits on Jul 20, 2023

  1. Configuration menu
    Copy the full SHA
    2d67d97 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    feccf89 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d70303a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    4a7061a View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7e40280 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2a80b2f View commit details
    Browse the repository at this point in the history
  7. Commence the slog that generizes the host-side scalar precision: intr…

    …oduce new CMake set types: real_t (QUDA_SCALAR_TYPE) - the host side scalar precision, complex_t the complex version of this (replaces Complex), device_reduce_t (QUDA_REDUCTION_TYPE). Eventually we will be able to set these to non-double types, but we're there yet....
    maddyscientist committed Jul 20, 2023
    Configuration menu
    Copy the full SHA
    e9089e1 View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2023

  1. Configuration menu
    Copy the full SHA
    a4e8f76 View commit details
    Browse the repository at this point in the history

Commits on Jul 22, 2023

  1. Configuration menu
    Copy the full SHA
    a7cc5f7 View commit details
    Browse the repository at this point in the history
  2. Updated ReduceArg::complete function to work when real_t and device_r…

    …educe_t are different types, e.g., double vs doubledouble
    maddyscientist committed Jul 22, 2023
    Configuration menu
    Copy the full SHA
    008c632 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    dc62b01 View commit details
    Browse the repository at this point in the history

Commits on Jul 23, 2023

  1. Fix some issues

    maddyscientist committed Jul 23, 2023
    Configuration menu
    Copy the full SHA
    3324b05 View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2023

  1. Add missing cast operator to deviation_t::operator= when copying from…

    … a different type (needed when copying from deviation_t<doubledouble> to deviation_t<double> for example
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    a16ff6c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2b5bac8 View commit details
    Browse the repository at this point in the history
  3. Update CUDA block_reduce_helper.h atomic types to work with doubledou…

    …ble (need to split into 64-bit words) and small generic cleanup
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    9d69abd View commit details
    Browse the repository at this point in the history
  4. Add support for doubledouble heterogeneous-atomic reductions. This al…

    …so updates the coalesced writing to sysmem to work with large reduce_t types, such that sizeof(device_reduce_t) / sizeof(atomic_type<device_reduce_t>) > warp_size, which previously was a restriction: we now use a warp-stride loop to do a coalesced write to sysmem
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    7e21a5b View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    d5f914d View commit details
    Browse the repository at this point in the history
  6. Add initial support for multi-process doubledouble reductions: only Q…

    …MP at present and just a simple gather method for now
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    1a73132 View commit details
    Browse the repository at this point in the history
  7. Multi-process reduction now uses device_reduce_t with the conversion …

    …to real_t done after the multi-process reduction
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    d76e57c View commit details
    Browse the repository at this point in the history
  8. Updates for blas_test: use same basis for host and device to allow fo…

    …r direct comparisons, use max error not error sum when multiple norms are used to check correctness, print out the deviation when verbosity >= QUDA_VERBOSE
    maddyscientist committed Jul 25, 2023
    Configuration menu
    Copy the full SHA
    27ba8de View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2023

  1. Minor comment clean up

    maddyscientist committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    4b5aa52 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2023

  1. Configuration menu
    Copy the full SHA
    bcde6ad View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2ee73a6 View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2023

  1. Initial version of reproduction reductions, fully works but a few lim…

    …itations representing this being WIP (bin bounds LUT repeatadly recomputed on the host, bin bounds LUT presently in explicit constant, CG reduction not supported, warp reductions rather register heavy, etc.)
    maddyscientist committed Aug 15, 2023
    Configuration menu
    Copy the full SHA
    9789820 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    67514d0 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d455000 View commit details
    Browse the repository at this point in the history
  4. Fix compiler warning

    maddyscientist committed Aug 15, 2023
    Configuration menu
    Copy the full SHA
    030836d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    08b9776 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    64ed607 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    ba96720 View commit details
    Browse the repository at this point in the history
  8. Fix gauge_path_test loop trace test when using doubledouble reduction…

    … variables (remove bad cast)
    maddyscientist committed Aug 15, 2023
    Configuration menu
    Copy the full SHA
    b7687b4 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2023

  1. Rework of reproducible reductions to pre-compute the bins when initia…

    …lizing the reduction. All tests passing, but seemingly break when a SANITIZE build is used
    maddyscientist committed Aug 23, 2023
    Configuration menu
    Copy the full SHA
    bc74e7b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6a60bc3 View commit details
    Browse the repository at this point in the history
  3. Fix compiler warning

    maddyscientist committed Aug 23, 2023
    Configuration menu
    Copy the full SHA
    a8085dc View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a413153 View commit details
    Browse the repository at this point in the history