Skip to content

Conversation

@joyalbin
Copy link

Fix for supporting different data type for
LinearCombination Output and Compute.

Fix for supporting different data type for
LinearCombination Output and Compute.
@joyalbin joyalbin marked this pull request as draft October 13, 2025 07:59
@Antonyvance Antonyvance added this to the 0.6 milestone Oct 17, 2025
@Antonyvance Antonyvance added bug Something isn't working urgent PR requires a urgent attention (for release or blocking another PR) labels Oct 17, 2025
Copy link

@jiyang1011 jiyang1011 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@joyalbin joyalbin changed the title [WIP] Support different dtype for LinearCombination Support different dtype for LinearCombination Oct 22, 2025
@tdeng5 tdeng5 requested a review from taozha2 October 23, 2025 01:30
typename TiledMMAHelper<MMA_Atom<XE_8x16x16_F32F16F16F32_TT>, Layout<Shape<_32, _128, _32>>,
Layout<Shape<_1, _4, _1>, Stride<_4, _1, _0>>>::TiledMMA,
XE_2D_U16x32x32_LD_N, XE_2D_U4x32x16_LD_T, XE_2D_U16x8x16_ST_N,
XE_2D_U16x32x32_LD_N, XE_2D_U4x32x16_LD_T, XE_2D_U32x8x16_ST_N,
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. I think it's better to add copy_traits for both C and D for different data type. XE_2D_U16x8x16_ST_N here
    means storing to D as F16, you shouldn't change it. If you want to store D as F32, you should add a new
    GemmConfiguration for it.
  2. can you refine the data type name in "FP16U4FP16F16FP16S4" according to your changes.
    The data type " FP16U4FP16F16FP16S4 " in the name are: A, B, C, Mma, Scale, Zero

using CollectiveEpilogue = cutlass::epilogue::collective::CollectiveEpilogue<
EpilogueDispatchPolicy,
TileShape,
ElementAccumulator,

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be ElementC, right?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working urgent PR requires a urgent attention (for release or blocking another PR)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants