Skip to content

Commit b39aa3b

Browse files
committed
minor upd
Signed-off-by: Siyuan Fu <[email protected]>
1 parent f79b5b0 commit b39aa3b

File tree

1 file changed

+0
-6
lines changed

1 file changed

+0
-6
lines changed

vllm/v1/attention/backends/mla/common.py

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -2043,16 +2043,10 @@ def forward(
20432043
)
20442044
decode_q0[..., : ql_nope_shape[2]].copy_(decode_ql_nope)
20452045
decode_q0[..., ql_nope_shape[2] :].copy_(decode_q_pe)
2046-
decode_q = torch.empty(
2047-
decode_q_shape,
2048-
device=decode_ql_nope.device,
2049-
dtype=torch.float8_e4m3fn,
2050-
)
20512046

20522047
decode_q, _ = ops.scaled_fp8_quant(
20532048
decode_q0.view(decode_q_shape[0], -1),
20542049
layer._q_scale,
2055-
output=decode_q.view(decode_q_shape[0], -1),
20562050
)
20572051
decode_q = decode_q.view(decode_q_shape)
20582052
else:

0 commit comments

Comments
 (0)