https://github.com/itayhubara/CalibTIP/blob/69077c92611b079234706784c344e8c9156f3283/main.py#L481 [0] index into the first batch. isn't sequential adaquant supposed to update the input cache of all batches to the quantized values?
CalibTIP/main.py
Line 481 in 69077c9
[0] index into the first batch.
isn't sequential adaquant supposed to update the input cache of all batches to the quantized values?