Test results different between logging in test_step and logging in test_epoch_end #10517

Bowen-n · 2021-11-13T06:44:12Z

Bowen-n
Nov 13, 2021

Why do these two test codes result in different test results(both average acc and average loss)?

def test_step(self, batch, batch_idx):
    input_ids, labels = batch
    outs = self(input_ids)
    loss = self.loss_fn(outs, labels)
    acc = self.acc_fn(outs, labels)
    self.log_dict({'test_loss': loss, 'test_acc': acc}, on_step=False, on_epoch=True, logger=False)
    return loss, acc

def test_step(self, batch, batch_idx):
    input_ids, labels = batch
    outs = self(input_ids)
    loss = self.loss_fn(outs, labels)
    acc = self.acc_fn(outs, labels)
    return loss, acc

def test_epoch_end(self, step_outputs):
    avg_loss = torch.stack([x[0] for x in step_outputs]).mean()
    avg_acc = torch.stack([x[1] for x in step_outputs]).mean()
    self.log_dict({'test_loss': avg_loss, 'test_acc': avg_acc}, logger=False)

I only use one GPU for testing.

Answered by rohitgr7

Nov 14, 2021

just in case anyone else sees this discussion, adding more context to @Bowen-n answer. Within lightning we use a weighted average to accumulate the results at the epoch end where weights are the batch_size for each batch inside test_step.

View full answer

Bowen-n · 2021-11-13T06:57:58Z

Bowen-n
Nov 13, 2021
Author

It seems that it's because the last batch may have a different number of samples.

0 replies

rohitgr7 · 2021-11-14T19:19:40Z

rohitgr7
Nov 14, 2021

just in case anyone else sees this discussion, adding more context to @Bowen-n answer. Within lightning we use a weighted average to accumulate the results at the epoch end where weights are the batch_size for each batch inside test_step.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test results different between logging in test_step and logging in test_epoch_end #10517

{{title}}

Replies: 2 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Test results different between logging in test_step and logging in test_epoch_end #10517

Bowen-n Nov 13, 2021

Replies: 2 comments

Bowen-n Nov 13, 2021 Author

rohitgr7 Nov 14, 2021

Bowen-n
Nov 13, 2021

Bowen-n
Nov 13, 2021
Author

rohitgr7
Nov 14, 2021