Question regarding discrete model prediction layer activation function and model loss function

For the case of the discrete model, specifically the model definition in the file kdd99_model.py; why is the prediction layer activation function sigmoid and not softmax as the KDD99 problem is a multi-class classification problem?

https://github.com/tensorflow/tcav/blob/218a4cddc4eb8d76f37b7e12b70bdb44dc1ec8f1/tcav/tcav_examples/discrete/kdd99_model.py#L82

Also, why is the from_logits parameter set to True in the SparseCategoricalCrossentropy loss function, if the prediction layer of the model already has a sigmoid activation function?

https://github.com/tensorflow/tcav/blob/218a4cddc4eb8d76f37b7e12b70bdb44dc1ec8f1/tcav/tcav_examples/discrete/kdd99_model.py#L85-L87

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question regarding discrete model prediction layer activation function and model loss function #121

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	model_full.compile(loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True), \
	metrics=['accuracy'],
	optimizer='adam')

Question regarding discrete model prediction layer activation function and model loss function #121

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions