Draft: SSL-mean teacher on segmentation

I am now training with lr=0.0006 according to the paper, for 500 epochs. The validation set is the test data of the labeled dataset. The data augmentation used now is adjust_contrast.

Here is the result. I will increase the learning rate and train more epochs.mt-result.pdf

The consistency loss weight will increase from 0 to 1 in the first 100 epochs, so it will cause a sudden increase there.

I found the checkpointer bug is caused by not setting a new parameter checkpoint-period. It goes well now.

added 1 commit

4ed19a8e - add plot of consistency and segmentation loss

Compare with previous version

@andre.anjos , @mguenther , attached is the result of a new experiment. The result is not very good. I think one reason is that the validation set is the original test set of the labeled dataset. So in the end the model chooses the lowest loss according to the labeled dataset which does not achieve the best performance on the test data. If we want to add the unlabeled dataset images to the validation set, for example, the HRF dataset, the original training set of HRF has only 15 images, it will have too few images if we select some images to the validation set. If we select from the test set, it may affect the comparison of performances on supervised model and semi-supervised model. Looking forward to your advice! comparison.rst trainlog.pdf

I changed the validation set to half of the original unlabeled test set, and used the other half data to be test set. And I also increased the adjust_contrast parameter. The result is better now. But I am not sure if I can use validation and test set this way.comparison.rst trainlog.pdf

@Txiao: I must confess I don't have the details in my head regarding which loss goes where in this SSL setup. Can you attach a diagram indicating where each loss component is taken from? That would help evaluating why it doesn't work as expected.

I hope this image helps. The labeled data will have a combined loss which is a sum of segmentation loss and weighted consistency loss. The unlabeled data will have only the consistency loss. The validation loss is a segmentation loss computed on prediction from Teacher model(which is our final model) with input being non-data-augmented. I compared the results when setting the validation set to be labeled data with setting it to be unlabeled data. The unlabeled set has better performance which is 0.645, still worse than the baseline.

added 1 commit

36f29ef2 - 1 commit from branch master

Compare with previous version

marked this merge request as draft

added 2 commits

96025463 - 1 commit from branch master
a27cbf8a - semi-spervised learning

Compare with previous version

Draft: SSL-mean teacher on segmentation

Merge request reports

Activity