Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification

Amruta Parulekar, Utkarsh Kanwat, Ravi Gupta, Medha Chippa, Thomas Jacob, Tripti Bameta, Swapnil Rane, Amit Sethi

2024

Abstract

Segmentation and classification of cell nuclei using deep neural networks (DNNs) can save pathologists’ time for diagnosing various diseases, including cancers. The accuracy of DNNs increases with the sizes of annotated datasets available for training. The available public datasets with nuclear annotations and labels differ in their class label sets. We propose a method to train DNNs on multiple datasets where the set of classes across the datasets are related but not the same. Our method is designed to utilize class hierarchies, where the set of classes in a dataset can be at any level of the hierarchy. Our results demonstrate that segmentation and classification metrics for the class set used by the test split of a dataset can improve by pre-training on another dataset that may even have a different set of classes due to the expansion of the training set enabled by our method. Furthermore, generalization to previously unseen datasets also improves by combining multiple other datasets with different sets of classes for training. The improvement is both qualitative and quantitative. The proposed method can be adapted for various loss functions, DNN architectures, and application domains.

Download


Paper Citation


in Harvard Style

Parulekar A., Kanwat U., Gupta R., Chippa M., Jacob T., Bameta T., Rane S. and Sethi A. (2024). Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification. In Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOIMAGING; ISBN 978-989-758-688-0, SciTePress, pages 281-288. DOI: 10.5220/0012380800003657


in Bibtex Style

@conference{bioimaging24,
author={Amruta Parulekar and Utkarsh Kanwat and Ravi Gupta and Medha Chippa and Thomas Jacob and Tripti Bameta and Swapnil Rane and Amit Sethi},
title={Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification},
booktitle={Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOIMAGING},
year={2024},
pages={281-288},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012380800003657},
isbn={978-989-758-688-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 1: BIOIMAGING
TI - Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification
SN - 978-989-758-688-0
AU - Parulekar A.
AU - Kanwat U.
AU - Gupta R.
AU - Chippa M.
AU - Jacob T.
AU - Bameta T.
AU - Rane S.
AU - Sethi A.
PY - 2024
SP - 281
EP - 288
DO - 10.5220/0012380800003657
PB - SciTePress