Week 1 : Coding Period
This week I began with Exploratory Data Analysis. I downloaded the RSNA-Pneumonia Detection Dataset. Being a dataset of DICOM image segmentation, I purged all the information regarding segmentation and kept only classification information.
I noticed a dataset imbalance in the images as the number of positive pneumonia cases were extremely low compared to the negative cases. This problem gets solved using undersampling and oversampling techniques.
Being a huge dataset of over 26k+ patients, my system occasionally ran out of memory while preprocessing it. I shifted my code to Colab and similar errors were encountered. To tackle this, I will use Mini Batches for training so that the training process becomes more efficient.
By then end of this weekend, I will complete training a model on this dataset. Next week, I will train the rest of the models.
I noticed a dataset imbalance in the images as the number of positive pneumonia cases were extremely low compared to the negative cases. This problem gets solved using undersampling and oversampling techniques.
Being a huge dataset of over 26k+ patients, my system occasionally ran out of memory while preprocessing it. I shifted my code to Colab and similar errors were encountered. To tackle this, I will use Mini Batches for training so that the training process becomes more efficient.
By then end of this weekend, I will complete training a model on this dataset. Next week, I will train the rest of the models.
Comments
Post a Comment