Week 1 : Coding Period

This week I began with Exploratory Data Analysis. I downloaded the RSNA-Pneumonia Detection Dataset. Being a dataset of DICOM image segmentation, I purged all the information regarding segmentation and kept only classification information.

I noticed a dataset imbalance in the images as the number of positive pneumonia cases were extremely low compared to the negative cases. This problem gets solved using undersampling and oversampling techniques.

Being a huge dataset of over 26k+ patients, my system occasionally ran out of memory while preprocessing it. I shifted my code to Colab and similar errors were encountered. To tackle this, I will use Mini Batches for training so that the training process becomes more efficient.

By then end of this weekend, I will complete training a model on this dataset. Next week, I will train the rest of the models.

Comments

Popular posts from this blog

GSoC 2020 with LibreHealth : Final Report

Week 1 : Acceptance and Community Bonding

Week 2 : Coding Period