Kidney Lesion Detection In Computer Tomograph Scans: A Pre-Transformer Project

Client:

XYZ Company

Services:

Web Design

Duration:

2 Weeks

Project overview

Goal:
  • Goal: Develop a reliable kidney lesion detector, able to handle sequences of data coming from CT scans. 5 kidney lesions were included for detection. Problem: Long waiting times for the patient before receiving a diagnostic. Time consuming work for the radiologists, manually detecting the kidney lesions and omissions because of high workload.
  • Problem: Long waiting times for the patient before receiving a diagnostic. Time consuming work for the radiologists, manually detecting the kidney lesions and omissions because of high workload.

Challenges

Machine Learning and Predictive Analytics:
  • Data imbalance and having significantly less lesions in the underrepresented classes.
  • Data leakage was a topic of concern given the sequential structure of the data, therefore special attention was required when creating the train – validation – test sets and in cross validation.
  • The annotations coming from multiple sources (radiologists) had to be manipulated to meet a common ground. For example a nodule cluster could be annotated with one, large bounding box or with individual bounding boxes around each nodule.
  • Difficulty in detecting lesions on the edge of the organ.

Algorithms

Machine Learning and Predictive Analytics:
  • The problem was tackled as an object detection problem, therefore optimizing for two outputs: the coordinates of the bounding box and the correctness of the label assigned to a bounding box.
  • The approach was to use, as a backbone, a very deep neural network and to adapt its pre-trained parameters through fine-tuning such that it accommodates medical images.
  • For performance evaluation, both bounding box overlap and label matching were assessed.
  • The metrics under observation were: IoU (Intersection over Union), precision, recall, f1 score and MAP (mean average precision).
  • This project had a strong hyperparameter tuning component, requiring extensive experimentation with advanced concepts: one cycle learning rate, learning rate schedulers, optimizers, kernel size and activation functions.

Results

The solution was able to detect the lesions in under 30 seconds per patient (on a sequence of 40 CT slices), with a precision and recall of >85%, making it suitable for internal hospital use.

Impact

  • Improve the patient experience by reducing the diagnostic waiting times.
  • Streamline the radiologist’s workflow, empower novice radiologists in diagnosing and improve their ramp-up process.