Skip to content

ML & Modeling

Step 6 – ML & Modeling: training models on those features

Section titled “Step 6 – ML & Modeling: training models on those features”

Once you have a clean table of features and labels, you can finally train models. In this step you apply machine-learning or deep-learning algorithms (for example logistic regression, random forests, CNNs, or MIL models) to answer a specific question: classification, prediction, grading, risk stratification, or similar tasks.

Technical name: ML & Modeling

Train models using image patches, features, and labels to:

  • Classify regions or slides.
  • Predict outcomes.
  • Support triage or risk stratification.
  • “Can a model flag suspicious areas on a slide?”
  • “Can we predict recurrence risk from histology?”
  • “Do image features improve beyond standard clinical variables?”
  • Patch‑level classification (tumor vs normal, pattern, grade).
  • Slide‑level predictions (risk score, biomarker status, response likelihood).
  • Combine image features with clinical/genomic data.
  • Train, validate, and tune models.
  • PyTorch / TensorFlow / Keras — deep learning frameworks.
  • scikit‑learn — logistic regression, random forest, SVM.
  • MONAI — medical imaging ML framework.
  • TIAToolbox — WSI components (tiling, aggregating tile predictions).

Move from measurements and labeled examples to algorithms that generalize and make predictions on new slides.

  • ML-01: Classical + deep starts — Random Forest on tabular morphometrics (Path A) plus PyTorch ResNet transfer learning and inference (Path B) to cover both feature-based and end-to-end image modeling.