Project Log 1
Defining HistoHelper AI
The Genesis
Every good tool starts with a problem that is too annoying to ignore. In clinical pathology, the problem is volume and visual fatigue. Pathologists are tasked with scanning gigapixel SVS (Whole Slide Images) to find microscopic anomalies. It is finding a needle in a digital haystack, and visual fatigue is a real threat to diagnostic accuracy.
The Problem
Beyond basic anomaly detection, there is a massive, unanswered question in oncology known as Organ Tropism. My specific research focus is this: Can an AI model differentiate between a Melanoma that metastasised to the Lung vs. one that metastasised to the Brain based solely on the morphology of the Primary Skin Lesion? Currently, predicting where a cancer will spread is largely reactive. If we can find the morphological “seeds” of organ-specific spread at the primary biopsy stage, we change the game.
The Solution: HistoHelper
HistoHelper is being built as a digital “second pair of eyes” for the lab. It is not designed to replace a pathologist. It is designed to flag subtle inconsistencies and highlight regions of interest, allowing the human expert to focus their energy where it matters most.
The Proposed Tech Stack
To handle gigapixel images and complex neural networks, the stack needs to be robust:
- Image Processing: Python with OpenSlide and OpenCV for tiling and normalising the massive SVS files.
- Machine Learning: PyTorch for building and training the computer vision models (likely starting with a ResNet or Vision Transformer architecture).
- Data Pipeline: A robust ETL process to handle image annotations and metadata.
- Frontend/Viewer: A lightweight web interface to display the SVS files with AI-generated heatmaps overlaid.
Roadmap to MVP
Getting to a working prototype requires a strict, phased approach:
- Data Acquisition: Source open-access annotated melanoma datasets (like TCGA).
- Preprocessing Pipeline: Build the logic to chop gigapixel images into digestible, analyzable tiles.
- Baseline Model: Train a simple classifier to differentiate basic tissue types and normal vs. abnormal cells.
- The Tropism Experiment: Feed the model annotated primary lesion data mapped to known metastatic outcomes (Lung vs. Brain) and look for predictive patterns.
- The Interface: Build the viewer so a human can actually interact with the model’s findings.
This is an active research project. Updates will be posted here as the build progresses.