Metabiomics

METABIOMICS

Updated July 2020

Metabiomics: Pioneering Early Detection of Colorectal Cancer through Advanced Data Science

Microscopic view of colon polyps

Microscopic view of colon polyps

🏢 Company Overview

Metabiomics is an early stage, private equity backed start-up focused on developing a non-invasive stool test for early detection of colon polyps and colorectal cancer using multi-omic biomarkers derived from the human gut microbiome.

🔬 My Role: Leading Metagenomics Data Science

In my role as the lead Metagenomics Data Scientist, I orchestrate the development and refinement of sophisticated algorithms essential for our innovative diagnostic tools. While the specifics of our algorithms are proprietary, I can share insights into the overarching strategies and methodologies we employ:

⚙️ Innovative Feature Engineering

My team and I employ advanced techniques to interpret both unassembled and assembled metagenomics datasets. Our analytical approach integrates diverse omic data sources including 16S rRNA gene surveys, comprehensive shotgun sequencing data, proteomics, RNA sequencing, metabolomics, and extensive literature insights to uncover novel biomarkers for early disease detection.

🗃️ Databasing and Knowledge Graphs for Robust ML/AI Applications

Once features are available, we organize them in a structure that is amenable for machine learning and/or artificial intelligence prediction workflows.

🤖 Advanced ML/AI Predictive Models

We used custom and off-the-shelf solutions to accurately predict very early stage colon cancer. The type of data going into the training sets, how we validated the data, and how the predictions were executed is part of our proprietary technology. At a high level, it implements many of the cutting-edge AI/ML algorithms used in related and adjacent fields.

🔄 Continuous Model Validation and Optimization

As we integrate new data, we continuously refine our models through automated systems and custom dashboards, enabling straightforward interpretation of results for our stakeholders.

📈 Scaling with Precision

Handling the vast scale of omics data, we implement tailored data processing and storage solutions, adhering to AWS best practices to maintain efficiency and reliability in our operations.