/

End-to-End Diabetes Diagnosis Predictor

Machine Learning | MLOps | Pytorch | Scikit Learn

Project

Overview

I wanted to create a complete pipeline for a decently sized dataset that trains and test many types of model families through grid search and stratified cross validation and outputs the best model to be used in a simple app. For this project I used a diabetes dataset that contains 100,000 data points.

Technologies

Python

Pytorch

Scikit Learn

AWS EC2

AWS ECR

Github Actions

Back