The below is a logistic regression model, which uses some dummy data to determine whether people are at risk of diabetes or not – of course, this model couldn’t actually determine whether of not someone does have diabetes, it’s just a demonstration.
As I expand this model to take on additional features and larger datasets, it will improve its accuracy. I will check the fit of this model (whether it’s under or overfitted) and update my findings on this article.
- Is it under or over fitted?
- Is there a bias?
- Does more data make it more accurate?