CoCalc provides the best real-time collaborative environment for Jupyter Notebooks, LaTeX documents, and SageMath, scalable from individual users to large groups and classes!
CoCalc provides the best real-time collaborative environment for Jupyter Notebooks, LaTeX documents, and SageMath, scalable from individual users to large groups and classes!
Path: blob/main/003_TensorFlow_Classification.ipynb
Views: 47
Dataset import and exploration
Drop missing values:
Encode string data:
All data is numeric now:
Convert to a binary classification problem
This is not a binary classification problem by default
We can make it one by declaring wines above some quality point good wines and rest of them bad wines
So we'll have 63.3% good wines and the rest are bad
Train/Test split
Data scaling
Input features aren't on the same scale, so we'll fix it quickly:
Model training
This is a completely random neural network architecture
Use
sigmoid
as the activation function in the last layer when working with binary classification problemsUse
binary_crossentropy
as a loss function when working with binary classification problemsWe'll track accuracy, precision, and recall and train for 100 epochs
Model performance visualization
You could keep training the model, as accuracy, precision, and recall seem to grow slightly
Making predictions
These are probabilities - here's how to convert them to classes (threshold = 0.5)
Model evaluation
Evaluation on the test set:
383 True Negatives, 597 True positives, 214 False negatives, 99 False positives
Further evaluation: