Artificial Intelligence Theory and Application

Posts

Math behind Gradient Descent Optimizers

1. RMS Prop Math 2. Momentum Math

How to Handle Training data - for ML Models

ML — Machine Learning How do we handle data? 1. Don't jump straight to Neural Networks or Deep Learning! 2. How to split data into Training, Validation and Test data (parts) 3. What is Data Leak 4. What is Forward Propagation and Backward Propagation? 5. Dropout in Neural Networks This applies to both the Methods: Non-Neural Network Methods — Traditional ML algorithms (e.g., SVM, Random Forest, Logistic Regression, KNN, Naive Bayes) Neural Network Methods (Deep Learning) — Multi-layered networks that learn complex patterns Important Note: Don't jump straight to Neural Networks or Deep Learning! Always check first if the problem can be solved using traditional machine learning models. Neural networks are powerful but come with added complexity, longer training times, and require more data. Use Neural Networks when: You have large amounts of data The problem involves images, audio, video, or text Traditional methods aren't giving good results The patterns are hig...

KNN (K-Nearest Neighbors) - Classification/Regression

KNN (K-Nearest Neighbors) is primarily a classification algorithm, though it can also be used for regression. How KNN Classification works: Choose a value for K (number of neighbors to consider) For a new data point, find the K closest points in the training data (using distance measures like Euclidean distance) Assign the class based on majority voting among those K neighbors Example: If K=5 and 3 of the 5 nearest neighbors are "Cat" and 2 are "Dog," the new point is classified as "Cat." Key characteristics: Lazy learner — it doesn't build a model during training; it just stores the data and does all the work at prediction time Non-parametric — makes no assumptions about the underlying data distribution Instance-based — uses actual training instances to make predictions KNN for Regression: When used for regression, instead of voting, it takes the average (or weighted average) of the K neighbors' values to predict a continuous...

Quiz: Underfitting, Overfitting, Bias & Variance

Quiz: Underfitting, Overfitting, Bias & Variance Questions Q1. A model performs poorly on both training data and test data. What is this called? A) Overfitting B) Underfitting C) High variance D) Good generalization Q2. A model achieves 99% accuracy on training data but only 60% on test data. What problem does this indicate? A) Underfitting B) Overfitting C) High bias D) Low variance Q3. Which of the following best describes bias in machine learning? A) Random fluctuations in model predictions B) Systematic error from oversimplified model assumptions C) Error caused by noisy data D) Difference between training and test accuracy Q4. High variance in a model means: A) The model is too simple B) The model's predictions are stable across different training sets C) The model is overly sensitive to the training data D) The model has high bias Q5. A linear regression model is used to fit a highly nonlinear dataset. This will likely result in: ...

Underfitting & Overfitting - What is Bias and Variance

https://www.geeksforgeeks.org/machine-learning/underfitting-and-overfitting-in-machine-learning/ Machine learning models aim to perform well on both training data and new, unseen data and is considered "good" if: It learns patterns effectively from the training data. It generalizes well to new, unseen data. It avoids memorizing the training data (overfitting) or failing to capture relevant patterns (underfitting). To evaluate how well a model learns and generalizes, we monitor its performance on both the training data and a separate validation or test dataset which is often measured by its accuracy or prediction errors . However, achieving this balance can be challenging . Two common issues that affect a model's performance and generalization ability are overfitting and underfitting . These problems are major contributors to poor performance in machine learning models. Let's us understand what they are and how they contribute to ML mo...

Classification Metrics - Confusion Matrix, Precision, Recall to ROC Curves

Topics: A. Classification Metrics B. Class Imbalance A. Classification Metrics Precision and recall are key metrics used to evaluate a machine learning model's performance, calculated using a confusion matrix. Precision measures the ratio of correctly predicted positive observations to the total number of positive predictions, answering "Of all the times the model predicted 'yes,' how often was it correct?". Recall measures the ratio of correctly predicted positive observations to all actual positive observations, answering "Of all the actual positive cases, how many did the model find?". Lets cover these topics "The Building Blocks: Understanding TP, TN, FP, and FN" Start with the foundation Use real examples (email spam, medical tests) "The Confusion Matrix: Your Performance Dashboard" Visual representation of the building blocks How to read and interpret it "Accuracy: The Misleading Metric" Why everyone starts here Why...