So! I’m taking this course to make sure my fundamentals in ML are solid. I’ve done project work and independent study here and there, but I longed to formalize it!
Table of Contents
Open Table of Contents
What is Machine Learning (ML)
Distinctions
-
Artificial Intelligence (AI):
The theory and development of computer systems that can perform tasks normally requiring human intelligence, such as visual perception, speech recognition, and decision-making. -
Machine Learning (ML):
An application of AI that provides systems with the ability to automatically learn tasks and improve from experience without explicit programming. ML focuses on developing architectures that can process data in new ways and learn for themselves. -
Deep Learning (DL):
A subset of ML that uses hierarchical layers of artificial neural networks to carry out learning.
A machine learning algorithm is said to learn from data if it improves its performance on a task with experience:
“A computer program is said to learn from experience , with respect to some class of tasks and performance measure , if its performance at tasks in , as measured by , improves with experience .” (Mitchell, 1997)
- Philosophically: understanding ML means probing the principles that underlie intelligence.
- ML tasks are usually described in terms of how ML systems should process examples.
Common ML Tasks
-
Classification:
-
Classification with missing inputs:
Either learn all possible mappings or the joint distribution of all inputs, which can then be marginalized over missing inputs. -
Regression:
-
Other tasks:
- Transcription
- Structured outputs
- Anomaly detection (e.g., fraud detection: profile of user is built and monitored)
- Synthesis and sampling (e.g., text-to-speech, procedural texture generation in video games)
- Imputation of missing values
- Denoising
- Density or probability mass function estimation
Performance Measures
Performance is task-specific. For classification, common metrics include:
-
Accuracy / Precision:
-
Recall / Sensitivity:
-
Specificity:
-
Error rate / similarity measures.
Types of Learning
- Supervised learning: labeled outputs exist.
- Unsupervised learning: no labels, discover hidden structure.
- Semi-supervised learning: some labeled, some unlabeled data.
- Weakly supervised learning: coarse-grained labels.
- Self-supervised learning: labels generated automatically from pretext tasks, often used in transfer learning.
General Workflow of Training an Algorithm
- Collect data
- Design features
- Train the model
- Test the model (on unseen data)
Common Applications
- Targeted advertising (recommend ads based on user data)
- Recommender systems (matrix completion, rank minimization)
- Object detection and recognition (driver-assist and self-driving cars)
- Other examples:
- Inpainting / imputation of missing values
- Image super-resolution
- Robust face recognition