0% found this document useful (0 votes)

6 views

Assignment No 1

The document discusses various types of learning in artificial intelligence, including supervised, unsupervised, reinforcement, self-supervised, and transfer learning. Each type is explained with key concepts, processes, and example algorithms, highlighting their applications and workflows. The document serves as an overview of machine learning methodologies and their significance in AI development.

Uploaded by

shoaibcomputerscience

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Assignment No 1

Uploaded by

shoaibcomputerscience

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Assignment No:1

Subject Name:
Artificial Inteligence

Submitted To:
Sir Shafiqur Rahman

Submitted By:

Muhammad Shoaib

Roll No:
2021-cs-36

University:
MCUT DG Khan
Learning

Learning, in the context of artificial intelligence (AI) and machine learning (ML), refers to the process by
which algorithms improve their performance on a task over time with experience or data. There are
several types of learning, each using different algorithms and techniques to achieve their goals. The
main types of learning are:

1. Supervised Learning

Supervised learning is a type of machine learning where an algorithm is trained on labeled data.
In this context, "labeled data" means that each training example is paired with an output label.
The goal of supervised learning is to learn a mapping from inputs to outputs based on the
example input-output pairs.

Here's a step-by-step explanation of how supervised learning works:

1. Data Collection

 Training Data: The first step is to collect a dataset that includes input-output pairs. The
input data can be anything from images, text, or numerical values, while the output data
(labels) corresponds to the desired prediction for each input.

2. Data Preprocessing

 Cleaning: This step involves removing any noise or irrelevant information from the data.
 Normalization: The data might need to be normalized or standardized to ensure that
different features contribute equally to the learning process.
 Splitting: The dataset is usually split into a training set and a test set. The training set is
used to train the model, while the test set is used to evaluate its performance.

3. Model Selection

 Choosing an Algorithm: Select a suitable machine learning algorithm based on the

problem at hand. Common algorithms include linear regression, logistic regression,
decision trees, support vector machines, and neural networks.

4. Training

 Learning Process: The selected algorithm is used to learn from the training data. The
model adjusts its parameters to minimize the difference between its predictions and the
actual labels in the training data.
 Loss Function: A loss function is used to measure the difference between the predicted
outputs and the actual outputs. The goal of the training process is to minimize this loss
function.

5. Evaluation

 Testing: After training, the model is evaluated on the test set to see how well it
generalizes to new, unseen data. Metrics like accuracy, precision, recall, and F1-score are
often used to evaluate the performance.
 Validation: Sometimes a validation set is used in addition to the training and test sets to
fine-tune the model and avoid overfitting.

6. Prediction

 Making Predictions: Once the model is trained and validated, it can be used to make
predictions on new, unseen data.

Example Algorithms in Supervised Learning

1. Linear Regression

 Use Case: Predicting a continuous output (e.g., predicting house prices).

 Description: Models the relationship between input features and output using a linear
equation.

2. Logistic Regression

 Use Case: Binary classification problems (e.g., spam detection).

 Description: Models the probability that an input belongs to a particular class using a
logistic function.

3. Decision Trees

 Use Case: Both classification and regression tasks.

 Description: Splits the data into subsets based on feature values, forming a tree-like
structure where each leaf represents a class or a value.

4. Support Vector Machines (SVM)

 Use Case: Classification problems with clear margin of separation.

 Description: Finds the hyperplane that best separates the classes in the feature space.

5. Neural Networks

 Use Case: Complex tasks such as image and speech recognition.

 Description: Consists of layers of interconnected nodes (neurons) that learn to extract
features and patterns from the input data.

Example Workflow with a Decision Tree Classifier

1. Data Collection: Collect a labeled dataset, such as customer data with features like age,
income, and a label indicating whether they bought a product.
2. Data Preprocessing: Clean the data, handle missing values, and split it into training and
test sets.
3. Model Selection: Choose a decision tree algorithm.
4. Training: Train the decision tree on the training data, where the algorithm learns the
rules for making decisions.
5. Evaluation: Test the decision tree on the test data to evaluate its performance.
6. Prediction: Use the trained decision tree to predict whether new customers will buy the
product based on their features.

2. Un Supervised Learning

Unsupervised learning is a type of machine learning where the algorithm is trained on unlabeled
data. This means that the input data has no corresponding output labels. The goal of
unsupervised learning is to identify hidden patterns or intrinsic structures in the input data.

Key Concepts of Unsupervised Learning

1. Data Collection
o Unlabeled Data: Collect a dataset that contains only input data without any
output labels. This data can come from various sources like images, text, or
numerical records.
2. Data Preprocessing
o Cleaning: Remove noise and irrelevant information from the data.
o Normalization: Normalize or standardize the data to ensure that different features
contribute equally to the learning process.
3. Model Selection
o Choosing an Algorithm: Select an unsupervised learning algorithm based on the
problem at hand. Common algorithms include clustering algorithms like K-
means, hierarchical clustering, and density-based spatial clustering of applications
with noise (DBSCAN), as well as dimensionality reduction algorithms like
Principal Component Analysis (PCA) and t-distributed Stochastic Neighbor
Embedding (t-SNE).
4. Training
o Learning Process: The selected algorithm processes the input data to learn its
structure. The algorithm tries to find patterns, groupings, or other structures in the
data.
5. Evaluation
o Validation: Evaluate the model based on internal metrics such as the silhouette
score for clustering or the explained variance for dimensionality reduction. There
is no straightforward way to evaluate the model as there are no ground truth
labels.
6. Interpretation and Utilization
o Interpreting Results: Analyze the patterns or structures identified by the
algorithm.
o Using the Model: Apply the model to new data to identify similar patterns or
structures.

Example Algorithms in Unsupervised Learning

1. K-Means Clustering

 Use Case: Grouping similar items into clusters (e.g., customer segmentation).
 Description: Divides the data into K clusters, where each data point belongs to the
cluster with the nearest mean.

2. Hierarchical Clustering

 Use Case: Creating a hierarchy of clusters (e.g., organizing documents into a hierarchy of
topics).
 Description: Builds a tree of clusters by either merging small clusters into larger ones
(agglomerative) or splitting large clusters into smaller ones (divisive).

3. DBSCAN (Density-Based Spatial Clustering of Applications with Noise)

 Use Case: Clustering data with noise and varying density (e.g., identifying clusters of
spatial data).
 Description: Groups together points that are close to each other based on a distance
metric and a minimum number of points, and marks points in low-density regions as
outliers.

4. PCA (Principal Component Analysis)

 Use Case: Reducing the dimensionality of data (e.g., visualizing high-dimensional data in
2D or 3D).
 Description: Transforms the data into a lower-dimensional space by finding the
directions (principal components) that maximize the variance.

5. t-SNE (t-distributed Stochastic Neighbor Embedding)

 Use Case: Visualizing high-dimensional data (e.g., exploring clusters in large datasets).
 Description: Reduces the dimensionality of the data while preserving the local structure,
often used for visualization in 2D or 3D space.
Example Workflow with K-Means Clustering

1. Data Collection: Collect a dataset, such as customer data with features like age, income,
and spending habits.
2. Data Preprocessing: Clean the data, handle missing values, and normalize it.
3. Model Selection: Choose the K-means clustering algorithm.
4. Training: Apply the K-means algorithm to partition the data into K clusters, where K is a
predefined number.
5. Evaluation: Use metrics like the silhouette score to evaluate the quality of the clusters.
6. Interpretation and Utilization: Analyze the clusters to understand the different
segments of customers and use this information for targeted marketing or other business
strategies.

3.Reinforcement Learning
Reinforcement learning (RL) is a type of machine learning where an agent learns to make
decisions by taking actions in an environment to maximize cumulative reward. The agent
interacts with the environment in discrete time steps, observing the current state, taking an
action, receiving a reward, and transitioning to a new state. The goal is to learn a policy that
maximizes the total expected reward over time.

Key Concepts of Reinforcement Learning

1. Agent: The learner or decision-maker.

2. Environment: Everything the agent interacts with.
3. State (S): A representation of the current situation of the agent in the environment.
4. Action (A): Choices made by the agent that affect the state.
5. Reward (R): Immediate return received by the agent after taking an action.
6. Policy (π): Strategy used by the agent to determine the next action based on the current
state.
7. Value Function (V): Measures the long-term return of being in a state under a policy.
8. Q-Value (Q): Measures the long-term return of taking an action in a state under a policy.

Process of Reinforcement Learning

1. Initialization: Initialize the policy, value function, or Q-value function.

2. Interaction: The agent observes the current state and selects an action based on the
policy.
3. Transition: The action is executed, resulting in a transition to a new state and receiving a
reward.
4. Learning: The agent updates the policy, value function, or Q-value function based on the
reward and the new state.
5. Iteration: Repeat the process to improve the policy over time.

Example Algorithms in Reinforcement Learning

1. Q-Learning

 Type: Model-free, off-policy.

 Description: Learns the value of the action in a state directly. It updates the Q-values
using the Bellman equation: Q(s,a)←Q(s,a)+α[r+γmax⁡aQ(s′,a)−Q(s,a)]Q(s, a) \leftarrow
Q(s, a) + \alpha [r + \gamma \max_a Q(s', a) - Q(s, a)]Q(s,a)←Q(s,a)+α[r+γamaxQ(s′,a)
−Q(s,a)] where α\alphaα is the learning rate, γ\gammaγ is the discount factor, rrr is the
reward, sss is the current state, aaa is the action taken, and s′s's′ is the next state.

2. Deep Q-Networks (DQN)

 Type: Model-free, off-policy.

 Description: Extends Q-learning by using a neural network to approximate the Q-value
function. It uses experience replay and a target network to stabilize training.

3. Policy Gradient Methods

 Type: Model-free, on-policy.

 Description: Directly optimizes the policy by adjusting its parameters to maximize

gradient is computed using: ∇θJ(θ)=Eπθ[∇θlog⁡πθ(a∣s)Qπ(s,a)]\nabla_\theta J(\theta) = \

expected rewards. The policy is often represented by a neural network. The policy

mathbb{E}_{\pi_\theta} \left[ \nabla_\theta \log \pi_\theta(a|s) Q^\pi(s, a) \right]∇θ

J(θ)=Eπθ[∇θlogπθ(a∣s)Qπ(s,a)] where θ\thetaθ are the policy parameters.

4. Actor-Critic Methods

 Type: Hybrid of policy-based and value-based methods.

 Description: Consists of two components: the actor (which updates the policy) and the
critic (which estimates the value function). The actor updates the policy in the direction
suggested by the critic.

Example Workflow with Q-Learning

1. Initialization: Initialize the Q-table with zeros, set the learning rate (α\alphaα) and
discount factor (γ\gammaγ).
2. Interaction:
o State Observation: Observe the current state sss.
o Action Selection: Select an action aaa using an exploration strategy (e.g., ε-
greedy).
3. Transition:
o Execute Action: Execute the action aaa and observe the reward rrr and next state
s′s's′.
4. Learning:
o Update Q-Value: Update the Q-value for the state-action pair using the Bellman
equation.
o Q(s,a)←Q(s,a)+α[r+γmax⁡aQ(s′,a)−Q(s,a)]Q(s, a) \leftarrow Q(s, a) + \alpha [r + \
gamma \max_a Q(s', a) - Q(s, a)]Q(s,a)←Q(s,a)+α[r+γamaxQ(s′,a)−Q(s,a)]
5. Iteration: Repeat the process for a set number of episodes or until convergence.

Example: Reinforcement Learning in a Grid World

1. Initialization: Initialize the Q-table with zeros for all state-action pairs in a grid world.
2. Interaction: The agent starts at a random position in the grid.
3. Transition: The agent takes an action (move up, down, left, right), receives a reward
(e.g., -1 for each move, +10 for reaching the goal), and transitions to a new state.
4. Learning: The agent updates the Q-table based on the reward and the new state.
5. Iteration: The agent continues to explore and update the Q-table until it learns the
optimal policy to reach the goal.

4. Self supervised Learning

Concept

Self-supervised learning (SSL) is an advanced machine learning approach where models learn
patterns from data without needing labeled datasets.

Method

SSL algorithms create their own labels by exploiting the data's inherent structure.

Examples

 Computer Vision: An SSL algorithm might mask parts of an image and train the model
to predict the missing sections.
 Natural Language Processing: SSL methods can mask words within a sentence,
requiring the model to predict the missing words based on the surrounding context.

Benefits

These techniques enable models to learn useful representations and features from large amounts
of unlabeled data, reducing the need for costly and time-consuming data annotation.

Impact

SSL enhances the efficiency and scalability of training sophisticated AI systems, making it a
powerful tool in modern machine learning.
5. Transfer Learning

Transfer learning involves taking a pre-trained model and adapting it to a new but related
problem. This approach is particularly useful when there is a limited amount of data available for
the new problem.

 Algorithms: Typically involve fine-tuning pre-trained models like neural networks (e.g.,
fine-tuning BERT for a specific NLP task).

Gear Teq
100% (2)
Gear Teq
135 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Ai Unit-4 ML
No ratings yet
Ai Unit-4 ML
4 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Data Science Process and Machine Learning
No ratings yet
Data Science Process and Machine Learning
6 pages
AI Module III
No ratings yet
AI Module III
14 pages
Practical # 9
No ratings yet
Practical # 9
4 pages
Machine Learning concise notes
No ratings yet
Machine Learning concise notes
7 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Unit 6 Learning and Knowledge Acquisition
No ratings yet
Unit 6 Learning and Knowledge Acquisition
9 pages
THEORY FILE - Machine Learning (6th Sem)!!
No ratings yet
THEORY FILE - Machine Learning (6th Sem)!!
26 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Unit-1
No ratings yet
Unit-1
24 pages
ML UNIT 1
No ratings yet
ML UNIT 1
20 pages
21cs743 Solutions
No ratings yet
21cs743 Solutions
19 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
book of 843_AI_Student_HandbookXI-104-127
No ratings yet
book of 843_AI_Student_HandbookXI-104-127
24 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
AI lab6 (1)
No ratings yet
AI lab6 (1)
7 pages
ML notes
No ratings yet
ML notes
16 pages
machine learning and AI
No ratings yet
machine learning and AI
13 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
ML Unit-2
No ratings yet
ML Unit-2
17 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Full Notes
No ratings yet
Full Notes
37 pages
Null 5
No ratings yet
Null 5
16 pages
Unit-5 Machine Learning
No ratings yet
Unit-5 Machine Learning
25 pages
ML Unit 1
No ratings yet
ML Unit 1
27 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
74 pages
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
No ratings yet
Unit-4object Segmentation Regression Vs Segmentation Supervised and Unsupervised Learning Tree Building Regression Classification Overfitting Pruning and Complexity Multiple Decision Trees
25 pages
MAchineLearningNotes
No ratings yet
MAchineLearningNotes
6 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Chap 10-Machine Learning
No ratings yet
Chap 10-Machine Learning
25 pages
Datascience Notes
No ratings yet
Datascience Notes
16 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Kaleb ML
No ratings yet
Kaleb ML
20 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Machine Learning Is The Branch of
No ratings yet
Machine Learning Is The Branch of
12 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Machine learning
No ratings yet
Machine learning
12 pages
15. Machine Learning Classification, Regression and Clustering
No ratings yet
15. Machine Learning Classification, Regression and Clustering
77 pages
MachineLearning
No ratings yet
MachineLearning
16 pages
Module 1 ML
No ratings yet
Module 1 ML
8 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
DS-unit2
No ratings yet
DS-unit2
23 pages
FDS Assignment
No ratings yet
FDS Assignment
76 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Envio Correo Foxypreviewer
No ratings yet
Envio Correo Foxypreviewer
2 pages
Deep Dive SOAR 2
No ratings yet
Deep Dive SOAR 2
22 pages
Jetweigh Brochure
No ratings yet
Jetweigh Brochure
7 pages
Teste
No ratings yet
Teste
1 page
Uplift_modelling
No ratings yet
Uplift_modelling
6 pages
A13C Manual of AI Plugin V2.0
No ratings yet
A13C Manual of AI Plugin V2.0
7 pages
trellix-mcafee-mvision-standard-1-1bz-5-250-user-1yr-subscription-with-1yr-business-software-support-bz-manual
No ratings yet
trellix-mcafee-mvision-standard-1-1bz-5-250-user-1yr-subscription-with-1yr-business-software-support-bz-manual
509 pages
Sample DOC Files Download - Get Examples Instantly
No ratings yet
Sample DOC Files Download - Get Examples Instantly
2 pages
Number System for the CAT 2nd Edition Nishit K. Sinha pdf download
100% (3)
Number System for the CAT 2nd Edition Nishit K. Sinha pdf download
65 pages
PhotoPrint Installation Manual
No ratings yet
PhotoPrint Installation Manual
3 pages
VISHAL ROHIT Project
No ratings yet
VISHAL ROHIT Project
35 pages
Introduction of Et2019
No ratings yet
Introduction of Et2019
24 pages
Rajalakshmi Engineering College
100% (2)
Rajalakshmi Engineering College
3 pages
Soc Analyst Responsibilities
No ratings yet
Soc Analyst Responsibilities
3 pages
COMPUTER STUDIES F3T2 PP1 MS
No ratings yet
COMPUTER STUDIES F3T2 PP1 MS
7 pages
Cessna 500 Citation I IPC
No ratings yet
Cessna 500 Citation I IPC
3,468 pages
1 - ER Series Industrial Robot Three Dimension Visual Debugging Manual
No ratings yet
1 - ER Series Industrial Robot Three Dimension Visual Debugging Manual
40 pages
Ashok
No ratings yet
Ashok
5 pages
Strategy
No ratings yet
Strategy
2 pages
RJFS KeyboardShortcuts
No ratings yet
RJFS KeyboardShortcuts
5 pages
Ethical Hacking
No ratings yet
Ethical Hacking
3 pages
Practicals PBL I
No ratings yet
Practicals PBL I
38 pages
AI^1
No ratings yet
AI^1
12 pages
Ex2 Answers PDF
No ratings yet
Ex2 Answers PDF
4 pages
Splash PRO: User Manual
No ratings yet
Splash PRO: User Manual
28 pages
8051 Instruction Set: Facebooktwitterlinkedinredditpinterestshare
No ratings yet
8051 Instruction Set: Facebooktwitterlinkedinredditpinterestshare
13 pages
10.2.6-Packet-Tracer - Use-Lldp-To-Map-A-Network
No ratings yet
10.2.6-Packet-Tracer - Use-Lldp-To-Map-A-Network
6 pages
Cse8 ch02
No ratings yet
Cse8 ch02
25 pages
Computer Science A Structured Approach Using C Behrouz A. Forouzan download
100% (2)
Computer Science A Structured Approach Using C Behrouz A. Forouzan download
60 pages

Uploaded by

Uploaded by

Assignment No:1

Here's a step-by-step explanation of how supervised learning works:

 Choosing an Algorithm: Select a suitable machine learning algorithm based on the

Example Algorithms in Supervised Learning

 Use Case: Predicting a continuous output (e.g., predicting house prices).

 Use Case: Binary classification problems (e.g., spam detection).

 Use Case: Both classification and regression tasks.

4. Support Vector Machines (SVM)

 Use Case: Classification problems with clear margin of separation.

 Use Case: Complex tasks such as image and speech recognition.

Example Workflow with a Decision Tree Classifier

Key Concepts of Unsupervised Learning

Example Algorithms in Unsupervised Learning

3. DBSCAN (Density-Based Spatial Clustering of Applications with Noise)

4. PCA (Principal Component Analysis)

5. t-SNE (t-distributed Stochastic Neighbor Embedding)

Key Concepts of Reinforcement Learning

1. Agent: The learner or decision-maker.

Process of Reinforcement Learning

1. Initialization: Initialize the policy, value function, or Q-value function.

Example Algorithms in Reinforcement Learning

 Type: Model-free, off-policy.

2. Deep Q-Networks (DQN)

 Type: Model-free, off-policy.

3. Policy Gradient Methods

 Type: Model-free, on-policy.

gradient is computed using: ∇θJ(θ)=Eπθ[∇θlog⁡πθ(a∣s)Qπ(s,a)]\nabla_\theta J(\theta) = \

mathbb{E}_{\pi_\theta} \left[ \nabla_\theta \log \pi_\theta(a|s) Q^\pi(s, a) \right]∇θ

 Type: Hybrid of policy-based and value-based methods.

Example Workflow with Q-Learning

Example: Reinforcement Learning in a Grid World

4. Self supervised Learning

You might also like