0% found this document useful (0 votes)

25 views

Intorduction of ML

Uploaded by

priyankabhatele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Intorduction of ML

Uploaded by

priyankabhatele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Unit -1

Machine learning (ML):

It is the scientific study of algorithms and statistical models that computer systems use to
perform a specific task without using explicit instructions, relying on patterns
and inference instead.
It is seen as a subset of artificial intelligence. Machine learning algorithms build a mathematical
model based on sample data, known as "training data", in order to make predictions or decisions
without being explicitly programmed to perform the task. [1][2]:2 Machine learning algorithms are
used in a wide variety of applications, such as email filtering and computer vision, where it is
difficult or infeasible to develop a conventional algorithm for effectively performing the task.
Although a machine learning model may apply a mix of different techniques, the
methods for learning can typically be categorized as three general types:

 Supervised learning: The learning algorithm is given labeled data and the
desired output. For example, pictures of dogs labeled “dog” will help the
algorithm identify the rules to classify pictures of dogs.
 Unsupervised learning: The data given to the learning algorithm is
unlabeled, and the algorithm is asked to identify patterns in the input data. For
example, the recommendation system of an e-commerce website where the
learning algorithm discovers similar items often bought together.
 Reinforcement learning: The algorithm interacts with a dynamic
environment that provides feedback in terms of rewards and punishments. For
example, self-driving cars being rewarded to stay on the road.1
Supervised Learning
 Supervised learning do the work of function approximation, where basically we train an
algorithm and in the end of the process we pick the function that best describes the input
data, the one that for a given X makes the best estimation of y (X -> y). Most of the time
we are not able to figure out the true function that always make the correct predictions and
other reason is that the algorithm rely upon an assumption made by humans about how the
computer should learn and this assumptions introduce a bias, Bias is topic I’ll explain
in another post.

 Here input dataset acts as a teacher where we feed the computer with training data containing
the input/predictors and we show it the correct answers (output or the label of input
predictors ). Form the training dataset, the model learns the mapping function between the
input predictors and the output variable.

 Supervised learning algorithms try to model relationships and dependencies between the
target prediction output and the input features such that we can predict the output values for
new data based on those relationships which it learned from the previous data sets.

 Supervised learning based models are the predictive models that predict either the value of a
continuous variable ( like temperature, stock price etc) which we calls regression, another is
the prediction of class ( like input image is of dog or cat) which we call classification models.

List of Common Algorithms of Supervised learning

 Nearest Neighbor
 Naive Bayes
 Decision Trees
 Linear Regression
 Support Vector Machines (SVM)
 Neural Networks

Classification and Regression in supervised Learning:

Classification algorithms and regression algorithms are types of supervised learning.
Classification algorithms are used when the value of the output variable is restricted to a limited
set of values i.e. class numbers. For a classification algorithm that filters emails, the input would
be an incoming email, and the output would be the name of the folder in which to file the email.
For an algorithm that identifies spam emails, the output would be the prediction of either " spam"
or "not spam", represented by the Boolean values true and false.
Regression algorithms are named for their continuous outputs, meaning they may have any value
within a range. Examples of a continuous value are the temperature, length, or price of an object.

.
In the case of semi-supervised learning algorithms, some of the training examples are missing
training labels, but they can nevertheless be used to improve the quality of a model. In weakly
supervised learning, the training labels are noisy, limited, or imprecise; however, these labels are
often cheaper to obtain, resulting in larger effective training sets.

Unsupervised learning
Unsupervised learning algorithms take a set of data that contains only inputs, and find structure
in the data, like grouping or clustering of data points. The algorithms, therefore, learn from test
data that has not been labeled, classified or categorized. Instead of responding to feedback,
unsupervised learning algorithms identify commonalities in the data and react based on the
presence or absence of such commonalities in each new piece of data. A central application of
unsupervised learning is in the field of density estimation in statistics, though unsupervised
learning encompasses other domains involving summarizing and explaining data features.
Cluster analysis is the assignment of a set of observations into subsets (called clusters) so that
observations within the same cluster are similar according to one or more predesignated criteria,
while observations drawn from different clusters are dissimilar. Different clustering techniques
make different assumptions on the structure of the data, often defined by some similarity
metric and evaluated, for example, by internal compactness, or the similarity between members
of the same cluster, and separation, the difference between clusters. Other methods are based
on estimated density and graph connectivity.

Semi-supervised learning
Semi-supervised learning falls between unsupervised learning (without any labeled training data)
and supervised learning (with completely labeled training data). Many machine-learning
researchers have found that unlabeled data, when used in conjunction with a small amount of
labeled data, can produce a considerable improvement in learning accuracy.

Reinforcement learning
Reinforcement learning is an area of machine learning concerned with how software
agents ought to take actions in an environment so as to maximize some notion of cumulative
reward. Due to its generality, the field is studied in many other disciplines, such as game
theory, control theory, operations research, information theory, simulation-based
optimization, multi-agent systems, swarm intelligence, statistics and genetic algorithms. In
machine learning, the environment is typically represented as a Markov Decision
Process (MDP). Many reinforcement learning algorithms use dynamic
programming techniques. Reinforcement learning algorithms do not assume knowledge of an
exact mathematical model of the MDP, and are used when exact models are infeasible.
Reinforcement learning algorithms are used in autonomous vehicles or in learning to play a game
against a human opponent.
Application of Machine Learning:

Limitations of Machine Learning:

Lack of Data : Many machine learning algorithms require large amounts of data
before they begin to give useful results. A good example of this is a neural
network. Neural networks are data-eating machines that require copious amounts
of training data. The larger the architecture, the more data is needed to produce
viable results. Reusing data is a bad idea, and data augmentation is useful to some
extent, but having more data is always the preferred solution. If you can get the
data, then use it.

Lack of Good Data: Despite the appearance, this is not the same as the above
comment. Let’s imagine you think you can cheat by generating ten thousand fake
data points to put in your neural network. What happens when you put it in?

1. It will train itself, and then when you come to test it on an unseen data set, it
will not perform well. You had the data but the quality of the data was not up
to scratch.
2. In the same way that having a lack of good features can cause your algorithm
to perform poorly, having a lack of good ground truth data can also limit the
capabilities of your model. No company is going to implement a machine
learning model that performs worse than human-level error.
3. Similarly, applying a model that was trained on a set of data in one situation
may not necessarily apply as well to a second situation. The best example of
this I have found so far is in breast cancer prediction.
4. Mammography databases have a lot of images in them, but they suffer from
one problem that has caused significant issues in recent years — almost all of
the x-rays are from white women. This may not sound like a big deal, but
actually, black women have been shown to be 42 percent more likely to die
from breast cancer due to a wide range of factors that may include
differences in detection and access to health care. Thus, training an algorithm
primarily on white women adversely impacts black women in this case.
5. What is needed in this specific case is a larger number of x-rays of black
patients in the training database, more features relevant to the cause of this 42
percent increased likelihood, and for the algorithm to be more equitable by
stratifying the dataset along the relevant axes.

Data Augmentation

Data augmentation is a method by which you can virtually increase the number of samples in
your dataset using data you already have. For image augmentation, it can be achieved
by performing geometric transformations, changes to color, brightness, contrast or by adding
some noise. Currently there are ongoing studies on interesting new methods in data
augmentation using Generative Adversarial Networks or by pairing samples.

Data Augmentation in image Processing:

 Position augmentation
 Scaling
 Cropping
 Flipping
 Padding
 Rotation
 Translation
 Affine transformation

 Color augmentation
 Brightness
 Contrast
 Saturation
 Hue

Scaling
In scaling or resizing, the image is resized to the given size e.g. the width of the image can be
doubled.

Cropping
In cropping, a portion of the image is selected e.g. in the given example the center cropped image is
returned
Flipping
In flipping, the image is flipped horizontally or vertically.

Padding
In padding, the image is padded with a given value on all sides.

Rotation
The image is rotated randomly in rotation.
Translation
In translation, the image is moved either along the x-axis or y-axis.

Color augmentation
Color augmentation or color jittering deals with altering the color properties of an image by changing
its pixel values.

Brightness
One way to augment is to change the brightness of the image. The resultant image becomes darker
or lighter compared to the original one.

Contrast
The contrast is defined as the degree of separation between the darkest and brightest areas of an
image. The contrast of the image can also be changed.

Saturation
Saturation is the separation between colors of an image.

Hue
Hue can be described of as the shade of the colors in an image
Topic: Eigen vector and Eigen value
Eigenvectors and eigenvalues have many important applications in computer vision and machine
learning in general. Well known examples are PCA (Principal Component Analysis) for
dimensionality reduction or EigenFaces for face recognition. An interesting use of eigenvectors and
eigenvalues is also illustrated in my post about error ellipses. Furthermore, eigendecomposition
forms the base of the geometric interpretation of covariance matrices, discussed in an more recent
post. In this article, I will provide a gentle introduction into this mathematical concept, and will show
how to manually obtain the eigendecomposition of a 2D square matrix.

An eigenvector is a vector whose direction remains unchanged when a linear transformation is

applied to it. Consider the image below in which three vectors are shown. The green square is only
drawn to illustrate the linear transformation that is applied to each of these three vectors.

Eigenvectors (red) do not change direction when a linear transformation (e.g. scaling) is applied to
them. Other vectors (yellow) do.

The transformation in this case is a simple scaling with factor 2 in the horizontal direction and factor
0.5 in the vertical direction, such that the transformation matrix is defined as:

A vector is then scaled by applying this transformation as . The above

figure shows that the direction of some vectors (shown in red) is not affected by this linear
transformation. These vectors are called eigenvectors of the transformation, and uniquely define the
square matrix . This unique, deterministic relation is exactly the reason that those vectors are
called ‘eigenvectors’ (Eigen means ‘specific’ in German).

In general, the eigenvector of a matrix is the vector for which the following holds:
where is a scalar value called the ‘eigenvalue’. This means that the linear transformation on
vector is completely defined by .

We can rewrite equation (1) as follows:

where is the identity matrix of the same dimensions as .

However, assuming that is not the null-vector, equation (2) can only be defined if is
not invertible. If a square matrix is not invertible, that means that its determinant must equal zero.
Therefore, to find the eigenvectors of , we simply have to solve the following equation:

In the following sections we will determine the eigenvectors and eigenvalues of a matrix , by
solving equation (3). Matrix in this example, is defined by:

Calculating the eigenvalues

To determine the eigenvalues for this example, we substitute in equation (3) by equation (4) and
obtain:

Calculating the determinant gives:

(6)

To solve this quadratic equation in , we find the discriminant:

Since the discriminant is strictly positive, this means that two different values for exist:
We have now determined the two eigenvalues and . Note that a square matrix of
size always has exactly eigenvalues, each with a corresponding eigenvector. The
eigenvalue specifies the size of the eigenvector.

Calculating the first eigenvector

We can now determine the eigenvectors by plugging the eigenvalues from equation (7) into equation
(1) that originally defined the problem. The eigenvectors are then found by solving this system of
equations.

We first do this for eigenvalue , in order to find the corresponding first eigenvector:

Since this is simply the matrix notation for a system of equations, we can write it in its equivalent
form:

and solve the first equation as a function of , resulting in:

Since an eigenvector simply represents an orientation (the corresponding eigenvalue represents the
magnitude), all scalar multiples of the eigenvector are vectors that are parallel to this eigenvector,
and are therefore equivalent (If we would normalize the vectors, they would all be equal). Thus,
instead of further solving the above system of equations, we can freely chose a real value for
either or , and determine the other one by using equation (9).

For this example, we arbitrarily choose , such that . Therefore, the

eigenvector that corresponds to eigenvalue is
Calculating the second eigenvector
Calculations for the second eigenvector are similar to those needed for the first eigenvector;
We now substitute eigenvalue into equation (1), yielding:

Written as a system of equations, this is equivalent to:

Solving the first equation as a function of resuls in:

We then arbitrarily choose , and find . Therefore, the eigenvector that

corresponds to eigenvalue is
Topic : Gradient Descent Based Linear Regression

It is a kind of Supervised Learning which can be used to predict the value of a

continuous variable like temperature, pressure, stock price.

The training dataset will be divided into two sections, one is the set of independent
variables(set of independent features) and another is the dependent variable which
is to be predicted. For example: In dataset of mobile price prediction, the input
dataset will be divided, input feature set ( CPU speed, ram, pixels for camera,
battery ) and output feature will be price which is dependent on the input feature
set.

Error
Function

Steps to calculate Linear Function

1. Assume Random values for m and b (slope and intercept)

2. For each ith Iteration or epoch, repeat the process from step 3 to 7
3. Evaluate gradient(Gm, Gb) for m and b using error from each ith sample
according to eq.1

4. Update M and C
5. m=m-(learning rate*Gm)
6. b= b-(Learning rate*Gb)

Applied ML notes
No ratings yet
Applied ML notes
123 pages
Immediate Download Foundation Mathematics 1st Edition K.A. Stroud Ebooks 2024
100% (11)
Immediate Download Foundation Mathematics 1st Edition K.A. Stroud Ebooks 2024
60 pages
Numerical Methods With MATLAB - Recktenwald PDF
100% (1)
Numerical Methods With MATLAB - Recktenwald PDF
85 pages
SCSA3015 Deep Learning Unit 1 Notes PDF
No ratings yet
SCSA3015 Deep Learning Unit 1 Notes PDF
30 pages
Machine Learning Lab Viva
100% (1)
Machine Learning Lab Viva
9 pages
(Michael Farber) Invitation To Topological Robotic
100% (3)
(Michael Farber) Invitation To Topological Robotic
145 pages
MA20218 Analysis 2A: Lecture Notes
No ratings yet
MA20218 Analysis 2A: Lecture Notes
62 pages
machine learning notes
No ratings yet
machine learning notes
20 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
UNIT II deep learning
No ratings yet
UNIT II deep learning
42 pages
ML1
No ratings yet
ML1
33 pages
PDF&Rendition=1 2
No ratings yet
PDF&Rendition=1 2
27 pages
CHP 1
No ratings yet
CHP 1
47 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Null 5
No ratings yet
Null 5
16 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
19 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
INTRODUCTION TO MACHINE LEARNING
No ratings yet
INTRODUCTION TO MACHINE LEARNING
31 pages
Machine Learning(MCA)
No ratings yet
Machine Learning(MCA)
5 pages
Session 3 Types of Machine Learning (1)
No ratings yet
Session 3 Types of Machine Learning (1)
22 pages
ML Lecture - 1
No ratings yet
ML Lecture - 1
33 pages
6CS4 AI Unit-4 @zammers
No ratings yet
6CS4 AI Unit-4 @zammers
129 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Unit 4 AI LASK
No ratings yet
Unit 4 AI LASK
7 pages
unit 1
100% (1)
unit 1
13 pages
Machine Learning Techniques-bcds062!01!01[1]
No ratings yet
Machine Learning Techniques-bcds062!01!01[1]
66 pages
Chapter1
No ratings yet
Chapter1
30 pages
Machine Learning Lecture
No ratings yet
Machine Learning Lecture
10 pages
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
20 pages
Unit 3-Introduction to Machine Learning
No ratings yet
Unit 3-Introduction to Machine Learning
44 pages
UNIT-1 DLL
No ratings yet
UNIT-1 DLL
73 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
There Are Key Areas in The Process of Machine Learning, Like
No ratings yet
There Are Key Areas in The Process of Machine Learning, Like
45 pages
Introduction to ML
No ratings yet
Introduction to ML
17 pages
Machine Learning Unit-I
No ratings yet
Machine Learning Unit-I
41 pages
1.Machine Learning Basics
No ratings yet
1.Machine Learning Basics
74 pages
Unit5_ML_introduction
No ratings yet
Unit5_ML_introduction
32 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
UNIT-I
No ratings yet
UNIT-I
38 pages
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
No ratings yet
9e27d2e7-5dfa-4b8b-b760-d1fb4a21abd0
24 pages
Machine Learnning
No ratings yet
Machine Learnning
17 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
ML Notes UT-1
No ratings yet
ML Notes UT-1
21 pages
AI using Python
No ratings yet
AI using Python
26 pages
lksk ML typesToStudents
No ratings yet
lksk ML typesToStudents
18 pages
Machine Learning L1
No ratings yet
Machine Learning L1
34 pages
1 ML Landscape, ML Categories
No ratings yet
1 ML Landscape, ML Categories
3 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
19 pages
Introduction to Machine Learing
No ratings yet
Introduction to Machine Learing
4 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
21 pages
1 What is Machine
No ratings yet
1 What is Machine
15 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
FDS Assignment
No ratings yet
FDS Assignment
76 pages
Unit 1
No ratings yet
Unit 1
21 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
unit V
No ratings yet
unit V
67 pages
Module1 And2
No ratings yet
Module1 And2
122 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Linear Algebra: Part B
No ratings yet
Linear Algebra: Part B
15 pages
2A Vectors - Notes
No ratings yet
2A Vectors - Notes
24 pages
Course Brochure - JEE Main 2024 January Crash Course (Batch 1)
No ratings yet
Course Brochure - JEE Main 2024 January Crash Course (Batch 1)
10 pages
Vector Space - Wikipedia
No ratings yet
Vector Space - Wikipedia
24 pages
Ggsipu Catalogue
No ratings yet
Ggsipu Catalogue
76 pages
Instant ebooks textbook Harmonic Vector Fields Variational Principles and Differential Geometry 1st Edition Sorin Dragomir download all chapters
100% (2)
Instant ebooks textbook Harmonic Vector Fields Variational Principles and Differential Geometry 1st Edition Sorin Dragomir download all chapters
55 pages
Dharmsinh Desai University, Nadiad: Faculty of Information Science
No ratings yet
Dharmsinh Desai University, Nadiad: Faculty of Information Science
39 pages
Mathematics For Data Science 2 - Week 3 GA
0% (1)
Mathematics For Data Science 2 - Week 3 GA
14 pages
Horace Lamb - Statics: Including Hydrostatics and The Elements of The Theory of Elasticity
No ratings yet
Horace Lamb - Statics: Including Hydrostatics and The Elements of The Theory of Elasticity
364 pages
CAAM 402/502 Spring 2013 Homework 2 Solutions
No ratings yet
CAAM 402/502 Spring 2013 Homework 2 Solutions
5 pages
Statics-M L Khanna PDF
No ratings yet
Statics-M L Khanna PDF
242 pages
ND Syllabus
No ratings yet
ND Syllabus
179 pages
Lab 01: IT1005: Name: Lee Zhi Kang Matric No.: A0088205E
No ratings yet
Lab 01: IT1005: Name: Lee Zhi Kang Matric No.: A0088205E
5 pages
MSc Mathematics Part-I_Part-II
No ratings yet
MSc Mathematics Part-I_Part-II
16 pages
Thomson 2006
No ratings yet
Thomson 2006
5 pages
Csir Net Mathematics Info
No ratings yet
Csir Net Mathematics Info
22 pages
Slidingrod Revised 4 Format JSV
No ratings yet
Slidingrod Revised 4 Format JSV
30 pages
Lattices: Hendrik W. Lenstra, JR
No ratings yet
Lattices: Hendrik W. Lenstra, JR
56 pages
Multilinear Algebra
100% (1)
Multilinear Algebra
142 pages
DSP Lab Manual New 2018-19
No ratings yet
DSP Lab Manual New 2018-19
45 pages
Sy - Integral Calculus
No ratings yet
Sy - Integral Calculus
12 pages
Pixel Purity Index-Based Algorithms For Endmember Extraction From Hyperspectral Imagery
No ratings yet
Pixel Purity Index-Based Algorithms For Endmember Extraction From Hyperspectral Imagery
34 pages
dg1-3 Surfaces in E 3
No ratings yet
dg1-3 Surfaces in E 3
5 pages
Chapter (Nhon Van Do)
No ratings yet
Chapter (Nhon Van Do)
25 pages
Partial Differential Equations A Unified Hilbert Space Approach 1st Edition Rainer Picard instant download
100% (2)
Partial Differential Equations A Unified Hilbert Space Approach 1st Edition Rainer Picard instant download
36 pages

Uploaded by

Uploaded by

Unit -1

Machine learning (ML):

List of Common Algorithms of Supervised learning

Classification and Regression in supervised Learning:

Limitations of Machine Learning:

Data Augmentation in image Processing:

An eigenvector is a vector whose direction remains unchanged when a linear transformation is

A vector is then scaled by applying this transformation as . The above

We can rewrite equation (1) as follows:

where is the identity matrix of the same dimensions as .

Calculating the eigenvalues

Calculating the determinant gives:

To solve this quadratic equation in , we find the discriminant:

Calculating the first eigenvector

and solve the first equation as a function of , resulting in:

For this example, we arbitrarily choose , such that . Therefore, the

Written as a system of equations, this is equivalent to:

Solving the first equation as a function of resuls in:

We then arbitrarily choose , and find . Therefore, the eigenvector that

It is a kind of Supervised Learning which can be used to predict the value of a

Steps to calculate Linear Function

1. Assume Random values for m and b (slope and intercept)

You might also like