0% found this document useful (0 votes)

6 views

GettingStartedwithMachineLearningML-DataScience365

The article 'Getting Started with Machine Learning' introduces the concept of machine learning (ML) as a collection of algorithms that learn from data to make predictions, contrasting it with traditional programming. It discusses the importance of ML in solving complex problems, adapting to new data, and handling large-scale issues, while also outlining the types of ML, including supervised and unsupervised learning. The author emphasizes the need for practical implementation using programming languages like Python or R and invites feedback from readers.

Uploaded by

Syed Wasif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

GettingStartedwithMachineLearningML-DataScience365

Uploaded by

Syed Wasif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/339031674

Getting Started with Machine Learning (ML)

Article · February 2020

CITATION READS

1 3,747

1 author:

Rukshan Manorathna
University of Colombo
16 PUBLICATIONS 9 CITATIONS

SEE PROFILE

All content following this page was uploaded by Rukshan Manorathna on 04 February 2020.

The user has requested enhancement of the downloaded file.

Getting Started with Machine Learning (ML)
A disruptive technology which sweeps away traditional programming in certain cases…

Rukshan Pramoditha
Dec 25, 2019 · 9 min read

After a very long time since the first article Getting Started with Data Science was
published in Data Science 365, today, I meet you with a new topic in the field of Data
Science. If you haven’t read my first article yet, I recommend you to read it before reading
this one. Without further delay, welcome to Getting Started with Machine Learning (ML)!

Introduction
Machine Learning (ML) is a collection of algorithms & techniques used to build
systems that learn from data. These systems are then able to perform predictions by
finding patterns in data. Machine learning is a disruptive technology which sweeps away
traditional programming in certain cases. What is the difference between machine learning
& traditional programming?
In traditional programming, the data & the program produce the output. For example, when
performing some accounting tasks, the program takes in the data (sales records, inventory
lists, etc.) & calculates your profits or losses. It will also give some nice & fanciful charts
showing your sales performance.

Traditional programming is a manual process — meaning a person (programmer) creates the

program by manually formulating or coding the rules.

In machine learning, the data & the output produce the program. You take both data &
output & use them to derive a set of rules to make predictions. It means that you may use
this model to predict the most popular items that will sell next year.

In machine learning, the algorithm automatically formulates the rules from the data.

ML consists of the following disciplines:

• Scientific computing (e.g. Python or R programming)

• Mathematics
• Statistics

When using ML, a data scientist needs to know about the followings:

• Which method of machine learning will best help in completing the given task
• How to apply that method

The knowledge that how that method works is optional.

Why machine learning?
Consider an event of creating a spam e-mail filter program without using ML. The
programmer writes that program by following the steps in a certain order.

Step 1: Identify how spam e-mails look like. (Words: “debit card”, “free”, “for you”, etc.)

Step 2: Write an algorithm to detect the patterns that you’ve seen.

Step 3: Finally, you’d test the program & then make changes to the program until the results
are good enough.

After that, the software would flag emails as spam if a certain number of those patterns are
detected.

In your algorithm, you write rules in your program to detect the patterns. If this is a case
that requires a long list of rules to find the solution, it soon becomes difficult for a human to
accurately code the rules. You can use ML to effectively solve this problem.

What will happen if the e-mail senders change their e-mail templates so that a word like
“4U” now instead of “for you”? The program using traditional techniques would need to be
updated manually. On the other hand, a program using ML techniques will automatically
detect this change & will adapt to new data.
When should you use machine learning?
ML is not a solution for every type of problem. There are certain cases where solutions can
be developed without using ML techniques. For example, you don’t need ML if you can
determine a target value by using simple rules & computations. If you implement ML
techniques for such problems, they will be getting more complex & you will need more
computer power to get the solutions.

However, there are some situations where you need to use ML.

• Very complex problems for which there is no solution with a traditional

approach: You need to use a data-driven approach to get the solution. For example,
speech recognition: When you say “one” or “two”, the program should distinguish
the difference. You will need to develop an algorithm that measures sound.

• Non-stable environments: Machine learning software can adapt to new data.

• When you have a problem that requires many long lists of rules to find the
solution: When rules depend on too many factors, it soon becomes difficult for a
human to accurately code the rules. You can use ML to effectively solve this
problem.
• You cannot scale: ML solutions are effective at handling large-scale problems. For
example, a spam e-mail filter program: There are millions of emails which are needed
to check in a single day! It cannot be done manually.

A few of ML terminology
There is a long list of ML terminology. Here I list some terms which are mostly used. Others
will be given in relevant parts as we progress.

• In ML, a target is called a label. In statistics, a target is called a dependent

variable.
• A variable in statistics is called a feature in ML.
• A transformation in statistics is called a feature creation in ML.

Types of machine learning

There are 2 main types of Machine Learning which are based on their learning styles. They
are:

• Supervised Learning
• Unsupervised Learning

In addition to these types, there are other types as well:

• Semi-supervised Learning
• Reinforcement Learning
• Batch Learning
• Online Learning
• Instance-based Learning
• Model-based Learning
The criteria for the above classification are:

• According to the type & amount of human supervision needed during the
training: Supervised Learning, Unsupervised Learning, Semi-supervised Learning &
Reinforcement Learning are classified based on this criterion.
• If they can learn incrementally
• If they work simply by comparing new data points to find data points or can
detect new patterns in the data & then will build a model

Supervised vs unsupervised learning

Supervised Learning
In supervised learning, you supervise the learning process. You train the algorithm using
labelled data. You can correct your algorithm if it makes a mistake in giving you the answer
in the learning process. This can be compared to learning which takes place in the presence
of a supervisor or a teacher.

In supervised learning, both input & output variables are given. This can be explained
mathematically. You have input variables (called x) & an output variable (called y) & you
use an algorithm to learn the mapping function of from the input to the output, y=f(x). In
supervised learning, the goal is to determine the mapping function so well that when
you have new input data (x), you can predict the output for that data. If the mapping is
correct, the algorithm has successfully learned. Else, you make the necessary changes to the
algorithm so that it can learn correctly. In simple words, what we do in supervise learning is
that we use a labelled dataset to obtain a new label for unlabelled data.
There are two phases: Training Phase & Testing Phase. In the training phase, you take a
randomly selected specimen of geometric shapes (training data) & label them accurately.
Then you make a table of all the characteristics (features) of each shape. You feed this data
to the machine learning algorithm & it learns a model (called prediction model). In the
testing phase, you input a shape (test data) which was also labelled. The prediction model
which was created earlier will give the correct label for that shape. If the output is correct,
the algorithm has successfully learned. Else, you make the necessary changes to the
algorithm so that it can learn correctly.

Classification and Regression are two types of supervised learning. Supervised learning is
a highly accurate method. The main drawback is that classifying big data can be a real
challenge.

Unsupervised Learning
In unsupervised learning, you do not need to supervise the learning process. Instead, you
need to allow the model to work on its own to discover hidden patterns in data. This can be
compared to the learning process of a student who has textbooks and all the required
material to study but has no teacher to guide so that he will have to learn by himself.
In unsupervised learning, only input data will be given. That data does not have any sort of
labels. The goal is to find the hidden patterns or the underlying structure in the given
input data in order to learn about the data. Unsupervised learning helps you to find
features which can be useful for categorization. It can also help to detect anomalies and
defects in the data which can be taken care of by us.

Clustering and Association are two types of unsupervised learning. Unsupervised learning
is less accurate & computationally complex when compared to supervised learning.

As a summary, we can compare supervised & unsupervised learning methods as follows.

Machine learning algorithms

An algorithm is a set of rules that a machine follows to achieve a particular goal. These
rules are in a certain order. A learner or machine learning algorithm is a set of rules used
to learn a machine learning model from data. ML algorithms are given general guidelines
that define the model, along with data while classical algorithms are given exact & complete
rules to finish a task. An ML algorithm can accomplish its task when the model has been
adjusted with respect to the data. We have to fit the model on the data or the model has to
be trained on the data. ML algorithms are different from classical algorithms in a way that
they automatically learn from the data you provide.

Some useful definitions of commonly used terms related to machine learning algorithms are:

Dataset: A table with the data from which the machine learns. The dataset contains the
features and the target to predict.

Instance: A row in the dataset. Other names for “instance” are data point, observation.
An instance consists of the feature values x(i) & if known, the target outcome y(i).

Feature: An input used for the machine learning algorithm. A feature is a column in the
dataset. The matrix with all the features is called X for a single instance. The vector of a
single feature for all instances is x(j).

Target: The information that the machine learns to predict. In mathematical formulas,
the target is usually called y. In statistics, it is called a dependent variable.

Prediction: The target value that the machine learning model “guesses” based on the
given features.

Machine learning algorithms fall into two broad categories which are supervised learning
algorithms & unsupervised learning algorithms, although there are other categories such
as semi-supervised learning algorithms, reinforcement learning algorithms, etc. The
following chart shows some of the commonly used algorithms which are classified under
supervised & unsupervised. Note that some of them can be in both supervised &
unsupervised categories, although they are listed under one category.
What is next?
This is just an introductory article for ML. This article lays a good foundation for ML &
motivates you to learn more about ML. The next big part is to learn machine learning
algorithms. Learning theoretical parts of these algorithms is not just enough. We also want
to learn how to implement these algorithms using Python or R programming. Here I use
Python. (Selecting Python or R for doing Data Science & ML should be done by conducting
thorough research. I personally selected Python by comparing many factors between the
two languages.)

The next article will be logistic regression under classification algorithms in ML. An article
series about Python programming should also be written parallelly with the ML article
series.
One last key point
My success will not be possible without your feedback. So please don’t hesitate to give me
feedback. Write them in the comment section of this article or just drop a message at
[email protected].

Thank you for reading! Next time, I will meet you with another ML article. Goodbye for
now!

Written by: Rukshan Pramoditha

Data Science 365,

Bring data into actionable insights.

View publication stats

Tie Computer Science Form Six
No ratings yet
Tie Computer Science Form Six
401 pages
Machine Learning PPT For Students
70% (10)
Machine Learning PPT For Students
18 pages
Security With AI and Machine Learning PDF
No ratings yet
Security With AI and Machine Learning PDF
71 pages
Test Dump MD100
No ratings yet
Test Dump MD100
378 pages
ISR4331-SEC/K9 Datasheet: Quick Specs
No ratings yet
ISR4331-SEC/K9 Datasheet: Quick Specs
6 pages
c01 Cat v5r18
No ratings yet
c01 Cat v5r18
34 pages
1_AML _Manish
No ratings yet
1_AML _Manish
72 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Overview of machine learning
No ratings yet
Overview of machine learning
60 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Module 1
No ratings yet
Module 1
34 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
26 pages
ETI microproject
No ratings yet
ETI microproject
11 pages
Introduction
No ratings yet
Introduction
18 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
No ratings yet
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
10 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
No ratings yet
Machine Learning With Python Programming: - Presentation by Uplatz - Contact Us: - Email: - Phone
22 pages
Unit-1
No ratings yet
Unit-1
55 pages
UNIT-1-Intro of ML
No ratings yet
UNIT-1-Intro of ML
33 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
ML
No ratings yet
ML
19 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Basics of Machine Learning
No ratings yet
Basics of Machine Learning
20 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Study On Machine Learning Research Paper
No ratings yet
Study On Machine Learning Research Paper
17 pages
5_6095834670757318868
No ratings yet
5_6095834670757318868
62 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
Unit 3 - DS - 1st year
No ratings yet
Unit 3 - DS - 1st year
5 pages
unit V
No ratings yet
unit V
67 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Elements of Machine Learning
No ratings yet
Elements of Machine Learning
116 pages
Unit 1
No ratings yet
Unit 1
62 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
MLT UINT1
No ratings yet
MLT UINT1
26 pages
ML_Module_4
No ratings yet
ML_Module_4
25 pages
ai faheem
No ratings yet
ai faheem
16 pages
Unit-1 new
No ratings yet
Unit-1 new
48 pages
Machine Learning- UNIT I (1)
No ratings yet
Machine Learning- UNIT I (1)
70 pages
Introduction To ML
No ratings yet
Introduction To ML
3 pages
An Enlightenment To Machine Learning - Resp
No ratings yet
An Enlightenment To Machine Learning - Resp
22 pages
Null 5
No ratings yet
Null 5
16 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
IDS Unit - 3
No ratings yet
IDS Unit - 3
4 pages
Introduction to ML
No ratings yet
Introduction to ML
17 pages
Aiba-Module 1 Machine Learning
No ratings yet
Aiba-Module 1 Machine Learning
23 pages
u 1
No ratings yet
u 1
12 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
78 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Aws ML
No ratings yet
Aws ML
125 pages
Machinelearning Unit-1
No ratings yet
Machinelearning Unit-1
29 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
Ch7 Introduction to Machine Learning
No ratings yet
Ch7 Introduction to Machine Learning
29 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Javascript Cheatsheet-Coders_Section
No ratings yet
Javascript Cheatsheet-Coders_Section
13 pages
Pandas Top 30 Layman Guide Final
No ratings yet
Pandas Top 30 Layman Guide Final
3 pages
Pandas_Top_30_With_Code_Clean
No ratings yet
Pandas_Top_30_With_Code_Clean
3 pages
Week 11 Expository (Descriptive) Paragraph
No ratings yet
Week 11 Expository (Descriptive) Paragraph
10 pages
Week 11 Compare and Contrast Paragraph
No ratings yet
Week 11 Compare and Contrast Paragraph
13 pages
Data Independence
No ratings yet
Data Independence
3 pages
Alarm Annunciator
No ratings yet
Alarm Annunciator
10 pages
Simulation and Modeling
No ratings yet
Simulation and Modeling
15 pages
Experience Cloud Consultant 0
No ratings yet
Experience Cloud Consultant 0
15 pages
Free Instagram Followers Generator (No Human Verification/No Survey/Real Legit Giveaway Codes)
No ratings yet
Free Instagram Followers Generator (No Human Verification/No Survey/Real Legit Giveaway Codes)
2 pages
WARM GPT python script
No ratings yet
WARM GPT python script
5 pages
Role of AI in Cyber Security Through Anomaly Detection and Predictive Analysis
No ratings yet
Role of AI in Cyber Security Through Anomaly Detection and Predictive Analysis
4 pages
How To Insert Sim Card Cellphone
No ratings yet
How To Insert Sim Card Cellphone
2 pages
Data Management
No ratings yet
Data Management
8 pages
Computer Science Paper 1 HL Markscheme Nov 2017
No ratings yet
Computer Science Paper 1 HL Markscheme Nov 2017
16 pages
Interaction Design Lab - Week 01
No ratings yet
Interaction Design Lab - Week 01
19 pages
rcms2802 120lfe BL
No ratings yet
rcms2802 120lfe BL
2 pages
Formal Definition of Turing Machine
No ratings yet
Formal Definition of Turing Machine
5 pages
IEM-ICDC_2025_poster
No ratings yet
IEM-ICDC_2025_poster
1 page
10 Holiday Homework
No ratings yet
10 Holiday Homework
9 pages
( OG) Script Mobile Legends (INSTANT LEVEL 300)
No ratings yet
( OG) Script Mobile Legends (INSTANT LEVEL 300)
3 pages
Google Services
No ratings yet
Google Services
7 pages
Oracle General Ledger Interview Questions
No ratings yet
Oracle General Ledger Interview Questions
3 pages
Form 2 Final Exam 2015
100% (1)
Form 2 Final Exam 2015
11 pages
Curriculum Vitae: Career Objective
No ratings yet
Curriculum Vitae: Career Objective
3 pages
Affiliate Marketing Tools BloggingCage
100% (1)
Affiliate Marketing Tools BloggingCage
17 pages
Bakson Homeopathy
No ratings yet
Bakson Homeopathy
2 pages
MS Access Basics PDF
100% (1)
MS Access Basics PDF
346 pages
Introduction To Git: Dr. Noman Islam
No ratings yet
Introduction To Git: Dr. Noman Islam
36 pages
HashiCorp Test-King Terraform-Associate v2021-02-02 by Wangping 20q
No ratings yet
HashiCorp Test-King Terraform-Associate v2021-02-02 by Wangping 20q
11 pages

Uploaded by

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Getting Started with Machine Learning (ML)

Article · February 2020

The user has requested enhancement of the downloaded file.

Traditional programming is a manual process — meaning a person (programmer) creates the

ML consists of the following disciplines:

• Scientific computing (e.g. Python or R programming)

The knowledge that how that method works is optional.

Step 2: Write an algorithm to detect the patterns that you’ve seen.

• Very complex problems for which there is no solution with a traditional

• Non-stable environments: Machine learning software can adapt to new data.

• In ML, a target is called a label. In statistics, a target is called a dependent

Types of machine learning

In addition to these types, there are other types as well:

Supervised vs unsupervised learning

As a summary, we can compare supervised & unsupervised learning methods as follows.

Machine learning algorithms

Written by: Rukshan Pramoditha

Data Science 365,

View publication stats

You might also like