Lec 1,2
Lec 1,2
Navneet Goyal
Department of Computer Science, BITS-Pilani, Pilani Campus,
India
PC - GeeksforGeeks
Introduction
• Machine Learning is the most visible face of AI!
• Now, Deep Learning has become the most visible face of
Machine Learning!
• Some new developments in AI/ML
– Privacy preserving ML/Federated Learning
– Edge AI/Private AI
– Explainable AI (XAI)
– Multimodal Learning
– Bias in AI Systems
• Some new application domains
– Biodiversity/Bioacoustics
– Earth Observation
– Geo Tagging
– Social Good
– Autonomous Cars
Autonomous Cars
• What role Machine Learning has to play?
Bias In AI
• A CHILD WEARING sunglasses is labeled as a “failure,
loser, nonstarter, unsuccessful person.” This is just
one of the many systemic biases exposed by
ImageNet Roulette, an art project that applies labels
to user submitted photos by sourcing its
identification system from the original ImageNet
database.
• ImageNet, which has been one of the instrumental
datasets for advancing AI, has deleted more than half
a million images from its “person” category since this
instance was reported in late 2019.
Bias In AI
• Earlier in 2019, researchers showed how
Facebook’s ad-serving algorithm for deciding who
is shown a given ad exhibits discrimination based
on race, gender, and religion of users.
• There have been reports of commercial facial-
recognition software (notably Amazon’s
Rekognition, among others) being biased against
darker-skinned women.
Introduction
• What exactly is Machine Learning?
• Why we need it?
• If at all we need it, how can we make the machines learn?
– in the same way humans (animals) learn?
• What is Deep Learning?
Introduction: Early Days of ML
• During the WW II, noted British computer scientist Alan
Turing worked to crack the ‘Enigma’ code which was
used by German forces to send messages securely.
• Turing and his team created the Bombe machine that
was used to decipher Enigma’s messages.
• The Enigma and Bombe Machines laid the foundations
for Machine Learning.
• Turing Test - a machine that could converse with
humans without the humans knowing that it is a
machine would win the “imitation game” and could be
said to be “intelligent”.
Recent Developments in ML
• Google open sourcing Tensor Flow
• Microsoft open sourced CNTK (Cognitive Toolkit)
• Baidu open sourced its Deep Learning Platform - PaddlePaddle
• Amazon will back MXNet (Apache) – a Deep Learning Framework
in their new AWS ML platform
• Facebook supporting the development of 2 Deep Learning
frameworks: Torch (Open Source ML Lib. – Sci. Comp. framework
based on Lua PL) & Caffe (Deep Learning framework by Berkeley
AI Research – BAIR)
• Google is also supporting Keras (NN API which can run on Tensor
Flow, CNTK, Theano)
• Wavenet’s audio generation using Deep Learning
– Outperforms Google’s TTS (Text 2 Speech)
• Lip reading – application of video recognition
• Machine Translation
Source: Article in Forbes – Best ML breakthroughs of 2016
Author - Xavier Amatriain, VP Engg. @ Quora
Former Netflix Recommendations Researcher & Professor
Recent Developments in ML
Deep Learning has taken ML to the next level
HPC & AI
Recent Developments in ML: AlphaGo Zero
• Mastering the game of Go without human knowledge*
– Much progress towards AI has been made using
supervised learning systems that are trained to replicate
the decisions of human experts
– expert data sets are often expensive, unreliable or simply
unavailable.
– Even when reliable data sets are available, they may
impose a ceiling on the performance of systems trained
in this manner
– Reinforcement Learning systems learn from their own
experience, allowing them to exceed human capabilities
– AlphaGo was the first program to achieve superhuman
performance in Go
Disappointed??
Prerequisites for ML
Introduction
List down the tasks which we humans can do better than
machines!!
Introduction
Let’s look at these incredible things that humans can do:
1. Identifying a song by just listening to a very small part of it
2. Identifying a movie by looking at a very short clip
3. Identifying a person
4. Identifying a person even after you see him after many many years
5. Recollecting memories
6. Identifying a person from a distance
7. Identifying a person by just listening to his/her voice
8. Identifying a person by his chat/message signature
9. Our own GPS!
10. Identifying spam mails
11. Object identification in images/videos
12. Image/document tagging
13. Suspicious activity or person
14. Medical diagnosis
15. Handwriting recognition
16. Conversation/discussion
17. …
Introduction
Ever wondered how we could do all this which such accuracy
and efficiency?
1. Pattern recognition
2. Information retrieval
Human Brain!!
Neurons!!
qCourse Objective
qStudy of approaches and algorithms that can make a
machine learn
Introduction
qMachine Learning
qSubarea of AI that is concerned with
algorithms/programs that can make a machine learn
qImprove automatically with experience
qFor example- doctors learning from experience
qFaculty learning how to control the class and be effective
qWe all learn from experience
Definition:
A computer program is said to learn from experience
E with respect to some class of tasks T and
performance measure P, if its performance at tasks in
T, as measured by P, improves with experience E.
What is Machine Learning?
• To solve a problem, we need an algorithm!
• For example: sorting a list of numbers
• Input: list of numbers
• Output: sorted list of numbers
• For some tasks, like filtering spam mails
• Input: an email
• Output: Y/N
• We do not know how to transform Input to Output
• Definition of Spam changes with time and from one
individual to individual
• What to DO?
Reference: E Alpaydin’s Machine Learning Book, 2010 (MIT Press)
What is Machine Learning?
• Collect lots of emails (both genuine and spam)
• “Learn” what constitutes a spam mail (or for that
matter a genuine mail)
• Learn from DATA!!
• For many similar problems, we may not have
algorithm(s), but we do have example data (called
Training Data)
• Ability to process training data has been made
possible by advances in computer technology
What is Machine Learning?
• Face Recognition!!!
• We humans are so good at it!!!
• Ever thought how we do it, despite
– Different light conditions, pose, hair style, make up,
glasses, ageing etc..
• Since we do not know how we do it, we can not
write a program to do it
• ML is about making inference from a sample
Machine Learning Applications
• What kind of data I would require for learning?
– Credit card transactions
– Face Recognition
– Spam filter
– Handwriting/Character Recognition
Handwriting Recognition
• Task T
– recognizing and classifying handwritten words within images
• Performance measure P
– percent of words correctly classified
• Training experience E
– a database of handwritten words with given classifications
Handwriting Recognition
Pattern Recognition Example
• Handwriting Digit Recognition
Reference: Christopher M Bishop: Pattern Recognition & Machine Leaning, 2006 Springer
Curse of Dimensionality
Source:https://www.opendatascience.com/blog/curse-of-dimensionality-explained/
Curse of Dimensionality
o As the number of dimensions tend to infinity, the
volume of a unit hypersphere tends to zero!!
(think about it!!)
o Data becomes increasingly sparse with increasing
dimensions
o As number of dimensions in a dataset increases,
distance measures become increasingly meaningless
o In very high dimensions, they are almost equidistant from each
other (relative distance between points tend to zero)
Different Aspects of the “Curse”*
o Optimization Problem
o Concentration Effect of Lp-Norms
o Irrelevant Attributes
o Correlated Attributes
o Intrinsic dimensionality < Embedding dimensionality
o Varying relative volume of an ℇ-Hypersphere
Var2
Constructed
Feature 1
Find function F(x) to map to a
different space
68
Go back
Getting Started…