0% found this document useful (0 votes)

14 views

Hashing

The document discusses various searching algorithms, including Sequential Search, Binary Search, and Hashing, highlighting their efficiency and applications. It explains the concept of hash tables, their operations, and methods for handling collisions, such as chaining and linear probing. Additionally, it covers the characteristics of good hash functions and provides examples of hash function methods and their implementations.

Uploaded by

Hasnain Nisar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Hashing

Uploaded by

Hasnain Nisar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Data Structure Algorithms and

Applications
CT-157

Course Instructor
Engr. Vanesh Kumar
SEARCHING
Introduction
 Process of finding an element within the list of
elements in order or randomly.
 Retrieval: Successful Search.
 A table of records in which a key is used for
retrieval is often called a SEARCH TABLE or
DICTIONARY.
 Internal Searching – Whole data in main
memory
 External Searching – Most data is kept in
auxiliary memory.
Searching Methods
 Sequential or Linear Searching.

 Binary Search.

 Hashing.
Sequential Search
 Searches on unordered and ordered tables in
sequential manner until the desired record is not
found or table end is not reached.
 It is simple and good for small arrays.
 Mostly used when data is not sorted.
 Efficiency:
 Best – 𝑂(1)
𝑛
 Average – 𝑂( )
2
 Worst – 𝑂(𝑛)
 Less efficient if array size is large.
 Not efficient on sorted arrays.
Binary Search
 This technique works better on sorted arrays and
can be applied only on sorted arrays.
 Not applied on Linked Lists.
 Requires less number of comparisons than linear
search.
 Efficiency: 𝑂(𝑙𝑜𝑔2 𝑛).
 Logic behind the technique:
First Half Second Half

First Value Mid Value Last Value

low=0 mid=(low+high)/2 high=n-1

HASHING
Introduction
 The search operation on a sorted array using the
binary search method takes 𝑂(𝑙𝑜𝑔2 𝑛)
 We can improve the search time by using an
approach called Hashing.
 Usually implemented on Dictionaries.
 Well, there are lots of applications out there that
need to support ONLY the operations INSERT,
SEARCH, and DELETE. These are known as
“dictionary” operations.
 Hashing can make this happen in 𝑂(1) and is
quite fast in practice.
Dictionary
 A dictionary is a collection of elements
 Each element has a field called key
– (key, value)
 Every key is usually distinct.
 Typical dictionary operations are:
– Insert a pair into the dictionary
– Search the pair with a specified key
– Delete the pair with a specified key
 Collection of student records in a class
– (key, value) =(student-number, a list of assignment and
exam marks)
– All keys are distinct
Dictionary as an Ordered Linear List

 L = (e1, e2, e3, …, en)

 Each ei is a pair (key, value)
 Array or chain representation

– unsorted array: O(n) search time

– sorted array: O(logn) search time
– unsorted chain: O(n) search time
– sorted chain: O(n) search time
Hash Table
 A hash table is a data structure that stores elements and
allows insertions, lookups, and deletions to be performed in
𝑂(1) time.
 A hash table is an alternative method for representing a
dictionary
 In a hash table, a hash function is used to map keys into
positions in a table. This act is called hashing
 Hash Table Operations
– Search: compute f(k) and see if a pair exists
– Insert: compute f(k) and place it in that position
– Delete: compute f(k) and delete the pair in that position
 In ideal situation, hash table search, insert or delete takes
(1)
Why we need Hash Tables
 Internet routers is a good example of why hash
tables are required.
 A router table (especially in those routers in the
backbone networks of internet operators) may
contain hundreds of thousands or millions of
entries.
When a packet has to be routed to a specific IP ad
dress, the router has to determine the best route
by querying the router table in an efficient manner.
Hash Tables are used as an efficient lookup struct
ure having as key the IP address and as value the
path that should be follow for that address.
Why we need Hash Tables
How Does it Work
 The table part is just an ordinary array, it is the Hash that
we are interested in.
 The Hash is a function that transforms a key into address
or index of array(table) where the record will be stored. If
the size of the table is N, then the integer will be in the
range 0 to N-1. The integer is used as an index into the arr
ay. Thus, in essence, the key itself indexes the array.
 If h is a hash function and k is key then h(k) is called the
hash of the key and is the index at which a record with the
key k should be placed.
 The hash function generates this address by performing
some simple arithmetic or logical operations on the key.
Ideal Hashing Example
 Pairs are: (22,a),(33,c),(3,d),(72,e),(85,f)-
-(key, value) pairs
 Hash table is ht[0:7], m = 8 (where m is the
number of positions in the hash table)
 Hash function h is k % m = k % 8
 Where are the pairs stored?

[0] [1] [2] [3] [4] [5] [6] [7]

(72,e) (33,c) (3,d) (85,f) (22,a)

[0] [1] [2] [3] [4] [5] [6] [7]
Hashing Function Methods
(Hashing Methods)
 Division Hash Method
 The key K is divided by some number m and the
remainder is used as the hash address of K.
 h(k)=k mod m

 This gives the indexes in the range 0 to m-1 so the

hash table should be of size m
 This is an example of uniform hash function if value of
m will be chosen carefully.
 Generally a prime number is a best choice which
will spread keys evenly.
 A uniform hash function is designed to distribute the
keys roughly evenly into the available positions within
the array (or hash table).
Hashing Function Methods
 The Folding Method
 The key K is partitioned into a number of parts ,each of
which has the same length as the required address with
the possible exception of the last part .
 The parts are then added together , ignoring the
final carry, to form an address.
 Example: If key=356942781 is to be transformed into a
three digit address.
P1=356, P2=942, P3=781 are added to yield 079.
Hashing Function Methods
 The Mid- Square Method
 The key K is multiplied by itself and the address is
obtained by selecting an appropriate number of digits
from the middle of the square.
 The number of digits selected depends on the size of
the table.
 Example: If key=123456 is to be transformed.
 (123456)2 =15241383936
 If a three-digit address is required, positions 5 to 7
could be chosen giving address 138.
Hashing a string key
 Table size [0..99]
 A..Z ---> 1,2, ...26
 0..9 ----> 27,...36
 Key: CS1 --->3+19+28 (concat) = 31,928
 (31,928)2 = 1,019,397,184 - 10 digits
 Extract middle 2 digits (5th and 6th) as table
size is 0..99.
 Get 39, so: H(CS1) = 39.
Characteristics of a Good Hash Function

 The hash value is fully determined by the data

being hashed.

 The hash function uses all the input data.

 The hash function "uniformly" distributes the data

across the entire set of possible hash values.

 The hash function generates very different hash

values for similar strings.
Hash Function Examples
Let h(k) = k % 15. Then,
if k = 25 129 35 2501 47 36
h(k) = 10 9 5 11 2 6

Storing the keys in the array is straightforward:

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
_ _ 47 _ _ 35 36 _ _ 129 25 2501 _ _ _

Thus, delete and find can be done in O(1), and

also insert, except…
Hash Function
What happens when you try to insert: k = 65 ?
k = 65
h(k) = 5

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
_ _ 47 _ _ 35 36 _ _ 129 25 2501 _ _ _
65(?)

This is called a collision.

Handling Collisions
 Chaining (Hashing with Chaining)

Open Addressing
– Linear Probing
Handling Collisions

Chaining
Separate Chaining
Let each array element be the head of a chain.
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
     
47 65 36 129 25 2501

35

Where would you store: 29, 16, 14, 99, 127 ?

Separate Chaining
Let each array element be the head of a chain:

Where would you store: 29, 16, 14, 99, 127 ?

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
        
16 47 65 36 127 99 25 2501 14
  
35 129 29

New keys go at the front of the relevant chain.

Separate Chaining: Disadvantages
• Parts of the array might never be used.
• As chains get longer, search time increases
to O(n) in the worst case.
• Constructing new chain nodes is relatively
expensive .
• Is there a way to use the “unused” space in
the array instead of using chains to make
more space
?
Handling Collisions

Linear Probing
Linear Probing
Let key k be stored in element h(k)=t of the array
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
47 35 36 129 25 2501
65(?)

What do you do in case of a collision?

If the hash table is not full, attempt to store key in the
next array element
(in this case (t+1)%N, (t+2)%N, (t+3)%N …).
until you find an empty slot.
Linear Probing
Where do you store 65 ? [Here N is 15].
65%15=5
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
47 35 36 65 129 25 2501
  
attempts

Where would you store: 29?

Linear Probing
If the hash table is not full, attempt to store key
in array elements (t+1)%N, (t+2)%N, …
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
47 35 36 65 129 25 2501 29

attempts

Where would you store: 16?

Linear Probing

• If the hash table is not full, attempt to store key in

array elements (t+1)%N, (t+2)%N, …
• [16%15=1]
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
16 47 35 36 65 129 25 2501 29


Where would you store: 14?

[14%15=14]
Linear Probing

• If the hash table is not full, attempt to store key in

array elements (t+1)%N, (t+2)%N, …
• [14%15=14]
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
14 16 47 35 36 65 129 25 2501 29
 
attempts

Where would you store: 99?

Linear Probing

• If the hash table is not full, attempt to store key in

array elements (t+1)%N, (t+2)%N, …
• [99%15=9]
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
14 16 47 35 36 65 129 25 2501 99 29
   
attempts

Where would you store: 127 ?

Linear Probing

• If the hash table is not full, attempt to store key in

array elements (t+1)%N, (t+2)%N, …
• [127%15=7]
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
16 47 35 36 65 127 129 25 2501 29 99 14
 
attempts
Linear Probing
• Eliminates need for separate data structures
(chains), and the cost of constructing nodes.

• Leads to problem of clustering. Elements tend

to cluster in dense intervals in the array.
    

• Search efficiency problem remains.

Project
No ratings yet
Project
3 pages
Data - Structures (1-5) 2&16 Marks
No ratings yet
Data - Structures (1-5) 2&16 Marks
21 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Lec 11 Hashing and Collision
No ratings yet
Lec 11 Hashing and Collision
16 pages
Hash Table Data Structure
No ratings yet
Hash Table Data Structure
34 pages
CSD203 Hashing
No ratings yet
CSD203 Hashing
32 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
Hashing
No ratings yet
Hashing
56 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
Hashing
No ratings yet
Hashing
44 pages
Hashing
No ratings yet
Hashing
20 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Unit-5
No ratings yet
Unit-5
50 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Hashing
No ratings yet
Hashing
34 pages
Lecture 3.Pptx 3
No ratings yet
Lecture 3.Pptx 3
24 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hashing
No ratings yet
Hashing
30 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing
No ratings yet
Hashing
23 pages
HASHING
No ratings yet
HASHING
63 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Ch7 Hashing
No ratings yet
Ch7 Hashing
12 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
06 - APS - Hash Table
No ratings yet
06 - APS - Hash Table
28 pages
Lecture 12
No ratings yet
Lecture 12
33 pages
Hashing
No ratings yet
Hashing
66 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
Lec12-Hash-Tables-09092024-090609pm (1)
No ratings yet
Lec12-Hash-Tables-09092024-090609pm (1)
48 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
13.hashing
No ratings yet
13.hashing
26 pages
ds 5 update
No ratings yet
ds 5 update
26 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Chapter10_HashTables
No ratings yet
Chapter10_HashTables
49 pages
Hashing: Data Structure
No ratings yet
Hashing: Data Structure
17 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
Hashing
No ratings yet
Hashing
23 pages
Unit28 Hashing1
No ratings yet
Unit28 Hashing1
19 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Hashing
No ratings yet
Hashing
37 pages
Lecture 14 Hashing
No ratings yet
Lecture 14 Hashing
44 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
Hashing
No ratings yet
Hashing
11 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
300+ Python Algorithms: Mastering the Art of Problem-Solving
From Everand
300+ Python Algorithms: Mastering the Art of Problem-Solving
Hernando Abella
5/5 (1)
Graph Data Structure
No ratings yet
Graph Data Structure
84 pages
Introduction
No ratings yet
Introduction
17 pages
BST
No ratings yet
BST
43 pages
Lecture # 4C - Boolean Algebra and Logic Simplification Final Version
No ratings yet
Lecture # 4C - Boolean Algebra and Logic Simplification Final Version
21 pages
$lecture # 9A - Data Transmission Final Version
No ratings yet
$lecture # 9A - Data Transmission Final Version
18 pages
Lecture # 10A - Data Processing and Control Final Version
No ratings yet
Lecture # 10A - Data Processing and Control Final Version
21 pages
Lecture # 5A - Function of Combinational Logic Final Version
No ratings yet
Lecture # 5A - Function of Combinational Logic Final Version
22 pages
Lecture # 9B - Data Transmission Final Version
No ratings yet
Lecture # 9B - Data Transmission Final Version
18 pages
Data - Structure - and - Algorithms - Lecture - 5 Linked List, Stacks
No ratings yet
Data - Structure - and - Algorithms - Lecture - 5 Linked List, Stacks
32 pages
Lecture # 1A - Digital Fundamentals and Analog Quantities
100% (2)
Lecture # 1A - Digital Fundamentals and Analog Quantities
21 pages
Data Structure Lect10
No ratings yet
Data Structure Lect10
15 pages
Data Structure Lect11
No ratings yet
Data Structure Lect11
13 pages
Data Structure Lect5 - Week2
No ratings yet
Data Structure Lect5 - Week2
30 pages
Lect Recurrences
No ratings yet
Lect Recurrences
82 pages
Lect Sorting
No ratings yet
Lect Sorting
83 pages
M.Tech JNTUK ADS UNIT-3
No ratings yet
M.Tech JNTUK ADS UNIT-3
13 pages
Cse408 MCQ
No ratings yet
Cse408 MCQ
17 pages
ADS (CSS) Final Question Bank Format For (III-I)
No ratings yet
ADS (CSS) Final Question Bank Format For (III-I)
3 pages
Hashing
50% (2)
Hashing
43 pages
DSA Assignment No 1
No ratings yet
DSA Assignment No 1
2 pages
Hashing
No ratings yet
Hashing
23 pages
Lecture-Hashing
No ratings yet
Lecture-Hashing
8 pages
Dsa Course File
No ratings yet
Dsa Course File
161 pages
Volante Software Engineer Hiring Test With Answers
No ratings yet
Volante Software Engineer Hiring Test With Answers
55 pages
DBMS Unit 5 Notes
No ratings yet
DBMS Unit 5 Notes
23 pages
KC Sir Assignment All Monish (1) Grayscale
No ratings yet
KC Sir Assignment All Monish (1) Grayscale
55 pages
Data Structure Theoretical Approach
No ratings yet
Data Structure Theoretical Approach
6 pages
ADDA
No ratings yet
ADDA
50 pages
CS301 Final Term MAGA File.. All Paperz Are in 1 File.
No ratings yet
CS301 Final Term MAGA File.. All Paperz Are in 1 File.
28 pages
Assignment 3
No ratings yet
Assignment 3
53 pages
Hash Function
No ratings yet
Hash Function
3 pages
DS Unit 6
No ratings yet
DS Unit 6
15 pages
QuestionPaper-20210717-01-20-20-AM-20SSP09
No ratings yet
QuestionPaper-20210717-01-20-20-AM-20SSP09
4 pages
Hashing
No ratings yet
Hashing
41 pages
Data Structure and Algorithm Analysis
No ratings yet
Data Structure and Algorithm Analysis
2 pages
Jntuk Ads Lab Manual
50% (2)
Jntuk Ads Lab Manual
27 pages
Algorithm Lecture6 Search
No ratings yet
Algorithm Lecture6 Search
40 pages
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
No ratings yet
Matrix Hashing With Two Level of Collision Resolution: National Institute of Technology Raipur
7 pages
10 Hashing PDF
No ratings yet
10 Hashing PDF
55 pages
Chapter 5
No ratings yet
Chapter 5
28 pages
Hashing
No ratings yet
Hashing
30 pages
Lec 13 - Hashing
No ratings yet
Lec 13 - Hashing
43 pages
Titlu Lucrare
No ratings yet
Titlu Lucrare
50 pages

Uploaded by

Uploaded by

Data Structure Algorithms and

First Value Mid Value Last Value

low=0 mid=(low+high)/2 high=n-1

 L = (e1, e2, e3, …, en)

– unsorted array: O(n) search time

[0] [1] [2] [3] [4] [5] [6] [7]

(72,e) (33,c) (3,d) (85,f) (22,a)

 This gives the indexes in the range 0 to m-1 so the

 The hash value is fully determined by the data

 The hash function uses all the input data.

 The hash function "uniformly" distributes the data

 The hash function generates very different hash

Storing the keys in the array is straightforward:

Thus, delete and find can be done in O(1), and

This is called a collision.

Where would you store: 29, 16, 14, 99, 127 ?

Where would you store: 29, 16, 14, 99, 127 ?

New keys go at the front of the relevant chain.

What do you do in case of a collision?

Where would you store: 29?

Where would you store: 16?

• If the hash table is not full, attempt to store key in

Where would you store: 14?

• If the hash table is not full, attempt to store key in

Where would you store: 99?

• If the hash table is not full, attempt to store key in

Where would you store: 127 ?

• If the hash table is not full, attempt to store key in

• Leads to problem of clustering. Elements tend

• Search efficiency problem remains.

You might also like