What is Hashing¶

Hashing is a method of sorting and indexing data. The idea behind hashing is to allow large amounts of data to be indexed using keys commonly created by formulas.

Who do we need Hashing?¶

It's time efficient:

Data structures	Time complexity for search operation
Array	O(log n)
Linked List	O(n)
Tree	O(log n)
Hashing	O(1) best case / O(n) worst case

Hash function - a hash function is any function that can be used to map data of arbitrary size to data of fixed size.
Key - input data given by user
Hash value - the values returned by a hash function are called hash values, hash codes, digests, or simply hashes.
Hash table - it is a data structure which implements an associative array abstract data type, a structure that can map keys to values.
Collision - a collision occurs when to different keys produce the same output as the hash value.

There are two types of collision resolution techniques:

Direct chaining:
- Situation will never arise
Open addressing:
- Need to create 2x size of current hash table and redo hashing for existing keys.

Direct chaining
- No fear of exhausting hash table buckets
- Fear of big linked lists (can effect performance big time)
Open addressing
- Easy implementation
- Fear of exhausting hash table buckets
If input size is known then always use "open addressing", else can use any of two.
If deletion is very high, we should always go with direct chaining.

Pros: - On an average insertion/deletion/search operation takes O(1) time

Cons: - In the worst case insertion/deletion/search might take O(n) time (when hash functions are not good enough).