Open In App

How to handle duplicates in Binary Search Tree?

Last Updated : 11 Oct, 2024
Summarize
Comments
Improve
Suggest changes
Like Article
Like
Share
Report
News Follow

In a Binary Search Tree (BST), all keys in the left subtree of a key must be smaller and all keys in the right subtree must be greater. So a Binary Search Tree by definition has distinct keys. How can duplicates be allowed where every insertion inserts one more key with a value and every deletion deletes one occurrence?

[Naive approach] Insert duplicate element on right subtree – O(h) Time and O(h) Space

A Simple Solution is to allow the same keys on the right side (we could also choose the left side). For example consider the insertion of keys 12, 10, 20, 9, 11, 10, 12, 12 in an empty Binary Search Tree: 

How-to-handle-duplicates-in-Binary-Search-Tree

[Expected approach] By storing count of occurrence – O(h) Time and O(h) Space

A Better Solution is to augment every tree node to store count together with regular fields like key, and left and right pointers. 

Insertion of keys 12, 10, 20, 9, 11, 10, 12, 12 in an empty Binary Search Tree would create following: Count of a key is shown in bracket.

How-to-handle-duplicates-in-Binary-Search-Tree-copy

This approach has following advantages over above simple approach.

  • Height of tree is small irrespective of number of duplicates. Note that most of the BST operations (search, insert and delete) have time complexity as O(h) where h is height of BST. So if we are able to keep the height small, we get advantage of less number of key comparisons.
  • Search, Insert and Delete become easier to do. We can use same insert, search and delete algorithms with small modifications (see below code). 
  • This approach is suited for self-balancing BSTs (AVL Tree, Red-Black Tree, etc) also. These trees involve rotations, and a rotation may violate BST property of simple solution as a same key can be in either left side or right side after rotation.

Follow the steps below to solve the problem:

Algorithm for Insert in BST:

  • If the tree is empty, create a new node with the given key.
  • Traverse the tree:
    • If the key is smaller than the current node’s key, move to the left child, otherwise, move to the right child.
    • If the key matches the current node, increment the count.
    • Repeat the process recursively until the correct position is found, then insert the new node.

Algorithm for Delete in BST:

  • Traverse the tree to find the node with the given key.
    • If found, and the node’s count is greater than 1, decrement the count and return.
    • If the node has no children or one child, replace it with the child.
    • If the node has two children, find its inorder successor, copy its key and count, then delete the successor node.

Below is implementation of normal Binary Search Tree with count with every key. This code basically is taken from code for insert and delete in BST. The changes made for handling duplicates are highlighted, rest of the code is same. 

C++
// C++ program to implement basic operations 
// (search, insert, and delete) on a BST that
// handles duplicates by storing count with 
// every node

#include<bits/stdc++.h>
using namespace std;

class Node {
  
  public:
    int key;
    int count;
    Node *left;
 	Node *right;
  
    Node(int x) {
        key = x;
        count = 1;
        left = nullptr;
        right = nullptr;
    }
};

// A utility function to do inorder traversal of BST
void inorder(Node *root) {
    if (root != nullptr) {
        inorder(root->left);
        cout << root->key << "(" << root->count << ") ";
        inorder(root->right);
    }
}

// A utility function to insert a new 
// node with given key in BST 
Node* insert(Node* node, int key) {
  
    // If the tree is empty, return a new node
    if (node == nullptr) 
        return new Node(key);

    // If key already exists in BST, 
    // increment count and return
    if (key == node->key) {
        node->count++;
        return node;
    }

    // Otherwise, recur down the tree
    if (key < node->key)
        node->left = insert(node->left, key);
    else
        node->right = insert(node->right, key);

    // return the (unchanged) node pointer 
    return node;
}

// Given a non-empty binary search tree, return 
// the node with minimum key value found in that 
// tree. Note that the entire tree does not need
// to be searched. 
Node* minValueNode(Node* node) {
    Node* current = node;

    // loop down to find the leftmost leaf 
    while (current->left != nullptr)
        current = current->left;

    return current;
}

// Given a binary search tree and a key, 
// this function deletes a given key and 
// returns root of modified tree 
Node* deleteNode(Node* root, int key) {
  
    // base case
    if (root == nullptr) return root;

    // If the key to be deleted is smaller than the
    // root's key, then it lies in left subtree
    if (key < root->key)
        root->left = deleteNode(root->left, key);

    // If the key to be deleted is greater than 
    // the root's key, then it lies in right subtree
    else if (key > root->key)
        root->right = deleteNode(root->right, key);

    // if key is same as root's key
    else {
      
        // If key is present more than once, 
        // simply decrement count and return
        if (root->count > 1) {
            root->count--;
            return root;
        }

        // else, delete the node
        // node with only one child or no child
        if (root->left == nullptr) {
            Node *curr = root->right;
            delete root;
            return curr;
        }
        else if (root->right == nullptr) {
            Node *curr = root->left;
            delete root;
            return curr;
        }

        // node with two children: Get the inorder 
        // successor (smallest in the right subtree)
        Node* mn = minValueNode(root->right);

        // Copy the inorder successor's 
        // content to this node
        root->key = mn->key;
        root->count = mn->count;

        // To ensure successor gets deleted by 
        // deleteNode call, set count to 0.
        mn->count = 0;

        // Delete the inorder successor
        root->right = deleteNode(root->right, mn->key);
    }
    return root;
}

int main() {
  
    // Let us create following BST
    //        12(3)
    //       /     \
    //    10(2)     20(1)
    //   /   \
    // 9(1) 11(1) 
    Node *root = nullptr;
    root = insert(root, 12);
    root = insert(root, 10);
    root = insert(root, 20);
    root = insert(root, 9);
    root = insert(root, 11);
    root = insert(root, 10);
    root = insert(root, 12);
    root = insert(root, 12);

    cout << "Inorder traversal of the given tree " << endl;
    inorder(root);

    cout << "\nDelete 20\n";
    root = deleteNode(root, 20);
    cout << "Inorder traversal of the modified tree \n";
    inorder(root);

    cout << "\nDelete 12\n";
    root = deleteNode(root, 12);
    cout << "Inorder traversal of the modified tree \n";
    inorder(root);

    cout << "\nDelete 9\n";
    root = deleteNode(root, 9);
    cout << "Inorder traversal of the modified tree \n";
    inorder(root);

    return 0;
}
C Java Python C# JavaScript

Output
Inorder traversal of the given tree 
9(1) 10(2) 11(1) 12(3) 20(1) 
Delete 20
Inorder traversal of the modified tree 
9(1) 10(2) 11(1) 12(3) 
Delete 12
Inorder traversal of the modified tree 
9(1) 10(2) 11(1) 12(2) 
Delete 9
Inorder traversal of the modified tree 
10(2) 11(1) 12(2) 

Time Complexity: O(h) for every operation, h is height of BST.
Auxiliary Space: O(h) which is required for the recursive function calls.



Next Article

Similar Reads

three90RightbarBannerImg