Working of HashSet in Java

Java Set Interface

A Java Set interface represents a group of elements arranged like an array. It does not allow duplicate elements. When we try to pass the same element that is already available in the Set, then it will not store into the Set. It is used to model the mathematical set abstraction.

Java HashSet class

A Java HashSet class represents a set of elements (objects). It does not guarantee the order of elements. It constructs a collection that uses a hash table for storing elements. It contains unique elements. It inherits the AbstractSet class. It also implements the Set interface. It uses a technique to store elements is called hashing. HashSet uses HashMap internally in Java.

Suppose, we want to create a HashSet to store a group of Strings, then create the object as:

Where <String> is the generic type parameter. It represents the type of element storing in the HashSet.

HashSet implements Set interface. It guarantees uniqueness. It is achieved by storing elements as keys with the same value always. HashSet does not have any method to retrieve the object from the HashSet. There is only a way to get objects from the HashSet via Iterator. When we create an object of HashSet, it internally creates an instance of HashMap with default initial capacity 16.

HashSet uses a constructor HashSet(int capacity) that represents how many elements can be stored in the HashSet. The capacity may increase automatically when more elements to be store.

HashSet uses another constructor HashSet(int capacity, float loadfactor). Here, loadfactor demines the point where the capacity of HashSet would be increased internally. For example, the product of capacity and loadfactor is 101*0.5=50.5. It means that after storing 50th element into the HashSet; its capacity will be internally increased to store more elements. The initial default capacity of HashSet is 16. The default load factor is 0.75.

HashSet Implementation

In the following, we are implementing add() method which adds element into HashSet.

import java.util.*;
public class HashsetDemo
{
public static void main(String[] args)
{
HashSet<String> hs= new HashSet<String>();
hs.add(?India?);
hs.add(?America?);
hs.add(?Russia?);
System.out.println(?Set is ?+hs);  	   //view HashSet
Iterator it=hs.iterator();         		 //add an iterator to hs
System.out.println("Elements using iterator:");
while(it.hasNext())                	            //display elements by using iterator
{
String s=(String)it.next();
System.out.println(s);
}
}
}

Test it Now

Output:

Set is [America, India, Russia]
Elements using iterator:
America
India
Russia

In the following example we are trying to add some duplicate values.

import java.util.*;
public class HashsetDemo
{
public static void main(String[] args)
{
HashSet<String> hs= new HashSet<String>();
hs.add("India");
hs.add("America");
hs.add("Russia");
hs.add("China");
hs.add("India");					      //duplicate value
hs.add("Russia");		                	     //duplicate value
System.out.println("Set is "+hs);  		    //view HashSet
Iterator it=hs.iterator();        			  //add an iterator to hs
System.out.println("Elements using iterator:");
while(it.hasNext())                          //display elements by using iterator
{
String s=(String)it.next();
System.out.println(s);
}
}
}

Test it Now

Output:

Set is [China, America, India, Russia]
Elements using iterator:
China
America
India
Russia

In the above example we have added some duplicate values. We can observe that duplicate values are not stored in the HashSet. When we pass duplicate elements in the add() method of the Set object, it internally returns false.

Here, a question arises that how it returns false. When we open the HashSet implementation of the add() method in Java APIs i.e. rt.jar, we find the following code in it:

public class HashSet<E> extends AbstractSet<E>
{
private transient HashMap<E,Object> map;
// Dummy value to associate with an Object in the backing Map
private static final Object PRESENT = new Object();
public HashSet()
{
map = new HashMap<>();
}
public boolean add(E e) 
{
return map.put(e, PRESENT)==null;
}
}

In the above code a call to add(object) is delegated to put(key, value) internally. Where key is the object we have passed and the value is another object, called PRESENT. It is a constant in java.util.HashSet.

We are achieving uniqueness in Set internally through HashMap. When we create an object of HashSet, it will create an object of HashMap. We know that each key is unique in the HashMap. So, we pass the argument in the add(E e) method. Here, we need to associate some value to the key. It will associate with Dummy value that is (new Object()) which is referred by Object reference PRESENT.

When we add an element in HashSet like hs.add("India"), Java does internally is that it will put that element E here "India" as a key into the HashMap (generated during HashSet object creation). It will also put some dummy value that is Object's object is passed as a value to the key.

put method of HashMap

put(Key k, Value v)
{
//some code
}

The important points about put(key, value) method is that:

If the Key is unique and added to the map, then it will return null
If the Key is duplicate, then it will return the old value of the key.

When we invoke add() method in HashSet, Java internally checks the return value of map.put(key, value) method with the null value.

public boolean add(E e)
{
return map.put(e, PRESENT==null);
}

If the method map.put(key, value) returns null, then the method map.put(e, PRESENT)==null will return true internally, and the element added to the HashSet.
If the method map.put(key, value) returns the old value of the key, then the method map.put(e, PRESENT)==null will return false internally, and the element will not add to the HashSet.

Retrieving Object from the HashSet

We use iterator() method to retrieve object from the HashSet. It is a method of java.util.HashSet class. It returns iterator for backup Map returned by map.keySet().iterator() method.

public Iterator<E> iterator()
{
return map.keySet().iterator();
}

Next Topic#

← prev next →

For Videos Join Our Youtube Channel: Join Now

Feedback

Send your Feedback to [email protected]

Help Others, Please Share

Learn Latest Tutorials

Splunk

SPSS

Swagger

Transact-SQL

Tumblr

ReactJS

Regex

Reinforcement Learning

R Programming

RxJS

React Native

Python Design Patterns

Python Pillow

Python Turtle

Keras

Preparation

Aptitude

Reasoning

Verbal Ability

Interview Questions

Company Questions

Trending Technologies

Artificial Intelligence

AWS

Selenium

Cloud Computing

Hadoop

ReactJS

Data Science

Angular 7

Blockchain

Git

Machine Learning

DevOps

B.Tech / MCA

DBMS

Data Structures

DAA

Operating System

Computer Network

Compiler Design

Computer Organization

Discrete Mathematics

Ethical Hacking

Computer Graphics

Software Engineering

Web Technology

Cyber Security

Automata

C Programming

C++

Java

.Net

Python

Programs

Control System

Data Mining

Data Warehouse

^{Like/Subscribe us for latest updates or newsletter}

Java Collections