Cataract Detection Using Machine Learning

A lot of individuals get cataracts, which is a common eye problem, especially as they age. Because of this disorder, the eye's lens becomes clouded, impairing vision, sensitivity to light, and reduced nighttime vision. It is essential to identify cataracts as soon as possible in order to enable prompt treatment and stop future vision loss. By examining photographs of the eye, machine learning algorithms have the potential to help in the early diagnosis of cataracts.

Working of Machine Learning Algorithm for Detecting Cataracts

A part of artificial intelligence called machine learning involves teaching algorithms to spot patterns in data. Machine learning algorithms may be trained on a huge dataset of eye pictures in the context of cataract diagnosis to learn how to distinguish between healthy and cataract-affected eyes.

Researchers frequently utilize a collection of labeled eye photos to train a machine-learning system to identify cataracts. The algorithm is trained to identify patterns in the photos that separate cataract-affected eyes from normal eyes by categorizing each image in the dataset as either having a cataract or not.

The algorithm may then be put to the test on a different set of eye photos to see how well it works. The sensitivity and specificity of the algorithm are determined to assess the accuracy of cataract detection.

The following steps are commonly involved in the machine learning method for cataract detection:

STEP 1: It is necessary to first get excellent ocular pictures. A range of tools, including cameras and optical coherence tomography (OCT) equipment, can be used to record these pictures. The pictures must be preprocessed when they are acquired to make sure they are correctly aligned, normalized, and enhanced for analysis.
STEP 2: Regions of interest must be found after the preprocessed pictures have been recognized. These Regions of interest correlate to regions of the eye, such as the lens or cornea, where cataracts are more likely to develop.
STEP 3: It is necessary to extract features from the Regions of interest after they have been discovered. These characteristics could include details like texture, shape, color, or intensity. The success of the algorithm in identifying cataracts depends on the feature extraction process.
STEP 4: The machine learning algorithm must then be trained once the features have been retrieved. In order to do this, a series of labeled photos, with each image classified as either having a cataract or not, must be fed to the algorithm. The program gains the ability to spot trends in the characteristics that set cataract photos apart from non-cataract ones.
STEP 5: The machine learning algorithm must then be trained once the features have been retrieved. In order to do this, a series of labeled photos, with each image classified as either having a cataract or not, must be fed to the algorithm. The program gains the ability to spot trends in the characteristics that set cataract photos apart from non-cataract ones.
STEP 6: The technique can be implemented in clinical settings to help with early cataract detection when it has been validated and tested.

Benefits of Cataract Detection Using Machine Learning

The technique can be implemented in clinical settings to help with early cataract detection when it has been validated and tested.
By examining vast datasets of eye photos and learning to recognize tiny patterns that may be challenging for people to see, machine learning algorithms have the potential to increase the accuracy of cataract diagnosis.
By automating the analytical process and removing the demand for human examination by qualified specialists, machine learning algorithms can lower the cost of cataract diagnosis.
By facilitating diagnosis in rural or underserved places where access to qualified experts is constrained, machine learning algorithms may enhance access to cataract diagnosis.

Limitations of Cataract Detection Using Machine Learning

Algorithms for machine learning are only as good as the data used to train them. The performance of the algorithm may not be as good as it might be if the dataset used to train it is biased or not representative of the population.
It might be tricky to comprehend how machine learning algorithms arrive at their diagnoses and interpret them. The absence of transparency may act as a deterrent to widespread adoption.
The use of machine learning in healthcare raises ethical concerns, such as patient privacy, informed consent, and bias in decision-making.
Machine learning algorithms might demand a substantial amount of processing power and storage, which may be difficult for organizations with low funding.

Code Implementation

We will try to distinguish the images of people, whether they have normal or cataract eyes. Here we will use two datasets: the Cataract dataset and Oculur Disease Recognition.

Importing Libraries

A Python package called "glob" is used to expand the paths to files and directories. Users may utilize wildcard patterns to search for files, and it will return a list of file paths that satisfy the search criteria.

The "glob" library offers a simple way to find files using patterns like file names, file extensions, or directory names. In data science and machine learning projects, reading numerous files with identical names or esxtensions is a typical practice.

import os, glob, cv2
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
import tensorflow as tf
from tensorflow.keras.layers import *
from tensorflow.keras import backend as K
from tensorflow.keras.models import Sequential
from tensorflow.keras.preprocessing.image import ImageDataGenerator
from tensorflow.keras.utils import get_custom_objects
import efficientnet.tfkeras as efn
from tqdm import tqdm

Reading Metadata and Setting Configuration

Two crucial machine learning operations, that are setting configurations and reading metadata, can boost the model's performance and can help in precise data handling.

Setting configuration entails adjusting numerous settings or parameters for a model. These parameters may include hyperparameters like epochs, batch size, and learning rate, as well as those that have an impact on model behavior, such as regularisation techniques or optimizer algorithms.

Data containing information about other data is known as metadata. Metadata in machine learning might include details on a dataset's size, the kinds of features or labels it contains, and other details about the data collection process, etc.

In order to guarantee that the data is handled and processed correctly, reading metadata is a crucial step in machine learning. Developers can use suitable methods, such as imputation, to manage missing values, for instance, if the metadata shows some characteristics in the dataset that it has missing values.

SEED = 42
EPOCHS = 100
SIZE_BATCH = 32
HEIGHT_IMG = 192
WIDTH_IMG = 256

# Dataset: cataract
ROOT_IMG = '../input/cataractdataset/dataset/'
DIR_IMG = [ROOT_IMG+'1_normal',
           ROOT_IMG+'2_cataract',
           ROOT_IMG+'2_glaucoma',
           ROOT_IMG+'3_retina_disease']

#Dataset: ocular-disease-recognition dataset
ROOT_IMG_OCU = '../input/ocular-disease-recognition-odir5k/ODIR-5K/Training Images/'
dataframe_ocu = pd.read_excel('../input/ocular-disease-recognition-odir5k/ODIR-5K/data.xlsx')

def seed_everything(seed):
    np.random.seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)
    tf.random.set_seed(seed)

seed_everything(SEED)
#The seeding step includes giving the model's training-phase random number generator a fixed value. This guarantees that the model's outcomes are consistent over several runs and on various computers.

Processing Cataract Dataset

Data must be gathered, cleaned, integrated, transformed, reduced, separated, and visualized as part of the processing of a dataset. To guarantee that the data is precise, consistent, and appropriate for analysis or machine learning, each step is crucial. Any data science or machine learning project must include data processing, which necessitates careful planning and close attention to detail. You may get the most out of your data by using these methods to prepare it for analysis or machine learning.

Here we will be processing the Cataract Dataset.

dataframe_cataract = pd.DataFrame(0,
                  columns=['paths',
                           'cataract'],
                  index=range(601))

filepaths = glob.glob(ROOT_IMG + '*/*')


for i, path_file in enumerate(filepaths):
    path_file = os.path.split(path_file)
    dataframe_cataract.iloc[i, 0] = path_file[0] + '/' + path_file[1]
   
    if path_file[0] == DIR_IMG[0]:    # normal
        dataframe_cataract.iloc[i, 1] = 0
    elif path_file[0] == DIR_IMG[1]:  # cataract
        dataframe_cataract.iloc[i, 1] = 1
    elif path_file[0] == DIR_IMG[2]:  # glaucoma
        dataframe_cataract.iloc[i, 1] = 2
    elif path_file[0] == DIR_IMG[3]:  # retine_disease
        dataframe_cataract.iloc[i, 1] = 3


# Only sampling normal and cataract from the dataset    
dataframe_cataract = dataframe_cataract.query('0 <= cataract < 2')
dataframe_cataract

Output:

print('Number of normal and cataract images')
print(dataframe_cataract['cataract'].value_counts())

Output:

Here we can see that there are 300 normal eye images and 100 images that are of cataracts.

Processing Oculur Disease Recognition Dataset

Now we will be processing the Oculur Disease Recognition Dataset.

Output:

def has_cataract_mentioned(text):
    if 'cataract' in text:
        return 1
    else:
        return 0
   
dataframe_ocu['left_eye_cataract'] = dataframe_ocu['Left-Diagnostic Keywords']\
                                 .apply(lambda x: has_cataract_mentioned(x))
dataframe_ocu['right_eye_cataract'] = dataframe_ocu['Right-Diagnostic Keywords']\
                                 .apply(lambda x: has_cataract_mentioned(x))

We made a function that will label the data whether the person has a cataract or not.

dataframe_lefteye = dataframe_ocu.loc[:, ['Left-Fundus', 'left_eye_cataract']]\
        .rename(columns={'left_eye_cataract':'cataract'})
dataframe_lefteye['paths'] = ROOT_IMG_OCU + dataframe_lefteye['Left-Fundus']
dataframe_lefteye = dataframe_lefteye.drop('Left-Fundus', axis=1)


dataframe_righteye = dataframe_ocu.loc[:, ['Right-Fundus', 'right_eye_cataract']]\
        .rename(columns={'right_eye_cataract':'cataract'})
dataframe_righteye['paths'] = ROOT_IMG_OCU + dataframe_righteye['Right-Fundus']
dataframe_righteye = dataframe_righteye.drop('Right-Fundus', axis=1)

dataframe_lefteye.head()

Output:

Here we get the file path of the left-eye images.

Output:

Here we get the file path of the right-eye images.

print('Number of left eye images')
print(dataframe_lefteye['cataract'].value_counts())
print('\nNumber of right eye images')
print(dataframe_righteye['cataract'].value_counts())

Output:

Here we obtained the number of left eye and right eye that has cataracts or not.

def downsample(dfa):
    dfa = pd.concat([
        dfa.query('cataract==1'),
        dfa.query('cataract==0').sample(sum(dfa['cataract']),
                                       random_state=SEED)
    ])
    return dfa


dataframe_lefteye = downsample(dataframe_lefteye)
dataframe_righteye = downsample(dataframe_righteye)

print('Number of left eye images')
print(dataframe_lefteye['cataract'].value_counts())
print('\nNumber of right eye images')
print(dataframe_righteye['cataract'].value_counts())

Output:

dataframe_ocu = pd.concat([dataframe_lefteye, dataframe_righteye])
dataframe_ocu.head()

Output:

Here we get the file path, where all the ocular disease-based images are present.

Combining Datasets

Here we will combine two metadata so that we can use it to load image data and to create a new dataset.

dataframe = pd.concat([dataframe_cataract, dataframe_ocu], ignore_index=True)
dataframe

Output:

Now we will split the dataset into a training and testing set.

dataframe_train, dataframe_test = train_test_split(dataframe,
                                     test_size=0.2,
                                     random_state=SEED,
                                     stratify=dataframe['cataract'])

dataframe_train, dataframe_value = train_test_split(dataframe_train,
                                    test_size=0.15,
                                    random_state=SEED,
                                    stratify=dataframe_train['cataract'])

def create_datasets(dfa, widht_img, height_img):
    imgs = []
    for path in tqdm(dfa['paths']):
        img = cv2.imread(path)
        img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
        img = cv2.resize(img, (widht_img, height_img))
        imgs.append(img)
       
    imgs = np.array(imgs, dtype='float32')
    dfa = pd.get_dummies(dfa['cataract'])
    return imgs, dfa


train_imgs, dataframe_train = create_datasets(dataframe_train, WIDTH_IMG, HEIGHT_IMG)
val_imgs, dataframe_value = create_datasets(dataframe_value, WIDTH_IMG, HEIGHT_IMG)
test_imgs, dataframe_test = create_datasets(dataframe_test, WIDTH_IMG, HEIGHT_IMG)

train_imgs = train_imgs / 255.0
val_imgs = val_imgs / 255.0
test_imgs = test_imgs / 255.0

Output:

Now we will go for the training of our model.

# plot the first 25 sheets of image data for training

f, ax = plt.subplots(5, 5, figsize=(15,15))
norm_list = list(dataframe_train[0][:25])
for i, img in enumerate(train_imgs[:25]):
    ax[i//5, i%5].imshow(img)
    ax[i//5, i%5].axis('off')
    if norm_list[i] == 1:
        ax[i//5, i%5].set_title('TrainData: Normal')
    else:
        ax[i//5, i%5].set_title('TrainData: Cataract')
plt.show()

Output:

Considering testing, we still go for the 25 sheets for it.

# plot the first 25 sheets of image data for Test
f, ax = plt.subplots(5, 5, figsize=(15,15))
norm_list = list(dataframe_train[0][:25])
for a, img in enumerate(test_imgs[:25]):
    ax[a//5, a%5].imshow(img)
    ax[a//5, a%5].axis('off')
    if norm_list[a] == 1:
        ax[a//5, a%5].set_title('TestData: Normal')
    else:
        ax[a//5, a%5].set_title('TestData: Cataract')
plt.show()

Output:

Modeling

Modeling is the process of using a dataset to train a mathematical or statistical algorithm to find patterns or make predictions. Building a machine learning model with good generalizability to fresh data and effective prediction capabilities is the aim of modeling.

class Mish(tf.keras.layers.Layer):

    def __init__(self, **kwargs):
        super(Mish, self).__init__(**kwargs)
        self.supports_masking = True

    def call(self, inputs):
        return inputs * K.tanh(K.softplus(inputs))

    def get_config(self):
        base_config = super(Mish, self).get_config()
        return dict(list(base_config.items()) + list(config.items()))

    def compute_output_shape(self, input_shape):
        return input_shape
def mish(x):
    return tf.keras.layers.Lambda(lambda x: x*K.tanh(K.softplus(x)))(x)
 
get_custom_objects().update({'mish': Activation(mish)})

input_shape = (HEIGHT_IMG, WIDTH_IMG, 3)

model = Sequential()
model.add(Conv2D(16, kernel_size=3, padding='same',
                 input_shape=input_shape, activation='mish'))
model.add(Conv2D(16, kernel_size=3, padding='same', activation='mish'))
model.add(BatchNormalization())
model.add(MaxPool2D(3))
model.add(Dropout(0.3))
model.add(Conv2D(16, kernel_size=3, padding='same', activation='mish'))
model.add(Conv2D(16, kernel_size=3, padding='same', activation='mish'))
model.add(BatchNormalization())
model.add(MaxPool2D(3))
model.add(Dropout(0.3))
model.add(Flatten())
model.add(Dense(2, activation='softmax'))
model.compile(optimizer='adam', loss='categorical_crossentropy', metrics=['accuracy'])


model.summary()

Output:

Now we will be creating randomly enhanced picture data from the ImageDataGenerator Object using some image data augmentation.

generator = ImageDataGenerator(horizontal_flip=True,
                               height_shift_range=0.1,
                               fill_mode='reflect')



es_callback = tf.keras.callbacks.EarlyStopping(patience=20,
                                               verbose=1,
                                               restore_best_weights=True)
reduce_lr = tf.keras.callbacks.ReduceLROnPlateau(factor=0.1, patience=10, verbose=1)

history = model.fit(generator.flow(train_imgs,
                                   dataframe_train,
                                   batch_size=SIZE_BATCH),
                    epochs=EPOCHS,
                    steps_per_epoch=len(train_imgs)/SIZE_BATCH,
                    callbacks=[es_callback, reduce_lr],
                    validation_data=(val_imgs, dataframe_value))


pd.DataFrame(history.history)[['accuracy', 'val_accuracy']].plot()
pd.DataFrame(history.history)[['loss', 'val_loss']].plot()
plt.show()

Output:

Ealuating the Model

Now we will be evaluating the model.

Output:

Well, the accuracy of the model is 88%, and the loss is 36%, which seems good.

But we will apply a model that has been previously trained on the dataset. Several characteristics and trends in the data have already been identified by this pre-trained model, which might be useful for carrying out specific tasks.

def build_model(height_img, width_img, n):
    inp = Input(shape=(height_img,width_img,n))
    efnet = efn.EfficientNetB0(
        input_shape=(height_img,width_img,n),
        weights='imagenet',
        include_top=False
    )
    x = efnet(inp)
    x = GlobalAveragePooling2D()(x)
    x = Dense(2, activation='softmax')(x)
    model = tf.keras.Model(inputs=inp, outputs=x)
    opt = tf.keras.optimizers.Adam(learning_rate=0.000003)
    loss = tf.keras.losses.CategoricalCrossentropy(label_smoothing=0.01)
    model.compile(optimizer=opt, loss=loss, metrics=['accuracy'])
    return model

model = build_model(HEIGHT_IMG, WIDTH_IMG, 3)
model.summary()

Output:

generator = ImageDataGenerator(horizontal_flip=True,
                               height_shift_range=0.1,
                               fill_mode='reflect')



es_callback = tf.keras.callbacks.EarlyStopping(patience=20,
                                               verbose=1,
                                               restore_best_weights=True)
reduce_lr = tf.keras.callbacks.ReduceLROnPlateau(factor=0.1, patience=10, verbose=1)

history = model.fit(generator.flow(train_imgs,
                                   dataframe_train,
                                   batch_size=SIZE_BATCH),
                    epochs=EPOCHS,
                    steps_per_epoch=len(train_imgs)/SIZE_BATCH,
                    callbacks=[es_callback, reduce_lr],
                    validation_data=(val_imgs, dataframe_value))


pd.DataFrame(history.history)[['accuracy', 'val_accuracy']].plot()
pd.DataFrame(history.history)[['loss', 'val_loss']].plot()
plt.show()

Output:

Re-Evaluating the model

Output:

Looking at the accuracy of the model while testing it, was 93% and with a loss of 21%, which is quite good.

Considering it's an optical-based machine learning model, the accuracy is superb.

Conclusion

Machine learning-based cataract detection has the potential to enhance early diagnosis and treatment of cataracts, enabling individualized and effective patient care. Although this method has drawbacks and restrictions, continuous research and development are aimed at resolving these problems and enhancing the functionality and usability of machine learning algorithms for cataract detection. The future of cataract detection and treatment is bright with ongoing technological advancements and more researcher-clinician collaboration.

Next TopicPlacement Prediction Using Machine Learning

← prev next →