International Journal of Image, Graphics and Signal Processing (IJIGSP)

ISSN: 2074-9074 (Print)

ISSN: 2074-9082 (Online)

DOI: https://doi.org/10.5815/ijigsp

Website: https://www.mecs-press.org/ijigsp

Published By: MECS Press

Frequency: 6 issues per year

Number(s) Available: 135

(IJIGSP) in Google Scholar Citations / h5-index

IJIGSP is committed to bridge the theory and practice of images, graphics, and signal processing. From innovative ideas to specific algorithms and full system implementations, IJIGSP publishes original, peer-reviewed, and high quality articles in the areas of images, graphics, and signal processing. IJIGSP is a well-indexed scholarly journal and is indispensable reading and references for people working at the cutting edge of images, graphics, and signal processing applications.

 

IJIGSP has been abstracted or indexed by several world class databases: Scopus, Google Scholar, Microsoft Academic Search, CrossRef, Baidu Wenku, IndexCopernicus, IET Inspec, EBSCO, JournalSeek, ULRICH's Periodicals Directory, WorldCat, Scirus, Academic Journals Database, Stanford University Libraries, Cornell University Library, UniSA Library, CNKI Scholar, ProQuest, J-Gate, ZDB, BASE, OhioLINK, iThenticate, Open Access Articles, Open Science Directory, National Science Library of Chinese Academy of Sciences, The HKU Scholars Hub, etc..

Latest Issue
Most Viewed
Most Downloaded

IJIGSP Vol. 17, No. 1, Feb. 2025

REGULAR PAPERS

Disorder Facial Emotion Recognition with Grey Wolf Optimized Gefficient-Net for Autism Spectrum

By Ramachandran Vedantham Ravisankar Malladi Sivaiah Bellamkonda Edara Sreenivasa Reddy

DOI: https://doi.org/10.5815/ijigsp.2025.01.01, Pub. Date: 8 Feb. 2025

Autism spectrum disorder (ASD) is a neurological issue that impacts brain function at an earlier stage. The autistic person realizes several complexities in communication or social interaction. ASD detection from face images is complicated in the field of computer vision. In this paper, a hybrid GEfficient-Net with a Gray-Wolf (GWO) optimization algorithm for detecting ASD from facial images is proposed. The proposed approach combines the advantages of both EfficientNet and GoogleNet. Initially, the face image from the dataset is pre-processed, and the facial features are extracted with the VGG-16 feature extraction technique. It extracts the most discriminative features by learning the representation of each network layer. The hyperparameters of GoogleNet are optimally selected with the GWO algorithm. The proposed approach is uniformly scaled in all directions to enhance performance. The proposed approach is implemented with the Autistic children’s face image dataset, and the performance is computed in terms of accuracy, sensitivity, specificity, G-mean, etc. Moreover, the proposed approach improves the accuracy to 0.9654 and minimizes the error rate to 0.0512. The experimental outcomes demonstrate the proposed ASD diagnosis has achieved better performance.

[...] Read more.
Method of Diagnostics of Multichannel Data Transmission System

By Anatolii Taranenko Yevhen Gabrousenko Oleksii Holubnychyi Oleksandr Lavrynenko Maksym Zaliskyi

DOI: https://doi.org/10.5815/ijigsp.2025.01.02, Pub. Date: 8 Feb. 2025

The redundancy of a multichannel data transmission system increases its reliability. During the operation of the system, it is necessary to diagnose and switch failed channels. To solve this problem, the set of input signals of the system is considered as a vector signal, whose scalar components are the coordinates of the vector in a given dimensional space. The diagnosis is performed using a scalar criterion, whose relative simplicity is ensured by the linearity of the signal transformations applied. To minimize the total probability of diagnostic error, the task of optimizing the tolerance on the diagnostic parameter is solved. The possibility of technical implementation of the proposed method is shown based on matrix transformations of the system's input and output signals. The system efficiency was assessed according to the "reliability-cost" criterion. Scientific novelty of the work consists in the fact that analytical expressions for matrix transformations of input and output vector signals of a multichannel data transmission system have been developed. Realization of these transformations provides diagnostics of the system according to the developed scalar criterion both in the test mode and in the mode of functioning as intended. 

[...] Read more.
Multimodal Image Analysis Based Pedestrian Detection Using Optimization with Classification by Hybrid Machine Learning Model

By Johnson Kolluri Ranjita Das

DOI: https://doi.org/10.5815/ijigsp.2025.01.03, Pub. Date: 8 Feb. 2025

In recent times People commonly display substantial intra-class variability in both appearance and position, making pedestrian recognition difficult. Current computer vision techniques like object identification as well as object classification has given deep learning (DL) models a lot of attention and this application is based on supervised learning, which necessitates labels. Multimodal imaging enables examining more than one molecule at a time, so that cellular events may be examined simultaneously or the progression of these events can be followed in real-time. Purpose of this study is to propose and construct a hybrid machine learning (ML) pedestrian identification model based on multimodal datasets. For pedestrian detection, the input is gathered as multimodal pictures, which are then processed for noise reduction, smoothing, and normalization. Then, the improved picture was categorized using metaheuristic salp cross-modal swarm optimization and optimized using naive spatio kernelized extreme convolutional transfer learning. We thoroughly evaluated the proposed approach on three benchmark datasets for multimodal pedestrian identification that are made accessible to the general public. For several multimodal image-based pedestrian datasets, experimental analysis is done in terms of average precision, log-average miss rate, accuracy, F1 score, and equal error rate. The findings of the studies show that our method is capable of performing cutting-edge detection on open datasets. proposed technique attained average precision of 95%, log-average miss rate of 81%, accuracy of 61%, F1 score of 51%, equal error rate of 59%.

[...] Read more.
An Autoencoder-Based Deep Learning Model for Fractographic Characterization of Tungsten Heavy Alloys

By Sudipta Pal Triparna Sarkar Sourav Saha Priya Ranjan Sinha Mahapatra

DOI: https://doi.org/10.5815/ijigsp.2025.01.04, Pub. Date: 8 Feb. 2025

Fracture surface analysis is crucial in investigating manufacturing failures and material characterization. Traditional manual inspection methods are slow and subjective, prompting the need for efficient automated tools using advanced computer vision techniques. Recent machine learning models for classifying surface fractures show potentials but struggle due to the lack of large, labeled datasets. This study explores the potential application of autoencoders, a self-supervised neural network, to identify unintended fracture surfaces from anomalous manufacturing of tungsten-heavy alloys. The proposed autoencoder-based model achieves 97% accuracy in distinguishing undesirable fracture patterns by analyzing the reconstruction loss of the images, surpassing existing methods. This high accuracy highlights the autoencoder's ability to automatically extract and reduce dimensional features from fracture surfaces effectively. The experimental result obtained on tungsten-heavy alloys demonstrate the model's potential towards developing autoencoder-based automated tools for fractographic analyses across various materials and operational scenarios. 

[...] Read more.
A Comprehensive Evaluation of Spectral Unmixing Methods in Hyperspectral Imaging

By Archana Chaudhari Samrudhi S. Wath Tushar P. Zanke Stuti N. Jagtap Snehashish S. Mulgir

DOI: https://doi.org/10.5815/ijigsp.2025.01.05, Pub. Date: 8 Feb. 2025

This study explores hyperspectral image classification, particularly focusing on spectral unmixing techniques applied to the widely used "PaviaU" dataset. Nine distinct endmembers, representing materials such as Water, Trees, and Shadows, serve as the foundation for our investigation. Introducing a novel linear regression model meticulously tailored for hyperspectral image reconstruction, we aim to address the complexities inherent in such datasets. Our approach leverages a fusion of non-negative least squares (NNLS) and a sum-to-one constraint, employing the Sequential Least Squares Quadratic Programming (SLSQP) method to seek optimal coefficients.  Through rigorous experimentation and analysis, our model achieves a mean reconstruction error of 1152.318. The efficacy of our approach lies in its seamless integration of NNLS and SLSQP, customizing a solution to the intricate nuances of hyperspectral data. By significantly reducing reconstruction errors, our method represents a substantial advancement in spectral unmixing techniques. Furthermore, this study produces nine abundance maps for each endmember using least squares with constraints, lasso, and averaging the squared differences between observed and reconstructed spectra for pixels with nonzero class labels to determine reconstruction error. Emphasizing the importance of abundance maps and reconstruction errors, we compare the results obtained through our proposed spectral unmixing methods with those of alternative approaches. This comprehensive analysis not only sheds light on the performance superiority of our proposed methods but also provides valuable insights for practitioners and researchers working with hyperspectral imaging data. By offering enhanced accuracy and efficiency in spectral unmixing, our approach holds significant promise for applications ranging from environmental monitoring to precision agriculture and beyond.

[...] Read more.
SIFT-BZM: Pixel Based Forgery Image Detection Using Scale-Invariant Feature Transform and Block Based Zernike Moments

By Kshipra Ashok Tatkare Manoj Devare

DOI: https://doi.org/10.5815/ijigsp.2025.01.06, Pub. Date: 8 Feb. 2025

New area of image processing termed "digital image forensics" aims to gather quantifiable proof of a digital image's authenticity and place of origin. Detection of forgery images to look for copied and pasted portions; however, depending on whether the copied portion underwent post-processing before being transferred to another party, the detection method may differ. Zernike Moments and Scale-Invariant Feature Transform (SIFT) combined are unique techniques that aid in the identification of textured and smooth regions. But compared to SIFT separately, this combination is the slowest. So in the proposed work, Block based image division and SIFT based key point detection model is developed to detect forgery images. The gathered images are poor visual quality and various dimension, so it is resized and converter grayscale conversion. In addition, pixel values of images are improved using optimal Gaussian filter and adaptive histogram equalization which remove noise and blurring based on sigma value. Then, using the SIFT key point extraction algorithm to extract the image's key point and compute the feature vector of each key-points. In that using a block based matching technique to split the pre-images into blocks, and each blocks are diagonally subdivide. Length of the feature vector is computed using Zernike moments of each blocks. Both SIFT features and Zernike moments features are matched to identify the manipulated image from the given data. The proposed model provides 100% recall, 98.2% precision, and 99.09% F1_score. Thus provide the proposed model was effectively detects forgery image in the given data. 

[...] Read more.
Denoising of Non-Small Cell Lung Cancer CT-scans through Fractional Fourier Transform for a Non-invasive Diagnostic Model

By Manika Jha Richa Gupta Rajiv Saxena

DOI: https://doi.org/10.5815/ijigsp.2025.01.07, Pub. Date: 8 Feb. 2025

Non-Small Cell Lung Cancer (NSCLC) represents a significant health challenge globally, with high mortality rates largely attributed to late-stage diagnosis. This paper details a novel approach for denoising computed tomography (CT) scans through 2-dimensional Fractional Fourier transform (2D-FrFT), which has been reported to be effective for time-frequency signal/image processing applications. To establish a foundation for the FrFT filtering of the original and corrupt dataset, a variable fractional-order image processing technique was used. Based on the derived pre-processing of CT scans, a classification model was developed with hand-crafted features and a 2-layer neural network to classify 4834 CT scans collected from the Lung Image Database Consortium image (LIDC-IDRI) dataset into classes of normal lungs and NSCLC infected lungs. This work presents an approach to improving the performance of NSCLC detection through a lightweight neural network that attains 1.00 accuracy, 1.00 sensitivity, and 1.00 AUC. An additional real-time lung cancer dataset from PGI Rohtak, Haryana, has been considered to validate the model and prove its performance against overfitting.  The experimental analysis showed better results than the existing methods for both LIDC-IDRI and hospital datasets and could be a competent assistant to clinicians in detecting NSCLC.

[...] Read more.
Intelligent Processing Censoring Inappropriate Content in Images, News, Messages and Articles on Web Pages Based on Machine Learning

By Oleksiy Tverdokhlib Victoria Vysotska Olena Nagachevska Yuriy Ushenko Dmytro Uhryn Yurii Tomka

DOI: https://doi.org/10.5815/ijigsp.2025.01.08, Pub. Date: 8 Feb. 2025

This project aims to enhance online experiences quality by giving users greater control over the content they encounter daily. The proposed solution is particularly valuable for parents seeking to safeguard their children, educational institutions striving to foster a more conducive learning environment, and individuals prioritising ethical internet usage. It also supports users who wish to limit their exposure to misinformation, including fake news, propaganda, and disinformation. Through the implementation of a browser extension, this system will contribute to a safer internet, reducing users' vulnerability to harmful content and promoting a more positive and productive online environment. The primary objective of this work is to develop a browser extension that automatically detects and censors inappropriate text and images on web pages using artificial intelligence (AI) technologies. The extension will enable users to personalise censorship settings, including the ability to define custom prohibited words and toggle the filtering of text and images. Accuracy estimates for various classifiers such as Random Forest (0.879), Logistic Regression (0.904), Decision Tree (0.878), Naive Bayes (0.315), and KNN (0.832) were performed.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Radio Receiver with Internal Compression of Input Signals Using a Dispersive Delay Line with Bandpass Filters

By Roman Pantyeyev Felix Yanovsky Andriy Mykolushko Volodymyr Shutko

DOI: https://doi.org/10.5815/ijigsp.2023.06.01, Pub. Date: 8 Dec. 2023

This article proposes a receiving device in which arbitrary input signals are subject to pre-detector processing for the subsequent implementation of the idea of compressing broadband modulated pulses with a matched filter to increase the signal-to-noise ratio and improve resolution. For this purpose, a model of a dispersive delay line is developed based on series-connected high-frequency time delay lines with taps in the form of bandpass filters, and analysis of this model is performed as a part of the radio receiving device with chirp signal compression. The article presents the mathematical description of the processes of formation and compression of chirp signals based on their matched filtering using the developed model and proposes the block diagram of a radio receiving device using the principle of compression of received signals. The proposed model can be implemented in devices for receiving unknown signals, in particular in passive radar. It also can be used for studying signal compression processes based on linear frequency modulation in traditional radar systems.

[...] Read more.
Mobile-Based Skin Disease Diagnosis System Using Convolutional Neural Networks (CNN)

By M.W.P Maduranga Dilshan Nandasena

DOI: https://doi.org/10.5815/ijigsp.2022.03.05, Pub. Date: 8 Jun. 2022

This paper presents a design and development of an Artificial Intelligence (AI) based mobile application to detect the type of skin disease. Skin diseases are a serious hazard to everyone throughout the world. However, it is difficult to make accurate skin diseases diagnosis. In this work, Deep learning algorithms Convolution Neural Networks (CNN) is proposed to classify skin diseases on the HAM10000 dataset. An extensive review of research articles on object identification methods and a comparison of their relative qualities were given to find a method that would work well for detecting skin diseases. The CNN-based technique was recognized as the best method for identifying skin diseases. A mobile application, on the other hand, is built for quick and accurate action. By looking at an image of the afflicted area at the beginning of a skin illness, it assists patients and dermatologists in determining the kind of disease present. Its resilience in detecting the impacted region considerably faster with nearly 2x fewer computations than the standard MobileNet model results in low computing efforts. This study revealed that MobileNet with transfer learning yielding an accuracy of about 85% is the most suitable model for automatic skin disease identification. According to these findings, the suggested approach can assist general practitioners in quickly and accurately diagnosing skin diseases using the smart phone.

[...] Read more.
Enhancement of Mammographic Images Based on Wavelet Denoise and Morphological Contrast Enhancement

By Toan Le Van Liet Van Dang

DOI: https://doi.org/10.5815/ijigsp.2023.06.03, Pub. Date: 8 Dec. 2023

Breast cancer can be detected by mammograms, but not all of them are of high enough quality to be diagnosed by physicians or radiologists. Therefore, denoising and contrast enhancement in the image are issues that need to be addressed. There are numerous techniques to reduce noise and enhance contrast; the most popular of which incorporate spatial filters and histogram equalization. However, these techniques occasionally result in image blurring, particularly around the edges. The purpose of this article is to propose a technique that uses wavelet denoising in conjunction with top-hat and bottom-hat morphological transforms in the wavelet domain to reduce noise and image quality without distorting the image. Use five wavelet functions to test the proposed method: Haar, Daubechies (db3), Coiflet (coif3), Symlet (sym3), and Biorthogonal (bior1.3); each wavelet function employs levels 1 through 4 with four types of wavelet shrinkage: Bayer, Visu, SURE, and Normal. Three flat structuring elements in the shapes of a disk, a square, and a diamond with sizes 2, 5, 10, 15, 20, and 30 are utilized for top-hat and bottom-hat morphological transforms. To determine optimal parameters, the proposed method is applied to mdb001 mammogram (mini MIAS database) contaminated with Gaussian noise with SD, ? = 20. Based on the quality assessment quantities, the Symlet wavelet (sym3) at level 3, with Visu shrinkage and diamond structuring element size 5 produced the best results (MSE = 50.020, PSNR = 31.140, SSIM = 0.407, and SC = 1.008). The results demonstrate the efficacy of the proposed method.

[...] Read more.
Fast Encryption Scheme for Secure Transmission of e-Healthcare Images

By Devisha Tiwari Bhaskar Mondal Anil Singh

DOI: https://doi.org/10.5815/ijigsp.2023.05.07, Pub. Date: 8 Oct. 2023

E-healthcare systems (EHSD), medical communications, digital imaging (DICOM) things have gained popularity over the past decade as they have become the top contenders for interoperability and adoption as a global standard for transmitting and communicating medical data. Security is a growing issue as EHSD and DICOM have grown more usable on any-to-any devices. The goal of this research is to create a privacy-preserving encryption technique for EHSD rapid communication with minimal storage. A new 2D logistic-sine chaotic map (2DLSCM) is used to design the proposed encryption method, which has been developed specifically for peer-to-peer communications via unique keys. Through the 3D Lorenz map which feeds the initial values to it, the 2DLSCM is able to provide a unique keyspace of 2544 bits (2^544bits) in each go of peer-to-peer paired transmission. Permutation-diffusion design is used in the encryption process, and 2DLSCM with 3DLorenz system are used to generate unique initial values for the keys. Without interfering with real-time medical transmission, the approach can quickly encrypt any EHSD image and DICOM objects. To assess the method, five distinct EHSD images of different kinds, sizes, and quality are selected. The findings indicate strong protection, speed, and scalability when compared to existing similar methods in literature.

[...] Read more.
An Efficient Brain Tumor Detection Algorithm Using Watershed & Thresholding Based Segmentation

By Anam Mustaqeem Engr Ali Javed Tehseen Fatima

DOI: https://doi.org/10.5815/ijigsp.2012.10.05, Pub. Date: 28 Sep. 2012

During past few years, brain tumor segmentation in magnetic resonance imaging (MRI) has become an emergent research area in the ?eld of medical imaging system. Brain tumor detection helps in finding the exact size and location of tumor. An efficient algorithm is proposed in this paper for tumor detection based on segmentation and morphological operators. Firstly quality of scanned image is enhanced and then morphological operators are applied to detect the tumor in the scanned image.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

By A. B. M. Aowlad Hossain Jannatul Kamrun Nisha Fatematuj Johora

DOI: https://doi.org/10.5815/ijigsp.2023.01.02, Pub. Date: 8 Feb. 2023

Ultrasound based breast screening is gaining attention recently especially for dense breast. The technological advancement, cancer awareness, and cost-safety-availability benefits lead rapid rise of breast ultrasound market. The irregular shape, intensity variation, and additional blood vessels of malignant cancer are distinguishable in ultrasound images from the benign phase. However, classification of breast cancer using ultrasound images is a difficult process owing to speckle noise and complex textures of breast. In this paper, a breast cancer classification method is presented using VGG16 model based transfer learning approach. We have used median filter to despeckle the images. The layers for convolution process of the pretrained VGG16 model along with the maxpooling layers have been used as feature extractor and a proposed fully connected two layers deep neural network has been designed as classifier. Adam optimizer is used with learning rate of 0.001 and binary cross-entropy is chosen as the loss function for model optimization. Dropout of hidden layers is used to avoid overfitting. Breast Ultrasound images from two databases (total 897 images) have been combined to train, validate and test the performance and generalization strength of the classifier. Experimental results showed the training accuracy as 98.2% and testing accuracy as 91% for blind testing data with a reduced of computational complexity. Gradient class activation mapping (Grad-CAM) technique has been used to visualize and check the targeted regions localization effort at the final convolutional layer and found as noteworthy. The outcomes of this work might be useful for the clinical applications of breast cancer diagnosis.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Retinal Image Segmentation for Diabetic Retinopathy Detection using U-Net Architecture

By Swapnil V. Deshmukh Apash Roy Pratik Agrawal

DOI: https://doi.org/10.5815/ijigsp.2023.01.07, Pub. Date: 8 Feb. 2023

Diabetic retinopathy is one of the most serious eye diseases and can lead to permanent blindness if not diagnosed early. The main cause of this is diabetes. Not every diabetic will develop diabetic retinopathy, but the risk of developing diabetes is undeniable. This requires the early diagnosis of Diabetic retinopathy. Segmentation is one of the approaches which is useful for detecting the blood vessels in the retinal image. This paper proposed the three models based on a deep learning approach for recognizing blood vessels from retinal images using region-based segmentation techniques. The proposed model consists of four steps preprocessing, Augmentation, Model training, and Performance measure. The augmented retinal images are fed to the three models for training and finally, get the segmented image. The proposed three models are applied on publically available data set of DRIVE, STARE, and HRF. It is observed that more thin blood vessels are segmented on the retinal image in the HRF dataset using model-3. The performance of proposed three models is compare with other state-of-art-methods of blood vessels segmentation of DRIVE, STARE, and HRF datasets.

[...] Read more.
Real-Time Video based Human Suspicious Activity Recognition with Transfer Learning for Deep Learning

By Indhumathi .J Balasubramanian .M Balasaigayathri .B

DOI: https://doi.org/10.5815/ijigsp.2023.01.05, Pub. Date: 8 Feb. 2023

Nowadays, the primary concern of any society is providing safety to an individual. It is very hard to recognize the human behaviour and identify whether it is suspicious or normal. Deep learning approaches paved the way for the development of various machine learning and artificial intelligence. The proposed system detects real-time human activity using a convolutional neural network. The objective of the study is to develop a real-time application for Activity recognition using with and without transfer learning methods. The proposed system considers criminal, suspicious and normal categories of activities. Differentiate suspicious behaviour videos are collected from different peoples(men/women). This proposed system is used to detect suspicious activities of a person. The novel 2D-CNN, pre-trained VGG-16 and ResNet50 is trained on video frames of human activities such as normal and suspicious behaviour. Similarly, the transfer learning in VGG16 and ResNet50 is trained using human suspicious activity datasets. The results show that the novel 2D-CNN, VGG16, and ResNet50 without transfer learning achieve accuracy of 98.96%, 97.84%, and 99.03%, respectively. In Kaggle/real-time video, the proposed system employing 2D-CNN outperforms the pre-trained model VGG16. The trained model is used to classify the activity in the real-time captured video. The performance obtained on ResNet50 with transfer learning accuracy of 99.18% is higher than VGG16 transfer learning accuracy of 98.36%. 

[...] Read more.
A Review on Image Reconstruction through MRI k-Space Data

By Tanuj Kumar Jhamb Vinith Rejathalal V.K. Govindan

DOI: https://doi.org/10.5815/ijigsp.2015.07.06, Pub. Date: 8 Jun. 2015

Image reconstruction is the process of generating an image of an object from the signals captured by the scanning machine. Medical imaging is an interdisciplinary field combining physics, biology, mathematics and computational sciences. This paper provides a complete overview of image reconstruction process in MRI (Magnetic Resonance Imaging). It reviews the computational aspect of medical image reconstruction. MRI is one of the commonly used medical imaging techniques. The data collected by MRI scanner for image reconstruction is called the k-space data. For reconstructing an image from k-space data, there are various algorithms such as Homodyne algorithm, Zero Filling method, Dictionary Learning, and Projections onto Convex Set method. All the characteristics of k-space data and MRI data collection technique are reviewed in detail. The algorithms used for image reconstruction discussed in detail along with their pros and cons. Various modern magnetic resonance imaging techniques like functional MRI, diffusion MRI have also been introduced. The concepts of classical techniques like Expectation Maximization, Sensitive Encoding, Level Set Method, and the recent techniques such as Alternating Minimization, Signal Modeling, and Sphere Shaped Support Vector Machine are also reviewed. It is observed that most of these techniques enhance the gradient encoding and reduce the scanning time. Classical algorithms provide undesirable blurring effect when the degree of phase variation is high in partial k-space. Modern reconstructions algorithms such as Dictionary learning works well even with high phase variation as these are iterative procedures.

[...] Read more.
Radio Receiver with Internal Compression of Input Signals Using a Dispersive Delay Line with Bandpass Filters

By Roman Pantyeyev Felix Yanovsky Andriy Mykolushko Volodymyr Shutko

DOI: https://doi.org/10.5815/ijigsp.2023.06.01, Pub. Date: 8 Dec. 2023

This article proposes a receiving device in which arbitrary input signals are subject to pre-detector processing for the subsequent implementation of the idea of compressing broadband modulated pulses with a matched filter to increase the signal-to-noise ratio and improve resolution. For this purpose, a model of a dispersive delay line is developed based on series-connected high-frequency time delay lines with taps in the form of bandpass filters, and analysis of this model is performed as a part of the radio receiving device with chirp signal compression. The article presents the mathematical description of the processes of formation and compression of chirp signals based on their matched filtering using the developed model and proposes the block diagram of a radio receiving device using the principle of compression of received signals. The proposed model can be implemented in devices for receiving unknown signals, in particular in passive radar. It also can be used for studying signal compression processes based on linear frequency modulation in traditional radar systems.

[...] Read more.