Two-Stage Feature Selection Method Created for 20 Neurons Artificial Neural Networks for Automatic Breast Cancer Detection

Authors

  • Jalpa J. Patel V.T. Patel Department of Electronics and Communication Engineering, Chandubhai S. Patel Institute of Technology, Charotar University of Science and Technology, Gujarat, India
  • Sarman K. Hadia Gujarat Technological University, Gujarat, India

DOI:

https://doi.org/10.48048/tis.2023.4027

Keywords:

Multi-level threshold, Information Gain, Pearson correlation, Heat map, Boosted-ANN, Automatic Breast Cancer Detection (ABCD), Digital Database for Screening Mammography (DDSM)

Abstract

Breast cancer is a common deadly diseases in women. Initial recognition of breast cancer using mammogram images is a challenging task. Hence, this paper proposed a unique automatic diagnosis model for breast cancer. Initially, the mammogram images are preprocessed with a median filter and contrast limited adaptive histogram equalization (CLAHE). The pre-processed image is automatically segmented using the multilevel threshold method. Subsequently, statistical, texture, shape, and geometric features are extracted from the segmented image. So, the length of the feature vector is high, and it is important to identify optimum features. In this paper, the dimension of the feature vector is reduced by 2-stage feature selection methods. Initially, the feature vector is applied to the best first search method information gain (IG) with rank feature method, and then secondly, apply the Pearson correlation method (PCM). Artificial neural networks (ANNs) are used to increase the classification accuracy of a breast cancer diagnosis. In this model, the selection of appropriate neurons in a single hidden layer is used to avoid overfitting problems in an ANN model. Based on optimum feature selection, the appropriate number of neurons chosen in the hidden layer is 20, which was applied for the proposed IG+PCM+Boosted-ANN model. The proposed model is applied on 2 regular datasets mini-Mammographic Image Analysis Society (mini-MIAS) and Digital Database for Screening Mammography (DDSM). The proposed model was superior to other exiting models and the model in this study achieves the accuracy of 99 and 98.80 % for mini-MIAS and DDSM datasets, respectively.

 

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

H Sung, J Ferlay, RL Siegel, M Laversanne, I Soerjomataram, A Jemal and F Bray. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Canc. J. Clinicians 2021; 71, 209-49.

American Cancer Society. Breast cancer facts & figures 2017-2018. American Cancer Society, Inc., Georgia, 2017.

H Cheng, X Cai, X Chen, L Hu and X Lou. Computer-aided detection and classification of microcalcifications in mammograms: A survey. Pattern Recogn. 2003; 36, 2967-91.

S Gupta, PF Chyn and MK Markey. Breast cancer cadx based on bi-rads descriptors from two mammographic views. Med. Phys. 2006; 33, 1810-17.

TN Lal, O Chapelle, J Weston and A Elisseeff. Embedded methods. Feature Extraction. Springer, Berlin, Germany, 2006, p. 137-65.

MB Wadhonkar, PA Tijare and SN Sawalkar. Artificial neural network approach for classification of heart disease dataset. Int. J. Appl. Innovat. Eng. Manag. 2014; 3, 388-92.

N Elkhani and RC Muniyandi. Review of the effect of feature selection for microarray data on the classification accuracy for cancer data sets. Int. J. Soft Comput. 2016; 11, 334-42.

R Gururajan, XH Tao and PD Barua. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recogn. Lett. 2020; 132, 123-31.

J Parker, D Dance, S Astley, I Hutt, C Boggis, I Ricketts, E Stamatakis, N Cerneaz, S Kok, P Taylor, D Betal and J Savage. Mammographic image analysis society (MIAS) database v1.21. Apollo - University of Cambridge Repository, Trinity Ln, Cambridge, 2015.

M Heath, K Bowyer, D Kopans, P Kegelmeyer, R Moore and K Chang. Current status of the digital database for screening mammography. Digital Mammography. Springer, Berlin, Germany, 1998, p. 457-60.

E Naeimeh and MR Chandren. Membrane computing inspired feature selection model for microarray cancer data. Intell. Data Anal. 2017; 21, S137-57.

MA Rahman and RC Muniyandi. An enhancement in cancer classification accuracy using a two-step feature selection method based on artificial neural networks with 15 neurons. Symmetry 2020; 12, 271.

L Gao, M Ye, X Lu and D Huang. Hybrid method based on information gain and support vector machine for gene selection in cancer classification. Dev. Reprod. Biol. 2017; 15, 389-95.

A Qayyum and A Basit. Automatic breast segmentation and cancer detection via svm in mammograms. In: Proceedings of the 2016 International Conference on Emerging Technologies, Islamabad, Pakistan. 2016.

BA Mohamed, NM Salem, MM Hadhoud and AF Seddik. Automatic segmentation and classification of masses from digital mammograms. Eur. J. Appl. Sci. 2016; 4, 17.

MF Ak. A comparative analysis of breast cancer detection and diagnosis using data visualization and machine learning applications. Healthcare 2020; 8, 111.

P Kaur, G Singh and P Kaur. Intellectual detection and validation of automated mammogram breast cancer images by multi-class SVM using deep learning classification. Informat. Med. Unlocked 2019; 16, 100239.

C Shravya, K Pravalika and S Subhani. Prediction of breast cancer using supervised machine learning techniques. Int. J. Innovat. Tech. Exploring Eng. 2019; 8, 1106-10.

F Mohanty, S Rup, B Dash, B Majhi and MNS Swamy. Mammogram classification using contourlet features with forest optimization-based feature selection approach. Multimed. Tool. Appl. 2019; 78, 12805-34.

VRE Christo, HK Nehemiah, B Minu and A Kannan. Correlation-based ensemble feature selection using bioinspired algorithms and classification using backpropagation neural network. Comput. Math. Meth. Med. 2019; 2019, 7398307.

R Chtihrakkannan, P Kavitha, T Mangayarkarasi, R Karthikeyan. Breast cancer detection using machine learning. Int. J. Innovat. Tech. Exploring Eng. 2019; 8, 3123-6.

R Alyami, J Alhajjaj, B Alnajrani, I Elaalami, A Alqahtani, N Aldhafferi, TO Owolabi and SO Olatunji. Investigating the effect of correlation based feature selection on breast cancer diagnosis using artificial neural network and support vector machines. In: Proceedings of the 2017 International Conference on Informatics, Health & Technology, Riyadh, Saudi Arabia. 2017.

S Thawkar and R Ingolikar. Classification of masses in digital mammograms using Biogeography-based optimization technique. J. King Saud Univ. Comput. Inform. Sci. 2020; 32, 1140-8.

R Erika, EDN Alam, A Oliver and R Zwiggelaar. Automatic segmentation of microcalcification clusters. In: Proceedings of the Annual Conference on Medical Image Understanding and Analysis, Southampton. 2018.

N Kharel, A Alsadoon, PWC Prasad and A Elchouemi. Early diagnosis of breast cancer using contrast limited adaptive histogram equalization (CLAHE) and morphology methods. In: Proceedings of the 2017 8th International Conference on Information and Communication Systems, Irbid, Jordan. 2017.

Y Zhang, TY Ji, MS Li and QH Wu. Identification of power disturbances using generalized morphological open-closing and close-opening undecimated wavelet. IEEE Trans. Ind. Electron. 2018; 63, 2330-9.

Q Chen, L Zhao, J Lu, G Kuang, N Wang and Y Jiang. Modified two-dimensional Otsu image segmentation algorithm and fast realization. IET Image Process. 2012, 6, 426-33.

KU Sheba and SG Raj. An approach for automatic lesion detection in mammograms. Cogent Eng. 2018; 5, 1444320.

JJ Patel and SK Hadia. An enhancement of mammogram images for breast cancer classification using artificial neural networks. IAES Int. J. Artif. Intell. 2021; 10, 332-45.

AK Mohanty, S Beberta and SK Lenka. Classifying benign and malignant mass using GLCM and GLRLM based on textures from mammogram. Int. J. Eng. Res. Appl. 2011; 1, 687-93.

P Zarbakhsh and A Addeh. Breast cancer tumor type recognition using graph feature selection technique and radial basis function neural network with optimal structure. J. Canc. Res. Therapeut. 2018; 14, 625-33.

PT Bharathi and P Subashini. Texture based color segmentation for infrared river ice images using K-means clustering. In: Proceedings of the 2013 International Conference on Signal Processing, Image Processing & Pattern Recognition, Coimbatore, India. 2013.

F Amiri, MR Yousefi and C Lcas. Mutual information based feature selection for intrusion detection system. J. Netw. Comput. Appl. 2011; 34, 1184-99.

M Sepandi, M Taghdir, A Rezaianzadeh and RS Assessing. Breast cancer risk with an artificial neural network. Asian Pac. J. Canc. Prev. 2018; 4, 1017-9.

MA Rahman, R Muniyandi, D Albashish, MM Rahman and OL Usman. Artificial neural network with Taguchi method for robust classification model to improve classification accuracy of breast cancer. PeerJ Comput. Sci. 2021; 7, e344.

SZ Ramadan. Using convolutional neural network with cheat sheet and data augmentation to detect breast cancer in mammograms. Comput. Math. Meth. Med. 2020; 2020, 9523404.

J Dheeba, NA Singh and ST Selvi. Computer-aided detection of breast cancer on mammograms: a swarm intelligence optimized wavelet neural network approach. J. Biomed. Informat. 2014; 49, 45-52.

DA Omondiagbe, S Veeramani and AS Sidhu. Machine learning classification techniques for breast cancer diagnosis. IOP Conf. Mater. Sci. Eng. 2019; 495, 012033.

M Pratiwi, Alexander, J Harefa and S Nanda. Mammograms classification using gray-level co-occurrence matrix and radial basis function neural network. Procedia Comput. Sci. 2015; 59, 83-91.

H Pezeshki, M Rastgarpour, A Sharifi and S Yazdani. Extraction of spiculated parts of mammogram tumors to improve accuracy of classification. Multimed. Tool. Appl. 2019; 78, 19979-20003.

P Gupta and S Garg. Breast cancer prediction using varying parameters of machine learning models. Procedia Comput. Sci. 2016; 171, 593-601.

S Chakraborty, MK Bhowmik, AK Ghosh and T Pal. Automated edge detection of breast masses on mammograms. In: Proceedings of the 2016 IEEE Region 10 Conference, Singapore. 2016.

KC Tatikonda, CM Bhuma and SK Samayamantula. The analysis of digital mammograms using HOG and GLCM features. In: Proceedings of the 2018 9th International Conference on Computing, Communication and Networking Technologies, Bengaluru, India. 2018.

Downloads

Published

2022-12-20

How to Cite

Patel, J. J. ., & Hadia, S. K. . (2022). Two-Stage Feature Selection Method Created for 20 Neurons Artificial Neural Networks for Automatic Breast Cancer Detection. Trends in Sciences, 20(2), 4027. https://doi.org/10.48048/tis.2023.4027