Please use this identifier to cite or link to this item:
http://dspace.azjhpc.org/xmlui/handle/123456789/509| Title: | DISTRIBUTED FEATURE EXTRACTION WITH AUTOENCODERS FOR BREAST CANCER DETECTION |
| Authors: | Naghizade, Elshan |
| Keywords: | Autoencoder;Feature Extraction;Breast Cancer;XGBoost;CatBoost |
| Issue Date: | 11-Nov-2025 |
| Publisher: | Azerbaijan Journal of High Performance Computing |
| Abstract: | In this study, a distributed feature extraction pipeline was developed for breast cancer detection us-ing autoencoders. Mammogram images were first derived from the RSNA dataset by converting DICOM files into PNG format. Twelve convolutional autoencoders were trained, where all layers were convolutional except for the bottleneck layer, which was defined as a fully connected layer. This layer served as the compressed feature vector. Variations across models were introduced by modifying the number of convolutional layers in encoders/decoders (3, 4, or 5) and the dimension-ality of the feature vector (128, 256, 512, or 1024). Training was conducted using mean squared error loss in a synchronous multi-worker setup on a four-node CPU cluster utilizing the Keras framework. After training, the extracted features were evaluated using three classification models: logistic regression, XGBoost, and CatBoost. The performance of each feature vector configuration was assessed based on accuracy, precision, recall, and F1-score. Through comparative analysis, the effectiveness of different vector sizes and model complexities in representing diagnostic features was determined. This approach demonstrated the feasibility of scalable, distributed feature extrac-tion for high-resolution medical imaging tasks, offering a practical framework for future breast cancer detection systems. |
| URI: | http://dspace.azjhpc.org/xmlui/handle/123456789/509 |
| ISSN: | 2616-6127 2617-4383 |
| Journal Title: | Azerbaijan Journal of High Performance Computing |
| Volume: | Volume 7 |
| Issue: | e2025.03 |
| First page number: | 1 |
| Last page number: | 10 |
| Number of pages: | 10 |
| Appears in Collections: | Azerbaijan Journal of High Performance Computing |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| doi.org.10.32010.26166127.2025.03.pdf | 241.99 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.