CNN googlenet and alexnet architecture deep learning for diabetic retinopathy image processing and classification

Naeem Oleiwi Al-Mahdi, Israa

Advanced Search

View/Open

Yüksek Lisans Tezi / Master's Thesis (2.122Mb)

Date

2023

Author

Naeem Oleiwi Al-Mahdi, Israa

Metadata

Show full item record

Abstract

The objective of this study is to analyze the performance of the two widely used convolutional neural network (CNN) architectures, namely AlexNet and GoogLeNet, in order to determine their relative merits and demerits with regard to the classification of photographic images. The purpose of this test is to evaluate how effectively they can recognize and prepare for a variety of various kinds of pictures. Experiments and analyses include a broad variety of subjects, such as the implementation of algorithms, various pre-processing methods, the continuation of algorithms, and various strategies for fine-tuning. Investigation is conducted into a variety of training period settings, and the findings and graphs obtained from these investigations are compared. The findings indicate that both AlexNet and GoogLeNet have the capability of being utilized in the process of photo classification. AlexNet already has a performance advantage over its rivals after only six rounds of training. In the beginning, GoogleNet was not as accurate as Caffe, but it quickly caught up by engaging in enormous training repetitions. In general, the findings of the study imply that selecting the appropriate CNN architecture should be driven more by necessity than by personal taste. While AlexNet offers high accuracy with fewer epochs, GoogLeNet shows potential for higher performance with further training. This research not only helps academics and practitioners make more educated judgments when selecting models for image classification tasks, but it also increases our understanding of CNN architectures.

Bu çalışmanın amacı, yaygın olarak kullanılan iki evrişimli sinir ağı (CNN) mimarisinin, yani AlexNet ve GoogLeNet'in performansını, fotoğrafik görüntülerin sınıflandırılmasına ilişkin göreli avantajlarını ve dezavantajlarını belirlemek için analiz etmektir. Bu testin amacı, çocukların çeşitli türden resimleri ne kadar etkili bir şekilde tanıyabildiklerini ve bunlara hazırlanabildiklerini değerlendirmektir. Deneyler ve analizler, algoritmaların uygulanması, çeşitli ön işleme yöntemleri, algoritmaların devamı ve çeşitli ince ayar stratejileri gibi çok çeşitli konuları içerir. Çeşitli eğitim periyodu ortamlarında inceleme yapılır ve bu incelemelerden elde edilen bulgular ve grafikler karşılaştırılır. Bulgular, hem AlexNet hem de GoogLeNet'in fotoğraf sınıflandırma sürecinde kullanılma yeteneğine sahip olduğunu göstermektedir. AlexNet, yalnızca altı turluk bir eğitimden sonra rakiplerine göre şimdiden bir performans avantajına sahip. Başlangıçta, GoogleNet, Caffe kadar doğru değildi, ancak muazzam eğitim tekrarlarıyla uğraşarak kısa sürede yakalandı. Genel olarak, çalışmanın bulguları, uygun CNN mimarisinin seçilmesinin kişisel zevkten çok zorunluluk tarafından yönlendirilmesi gerektiğini ima etmektedir. AlexNet daha az dönemle yüksek doğruluk sunarken, GoogLeNet daha fazla eğitimle daha yüksek performans potansiyeli gösteriyor. Bu araştırma, akademisyenlerin ve uygulayıcıların görüntü sınıflandırma görevleri için model seçerken daha eğitimli kararlar vermelerine yardımcı olmakla kalmıyor, aynı zamanda CNN mimarileri hakkındaki anlayışımızı da artırıyor.

URI

https://hdl.handle.net/11363/7133

Collections

Yüksek Lisans Tezleri [1219]