Juardo commited on
Commit
a449177
·
verified ·
1 Parent(s): c2a4366

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: PyTorch
3
+ tags:
4
+ - cnn
5
+ - lenet
6
+ - cifar100
7
+ - image-classification
8
+ datasets:
9
+ - uoft-cs/cifar100
10
+ language:
11
+ - en
12
+ metrics:
13
+ - accuracy
14
+ ---
15
+
16
+ # CIFAR10 LeNet5 Variation 2: GELU + Dropout Layer
17
+
18
+ This repository contains our best variation of the original LeNet5 architecture adapted for CIFAR-10, but we will use its architecture and train it on CIFAR-100 this time. The model consists of two convolutional layers followed by two fully connected layers a dropout layer (p=0.5) and a final fully connected layer, using linear (GELU) activations, extending variation 1, and Kaiming uniform initialization. It is trained with a batch size of 32 using the Adam optimizer (learning rate 0.001) and CrossEntropyLoss. In our experiments, this model achieved a test loss of 0.0572 and a top-1 accuracy of 43.08% on CIFAR-100.
19
+
20
+ ## Model Details
21
+
22
+ - **Architecture:** 2 Convolutional Layers, 2 Fully Connected Layers, 1 Dropout Layer, 1 Final Fully Connected Layer.
23
+ - **Activations:** GELU.
24
+ - **Weight Initialization:** Kaiming Uniform.
25
+ - **Optimizer:** Adam (lr=0.001).
26
+ - **Loss Function:** CrossEntropyLoss.
27
+ - **Dataset:** CIFAR-100.
28
+
29
+ ## Usage
30
+
31
+ Load this model in PyTorch to fine-tune or evaluate on CIFAR-100 using your training and evaluation scripts.
32
+
33
+ ---
34
+
35
+ Feel free to update this model card with further training details, benchmarks, or usage examples.