Juardo commited on
Commit
21093f1
·
verified ·
1 Parent(s): d75de34

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -15,12 +15,12 @@ metrics:
15
 
16
  # CIFAR10 LeNet5 Variation 2: GELU + Dropout Layer
17
 
18
- This repository contains our second variation of the original LeNet5 architecture adapted for CIFAR-10. The model consists of two convolutional layers followed by two fully connected layers a dropout layer (p=0.5) and a final fully connected layer, using linear (GELU) activations, extending variation 1, and Kaiming uniform initialization. It is trained with a batch size of 32 using the Adam optimizer (learning rate 0.001) and CrossEntropyLoss. In our experiments, this model achieved a test loss of 0.0539 and a top-1 accuracy of 58.52% on CIFAR-10.
19
 
20
  ## Model Details
21
 
22
  - **Architecture:** 2 Convolutional Layers, 2 Fully Connected Layers, 1 Dropout Layer, 1 Final Fully Connected Layer.
23
- - **Activations:** ReLU (might switch to tanh).
24
  - **Weight Initialization:** Kaiming Uniform.
25
  - **Optimizer:** Adam (lr=0.001).
26
  - **Loss Function:** CrossEntropyLoss.
 
15
 
16
  # CIFAR10 LeNet5 Variation 2: GELU + Dropout Layer
17
 
18
+ This repository contains our second variation of the original LeNet5 architecture adapted for CIFAR-10. The model consists of two convolutional layers followed by two fully connected layers a dropout layer (p=0.5) and a final fully connected layer, using linear (GELU) activations, extending variation 1, and Kaiming uniform initialization. It is trained with a batch size of 32 using the Adam optimizer (learning rate 0.001) and CrossEntropyLoss. In our experiments, this model achieved a test loss of 0.0316 and a top-1 accuracy of 64.71% on CIFAR-10.
19
 
20
  ## Model Details
21
 
22
  - **Architecture:** 2 Convolutional Layers, 2 Fully Connected Layers, 1 Dropout Layer, 1 Final Fully Connected Layer.
23
+ - **Activations:** GELU.
24
  - **Weight Initialization:** Kaiming Uniform.
25
  - **Optimizer:** Adam (lr=0.001).
26
  - **Loss Function:** CrossEntropyLoss.