Update README.md
Browse files
README.md
CHANGED
@@ -15,12 +15,12 @@ metrics:
|
|
15 |
|
16 |
# CIFAR10 LeNet5 Variation 2: GELU + Dropout Layer
|
17 |
|
18 |
-
This repository contains our second variation of the original LeNet5 architecture adapted for CIFAR-10. The model consists of two convolutional layers followed by two fully connected layers a dropout layer (p=0.5) and a final fully connected layer, using linear (GELU) activations, extending variation 1, and Kaiming uniform initialization. It is trained with a batch size of 32 using the Adam optimizer (learning rate 0.001) and CrossEntropyLoss. In our experiments, this model achieved a test loss of 0.
|
19 |
|
20 |
## Model Details
|
21 |
|
22 |
- **Architecture:** 2 Convolutional Layers, 2 Fully Connected Layers, 1 Dropout Layer, 1 Final Fully Connected Layer.
|
23 |
-
- **Activations:**
|
24 |
- **Weight Initialization:** Kaiming Uniform.
|
25 |
- **Optimizer:** Adam (lr=0.001).
|
26 |
- **Loss Function:** CrossEntropyLoss.
|
|
|
15 |
|
16 |
# CIFAR10 LeNet5 Variation 2: GELU + Dropout Layer
|
17 |
|
18 |
+
This repository contains our second variation of the original LeNet5 architecture adapted for CIFAR-10. The model consists of two convolutional layers followed by two fully connected layers a dropout layer (p=0.5) and a final fully connected layer, using linear (GELU) activations, extending variation 1, and Kaiming uniform initialization. It is trained with a batch size of 32 using the Adam optimizer (learning rate 0.001) and CrossEntropyLoss. In our experiments, this model achieved a test loss of 0.0316 and a top-1 accuracy of 64.71% on CIFAR-10.
|
19 |
|
20 |
## Model Details
|
21 |
|
22 |
- **Architecture:** 2 Convolutional Layers, 2 Fully Connected Layers, 1 Dropout Layer, 1 Final Fully Connected Layer.
|
23 |
+
- **Activations:** GELU.
|
24 |
- **Weight Initialization:** Kaiming Uniform.
|
25 |
- **Optimizer:** Adam (lr=0.001).
|
26 |
- **Loss Function:** CrossEntropyLoss.
|