Multi-Domain-Expert-Learning
/

expert-philpapers

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

tosin commited on Jun 18, 2023

Commit

9a3cf82

·

1 Parent(s): 139ee8a

update model card README.md

Files changed (1) hide show

README.md +11 -26

README.md CHANGED Viewed

@@ -1,23 +1,18 @@
 ---
 license: apache-2.0
-datasets:
-- Multi-Domain-Expert-Layers/philpapers
-language:
-- en
-metrics:
-- accuracy
-library_name: transformers
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# layer_9,10,11,12,13
-This model is a fine-tuned version of [EleutherAI/pythia-1b-deduped](https://huggingface.co/EleutherAI/pythia-1b-deduped) on the philpapers dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.8991
-- Accuracy: 0.4548
 ## Model description
@@ -38,31 +33,21 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
 - train_batch_size: 1
-- eval_batch_size: 1
 - seed: 42
-- distributed_type: multi-GPU
-- num_devices: 8
 - gradient_accumulation_steps: 8
-- total_train_batch_size: 64
-- total_eval_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 1000
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 2.6694        | 0.14  | 200  | 2.9416          | 0.4486   |
-| 2.6174        | 0.29  | 400  | 2.9312          | 0.4502   |
-| 2.611         | 0.43  | 600  | 2.9167          | 0.4519   |
-| 2.576         | 0.57  | 800  | 2.9057          | 0.4537   |
-| 2.5796        | 0.72  | 1000 | 2.8991          | 0.4548   |
 ### Framework versions
 - Transformers 4.28.1
-- Pytorch 2.0.0+cu117
 - Datasets 2.11.0
-- Tokenizers 0.13.3

 ---
 license: apache-2.0
+tags:
+- generated_from_trainer
+model-index:
+- name: expert-philpapers
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# expert-philpapers
+This model is a fine-tuned version of [EleutherAI/pythia-1b-deduped](https://huggingface.co/EleutherAI/pythia-1b-deduped) on an unknown dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
 - train_batch_size: 1
+- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 8
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - training_steps: 1000
 ### Training results
 ### Framework versions
 - Transformers 4.28.1
+- Pytorch 2.0.1+cu117
 - Datasets 2.11.0
+- Tokenizers 0.13.3