cedricbonhomme commited on
Commit
3e6075d
·
verified ·
1 Parent(s): c7a1f8d

End of training

Browse files
Files changed (3) hide show
  1. README.md +10 -12
  2. emissions.csv +1 -1
  3. model.safetensors +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.4995
22
- - Accuracy: 0.8421
23
 
24
  ## Model description
25
 
@@ -39,24 +39,22 @@ More information needed
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
- - train_batch_size: 32
43
- - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 7
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:-----:|:------:|:---------------:|:--------:|
53
- | 0.6798 | 1.0 | 14844 | 0.6302 | 0.7400 |
54
- | 0.5937 | 2.0 | 29688 | 0.6037 | 0.7617 |
55
- | 0.5045 | 3.0 | 44532 | 0.5406 | 0.7846 |
56
- | 0.5463 | 4.0 | 59376 | 0.4999 | 0.8103 |
57
- | 0.3192 | 5.0 | 74220 | 0.4894 | 0.8257 |
58
- | 0.2919 | 6.0 | 89064 | 0.4923 | 0.8384 |
59
- | 0.3553 | 7.0 | 103908 | 0.4995 | 0.8421 |
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.5255
22
+ - Accuracy: 0.8184
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 3e-05
42
+ - train_batch_size: 16
43
+ - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 5
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:-----:|:------:|:---------------:|:--------:|
53
+ | 0.6959 | 1.0 | 29819 | 0.6482 | 0.7331 |
54
+ | 0.6137 | 2.0 | 59638 | 0.5869 | 0.7681 |
55
+ | 0.48 | 3.0 | 89457 | 0.5544 | 0.7899 |
56
+ | 0.4657 | 4.0 | 119276 | 0.5210 | 0.8109 |
57
+ | 0.3649 | 5.0 | 149095 | 0.5255 | 0.8184 |
 
 
58
 
59
 
60
  ### Framework versions
emissions.csv CHANGED
@@ -1,2 +1,2 @@
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
- 2025-12-16T12:21:26,codecarbon,2f52b7fd-7978-4654-8d58-6102e50694c2,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,20689.402613584185,0.8893559856050955,4.2986064035564027e-05,42.5,212.93509257980364,755.7507977485657,0.24405337297347032,3.865553222162319,4.339287941454435,8.448894536590212,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0
 
1
  timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
+ 2025-12-19T15:50:47,codecarbon,606219ce-3144-49b8-a9d0-92eede4b6daf,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,31881.930166431703,1.3961973144618327,4.379274740184585e-05,42.5,583.3473472300943,755.7507977485657,0.3761313160684887,6.199754455077368,6.688008612083131,13.263894383228985,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:02c737ebf27e494466114d7215088ee58e26265b061109f0da083a1c7afa24f5
3
  size 498618976
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9be19b9a22e6855a76c546488f173f0c58a7c9242cd3f5b17d8e91922af10580
3
  size 498618976