End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4995
-- Accuracy: 0.8421
 ## Model description
@@ -39,24 +39,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 7
 ### Training results
 | Training Loss | Epoch | Step   | Validation Loss | Accuracy |
 |:-------------:|:-----:|:------:|:---------------:|:--------:|
-| 0.6798        | 1.0   | 14844  | 0.6302          | 0.7400   |
-| 0.5937        | 2.0   | 29688  | 0.6037          | 0.7617   |
-| 0.5045        | 3.0   | 44532  | 0.5406          | 0.7846   |
-| 0.5463        | 4.0   | 59376  | 0.4999          | 0.8103   |
-| 0.3192        | 5.0   | 74220  | 0.4894          | 0.8257   |
-| 0.2919        | 6.0   | 89064  | 0.4923          | 0.8384   |
-| 0.3553        | 7.0   | 103908 | 0.4995          | 0.8421   |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5255
+- Accuracy: 0.8184
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step   | Validation Loss | Accuracy |
 |:-------------:|:-----:|:------:|:---------------:|:--------:|
+| 0.6959        | 1.0   | 29819  | 0.6482          | 0.7331   |
+| 0.6137        | 2.0   | 59638  | 0.5869          | 0.7681   |
+| 0.48          | 3.0   | 89457  | 0.5544          | 0.7899   |
+| 0.4657        | 4.0   | 119276 | 0.5210          | 0.8109   |
+| 0.3649        | 5.0   | 149095 | 0.5255          | 0.8184   |
 ### Framework versions

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2025-12-~~16T12~~:21:26,codecarbon,~~2f52b7fd~~-~~7978~~-~~4654~~-~~8d58~~-~~6102e50694c2~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~20689~~.~~402613584185~~,0.~~8893559856050955~~,4.~~2986064035564027e~~-05,42.5,~~212~~.~~93509257980364~~,755.7507977485657,0.~~24405337297347032~~,3.~~865553222162319~~,4.~~339287941454435~~,8.~~448894536590212~~,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-12-19T15:50:47,codecarbon,606219ce-3144-49b8-a9d0-92eede4b6daf,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,31881.930166431703,1.3961973144618327,4.379274740184585e-05,42.5,583.3473472300943,755.7507977485657,0.3761313160684887,6.199754455077368,6.688008612083131,13.263894383228985,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,N,1.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:02c737ebf27e494466114d7215088ee58e26265b061109f0da083a1c7afa24f5
 size 498618976

 version https://git-lfs.github.com/spec/v1
+oid sha256:9be19b9a22e6855a76c546488f173f0c58a7c9242cd3f5b17d8e91922af10580
 size 498618976