Malgorzata Siudek commited on
Commit
721437d
·
1 Parent(s): d56d7f5

Add VIS_NISP model checkpoint and metadata

Browse files
Files changed (2) hide show
  1. astropt/090M/ckpt.pt +3 -0
  2. astropt/090M/hparams.txt +41 -0
astropt/090M/ckpt.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28734593a17ca4b6491882ac610681e4c81a12f7faa4e8a599c65844c896601f
3
+ size 1075685355
astropt/090M/hparams.txt ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ AstroPT-0089.5M
2
+ time: 1732276382
3
+ log_via_wandb: False
4
+ log_emissions: False
5
+ out_dir: logs/astropt090M_euclid_4chan
6
+ eval_interval: 100
7
+ log_interval: 10
8
+ checkpoint_interval: 500
9
+ eval_iters: 200
10
+ eval_only: False
11
+ always_save_checkpoint: True
12
+ init_from: scratch
13
+ use_hf: False
14
+ stream_hf_dataset: False
15
+ gradient_accumulation_steps: 20
16
+ batch_size: 32
17
+ spiral: True
18
+ block_size: 196
19
+ image_size: 224
20
+ num_workers: 64
21
+ n_layer: 12
22
+ n_head: 12
23
+ n_embd: 768
24
+ n_chan: 4
25
+ dropout: 0.0
26
+ bias: False
27
+ patch_size: 16
28
+ learning_rate: 0.0006
29
+ max_iters: 5000
30
+ weight_decay: 0.1
31
+ beta1: 0.9
32
+ beta2: 0.95
33
+ grad_clip: 1.0
34
+ decay_lr: True
35
+ warmup_iters: 2000
36
+ lr_decay_iters: 4950.0
37
+ min_lr: 5.9999999999999995e-05
38
+ backend: nccl
39
+ device: cuda
40
+ dtype: bfloat16
41
+ compile: False