Hamid Shojanazeri
|
a955ed1999
added checks for dist barrier and commented cuda exapnadable segements and dist_dbug
|
1 year ago |
Hamid Shojanazeri
|
a2403c7c1a
clean up
|
1 year ago |
Hamid Shojanazeri
|
e9559d2669
fixing the train/eval_loss calcualtion
|
1 year ago |
Hamid Shojanazeri
|
4ba4400a75
adding dist barrier before and after checkpointing
|
1 year ago |
Hamid Shojanazeri
|
a49a2c2804
adding PT cuda allocation expand flag
|
1 year ago |
Hamid Shojanazeri
|
442c1ccf7c
adding barrier to end of trainer loop
|
1 year ago |
Hamid Shojanazeri
|
f74d57dc08
printing scores based on fsdp usage or single gpu
|
1 year ago |
Hamid Shojanazeri
|
3d887ea483
update with active memory and removing rank0 for eval score
|
1 year ago |
Hamid Shojanazeri
|
bedb96b78a
fixing the full state path in checkpoint handler
|
1 year ago |
Geeta Chauhan
|
74bde65a62
Adding Supporting Files For link and Spell Check (#26)
|
1 year ago |
Hamid Shojanazeri
|
83fde7b94b
Fix cuda id for using quantization (#40)
|
1 year ago |
Hamid Shojanazeri
|
bd01f64cbd
Merge branch 'main' into fix-cuda_id
|
1 year ago |
Geeta Chauhan
|
e5970e2e1f
Improve FSDP LoRA Memory Usage (#41)
|
1 year ago |
Andrew Gu
|
71fdc4920a
Save memory and fix typos
|
1 year ago |
Hamid Shojanazeri
|
a7156dfb5d
fixing the cuda id
|
1 year ago |
Hamid Shojanazeri
|
707af7ea24
adding cuda:0 for non-fsdp situations
|
1 year ago |
sekyonda
|
f93d4c891e
Updates per Geeta's request
|
1 year ago |
sekyondaMeta
|
226a10df75
Merge branch 'facebookresearch:main' into spellCheck
|
1 year ago |
Geeta Chauhan
|
1e0f8a1fb7
fixing scaler for both fsdp and non fsdp (#34)
|
1 year ago |
Hamid Shojanazeri
|
6678be75ad
fixing identation
|
1 year ago |
Hamid Shojanazeri
|
6a84e9e4d5
fixing scaler for both fsdp and non fsdp
|
1 year ago |
Geeta Chauhan
|
1838378e0a
fixing the condition for moving to cuda (#33)
|
1 year ago |
Hamid Shojanazeri
|
065ddaa77b
fixing the condition for moving to cuda
|
1 year ago |
Geeta Chauhan
|
0493768dc1
fix typos and spelling errors (#20)
|
1 year ago |
Geeta Chauhan
|
c8cd2f40b1
modify to steping the lr scheduler each epoch (#28)
|
1 year ago |
Hamid Shojanazeri
|
20b061e01c
modify to steping the lr scheduler each epoch
|
1 year ago |
sekyonda
|
1da98b14f0
Update README.md
|
1 year ago |
sekyonda
|
07991603ff
Update inference.md
|
1 year ago |
sekyonda
|
eb2ed73bb3
Update README.md
|
1 year ago |
sekyonda
|
7af5cc4292
Adding Supporting Files For link and Spell Check
|
1 year ago |