Hamid Shojanazeri
|
4ba4400a75
adding dist barrier before and after checkpointing
|
1 year ago |
chauhang
|
95d59afcb8
Update PR template
|
1 year ago |
chauhang
|
857a3ade4e
Add PR template
|
1 year ago |
chauhang
|
9f9532d34c
comm
|
1 year ago |
Christian Miller
|
9b2f72e1f5
update README: python 3.8 rec + fix formatting
|
1 year ago |
Hamid Shojanazeri
|
a49a2c2804
adding PT cuda allocation expand flag
|
1 year ago |
Geeta Chauhan
|
905f633dab
adding issue tempalte (#57)
|
1 year ago |
Hamid Shojanazeri
|
b814704b5f
adding issue tempalte
|
1 year ago |
Hamid Shojanazeri
|
442c1ccf7c
adding barrier to end of trainer loop
|
1 year ago |
Hamid Shojanazeri
|
f74d57dc08
printing scores based on fsdp usage or single gpu
|
1 year ago |
Hamid Shojanazeri
|
3d887ea483
update with active memory and removing rank0 for eval score
|
1 year ago |
sekyonda
|
0d9c1a909f
Update markdown_link_check_config.json
|
1 year ago |
Hamid Shojanazeri
|
bedb96b78a
fixing the full state path in checkpoint handler
|
1 year ago |
sekyondaMeta
|
b625dceb9b
Create spellcheck.yml
|
1 year ago |
Kaiser Pister
|
b61c45d31d
Fix broken links in Dataset.md
|
1 year ago |
Hamid Shojanazeri
|
569f8b7976
fixed arg names
|
1 year ago |
Hamid Shojanazeri
|
9e3b1b7f01
fixed arg names
|
1 year ago |
Hamid Shojanazeri
|
4b18e49f44
added steps for conversion of fsdp to Hf
|
1 year ago |
Geeta Chauhan
|
74bde65a62
Adding Supporting Files For link and Spell Check (#26)
|
1 year ago |
Hamid Shojanazeri
|
a977145a9b
change bf16 default to false
|
1 year ago |
Hamid Shojanazeri
|
563e572f7c
adding active mem stat
|
1 year ago |
Hamid Shojanazeri
|
83fde7b94b
Fix cuda id for using quantization (#40)
|
1 year ago |
Hamid Shojanazeri
|
bd01f64cbd
Merge branch 'main' into fix-cuda_id
|
1 year ago |
Hamid Shojanazeri
|
4320528d40
clean up
|
1 year ago |
Hamid Shojanazeri
|
171c3589a6
add converter script
|
1 year ago |
Hamid Shojanazeri
|
ceaf6301e9
remove the unused code
|
1 year ago |
Geeta Chauhan
|
e5970e2e1f
Improve FSDP LoRA Memory Usage (#41)
|
1 year ago |
Andrew Gu
|
71fdc4920a
Save memory and fix typos
|
1 year ago |
Hamid Shojanazeri
|
a7156dfb5d
fixing the cuda id
|
1 year ago |
Hamid Shojanazeri
|
707af7ea24
adding cuda:0 for non-fsdp situations
|
1 year ago |