Commit History

Author SHA1 Message Date
  Hamid Shojanazeri 31d6ce8bf6 adding expnadable sgement and dist debug flag info 1 year ago
  Hamid Shojanazeri a955ed1999 added checks for dist barrier and commented cuda exapnadable segements and dist_dbug 1 year ago
  Hamid Shojanazeri a2403c7c1a clean up 1 year ago
  Hamid Shojanazeri e9559d2669 fixing the train/eval_loss calcualtion 1 year ago
  Geeta Chauhan 2243b962fa Create spellcheck.yml (#50) 1 year ago
  Geeta Chauhan 3cc2b3787f Fix broken links in Dataset.md (#49) 1 year ago
  Geeta Chauhan 021ed8e312 adding active mem stat (#44) 1 year ago
  Geeta Chauhan 09db361d23 Templates updates (#67) 1 year ago
  Hamid Shojanazeri 4ba4400a75 adding dist barrier before and after checkpointing 1 year ago
  chauhang 95d59afcb8 Update PR template 1 year ago
  chauhang 857a3ade4e Add PR template 1 year ago
  chauhang 9f9532d34c comm 1 year ago
  Christian Miller 9b2f72e1f5 update README: python 3.8 rec + fix formatting 1 year ago
  Hamid Shojanazeri a49a2c2804 adding PT cuda allocation expand flag 1 year ago
  Geeta Chauhan 905f633dab adding issue tempalte (#57) 1 year ago
  Hamid Shojanazeri b814704b5f adding issue tempalte 1 year ago
  Hamid Shojanazeri 442c1ccf7c adding barrier to end of trainer loop 1 year ago
  Hamid Shojanazeri f74d57dc08 printing scores based on fsdp usage or single gpu 1 year ago
  Hamid Shojanazeri 3d887ea483 update with active memory and removing rank0 for eval score 1 year ago
  sekyonda 0d9c1a909f Update markdown_link_check_config.json 1 year ago
  Hamid Shojanazeri bedb96b78a fixing the full state path in checkpoint handler 1 year ago
  sekyondaMeta b625dceb9b Create spellcheck.yml 1 year ago
  Kaiser Pister b61c45d31d Fix broken links in Dataset.md 1 year ago
  Hamid Shojanazeri 569f8b7976 fixed arg names 1 year ago
  Hamid Shojanazeri 9e3b1b7f01 fixed arg names 1 year ago
  Hamid Shojanazeri 4b18e49f44 added steps for conversion of fsdp to Hf 1 year ago
  Geeta Chauhan 74bde65a62 Adding Supporting Files For link and Spell Check (#26) 1 year ago
  Hamid Shojanazeri a977145a9b change bf16 default to false 1 year ago
  Hamid Shojanazeri 563e572f7c adding active mem stat 1 year ago
  Hamid Shojanazeri 83fde7b94b Fix cuda id for using quantization (#40) 1 year ago