Historique des commits

Auteur SHA1 Message Date
  Hamid Shojanazeri 5b916114eb merge main branch il y a 1 an
  Hamid Shojanazeri 668c364f6b add rank to save_train_params il y a 1 an
  Hamid Shojanazeri 231c9e7da9 adding train_param.yaml saving for fsdp checkpoint loading for inference il y a 1 an
  Hamid Shojanazeri 475e67b4ec clean up il y a 1 an
  Hamid Shojanazeri 50e9d17045 add the default option for find the HF model_name/path from train_param.yaml il y a 1 an
  Hamid Shojanazeri 41dd7ff1cb Merge branch 'main' into checkpoint_handler_path_fix il y a 1 an
  Hamid Shojanazeri 31d6ce8bf6 adding expnadable sgement and dist debug flag info il y a 1 an
  Hamid Shojanazeri a955ed1999 added checks for dist barrier and commented cuda exapnadable segements and dist_dbug il y a 1 an
  Hamid Shojanazeri a2403c7c1a clean up il y a 1 an
  Hamid Shojanazeri e9559d2669 fixing the train/eval_loss calcualtion il y a 1 an
  Geeta Chauhan 2243b962fa Create spellcheck.yml (#50) il y a 1 an
  Geeta Chauhan 3cc2b3787f Fix broken links in Dataset.md (#49) il y a 1 an
  Geeta Chauhan 021ed8e312 adding active mem stat (#44) il y a 1 an
  Geeta Chauhan 09db361d23 Templates updates (#67) il y a 1 an
  Hamid Shojanazeri 4ba4400a75 adding dist barrier before and after checkpointing il y a 1 an
  chauhang 95d59afcb8 Update PR template il y a 1 an
  chauhang 857a3ade4e Add PR template il y a 1 an
  chauhang 9f9532d34c comm il y a 1 an
  Christian Miller 9b2f72e1f5 update README: python 3.8 rec + fix formatting il y a 1 an
  Hamid Shojanazeri a49a2c2804 adding PT cuda allocation expand flag il y a 1 an
  Geeta Chauhan 905f633dab adding issue tempalte (#57) il y a 1 an
  Hamid Shojanazeri b814704b5f adding issue tempalte il y a 1 an
  Hamid Shojanazeri 442c1ccf7c adding barrier to end of trainer loop il y a 1 an
  Hamid Shojanazeri f74d57dc08 printing scores based on fsdp usage or single gpu il y a 1 an
  Hamid Shojanazeri 3d887ea483 update with active memory and removing rank0 for eval score il y a 1 an
  sekyonda 0d9c1a909f Update markdown_link_check_config.json il y a 1 an
  Hamid Shojanazeri bedb96b78a fixing the full state path in checkpoint handler il y a 1 an
  sekyondaMeta b625dceb9b Create spellcheck.yml il y a 1 an
  Kaiser Pister b61c45d31d Fix broken links in Dataset.md il y a 1 an
  Hamid Shojanazeri 569f8b7976 fixed arg names il y a 1 an