Commit History

Autor SHA1 Mensaxe Data
  gaopengzhi c7d410725b Merge branch 'main' into grad_clip hai 1 ano
  gaopengzhi e2797abe9b Add gradient_clipping and gradient_clipping_threshold parameters hai 1 ano
  gaopengzhi bb7c6c1e33 Support FSDP scenario hai 1 ano
  gaopengzhi b1d9efd155 Refactor gradient clipping feature hai 1 ano
  Jeremy Howard eca8410b32 Use bf16 parameters in bf16 mixed prec hai 1 ano
  gaopengzhi 04befdef69 Add gradient clipping feature hai 1 ano
  Matthias Reso e8bb7fbabc Merge remote-tracking branch 'origin/main' into feature/length_based_batch_sampling hai 1 ano
  Matthias Reso 33925f71e6 Add missing amp context if use_fp16 is enabled hai 1 ano
  Matthias Reso 10f9367e56 fix missing labels in datasets hai 1 ano
  hongbo.mo 6217635e87 Fix tqdm bar not change length after terminal is resized hai 1 ano
  Matthias Reso c33ea3cacb Fix pbar update hai 1 ano
  hongbo.mo 5e910e6a42 Fix typo hai 1 ano
  hongbo.mo 0bc6a07a80 bugfix: update tqdm bar with the fixed gradient_accumulation_steps hai 1 ano
  Matthias Reso 72a9832571 Merge branch 'main' into feature/package_distribution hai 1 ano
  Matthias Reso ce9501f22c remove relative imports hai 1 ano
  Matthias Reso 5b58afc754 Fix div by zero if run_validation=False hai 1 ano
  Matthias Reso cf678b9bf0 Adjust imports to package structure + cleaned up imports hai 1 ano
  Matthias Reso 4c9cc7d223 Move modules into separate src folder hai 1 ano