Historique des commits

Auteur SHA1 Message Date
  Matthias Reso cf678b9bf0 Adjust imports to package structure + cleaned up imports il y a 1 an
  Matthias Reso 02428c992a Adding vllm as dependency; fix dep install with hatchling il y a 1 an
  Matthias Reso c8522eb0ff Remove peft install from src il y a 1 an
  Matthias Reso 4c9cc7d223 Move modules into separate src folder il y a 1 an
  Geeta Chauhan fbc513ec47 adding notes how to get the HF models (#151) il y a 1 an
  Hamid Shojanazeri bcfafd9a0b adding notes how to get the HF models il y a 1 an
  Geeta Chauhan cfba150311 adding llama code inference (#144) il y a 1 an
  Hamid Shojanazeri 6105a3f886 clarifying the infilling use-case il y a 1 an
  Hamid Shojanazeri 8b0008433c fix typos il y a 1 an
  Hamid Shojanazeri 564ef2f628 remove padding logic il y a 1 an
  Hamid Shojanazeri 277a292fbc adding autotokenizer il y a 1 an
  Hamid Shojanazeri 3f2fb9167e adding notes to model not supporting infilling il y a 1 an
  Hamid Shojanazeri c62428b99c setting defaults of temp and top_p il y a 1 an
  Hamid Shojanazeri c014ae7cb8 setting BT option to true il y a 1 an
  Hamid Shojanazeri 4fa44e16d9 add note for python llama not suited for llama infilling il y a 1 an
  Hamid Shojanazeri b18a186385 removing the option to take prompt from cli il y a 1 an
  Hamid Shojanazeri 75991d8795 fix the extra line added and remove take prompt from cli il y a 1 an
  Hamid Shojanazeri d28fc9898a addressing doc comments il y a 1 an
  Hamid Shojanazeri a234d1fe0c fix typos il y a 1 an
  Hamid Shojanazeri 2d9f4796e8 fixing the output format il y a 1 an
  Hamid Shojanazeri 1e8ea70b26 adding llama code inference il y a 1 an
  Geeta Chauhan 82e05c46e0 fix a bug in the config for use_fast_kernels (#121) il y a 1 an
  Hamid Shojanazeri 971c079aa6 bugfix: remove duplicate load_peft_model (#124) il y a 1 an
  hongbo.mo fcc817e923 bugfix: remove duplicate load_peft_model il y a 1 an
  Brian Vaughan 3faf005226 fix a bug in the config for use_fast_kernels il y a 1 an
  Geeta Chauhan 03faba661f Update paddings (#85) il y a 1 an
  Geeta Chauhan 205e5a4b81 save cpu mem by leveraging FSDP rank0 broadcasting (#77) il y a 1 an
  Hamid Shojanazeri 85a4ed1b65 Merge branch 'main' into update_paddings il y a 1 an
  lchu feaa344af3 resolve conflicts il y a 1 an
  Geeta Chauhan 3f1fef7a00 adding flash attention and xformer memory efficient through PT SDPA (#97) il y a 1 an