marx/llama-recipes

Auteur	SHA1 Message	Date
Matthias Reso	cf678b9bf0 Adjust imports to package structure + cleaned up imports	il y a 1 an
Matthias Reso	02428c992a Adding vllm as dependency; fix dep install with hatchling	il y a 1 an
Matthias Reso	c8522eb0ff Remove peft install from src	il y a 1 an
Matthias Reso	4c9cc7d223 Move modules into separate src folder	il y a 1 an
Geeta Chauhan	fbc513ec47 adding notes how to get the HF models (#151)	il y a 1 an
Hamid Shojanazeri	bcfafd9a0b adding notes how to get the HF models	il y a 1 an
Geeta Chauhan	cfba150311 adding llama code inference (#144)	il y a 1 an
Hamid Shojanazeri	6105a3f886 clarifying the infilling use-case	il y a 1 an
Hamid Shojanazeri	8b0008433c fix typos	il y a 1 an
Hamid Shojanazeri	564ef2f628 remove padding logic	il y a 1 an
Hamid Shojanazeri	277a292fbc adding autotokenizer	il y a 1 an
Hamid Shojanazeri	3f2fb9167e adding notes to model not supporting infilling	il y a 1 an
Hamid Shojanazeri	c62428b99c setting defaults of temp and top_p	il y a 1 an
Hamid Shojanazeri	c014ae7cb8 setting BT option to true	il y a 1 an
Hamid Shojanazeri	4fa44e16d9 add note for python llama not suited for llama infilling	il y a 1 an
Hamid Shojanazeri	b18a186385 removing the option to take prompt from cli	il y a 1 an
Hamid Shojanazeri	75991d8795 fix the extra line added and remove take prompt from cli	il y a 1 an
Hamid Shojanazeri	d28fc9898a addressing doc comments	il y a 1 an
Hamid Shojanazeri	a234d1fe0c fix typos	il y a 1 an
Hamid Shojanazeri	2d9f4796e8 fixing the output format	il y a 1 an
Hamid Shojanazeri	1e8ea70b26 adding llama code inference	il y a 1 an
Geeta Chauhan	82e05c46e0 fix a bug in the config for use_fast_kernels (#121)	il y a 1 an
Hamid Shojanazeri	971c079aa6 bugfix: remove duplicate load_peft_model (#124)	il y a 1 an
hongbo.mo	fcc817e923 bugfix: remove duplicate load_peft_model	il y a 1 an
Brian Vaughan	3faf005226 fix a bug in the config for use_fast_kernels	il y a 1 an
Geeta Chauhan	03faba661f Update paddings (#85)	il y a 1 an
Geeta Chauhan	205e5a4b81 save cpu mem by leveraging FSDP rank0 broadcasting (#77)	il y a 1 an
Hamid Shojanazeri	85a4ed1b65 Merge branch 'main' into update_paddings	il y a 1 an
lchu	feaa344af3 resolve conflicts	il y a 1 an
Geeta Chauhan	3f1fef7a00 adding flash attention and xformer memory efficient through PT SDPA (#97)	il y a 1 an

Récemment Précédemment

Historique des commits Trouver

Historique des commits