Matthias Reso
|
113ea18bf1
Replace LlamaTokenizer with AutoTokenizer
|
7 months ago |
Beto
|
5979dbe996
Merging local with remote
|
7 months ago |
Beto
|
d4cbfa1cc1
Merging upstream llama-recipes to current repo
|
7 months ago |
Hamid Shojanazeri
|
aaa9e2c863
Adding a feature that will stop the training/eval process after reaching some max_steps (#428)
|
7 months ago |
Kai Wu
|
e6f69f84ad
add max_steps_reached to reduce redundancy
|
7 months ago |
rahul-sarvam
|
0efb8bd31e
Update README.md
|
7 months ago |
rahul-sarvam
|
687c2dc5d8
Update README.md
|
7 months ago |
Rahul A R
|
2fa8e69b62
add new argument: tokenizer_name
|
7 months ago |
Rahul A R
|
f8183b96fe
use new tokenizer_name argument and resize embeddings if required
|
7 months ago |
Rahul A R
|
1e4e3e00fc
adding new multilingual recipe
|
7 months ago |
Kai Wu
|
362cda0fa6
fixing test_gradient_accumulation and test_save_to_json
|
7 months ago |
Kai Wu
|
fa0a389f74
add max_step feature for training and eval
|
7 months ago |
Hamid Shojanazeri
|
37c8f72211
Update location and name of llm.py example notebook (#417)
|
7 months ago |
Thomas Robinson
|
79266217ef
Update location and name of llm.py example notebook
|
7 months ago |
Hamid Shojanazeri
|
f7aa02af9f
only save training params on rank 0 (#415)
|
7 months ago |
jpgard
|
6954b16b3b
only save training params on rank 0
|
7 months ago |
varunfb
|
a404c9249c
Notebook to demonstrate using llama and llama-guard together using OctoAI
|
8 months ago |
Beto
|
18d76ed36f
merging into private llama recipes repo
|
8 months ago |
Hamid Shojanazeri
|
64e189914f
update due to peft new release (#407)
|
8 months ago |
Hamid Shojanazeri
|
11f51db28c
adding the kbit prep in the code
|
8 months ago |
Hamid Shojanazeri
|
f058ff6ccd
update due to peft new release
|
8 months ago |
Hamid Shojanazeri
|
6a7478a6aa
Reorg inference throughput folder structure (#404)
|
8 months ago |
Chester Hu
|
367e4869ac
Reorg inference throughput folder structure
|
8 months ago |
Hamid Shojanazeri
|
d6eb83f6c5
Add llm class so that externally-hosted models can be called (#398)
|
8 months ago |
Thomas Robinson
|
0346d0d5b8
Add documentation and examples
|
8 months ago |
Hamid Shojanazeri
|
43a1e5cdb0
Fix dead links after directory structure refactor (#397)
|
8 months ago |
Suraj Subramanian
|
e2a35420c0
Remove octoai link that is 401-ing
|
8 months ago |
Suraj Subramanian
|
12602f32e2
Merge branch 'main' into subramen-patch-deadlinks
|
8 months ago |
Hamid Shojanazeri
|
c8f4bdac41
Adding open in colab option for notebook (#395)
|
8 months ago |
Thomas Robinson
|
81984a9a44
Remove unnecessary spec format
|
8 months ago |