Hamid Shojanazeri
|
df03fd4b12
Recipe to add a new language to Llama2 (#429)
|
7 bulan lalu |
Rahul A R
|
cc1029bcf1
update wordlist.txt
|
7 bulan lalu |
Rahul A R
|
664429b726
Merge branch 'main' of github.com:rahul-sarvam/llama-recipes
|
7 bulan lalu |
Rahul A R
|
e98f6de80d
typo
|
7 bulan lalu |
Hamid Shojanazeri
|
e3d750f49f
Update wordlist.txt
|
7 bulan lalu |
Rahul A R
|
09028bf893
addressing Hamid's comments
|
7 bulan lalu |
rahul-sarvam
|
eb7ef4225f
Update recipes/multilingual/README.md
|
7 bulan lalu |
rahul-sarvam
|
f1f335a591
Update recipes/multilingual/README.md
|
7 bulan lalu |
rahul-sarvam
|
47556ce0a6
Update recipes/multilingual/README.md
|
7 bulan lalu |
Hamid Shojanazeri
|
aaa9e2c863
Adding a feature that will stop the training/eval process after reaching some max_steps (#428)
|
7 bulan lalu |
Kai Wu
|
e6f69f84ad
add max_steps_reached to reduce redundancy
|
7 bulan lalu |
rahul-sarvam
|
0efb8bd31e
Update README.md
|
7 bulan lalu |
rahul-sarvam
|
687c2dc5d8
Update README.md
|
7 bulan lalu |
Rahul A R
|
2fa8e69b62
add new argument: tokenizer_name
|
7 bulan lalu |
Rahul A R
|
f8183b96fe
use new tokenizer_name argument and resize embeddings if required
|
7 bulan lalu |
Rahul A R
|
1e4e3e00fc
adding new multilingual recipe
|
7 bulan lalu |
Kai Wu
|
362cda0fa6
fixing test_gradient_accumulation and test_save_to_json
|
7 bulan lalu |
Kai Wu
|
fa0a389f74
add max_step feature for training and eval
|
7 bulan lalu |
Hamid Shojanazeri
|
37c8f72211
Update location and name of llm.py example notebook (#417)
|
7 bulan lalu |
Thomas Robinson
|
79266217ef
Update location and name of llm.py example notebook
|
7 bulan lalu |
Hamid Shojanazeri
|
f7aa02af9f
only save training params on rank 0 (#415)
|
7 bulan lalu |
jpgard
|
6954b16b3b
only save training params on rank 0
|
7 bulan lalu |
Hamid Shojanazeri
|
64e189914f
update due to peft new release (#407)
|
8 bulan lalu |
Hamid Shojanazeri
|
11f51db28c
adding the kbit prep in the code
|
8 bulan lalu |
Hamid Shojanazeri
|
f058ff6ccd
update due to peft new release
|
8 bulan lalu |
Hamid Shojanazeri
|
6a7478a6aa
Reorg inference throughput folder structure (#404)
|
8 bulan lalu |
Chester Hu
|
367e4869ac
Reorg inference throughput folder structure
|
8 bulan lalu |
Hamid Shojanazeri
|
d6eb83f6c5
Add llm class so that externally-hosted models can be called (#398)
|
8 bulan lalu |
Thomas Robinson
|
0346d0d5b8
Add documentation and examples
|
8 bulan lalu |
Hamid Shojanazeri
|
43a1e5cdb0
Fix dead links after directory structure refactor (#397)
|
8 bulan lalu |