Hamid Shojanazeri 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA 1 éve
..
__init__.py 4767f09ecd Initial commit 1 éve
datasets.py 4767f09ecd Initial commit 1 éve
fsdp.py a977145a9b change bf16 default to false 1 éve
peft.py 4767f09ecd Initial commit 1 éve
training.py 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA 1 éve