Hamid Shojanazeri 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA 1 ano atrás
..
__init__.py 4767f09ecd Initial commit 1 ano atrás
datasets.py 4767f09ecd Initial commit 1 ano atrás
fsdp.py a977145a9b change bf16 default to false 1 ano atrás
peft.py 4767f09ecd Initial commit 1 ano atrás
training.py 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA 1 ano atrás