Hamid Shojanazeri 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA il y a 1 an
..
__init__.py 4767f09ecd Initial commit il y a 1 an
config_utils.py 4767f09ecd Initial commit il y a 1 an
dataset_utils.py 4767f09ecd Initial commit il y a 1 an
fsdp_utils.py 4767f09ecd Initial commit il y a 1 an
memory_utils.py 41dd7ff1cb Merge branch 'main' into checkpoint_handler_path_fix il y a 1 an
train_utils.py 44ef280d31 adding flash attention and xformer memory efficient through PT SDPA il y a 1 an