Create README.md

This commit is contained in:
Junya Morioka
2024-10-28 09:48:49 +09:00
committed by GitHub
parent d25708aaa9
commit 5195dde3eb
+10
View File
@@ -0,0 +1,10 @@
# flash-attention pre-build wheels
This repository provides wheels for the pre-build [flash-attention](https://github.com/Dao-AILab/flash-attention).
Since building flash-attention takes a **very long time** and is resource-intensive,
I also build and provide combinations of CUDA and PyTorch that are not officially distributed.
The building Github Actions Workflow can be found [here](./.github/workflows/build.yml).
The built packages is available on the [release page](https://github.com/mjun0812/flash-attention-prebuild-wheels/releases).