flash-attention pre-build wheels
This repository provides wheels for the pre-built flash-attention.
Since building flash-attention takes a very long time and is resource-intensive,
I also build and provide combinations of CUDA and PyTorch that are not officially distributed.
The building Github Actions Workflow can be found here.
The built packages are available on the release page.
Install
- Select the versions for Python, CUDA, PyTorch, and flash_attn.
-
Find the corresponding version of a wheel from the below table and releases
-
Direct Install or Download and Local Install
Packages
v0.0.9
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.4.3, 2.5.9, 2.6.3 |
3.10, 3.11, 3.12 |
2.7.0 |
12.8.1 |
v0.0.8
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.4.3, 2.5.9, 2.6.3, 2.7.4.post1 |
3.10, 3.11, 3.12 |
2.4.1, 2.5.1, 2.6.0, 2.7.0 |
11.8.0, 12.4.1, 12.6.3 |
v0.0.7
Skip for experimental reasons.
v0.0.6
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.4.3, 2.5.9, 2.6.3, 2.7.4.post1 |
3.10, 3.11, 3.12 |
2.2.2, 2.3.1, 2.4.1, 2.5.1, 2.6.0 |
12.4.1, 12.6.3 |
v0.0.5
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.6.3, 2.7.4.post1 |
3.10, 3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.1, 2.6.0 |
12.4.1, 12.6.3 |
v0.0.4
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.7.3 |
3.10, 3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.1 |
11.8.0, 12.1.1, 12.4.1 |
v0.0.3
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.7.2.post1 |
3.10, 3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.1 |
11.8.0, 12.1.1, 12.4.1 |
v0.0.2
Release
| Flash-Attention |
Python |
PyTorch |
CUDA |
| 2.4.3, 2.5.6, 2.6.3, 2.7.0.post2 |
3.10, 3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.1 |
11.8.0, 12.1.1, 12.4.1 |
v0.0.1
Release
| flash-attention |
Python |
PyTorch |
CUDA |
| 1.0.9, 2.4.3, 2.5.6, 2.5.9, 2.6.3 |
3.10, 3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.0 |
11.8.0, 12.1.1, 12.4.1 |
v0.0.0
Release
| flash-attention |
Python |
PyTorch |
CUDA |
| 2.4.3, 2.5.6, 2.5.9, 2.6.3 |
3.11, 3.12 |
2.0.1, 2.1.2, 2.2.2, 2.3.1, 2.4.1, 2.5.0 |
11.8.0, 12.1.1, 12.4.1 |
Original
repo