Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.98k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
3c8bb73
quantization
1.08 GB
2 contributors
History:
30 commits
danieldk
HF Staff
Add support for ROCm
3c8bb73
12 months ago
build
Build (Torch 2.6)
about 1 year ago
compressed_tensors
Sync with vLLM
about 1 year ago
core
Sync with vLLM
about 1 year ago
cutlass_extensions
Sync with vLLM
about 1 year ago
cutlass_w8a8
Sync with vLLM
about 1 year ago
fp8
Sync with vLLM
about 1 year ago
gptq_marlin
Sync with vLLM
about 1 year ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS
over 1 year ago
tests
Add full Marlin support and tests for Marlin/CUTLASS
over 1 year ago
torch-ext
Add support for ROCm
12 months ago
.gitattributes
Safe
1.56 kB
Build
over 1 year ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
over 1 year ago
README.md
Safe
195 Bytes
Update README.md (#1)
about 1 year ago
build.toml
3.14 kB
Add support for ROCm
12 months ago
dispatch_utils.h
Safe
1.49 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
over 1 year ago
flake.lock
3.03 kB
Add support for ROCm
12 months ago
flake.nix
Safe
335 Bytes
Add support for ROCm
12 months ago
vectorization.cuh
Safe
778 Bytes
Sync with vLLM
about 1 year ago