arxiv:2411.02355
Eldar Kurtić
ekurtic
AI & ML interests
Efficient inference
Recent Activity
updated
a model
about 16 hours ago
ekurtic/Llama-Guard-4-12B-quantized.w8a8
published
a model
about 17 hours ago
ekurtic/Llama-Guard-4-12B-quantized.w8a8
updated
a model
about 17 hours ago
ekurtic/Llama-Guard-4-12B-FP8-dynamic