Mengzhao Chen's picture

Mengzhao Chen

ChenMnZ

·

https://chenmnz.github.io/

ChenMnZ

AI & ML interests

model compression

Recent Activity

upvoted an article about 2 months ago

The Optimal Architecture for Small Language Models

upvoted a paper about 2 months ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper 3 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

None yet

ChenMnZ 's models 129

ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ

Updated Aug 6, 2024 • 5 • 25

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-BitBLAS

Text Generation • 29B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-EfficientQAT-w2g128-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 4

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-BitBLAS

Text Generation • 276B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 4 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w4g128-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 2 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w2g64-GPTQ

Text Generation • 8B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128

Text Generation • 11B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-BitBLAS

Text Generation • 26B • Updated Jul 22, 2024

ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-BitBLAS

Text Generation • 26B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-BitBLAS

Text Generation • 26B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS

Text Generation • 275B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 4

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128

Text Generation • 2B • Updated Jul 22, 2024

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w3g128

Text Generation • 2B • Updated Jul 22, 2024 • 5