Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mengfanxu's picture
3 5 126

mengfanxu

fxmeng
niguanwo's profile picture 21world's profile picture PeepDaSlan9's profile picture
·
https://fxmeng.github.io
  • fxmeng

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago
fxmeng/TransMLA-llama3-8b-32k
updated a model about 1 month ago
fxmeng/TransMLA-llama3-8b-8k
updated a collection about 2 months ago
TransMLA-base
View all activity

Organizations

None yet

authored a paper 4 months ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21 • 9
authored 2 papers 11 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 58

CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy

Paper • 2411.17426 • Published Nov 26, 2024
authored a paper over 1 year ago

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models

Paper • 2404.02948 • Published Apr 3, 2024 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs