YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

FocusUI-3B

This model was introduced in the paper:

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Model Zoo

Model Backbone ๐Ÿค— HuggingFace
FocusUI-3B Qwen2.5-VL-3B https://huggingface.co/yyyang/FocusUI-3B
FocusUI-7B Qwen2.5-VL-7B https://huggingface.co/yyyang/FocusUI-7B
FocusUI-2B Qwen3-VL-2B https://huggingface.co/yyyang/FocusUI-Qwen3-VL-2B

Dataset & Benchmarks

For the training and evaluation data, see FocusUI-Training-Data and UI-Grounding-Benchmarks.

Citation

@article{ouyang2025focusui,
  title   = {FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection},
  author  = {Ouyang, Mingyu and Lin, Kevin Qinghong and Shou, Mike Zheng and Ng, Hwee Tou},
  year    = {2025},
  journal = {arXiv preprint},
}
Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Paper for yyyang/FocusUI-3B