27 101 271

Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper 3 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

liked a model 10 days ago

FutureLivingLab/iFlow-ROME

liked a model 10 days ago

openbmb/MiniCPM-o-4_5

View all activity

Organizations

upvoted a paper 3 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published 9 days ago • 24

liked 2 models 10 days ago

FutureLivingLab/iFlow-ROME

Text Generation • 31B • Updated 10 days ago • 232 • 34

openbmb/MiniCPM-o-4_5

Any-to-Any • Updated 9 days ago • 77.6k • 870

liked a model 11 days ago

openbmb/MiniCPM-SALA

Text Generation • Updated 11 days ago • 4.66k • 472

liked a dataset 11 days ago

XXHStudyHard/EnvScaler-SFT-Traj-9K

Viewer • Updated Jan 15 • 9.02k • 89 • 6

liked a dataset 12 days ago

r2e-edits/SweSmith-RL-Dataset

Viewer • Updated Jun 20, 2025 • 8.65k • 69 • 4

liked a dataset 13 days ago

openbmb/UltraData-Math

Viewer • Updated 1 day ago • 181M • 40.8k • 241

liked 3 datasets 15 days ago

upvoted 3 papers 15 days ago

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Paper • 2602.03411 • Published 19 days ago • 37

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Paper • 2602.03419 • Published 19 days ago • 39

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 18 days ago • 253

upvoted 4 papers 19 days ago

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Paper • 2602.02486 • Published 19 days ago • 18

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 20 days ago • 60

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 20 days ago • 238

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published 19 days ago • 32

liked a dataset 19 days ago

GAIR/daVinci-Agency

Viewer • Updated 18 days ago • 240 • 440 • 2

upvoted a paper 20 days ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 23 days ago • 35

upvoted a paper 21 days ago

DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents

Paper • 2601.20975 • Published 24 days ago • 9

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity