Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rednote-hilab

company
https://github.com/rednote-hilab
rednote-hilab
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

floyed  submitted a paper 23 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
debingzhang  authored a paper 3 months ago
PretrainZero: Reinforcement Active Pretraining
ygfrancois  updated a model 5 months ago
rednote-hilab/dots.ocr.base
View all activity

Papers

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

View all Papers

redmoe-ai-v1's profile pictureshamyji's profile picturejay's profile picturelizichao's profile pictureLei Zhang's profile picturezdb's profile pictureqyinyourmind's profile pictureGuang Yang's profile pictureliuhao's profile picturecolin zhang's profile pictureXiaoming Shi's profile picture
rednote-hilab 's Papers 7
Submitted by
HuangMeow
196

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

rednote-hilab rednote-hilab
12 3
Submitted by
Xiao Wang
26

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

rednote-hilab rednote-hilab
64 2
Submitted by
floyed shen
220

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

rednote-hilab rednote-hilab
29 9
2

dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model

rednote-hilab rednote-hilab
8.07k 1
Submitted by
taesiri
46

DeepEyesV2: Toward Agentic Multimodal Model

rednote-hilab rednote-hilab
1.16k 2
Submitted by
Qingyu_Yin
7

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?

rednote-hilab rednote-hilab
9 2
4

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

rednote-hilab rednote-hilab
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs