9 31 12

Xiaoyang Wang

xywang1

https://xyang0.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a collection 5 days ago

Preference Datasets for DPO

upvoted a paper 25 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper 2 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

View all activity

Organizations

None yet

upvoted a collection 5 days ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 46

upvoted a paper 25 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 27 days ago • 74

upvoted 2 papers 2 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 97

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19, 2025 • 106

upvoted 2 papers 3 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 106

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7, 2025 • 31

upvoted 6 papers 4 months ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21, 2025 • 46

upvoted 2 papers 5 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12, 2025 • 40

liked a model 5 months ago

CognitiveKernel/Qwen3-8B-CK-Pro

Text Generation • 8B • Updated Aug 6, 2025 • 290 • 13

upvoted a paper 5 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 129

authored a paper 5 months ago

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 93

liked a model 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.57M • • 4.31k

upvoted a paper 5 months ago

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 93

commented a paper 5 months ago

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 93 •

Xiaoyang Wang

AI & ML interests

Recent Activity

Organizations

xywang1's activity