9 14 72

Yang

jacklanda

AI & ML interests

Reasoning, Mech Interp, Semantics

Recent Activity

upvoted a paper about 23 hours ago

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

updated a collection 1 day ago

Semantics

commented on a paper 1 day ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

View all activity

Organizations

None yet

Collections 2

Papers 11

spaces 1

Distinct

👀

Create a static web page by editing HTML

models 1

jacklanda/Qwen-2.5-1.5B-Simple-RL

Updated Feb 17, 2025

datasets 1

jacklanda/LexBench

Preview • Updated May 21, 2024 • 2

Yang

AI & ML interests

Recent Activity

Organizations

Collections 2

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Papers 11

spaces 1

Distinct

models 1

jacklanda/Qwen-2.5-1.5B-Simple-RL

datasets 1

jacklanda/LexBench