Asankhaya Sharma's picture

In a Training Loop 🔄

Asankhaya Sharma

codelion

hugging-science

·

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

liked a dataset about 7 hours ago

codelion/sutra-10B

upvoted a collection 1 day ago

updated a collection 1 day ago

View all activity

Organizations

liked a dataset about 7 hours ago

codelion/sutra-10B

Viewer • Updated 9 days ago • 5M • 211 • 2

upvoted a collection 1 day ago

Nano LLMs

Really small LLMs pre-trained on data efficient 1 B tokens • 3 items • Updated 1 day ago • 1

updated a collection 1 day ago

Nano LLMs

Really small LLMs pre-trained on data efficient 1 B tokens • 3 items • Updated 1 day ago • 1

liked a model 2 days ago

codelion/SmolLM2-70M

Text Generation • 69.2M • Updated 6 days ago • 14 • 1

upvoted an article 2 days ago

Article

Scaling Pedagogical Pretraining: From Optimal Mixing to 10 Billion Tokens

2 days ago

•

2

published an article 2 days ago

Article

Scaling Pedagogical Pretraining: From Optimal Mixing to 10 Billion Tokens

2 days ago

•

2

liked a model 2 days ago

mlx-community/Qwen3.5-9B-OptiQ-4bit

Text Generation • 9B • Updated 3 days ago • 1.19k • 8

updated a model 3 days ago

mlx-community/Qwen3.5-9B-OptiQ-4bit

Text Generation • 9B • Updated 3 days ago • 1.19k • 8

published a model 3 days ago

mlx-community/Qwen3.5-9B-OptiQ-4bit

Text Generation • 9B • Updated 3 days ago • 1.19k • 8

updated a model 4 days ago

mlx-community/Qwen3.5-4B-OptiQ-4bit

Text Generation • 0.8B • Updated 4 days ago • 789 • 1

published a model 4 days ago

mlx-community/Qwen3.5-4B-OptiQ-4bit

Text Generation • 0.8B • Updated 4 days ago • 789 • 1

updated a model 4 days ago

mlx-community/Qwen3.5-2B-OptiQ-4bit

Text Generation • 0.4B • Updated 4 days ago • 871

published a model 4 days ago

mlx-community/Qwen3.5-2B-OptiQ-4bit

Text Generation • 0.4B • Updated 4 days ago • 871

updated a model 4 days ago

mlx-community/Qwen3.5-0.8B-OptiQ-4bit

Text Generation • 0.2B • Updated 4 days ago • 433

published a model 4 days ago

mlx-community/Qwen3.5-0.8B-OptiQ-4bit

Text Generation • 0.2B • Updated 4 days ago • 433

updated a Space 4 days ago

Safety Copilot

Ask about any health & safety related queries

published 2 datasets 4 days ago

codelion/sutra-magpie-sft

Viewer • Updated Jan 3 • 20.7k • 6

codelion/sutra-30k-seeds

Viewer • Updated Jan 3 • 30.3k • 6