Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
iliass ayaou
datalyes
Follow
ValeKnappich's profile picture
dkapt's profile picture
cbobed's profile picture
10 followers
Ā·
2 following
iliass-ayaou-7bb481151
AI & ML interests
information retrieval, patent retrieval, knowledge management, data engineering and architecture, NLP
Recent Activity
posted
an
update
1 day ago
I am happy to share that three models from PATENTEB are now publicly available: patembed-large, patembed-base, and patembed-base_long_4096. The models can be found here: https://huggingface.co/collections/datalyes/patembed-models-collection Feedback, issues, and use cases are very welcome.
new
activity
about 1 month ago
datalyes/DAPFAM_patent:
Add task category: text-retrieval
reacted
to
nouamanetazi
's
post
with š
3 months ago
After training šš¦šØš„ššš on ššš ššššš¬ for nearly a month, I've come to realize something most people overlook: š¢š§šš«šš¬šš«š®ššš®š«š š¢š¬ šš”š š¦šš¤š-šØš«-šš«ššš¤ šššššØš« š¢š§ ššš šš«šš¢š§š¢š§š . š„ Everyone talks about model architecture and data quality. And yes, those matter immensely. But here's what nobody tells you: when your training run fails at 2 AM because of mysterious šššš šš«š«šØš«š¬, or when your expensive GPU cluster is running at šš% šššš¢šš¢šš§šš², the problem isn't your model. It's most probably a š¦š¢š¬š®š¬š šØš šš”š š”šš«šš°šš«š. š ļø Questions that seemed simple but had no clear answers: Why is ššØš šš«šš¢š§š¢š§š š¬š„šØš°šš« šš”šš§ ššš§š¬š š¦šØššš„š¬? Which šššš šš„šš š¬ should we actually set? How often should we checkpoint without killing throughput? That's why we built šš”š šš¦šØš„ šš«šš¢š§š¢š§š šš„šš²ššØšØš¤ š: a complete guide covering everything from model architecture and data curation to the SmolLM3 training marathon, post-training techniques, and crucially, the š¢š§šš«šš¬šš«š®ššš®š«š š„šš²šš« that most teams get wrong. We validated real vs theoretical bandwidth across the entire stack: šššš š”š¢ššš¢š§š š šš/š¬, šššš¢š§š¤ š.š š«šššš”š¢š§š ššš šš/š¬, šššš ššš§š šš šš.š šš/š¬. Then we ran collective operations across ššš šššš¬ (16 nodes, 8xH100s each) and measured how performance degrades at scale: all-reduce drops from ššš šš/š¬ on a single node to ššš-ššš šš/š¬ across 16 nodes. If you've ever wondered why your training runs are slower than they should be, or you're planning to scale up and want to avoid expensive mistakes, this guide might save you weeks of debugging. šš”š šš¦šØš„ šš«šš¢š§š¢š§š šš„šš²ššØšØš¤: https://lnkd.in/e5MKXUHS Shared with ā¤ļø by the HuggingFace team
View all activity
Organizations
datalyes
's datasets
16
Sort:Ā Recently updated
datalyes/title2full
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
18.7k
ā¢
1
datalyes/retrieval_OUT
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
15.5k
ā¢
2
ā¢
1
datalyes/retrieval_MIXED
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
15.6k
ā¢
1
ā¢
1
datalyes/retrieval_IN
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
15.8k
ā¢
6
ā¢
1
datalyes/problem2solution
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
18.7k
ā¢
2
datalyes/problem2full
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
18.7k
ā¢
19
datalyes/para_solution
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
18.7k
ā¢
12
datalyes/para_problem
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
18.7k
ā¢
1
datalyes/effect2substance
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
2.04k
ā¢
7
datalyes/effect2full
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
2.04k
ā¢
2
datalyes/clusters_inventor
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
86.8k
ā¢
10
datalyes/clusters_ext_full_ipc
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
47.2k
ā¢
15
datalyes/class_text2ipc3
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
48.7k
ā¢
5
datalyes/class_nli_oldnew
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
37.9k
ā¢
2
datalyes/class_bloom
Viewer
ā¢
Updated
Oct 28, 2025
ā¢
19k
ā¢
4
datalyes/DAPFAM_patent
Viewer
ā¢
Updated
Sep 10, 2025
ā¢
96.5k
ā¢
184
ā¢
6