·
AI & ML interests
NLP
Recent Activity
Organizations
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr3e4_bs256
3B
•
Updated
•
8
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_full_lr2e4_bs256
3B
•
Updated
•
6
RefalMachine/ruadapt_qwen2.5_3B_unigram_32000_full_lr2e4_bs256
3B
•
Updated
•
5
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_full_lr2e4_bs256
3B
•
Updated
•
4
RefalMachine/ruadapt_qwen2.5_3B_bpe_32000_full_lr2e4_bs256
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_darulm_bpe_64000_full_lr3e4_bs256
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_unigram_32000_mean_init
3B
•
Updated
•
5
RefalMachine/ruadapt_qwen2.5_3B_ext_cl100k_bpe_32000_mean_init
3B
•
Updated
•
8
RefalMachine/ruadapt_qwen2.5_3B_unigram_32000_mean_init
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_bpe_32000_mean_init
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_darulm_cl100k_extended_u60k_full_lr3e4_bs256
3B
•
Updated
•
6
RefalMachine/ruadapt_qwen2.5_3B_darulm_cl100k_extended_u60k_full_lr2e4_bs256
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_darulm_cl100k_extended_u60k_mean_init
3B
•
Updated
•
6
RefalMachine/ruadapt_qwen2.5_3B_darulm_bpe_128000_full_lr2e4_bs256
3B
•
Updated
•
4
RefalMachine/ruadapt_qwen2.5_3B_darulm_unigram_128000_full_lr2e4_bs256
3B
•
Updated
•
6
RefalMachine/ruadapt_qwen2.5_3B_darulm_unigram_128000_mean_init
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_darulm_bpe_128000_mean_init
3B
•
Updated
•
7
RefalMachine/ruadapt_qwen2.5_3B_darulm_bpe_64000_full_lr2e4_bs256
3B
•
Updated
•
5
RefalMachine/ruadapt_qwen2.5_3B_darulm_bpe_64000_mean_init
3B
•
Updated
•
5
RefalMachine/ruadapt_llama3_instruct_lep_saiga_kto_ablitirated
8B
•
Updated
•
17
•
2
RefalMachine/ruadapt_mistral_7b_openchat_extended_lep_ft
7B
•
Updated
•
8
RefalMachine/ruadapt_llama3_8b_instruct_extended_lep_ft
8B
•
Updated
•
12
•
4
RefalMachine/Meta-Llama-3-8B_eval
Updated
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_unigram_full_lr2e4_bs256_v2
Text Generation
•
8B
•
Updated
•
11
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_bpe_full_lr2e4_bs256_v2
Text Generation
•
8B
•
Updated
•
8
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_full_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
11
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_bpe_full_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
7
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_unigram_full_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
9
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_full_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
10
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_full_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
10