Delta-Vector commited on
Commit
387e29f
·
verified ·
1 Parent(s): c697ea8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +337 -28
README.md CHANGED
@@ -1,35 +1,344 @@
1
  ---
2
  base_model:
3
- - Delta-Vector/Austral-Xgen-9B-Winton
4
  library_name: transformers
5
  tags:
6
- - mergekit
7
- - merge
8
-
 
 
 
 
 
 
9
  ---
10
- # xgen-grpo
11
-
12
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
-
14
- ## Merge Details
15
- ### Merge Method
16
-
17
- This model was merged using the Passthrough merge method using [Delta-Vector/Austral-Xgen-9B-Winton](https://huggingface.co/Delta-Vector/Austral-Xgen-9B-Winton) + /home/Ubuntu/Mango/verifiers/outputs/Fenrisúlfr-500-1e-4-clip-8e-6-9b/checkpoint-150 as a base.
18
-
19
- ### Models Merged
20
-
21
- The following models were included in the merge:
22
-
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
 
28
- ```yaml
29
- base_model: Delta-Vector/Austral-Xgen-9B-Winton+/home/Ubuntu/Mango/verifiers/outputs/Fenrisúlfr-500-1e-4-clip-8e-6-9b/checkpoint-150
30
- dtype: bfloat16
31
- merge_method: passthrough
32
- models:
33
- - model: Delta-Vector/Austral-Xgen-9B-Winton+/home/Ubuntu/Mango/verifiers/outputs/Fenrisúlfr-500-1e-4-clip-8e-6-9b/checkpoint-150
 
34
 
35
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  base_model:
3
+ - Delta-Vector/Francois-PE
4
  library_name: transformers
5
  tags:
6
+ - fine-tuning
7
+ - prose
8
+ - GRPO
9
+ - axolotl
10
+ - finetune
11
+ - roleplaying
12
+ - creative-writing
13
+ datasets:
14
+ - PocketDoc/Dans-Systemmaxx
15
  ---
16
+ <style>
17
+ body {
18
+ font-family: 'Quicksand', sans-serif;
19
+ background: linear-gradient(135deg, #f9ffd1 0%, #e2fab5 100%);
20
+ color: #000000;
21
+ margin: 0;
22
+ padding: 0;
23
+ font-size: 16px;
24
+ }
25
+ .container {
26
+ margin: 20px;
27
+ background-color: rgba(255, 255, 255, 0.9);
28
+ padding: 20px;
29
+ border-radius: 12px;
30
+ box-shadow: 0 4px 20px rgba(0, 0, 0, 0.3);
31
+ border: 3px solid #000000;
32
+ position: relative;
33
+ }
34
+ .header h1 {
35
+ font-size: 28px;
36
+ color: #000000;
37
+ margin: 0 0 20px 0;
38
+ text-align: center;
39
+ text-decoration: underline;
40
+ }
41
+ .section {
42
+ margin-top: 30px;
43
+ }
44
+ .section h2 {
45
+ font-size: 24px;
46
+ color: #000000;
47
+ text-align: center;
48
+ text-decoration: underline;
49
+ }
50
+ .info p {
51
+ color: #000000;
52
+ line-height: 1.6;
53
+ font-size: 16px;
54
+ }
55
+ .info img {
56
+ width: 85%;
57
+ border-radius: 10px;
58
+ margin: 0 auto 15px;
59
+ display: block;
60
+ box-shadow: 0 0 20px rgba(0, 0, 0, 0.3);
61
+ border: 1px solid #000000;
62
+ }
63
+ a {
64
+ color: #000000;
65
+ text-decoration: none;
66
+ transition: color 0.2s ease;
67
+ }
68
+ a:hover {
69
+ color: #538125;
70
+ }
71
+ .button {
72
+ display: inline-block;
73
+ background-color: rgba(106, 168, 79, 0.8);
74
+ color: #000000;
75
+ padding: 10px 20px;
76
+ border-radius: 5px;
77
+ cursor: pointer;
78
+ text-decoration: none;
79
+ transition: background-color 0.2s ease, box-shadow 0.2s ease;
80
+ }
81
+ .button:hover {
82
+ background-color: #538125;
83
+ box-shadow: 0 0 15px rgba(106, 168, 79, 0.5);
84
+ }
85
+ pre {
86
+ background-color: rgba(240, 248, 225, 0.95);
87
+ padding: 15px;
88
+ border-radius: 5px;
89
+ overflow-x: auto;
90
+ border: 1px solid #000000;
91
+ }
92
+ code {
93
+ font-family: 'Courier New', monospace;
94
+ color: #000000;
95
+ }
96
+ .info-card {
97
+ background: rgba(249, 255, 235, 0.95);
98
+ border: 1px solid #000000;
99
+ border-radius: 8px;
100
+ overflow: hidden;
101
+ }
102
+ .info-header {
103
+ background: rgba(106, 168, 79, 0.1);
104
+ padding: 20px;
105
+ border-bottom: 1px solid #000000;
106
+ }
107
+ .info-header h3 {
108
+ color: #000000;
109
+ margin: 0 0 10px 0;
110
+ font-size: 20px;
111
+ text-align: center;
112
+ text-decoration: underline;
113
+ }
114
+ .model-tags {
115
+ display: flex;
116
+ gap: 8px;
117
+ flex-wrap: wrap;
118
+ }
119
+ .model-tag {
120
+ background: rgba(106, 168, 79, 0.1);
121
+ color: #000000;
122
+ padding: 4px 8px;
123
+ border-radius: 4px;
124
+ font-size: 12px;
125
+ border: 1px solid #000000;
126
+ }
127
+ .model-composition {
128
+ padding: 20px;
129
+ border-bottom: 1px solid #000000;
130
+ }
131
+ .model-composition h4 {
132
+ color: #000000;
133
+ margin: 0 0 15px 0;
134
+ font-size: 16px;
135
+ text-align: center;
136
+ text-decoration: underline;
137
+ }
138
+ .composition-list {
139
+ list-style: none;
140
+ padding: 0;
141
+ margin: 0;
142
+ display: grid;
143
+ gap: 10px;
144
+ }
145
+ .composition-list li {
146
+ color: #000000;
147
+ display: flex;
148
+ align-items: baseline;
149
+ gap: 8px;
150
+ }
151
+ .model-component {
152
+ font-weight: 500;
153
+ min-width: 120px;
154
+ }
155
+ .model-description {
156
+ padding: 20px;
157
+ background: rgba(255, 255, 255, 0.5);
158
+ }
159
+ .metrics-section {
160
+ margin-bottom: 30px;
161
+ }
162
+ .metrics-section details {
163
+ background: rgba(249, 255, 235, 0.95);
164
+ border: 1px solid #000000;
165
+ border-radius: 8px;
166
+ padding: 15px;
167
+ margin-bottom: 15px;
168
+ }
169
+ .metrics-section summary {
170
+ color: #000000;
171
+ font-size: 18px;
172
+ cursor: pointer;
173
+ outline: none;
174
+ padding: 5px 0;
175
+ text-align: center;
176
+ }
177
+ .creator-section {
178
+ margin: 20px 0;
179
+ }
180
+ .creator-badge {
181
+ display: inline-flex;
182
+ align-items: center;
183
+ background: rgba(249, 255, 235, 0.95);
184
+ border: 1px solid #000000;
185
+ border-radius: 8px;
186
+ padding: 10px 15px;
187
+ }
188
+ .creator-label {
189
+ color: #000000;
190
+ font-size: 14px;
191
+ margin-right: 8px;
192
+ }
193
+ .creator-link {
194
+ display: flex;
195
+ align-items: center;
196
+ gap: 5px;
197
+ color: #000000;
198
+ text-decoration: none;
199
+ transition: all 0.2s ease;
200
+ }
201
+ .creator-name {
202
+ font-weight: 600;
203
+ }
204
+ .creator-arrow {
205
+ font-size: 16px;
206
+ transition: transform 0.2s ease;
207
+ }
208
+ .creator-link:hover .creator-arrow {
209
+ transform: translateX(3px);
210
+ }
211
+ .link-arrow {
212
+ display: inline-block;
213
+ transition: transform 0.2s ease;
214
+ }
215
+ a:hover .link-arrow {
216
+ transform: translateX(3px);
217
+ }
218
+ .axolotl-container {
219
+ text-align: center;
220
+ margin: 30px 0;
221
+ }
222
+ .axolotl-container img {
223
+ max-width: 200px;
224
+ width: 200px;
225
+ height: auto;
226
+ border-radius: 8px;
227
+ box-shadow: 0 2px 10px rgba(0, 0, 0, 0.3);
228
+ border: 1px solid #000000;
229
+ background-color: #000000;
230
+ padding: 15px;
231
+ object-fit: contain;
232
+ }
233
+ </style>
234
+ <html lang="en">
235
+ <head>
236
+ <meta charset="UTF-8">
237
+ <meta name="viewport" content="width=device-width, initial-scale=1.0">
238
+ <title>Nanuq-R1 9B</title>
239
+ <link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet">
240
+ </head>
241
+ <body>
242
+ <div class="container">
243
+ <div class="header">
244
+ <h1>Nanuq-R1 9B</h1>
245
+ </div>
246
+ <div class="info">
247
+ <img src="https://files.catbox.moe/i77ubj.webp" alt="Model banner">
248
+ <div style="text-align: center;">
249
+ <div class="creator-section">
250
+ <div class="creator-badge">
251
+ <span class="creator-label">Created by</span>
252
+ <a href="https://huggingface.co/Delta-Vector" target="_blank" class="creator-link">
253
+ <span class="creator-name">Delta-Vector</span>
254
+ <span class="creator-arrow">→</span>
255
+ </a>
256
+ </div>
257
+ </div>
258
+ <div class="model-info">
259
+ <h2>Model Information</h2>
260
+ <div class="info-card">
261
+ <div class="info-header">
262
+ <h3>Nanuq-R1 9B</h3>
263
+ <div class="model-tags">
264
+ <span class="model-tag">GRPO Experiment</span>
265
+ <span class="model-tag">Austral finetune</span>
266
+ <span class="model-tag">Creative & Refreshing Prose with deep IF.</span>
267
+ </div>
268
+ </div>
269
+ <div class="model-description">
270
+ <p>A sequel! The new Nanuq series is meant to be as a testing grounds for my GRPO experiments, Built ontop of Austral Xgen 9B, I made an RL env using PrimeIntellect-ai/verifiers and implemented InternLM/POLAR in said env, then using Pocketdoc's Systemmax dataset, I finetuned the model for 150 steps and this was the result.</p>
271
+ </div>
272
+ </div>
273
+ </div>
274
+ <div class="section">
275
+ <h2>Quantized Versions</h2>
276
+ <div class="info-card">
277
+ <div class="model-composition">
278
+ <h4>Available Downloads</h4>
279
+ <ul class="composition-list">
280
+ <li><span class="model-component"><a href="" target="_blank">GGUF Format</a></span>For use with LLama.cpp & Forks(Coming Soon!)</li>
281
+ <li><span class="model-component"><a href="" target="_blank">EXL2 Format</a></span>For use with TabbyAPI (Coming soon!)</li>
282
+ </ul>
283
+ </div>
284
+ </div>
285
+ </div>
286
+ <div class="section">
287
+ <h2>Prompting</h2>
288
+ <p>Model has been tuned with the ChatML formatting. A typical input would look like this:</p>
289
+ <pre><code>"""&lt;|im_start|&gt;user
290
+ Hi there!&lt;|im_end|&gt;
291
+ &lt;|im_start|&gt;assistant
292
+ Nice to meet you!&lt;|im_end|&gt;
293
+ &lt;|im_start|&gt;user
294
+ Can I ask a question?&lt;|im_end|&gt;
295
+ &lt;|im_start|&gt;assistant
296
+ """</code></pre>
297
+ </div>
298
+ <div class="section">
299
+ <h2>System Prompting</h2>
300
+ <p>I would highly recommend using either Euryale's system prompt or the EVA system prompt with the model.</p>
301
+ <div class="metrics-section">
302
+ <details>
303
+ <summary>See Sao10k's Euryale System Prompt</summary>
304
+ <pre><code>Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.
305
+ &lt;Guidelines&gt;
306
+ • Maintain the character persona but allow it to evolve with the story.
307
+ • Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.
308
+ • All types of outputs are encouraged; respond accordingly to the narrative.
309
+ • Include dialogues, actions, and thoughts in each response.
310
+ • Utilize all five senses to describe scenarios within {{char}}'s dialogue.
311
+ • Use emotional symbols such as "!" and "~" in appropriate contexts.
312
+ • Incorporate onomatopoeia when suitable.
313
+ • Allow time for {{user}} to respond with their own input, respecting their agency.
314
+ • Act as secondary characters and NPCs as needed, and remove them when appropriate.
315
+ • When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.
316
+ &lt;/Guidelines&gt;
317
 
318
+ &lt;Forbidden&gt;
319
+ Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.
320
+ Writing for, speaking, thinking, acting, or replying as {{user}} in your response.
321
+ Repetitive and monotonous outputs.
322
+ • Positivity bias in your replies.
323
+ Being overly extreme or NSFW when the narrative context is inappropriate.
324
+ &lt;/Forbidden&gt;
325
 
326
+ Follow the instructions in &lt;Guidelines&gt;&lt;/Guidelines&gt;, avoiding the items listed in &lt;Forbidden&gt;&lt;/Forbidden&gt;.</code></pre>
327
+ </details>
328
+ </div>
329
+ </div>
330
+ <div class="section">
331
+ <h2>Training</h2>
332
+ <p>The training was done for 150 steps using 8 x <a href="https://www.nvidia.com/en-us/data-center/h200/">H200s</a> GPUs for the fine-tuning of the model.</p>
333
+ <p style="text-align: center; margin-top: 20px;">
334
+ <div class="axolotl-container">
335
+ <a href="https://github.com/PrimeIntellect-ai/verifiers" target="_blank">
336
+ <img src="https://files.catbox.moe/j36qbe.png" alt="Built with Verifiers">
337
+ </a>
338
+ </div>
339
+ <div class="section">
340
+ <h2>Credits</h2>
341
+ <p>Thank you to <a href="https://huggingface.co/intervitens">Intervitens</a>, <a href="https://huggingface.co/cgato">Cgato</a>, <a href="https://huggingface.co/kubernetes-bad">Kubernetes Bad</a>, <a href="https://huggingface.co/cgato">Cgato</a>, <a href="https://huggingface.co/snwy">Snwy</a>, <a href="https://huggingface.co/AuriAetherwiing">Auri</a>, <a href="https://x.com/willccbb">Will Brown</a> and most of all: <a href="https://huggingface.co/kalomaze">Kalomaze</a> </p>
342
+ </div>
343
+ </div>
344
+ </div>