Spaces:

donato11
/

Capibara

Sleeping

donato11 commited on Dec 4, 2025

Commit

16ae4d8

1 Parent(s): ee73437

Final changes

Files changed (2) hide show

docs/dataset_card.md CHANGED Viewed

@@ -84,12 +84,14 @@ The NLBSE24 dataset was created to foster research and development in natural la
 #### Data Collection and Processing
-The original data consists of GitHub issue reports collected from various open-source repositories. The preprocessing for our "soft-cleaned" version includes:
 1.  **Column Identification:** Identifying 'label' and 'issue' columns for target and text data respectively.
 2.  **Basic Structural Cleaning:**
     *   Removal of rows with inconsistent data types in columns.
     *   Removal of rows containing missing values (NaN) or empty strings across relevant fields.
-    *   Removal of exact duplicate rows.
 #### Who are the source data producers?

 #### Data Collection and Processing
+The original data consists of GitHub issue reports collected from various open-source repositories.
+<!--
+The preprocessing for our "soft-cleaned" version includes:
 1.  **Column Identification:** Identifying 'label' and 'issue' columns for target and text data respectively.
 2.  **Basic Structural Cleaning:**
     *   Removal of rows with inconsistent data types in columns.
     *   Removal of rows containing missing values (NaN) or empty strings across relevant fields.
+    *   Removal of exact duplicate rows. -->
 #### Who are the source data producers?

docs/model_card.md CHANGED Viewed

@@ -221,26 +221,6 @@ If you use this model in your research, consider citing the relevant SetFit and
 **BibTeX:**
-```bibtex
-@article{setfit2022,
-  title={{SetFit: Efficient Few-Shot Learning with Sentence Transformers}},
-  author={Hofst{\"a}tter, Philipp and Reimers, Nils and {de Jong}, Henri and van der Vegt, Wouter and van der Velde, Maarten and Rausch, Andreas and Aken, Bob van and Pietsch, Stefan and Godey, Julien and van der Goot, Rob and de Jong, Iryna and Gurevych, Iryna and de Rijke, Maarten},
-  journal={arXiv preprint arXiv:2209.11055},
-  year={2022}
-}
-@inproceedings{wang-etal-2020-mini,
-    title = "{MiniLM}: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers",
-    author = "Wang, Wenhui  and
-      Gao, Furu  and
-      Chen, Bin  and
-      Chen, Yaojie  and
-      Li, Shijian  and
-      Han, Shuzhou",
-    booktitle = "Advances in Neural Information Processing Systems",
-    volume = "33",
-    year = "2020"
-}
-```
 **APA:**