Matthias Reso před 1 rokem
rodič
revize
1cfd709e26
2 změnil soubory, kde provedl 4 přidání a 2 odebrání
  1. 1 1
      docs/Dataset.md
  2. 3 1
      scripts/spellcheck_conf/wordlist.txt

+ 1 - 1
docs/Dataset.md

@@ -5,7 +5,7 @@ The provided fine tuning script allows you to select between three datasets by p
 * [grammar_dataset](https://huggingface.co/datasets/jfleg) contains 150K pairs of english sentences and possible corrections.
 * [alpaca_dataset](https://github.com/tatsu-lab/stanford_alpaca) provides 52K instruction-response pairs as generated by `text-davinci-003`.
 * [samsum_dataset](https://huggingface.co/datasets/samsum) contains about 16k messenger-like conversations with summaries.
-* [OpenAssistent/oaast1](https://huggingface.co/datasets/OpenAssistant/oasst1/) contains about 88k messages from assistant-style conversations.
+* [OpenAssistant/oasst1](https://huggingface.co/datasets/OpenAssistant/oasst1/) contains about 88k messages from assistant-style conversations.
 
 ## Using custom datasets
 

+ 3 - 1
scripts/spellcheck_conf/wordlist.txt

@@ -1147,4 +1147,6 @@ HuggingFace's
 LoRA
 bitsandbytes
 CLA
-dialogs
+dialogs
+OpenAssistant
+oasst1