Datasets processed from MOSS for Fine-Tuning
Matthew Khoriaty
AMindToThink
AI & ML interests
None yet
Recent Activity
published a dataset 8 days ago
AMindToThink/olmo-2-1124-7b-four-stage-samples-rlhf-diversity updated a dataset 8 days ago
AMindToThink/olmo-2-1124-7b-four-stage-samples-rlhf-diversity liked a model 2 months ago
codefuse-ai/F2LLM-4BOrganizations
None yet
RMU Unlearning Gemmas
RMU: bio forget corpora. cyber: cyber forget. both wikitext retain
https://arxiv.org/abs/2410.19278.
The code https://github.com/AMindToThink/wmdp.
-
AMindToThink/gemma-2-2b-it_RMU_s100_a300_layer3
Text Generation • 3B • Updated • 5 • -
AMindToThink/gemma-2-2b-it_RMU_s100_a100_layer3
Text Generation • 3B • Updated • 3 -
AMindToThink/gemma-2-2b-it_RMU_s100_a500_layer3
Text Generation • 3B • Updated • 2 • -
AMindToThink/gemma-2-2b-it_RMU_s100_a1200_layer3
Text Generation • 3B • Updated • 4 •
MOSS Processed Data
Datasets processed from MOSS for Fine-Tuning
RMU Unlearning Gemmas
RMU: bio forget corpora. cyber: cyber forget. both wikitext retain
https://arxiv.org/abs/2410.19278.
The code https://github.com/AMindToThink/wmdp.
-
AMindToThink/gemma-2-2b-it_RMU_s100_a300_layer3
Text Generation • 3B • Updated • 5 • -
AMindToThink/gemma-2-2b-it_RMU_s100_a100_layer3
Text Generation • 3B • Updated • 3 -
AMindToThink/gemma-2-2b-it_RMU_s100_a500_layer3
Text Generation • 3B • Updated • 2 • -
AMindToThink/gemma-2-2b-it_RMU_s100_a1200_layer3
Text Generation • 3B • Updated • 4 •
models 124
AMindToThink/folder_structure_test
Updated
AMindToThink/ppo_with_value15
Updated
AMindToThink/ppo_with_value14
Updated
AMindToThink/ppo_push_main_13
Text Generation • 0.2B • Updated • 3
AMindToThink/ppo_push_main_12
0.2B • Updated • 2
AMindToThink/ppo_on_value_no_value11
0.2B • Updated • 3
AMindToThink/pushed_value_model
Text Classification • 0.1B • Updated • 4
AMindToThink/value_model_0
Text Classification • 0.1B • Updated • 1
AMindToThink/ppo
Text Generation • 0.2B • Updated • 2
AMindToThink/ppo9
0.2B • Updated • 2
datasets 8
AMindToThink/olmo-2-1124-7b-four-stage-samples-rlhf-diversity
Preview • Updated • 23
AMindToThink/moss-002-sft-data-instruction-output-ascii-only
Viewer • Updated • 1.75M • 4
AMindToThink/openthoughts-as-assistant-for-sft
Viewer • Updated • 114k • 7
AMindToThink/common_bugs
Viewer • Updated • 45 • 30
AMindToThink/moss-002-sft-data-instruction-output
Viewer • Updated • 3.53M • 4 • 1
AMindToThink/mmlu_wmdp_combined
Viewer • Updated • 4.74k • 12
AMindToThink/mmlu_wmdp_bio_combined
Viewer • Updated • 1.43k • 12
AMindToThink/wmdp-cyber-corpus_unpaired-preference
Viewer • Updated • 5.47k • 15