human-preferences

Star

Here are 8 public repositories matching this topic...

THUDM / ImageReward

Star

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

generative-model diffusion-models human-preferences rlhf

Updated Sep 25, 2023
Python

glgh / awesome-llm-human-preference-datasets

Star

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

nlp machine-learning awesome-list datasets eval llm human-preferences rlhf

Updated Oct 4, 2023

naver / disco

Star

A Toolkit for Distributional Control of Generative Models

machine-learning ai alignment language-models monte-carlo-sampling generative-models fine-tuning human-preferences distributional-policy-gradients

Updated Sep 4, 2023
Python

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).

alignment human-preferences text-to-video-generation large-vision-models

Updated Jul 5, 2024
Python

SteveKGYang / MetaAligner

Star

Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models

natural-language-processing alignment gpt large-language-models human-preferences llama2

Updated May 17, 2024
Python

ezermoysis1 / finetuning-llms-with-conditioning

Star

Fine-tuning Language Models with Conditioning on Two Human Preferences

nlp-machine-learning fine-tuning-cnns gpt-2 human-preferences llms

Updated Jun 29, 2023
Python

wx-b / B_Pref

Star

reinforcement-learning gym rl human-preferences

Updated Jul 5, 2022
Python

SHarrison00 / finetuning_llms_with_conditioning

Star

Fine-tuning LLMs using conditional training to learn two human preferences. UCL Module Project: Statistical Natural Language Processing (COMP0087).

nlp-machine-learning gpt-2 fine-tuning-nlp human-preferences llms

Updated Aug 9, 2023
Python

Improve this page

Add a description, image, and links to the human-preferences topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the human-preferences topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

human-preferences

Here are 8 public repositories matching this topic...

THUDM / ImageReward

glgh / awesome-llm-human-preference-datasets

naver / disco

PKU-Alignment / safe-sora

SteveKGYang / MetaAligner

ezermoysis1 / finetuning-llms-with-conditioning

wx-b / B_Pref

SHarrison00 / finetuning_llms_with_conditioning

Improve this page

Add this topic to your repo