#

rlhf

Here are 138 public repositories matching this topic...

ChukwumaChukwuma / enyimba2_ai

Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.

machine-learning natural-language-processing reinforcement-learning ai artificial-intelligence quantum-computing llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

BARUDA-AI / Awesome-Preference-Optimization

Survey of preference alignment algorithms

alignment direct preference-learning rlhf preference-alignment

Updated Feb 25, 2024

10mudassir007 / AI-CHATBOT

Intelligent AI Chatbot that has the capability to learn from the user

python nlp ai learning-python chatbot nltk nlp-machine-learning nltk-python rlhf

Updated Mar 22, 2024
Python

jddunn / rlhf

Library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO

ppo rlhf reward-model textrl

Updated Feb 28, 2024
Python

Li-brua / chatglm-lora-rlhf-langchain-rag

This project involved fine-tuning ChatGLM using LoRA and RLHF, and deploying it with LangChain.

lora llm langchain rlhf chatglm

Updated Jul 12, 2024
Python

AugustasMacijauskas / mlmi-thesis

Code for my thesis titled "Eliciting latent knowledge from language reward models" for the MPhil in Machine Learning and Machine Intelligence at the University of Cambridge

alignment interpretability rlhf

Updated Oct 5, 2023
Jupyter Notebook

navneet1083 / textsum-tune

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

lora ppo peft ppo-agent huggingface-transformers rlhf flan-t5 llm-training

Updated Aug 8, 2023
Jupyter Notebook

colehaus / social-choice-rlhf

An alternative RLHF reward model formulation from a social choice perspective

Updated Apr 7, 2024
Python

sathishkumar67 / GPT-2-IMDB-Sentiment-Fine-Tuning-with-PPO

Implemented the Proximal Policy Optimization (PPO) algorithm to fine-tune a large language model for generating consistently positive reviews

reinforcement-learning transformers text-generation pytorch ppo gpt2 rlhf

Updated Jun 27, 2024
Python

akain0 / Reinforcement-Learning-

Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.

reinforcement-learning reinforcement-learning-algorithms a3c lstm-neural-networks bellman-equation rlhf

Updated May 7, 2024
Jupyter Notebook

AMfeta99 / NLP_LLM

This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.

Updated Jul 4, 2024
Jupyter Notebook

alexisrozhkov / llm-calib

Improving LLM truthfulness via reporting confidence

alignment truthfulness llm rlhf

Updated Jun 9, 2024
Python

sathishkumar67 / GPT-2-Non-Toxic-RLHF

Aligning GPT2 model to generate Non-Toxic words

reinforcement-learning text-generation pytorch transformer ppo gpt2 rlhf

Updated Jul 15, 2024
Python

ChukwumaChukwuma / enyimba_ai

Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction

machine-learning natural-language-processing reinforcement-learning ai chatbot artificial-intelligence strategy policy-evaluation alphazero muzero prompt-engineering llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

soniawmeyer / WanderChat

A Comparison of LLM Chat Bot Implementation Methods with Travel Use Case

machine-learning chatbot travel llama lora sjsu mistral fine-tuning rag ai-engineering llms rlhf llm-training qlora

Updated Jun 21, 2024
Jupyter Notebook

thisisHJLee / RLHF

nlp reinforcement-learning language-model ppo rlhf supervised-finetuning reward-model

Updated Jul 20, 2023

congchan / llm

Codebase and experiments of LLM(Large Language Modeling)

large-language-models rlhf

Updated May 5, 2024
Python

ymnseol / weekly-paper-reading-group

Summaries of papers related to the alignment problem in NLP

nlp natural-language-processing rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated May 29, 2023

ssbuild / t5_rlhf

chatyuan_rlhf_training

lora reward ppo t5 rlhf adalora qlora

Updated Sep 19, 2023
Python

saschaschramm / tiny-chatgpt

Researching the reinforcement learning algorithm of ChatGPT

gae temporal-differencing-learning ppo chatgpt rlhf general-advantage-estimation

Updated Apr 7, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."