- Shanghai
- www.vectortheta.com
Block or Report
Block or report michaelnny
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
Llama3-FunctionCalling
Llama3-FunctionCalling PublicFine-tune Llama3 model to support function calling
Jupyter Notebook 13
-
InstructLLaMA
InstructLLaMA PublicImplements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…
-
alpha_zero
alpha_zero PublicA PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
-
deep_rl_zoo
deep_rl_zoo Public archiveA collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
If the problem persists, check the GitHub status page or contact support.