Reinforcement Learning from Human Feedback

Instructor: Nikita Namjoshi

What you'll learn

  •   Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique.
  •   Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library.
  •   
  •   Evaluate tuned model performance against the base model with evaluation methods.
  • Skills you'll practice

  •   Google Cloud Platform
  •   Prompt Engineering
  •   Large Language Modeling
  •   Performance Tuning
  •   Reinforcement Learning
  • ©2025  ementorhub.com. All rights reserved