Ementorhub

Reinforcement Learning from Human Feedback

Instructor: Nikita Namjoshi

What you'll learn

Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique.

Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library.

Evaluate tuned model performance against the base model with evaluation methods.

Skills you'll practice

Google Cloud Platform

Prompt Engineering

Large Language Modeling

Performance Tuning

Reinforcement Learning

Generative AI Advance Fine-Tuning for LLMs

Finetuning Large Language Models

Efficiently Serving LLMs