← Back to glossary
RLHF (Reinforcement Learning from Human Feedback)
Training models with human feedback to improve their responses.
Advanced rlhf feedback_humano
Full definition
Training models with human feedback to improve their responses.
Example in a business context
Improving the quality and safety of a conversational assistant.