← Back to glossary
Safety filters
Additional rules and models that block or modify LLM responses to prevent harmful, illegal or inappropriate content.
Intermediate seguridad moderacion contenido
Full definition
Additional rules and models that block or modify LLM responses to prevent harmful, illegal or inappropriate content.
Example in a business context
Preventing a customer support chatbot from generating insults, personal data or dangerous medical advice.