Safety filters

Additional rules and models that block or modify LLM responses to prevent harmful, illegal or inappropriate content.

Intermediate seguridad moderacion contenido

Full definition

Additional rules and models that block or modify LLM responses to prevent harmful, illegal or inappropriate content.

Preventing a customer support chatbot from generating insults, personal data or dangerous medical advice.