GlossarIA — Open AI glossary
A curated, bilingual glossary of AI concepts for teams and companies.
A/B testing
Controlled comparison of two versions of a model or system to see which performs better.
experimentos a_b
Intermediate
Accountability
Obligation to assign and track who is responsible for AI systems' decisions and errors.
responsabilidad etica gobernanza
Intermediate
Activation function
Mathematical function applied to the output of an artificial neuron that determines if it should be activated, introducing non-linearity to the network.
redes matematicas neurona
Intermediate
Adversarial attacks
Techniques that manipulate inputs in ways imperceptible to humans but that fool the model.
seguridad ataques robustez
Advanced
Agent orchestrator
System that coordinates multiple specialized models or tools to solve complex tasks autonomously.
agentes flujo automatizacion
Advanced
Agentic AI
AI systems capable of planning, reasoning and executing complex tasks autonomously with minimal human supervision.
agentes autonomia razonamiento
Advanced
Agentic Workflow
Design pattern where the AI iterates, searches for tools, and reflects to complete a task.
agentes procesos automatizacion
Advanced
AI API
Interface that allows integrating AI models into external applications and systems.
api integracion backend
Intermediate
AI framework
Set of libraries and tools for developing, training and deploying AI models.
framework desarrollo
Intermediate
AI governance
Framework of policies, processes and controls to manage responsible AI development and use in the organization.
gobernanza estrategia control
Intermediate
AI impact assessment
Prior analysis of risks and effects before deploying an AI system.
impacto riesgo evaluacion
Intermediate
AI pipeline
Set of steps that define a model life cycle, from data collection to deployment.
proceso mlops
Intermediate
AI slop
Low-quality, repetitive, generic or valueless content mass-produced by AI that floods websites, social media and documents.
calidad generativa contenido etica
Intermediate
Algorithmic bias
Distortion in model results caused by biased data or design choices.
sesgo etica
Intermediate
Algorithmic transparency
Degree to which the functioning of an algorithmic decision system can be explained and understood.
transparencia modelo explicacion
Intermediate
Alignment
Process of ensuring AI system goals and behaviors match human values and intentions.
etica alineacion valores
Advanced
API Gateway
Centralized entry point that manages, routes and controls access to multiple AI APIs and backend services.
api arquitectura integracion
Intermediate
Artificial General Intelligence (AGI)
Hypothetical AI system capable of understanding, learning and applying knowledge across any intellectual task at or above human level.
agi inteligencia_general futuro
Advanced
Artificial intelligence (AI)
Field that aims to make machines perform tasks that normally require human intelligence, such as learning, reasoning or communicating.
fundamentos empresa
Basic
Attention mechanism
Technique that allows the model to focus on the most relevant parts of the input when processing information, fundamental in transformer architectures.
fundamentos transformer arquitectura
Intermediate
Audio generator
Model that synthesizes voice or music from text or parameters.
audio voz sintesis
Intermediate
Autonomous agent
Model or system that chains actions to perform complex tasks with little human supervision.
agentes automatizacion
Advanced
Backpropagation
Fundamental algorithm that calculates the gradient of the loss function with respect to neural network weights, enabling learning.
fundamentos entrenamiento optimizacion
Intermediate
Batch inference
Process of running predictions on multiple examples simultaneously instead of one-by-one, optimizing resources and time.
inferencia produccion optimizacion
Intermediate
Batch size
Number of training examples processed simultaneously before updating model weights.
fundamentos entrenamiento parametro
Basic
Benchmarking
Evaluating models by comparing their results with standard metrics or reference datasets.
metricas comparacion
Intermediate
BLEU / ROUGE
Metrics to evaluate generated text quality by comparing with references: BLEU for translation, ROUGE for summaries.
metricas evaluacion texto
Intermediate
Catastrophic forgetting
Phenomenon where a neural network completely forgets what it learned in a previous task upon being trained on a new task.
entrenamiento memoria problema
Advanced
Chain-of-Thought (CoT)
Prompting technique that forces the model to reason step-by-step before giving the final answer, dramatically improving accuracy on complex tasks.
prompt razonamiento generativa
Intermediate
Chatbot
Virtual assistant that holds automated conversations with users.
chatbot soporte conversacion
Basic
Clustering
Unsupervised learning technique that groups similar data into clusters without prior labels, revealing hidden patterns.
no_supervisado agrupacion segmentacion
Basic
Computer Vision
AI area that enables machines to interpret and understand images and videos.
vision imagen analisis
Basic
Confusion matrix
Table showing true positives, false positives, true negatives and false negatives of a classifier.
metricas evaluacion clasificacion
Basic
Constitutional AI
Anthropic's approach to align models using explicit ethical principles instead of only human feedback.
etica alineacion seguridad
Advanced
Context
Additional information provided to the model (usually at the beginning of the conversation or in the system prompt) so it understands the role, domain, rules or specific reality in which it must operate.
prompt contexto dominio personalizacion
Intermediate
Context Window
Maximum number of tokens (words or subwords) that a language model can process and keep in memory in a single interaction.
llm token memoria contexto
Intermediate
Convolutional Neural Network (CNN)
Neural network architecture designed to process grid-structured data like images using convolutions and pooling.
arquitectura vision imagen
Intermediate
Cost per token
Price charged by LLM providers for each processed token (input + output).
coste facturacion token
Basic
Cross-Validation
Method to evaluate model performance by splitting data into subsets and rotating between training and testing.
evaluacion modelo validacion
Intermediate
Data augmentation
Generating additional or synthetic data to improve model training.
datos aumento sintetico
Intermediate
Data cleaning
Process of removing errors, duplicates and inconsistent values from data.
datos calidad
Basic
Data drift
Gradual change in the distribution of real-world data compared to training data, causing performance degradation.
monitorizacion calidad produccion
Intermediate
Data governance
Strategic management of the data lifecycle in an organization, including quality, access and security.
gobernanza datos estrategia
Intermediate
Data labeling
Process of annotating data with correct labels to train supervised models.
datos anotacion supervisado
Basic
Data lake
Centralized repository where raw data from multiple sources is stored.
datos almacen nube
Intermediate
Data warehouse
Structured repository optimized for analytical queries that stores processed, clean and organized data from multiple sources.
datos almacen analitica
Intermediate
Dataset
Structured collection of data used to train AI models.
datos fundamentos
Basic
Deep learning
Technique based on neural networks with many layers that can process complex data such as images, speech or text.
fundamentos redes vision
Intermediate
Deployment
Process of putting a trained AI model into production so it can be used by real applications, users or systems.
mlops produccion despliegue
Intermediate
Differential privacy
Technique that protects individual identity by adding controlled noise to aggregated data.
privacidad datos anonimizacion
Advanced
Diffusion models
Generative architecture that creates images by progressively adding noise and then learning to remove it.
arquitectura generativa imagen
Advanced
Digital rights
Set of principles and rules that protect people in the digital environment, including AI systems.
derechos ciudadania datos
Intermediate
Dropout
Regularization technique that randomly deactivates neurons during training to prevent overfitting, forcing the model to be more robust.
regularizacion entrenamiento overfitting
Intermediate
Edge AI
Running AI models directly on local devices without always relying on the cloud.
edge dispositivos iot
Intermediate
Embedding
Numeric representation of text, image or audio that allows measuring semantic similarity.
vector busqueda semantica
Intermediate
Emergent abilities
Abilities that appear in large models but not in smaller versions, without being explicitly trained.
llm escalado capacidades
Advanced
Ensemble Methods
Techniques that combine multiple models to improve accuracy and robustness, such as Random Forest or Boosting.
ensamble modelo combinacion
Intermediate
Epoch
One complete pass of the model through the entire training dataset.
fundamentos entrenamiento proceso
Basic
ETL (Extract-Transform-Load)
Process of extracting data from multiple sources, transforming it to a common format and loading it into a destination for analysis or training.
datos pipeline integracion
Intermediate
EU AI Act
European Union legal framework that classifies AI systems by risk levels and defines obligations.
regulacion ue ley
Intermediate
Explainability
Ability of a model to justify or explain its decisions in an understandable way.
explicabilidad modelo
Intermediate
Fairness
Ethical principle ensuring AI models do not discriminate against groups based on protected attributes like gender or ethnicity.
equidad etica sesgo
Intermediate
Feature
Variable or attribute in the data used to train a model.
datos feature
Intermediate
Feature engineering
Selection and transformation of variables that improve model learning.
features diseno
Advanced
Feature Store
Centralized repository that stores, documents and serves features for model training and inference.
mlops datos features
Intermediate
Federated Learning
Distributed training technique where the model learns from decentralized data without it leaving the original devices, preserving privacy.
distribuido privacidad entrenamiento
Advanced
Few-shot learning
Learning from only a small number of labeled examples.
pocos_datos ejemplos
Intermediate
Fine-tuning
Adapting a base model with specific data for a concrete use case.
ajuste modelo dominio
Intermediate
Future of work with AI
Changes in professional roles and skills driven by the adoption of artificial intelligence.
futuro_trabajo competencias empresa
Basic
General Data Protection Regulation (GDPR)
EU legal framework for personal data protection, applicable to AI systems processing sensitive information.
regulacion privacidad ue
Intermediate
Generative Adversarial Network (GAN)
Model consisting of a generator and discriminator that compete to create realistic data, such as synthetic images.
arquitectura generativa adversarial
Advanced
Gradient Descent
Optimization algorithm that iteratively adjusts model parameters to minimize the loss function.
fundamentos optimizacion entrenamiento
Intermediate
Green AI
Approach that prioritizes energy efficiency and carbon footprint reduction in AI projects.
sostenibilidad energia huella
Intermediate
Grounding
Technique to anchor model responses to real, up-to-date data (usually via RAG) to prevent hallucinations.
rag precision actualizacion
Intermediate
Guardrails
Set of rules, validations and controls that limit and guide AI model behavior to prevent dangerous or inappropriate outputs.
seguridad control validacion
Intermediate
Hallucination
Phenomenon in generative models where plausible but false or invented responses are produced.
generativa error llm
Intermediate
Human-in-the-loop
Approach in which a person reviews, corrects or validates the model's decisions.
human_in_the_loop revision
Intermediate
Hyperparameters
Parameters set manually before training that control the model's learning process.
fundamentos modelo ajuste
Intermediate
Image generator
Model that creates images from natural language descriptions.
imagen generativa creatividad
Basic
Imbalanced data
Situation where some classes have many more examples than others, causing model bias.
datos calidad sesgo
Intermediate
Inference
Use of a trained model to generate results or predictions.
proceso produccion
Basic
IoT with AI
Integration of AI into connected devices for real-time analysis and automation.
iot dispositivos integracion
Intermediate
Jailbreaking
Techniques to bypass an LLM's safety restrictions and make it generate prohibited content.
seguridad prompt vulnerabilidad
Intermediate
Knowledge Graph
Data structure that represents entities and their relationships as a graph, enabling complex reasoning and semantic searches.
grafo conocimiento relaciones
Advanced
Language model (LLM)
Algorithm trained on large volumes of text to generate natural language.
texto generativa empresa
Intermediate
Latency
Time the model takes to generate a response after receiving the request.
rendimiento produccion tiempo
Intermediate
Latent space
Abstract and compressed mathematical representation of data, where nearby points represent semantically similar concepts.
representacion matematicas generativa
Advanced
LoRA (Low-Rank Adaptation)
Efficient fine-tuning technique that trains only low-rank matrices instead of all model parameters.
fine_tuning eficiencia optimizacion
Intermediate
Loss function
Mathematical function that measures the difference between model predictions and actual values, guiding training.
fundamentos entrenamiento optimizacion
Intermediate
Low-code / No-code AI
Platforms that allow building AI solutions with little or no code.
low_code no_code automatizacion
Basic
Machine learning
Subfield of AI that trains algorithms to learn from data without being explicitly programmed.
fundamentos modelo empresa
Intermediate
MCP (Model Context Protocol)
Open protocol that standardizes how AI models connect with data sources and tools.
estandar datos herramientas
Advanced
Mixture of Experts (MoE)
Architecture that divides the model into multiple specialized experts, activating only the relevant ones for each input.
arquitectura escalabilidad eficiencia
Advanced
MLOps
Practice that combines development and operations to manage AI models in production.
mlops devops produccion
Intermediate
Model collapse
Phenomenon where models trained on data generated by other AI models progressively degrade in quality, losing diversity and accuracy.
calidad degradacion sintetico
Advanced
Model distillation
Technique to transfer knowledge from a large expensive model to a smaller faster one while keeping nearly the same accuracy.
optimizacion eficiencia modelo
Advanced
Model evaluation
Measuring model accuracy and quality using metrics such as accuracy, recall or F1-score.
modelo metricas
Intermediate
Model monitoring
Continuous tracking of performance, quality and behavior of models in production.
mlops produccion calidad
Intermediate
Model poisoning
Attack that corrupts training data so the model learns malicious behaviors.
seguridad ataques datos
Advanced
Model registry
Centralized repository that stores, versions and manages metadata for all ML models in the organization.
mlops gestion versionado
Intermediate
Model security
Measures to protect AI models from data leaks, adversarial attacks and misuse.
seguridad ataques proteccion
Intermediate
Model traceability
Ability to know how, with which data and with which versions a model was trained and changed.
trazabilidad auditoria
Intermediate
Model versioning
Practice of maintaining records of different model versions with their changes and metadata.
mlops gestion control
Basic
Multiagent systems
Architecture where multiple AI agents collaborate, negotiate or compete to solve complex problems.
agentes colaboracion coordinacion
Advanced
Multimodality
Ability of a model to understand and combine different data types such as text, image or audio.
multimodal imagen texto
Intermediate
Named Entity Recognition (NER)
NLP task that identifies and classifies entities in text such as people, organizations, locations or dates.
nlp texto extraccion
Intermediate
Natural Language Processing (NLP)
AI subfield focused on interactions between computers and human language, including text analysis and translation.
texto nlp lenguaje
Basic
Natural Language Understanding (NLU)
NLP subfield focused on making the machine understand meaning, intent, and entities in human text.
texto nlu entendimiento
Intermediate
Neural network
Mathematical model inspired by the human brain, made of interconnected nodes that learn relationships between data.
fundamentos redes
Basic
Normalization
Adjusting data values so they follow a common comparable scale.
datos escalado
Basic
OCR (Optical Character Recognition)
Technology that converts images of text into editable digital text.
vision texto documento
Basic
Open-source model
AI model whose weights and architecture are public and can be freely downloaded and modified (Llama-3, Mistral, Gemma, etc.).
modelo open_source licencia
Intermediate
Overfitting
When a model learns the training data too well and fails to generalize to new data.
modelo calidad
Intermediate
Parameters
Internal model variables (weights and biases) that are automatically adjusted during training from the data.
pesos entrenamiento modelo
Intermediate
PEFT (Parameter-Efficient Fine-Tuning)
Family of techniques that allow fine-tuning large models by modifying only a small fraction of their parameters.
fine_tuning eficiencia optimizacion
Intermediate
Perplexity
Metric that measures how surprised a language model is by new data; lower values indicate better performance.
metricas evaluacion llm
Intermediate
Personally Identifiable Information (PII)
Information that can be used to directly identify a person (name, ID, email, phone, etc.).
privacidad datos cumplimiento
Intermediate
Pre-training
Initial phase where a language model is trained on massive amounts of public internet text to learn general language patterns and world knowledge.
entrenamiento llm fundacion
Advanced
Precision and Recall
Classification metrics: precision measures accuracy of positive predictions, recall measures ability to find all positives.
metricas evaluacion clasificacion
Basic
Prompt
Instruction or text that guides the model's response.
prompt instrucciones
Basic
Prompt engineering
Design and optimization of prompts to improve generative model outputs.
prompt diseno plantillas
Intermediate
Prompt Injection
Vulnerability where a malicious user manipulates inputs to alter model behavior.
seguridad prompt ataque
Intermediate
Pruning
Optimization technique that removes unimportant connections or neurons from the model to reduce size and speed up inference.
optimizacion eficiencia compresion
Advanced
Quantization
Reduction of numerical precision of model weights (from 32-bit to 8-bit or 4-bit) to use less memory and run faster.
optimizacion hardware eficiencia
Advanced
RAG (Retrieval-Augmented Generation)
Technique that combines information retrieval with text generation.
rag busqueda generativa
Intermediate
Reasoning Model
AI model designed to 'think' before responding, breaking down complex problems into logical steps.
razonamiento llm complejidad
Advanced
Recommendation System
Tool that suggests items or content based on user preferences and historical data.
recomendacion personalizacion
Intermediate
Recurrent Neural Network (RNN)
Architecture for processing sequential data, maintaining memory of previous inputs; includes variants like LSTM for long dependencies.
arquitectura secuencias tiempo
Intermediate
Red-teaming
Process of intentionally testing an AI model with malicious or tricky prompts to uncover vulnerabilities and improve robustness.
seguridad pruebas ataques etica
Intermediate
Reinforcement learning
Model learns by trial and error, receiving rewards for correct decisions.
refuerzo politicas
Advanced
Responsible AI
Ethical, transparent and safe use of artificial intelligence in products and processes.
etica riesgo gobernanza
Intermediate
Retrieval
Process of searching and extracting relevant information from a knowledge base before generating a response.
rag busqueda informacion
Intermediate
RLAIF (Reinforcement Learning from AI Feedback)
Variant of RLHF where feedback to train the model comes from another AI system instead of humans, scaling the alignment process.
rlhf feedback_ia alineacion
Advanced
RLHF (Reinforcement Learning from Human Feedback)
Training models with human feedback to improve their responses.
rlhf feedback_humano
Advanced
Safety filters
Additional rules and models that block or modify LLM responses to prevent harmful, illegal or inappropriate content.
seguridad moderacion contenido
Intermediate
Semantic search
Search technique that understands the meaning and intent of the query, rather than just matching literal keywords.
busqueda embeddings nlp
Intermediate
Sentiment analysis
NLP technique that identifies emotions or attitudes (positive, negative, neutral) in text.
nlp texto analisis
Basic
Shadow AI
Unauthorized use of AI tools (ChatGPT, Gemini, etc.) by employees without IT or compliance approval.
gobernanza riesgo control
Intermediate
Skill
A specific function or capability that an AI agent can execute to perform a task.
agentes herramientas capacidades
Intermediate
Speech-to-text / Text-to-speech
Converting speech to text (STT) and text to speech (TTS) using AI models.
audio voz transcripcion
Basic
Supervised learning
Model learns from labeled data where the correct answer is known.
supervisado etiquetas
Basic
Synthetic data
Artificially generated data that mimics the statistical properties of real data, used when real data is scarce or to protect privacy.
datos sinteticos privacidad
Intermediate
System prompt
Fixed initial instruction that defines the model’s role, tone, rules and personality throughout the entire conversation.
prompt contexto personalizacion
Intermediate
Temperature
Hyperparameter that controls randomness in model responses (0 = deterministic, 1 = very creative).
parametro creatividad control
Intermediate
Text generator
Model that writes text based on instructions or prompts.
texto generativa contenido
Basic
Token
Smallest unit of text processed by a model, which can be a word or a fragment.
texto coste tokens
Basic
Tokenization
Process of splitting text into smaller units (tokens) that the model can process, such as words or subwords.
texto procesamiento nlp
Basic
Tool calling / Function calling
Model ability to decide when and how to call external tools (APIs, databases, calculators) instead of trying to answer everything itself.
funciones api agentes
Intermediate
Top-p (Nucleus sampling)
Sampling method that only considers tokens whose cumulative probability exceeds p (usually 0.9–0.95).
parametro muestreo nucleus
Intermediate
Training
Process by which a model learns from data and adjusts its internal parameters.
proceso modelo
Basic
Transfer Learning
Technique that reuses a pre-trained model on a related task to speed up training on a new problem.
aprendizaje transferencia modelo
Intermediate
Transformer
Architecture that allows efficient parallel processing of sequences such as text.
arquitectura text transformer
Intermediate
Underfitting
Situation where a model is too simple and fails to capture underlying patterns in the data, leading to poor performance on both training and test data.
fundamentos modelo calidad
Intermediate
Unsupervised learning
Model identifies patterns in data without prior labels.
no_supervisado clusters
Intermediate
Variational Autoencoder (VAE)
Model that learns to compress data into a latent space and then reconstruct it, useful for generation and dimensionality reduction.
arquitectura generativa compresion
Advanced
Vector store
Database optimized to store embeddings and perform similarity search.
vector busqueda almacen
Intermediate
Video generator
System that produces video clips or animations from text, images or templates.
video generativa
Intermediate
Vision Transformer (ViT)
Transformer-based architecture applied to images by splitting them into patches and processing them as sequences, often outperforming CNNs.
vision transformer arquitectura
Advanced
Zero-shot learning
Ability of a model to handle tasks without specific prior examples.
generalizacion tareas
Intermediate