GlossarIA
Open AI glossary for companies
← Back to glossary

Pruning

Optimization technique that removes unimportant connections or neurons from the model to reduce size and speed up inference.

Advanced optimizacion eficiencia compresion

Full definition

Optimization technique that removes unimportant connections or neurons from the model to reduce size and speed up inference.

Example in a business context

Reducing a 1GB model to 300MB by removing weights with minimal impact.