🤖 Machine Learning / AI
Intermediate
What is fine-tuning a pre-trained model?
Answer
Fine-tuning is the process of taking a model pre-trained on a large dataset and continuing training on a smaller, task-specific dataset. Instead of training from scratch, you initialize the model with pre-trained weights (which encode general knowledge) and run a few more epochs with a low learning rate to adapt to the new task. Common strategies: full fine-tuning (update all weights), feature extraction (freeze pre-trained layers, only train the new head), and layer-wise learning rate decay (lower LR for earlier layers). For LLMs, LoRA (Low-Rank Adaptation) enables efficient fine-tuning with few trainable parameters.
Previous
What is a ResNet (Residual Network)?
Next
What is the difference between parametric and non-parametric models?
More Machine Learning / AI Questions
View all →- Intermediate What is a convolutional neural network (CNN)?
- Intermediate What is a Recurrent Neural Network (RNN)?
- Intermediate What is an LSTM and how does it solve the vanishing gradient problem?
- Intermediate What is the attention mechanism in neural networks?
- Intermediate What is the Transformer architecture?