What is t-SNE?
Answer
t-SNE (t-distributed Stochastic Neighbor Embedding) is a non-linear dimensionality reduction technique used primarily for visualization of high-dimensional data in 2D or 3D. It models pairwise similarities between points in high-dimensional space using a Gaussian distribution, and in the low-dimensional space using a Student t-distribution (heavy tails prevent crowding). t-SNE preserves local structure and reveals clusters visually. Limitations: non-deterministic, computationally expensive for large datasets (O(n²)), hyperparameter-sensitive (perplexity matters), and distances between clusters are not meaningful. UMAP is a faster, more scalable alternative.
Previous
What is a recommendation system?
Next
What is the difference between discriminative and generative models?
More Machine Learning / AI Questions
View all →- Intermediate What is a convolutional neural network (CNN)?
- Intermediate What is a Recurrent Neural Network (RNN)?
- Intermediate What is an LSTM and how does it solve the vanishing gradient problem?
- Intermediate What is the attention mechanism in neural networks?
- Intermediate What is the Transformer architecture?