scikit-learn

Machine Learning in Python

Getting StartedRelease Highlights for 1.6
  • Simple and efficient tools for predictive data analysis
  • Accessible to everybody, and reusable in various contexts
  • Built on NumPy, SciPy, and matplotlib
  • Open source, commercially usable - BSD license

Classification

Identifying which category an object belongs to.

Applications: Spam detection, image recognition.
Algorithms:Gradient boosting, nearest neighbors, random forest, logistic regression, and more...

Classifier comparison
Examples

Regression

Predicting a continuous-valued attribute associated with an object.

Applications: Drug response, stock prices.
Algorithms:Gradient boosting, nearest neighbors, random forest, ridge, and more...

Decision Tree Regression with HGBT
Examples

Clustering

Automatic grouping of similar objects into sets.

Applications: Customer segmentation, grouping experiment outcomes.
Algorithms:k-Means, HDBSCAN, hierarchical clustering, and more...

A demo of K-Means clustering on the handwritten digits data
Examples

Dimensionality reduction

Reducing the number of random variables to consider.

Applications: Visualization, increased efficiency.
Algorithms:PCA, feature selection, non-negative matrix factorization, and more...

PCA example with Iris Data-set
Examples

Model selection

Comparing, validating and choosing parameters and models.

Applications: Improved accuracy via parameter tuning.
Algorithms:Grid search, cross validation, metrics, and more...

Demonstration of multi-metric evaluation on cross_val_score and GridSearchCV
Examples

Preprocessing

Feature extraction and normalization.

Applications: Transforming input data such as text for use with machine learning algorithms.
Algorithms:Preprocessing, feature extraction, and more...

Demonstrating the different strategies of KBinsDiscretizer
Examples

News

  • On-going development:scikit-learn 1.7 (Changelog).
  • January 2025. scikit-learn 1.6.1 is available for download (Changelog).
  • December 2024. scikit-learn 1.6.0 is available for download (Changelog).
  • September 2024. scikit-learn 1.5.2 is available for download (Changelog).
  • July 2024. scikit-learn 1.5.1 is available for download (Changelog).
  • May 2024. scikit-learn 1.5.0 is available for download (Changelog).
  • April 2024. scikit-learn 1.4.2 is available for download (Changelog).
  • February 2024. scikit-learn 1.4.1.post1 is available for download (Changelog).
  • January 2024. scikit-learn 1.4.0 is available for download (Changelog).
  • All releases:What's new (Changelog).

Community

Help us, donate!Cite us!

Who uses scikit-learn?

More testimonials...

close