python packages for data mining and machine learning

The following are the best Python modules for data mining from kdnuggets.

Basics

  • numpy - numerical library
  • scipy - Advanced math, signal processing, *
    optimization, statistics
  • matplotlib python plotting - Matplotlib

Machine Learning and Data Mining

  • MDP a collection of supervised and unsupervised
  • mlpy Machine Learning Python
  • NetworkX for graph analysis
  • Orange Data Mining Fruitful & Fun
  • pandas Python Data Analysis Library
  • pybrain
  • scikits-learn - Classic machine learning algorithms - Provide simple an efficient solutions to learning problems

Natural Language

  • NLTK Natural Language Toolkit

For web scraping

  • Scrapy An open source web scraping framework for Python

Deep learning and textmining for text mining: