功能 | 技术 | 网址 |
核心库和统计数据 | NumPy | http://www.numpy.org/ |
SciPy | https://scipy.org/scipylib/ | |
Pandas | https://pandas.pydata.org/ | |
StatsModels | http://www.statsmodels.org/devel/ | |
可视化 | Matplotlib | https://matplotlib.org/index.html |
Seaborn | https://seaborn.pydata.org/ | |
Plotly | https://plot.ly/python/ | |
Bokeh | https://bokeh.pydata.org/en/latest/ | |
Pydot | https://pypi.org/project/pydot/ | |
机器学习 | Scikit-learn | http://scikit-learn.org/stable/ |
XGBoost / LightGBM / CatBoost(梯度增强算法) | https://xgboost.readthedocs.io/en/latest/ https://lightgbm.readthedocs.io/en/latest/Python-Intro.html https://github.com/catboost/catboost |
|
Eli5 | https://eli5.readthedocs.io/en/latest/ | |
深度学习 | TensorFlow | https://www.tensorflow.org/ |
PyTorch | https://pytorch.org/ | |
Keras | https://keras.io/ | |
分布式深度学习 | Dist-keras / elephas / spark-deep-learning | https://joerihermans.com/work/distributed-keras/ https://pypi.org/project/elephas/ https://databricks.github.io/spark-deep-learning/site/index.html |
自然语言处理 | NLTK | https://www.nltk.org/ |
SpaCy | https://spacy.io/ | |
Gensim | https://radimrehurek.com/gensim/ | |
数据采集 | Scrapy | https://scrapy.org/ |