PyCaret is end-to-end open source machine learning library for python programming language. Its primary objective is to reduce the cycle time of hypothesis to insights by providing an easy to use high level unified API. PyCaret's vision is to become defacto standard for teaching machine learning and data science. Our strength is in our easy to use unified interface for both supervised and unsupervised learning. It saves time and effort that citizen data scientists, students and researchers spent on coding or learning to code using different interfaces, so that now they can focus on business problem.
## Current Release
The current release is beta 0.0.34 (as of 05/02/2020). A full release is targetted in the first week of February 2020.
The current release is beta 0.0.35 (as of 07/02/2020). A full release is targetted in the first week of February 2020.
## Features Currently Available
As per beta 0.0.34 following modules are generally available:
As per beta 0.0.35 following modules are generally available:
* pycaret.datasets <br/>
* pycaret.classification (binary and multiclass) <br/>
* pycaret.regression <br/>
...
...
@@ -31,7 +31,7 @@ pip install pycaret
```
## Quick Start
As of beta 0.0.34 classification, regression, nlp, arules, anomaly and clustering modules are available.
As of beta 0.0.35 classification, regression, nlp, arules, anomaly and clustering modules are available.
data_t1=data_t1[['comp1','comp2']].agg(['mean','median','min','max','std'])#this gives us a df with only numeric columns (min , max ) and level as index
# some time if a level has only one record its std will come up as NaN, so convert NaN to 1
data_t1.fillna(1,inplace=True)
# now number of clusters cant be more than the number of samples in aggregated data , so