Automatically Visualize any dataset, any size with a single line of code.
AutoViz performs automatic visualization of any dataset with one line. Give any input file (CSV, txt or json) and AutoViz will visualize it.
To clone AutoViz, it's better to create a new environment, and install the required dependencies:
To install from PyPi:
conda create -n <your_env_name> python=3.7 anaconda conda activate <your_env_name> # ON WINDOWS: `source activate <your_env_name>` pip install autoviz
To install from source:
cd <AutoViz_Destination> git clone [email protected]:AutoViML/AutoViz.git # or download and unzip https://github.com/AutoViML/AutoViz/archive/master.zip conda create -n <your_env_name> python=3.7 anaconda conda activate <your_env_name> # ON WINDOWS: `source activate <your_env_name>` cd AutoViz pip install -r requirements.txt
Read this Medium article to know how to use AutoViz.
In the AutoViz directory, open a Jupyter Notebook and use this line to instantiate the library
from autoviz.AutoViz_Class import AutoViz_Class AV = AutoViz_Class()
Load a dataset (any CSV or text file) into a Pandas dataframe or give the name of the path and filename you want to visualize.
If you don't have a filename, you can simply assign the filename argument
"" (empty string).
Call AutoViz using the filename (or dataframe) along with the separator and the name of the target variable in the input. AutoViz will do the rest. You will see charts and plots on your screen.
filename = "" sep = "," dft = AV.AutoViz( filename, sep, target, df, header=0, verbose=0, lowess=False, chart_format="svg", max_rows_analyzed=150000, max_cols_analyzed=30, )
This is the main calling program in AV.
It will call all the load, display and save programs that are currently outside AV.
This program will draw scatter and other plots for the input dataset and then call the correct variable name with the
add_plots function and send in the chart created by that plotting program, for example, scatter.
You have to make sure that
add_plots function has the exact name of the variable defined in the Class AV.
If not, this will give an error.
COMMAis assumed as default separator in file. But you can change it.
max_rows_analyzed- limits the max number of rows that is used to display charts
max_cols_analyzed- limits the number of continuous vars that can be analyzed
Apache License, Version 2.0
This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose.