Subido por geosolucionesgg

eda

Anuncio
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
1 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
2 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
3 of 13
pip install pandas_ui
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
4 of 13
jupyter nbextension enable — py qgrid — sys-prefix
jupyter nbextension enable — py widgetsnbextension — sys-prefix
from pandas_ui import *
pdf =pandas_ui(“../input/chocolate-bar-2020/chocolate.csv”)
pdf.to_file(output_file=”pandas_ui1.html”)
get_df() # to get the data frame
#get_meltdf() or get_pivotdf() # to get melt or pivot dataframes if
you have created any.
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
5 of 13
pip install pandas-profiling
or
conda install -c anaconda pandas-profiling
from pandas_profiling import ProfileReport
df = pd.read_csv(‘../input/chocolate-bar-2020/chocolate.csv’)
pr = ProfileReport(df)
pr.to_file(output_file=”pandas_profiling1.html”)
pr
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
6 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
7 of 13
甜心之枪
df = pd.read_csv(‘../input/chocolate-bar-2020/chocolate.csv’)
from sklearn.model_selection import train_test_split
train, test = train_test_split(df, test_size=0.3)
!pip install sweetviz
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
8 of 13
import sweetviz as sv
sweetviz_report = sv.analyze([df,”data”],target_feat=’rating’)
sweetviz_report.show_html(‘viz.html’)
df1 = sv.compare(train, test)
df1.show_html(‘Compare.html’)
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
9 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
10 of 13
!pip install autoviz
from autoviz.AutoViz_Class import AutoViz_Class
AV = AutoViz_Class()
dft = AV.AutoViz(filename = “”, sep= ‘,’ , depVar=’rating’, dfte= df,
header=0, verbose=2, lowess=False, chart_format=”svg”,
max_rows_analyzed=2500, max_cols_analyzed= 21)
dft.to_file(output_file=”autoviz_profiling.html”)
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
11 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
12 of 13
Better, Faster, Stronger Python Exploratory Data Analysis (EDA) | by Philippe Bouaziz, PhD | Jul, 2020 | Towards Data Science
13 of 13
Descargar