geochemistrypi.data_mining.plot package¶

Submodules¶

map_projected_by_basemap(col: Series, name_column: str, longitude: DataFrame, latitude: DataFrame) → None[source]¶

Project an element data into world map using basemap.

Parameters:

map_projected_by_cartopy(col: Series, name_column: str, longitude: DataFrame, latitude: DataFrame) → None[source]¶

Project an element data into world map using cartopy.

Parameters:

process_world_map(data: DataFrame, name_column: str) → None[source]¶: The process of projecting the data on the world map.

basic_statistic(data: DataFrame) → None[source]¶

Some basic statistic information of the designated data set.

check_missing_value(data: DataFrame) → bool[source]¶

Check whether the data set has null value or not.

correlation_plot(col: Index, df: DataFrame, name_column: str) → None[source]¶

A heatmap describing the correlation between the required columns.

Parameters:

distribution_plot(col: Index, df: DataFrame, name_column: str) → None[source]¶

The histogram containing the respective distribution subplots of the required columns.

Parameters:

is_null_value(data: DataFrame) → None[source]¶

Check whether the data set has null value or not.

log_distribution_plot(col: Index, df: DataFrame, name_column: str) → None[source]¶

The histogram containing the respective distribution subplots after log transformation of the required columns.

Parameters:

probability_plot(col: Index, df_origin: DataFrame, df_impute: DataFrame, name_column: str) → None[source]¶

A large graph containing the respective probability plots (origin vs. impute) of the required columns.

Parameters:

col (pd.Index) – A list of columns that need to plot.
df_origin (pd.DataFrame (n_samples, n_components)) – The original dataset with missing value.
df_impute (pd.DataFrame (n_samples, n_components)) – The dataset after imputation.

ratio_null_vs_filled(data: DataFrame) → None[source]¶

The ratio of the null values in each column.