Plotting of Example Data

Imports

At first we need to point python to the project folder. The path can be assigned as a relative path as shown below, or as an absolute system path. For plotting of data the cloud_plotter modle is used, which can be imported via the import cloud_plotter command.

[32]:

import sys
sys.path.append('../cloud_classifier')
import cloud_plotter
import importlib
importlib.reload(cloud_plotter)

[32]:

<module 'cloud_plotter' from '/home/squidy/tropos/CTyPyTool/notebooks/../cloud_classifier/cloud_plotter.py'>

Initialization

Our first step is to create a plotter object and to load a previously created cloud classifier project. Loading the project is neccesarry in order to import all project settings like the location of auxilary files.

[33]:

cp = cloud_plotter.cloud_plotter()

path = "../classifiers/ForestClassifier"
cp.load_project(path)

Next we specify some label files we want to plot. In this example the data consists of one original label file and two files of predicted labels, one from the Decision Tree and the other from the Random Forest Classifier.

[41]:

orig_file = "../data/example_data/nwcsaf_msevi-medi-20190317_1800.nc"
tree_prediction = "../classifiers/TreeClassifier/labels/nwcsaf_msevi-medi-20190317_1800_predicted.nc"
forest_prediction = "../classifiers/ForestClassifier/labels/nwcsaf_msevi-medi-20190317_1800_predicted.nc"

Plotting

Individual Plots

Using the plot_data method we can plot each of those datasets individually

[35]:

cp.plot_data(label_file = orig_file, colorbar = True)

../_images/source_Plotting_of_example_data_9_0.png

[36]:

cp.plot_data(label_file = tree_prediction, colorbar = True)

../_images/source_Plotting_of_example_data_10_0.png

[37]:

cp.plot_data(label_file = forest_prediction, colorbar = True)

../_images/source_Plotting_of_example_data_11_0.png

Combined Plots

Using the plot_multiple method we can plot multile datasets next to each other and evaluate the predcition performance in respect to the original labels.

[38]:

titles = ["Tree Classifier", "Forest Classifier"]
cp.plot_multiple(label_files = [tree_prediction, forest_prediction], truth_file = orig_file, plot_titles = titles)

../_images/source_Plotting_of_example_data_13_0.png

Probabilites Plots

The labels predicted with the Random Forest Classifier come with a probability score. That is, for each data point there also is a measure of how certain the classifier is about its choice of label. Those certainties are also stored in the label files and can be plooted using the plot_probas method.

[39]:

titles = ["Certainty", "Forest Classifier"]

cp.plot_probas(label_file = forest_prediction, truth_file = orig_file, plot_titles = titles)

../_images/source_Plotting_of_example_data_15_0.png

Correlation Matrix Plots

Given predicted labels and the original file we can also compute and plot a correlation matrix via the plot_coocurrence_matrix method

[40]:

cp.plot_coocurrence_matrix(forest_prediction, orig_file)

../_images/source_Plotting_of_example_data_17_0.png