Tutorial 3

Predicting Recidivism

This tutorial is based on models trained on the COMPAS dataset ☍. Models trained on this dataset are intended to predict the potential risk of recidivism.

As in the first tutorial, we assume that a model is already trained for use in FixOut. The first steps regarding the necessary imports are the same as the first tutorial, so please check it out (Tutorial 1 – Credit Risk) before continue this one.

You can import the data directly from the FixOut package, as COMPAS is a toy dataset, then call importCompasData.

from demo_data import importCompasData

We initialize the class responsible for running FixOut (FixOutRunner ☍) and assign it a name (which will be the title displayed on the web interface). We also indicate the sensitive features. In this case, they are “sex” and “race”.

⚠

FixOut is a tool centered on sensitive features and their proxies. Not providing this information correctly beforehand will compromise the obtained results.

fxo = FixOutRunner("Income Assessment (Census data)") 

sensitive_features = ["sex","race"]

fxa = FixOutArtifact(model=model,
                    training_data=(X_train,y_train), 
                    testing_data=[(X_test,y_test,"Testing")],
                    features_name=features_name,
                    sensitive_features=sensitive_features,
                    dictionary=dic)

As in the first tutorial, from now on, you rely on different functions depending on the development environment you use.

Using a Jupyter Notebook

Begin with the runJ function to launch FixOut.

fxo.runJ(fxa, show=False)

If you need to check if there are non-sensitive feature that are highly correlated with sensitive features, make use of the method correlation.

fxo.correlation()

⚠ This website is under construction

Tutorial 3

Predicting Recidivism

Using a Jupyter Notebook

References