Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Say No to the Discrimination: Learning Fair Graph Neural Networks with Limited Sensitive Attribute Information

About

Graph neural networks (GNNs) have shown great power in modeling graph structured data. However, similar to other machine learning models, GNNs may make predictions biased on protected sensitive attributes, e.g., skin color and gender. Because machine learning algorithms including GNNs are trained to reflect the distribution of the training data which often contains historical bias towards sensitive attributes. In addition, the discrimination in GNNs can be magnified by graph structures and the message-passing mechanism. As a result, the applications of GNNs in sensitive domains such as crime rate prediction would be largely limited. Though extensive studies of fair classification have been conducted on i.i.d data, methods to address the problem of discrimination on non-i.i.d data are rather limited. Furthermore, the practical scenario of sparse annotations in sensitive attributes is rarely considered in existing works. Therefore, we study the novel and important problem of learning fair GNNs with limited sensitive attribute information. FairGNN is proposed to eliminate the bias of GNNs whilst maintaining high node classification accuracy by leveraging graph structures and limited sensitive information. Our theoretical analysis shows that FairGNN can ensure the fairness of GNNs under mild conditions given limited nodes with known sensitive attributes. Extensive experiments on real-world datasets also demonstrate the effectiveness of FairGNN in debiasing and keeping high accuracy.

Enyan Dai, Suhang Wang• 2020

Related benchmarks

TaskDatasetResultRank
Node ClassificationNBA
BAcc62.35
23
Node ClassificationBail
BACC78.31
23
Node ClassificationGerman (test)
F1 Score78.57
22
Node ClassificationPokec-n (test)
F1 Score63.2
16
Node ClassificationCredit Dataset--
14
Working environment predictionPokec-n (test)
Accuracy70.1
9
User categorizationPokec-z (test)
Accuracy70
9
Node Classificationgerman dataset
Balanced Accuracy59.71
9
Node ClassificationBail (test)
ACC84.65
8
Node ClassificationNBA
Training Time (s)5
8
Showing 10 of 17 rows

Other info

Follow for update