Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess

About

The advent of machine learning models that surpass human decision-making ability in complex domains has initiated a movement towards building AI systems that interact with humans. Many building blocks are essential for this activity, with a central one being the algorithmic characterization of human behavior. While much of the existing work focuses on aggregate human behavior, an important long-range goal is to develop behavioral models that specialize to individual people and can differentiate among them. To formalize this process, we study the problem of behavioral stylometry, in which the task is to identify a decision-maker from their decisions alone. We present a transformer-based approach to behavioral stylometry in the context of chess, where one attempts to identify the player who played a set of games. Our method operates in a few-shot classification framework, and can correctly identify a player from among thousands of candidate players with 98% accuracy given only 100 labeled games. Even when trained on amateur play, our method generalises to out-of-distribution samples of Grandmaster players, despite the dramatic differences between amateur and world-class players. Finally, we consider more broadly what our resulting embeddings reveal about human style in chess, as well as the potential ethical implications of powerful methods for identifying individuals from behavioral data.

Reid McIlroy-Young, Russell Wang, Siddhartha Sen, Jon Kleinberg, Ashton Anderson• 2022

Related benchmarks

TaskDatasetResultRank
Behavioral StylometryLichess 10k-40k+ (all)
Accuracy98.2
20
Behavioral StylometryLichess High-ranked subsets
Accuracy0.569
16
Behavioral StylometryLichess High-ranked ∪ 10K-40K+
Accuracy80
12
Behavioral StylometryLichess 20k-40k+ players
Accuracy100
8
Behavioral StylometryLichess 30k-40k+ players
Accuracy100
8
Behavioral StylometryLichess 40k+ players
Accuracy100
8
Behavioral StylometryLichess 10k-40k+ (unseen)
Accuracy0.244
8
Behavioral StylometryLichess 10k+ games (unseen)
P@197.9
6
Behavioral StylometryLichess 10k+ games (seen)
P@198.2
6
Behavioral StylometryLichess 10k+ games (all)
P@198.2
6
Showing 10 of 28 rows

Other info

Code

Follow for update