Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Evaluating LLM-based Personal Information Extraction and Countermeasures

About

Automatically extracting personal information -- such as name, phone number, and email address -- from publicly available profiles at a large scale is a stepstone to many other security attacks including spear phishing. Traditional methods -- such as regular expression, keyword search, and entity detection -- achieve limited success at such personal information extraction. In this work, we perform a systematic measurement study to benchmark large language model (LLM) based personal information extraction and countermeasures. Towards this goal, we present a framework for LLM-based extraction attacks; collect four datasets including a synthetic dataset generated by GPT-4 and three real-world datasets with manually labeled eight categories of personal information; introduce a novel mitigation strategy based on prompt injection; and systematically benchmark LLM-based attacks and countermeasures using ten LLMs and five datasets. Our key findings include: LLM can be misused by attackers to accurately extract various personal information from personal profiles; LLM outperforms traditional methods; and prompt injection can defend against strong LLM-based attacks, reducing the attack to less effective traditional ones.

Yupei Liu, Yuqi Jia, Jinyuan Jia, Neil Zhenqiang Gong• 2024

Related benchmarks

TaskDatasetResultRank
Email address extractionSynthetic dataset--
70
PII PredictionSynthPAI (test)
Accuracy (Age)71.9
19
Personal Information ExtractionSynthetic Personal Profile Dataset
Accuracy (Email)54
10
Personal Information ExtractionSynthetic--
10
Personal Information ExtractionCelebrity--
10
Personal Information ExtractionPhysician--
10
Personal Information ExtractionProfessor--
10
Personal Information ExtractionCourt--
10
Showing 8 of 8 rows

Other info

Follow for update