Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge-Driven Structured EHR Understanding and Reasoning on Synthea

58.7K-R1 AUC

Gemini 2.5

Updated 3mo ago

Evaluation Results

Method	Links
Gemini 2.5 2025.11		58.7	-	54.1	-
Qwen-32B 2025.11		58.3	-	51	-
GPT-3.5 Turbo 2025.11		58.1	-	55.4	52.9
Gemini-2.0 2025.11		57.7	52	56.2	51.6
GPT-4o 2025.11		55.6	55	53.2	51
Gemini 1.5 2025.11		55.6	-	-	-
DeepSeek-V3 2025.11		52.8	-	-	-
DeepSeek-V2.5 2025.11		-	51	-	-
Qwen-72B 2025.11		-	-	-	52.2