Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Insight Generation on Internal non-scientific document collections Finance - Investment 2
Loading...
4.73
Set-level Score (Gemini 2.5 Flash)
INSIGHTGEN
1.7036
2.4893
3.275
4.0607
Apr 21, 2026
Set-level Score (Gemini 2.5 Flash)
Set-level Score (Claude 4 Sonnet)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Set-level Score (Gemini 2.5 Flash)
Set-level Score (Claude 4 Sonnet)
INSIGHTGEN
Base Model=GPT-4o
2026.04
4.73
3.5
INSIGHTGEN
Base Model=Claude-3.5-...
2026.04
4.41
3.41
FAISS+CoT
Base Model=Claude-3.5-...
2026.04
4.09
2.68
GPT+CoT
Base Model=Claude-3.5-...
2026.04
3.82
2.73
FAISS+CoT
Base Model=GPT-4o
2026.04
3.46
2.23
GPT+CoT
Base Model=GPT-4o
2026.04
3.41
2.41
FAISS
Base Model=GPT-4o
2026.04
3.14
2.14
Direct GPT
Base Model=GPT-4o
2026.04
2.64
2.05
Direct GPT
Base Model=Claude-3.5-...
2026.04
2.46
1.68
FAISS
Base Model=Claude-3.5-...
2026.04
1.82
1.5
Feedback
Search any
task
Search any
task