Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Insight Generation on Internal non-scientific document collections Hotel Sales Strategies
Loading...
4.53
Set-level Score (Gemini-2.5-Flash)
INSIGHTGEN
2.2836
2.8668
3.45
4.0332
Apr 21, 2026
Set-level Score (Gemini-2.5-Flash)
Set-level Score (Claude-4-Sonnet)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Set-level Score (Gemini-2.5-Flash)
Set-level Score (Claude-4-Sonnet)
INSIGHTGEN
Base Model=GPT-4o
2026.04
4.53
3.32
INSIGHTGEN
Base Model=Claude-3.5-...
2026.04
4.53
3.21
FAISS+CoT
Base Model=Claude-3.5-...
2026.04
4.32
2.79
FAISS+CoT
Base Model=GPT-4o
2026.04
3.9
2.68
FAISS
Base Model=GPT-4o
2026.04
3.63
2.74
GPT+CoT
Base Model=Claude-3.5-...
2026.04
3.63
2.42
Direct GPT
Base Model=Claude-3.5-...
2026.04
3.21
2
GPT+CoT
Base Model=GPT-4o
2026.04
3.11
2.53
Direct GPT
Base Model=GPT-4o
2026.04
3
2.26
FAISS
Base Model=Claude-3.5-...
2026.04
2.37
2.05
Feedback
Search any
task
Search any
task