Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Probabilistic Program Generation on Dugongs PosteriorDB
Loading...
23.4
ELPD LOO
BoxLM
7.748
11.8115
15.875
19.9385
Sep 1, 2025
ELPD LOO
Updated 1mo ago
Evaluation Results
Method
Method
Links
ELPD LOO
BoxLM
Base Model=GPT-4
2025.09
23.4
OpenAI-o3
Base Model=OpenAI-o3
2025.09
22.83
Expert
Base Model=Expert stan...
2025.09
22.43
REFINESTAT
Base Model=DQ-7B
2025.09
8.35
Feedback
Search any
task
Search any
task