Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dialogue Agent Evaluation on DEMO

7.238Element Awareness - Goal

GPT-4o

5.316085.815046.3146.81296Dec 6, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.12
7.2384.2726.6467.866.5048.1839.1468.5658.8328.6827.23
2024.12
7.094.466.7726.5346.2137.7349.1418.4048.8158.5236.979
2024.12
6.8354.3596.1887.1886.1428.5759.2288.5668.5578.7327.005
2024.12
6.7743.9076.3497.3946.1067.0517.8287.0776.797.1876.466
2024.12
6.6554.0255.5826.9255.7977.1618.5577.3047.3767.66.398
2024.12
6.6244.1056.8698.0286.4067.087.9677.2076.9267.2956.703
2024.12
6.5643.7415.786.8825.7416.1486.6195.8095.5716.0375.84
2024.12
6.3134.1456.8857.9646.3267.6248.9488.1848.3438.2756.976
2024.12
6.0863.6385.8967.0475.6677.4559.1078.3558.3578.3196.551
2024.12
5.393.9916.0666.3295.4437.4399.2098.5428.258.366.417