Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
galileo-ai
/
agent-leaderboard
like
259
Running
on
CPU Upgrade
App
Files
Files
Community
2
Fetching metadata from the HF Docker repository...
137096d
agent-leaderboard
/
output
/
gpt-4.5-preview-2025-02-27
5 contributors
History:
1 commit
pratikbhavsar
added gpt 4.5 and flash lite
ef63014
about 1 month ago
BFCL_v3_irrelevance.parquet
Safe
49 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_base_multi_func_call.parquet
Safe
23.8 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_base_single_func_call.parquet
Safe
22.7 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_composite.parquet
Safe
44.3 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_long_context.parquet
Safe
37.9 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_miss_func.parquet
Safe
39.5 kB
added gpt 4.5 and flash lite
about 1 month ago
BFCL_v3_multi_turn_miss_param.parquet
Safe
41.7 kB
added gpt 4.5 and flash lite
about 1 month ago
tau_long_context.parquet
Safe
43.5 kB
added gpt 4.5 and flash lite
about 1 month ago
toolace_single_func_call_1.parquet
Safe
16.1 kB
added gpt 4.5 and flash lite
about 1 month ago
toolace_single_func_call_2.parquet
Safe
11.7 kB
added gpt 4.5 and flash lite
about 1 month ago
xlam_multiple_tool_multiple_call.parquet
Safe
101 kB
added gpt 4.5 and flash lite
about 1 month ago
xlam_multiple_tool_single_call.parquet
Safe
41.4 kB
added gpt 4.5 and flash lite
about 1 month ago
xlam_single_tool_multiple_call.parquet
Safe
30.4 kB
added gpt 4.5 and flash lite
about 1 month ago
xlam_single_tool_single_call.parquet
Safe
43.9 kB
added gpt 4.5 and flash lite
about 1 month ago
xlam_tool_miss.parquet
Safe
54 kB
added gpt 4.5 and flash lite
about 1 month ago