Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
benchmark
art
Synthetic
code
medical
biology
finance
legal
chemistry
agent
music
climate
Apply filters
Datasets
1,691
Full-text search
Edit filters
Sort: Trending
Active filters:
benchmark
Clear all
Trelis/tricky-tts-public
Viewer
•
Updated
2 days ago
•
4
•
22
•
2
michaljunczyk/pl-asr-bigos
Updated
Jan 8, 2024
•
29
•
4
nguha/legalbench
Viewer
•
Updated
4 days ago
•
91.8k
•
536k
•
169
jamendolyrics/jam-alt
Viewer
•
Updated
Jul 1, 2025
•
158
•
1.38k
•
15
LLM-Tuning-Safety/HEx-PHI
Preview
•
Updated
Aug 19, 2024
•
858
•
61
ikala/tmmluplus
Viewer
•
Updated
Sep 4, 2025
•
22.7k
•
1.98k
•
130
EpicPinkPenguin/procgen
Viewer
•
Updated
25 days ago
•
160M
•
326k
•
6
Salesforce/GiftEval
Preview
•
Updated
Jan 21, 2025
•
7.58k
•
21
yesilhealth/Health_Benchmarks
Viewer
•
Updated
Apr 20, 2025
•
7.54k
•
1.29k
•
9
jazasyed/musdb-alt
Viewer
•
Updated
Jun 27, 2025
•
39
•
94
•
4
Salesforce/APIGen-MT-5k
Viewer
•
Updated
Oct 10, 2025
•
5k
•
622
•
97
qcri-ai/HCTQA
Viewer
•
Updated
Jan 22
•
77.6k
•
85
•
2
MatSciBench/MatSciBench
Viewer
•
Updated
Oct 14, 2025
•
1.34k
•
119
•
2
Kunbyte/ROSE-Dataset
Viewer
•
Updated
Oct 11, 2025
•
50.3k
•
15.4k
•
8
ai-hyz/MemoryAgentBench
Viewer
•
Updated
Oct 7, 2025
•
146
•
22.9k
•
31
zai-org/CC-Bench-trajectories
Viewer
•
Updated
Sep 30, 2025
•
260
•
759
•
93
AI-companionship/INTIMA
Viewer
•
Updated
Aug 29, 2025
•
380
•
90
•
28
FudanCVL/MOSEv2
Updated
Aug 15, 2025
•
621
•
7
Tevatron/browsecomp-plus
Viewer
•
Updated
Dec 20, 2025
•
830
•
5.19k
•
31
meta-agents-research-environments/gaia2
Viewer
•
Updated
Sep 25, 2025
•
963
•
28.7k
•
39
allenai/molmospaces
Viewer
•
Updated
3 days ago
•
772k
•
4.17k
•
41
AIDC-AI/HSCodeComp
Viewer
•
Updated
Oct 27, 2025
•
632
•
376
•
10
chaofanma/Fantastic-Beasts
Viewer
•
Updated
Oct 25, 2025
•
251
•
66
•
3
stepfun-ai/AndroidDaily
Viewer
•
Updated
Dec 19, 2025
•
235
•
152
•
15
google/deepsearchqa
Viewer
•
Updated
Dec 17, 2025
•
900
•
7.73k
•
112
MiniMaxAI/VIBE
Viewer
•
Updated
Dec 23, 2025
•
200
•
320
•
274
hi-paris/FakeParts
Viewer
•
Updated
Jan 9
•
82.2k
•
1.4k
•
2
OpenMOSS-Team/ABC-Bench
Viewer
•
Updated
Jan 20
•
224
•
166
•
4
TalentZHOU/hle_material_science
Viewer
•
Updated
Jan 25
•
106
•
1.03k
•
1
tencent/CL-bench
Viewer
•
Updated
Feb 6
•
1.9k
•
1.21k
•
140
Previous
1
2
3
4
...
57
Next