All Task Types
129 task families benchmarked across 26 models and 7 difficulty tiers. Click any task to see the full leaderboard.
Classification
11 Binary Sentiment Analysis text->class Content Moderation text->class Document Type Classification text->class Emotion Detection text->class Fine-Grained Sentiment Analysis text->class Intent Detection text->class Language Detection text->class Phishing Detection text->class Spam Detection text->class Support Ticket Classification text->class Topic Categorization text->class
Code Generation
14 Bash Script Generation text->code Bug Fixing text->code CSS Generation text->code Code Refactoring text->code HTML Generation text->code JavaScript Function Generation text->code Python Function Generation text->code Regex Generation text->code SQL Aggregation Queries text->sql SQL JOIN Queries text->sql SQL SELECT Queries text->sql SQL Stored Procedures text->code TypeScript Function Generation text->code Unit Test Generation text->code
Data Transformation
9Document Extraction
42 1099-DIV Extraction text->json 1099-INT Extraction text->json 1099-NEC Extraction text->json Bank Statement Extraction text->json Calendar Event Extraction text->json Citation Parsing text->json Clinical Note Extraction text->json Contract Clause Extraction text->json Court Filing Extraction text->json Credit Card Statement Extraction text->json Customs Declaration Extraction text->json Database Record Extraction text->json EOB (Explanation of Benefits) Extraction text->json Event Extraction text->json Expense Report Extraction text->json Form Filling text->json Insurance Claim Extraction text->json Insurance Policy Extraction text->json Invoice Extraction text->json Job Posting Extraction text->json K-1 Extraction text->json Key-Value Block Parsing text->json Lab Result Extraction text->json Lease Extraction text->json Log Line Parsing text->json Medical Bill Extraction text->json Mortgage Application Extraction text->json NDA Extraction text->json NER: Financial Entities text->json NER: Person, Org, Location text->json Natural Language to JSON text->json Patent Extraction text->json Pay Stub Extraction text->json Prescription Extraction text->json Purchase Order Extraction text->json Receipt Extraction text->json Resume Extraction text->json Search Query Construction text->json State Tax Form Extraction text->json Ticket Creation text->json W-2 Extraction text->json W-9 Extraction text->json
Image & Video
16 Business Card Image Extraction image->json Chart Image Extraction image->json Form Checkbox Image Extraction image->json Handwritten Note Extraction image->json Invoice Image Extraction image->json Receipt Image Extraction image->json Table Image Extraction image->json Video: Dashboard Reading video->json Video: Data Entry video->json Video: Dropdown & Modal video->json Video: Error Handling video->json Video: Form Fill video->json Video: Multi-Step Tasks video->json Video: Search & Filter video->json Video: Simple Navigation video->json W-2 Image Extraction image->json
Math & Reasoning
9Text Transformation
8Translation & Summarization
20 Article Summarization text->text Definition Extraction text->text Document Translation (EN→ES) text->text Email Summarization text->text English to Chinese Translation text->text English to French Translation text->text English to German Translation text->text English to Japanese Translation text->text English to Portuguese Translation text->text English to Spanish Translation text->text FAQ Matching text->text Formalization text->text Legal Summarization text->text Length Reduction text->text Meeting Action Items text->text Number Preservation text->text Reading Comprehension text->text Table Question Answering text->text Technical Translation (EN→ES) text->text Text Simplification text->text