Ga naar hoofdinhoud

Pagina laden...

Expertise AI Tools Kennisbank Blog Nieuwsbrief Over ons

Plan een consult

Home
/Blog
/Data Platforms
/Zero-shot learning abilities of language models.

← Terug naar blog2021-09-15

Zero-shot learning abilities of language models.

Data PlatformsPremium

This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning, finetuning language models on a collection of tasks described via instructions, substantially boosts zero-shot performance on unseen tasks.

We take a 137B parameter pretrained language model and instruction-tune it on over 60 NLP tasks verbalized via natural language instruction templates. We evaluate this instruction- tuned model, which we call FLAN, on unseen task types. FLAN substantially improves the performance of its unmodified counterpart and surpasses zero-shot 175B GPT-3 on 19 of 25 tasks that we evaluate. FLAN even outperforms few-shot GPT-3 by a large margin on ANLI, RTE, BoolQ, AI2-ARC, OpenbookQA, and StoryCloze. Ablation studies reveal that number of tasks and model scale are key components to the success of instruction tuning.

arxiv.org/pdf/2109.01652.pdf

Premium content

Zero-shot learning abilities of language models.

Dit artikel is exclusief beschikbaar voor nieuwsbrief-abonnees. Schrijf je in voor toegang tot 880+ artikelen.

Geen spam. Uitschrijven op elk moment.

AI & Security Intelligence

Wekelijkse nieuwsbrief met AI updates, security alerts en compliance inzichten, direct in uw inbox.

Security & AI Operating Model

Advisory met executiekracht

Van BIO2 en NIS2 tot EU AI Act, embedded in uw operating model, niet als extern project. Maandelijks opzegbaar, met assessments als bewijsvoering.

Bekijk advisory niveaus →Plan een intake

Gerelateerde artikelen

De AI-levenscyclus een benadering voor gefaseerde innovatie.

2024-11-28 Data Platforms

Who is the data owner?

2024-11-17 Data Platforms

Unlocking the Power of Language From Roman Jakobson to Large Language Models (LLMs)

Van Data naar Doen, AI, security en compliance consultancy voor de Nederlandse markt. Djimit is de publieke merknaam van DjimIT B.V., met Dennis Landman als oprichter en adviseur.

Content

Blog
Nieuwsbrief
Kennisbank

Expertise

AI Governance
EU AI Act
NIS2 Compliance
BIO2 + NORA + AI Act
AI Security
Cloud Soevereiniteit
AI Agents & MCP

Proof

Publieke sector proof
AI security proof
NIS2 roadmap proof
AI evidence sample
NIS2 board sample

Contact

Kennismaking
Contact
LinkedIn
GitHub

© 2026 DjimIT B.V. Alle rechten voorbehouden.

Privacy Voorwaarden

KvK-nummer op aanvraag·BTW-nummer op aanvraag·Kantooradres op aanvraag