How Many Spanish Words Do You Need? Vocabulary Targets for A1, A2 and B1

"How many words do I need to know?" is one of the first questions Spanish learners ask — and one of the least helpfully answered. You'll find estimates ranging from 250 to 10,000 depending on who you ask and what they mean by "know". Some of those estimates are based on research; most are not.

This post gives you the research-backed answer, explains why the number you need depends entirely on what you want to do with Spanish, and tells you something more useful than a target number: how vocabulary size connects to what you can actually read and understand.

The number that actually matters: coverage

Vocabulary researchers don't think primarily in terms of "how many words do you know." They think in terms of coverage — what percentage of the words in a given text you recognise.

Why does coverage matter? Because comprehension isn't binary. Reading a text where you know 60% of the words is a very different experience from reading one where you know 95% of the words. At 60%, you're guessing at meaning, losing the thread, and working so hard to decode individual words that comprehension of the overall meaning suffers. At 95%, you can read comfortably, infer the meaning of unfamiliar words from context, and actually acquire language rather than just decode it.

Paul Nation's vocabulary research established two key coverage thresholds that are now widely cited in applied linguistics:

  • 95% coverage — the point at which reading with a dictionary becomes feasible and reasonably efficient, first proposed by Laufer (1989)
  • 98% coverage — the point at which genuinely unassisted reading comprehension becomes possible, established by Hu and Nation (2000)

Below 95%, the gaps are too frequent for comfortable reading. Above 98%, the occasional unknown word can be inferred from context without breaking comprehension. Those two thresholds frame everything else in this post.

The vocabulary curve

Here is where the research on frequency distributions becomes important. Not all Spanish words are equally useful. The most frequent words in any natural language appear far more often than less frequent ones — a pattern described mathematically as Zipf's Law, after linguist George Kingsley Zipf.

The practical consequence is dramatic: a small number of very common words accounts for a disproportionately large share of any text you'll encounter. This means the first 500 words you learn give you far more coverage per word than the next 500, which give you more than the 500 after that. The gains are front-loaded — and they diminish sharply as vocabulary grows.

~50% Text coverage from the 100 most frequent Spanish words — articles, prepositions, common verbs
~72% Coverage from the most frequent 1,000 word families — roughly A2 vocabulary
~87% Coverage from the most frequent 2,000 word families — roughly B1 vocabulary

These figures — drawn from Nation's corpus research and applicable across natural languages following Zipfian distributions, including Spanish — illustrate the curve. Going from zero to 1,000 words gains you roughly 72 percentage points of coverage. Going from 1,000 to 2,000 words gains you only another 15. The law of diminishing returns applies throughout vocabulary learning, which is why intermediate progress feels slower than beginner progress even when you're working just as hard.

Vocabulary size and what you can actually read

The table below maps vocabulary size to approximate text coverage and the reading experience that corresponds to it. The vocabulary figures are widely used approximations for CEFR levels in Spanish; the coverage percentages reflect the Zipfian distribution established in corpus linguistics research.

Vocabulary size CEFR level Approx. text coverage Reading experience
~500 words A1 ~55% Nearly 1 in 2 words unfamiliar; comprehension requires heavy context support
~1,000 words A2 ~70% 1 in 3 words unfamiliar; adapted texts manageable with effort
~2,000–2,500 words B1 ~80–85% 1 in 5–6 words unfamiliar; adapted texts comfortable, native content challenging
~4,000–5,000 words B2 ~92–95% Approaching the comfortable reading threshold; most native content accessible with occasional dictionary use
~8,000+ words C1–C2 ~98%+ Unassisted reading of native content; rare dictionary use needed

Two things stand out from this table. First, the jump from B1 (80–85% coverage) to B2 (92–95%) is the single hardest stretch — it requires roughly doubling your vocabulary while each new word provides progressively less coverage. This is why learners who sail through A1 and A2 often hit a wall at B1. The intermediate plateau is, in large part, a vocabulary coverage problem.

Second, even B1 leaves you below the 95% comfortable reading threshold. At B1, roughly 1 in 5 or 6 words in a native text is unfamiliar. That's too many gaps for comfortable unassisted reading — which is why adapted content at the right level is so valuable at this stage. See our guide to what CEFR levels actually mean for the full picture.

Word families, not individual words

One important nuance: the vocabulary research above uses word families, not individual words. A word family groups together a headword and its inflected and derived forms: hablar, habla, hablé, hablando, hablado, hablante would all count as one family.

This distinction matters because learners often overcount their vocabulary. If you've learned rápido and rápidamente separately, that's one word family, not two. Conversely, if you learn a verb well — its conjugation patterns, its common collocations — you get the whole family for roughly the same effort as learning the stem.

The practical implication: focus on high-frequency word families rather than memorising isolated word forms. Learning the verb tener properly, including its irregular conjugations and common expressions (tener que, tener ganas de, tener razón), is a far better investment than learning ten obscure adjectives you'll rarely encounter.

Receptive versus productive vocabulary

A second distinction that matters: receptive vocabulary (words you recognise when reading or listening) is always larger than productive vocabulary (words you can actively use in speech or writing). Research suggests productive vocabulary is typically 50–70% of receptive vocabulary for language learners.

The vocabulary targets above refer to receptive vocabulary — the relevant metric for reading, which is what coverage measures. If your goal is to speak Spanish confidently, your productive vocabulary needs to be substantially above the coverage level required for comfortable reading. A B1 speaker with 2,000 receptive words might only have 1,000–1,400 words they can actively deploy.

For reading-based language learning — which the research strongly supports as the most efficient route to vocabulary growth — focusing on receptive vocabulary first is the right strategy. Productive vocabulary follows naturally as you encounter and reuse words across many contexts. It cannot be efficiently force-fed through drilling.

Why frequency lists are necessary but not sufficient

Knowing the top 2,000 most frequent Spanish words will get you to ~87% text coverage — but that remaining 13% is not random. It tends to cluster around specific topics. An article about Spanish politics will use political vocabulary. An article about sport will use sport vocabulary. That topic-specific vocabulary sits in the long tail of frequency, below the first 2,000 most common words.

This is why vocabulary building has two components that work differently:

  1. High-frequency core vocabulary — the first 2,000–3,000 most common Spanish words. These are worth deliberate study because they appear everywhere. Spaced repetition (Anki or similar) works well for this layer because the investment pays off in every text you'll ever read.
  2. Topic vocabulary — words specific to the subjects you read about. These are best acquired through reading, not drilling. You'll encounter them repeatedly in context if you read about your interest areas, which is far more efficient than trying to memorise topic lists in advance.

The mistake most learners make is treating all vocabulary as the same problem. It isn't. High-frequency core vocabulary responds well to deliberate study. Topic vocabulary grows fastest through extensive reading in subjects you care about.

What vocabulary coverage looks like at each level
A2 (~1,000 words, ~70% coverage)
El gobierno ha anunciado nuevas medidas para reducir la inflación. Los precios de los alimentos han subido mucho este año. Muchas familias tienen dificultades para llegar a fin de mes.
B1 (~2,000 words, ~82% coverage)
El gobierno ha anunciado un paquete de medidas económicas para hacer frente a la inflación persistente. El aumento de los precios de los alimentos y la energía está afectando especialmente a las familias con menos recursos, según el último informe del Banco de España.

In the examples above, underlined words represent vocabulary that falls outside the most common 1,000 words (A2 panel) or common 2,000 words (B1 panel). Notice that even at B1, only a small number of words per sentence are unfamiliar — but those words often carry significant meaning. That's the 13% gap that adapted content helps bridge.

How to build vocabulary efficiently at each level

At A1 (0–500 words)

Focus entirely on the highest-frequency words. A structured app (Duolingo, Babbel) or a top-500 frequency list will serve you well here. Prioritise: common verbs (ser, estar, tener, hacer, ir, poder, querer, saber), articles, pronouns, question words, numbers, days, months, and the most common nouns. Don't study obscure vocabulary — every minute spent on a low-frequency word is a minute not spent on high-frequency words that appear on every page.

At A2 (500–1,500 words)

Continue drilling the high-frequency core through apps or spaced repetition, but start introducing reading. Adapted texts at A1 and A2 level expose you to known words in new contexts (which reinforces them) and introduce new words in context (which is how they're best acquired). From this point, reading is more efficient than drilling for vocabulary growth. Begin filtering your reading by topic — what subjects do you want to be able to discuss in Spanish?

At B1 (1,500–2,500 words)

Reading should be your primary vocabulary-building method by now. Extensive reading at B1 level — the kind where you understand most of the text and encounter new words in meaningful context — is the most efficient route to the vocabulary growth that gets you past the intermediate plateau. See our post on extensive reading in language learning for the research behind this. Topic vocabulary will build naturally through reading; continue with spaced repetition only for core vocabulary you're not encountering naturally in your reading.

Reading at your level is the most efficient vocabulary builder from A2 onwards. Lectura adapts real Spanish news articles to A1, A2 and B1 — so you always have text at the right coverage level for acquisition. A1 Spanish · A2 Spanish · B1 Spanish

Frequently asked questions

How many Spanish words do I need to be fluent?

It depends on what you mean by fluent. For comfortable everyday conversation and reading of most native content without a dictionary — roughly B2 level — you need around 4,000–5,000 word families receptively. This gets you to ~95% text coverage, the comfortable reading threshold. For near-native comprehension across virtually all contexts (C1), the figure is closer to 8,000–10,000 words. "Fluent" is often used to mean B2; the vocabulary target for that is around 4,000–5,000 word families.

Is it better to learn vocabulary from lists or through reading?

Both, but at different stages and for different vocabulary types. The first 1,000–2,000 most frequent word families are worth deliberate study through frequency lists and spaced repetition — the investment pays off in every text you read. Beyond that, reading in topics you care about is more efficient than drilling lists. Topic-specific vocabulary (words you'll use in particular subject areas) is best acquired in context rather than memorised cold, because context both aids initial learning and consolidates retention.

How many new words do you learn per hour of reading?

Research estimates vary, but a commonly cited figure for extensive reading is around 8–12 new words per hour of reading at an appropriate level (roughly 70–80% comprehension). This sounds modest, but at one hour of reading per day that's 2,900–4,380 new words per year — enough to move through a full CEFR level annually if the reading is consistent and well-targeted. The key variable is reading at the right level: too easy and you encounter few new words; too hard and you encounter too many to absorb.

Why does progress feel slower as I advance in Spanish?

Because it is slower, in terms of coverage gain per word learned. At A1, each new word you learn covers vocabulary that appears frequently — so your coverage jumps noticeably. At B1, each new word tends to be less frequent — so the same learning effort produces a smaller coverage gain. This is the mathematical consequence of Zipf's Law and it affects every learner at every level. It's not a sign that you're doing something wrong; it's a structural feature of vocabulary distribution. The intermediate plateau is partly this effect — real progress continuing at a slower rate, misread as stagnation.

About Lectura

Lectura is a reading-based language learning platform for Spanish and French. We adapt real news articles to your CEFR level — A1, A2 or B1 — ensuring you're always reading at the right vocabulary coverage level for acquisition: enough familiar words to understand the context, enough new words to keep building. Choose your topics, choose your level, and switch up as your vocabulary grows.

Browse A1, A2 and B1 Spanish articles, or explore the full Spanish library. Start with a 7-day free trial.

Try the reading workflow.

Paste any article and read it at A1, A2, or B1 in minutes.

Start free