Cantonese 9 Tones Explained: How to Hear and Produce Them All
Why Nine Tones, Not Six?
You will hear both six tones and nine tones cited for Cantonese, and both are correct depending on the framework. The standard modern Cantonese phonology (used in Hong Kong) recognises six basic tones: high level (1), high rising (2), mid level (3), low falling (4), low rising (5), and low level (6). The nine-tone framework adds three 'entering tones' (入聲, jap6 sing1) — short, clipped syllables that end in a stop consonant (p, t, or k). These entering tones are heard as variants of tones 1, 3, and 6 but feel distinctly different in rhythm and length. For practical learning, start with six tones and treat the entering tones as a variation you will absorb naturally through listening.
The Six Core Tones: A Listening Map
Tone 1 (high level): the pitch stays high and flat, like a sustained musical note at the top of your range. Example: si1 (詩, poem). Tone 2 (high rising): starts high and rises further, like a surprised question in English. Example: si2 (史, history). Tone 3 (mid level): a comfortable mid-pitch, steady and flat — the most neutral-sounding tone. Example: si3 (試, to try). Tone 4 (low falling): starts at a low pitch and drops or stays low with a slight fall, often sounding heavy or somber. Example: si4 (時, time). Tone 5 (low rising): starts low and rises, similar to tone 2 but from a lower starting point — like an incredulous question. Example: si5 (市, city). Tone 6 (low level): very low, short, and flat. Often described as the softest and most relaxed tone. Example: si6 (事, matter, affair).
The Entering Tones: Short and Clipped
Entering tones are syllables that end in a stop consonant (-p, -t, or -k) and are pronounced short and clipped — no trailing vowel sound. There are three entering tones corresponding to the upper, mid, and lower pitch levels. High entering tone (Tone 7): short and high, like a sharp tap. Example: sik1 (識, to know). Mid entering tone (Tone 8): short and mid-pitched. Example: sik3 (色, colour). Low entering tone (Tone 9): short and low. Example: sik6 (食, to eat). The key to hearing entering tones correctly is training your ear to expect the abrupt stop at the end of the syllable. The stop itself is not voiced — it is the absence of sound that defines the tone.
Common Minimal Pairs: Training Your Ear
The fastest way to build tonal discrimination is through minimal pairs — pairs of words that differ only in tone. The word 'fu' illustrates five tones clearly: fu1 (夫, husband), fu2 (虎, tiger), fu3 (苦, bitter), fu4 (父, father), fu6 (負, to bear / negative). Similarly, 'ma': ma1 (媽, mother), ma3 (馬, horse), ma4 (罵, to scold), ma5 (吖, a particle). Practice by listening to each pair five times without looking at the romanisation, then checking. Your ear will start categorising automatically after 20–30 sessions of this kind of focused listening.
Production: Getting Your Voice to Do What You Hear
The gap between hearing tones correctly and producing them is significant. Here is a three-stage production drill. Stage 1 — Hum without words: produce each tone as a pure hum at the right pitch contour without attaching a syllable. This trains pitch memory. Stage 2 — Anchor to a word: take a single word for each tone and say only that word until the pitch contour feels automatic. Stage 3 — Contrast pairs: say two adjacent tones back to back (1-2, 2-3, 3-4, 4-5, 5-6) until the transition feels clean. Record yourself and compare with a native model. The most common production errors are merging tones 4, 5, and 6 (all low-pitch tones) and flattening the rising contour of tone 2.
A Realistic Practice Timeline
Most adult learners can reliably distinguish all six Cantonese tones in listening within 4–8 weeks of daily focused practice (15 minutes per day). Production accuracy — where a native speaker can correctly identify your intended tone — typically takes 3–6 months of regular practice with feedback. The entering tones, because they are rhythm-based rather than pitch-contour-based, often click faster than expected once you spend time with authentic Cantonese speech. The biggest mistake learners make is drilling tone charts without ever listening to real connected speech. Cantonese tones in natural conversation are affected by speech rate, emotion, and context. Chart practice builds a scaffold; immersion builds the intuition.
Finding the Right Teacher for Tone Feedback
Tone accuracy in Cantonese is one skill where self-study has clear limits. A teacher who is a native Cantonese speaker can hear the exact pitch error you are making and correct it in real time in a way that no app or chart can replicate. On Unox, you can filter for Cantonese-specific teachers and book short pronunciation-focused sessions to get feedback on your tonal output. Many learners find that a few concentrated teacher sessions on tones — interspersed with independent listening practice — accelerate their tonal accuracy far more than months of solo drilling.
You might also like
Chinese Tones Made Easy: A Practical Guide for Beginners
If Chinese tones feel intimidating, this guide breaks them down into simple listening and speaking h…
Read more →Mandarin Tones: The Complete Beginner's Guide
In Chinese, the same syllable with different tones means completely different words. Here is everyth…
Read more →Chinese Tones for Beginners: Master All 4 Tones
Tones are the first thing that scares new Chinese learners, but they follow clear rules. Here is eve…
Read more →Start practicing Chinese for free on Unox
Conversation practice, anytime. No credit card required.
Learn Chinese Free