###Cantonese TOCFL(繁體) Vocabulary List, indexed, with audio and example sentences (粵語)
okay i give up with this editor (it doesn't let me edit without reuploading the entire deck) and will just leave stuff in raw Markdown
##Deck highlights in brief
* covers [TOCFL 2-6](
https://www.tw.org/tocfl/)
* see the [supplementary deck](
https://ankiweb.net/shared/info/2101132829) i also shared for spoken Cantonese phrases
* is mostly sorted from least to most difficult
* has both characters and compound words
* has voiced sample sentences for about 80% of cards
* has audio for all words and sentences
* has English translation and clozed version of sentence
* includes cards to check for reading, listening, and production
* mobile friendly
This is based on
https://ankiweb.net/shared/info/1893881321 (which in turn was based on
https://www.roc-taiwan.org/at_de/post/634.html). Most of the changes were QoL improvements, such as:
* Redid CSS so that it is not tacky, works on AnkiDroid, and uses perfect black theme for OLED battery consumption considerations
* Added example sentences from [HSK Tingli decks](
https://ankiweb.net/shared/decks/tingli) which covered about half of the deck.
* tidied up definitions to use commas instead of slashes to avoid word wrap display issues
* Added explicit field for difficulty sequencing (see caveats), according to:
*in the Essentials deck:
* firstly, some bare essentials (numbers, common words that are also radicals, basic pronouns)
* then, with a few exceptions, the 250 most common words according to
https://github.com/ruddfawcett/hanziDB.csv
* then by the TOCFL grade level as per original deck, split into subdecks
* within each subdecks, cards are then sorted be HSK index according to this series of decks
https://ankiweb.net/shared/info/132435921
* then by words that have a separate Cantonese meaning
* then by frequency in use, according to
https://github.com/ruddfawcett/hanziDB.csv (converted to traditional)
* then by HSK level, according to
https://github.com/ruddfawcett/hanziDB.csv (converted to traditional)
* the remaining words are randomized to avoid repitition
* 3 card types in total, feel free to adjust/remove:
* *NOTE* In the deck options, you probably want to enable the two "bury related card" options.
* Recognition:
* Front: Chinese character, no jyutping
* Back: Full entry + audio
* Listening: (see caveats)
* Front: audio, jyutping, and spoilered memo and english sentence fields (for disambiguation, etc.)
* Back: Full entry
* Production: (see caveats)
* Front: no character nor jyutping, English definition. clozed sentences (if available)
* Rear: Full entry + audio
* Changed Cantonese voices to Microsoft Azure's neural generated voices (the original had many pronunciation errors, plus these ones sound much more natural(though this one still isn't perfect))
Fields:
*NOTE* I have added additional fields, so you will need to do some conversion prior to adding this deck if you want to continue your progress from the original deck.
* 漢字: The Chinese characters for the word, in Traditional
* 注音: Bopomofo (not shown in card style)
* 粵拼: Cantonese - jyutping
* Alternative: Alternative character(s) to a particular character(s) in the word, e.g. 部分 can alternatively be written as 部份.
* final_optional: character(s) that can optionally appear at the end of a word e.g. (兒) in 一點兒, (子) in 繩子 etc...
* 意思(華): Meaning in Chinese
* 意思(粵): Meaning Exclusive to Cantonese, e.g. 得意 means cute in Cantonese but complacent in Mandarin.
* jyutping_audio: Cantonese reading
* zyujam_audio: Mandarin reading (unused with default cards, media files are not in this deck, but I kept this field intact in case you decide to merge with the original)
* Example_ch: An example sentence in Chinese of the word being used
* Example_clozed: Clozed version of example Sentence
* Example_audio: Cantonese TTS recording of sample sentence
* Example_Eng: Translation of the example sentence
* Example_2_*: for a second example sentence
* Memo: self-note that shows in full entry. Also shows in listening, in case you need it to disambiguate between homophones. Renamed from "mnemonic" in original.
* level:
* 基礎級 - all vocab required up to level 2 in TOCFL;
* 進階級 - all vocab required in level 3 TOCFL;
* 高階級 - all vocab required in level 4 TOCFL;
* 流利級 - all vocab required in level 5&6 TOCFL
* order: Intended order this deck should be studied in (from least to most difficult)
default_tts_voice: The voice that I set Azure TTS to use. Sometimes one voice will pronounce things slightly wrong. Changing this field does nothing, It is just for my sorting/batch processing.
Caveats:
* This is based off another deck, so any mistakes in that deck are likely to be carried over unless I have corrected them manually.
* Sharing to AnkiWeb doesn't seem to preserve the order of the cards exactly, so it is recommended to reschedule them based on the "index" field before you start using the deck. If the first card you get is not what is listed below, please rearrange your deck by the "order" field.
* Fortunately, it seems like the default import order is *roughly* correct so it should be good enough if you are unfamiliar with editing decks.
* The first card of the Essentials deck should be 一
* The first card of the Level 2 deck should be 風
* The first card of the Level 3 deck should be 它
* The first card of the Level 4 deck should be 軍
* The first card of the Level 5-6 deck is 賓館
* Some sentences are in spoken style, some are in written style.
* With exeptions, a sentence that contains any word from the supplementary cantonese deck are in spoken style, those without such words are in written style.
* There might be rare bad formatting
* By default this deck has listening and production cards for everything, and which isn't always applicable. Recommended to suspend such cards.
* the "Bury related card" option is disabled by default, I recommend enabling it for these decks.
* Due to multiple readings, some the audio pronunciation isn't 100%. e.g. I just can't get the TTS to pronounce 飲料 without sounding really awkward. If you notice any such mispronunciations, let me know and I'll try with the other voices as that usually corrects it.
* There are a few tones that are exclusive to a single character which the TTS pronounces wrong, thus cannot be corrected.
* The sorting isn't perfect, there are still some common terms that come well after less common terms
* I'm using this deck so you might see my personal notes and tags, have random suspended cards, etc.
If you notice any errors, let me know in the comments and I may tidy it up.
Here is a pastebin of all the words in their intended sequence:
https://pastebin.com/AYYfMAzQ (I sometimes make slight variations manually, so this list is not 100% correct)
##Changelog:
2021-01-09
* Initial Release
2021-01-14
* Incremental updates and additions to sample sentences/audio
* Minor CSS fixes
* Added 5 second timer before displaying hints on listening checks
* Surname definitions are hidden on the front side of clozed cards
* Removed simplified Chinese and pinyin from "Variant of" definitions
After the file is downloaded, double-click on it to open it in the desktop
program.
At this time, it is not possible to add shared decks directly to your
AnkiWeb account - they need to be added from the desktop then synchronized
to AnkiWeb.