The Statistics View¶
The statistics view shows you statistics about the currently open document.
There are five main groups of statistics:
You can expand or collapse each section by clicking on the header of the relevant section.
You can also copy information from the statistics view to the clipboard by selecting the rows of interest and then copying from the main menu (Edit->Copy) or by using standard ‘Copy’ shortcut keys.
Word Statistics¶
This section displays information about the total and unique number of words in the currently open document. This includes:
- Total
- The total number of words in the document.
- Total Known
- The total number of words in the document that you know (based on the currently active vocabulary profile).
- Total Percent Known
- The percentage of known words in the document compared to the total number of words.
- Total Unknown
- The total number of unknown words that appear in the document.
- Total Percent Unknown
- The percentage of unknown words, compared to the total number of words in the document.
- Unique
- The total number of unique words in the document.
- Unique Known
- The total number of known unique words.
- Unique Percent Known
- The percentage of known unique words, compared to the total number of unique words in the document.
- Unique Unknown
- The number of unknown unique words.
- Unique Percent Unknown
- The percentage of unknown unique words, compared to the total number of unique words in the document.
HSK Statistics¶
This section shows the percentage of both the total and unique number of words in the document for each HSK level (1-6), as well as the percentage of total and unique words in the document that are not defined in the HSK vocabulary lists.
Levels 2-6 also include the cumulative percentage of all preceding levels, displayed in brackets after the raw percentage for that level.
TOCFL Statistics¶
This section shows the percentage of both the total and unique number of words in the document for each TOCFL level (1-5), as well as the percentage of total and unique words in the document that are not defined in the TOCFL vocabulary lists.
Levels 2-5 also include the cumulative percentage of all preceding levels, displayed in brackets after the raw percentage for that level.
Character Statistics¶
This section show the total number, and the total number of unique Chinese characters in the entire document.
This number does not include punctuation or whitespace.
File Statistics¶
The section shows general information about the currently open text document, including:
- The total number of bytes in the text file.
- The total number of Unicode codepoints in the file.
- The total number of lines in the file.
- The total time it took to segment and analyse the entire file.