Character Frequency Counter


Service “Character Frequency Counter” is designed to produce statistical and background information about the characters in the text. Service interface is on Figure 1. The input of the service can be any text file or sequence of characters. For information about the symbols need to press the “Get Information on Characters! / Атрымаць інфармацыю пра сімвалы!”.

characterInfo_GUI_2015-03-26Figure 1 – Information on Characters interface

Service counts the total number of characters of the input text and the number of unique characters in the text. For each unique character information is gathered about its code (Unicode standard), name (for those characters that are in the database), the number of occurrences of the character in the input text and information about the context in which the character in the text was found for the first time. The names of characters are available in one of three languages – Belarusian, Russian and English – by selecting the desired language from the drop-down list. The resulting data is displayed to the user in the form of a table. An example of the resulting data is presented on Figure 2.

characterInfo_usage_2015-03-26Figure 2 – Results of text processing by Information on Characters

Service allows to sort the outputs for each of the columns. To do this, click on the column header for which you want to sort the list. Pressing the same heading for the second time you will get the list in reverse order. Figure 3 shows an example of the resulting data, sorted by the number of occurrences in the text.

characterInfo_usageWithSort_2015-03-27Figure 3 – The resulting data sorted by the number of occurrences in the text.

Service pagehttp://corpus.by/CharacterFrequencyCounter/?lang=en

If you have found a spelling error, please, notify us by selecting that text and pressing Ctrl+Enter.