{"id":1279,"date":"2014-03-27T10:05:12","date_gmt":"2014-03-27T07:05:12","guid":{"rendered":"http:\/\/ssrlab.by\/?p=1279"},"modified":"2018-11-21T14:20:53","modified_gmt":"2018-11-21T11:20:53","slug":"132","status":"publish","type":"post","link":"https:\/\/ssrlab.by\/en\/1279","title":{"rendered":"Belarusian and Russian text-to-speech synthesizer for stationary, mobile and web-based platforms"},"content":{"rendered":"<p>&nbsp;<\/p>\n<p><span id=\"result_box\" lang=\"en\" style=\"text-align: justify; line-height: 1.6em;\"><span class=\"hps\">Regular<\/span> <\/span><span style=\"text-align: justify; line-height: 1.6em;\">people-to-people communication is performed through hearing and voice. <\/span><span id=\"result_box\" lang=\"en\" style=\"text-align: justify; line-height: 1.6em;\"><span class=\"hps\">This method<\/span> <span class=\"hps\">of interaction<\/span> <span class=\"hps\">is also desirable for<\/span> <span class=\"hps\">the human-machine<\/span> <span class=\"hps\">relationship. Such a technology as <\/span><\/span><b style=\"text-align: justify; line-height: 1.6em;\">Speech Synthesis <\/b><span style=\"text-align: justify; line-height: 1.6em;\">allows your<\/span><b style=\"text-align: justify; line-height: 1.6em;\"> <\/b><span id=\"result_box\" lang=\"en\" style=\"text-align: justify; line-height: 1.6em;\"><span class=\"hps\">mobile or<\/span> <span class=\"hps\">stationary computer to convert\u00a0<\/span><\/span><span style=\"text-align: justify; line-height: 1.6em;\">separate words, sentences and other <\/span><span id=\"result_box\" lang=\"en\" style=\"text-align: justify; line-height: 1.6em;\"><span class=\"hps\">text fragments<\/span><\/span> <span id=\"result_box\" lang=\"en\" style=\"text-align: justify; line-height: 1.6em;\"><span class=\"hps\">in one of <\/span><span class=\"hps\">two<\/span> <span class=\"hps\">official languages \u200b\u200bof<\/span> <span class=\"hps\">the Republic of Belarus,<\/span> <span class=\"hps\">Belarusian or <\/span><span class=\"hps\">Russian<\/span><\/span><span style=\"text-align: justify; line-height: 1.6em;\">, into speech.<\/span><\/p>\n<p style=\"text-align: justify;\">The tts-based systems may be presented by such\u00a0<span id=\"result_box\" class=\"short_text\" lang=\"en\"><span class=\"hps\">multimedia products as<\/span><\/span> <strong>talking <span id=\"result_box\" lang=\"en\"><span class=\"hps\">electronic <\/span><\/span>answering machines<\/strong> (fault and warning messages voicing, voicing of sms, e-mails, chat messages, tts voicing in <span dir=\"auto\">queue management systems<\/span>), <strong>audiobooks<\/strong> (sequential reading, educational dialogues, audio guides); <strong>Internet radio<\/strong> (RSS readers, and website readers); <strong>multimedia presentations<\/strong> \u201ctext-image-sound\u201d.<\/p>\n<p style=\"text-align: justify;\">The intended audience group covers almost\u00a0<span id=\"result_box\" lang=\"en\"><span class=\"hps\">all the people<\/span> <span class=\"hps\">of the country: <\/span>any attentive <span class=\"hps\">listener<\/span> <span class=\"hps\">in a particular<\/span> <span class=\"hps\">room or<\/span> a <span class=\"hps\">passersby<\/span> <span class=\"hps atn\">of any age (<\/span>children, adults <span class=\"hps\">and senior citizens<\/span> in <span class=\"hps\">the streets,<\/span> <span class=\"hps\">in the bank,<\/span> <span class=\"hps\">at school,<\/span> <span class=\"hps\">office, apartment<\/span>, or <span class=\"hps\">in a vehicle<\/span>); <span class=\"hps\">people<\/span> <span class=\"hps\">with disabilities<\/span> <span class=\"hps atn\">(<\/span>visually impaired, <span class=\"hps\">with weakened<\/span> <span class=\"hps\">vocal cords<\/span> <span class=\"hps\">or<\/span> <span class=\"hps\">hearing) may become users of a <\/span><\/span>tts-based system<span style=\"text-align: center; line-height: 1.6em;\">.<\/span><\/p>\n<p style=\"text-align: justify;\"><strong>Text-to-speech synthesizer for stationary platforms<\/strong><img style=\"opacity: 0.9; text-align: center; width: 500px; height: 301px; float: right; margin-left: 7px; margin-right: 7px;\" src=\"http:\/\/ssrlab.by\/wp-content\/uploads\/2014\/03\/image008.png\" alt=\"\" \/><\/p>\n<p style=\"text-align: justify;\">The interface for stationary platforms asks a user in which language one enters a text. Then the entered text arrives at the input of specialized processors (linguistic, intonational (prosodic), phonetic, or acoustic). Finally, the processed text is converted into an audio signal.<\/p>\n<p style=\"text-align: justify;\"><strong>The novelty and originality of the TSS design<\/strong> lie in the following: the text-to-speech synthesizer uses the same algorithms and their realizations slightly altered according to language-dependent linguistic resources. As a result, this significantly saves computer resources.<span style=\"text-align: center; line-height: 1.6em;\">\u00a0<\/span><\/p>\n<p style=\"text-align: justify;\">The developed algorithms for turning numbers into ordinal or cardinal numerals and their modifications, allow voicing designations of dates (e.g., 25.10.2011), time, temperatures (e.g., 212\u00a0\u00b0F), scales, technical names (e.g., the\u00a0Yakovlev\u00a0Yak-15), abbreviations, and acronyms (e.g., UNIX, Android\u00a04.3). These algorithms, unlike the existing ones, take into account the declination of ordinal numbers according to the categories of gender, number and case, and, therefore, can increase the natural sounding of synthesized speech.<\/p>\n<div>\n<p style=\"text-align: justify;\"><img loading=\"lazy\" style=\"line-height: 1.6em; opacity: 0.9; text-align: center; float: left; margin-left: 10px; margin-right: 10px;\" src=\"http:\/\/ssrlab.by\/wp-content\/uploads\/2014\/03\/image010.png\" alt=\"\" width=\"119\" height=\"238\" \/><\/p>\n<p style=\"text-align: justify;\">\n<p style=\"text-align: justify;\">\n<p style=\"text-align: justify;\"><strong>Text-to-Speech Synthesizer for Mobile Platforms<\/strong><\/p>\n<div>\n<p style=\"text-align: justify;\">The system of text-to-speech synthesis has been implemented on the J2ME platform used in a wide variety of mobile phones. The synthesizer has low requirements for this platform, which allows using it for the majority of mobile phones. Another distinctive feature that favours the use of the SST is the inbuilt technology of the placement of accents in words, which, unlike its known analogues, considers heuristic and statistical accent characteristics. Thanks to this, the volume of the grammatical dictionary is significantly reduced without loss of accuracy of the accent placement.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Text-to-Speech Synthesizer for the Internet<\/strong><\/p>\n<p>The system has been implemented on the free scripting programming language PHP that is considered the most popular one on the web. Users can visit the website <a href=\"http:\/\/corpus.by\/\">http:\/\/corpus.by\/<\/a> at any time and synthesize speech for any text in the appropriate language. After the generation of speech, it will become possible not only to play the resulting sound file, or to download and save it, but also to share the electronic link to it with friends via e-mail or a social network. For example, at <a href=\"http:\/\/corpus.by\/\">http:\/\/corpus.by\/<\/a> it is possible to quickly create language riddle tests.<\/p>\n<p><strong>Text-to-Speech Synthesizer is a Widely-Used Tool<\/strong><\/p>\n<p>The text-to-speech synthesizer is capable of providing the voicing of texts in the Belarusian and Russian languages for a wide range of users. Thanks to the language-independent architecture, it can speak the same voice (male or female) in different languages. The ability to launch the SST on different platforms allows its almost universal use.<\/p>\n<div style=\"font-size: 13px;\"><\/div>\n<\/div>\n<\/div>\n<p><!--more--><\/p>","protected":false},"excerpt":{"rendered":"<p>Regular people-to-people communication is performed through hearing and voice. This method of interaction is also desirable for the human-machine relationship. Such a technology as Speech Synthesis allows your mobile or stationary computer to convert separate words, sentances and other text fragments in one of two official languages \u200b\u200bof the Republic of Belarus, Belarusian or Russian, into speech.<\/p>\n<a class = \"excerpt\" href=\"https:\/\/ssrlab.by\/en\/1279\">Read more...<\/a>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[5,290,291,294,304,305,311],"tags":[],"_links":{"self":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/1279"}],"collection":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/comments?post=1279"}],"version-history":[{"count":28,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/1279\/revisions"}],"predecessor-version":[{"id":3905,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/1279\/revisions\/3905"}],"wp:attachment":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/media?parent=1279"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/categories?post=1279"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/tags?post=1279"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}