{"id":3231,"date":"2015-03-19T08:00:47","date_gmt":"2015-03-19T05:00:47","guid":{"rendered":"http:\/\/ssrlab.by\/?p=3231"},"modified":"2018-11-21T14:20:50","modified_gmt":"2018-11-21T11:20:50","slug":"sintezatar-mauliennja-dlja-belaruskaj-movy","status":"publish","type":"post","link":"https:\/\/ssrlab.by\/en\/3231","title":{"rendered":"Academy of Sciences set up Speech Synthesis for Belarusian"},"content":{"rendered":"<p>Download (PDF, 1.41MB)<\/p>\n<p style=\"text-align: justify;\">United Institute of Informatics Problems of the National Academy of Sciences of Belarus for more than 40 years\u00a0<span style=\"font-weight: 400;\">has been engaged<\/span>\u00a0in speech technology. A new direction proposed by the former head of the laboratory, and now &#8211; the main researcher, Boris Lobanov\u00a0Ph.D. &#8211; is computer person voice cloning.<\/p>\n<p style=\"text-align: justify;\">This technology allows you to play any text with the manners of reading of a particular person and his voice, to recreate the voices of well-known personalities.<\/p>\n<p style=\"text-align: justify;\">Lilija CIRU\u0139NIK, the acting head of the speech synthesis and recognition laboratory of the United Institute of Informatics Problems of the National Academy of Sciences tells us about prospects for the development of speech technologies.<\/p>\n<p style=\"text-align: justify;\">\u2014\u00a0What is the speech recognition?<\/p>\n<p style=\"text-align: justify;\">\u2014 The ultimate goal of speech recognition to make computer program\u00a0understand the meaning of statements and perform some action. There are two tasks. The first one \u2014 separate voice recognition commands.For example, instead of entering these or other commands using the keyboard or mouse, you can give them by voice. The system will respond \u2014 select text, copy, move to the line above.<\/p>\n<p style=\"text-align: justify;\">The system can be used in manufacture when working with complex equipment, where instead of using mechanical levers voice commands can be utilized.<\/p>\n<p style=\"text-align: justify;\">The second problem \u2014\u00a0a so-called recognition of continuous speech. It&#8217;s like stenography. So the computer will be able to\u00a0give you our conversation\u00a0in a view of\u00a0a text file.<\/p>\n<p style=\"text-align: justify;\">\u2014 Speech synthesis\u00a0\u2014 vice versa?<\/p>\n<p style=\"text-align: justify;\">\u2014 Yes. <span style=\"font-weight: 400;\">Speech Synthesizer is a computer program, which according to the entered text by voice output information, creates audio files corresponding to the input text.<\/span> Do you want to &#8211; the program will read to you Leo Tolstoy or a newspaper article. The main thing is an original\u00a0text file.<\/p>\n<p style=\"text-align: justify;\">\u2014 Then whose voice will it be?<\/p>\n<p style=\"text-align: justify;\">\u2014 Any text of any size can be read by male or female voice. With an original technology, we can create a personal voice of any man. When playing, you can change the tone of voice, the speed and the playback volume. The resulting voice recording can be saved in different formats, for example, in the popular MP3 format.<\/p>\n<p style=\"text-align: justify;\">\u2014 How can you use it in practice?<\/p>\n<p style=\"text-align: justify;\">\u2014 With it, for example, we can create audiobooks. Of course, a professional actor will voice audiobook much better than a computer program. However, with the use of the program &#8211; a speech synthesizer, you can choose to listen to any book and to create on its basis the sound file.<\/p>\n<p style=\"text-align: justify;\"><span style=\"font-weight: 400;\">The speech synthesizer is important for blind and visually impaired people and, in particular, information kiosks, which are now used in banks, airports, railway stations.\u00a0<\/span>Information kiosks give not only visual information (granted on the screen), but voice\u00a0information as well. This information is now usually pre-recorded and played back when necessary. However, after\u00a0any change it should be rewritten. If you use a speech synthesizer, it will simplify and reduce the cost of the task.<\/p>\n<p style=\"text-align: justify;\">Another example is to inform customers on the phone. For instance, some organizations have to report the debt for the rent or telephone. It would also be wise to use a speech synthesizer.<\/p>\n<p style=\"text-align: justify;\"><span style=\"font-weight: 400;\">By embedding a speech synthesizer in the<\/span><span style=\"font-weight: 400;\"> work<\/span><span style=\"font-weight: 400;\">\u00a0with e-mail,<\/span><span style=\"font-weight: 400;\"> you can listen to the incoming mail while doing something else<\/span><span style=\"font-weight: 400;\">.\u00a0<\/span>You can, for example, convert\u00a0the form of an electronic newspaper into the audio file and listen to it on the way to work.<\/p>\n<p style=\"text-align: justify;\">\u2014 Are there many such programs in the world?<\/p>\n<p style=\"text-align: justify;\">\u2014 Yes, of course, Programs exist for the majority of modern languages. There are several systems for the Russian language, the quality of which is comparable with the system we have created. The development of a speech synthesizer for each language has its own characteristics.<\/p>\n<p style=\"text-align: justify;\">\u2014 You are working on the creation of speech synthesis for the Belarusian language, aren&#8217;t you?<\/p>\n<p style=\"text-align: justify;\">\u2014 Yes. But the quality of the program does not satisfy us.<\/p>\n<p style=\"text-align: justify;\">\u2014 What&#8217;s the problem?<\/p>\n<p style=\"text-align: justify;\">\u2014 <span style=\"font-weight: 400;\">One of the main features <\/span><span style=\"font-weight: 400;\">is the development of linguistic and acoustic information resources while creating a speech synthesis system.<\/span>\u00a0In the synthesis speech text you need to know where to put the emphasis on\u00a0each word. Belarusian and Russian does not have stable stresses, so you need to create an electronic dictionary of stresses, containing the largest possible number of words. Another problem is the intonation of speech.\u00a0<span style=\"font-weight: 400;\">To make synthesized speech \u201cright\u201d a database<\/span><span style=\"font-weight: 400;\"> should be created<\/span><span style=\"font-weight: 400;\"> \u00a0for the intonations of the Belarusian language.\u00a0<\/span><span style=\"font-weight: 400;\">It is necessary to have a voice database for scoring arbitrary text, which contains all the sounds of the language and basic shades.\u00a0<\/span>Such a framework for the Russian language contains about 800 short audio segments. It is necessary to replenish the sounds specific to the Belarusian language<\/p>\n<p style=\"text-align: justify;\">\u2014 Are your developments available for users?<\/p>\n<p style=\"text-align: justify;\">\u2013\u00a0<span style=\"font-weight: 400;\">We offer the developed system of creating and scoring audiobooks \u201caBookForge\u201d \u00a0as software product that can be purchased by any user.\u00a0<\/span>The Institute has concluded a license agreement with a private firm, which sells it.<\/p>\n<p style=\"text-align: justify;\">\u2014 Tell us about the project &#8220;Talking Head&#8221;.<\/p>\n<p style=\"text-align: justify;\">\u2013 It\u00a0is an audio-visual speech synthesis. Speech synthesis audiovisual technology includes not only the scoring of voice, text comments, but also it displays the head and articulatory organs (lips, cheeks, jaw, etc.) during text pronunciation.\u00a0There are two approaches of formation\u00a0an audio-visual speech synthesizer: the creation of stylized three-dimensional model of &#8220;talking heads&#8221;, as well as creating a personal two-dimensional &#8220;talking head&#8221; of a particular person on the basis of photographs of his face in the pronunciation of certain sounds. T<span style=\"font-weight: 400;\">he system of audiovisual speech synthesis on the text<\/span><span style=\"font-weight: 400;\"> is in demand<\/span><span style=\"font-weight: 400;\"> not only for people with sight problems, but also <\/span><span style=\"font-weight: 400;\">for<\/span><span style=\"font-weight: 400;\"> hard of hearing, as they can read the \u201ctalking head\u2019s\u201d lips<\/span><\/p>\n<p style=\"text-align: justify;\">\u2014 In your opinion, what prospects has the development of speech technologies in Belarus?<\/p>\n<p style=\"text-align: justify;\">\u2013\u00a0During the last 15-20 years the speech technologies got rapid development. Speech recognition, speech synthesis of text, voice identification and verification of identity has now achieved a high quality and are used in many practical applications. However, existing systems are used in many new practical fields, new ways of their improving are developing. <span style=\"font-weight: 400;\">The speech technology development has high potential in Belarus. TTS<\/span><span style=\"font-weight: 400;\">\u00a0systems can be further developed and implemented on the scoring systems of public transport, teaching of Russian \/ Belarusian, self-service terminals.<\/span><\/p>\n<p><strong>Source:<\/strong>\u00a0<span style=\"text-decoration: underline;\"><a href=\"http:\/\/baranovichi.by\/belarus-news\/14374-v-akademii-nauk-sozdali-sintezator-rechi-dlja-belorusskogo-jazyka.html\">http:\/\/baranovichi.by\/belarus-news\/14374-v-akademii-nauk-sozdali-sintezator-rechi-dlja-belaruskogo-jazyka.html<\/a><\/span><\/p>\n<p><iframe src=\"\/\/docs.google.com\/viewer?url=http%3A%2F%2Fssrlab.by%2Fwp-content%2Fuploads%2F2015%2F03%2Fbaranovichi-by-v-akademii-nauk-sozdali-sintezator-rechi-dlja-belorusskogo-jazyka.pdf&hl=en_US&embedded=true\" class=\"gde-frame\" style=\"width:100%; height:500px; border: none;\" scrolling=\"no\"><\/iframe>\n<p class=\"gde-text\"><a href=\"http:\/\/ssrlab.by\/wp-content\/uploads\/2015\/03\/baranovichi-by-v-akademii-nauk-sozdali-sintezator-rechi-dlja-belorusskogo-jazyka.pdf\" class=\"gde-link\">Download (PDF, 1.41MB)Download (PDF, 1.41MB)<\/p>","protected":false},"excerpt":{"rendered":"<p>United Institute of Informatics Problems of the National Academy of Sciences of Belarus for more than 40 years engaged in speech technology. A new direction proposed by the former head of the laboratory, and now &#8211; the main researcher, Boris Lobanov Ph.D. &#8211; computer person voice cloning.<\/p>\n<a class = \"excerpt\" href=\"https:\/\/ssrlab.by\/en\/3231\">Read more...<\/a>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[268],"tags":[],"_links":{"self":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/3231"}],"collection":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/comments?post=3231"}],"version-history":[{"count":20,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/3231\/revisions"}],"predecessor-version":[{"id":3780,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/posts\/3231\/revisions\/3780"}],"wp:attachment":[{"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/media?parent=3231"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/categories?post=3231"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ssrlab.by\/en\/wp-json\/wp\/v2\/tags?post=3231"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}