Vietnam.vn - Nền tảng quảng bá Việt Nam

Human-Engineer “GenZ” of Zalo AI presents research at world’s leading scientific conference

Việt NamViệt Nam11/09/2024


The research project to increase the accuracy of real-time speech recognition models (Streaming Automatic Speech Recognition) by Le Duy Khanh - "GenZ" engineer of Zalo AI - will be announced for the first time at the International Scientific Conference, taking place in Greece in September 2024.

With the topic " Improving Streaming Speech Recognition With Time-Shifted Contextual Attention And Dynamic Right Context Masking " , the research paper of the Zalo AI engineer born in 2000 achieved an almost perfect score - 11/12 points, passing the rigorous review round with more than 2,000 participating papers to be presented at the Interspeech Conference in the form of an oral session.


I am very proud that my first scientific article was recognized by a prestigious scientific conference and I have the opportunity to introduce Vietnam's research achievements to big-tech, experts and the international community ,” Le Duy Khanh shared.

Under the guidance of Dr. Chau Thanh Duc - Head of Research and Development Department at Zalo AI, Lecturer at University of Science (Ho Chi Minh City National University), this research project is expected to make an important contribution to upgrading speech recognition models, increasing the accuracy of voice dictation and voice-to-text on Zalo application.

Synthesizing Zalo AI’s highly practical research into scientific papers and presenting them at prestigious international conferences is of great significance. It not only demonstrates the capacity of Vietnamese engineers, but also demonstrates the desire to share experiences and contribute to the development of the global AI community,” said Dr. Chau Thanh Duc.

Previously, Zalo integrated this research into its messaging application since the end of 2023, significantly improving the accuracy of the "voice message composition" feature. This feature allows users to compose messages by voice instead of typing, saving time and making it more convenient in many usage situations. At the same time, the accuracy of this feature has reached 95% in practice; the rate of needing to edit text after composing by voice has decreased from 6.4% to only 4.8%.


According to Zalo statistics, although the feature is still in the testing phase, it has generated nearly 4.5 million messages per day and attracted about 3.2 million monthly users (data updated to June 2024).

Since starting its pioneering journey in AI research in 2017, Zalo has always believed in “empowering” the younger generation. Currently, up to 31% of Zalo employees belong to the GenZ generation. In 2021, two other research topics of the Zalo AI engineering team related to speech processing technology were also recognized at the Asia-Pacific International Conference on Artificial Intelligence (PRICAI 2021). Notably, the authors of these two topics are all young researchers under the age of 30.

Interspeech is a long-standing, comprehensive and prestigious international conference on Speech Processing organized by the International Speech Communication Association. This year, the conference with the theme “Speech and beyond takes place from September 1-5, 2024 on the island of Kos (Greece).

Source: https://www.vng.com.vn/news/people/ky-su-genz-cua-zalo-ai-gioi-thieu-nghien-cuu-tai-hoi-nghi-khoa-hoc-hang-dau-the-gioi.html


Comment (0)

No data
No data

Same tag

Same category

View of Nha Trang beach city from above
Check-in point of Ea H'leo wind farm, Dak Lak causes a storm on the internet
Images of Vietnam "Bling Bling" after 50 years of national reunification
More than 1,000 women wearing Ao Dai parade and form a map of Vietnam at Hoan Kiem Lake.

Same author

Heritage

Figure

Business

No videos available

News

Political System

Local

Product