As of its launch on July 4, 2022, ISCSLP 2022 Conversational Short-phrase Speaker Diarization Challenge has received more than 40 registration. On July 24, the committee releases the baseline and training datasets for all participants.
In the recently held 2022 World Artificial Intelligence Conference, the WAIC 2022 Data Element Circulation Technology Frontier Exploration Forum was one of the major theme forums of the conference. With the theme of "Open Symbiosis, Integration of Data and Reality", the forum focused on the important economic and strategic value of data as a key production factor driving economic and social innovation and development, as well as the corresponding security threats and privacy challenges.
Recently, Dogecoin DOGE/USD founder Billy Marcus tweeted, "Would you be friends if you could upload your brain to the cloud and talk to a virtual version of yourself?" Musk replied, "I've already done it".
Since the epidemic in 2020, the most popular is not a popular star, but a "virtual human". From the Japanese fashionista IMMA, the domestic AYAYI, the virtual singer Getong, to the dimensional virtual person A-SOUL combination, to CCTV's virtual person Xiao C and the live-action virtual people Teresa Teng and Gong Jun... They are from the fashion industry, From the singing and dancing world, the dimension world to reporters, actors and other industries, there are many fans who shine.
With the development of artificial intelligence, many people are no strangers to voiceprint recognition. Voiceprint recognition is to convert sound signals into electrical signals, and then use a computer for identification. Different tasks and applications will use different voiceprint recognition technologies. For example, identification technology may be required when narrowing the scope of criminal investigations, while verification technology may be required for banking transactions.
In June 2022, Google engineer, Blake Lemonine, claimed that Google's large-scale language model LaMDA had human "self-awareness", and in Blake Lemoine's view, even the artificial intelligence created by GPT-3, the largest language neural model constructed in the form of OpenAI open source architecture, the consciousness of "human" may also appear. According to Blake Lemoine's interview quotes, LaMDA can access services such as YouTube, Google Search, Google Maps, and Google Books, which means that it can continue to accumulate "knowledge" through Google's services, thereby becoming smarter, that is, it can continue to imitate the human brain. Learning to evolve. As soon as the news came out, discussions about whether LaMDA had a "personality" spread across social platforms at home and abroad.
Since 2013, with the development of deep neural networks, the effect of machine translation has improved significantly, but it has not yet reached the point where it can "understand" the language that needs to be translated. There are more than 7,000 languages identified in the world. Among them, Chinese, English, Spanish, Russian, Arabic and French are the main languages in the world and the main working languages of the United Nations. The top ten most spoken languages in the world are, in order: Chinese, English, Russian, Spanish, Hindi, Arabic, Portuguese, Bengali, German and Japanese.
The 23rd INTERSPEECH (INTERSPEECH 2022) Conference is going to take place from September 18 to 22, 2022 at Songdo ConvensiA, in Incheon, Korea and virtually, under the theme “Human and Humanizing Speech Technology”. INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. The conference formulates the vision of the scientific and industrial community to commit endeavors to continue the effort in speech science toward humanizing the spoken language technology. Magic Data is proud to be Silver Sponsor of INTERSPEECH 2022 and building a stronger connection with the community.
On August 11, Lei Jun, founder and CEO of Xiaomi, unveiled the company’s latest AI product: CyberOne, a full-scale humanoid robot. It is reported that CyberOne is 177CM tall and weighs 52KG. The humanoid—whose nicknamed "Metal Bro", can perceive human emotions, has keen vision, and can achieve bipedal posture balance. This full-scale humanoid bionic robot can also perceive 45 kinds of human semantic emotion and has a depth information accuracy of 1% within 8 meters. Lei Jun said that CyberOne takes artificial intelligence as the core and standard humanoid as the carrier. What amazes me most about this robot is its 45 kinds of human semantic emotion perception ability, which makes this product no longer a splicing of cold metal materials, but a "warm" and perceptible mind.
In 2014, the Amazon Echo speaker first combined voice interaction with the speaker, allowing the speaker to realize functions such as making calls, setting alarm clocks, and checking the weather. Voice interaction brings users a new experience, and Amazon has always occupied the largest market share in the smart speaker industry by virtue of being the first to enter the game.