Recently, the IEEE ASRU 2023 Automatic Speech Recognition and Understanding Symposium was successfully concluded in Taipei, Taiwan. Top experts, scientific research teams and famous technology companies from global academia and industry gathered together to share the current development trend and latest research results of the voice industry. As a silver sponsor, Databaker was invited to appear at the conference and showed the guests the rich multilingual datasets and all-round data solutions of Databaker.
According to reports, the ASRU Symposium is the flagship technical event of the IEEE Speech and Language Processing Technical Committee (SLTC) and the top conference in the speech and language processing academic circle. It is held every two years and has a long history and wide influence. The conference brings together top experts and researchers from academia and industry to work together on a wide range of speech recognition and comprehension issues.
As an industry-leading AI full-scenario data service provider, Databaker has always been at the forefront of technological innovation. Based on large-scale pre-trained language models and automatic annotation capabilities, it provides customers with more efficient standard collection services for all types of data such as speech, text, and images.
At present, Databaker has accumulated rich experience in AI data services, and has built a large-scale, multilingual, and high-quality AI database, with hundreds of thousands of hours of effective audio data. It covers dozens of foreign languages such as Japanese, Korean, Arabic, Indonesian, Pakistani Portuguese, Spanish, Russian, English, French, German, Italian, and more than 10 dialects such as Taiwanese, Cantonese, Hokkien, Uyghur, Tibetan, Sichuan, Tianjin, and Northeast Chinese.
At the ASRU conference, Databaker Technology demonstrated full-stack AI data solutions and rich data products such as voice, text, and images to the guests.
The commercialization of AI has driven the rapid growth of basic data services
According to the "2022 Artificial Intelligence Basic Data Service*** statistics" released by professional consulting service agency Deloitte, the market size of Chinese artificial intelligence basic data service in 2022 will be 4.5 billion yuan, and the market size is expected to reach 13 billion to 16 billion yuan in 2027. At the same time, with the realization of complex intelligent scenarios such as intelligent manufacturing, metaverse, autonomous driving, and generative AI, higher requirements are put forward for AI basic data services. Automated annotation, professional data collection, and full-stack services have become the three core capabilities of AI basic data.
Excerpted from Deloitte's 2022 Artificial Intelligence Basic Data Services***
Databaker Technology has accumulated many years of AI technical capabilities and business processes, launched an integrated data collection tool AI data platform, built-in large model, and comprehensively supports the collection and annotation of multi-modal data such as voice, text, image and point cloud through intelligent multi-domain data annotation and processing capabilities, so as to solve the diverse and complex data needs of AI landing scenarios, and realize the double improvement of data production efficiency and data quality.
In terms of project delivery capabilities, Databaker has established a number of professional-level data bases in Tianjin, Changchun, Yichang and other places, based on advanced data acquisition equipment, professional data annotation teams and strict quality control systems and other software and hardware conditions, to create a full-stack service plan from scheme design to collection and annotation customization project implementation to the final high data delivery, in-depth layout of vertical field scene applications, to meet the differentiated customization needs of customers, and provide comprehensive and efficient data services.
Up to now, Databaker has successfully provided high-quality data services for more than 600 enterprises and research institutions around the world, with a total of more than 1,000 service projects, covering smart home, smart finance, smart **, social entertainment, smart cockpit, smart city, e-commerce and many other fields.
With the advancement of technology and the expansion of application scenarios in the future, data elements will play a more important role in economic and social development. Databaker will further improve data products and services, provide more professional and efficient data solutions for the AI industry, and promote the vigorous development of the artificial intelligence industry.