On January 30, iFLYTEK held the Spark Cognitive Model V35. At the upgrade conference, the iFLYTEK Xinghuo v3. was officially launched based on the first national computing power training5。
With the sweep of the wave of large models, major manufacturers have begun to expand their layout in the field of large models, and iFLYTEK has also responded positively. On October 24, 2023, iFLYTEK and Huawei jointly announced the official launch of the first Vanka domestic computing platform "Feixing-1" to support the training of trillion-parameter large models. In the more than 90 days since its launch, iFLYTEK Xinghuo has continued to increase its R&D investment, and based on the "Feixing No. 1", it has launched a large-scale model training with larger parameters against GPT-4, which is iFLYTEK Xinghuo V3 on January 305. The foundation for the upgrade release has been laid.
At the latest iFLYTEK press conference, the company highlighted a series of eye-catching keywords, including "beyond GPT-4" and "domestic independent and controllable computing platform". The Spark model v3 released this time5 After a comprehensive upgrade, its performance is not only close to the level of GPT-4 Turbo, but also has made significant breakthroughs in several key areas.
It is understood that the Spark large model V3After a full upgrade, its performance is already close to the level of the GPT-4 Turbo. Specifically, it has surpassed GPT-4 Turbo in language comprehension and math, with 96% of GPT-4 Turbo capable and 91% of GPT-4V capable of multimodal comprehension.
Liu Qingfeng, chairman of iFLYTEK, said: "iFLYTEK Spark V35 has reached a critical turning point in the ability to improve. By 2024, the iFLYTEK Xinghuo cognitive model will show excellent performance in more scenarios and fields.
This series of developments has raised people's attention to the core of the development of large models.
First of all, it is worth noting that the Spark model v35 have surpassed GPT-4 Turbo in language comprehension and math. This indicates that the development of large models is moving towards a more comprehensive and in-depth direction. Language understanding has always been one of the core concerns in the development of large models, and iFLYTEK's new model has performed impressively in this regard. This means that in the future of natural language processing tasks, iFLYTEK's large model is expected to play a more important role, providing more accurate and intelligent solutions for various application scenarios.
Secondly, in terms of ability, the Spark model v35 has reached 4% of GPT-96 Turbo. This reflects a significant improvement in the ability of large models to understand and generate**. With the continuous development of information technology, the demand for large models with strong understanding and generation capabilities is also increasing. The superior performance of iFLYTEK's new model in this field indicates the application potential of large models in promoting software development and automated programming.
Finally, the multimodal comprehension capability is the iFLYTEK Xinghuo large model v3Another highlight of the 5, reaching 91% of GPT-4V. This means that the model is better able to understand and process multiple input data, including text, images, sound, and other modalities. This is of great significance for realizing more intelligent and integrated human-computer interaction, information processing and other applications.
In general, iFLYTEK presented the Spark large model v3 in this press conferenceNot only does the 5 surpass the GPT-4 Turbo in terms of performance, but it also makes significant progress in several key areas. These breakthroughs mark the evolution of the core of the development of large models, providing more powerful and intelligent solutions for all walks of life. At the same time, as a large model of domestic independent and controllable, iFLYTEK's achievements also highlight China's increasingly strong position in the field of artificial intelligence.
However, it is worth noting that more independent assessment and verification of the objectivity and reliability of these claims is still needed. In the highly competitive tech sector, objective data and evaluations will help better understand the real value of this new model.
According to reports, Spark V35's seven core competencies have been comprehensively improved, including text generation improvement73% and 7% improvement in language comprehension6%, knowledge quiz increased by 47%, logical reasoning increased by 95%, math ability improved by 98%, *Ability increase 80%, multimodal capability increased by 66%。
iFLYTEK launched the Spark v35. It has achieved comprehensive improvement in text generation, language comprehension, knowledge question and answer, logical reasoning, mathematical ability, advanced ability and multimodal ability.
First of all, the improvement of text generation is one of the core capabilities of large models. In the age of information, it will be an important task for models to generate more accurate and expressive texts. Starfire v35 Implemented in terms of text generation 7The 3% increase provides strong support for the model to better understand and generate natural language. This is closely related to the development trend of large models in natural language processing tasks.
Secondly, the improvement of language comprehension is also an important direction for the development of large models. Starfire v35. Achieved 7 in language comprehensionThe 6% increase shows that the ability of large models to understand context and reason about semantic relationships is constantly increasing. This has a positive effect on the realization of tasks such as more intelligent dialogue systems, sentiment analysis, etc.
Knowledge question answering and logical reasoning are the other two key directions in the development of large models. Starfire v35 achieved 4 in these two aspects7% and 9The 5% increase indicates that the performance of large models in dealing with complex problems and performing logical reasoning is constantly improving. This is of great significance for solving complex problems in the real world, such as intelligent customer service, legal consultation, etc.
The improvement of mathematical ability and advanced ability provides a broader space for the application of large models in the field of science and technology. Starfire v35 achieved 9 in each of these two areas8% and 8The 0% improvement provides more reliable support for the model to better handle mathematical problems and generate**. This is of great significance for promoting the application of large models in the field of engineering.
Finally, the improvement of multimodal capability provides a better solution for large models to process multiple information sources such as images and voices. Starfire v35. Achieved 6The 6% increase provides strong support for the model to better understand and process multimodal information. This is an important impetus for the realization of a more comprehensive and complex human-computer interaction system.
Overall, Starfire v3The seven core competencies of 5 demonstrate the wide application potential of large models in different fields. The future development trend of large models will mainly focus on text generation, language understanding, knowledge question answering, logical reasoning, mathematical ability, advanced ability, and multimodal ability. However, the field of large models still needs to pay attention to issues such as transparency, fairness, and data privacy, so as to balance technological innovation and ethical responsibility, and promote AI technology to better serve society.
With the rapid development of science and technology, open large models are becoming one of the important engines to promote innovation in the field of artificial intelligence.
The large model of openness for all makes AI technology more popular and democratized. Ordinary users can use AI technology more conveniently and enjoy more intelligent services and experiences through stronger voice interaction, text understanding, and multimodal capabilities. At the same time, the use of open large models promotes innovation in various industries, especially in the fields of customer service, education, healthcare, and entertainment. More powerful model capabilities mean more efficient and personalized services, which further promotes the digital and intelligent development of the industry.
First of all, in terms of industry applications, the launch of the national open model will have a far-reaching impact on all walks of life. In the fields of customer service, automobiles, and robots, human-computer interaction will be more intelligent and natural. The upgrade of language understanding, text generation, and knowledge Q&A of the open large model will help the industry achieve more efficient and intelligent services and communication, and promote intelligent transformation.
Secondly, in the field of education, the application of open large models in the field of education will provide students with a more personalized and efficient learning experience. Through voice interaction, knowledge questions and answers, students can obtain knowledge more conveniently and improve learning efficiency. Educational institutions and platforms can use open models to provide customized teaching content to facilitate the popularization and dissemination of knowledge.
Then, in the research field, researchers will benefit from the improvement of open large models in text generation, logical reasoning, etc. This will help accelerate scientific discovery and innovation, making it easier for researchers to access and process large volumes of literature and information. Open models provide a more powerful tool for interdisciplinary research and promote deeper progress in various fields of science.
Finally, in terms of social communication, in terms of multilingual support, the upgrade of the open model will promote cross-cultural communication and understanding. The upgrade of the iFLYTEK translator will provide users with a more free and natural language communication experience, which is expected to reduce language and cultural differences and promote the process of a globally interconnected society.
Overall, the launch of the Open for All model marks the popularization and application of AI technology around the world. The wide application in different fields will promote industrial upgrading, promote educational innovation, accelerate scientific research, promote social exchanges, and bring more possibilities to human society. By continuously improving the performance of the open model, iFLYTEK is making an important contribution to building a smarter and more interconnected future society.
However, its development has not only brought many positive meanings, but also faced a series of challenges.
Data privacy and security: As the use of models becomes more widespread, more attention needs to be paid to the privacy and security of user data. It is an important challenge to ensure that users' personal information is adequately protected during the use of the open model.
Computing power and energy requirements: Training large-scale models requires a huge amount of computing power and energy investment, which may have a negative impact on the environment. Developers and researchers need to focus on the sustainability and environmental friendliness of model training while pursuing performance.
Transparency and explanatory: As models become more complex, their decision-making processes become more difficult to understand. For open large models, improving their transparency and explainability is key to ensuring user trust and control.
Legal and ethical issues: In the use of the large model of openness for all, legal and ethical issues may be involved, such as intellectual property rights, allocation of responsibilities, etc. Relevant regulations and ethical standards need to be further improved to ensure the legal and compliant use of the model.
The Open for All model has provided a huge boost to the advancement of AI technology, but in the process of solving these challenges, all parties need to work together to ensure the healthy development of this technology and bring more benefits to society.
In the field of artificial intelligence, iFLYTEK has always been one of the leading companies that has attracted much attention. The company has made remarkable achievements in speech recognition, natural language processing, and other fields. Among them, its large model technology has always been an important part of the trendset. However, to fully evaluate the overall strength of the iFLYTEK model, it is necessary to conduct a comprehensive analysis of its performance in multiple aspects.
In terms of scientific research strength, iFLYTEK has strong strength in artificial intelligence research, and continues to promote cutting-edge research in the field by presenting at top international conferences. In terms of R&D investment, iFLYTEK is also not stingy, according to the data released by it in the third quarter of 2023, iFLYTEK recorded 126 in the first three quartersThe revenue of 1.4 billion yuan, the scale of revenue compared with 126 in the same period last year6.1 billion yuan compared to 037% slightly**. The net profit attributable to shareholders of listed companies also declined sharply, and the net profit attributable to shareholders of listed companies of 99.36 million yuan decreased by 76 percent year-on-year36%。
iFLYTEK explained in the financial report that the main reason is that the company actively seized the new historical opportunities of general artificial intelligence and firmly invested in general artificial intelligence cognitive models.
In terms of technical strength, iFLYTEK's large model has a deep background in neural networks and machine learning, and uses a large amount of data for training to continuously improve the generalization ability and adaptability of the model. This makes it excellent for dealing with a wide range of language variants and accents. However, the basic ability is slightly weaker, in August last year, the Xinhua News Agency Research Institute released a large model experience report showing that Wenxin Yiyan is the leading level in China in terms of the basic ability of the large model, and the advantages of the Spark large model are manifested in work efficiency and commercial application.
In terms of application, although the Xinghuo model is a general model, iFLYTEK also anchored many application scenarios for it at the press conference, but it did not fall into the commercialization circle of the general model, but implanted it into the consumer products represented by AI learning machines for the first time.
As more and more players run into the market, large models may not be able to support high premiums, and the profits of intelligent education hardware are bound to return to a reasonable range, and may even roll out the Internet genre that does not sell hardware but only software. At that time, iFLYTEK, which has shallow Internet genes, may suffer a lot of impact on its education fundamentals.
On the other hand, there is no so-called "technical myth" in the large model track, and many scenarios and applications need the support of underlying computing power. Although iFLYTEK is not afraid of players such as Good Future and Job Gang in the short term, from a long-term perspective, if giants such as Alibaba, and Tencent go deep into the battle, it may be difficult for iFLYTEK to have the ability to confront them head-on.
In the future, with the continuous development of technology, it is believed that the iFLYTEK large model will be further improved in continuous iteration to provide better support for a wider range of application scenarios.