HashData is committed to lowering the barrier to entry for big data analytics and democratizing data

Mondo Technology Updated on 2024-02-01

Investment and financing projects·hashdata

This project was submitted by Hashdata and participated in the selection of the "Data Ape Annual Golden Ape Planning Activity - 2023 Big Data Industry Annual Most Investment Value List Award".

Founded in February 2016, Hashdata is a startup focusing on cloud data warehousing, which is a typical representative of the integrated development of the digital economy and the real economy. The core team is mainly composed of Pivotal, Teradata, IBM, Yahoo!, Oracle, Huawei, and other companies are composed of senior experts in cloud computing, distributed database, and big data. With profound technical accumulation and forward-looking product concepts, it has received multiple rounds of financing from well-known investment institutions including Matrix Partners, Guoke Jiahe, GSR Venture Capital, and Wuyuan Capital since its inception, with a cumulative financing amount of hundreds of millions of yuan.

With the rapid development of information technology, all walks of life are generating and accumulating the most advanced growth of data. How to analyze this growing scale and complexity of data and mine the value of the data to provide business decision support for companies has become a key factor in the success of enterprises. Based on the mature and open-source database ecosystem and the power of the cloud-native architecture, Hashdata has developed HashData Cloud Data Warehouse, an analytical database that can provide nearly unlimited scale, concurrency and performance.

HashData can help businesses consolidate internal data silos, easily share this regulated data, and perform a variety of data analysis workloads. Deliver a seamless data analytics experience across multiple public and private clouds. As the core engine of enterprise data analysis, it provides overall solutions for data warehouse, data lake, data engineering, data science, data application development, and data sharing.

Hashdata Product Design Concept:

1. Innovative cloud-native architecture.

HashData cloud data warehouse adopts the architecture design of complete separation of metadata, computing and storage, which can give full play to the advantages of cloud native architecture, can efficiently respond to high concurrency and complex queries, and the architecture can be dynamically scaled in various dimensions with business needs, providing high-performance data warehouse services while achieving the optimal allocation of resources.

2. Open source and open cloud ecosystem.

Derived from the mainstream open-source databases Postgres and Greenplum, the HashData Cloud Data Warehouse is 100% compatible with analytics interfaces, and supports data analysis functions such as stream computing, full-text search, machine learning, and scientific computing, and can seamlessly integrate with mainstream ETL and BI tools in the market. At the same time, it can seamlessly connect with mainstream public cloud and private cloud platforms to provide users with maximum convenience.

3. Perfect management tools.

HashData Cloud Data Warehouse is designed to provide users with fully managed data analysis services, and through a well-functioning management console, enterprises can easily deploy data warehouse clusters containing dozens or even hundreds of nodes, and quickly start data analysis tasks after loading data. Complex and error-prone O&M tasks such as cluster resource allocation, data backup, monitoring and auditing, error recovery, high availability, and upgrades are completed by the product itself, achieving "zero O&M".

4. Out-of-the-box AI analysis module.

In order to lower the threshold for the application of advanced analytics and AI technology, Kuke Data has created a next-generation in-database advanced analytics and data science toolbox HashML based on HashData.

HashML is one of the first batch of in-library intelligent analysis tools launched in China, providing one-stop multi-level data analysis and AI capabilities from data query processing and advanced analysis to machine learning and deep learning. For large language models, HashML provides support for the entire process from high-quality data mining and model fine-tuning to model deployment and inference. Based on hashdata's built-in distributed parallel vector data storage, indexing and retrieval functions, hashml provides the ability to build and retrieve vector knowledge bases, which greatly reduces the application development cost of large language models. HashML inherits the cloud-native advantages of HashData, which can achieve on-demand elastic scaling from model training to model deployment, and provides support for Python and SQL languages, lowering the threshold for use.

HashML Product Features:

Ease of use: Out of the box, we strive to standardize the API design of all modules, and keep in line with popular third-party libraries in the data science community to ensure ease of use to the greatest extent.

Excellent performance: The concurrency of parallel processing is determined according to the complexity of the task, and multi-machine and multi-card can be used to achieve efficient training and fine-tuning to ensure the timeliness of operations.

Rich algorithms: It supports machine learning algorithms, deep neural networks, and pre-trained large models, provides a vector knowledge base for large language model applications, and efficiently supports the storage and retrieval of massive semantic vector data.

5. Extensive and leading industry application cases.

Kuke Data currently has more than 50 enterprise customers. In the financial industry, HashData provides services to nearly 10 customers, including large state-owned banks, policy banks, joint-stock commercial banks, top 10 companies, and regulatory authorities, supporting the data of hundreds of millions of users and tens of millions of enterprises. In the fields of telecommunications, energy, transportation, and the Internet, Hashdata provides data computing and analysis services for dozens of leading customers, including carriers, PetroChina, large airlines and ports, multinational environmental protection groups, large distance education institutions, and Internet medical enterprises, helping partners in digital transformation.

Hashdata has always adhered to technology-oriented, continued to invest in innovation and R&D, and the technical team accounts for more than 70%.

At present, Hashdata has more than 30 independent intellectual property rights, and its products have passed ISO9001 and other international quality management system certifications. The company is a national high-tech enterprise, a high-tech enterprise in Beijing, and a "specialized, refined, special and new" enterprise in Beijing.

The Hashdata cloud data warehouse solution maximizes the advantages of cloud computing by separating metadata, computing, and storage, and sharing a unified data storage layer architecture among multiple clusters. Taking advantage of the elasticity and distributed characteristics of the cloud platform, it can achieve rapid deployment, on-demand scaling, and non-stop delivery, greatly reducing the threshold for enterprises to conduct big data analysis, and meeting customers' all-round needs for high security, high reliability, high scalability, and intelligence. At present, it has provided enterprise-level database services with full functions, stability, reliability, scalability and superior performance for leading customers in important industries related to the lifeline of the national economy, such as finance, telecommunications, energy, transportation and Internet.

In the past three years, hashdata's revenue has grown at an annual rate of 100-200%, and it is expected to achieve tens of millions of yuan in revenue in 2023 and more than 100% in 2024.

Lirong Jian, co-founder and CEO of HashData

Lirong Jian, graduated from Tsinghua University and Hong Kong University of Science and Technology, Apache Hawq Committer, Greenplum Database Contributor, and is currently the co-founder and CEO of Kuke Data. He has been engaged in distributed computing research and development in IBM China Research Institute, Yahoo Beijing R&D Center, and Pivotal China R&D Center, and has published a number of international conference journals** (including SigMod and Infocom) and obtained more than 10 international patents, involving wireless networks, cloud computing, Hadoop and distributed databases, etc., maintaining deep thinking and forward-looking practice in the database industry.

In December 2020, round A++, Wuyuan Capital - nearly 100 million yuan.

December 2019, Series A+, GSR Ventures - tens of millions of dollars.

In October 2017, Series A, Guoke Jiahe - tens of millions of yuan.

February 2016, Angel Round, Matrix Partners - tens of millions of yuan.

The localization of core basic software is one of the mainstream of China's science and technology in the future. With the acceleration of cloud transformation for government and enterprise users, cloud-native technology has gradually become the mainstream development trend. As a leader in the field of localized cloud-native data warehouse software, hashdata is undergoing high-speed technical product changes, continuously enhancing its development advantages, replacing the traditional data warehouse products provided by a large number of foreign companies in the market, and continuing to create great value for the digital transformation of traditional Chinese enterprises. ”

Guoke Jiahe. Managing Partner: Hongwu Chen.

In the cloud-native market, HashData is very similar to Snowflake, a Silicon Valley data warehouse manufacturer that achieved the largest software company IPO in the history of the U.S. stock market. As the cloud computing market matures in China, we firmly believe that Hashdata will lead the industry change and become a world-class cloud-native data warehouse manufacturer independently developed in China."

Wuyuan Capital. Partner: Kai Liu.

The HashData team has in-depth knowledge and experience in the field of cloud-native data warehousing, and is able to quickly and accurately understand our needs and provide customized solutions. They not only continue to innovate in technology, but also continue to improve in service concept, in the early communication and implementation of the project, the team has always maintained a positive working attitude and professional working methods, can quickly respond to various challenges, and finally ensure the smooth launch of our project.

A joint-stock bank.

Head of Product Department: Mr. Zhao.

HashData's cloud-native architecture innovation separates metadata, computing, and storage, significantly improving the concurrency, availability, and scalability of the system, and ensuring the efficient and stable operation of the system. At the same time, their professional service team provided a full range of support to ensure the smooth implementation and smooth launch of the project. Our technical team spoke highly of hashdata's products and services, and unanimously agreed that it is a strong guarantee to improve our business capabilities. We look forward to continuing to work with Hashdata in the future to drive the development and progress of the business together."

A large central enterprise.

Director of the Information Department, Mr. Wang.

Related Pages