HashData Unified AI Big Data Platform

AI Database

Lakehouse architecture with multi-engine synergy — China's first large-scale cloud DW commercialization, serving top finance and telecom clients

100PB+Data Scale
50+Enterprise Clients
100M+Daily Queries
25KCluster Nodes

Core Capabilities

Lakehouse, multi-engine synergy, AI-native

Lakehouse Architecture

Storage-compute separation, unified metadata management — no data movement needed, multi-engine direct data sharing.

Separation · Unified Metadata

Multi-Engine Synergy

MPP/OLAP/AI/ML/Flink/Spark/ES heterogeneous compute engine clusters — optimal performance for every workload.

Heterogeneous · Best Performance

HashML AI Engine

Built-in next-gen data science and AI dev tools — LLM training/inference, vector search, knowledge base Q&A (ChatKB/ChatData).

LLM · Vector Search · AI-Native

Architecture Advantages

Cloud-native, polymorphic storage, AI-native, elastic scaling

Polymorphic Storage

Heap/AO/PAX/disaggregated storage/directory tables — Hudi/Iceberg open data formats

Full-Spectrum Analytics

PG14.4 kernel with PostgreSQL ecosystem plugins, built-in HashML — one dataset, multi-model compute

AI Corpus Processing

Directory table for unstructured knowledge, pgVector + ES retrieval — complete AI training/inference pipeline

Elastic Scaling

Cloud-native with tenant resource isolation, dynamic elastic scaling, intelligent operations

Key Clients

Serving top finance, telecom, and government clients

Finance

CBIRC, China Construction Bank, Bank of China, EXIM Bank, Hengfeng Bank — largest single-client dataset: 38PB.

Telecom

China Mobile, China Unicom, China Telecom — big data analytics platforms for top carriers.

Government & SOEs

Chinese Academy of Sciences, PetroChina, CITIC Group, China Merchants Group — top government and SOE clients.

Get Your HashData Big Data Platform Solution

Our team will tailor a big data solution for you