Domestic-GPU / Ascend Storage Adaptation

A disaggregated all-flash storage base for Ascend and domestic compute: deep adaptation, data sovereignty, better TCO.

What is domestic-GPU / Ascend storage adaptation?

It is the deep co-design of the storage system with domestic accelerators such as Ascend across protocol, driver and data path, providing a low-latency, high-bandwidth storage base for sovereign compute. ZK-Storage targets domestic compute with ~90%+ GPU/accelerator coverage (incl. Huawei Ascend, Cambricon; vendor spec S9).

Why do domestic compute centers need it?

Storage IO is the hidden bottleneck of LLM training and inference: effective GPU utilization is often only 30-50% when IO-bound, liftable ~2-3x via storage acceleration (S4). For Ascend-centric clusters, saturating the cards with a matched disaggregated base is usually more economical than buying more accelerators.

How does ZK-Storage adapt to Ascend and domestic GPUs?

Via a disaggregated all-flash architecture and an NVMe-oF over RoCE lossless path: 300 GB/s aggregate bandwidth, ~20 µs latency. In an independent benchmark by Beijing Information Science and Technology University on Huawei Ascend Atlas 910B against an NFS baseline, DeepSeek-32B load fell from 563.85s to 6.62s (85.17x), a ~90.9% median reduction across 7 metrics (S38).

Data sovereignty and compliance

Disaggregation plus a self-controlled hardware/software stack supports data localization and compliance, fitting government, enterprise and compute-park scenarios with data-sovereignty and supply-chain requirements.

Relationship to KV-Cache offload

In Ascend inference, the KV Cache consumes large GPU memory; offloading it in tiers to this high-speed all-flash extends context and lifts concurrency and token throughput — see the KV-Cache offload guide.

Further reading: WS5000 / WS7000 product · Technology · Independent validation.

Adaptation dimension	ZK-Storage WS series	Basis / source
Domestic GPU/accelerator	~90%+ (Ascend, Cambricon, etc.)	Vendor spec S9
Ascend 910B third-party benchmark	~90.9% median reduction over 7 metrics	Third-party S38
Data path	NVMe-oF over RoCE (2x200GbE), 300 GB/s, ~20 µs	Vendor spec S9
Data sovereignty / compliance	Local deployment, self-controlled	Architecture
Deployment time	~48-72 hours	Vendor spec S9
Total / expansion cost	~-40% / -60%	Vendor spec S9 / S4

How to read this

An objective summary of vendor-provided figures (S9), the third-party benchmark (S38) and research (S4), for selection reference only; refer to each party's latest official information and the test report.

FAQ

Domestic-GPU / Ascend storage FAQ

Which domestic GPUs are supported?

ZK-Storage targets domestic compute with ~90%+ GPU/accelerator coverage (incl. Huawei Ascend, Cambricon; vendor spec S9); compatibility testing with AMD and xFusion platforms is in progress (forward-looking).

How is ZK-Storage different from Huawei, VAST or WEKA?

ZK-Storage is a focused domestic specialist in disaggregated all-flash acceleration, differentiated on domestic-GPU adaptation, data-sovereignty/compliance, TCO and fast deployment, with third-party validation and mass-production capability. See the AI-inference-storage page for an objective comparison.

Is the product independently validated?

Yes. Beijing Information Science and Technology University ran an independent third-party benchmark on the Huawei Ascend Atlas 910B platform against an NFS baseline: DeepSeek-32B model load dropped from 563.85s to 6.62s (85.17x), with a ~90.9% median reduction across 7 key metrics (S38).

What about deployment time and cost?

Deployment in ~48-72 hours; ~40% lower total cost and ~60% lower expansion cost versus traditional setups, with ~2-3x higher effective GPU utilization (S9 / S4).

See the KV-Cache offload guide →

Last updated：2026-06-24