# Traditional SEO Won't Get You Into Chinese AI Answers — The Off-Site Signals That Will

*By Eastbound Research · 9 May 2026*

Most foreign-brand audits for Chinese AI visibility check the wrong surface. They polish the brand's own website — fast load times, clean structured data, professional layout — and call the visibility risk addressed. Then a Mandarin tester asks DeepSeek for a recommendation in the brand's category, and the answer set names competitors.

The on-site checklist that wins on Google barely predicts brand recall in Mainland AI. We audited the brands DeepSeek, Qwen, and Doubao actually surface — and found the highest-recall ones often have broken sites, near-empty structured data, no Wikipedia entry. Their visibility lives off-site, in community discourse, KOL coverage, and e-commerce depth. Off-site SEO for AI search is the missing half of most GEO playbooks.

**Off-site substrate** is the body of third-party evidence — Zhihu threads, KOL posts, Tmall reviews, Wikipedia entries — that an AI engine reaches for when answering a brand-recommendation prompt. It sits outside the brand's own website. For Mainland-Chinese consumer queries, it's where most of the brand-recall signal lives.

## Schema density doesn't predict AI mention rate

If you plotted every brand we audited with on-site schema density on the x-axis and AI mention rate on the y-axis, you wouldn't get a positive slope. You'd get a near-flat scatter: the highest-density site in our audit is among the lowest-mentioned, and the lowest-density sites are among the most-mentioned.

The winning brands averaged roughly half the schema markup of the controls we picked as the most aggressively schema-tagged sites in the same niches. Heaviest-schema control: eleven distinct schema types, seven `sameAs` links, mention rate 20%. Most-mentioned brands in the panel: zero or one schema types, zero `sameAs`, mention rates above 80%.

Stated tightly: **on-site schema density is necessary baseline work, but it is not sufficient on its own to drive AI brand recall.** Several of the most-mentioned brands have a near-empty on-site signal, meaning something else off-site is doing the lifting. The control sample is small enough that we can't claim schema is useless. We can claim it isn't enough by itself. [See methodology.](#methodology)

## Why brands with thin websites still surface in Chinese AI

The cleanest way to make the contradiction concrete is to look at three brands whose on-site footprint reads as a near-empty checklist. All three are mentioned in 60–92% of responses across the AIs we tested.

**Kailh (凯华).** A Guangdong electronics manufacturer. The homepage runs on a small-business website-builder template; the HTTPS configuration is broken (expired certificate for the wrong domain); plain HTTP works fine. On-site signal: no structured data, no `sameAs` links, no Open Graph, no canonical, no hreflang. AI mention rate for the keyboard niche: **92%**. The recall isn't anchored on the website. It's anchored on community discourse — long-form Zhihu switch-comparison threads, Bilibili reviewer videos, Tmall product-detail pages with hundreds of reviews per SKU.

**Gateron (佳达隆).** The second 92%-mention keyboard brand, and the more interesting on-site case. Its homepage returns clean HTML with full Open Graph, a clean canonical, 6,500+ words of product copy. By every "is this a real brand site?" check, Gateron passes. On-site GEO signal: one schema type (Organization), zero `sameAs`. Substrate: identical to Kailh — Tmall + community forum.

**Naturehike (挪客).** Mainland indie outdoor brand. Almost no structured data; no English Wikipedia, no Chinese Wikipedia, no Wikidata entry. AI mention rate on DeepSeek for the indie-camping niche: **64%**, ahead of multiple international brands with full encyclopedic and schema presence. Substrate: Tmall + Xiaohongshu + Zhihu + Bilibili.

None of these brands are running an AI-visibility playbook. None ran a schema-markup project. Their AI mention rates are a side-effect of the substrate they sit in.

## The unit of recall is the flagship-product phrase, not the brand name

The most counter-intuitive finding in the panel: brands at high recall don't surface as a bare brand name. They surface alongside a small set of named, repeatable product phrases.

Across 125 prompt × rep cells in the keyboard niche: Kailh appears with "Kailh Box红軸" (Kailh Box Red Switch) 14 times. Gateron with "性價比之王 軸體" ("Cost-Performance King" Switch) 11 times. TTC with "金粉軸" (Gold Powder Switch) 10 times. In sunscreen, Winona with "清透防曬乳" (Clear Sunscreen) 6 times.

The interpretation: flagship-product names act as **repeatable evidence anchors** in community discourse. Gateron is not surfaced because its site is heavily marked up; it's surfaced because community discourse repeatedly co-occurs *Gateron* with *"Cost-Performance King" Switch*. The named flagship is the unit of recall, not the bare brand name. Optimization strategies built around bare brand-name SEO miss this anchor entirely.

## The off-site substrate that matters is niche-conditional

If on-site density isn't the lift, what is? The audit data points to two off-site substrates that are present in the high-recall brands — but which one matters depends entirely on the niche.

**International-corpus niches** (single-malt whisky, niche perfume): encyclopedic anchor matters. Macallan and Ardbeg carry full English + Chinese Wikipedia plus Wikidata. Glenfiddich has English Wikipedia and Wikidata. Encyclopedic presence dominates here because the brand discourse genuinely lives in English-language reference works. *Do you need a Wikipedia page for AI to recommend your brand?* In international-corpus niches, yes — it's one of the strongest signals we measured. In Mainland-saturated niches, no — the brands winning at high recall here have no encyclopedic presence at all.

**Mainland-saturated niches** (mech keyboards, craft beer, indie camping, handmade leather): the substrate that lifts is community-discourse density — Zhihu long-form, Xiaohongshu KOL imprint, Bilibili review depth, Tmall + JD product-catalogue depth. The brands surfacing at high mention rates here routinely have *no encyclopedic presence at all*. Strategies built around Wikipedia-first miss the substrate entirely.

The structural pattern replicated across both panels of niches we tested. International-corpus niches returned 0–1 Mainland brands across all three AIs. Mainland-saturated niches returned 5–7. The Doubao Mainland-content tilt held: about +10pp on top of the category baseline.

Practical reading: **niche type captured more of the variance in Mainland-brand share than AI choice did.** A brand in an international-corpus niche didn't move into a Mainland-substrate result by switching AIs. A brand in a Mainland-saturated niche was visible across all three AIs as long as its community substrate was there.

## What this means for your China AI visibility strategy

Three takeaways follow.

**Schema markup and on-site SEO are necessary baseline work for Chinese AI visibility. Keep doing them.** The finding above is that schema density isn't *sufficient* — not that it's useless. A clean on-site footprint still does work for crawler legibility, retrieval, and absorption-readiness on Bing/Copilot. The question is what to ship alongside it.

**For Mainland-saturated niches, the off-site investment is in community substrate.** Sustained KOL programmes on Xiaohongshu and Bilibili, long-form Zhihu presence written by named experts, depth in Tmall + JD product-detail review surfaces. Wikipedia is a bonus, not a precondition. The brands winning here have no encyclopedic presence at all.

**For international-corpus niches, the priority order is the opposite.** Verified Wikipedia / Wikidata entity pages first; KOL programmes second; community-forum work has limited additional lift here, because the AI isn't reading that substrate for this niche.

A brand audit that scores a checklist green and ignores the substrate misses the half of the work the engine is actually reading.

## Where Eastbound comes in

We see brand teams discover this gap when their internal audits read green and a Mandarin tester returns competitor names instead of theirs. The fix isn't a single content sprint — it is a re-mapping of what the engine reads, what it surfaces, and where the brand sits in the answer set. That mapping is the kind of work Eastbound does. If your team is sitting on the green-audit / red-engine gap, [run the free China AI visibility audit](https://eastbound.ai/ai-visibility-audit/) on your domain or [book an intro call](https://eastbound.ai/book-consultation/).

## Methodology

- **Sample.** 750 brand-recommendation calls (25 prompts × 5 reps × 3 engines × 2 niche panels), across 10 Mainland-Chinese consumer niches: mechanical keyboard switches, mineral sunscreen, niche perfume, ceramic dinnerware, natural wine, craft beer, indie camping, vegan skincare, single-malt whisky, handmade leather.
- **Engines.** DeepSeek (`deepseek-chat`); Qwen (`qwen-plus`, DashScope international); Doubao (`seed-2-0-lite-260228`, BytePlus ModelArk international, Lite tier). All API-mode, default decoding. Chat-with-Search browsing surface is a separate study.
- **On-site audit.** 20 winning brands + 5 controls attempted in Panel 1; 13 winners + 2 controls fetched cleanly. Counter-examples are strong (Kailh, Gateron, Naturehike all hit 80%+ mention rates with near-empty schema), but the control sample is too small to claim schema is useless. We can claim it isn't sufficient.
- **What we measured.** Brand mention rate per (prompt × rep) cell; co-occurring product-phrase frequency; on-site schema / `sameAs` / fetchability; off-site Wikipedia / Wikidata presence. This is descriptive measurement of LLM behavior; it is not a causal claim about what publishing on a substrate would do to mention rate.
- **What we did not measure.** Sales, conversion, attributable revenue. ChatGPT / Claude / Gemini / Perplexity / ERNIE / Yuanbao — not in this panel. Chat-with-Search-ON browsing surfaces — separate study.
- **Reliability.** Structural pattern (international-corpus → encyclopedic; Mainland-saturated → community substrate) replicated across two non-overlapping niche panels. Doubao long-tail source-ranking is less stable than top-5 (top-5 κ = 1.00; top-15 κ = 0.46) — treat long-tail Doubao findings with that caveat.
- **Repo.** Per-record JSONs at `~/Documents/Claude/GEO/research/2026-04-29-deepseek-cn-niches/data/site_audits/` and adjacent panels. Replication script and prompt panels available on request.

---

[← back to Research](/blog/) · [China AI visibility for global brands](https://eastbound.ai/china-ai-visibility/) · [Run the free audit](https://eastbound.ai/ai-visibility-audit/)
