© 2026 Keke LE Design

2025

Alibaba Group

VersionAI Tmall - AI Friendly Design System for GenAI

GenAI Research

Tmall Design is a design system developed specifically for the Tmall shopping app, one of the largest B2C e-commerce platforms in China.

Unlike previous iterations, the version presented on this site is intended not only for human designers and developers, but more importantly for AI models. Its content structure, textual format, and communication logic have been translated to enable AI models to read and apply the specification. With AI assistance, users can reference this knowledge base to generate user interfaces that comply with Tmall's design specification (1), as well as conduct design evaluation and Q&A (2).

By enabling a more efficient and advanced mode of human-AI collaborative design, the system supports scenarios such as recomposing existing designs, making incremental adjustments, and creating new interfaces step by step.

tmall-design.com

Human-AI Co-design Product Interfaces as An Emerging Mode of Work

In the daily workflow of design teams, output targets generally fall into three categories: modules, single pages, and multi-page flows. Correspondingly, design approaches can be grouped into three modes: assembling existing components, modifying existing components, and creating new components. Among these, modifications to existing components can be further broken down into information adjustments, layout adjustments, and style adjustments.

When mapped to the five major demand scenarios—core user journeys, content discovery channels, interactive products, marketing events, and B-end experience—a year-round analysis reveals a consistent pattern: within core journeys, discovery channels, and B-end experience, design tasks involving assembling or modifying existing components at the module or single-page level account for nearly 48% of all requests.

This substantial volume of repetitive and structurally similar design work urgently calls for automated, AI-driven design-generation methods to significantly improve both efficiency and output quality. For enterprise design teams, the primary goal is to generate product interfaces that faithfully adhere to internal design standards, and this requires the same essential task: clarifying and constructing an AI-friendly design-knowledge system that can be connected to design tools or large models.

The VersionAI Tmall solution presentation for GenUI ↑

Introducing VersionAI Tmall

By constructing the Design System Specification Knowledge Base and the Interface Generation Multi-Agents Workflow, an integrated, AI-friendly enterprise design system is formed. The system can be flexibly applied in IDE-based environments and agent-mode design tools via MCP or context injection. This comprehensive set of theory and practice offers broad applicability and can be adopted by enterprise teams across industries. VersionAI Tmall has already been implemented in real-world projects, resulting in significant improvements in interface design efficiency.

Generative UI Agent Structure ↑

Building Design System Knowledge for AI Understanding and Utilization

Interface generation is essentially a domain-specific intelligent system composed of multiple task-specific agents, operating within an enterprise-aligned design workflow, and driven by contextual reasoning and retrieval over Design System Specification Knowledge. These agents are organized into four primary task agents: intent recognition, knowledge retrieval, design synthesis and code generation.

Design System Specification Knowledge acts as externally retrieved factual knowledge: through a Design-to-Code (D2C) transformation, design assets are converted into rule-based semantic facts, code-level symbolic facts, and the structured mappings between them, forming a structured Design Knowledge Base that the model can read, retrieve, and extract from.

Structural translation of design system specification knowledge ↑

Design System Specification Knowledge

Design System Specification Knowledge is composed of two categories of knowledge objects and three file types. The two categories of knowledge objects are Styles and Cases, where styles are further subdivided into four types: Elements, Layouts, Components, and Assets. Each category of knowledge object is associated with three file types: Semantic rules, Mapping schemas, and Code implementations.

Semantic rules consist of textual descriptions covering an object's definition, styles, variables, usage, and constraints. They represent the foundational intellectual property that gives rise to experiential differentiation and are authored in Markdown format. Code implementations translate objects into executable and deployable symbolic facts, using the appropriate programming language as the representation format.

Mapping schemas act as the central connective layer between semantic rules and code implementations. Rather than duplicating either, they provide object ID anchoring, constraint definitions, and precise code references. Within the workflow, they guide knowledge retrieval, solution construction, and code generation, enabling large language models to avoid traversing the entire knowledge corpus and reducing computational overhead.

Element interpertation example: Color ↑

  1. Element knowledge is divided into four irreducible units: color, typography, corner radius, and motion, which serve as the foundational dependencies for all design styles.
  2. Its semantic rules consist of element definitions, style composition, attribute variables (design tokens), mandatory rules, and functional usage.
  3. The mapping schema is composed of ID symbols, semantics (including name, definition, and description), rules_refs, implementations_refs, and constraints (such as usage scenarios, mandatory prohibitions, and other restrictions). This schema is automatically generated by AI in accordance with Schema-based prompting rules.
  4. Code implementations take the form of global styles (SCSS); these are automatically reconstructed from design artifacts and generated as code by IDE tooling through calls to the design software's MCP services.

Component interpertation example: Price ↑

  1. Component knowledge is composed of units such as avatars, badges, asset slots, buttons, content streams, countdowns, coupons, filters, beacons, onboarding guides, carousels, inputs, messages, floating windows, dialogs, prices, product cards, progress bars, red packets, selectors, overlays, floating bars, switches, tab bars, tabs, text streams, tooltips, bubbles, and top bars. These components serve as critical reusable units for interface generation.
  2. The semantic rules for components consist of component definitions, style composition, attribute variables, component states (interaction and visual), mandatory rules, and functional usage.
  3. Building upon the existing mapping schema structure, component mapping schemas additionally introduce a properties schema (props schema), which corresponds to the attributes required by each component in the page-level component structure schema generated during the solution design stage.
  4. Code implementations are realized as component structures (TSX) and component styles (SCSS) within a React-based technology stack.

Asset interpertation example: Icon ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Page interpertation example: Shopping Feed Page ↑

  1. Page semantic rules consist of page definition, target users, naming conventions, structural hierarchy, page states (interaction and visual), mandatory rules, and functional usage. Page knowledge does not require code implementations; instead, it is composed by the LLM through combining component, asset, and element code.
  2. The mapping schema consists of ID symbols, semantics (name, definition, description), rules, structure, and constraints (usage scenarios, mandatory prohibitions, and others).

Structural translation of design system specification knowledge ↑

Design System Specification Knowledge

Design System Specification Knowledge is composed of two categories of knowledge objects and three file types. The two categories of knowledge objects are Styles and Cases, where styles are further subdivided into four types: Elements, Layouts, Components, and Assets. Each category of knowledge object is associated with three file types: Semantic rules, Mapping schemas, and Code implementations.

Semantic rules consist of textual descriptions covering an object's definition, styles, variables, usage, and constraints. They represent the foundational intellectual property that gives rise to experiential differentiation and are authored in Markdown format. Code implementations translate objects into executable and deployable symbolic facts, using the appropriate programming language as the representation format.

Mapping schemas act as the central connective layer between semantic rules and code implementations. Rather than duplicating either, they provide object ID anchoring, constraint definitions, and precise code references. Within the workflow, they guide knowledge retrieval, solution construction, and code generation, enabling large language models to avoid traversing the entire knowledge corpus and reducing computational overhead.

Agent interpertation example: Intent Recognition ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Agent interpertation example: Knowledge Retrieval ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Screenshot of Single-Product Storytelling Mode ↑

Design System Specification Knowledge

Design System Specification Knowledge is composed of two categories of knowledge objects and three file types. The two categories of knowledge objects are Styles and Cases, where styles are further subdivided into four types: Elements, Layouts, Components, and Assets. Each category of knowledge object is associated with three file types: Semantic rules, Mapping schemas, and Code implementations.

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Agent interpertation example: Knowledge Retrieval ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Single-Product Space Mode ↑

Storytelling Mode

This mode is suitable for displaying and interacting with individual items. By highlighting each product feature sequentially, users can understand the product from start to finish in a linear experience. For each feature, appropriate technical setups and interaction methods are selected to present the highlights in the most intuitive and immersive way.

Single-Product Space Mode ↑

Storytelling Mode

This mode is suitable for displaying and interacting with individual items. By highlighting each product feature sequentially, users can understand the product from start to finish in a linear experience. For each feature, appropriate technical setups and interaction methods are selected to present the highlights in the most intuitive and immersive way.

Single-Product Space Mode ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Agent interpertation example: Knowledge Retrieval ↑

Storytelling Mode

This mode is suitable for displaying and interacting with individual items. By highlighting each product feature sequentially, users can understand the product from start to finish in a linear experience. For each feature, appropriate technical setups and interaction methods are selected to present the highlights in the most intuitive and immersive way.

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Agent interpertation example: Knowledge Retrieval ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Agent interpertation example: Knowledge Retrieval ↑

  1. Asset knowledge consists of units such as images, icons, logos, and tags, serving as the primary material sources for page content population.
  2. The semantic rules for assets include asset definitions, attribute variables, mandatory rules, and functional usage.
  3. The mapping schema for assets follows the same template as that used for elements. Depending on the specific asset type, asset knowledge may or may not require code implementations. For example, icons do require code implementations, whereas images typically do not.
  4. In such cases, the implementations take the form of source asset files (e.g., SVG or PNG) with naming conventions consistent with those defined in the semantic rules.

Design Lead

Keke LE

Design Team

MPID Multi-platform Innovation Design
Taobao Design Sub-unit