LUIS 的语言和区域支持Language and region support for LUIS

LUIS 在服务中具有多种功能。LUIS has a variety of features within the service. 并非所有功能都会同等地以各种语言提供。Not all features are at the same language parity. 请确保你所定位的语言文化支持你感兴趣的功能。Make sure the features you are interested in are supported in the language culture you are targeting. LUIS 应用特定于区域性,一旦设置即无法更改。A LUIS app is culture-specific and cannot be changed once it is set.

多语言 LUIS 应用Multi-language LUIS apps

如果需要多语言 LUIS 客户端应用程序(例如聊天机器人),可通过几种方法实现。If you need a multi-language LUIS client application such as a chatbot, you have a few options. 如果 LUIS 支持所有语言,则需面向每种语言开发一个 LUIS 应用。If LUIS supports all the languages, you develop a LUIS app for each language. 每个 LUIS 应用都具有唯一的应用 ID 和终结点日志。Each LUIS app has a unique app ID, and endpoint log. 如果需要为 LUIS 不支持的语言提供语言理解,可使用 Microsoft Translator API 将表述翻译成受支持的语言,将表述提交到 LUIS 终结点,然后接收生成的分数。If you need to provide language understanding for a language LUIS does not support, you can use Microsoft Translator API to translate the utterance into a supported language, submit the utterance to the LUIS endpoint, and receive the resulting scores.

支持的语言Languages supported

LUIS 理解以下语言:LUIS understands utterances in the following languages:

语言Language 区域设置Locale 预生成域Prebuilt domain 预生成实体Prebuilt entity 短语列表建议Phrase list recommendations **文本分析**Text analytics
(情绪和(Sentiment and
关键字)Keywords)
[中文](#chinese-support-notes)[Chinese](#chinese-support-notes) zh-CN -

预生成实体预生成域具有不同的语言支持。Language support varies for prebuilt entities and prebuilt domains.

*中文支持说明*Chinese support notes

  • zh-cn 区域性中,LUIS 要求简体中文字符集,而不是繁体字符集。In the zh-cn culture, LUIS expects the simplified Chinese character set instead of the traditional character set.
  • 意向、实体、功能和正则表达式的名称可采用中文或罗马字符。The names of intents, entities, features, and regular expressions may be in Chinese or Roman characters.
  • 请参阅预生成域参考,了解 zh-cn 区域性支持的预生成域。See the prebuilt domains reference for information on which prebuilt domains are supported in the zh-cn culture.

应用程序中的罕见字词或外来字词Rare or foreign words in an application

en-us 区域性中,LUIS 可学习区分大多数英文字词,包括俚语。In the en-us culture, LUIS learns to distinguish most English words, including slang. zh-cn 区域性中,LUIS 可学习区分大多数中文字符。In the zh-cn culture, LUIS learns to distinguish most Chinese characters. 如果在 en-uszh-cn 中使用一个罕见字词或字符,并且 LUIS 似乎无法识别该字词或字符,则可将该字词或字符添加到短语列表功能If you use a rare word in en-us or character in zh-cn, and you see that LUIS seems unable to distinguish that word or character, you can add that word or character to a phrase-list feature. 例如,应将超出应用程序区域性的字词(即外来字词)添加到短语列表功能。For example, words outside of the culture of the application -- that is, foreign words -- should be added to a phrase-list feature. 应将此短语列表标记为不可互换,以指示罕见字词集组成 LUIS 应学会识别的类,但它们不是同义词,也不能彼此互换。This phrase list should be marked non-interchangeable, to indicate that the set of rare words forms a class that LUIS should learn to recognize, but they are not synonyms or interchangeable with each other.

混合语言Hybrid languages

混合语言混含两个区域性的字词,如英语和中文。Hybrid languages combine words from two cultures such as English and Chinese. 由于单个应用仅基于单个区域性,因此 LUIS 不支持此类语言。These languages are not supported in LUIS because an app is based on a single culture.

词汇切分Tokenization

为了执行机器学习,LUIS 基于区域性将表述拆分成词法单元To perform machine learning, LUIS breaks an utterance into tokens based on culture.

语言Language 每个空格或特殊字符every space or special character 字符级character level 复合词compound words 返回的切分后的实体tokenized entity returned
中文Chinese