标准详细信息 去购物车结算

【国外标准】 IEEE Draft Standard - Adoption of Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) Technical Specification Multimodal Conversation (MMC) Version 2

本网站 发布时间: 2025-04-28
  • IEEE P3300
  • 定价: 103元 / 折扣价: 88
  • 在线阅读
开通会员免费在线看70000余条国内标准,赠送文本下载次数,单本最低仅合13.3元!还可享标准出版进度查询、定制跟踪推送、标准查新等超多特权!   查看详情>>
标准简介标准简介

适用范围:

Multimodal Conversation (MPAI-MMC) specifies: 1. Data Formats for analysis of text, speech, and other non-verbal components as used in human-machine and machine-machine conversation applications. 2. Use Cases implemented in the AI Framework using Data Formats from MPAI-MMC and other MPAI standards and providing recognized applications in the Multimodal Conversation domain. This Technical Specification includes the following Use Cases: 1. Conversation with Personal Status (CPS), enabling… read more conversation and question answering with a machine able to extract the inner state of the entity it is conversing with and showing itself as a speaking digital human able to express a Personal Status. By adding or removing minor components to this general Use Case, five Use Cases are spawned: 2. Conversation About a Scene (CAS) where a human converses with a machine pointing at the objects scattered in a room and displaying Personal Status in their speech, face, and gestures while the machine responds displaying its Personal Status in speech, face, and gesture. 3.Virtual Secretary for Video conference (VSV) where an avatar not representing a human in a virtual avatar-based video conference extracts Personal Status from Text, Speech, Face, and Gestures, displays a summary of what other avatars say, and receives and act on comments. 4.Human-Connected Autonomous Vehicle Interaction” (HCI) where humans converse with a machine displaying Personal Status after having been properly identified by the machine with their speech and face in outdoor and indoor conditions while the machine responds by displaying its Personal Status in speech, face, and gesture. 5.Conversation with Emotion (CWE), supporting audio-visual conversation with a machine impersonated by a synthetic voice and an animated face. 6.Multimodal Question Answering (MQA), supporting request for information about a displayed object. 7.Three Uses Cases supporting text and speech translation applications. In each Use Case, users can specify whether speech or text is used as input and, if it is speech, whether their speech features are preserved in the interpreted speech: 7.1.Unidirectional Speech Translation (UST). 7.2.Bidirectional Speech Translation (BST). 7.3.One-to-Many Speech Translation (MST). 8.The “Personal Status Extraction Composite AIMs that estimates the Personal Status Conveyed by Text, Speech, Face, and Gesture – of a real or digital human. read less

基本信息

  • 标准号:

    IEEE P3300

  • 标准名称:

    IEEE Draft Standard - Adoption of Moving Picture, Audio and Data Coding by Artificial Intelligence (MPAI) Technical Specification Multimodal Conversation (MMC) Version 2

  • 英文名称:

  • 标准状态:

  • 发布日期:

  • 实施日期:

  • 出版语种:

标准分类号

  • 标准ICS号:

  • 中标分类号:

关联标准

  • 替代以下标准:

  • 被以下标准替代:

  • 引用标准:

  • 采用标准:

出版信息

  • 页数:

  • 字数:

  • 开本:

其他信息

  • 起草人:

  • 起草单位:

  • 归口单位:

  • 提出部门:

  • 发布部门: