購物比價找書網找車網
FindBook  
 有 1 項符合

SPEECH PROCESSING FOR IP NETWORKS: MEDIA RESOURCE CONTROL PROTOCOL (MRCP)

的圖書
SPEECH PROCESSING FOR IP NETWORKS: MEDIA RESOURCE CONTROL PROTOCOL (MRCP) SPEECH PROCESSING FOR IP NETWORKS: MEDIA RESOURCE CONTROL PROTOCOL (MRCP)

作者:BURKE 
出版社:JOHN WILEY & SONS,LTD
出版日期:2007-01-01
圖書選購
型式價格供應商所屬目錄
 
$ 1380
三民網路書店 三民網路書店
電腦
圖書介紹 - 資料來源:三民網路書店   評分:
圖書名稱:SPEECH PROCESSING FOR IP NETWORKS: MEDIA RESOURCE CONTROL PROTOCOL (MRCP)
  • 圖書簡介

    Media Resource Control Protocol (MRCP) is a new IETF protocol, providing a key enabling technology that eases the integration of speech technologies into network equipment and accelerates their adoption resulting in exciting and compelling interactive services to be delivered over the telephone. MRCP leverages IP telephony and Web technologies such as SIP, HTTP, and XML (Extensible Markup Language) to deliver an open standard, vendor-independent, and versatile interface to speech engines.
    Speech Processing for IP Networks brings these technologies together into a single volume, giving the reader a solid technical understanding of the principles of MRCP, how it leverages other protocols and specifications for its operation, and how it is applied in modern IP-based telecommunication networks. Focusing on the MRCPv2 standard developed by the IETF SpeechSC Working Group, this book will also provide an overview of its precursor, MRCPv1.
    Speech Processing for IP Networks:
    Gives a complete background on the technologies required by MRCP to function, including SIP (Session Initiation Protocol), RTP (Real-time Transport Protocol), and HTTP (Hypertext Transfer Protocol).
    Covers relevant W3C data representation formats including Speech Synthesis Markup Language (SSML), Speech Recognition Grammar Specification (SRGS), Semantic Interpretation for Speech Recognition (SISR), and Pronunciation Lexicon Specification (PLS).
    Describes VoiceXML - the leading approach for programming cutting-edge speech applications and a key driver to the development of many of MRCP's features.
    Explains advanced topics such as VoiceXML and MRCP interworking.
    This text will be an invaluable resource for technical managers, product managers, software developers, and technical marketing professionals working for network equipment manufacturers, speech engine vendors, and network operators. Advanced students on computer science and engineering courses will also find this to be a useful guide.

  • 作者簡介

    David Burke is Chief Technology Officer and co-founder of Voxpilot Ltd, UK. David led Voxpilot to its current position as a leader in VoiceXML interactive services platform technology. His management duties at Voxpilot include executive management and counsel, product vision, direction and management, responsibility for all R&D activities including budgeting, engineering team selection and mentoring, and architecture and design.
    He is also member of the World Wide Web Consortium (W3C) Voice Browser Working Group and of the Internet Engineering Task Force (IETF) Speech SC Working Group.

  • 目次

    PART I. BACKGROUND.
    1. Introduction.
    1.1 Introduction to Speech Applications.
    1.2 The MRCP Value Proposition.
    1.3 History of MRCP Standardisation.
    1.3.1 Internet Engineering Task Force.
    1.3.2 World Wide Web Consortium.
    1.3.3 MRCP: From Humble Beginnings Toward IETF Standard.
    1.4 Summary.
    2. Basic Principles of Speech Processing.
    2.1 Human Speech Production.
    2.1.1 Speech Sounds: Phonemics and Phonetics.
    2.2 Speech Recognition.
    2.2.1 Endpoint Detection.
    2.2.2 Mel-Cepstrum.
    2.2.3 Hidden Markov Models.
    2.2.4 Language Modelling.
    2.3 Speaker Verification and Identification.
    2.3.1 Feature Extraction.
    2.3.2 Statistical Modelling.
    2.4 Speech Synthesis.
    2.4.1 Front-end Processing.
    2.4.2 Back-end Synthesis.
    2.5 Summary.
    3. Overview of MRCP.
    3.1 Architecture.
    3.2 Media Resource Types.
    3.3 Network Scenarios.
    3.3.1 VoiceXML IVR Service Node.
    3.3.2 IP PBX with Voicemail.
    3.3.3 Advanced Media Gateway.
    3.4 Protocol Operation.
    3.4.1 Establishing Communication Channels.
    3.4.2 Controlling a Media Resource.
    3.4.3 Walkthrough Examples.
    3.5 Security.
    3.6 Summary.
    PART II. MEDIA AND CONTROL SESSIONS.
    4. Session Initiation Protocol.
    4.1 Introduction.
    4.2 Walkthrough Example.
    4.3 SIP URIs.
    4.4 Transport.
    4.5 Media Negotiation.
    4.5.1 Session Description Protocol.
    4.5.2 Offer/Answer Model.
    4.6 SIP Servers.
    4.6.1 Registrars.
    4.6.2 Proxy Servers.
    4.6.3 Redirect Servers.
    4.7 SIP Extensions.
    4.7.1 Capability Discovery.
    4.8 Security.
    4.8.1 Transport and Network Layer Security.
    4.8.2 Authentication.
    4.8.3 S/MIME.
    4.9 Summary.
    5. Session Initiation in MRCP.
    5.1 Introduction.
    5.2 Initiating the Media Session.
    5.3 Initiating the Control Session.
    5.4 Session Initiation Examples.
    5.4.1 Single Media Resource.
    5.4.2 Adding and Removing Media Resources.
    5.4.3 Distributed Media Source/Sink.
    5.5 Locating Media Resource Servers.
    5.5.1 Requesting Server Capabilities.
    5.5.2 Media Resource Brokers.
    5.6 Security.
    5.7 Summary.
    6. The Media Session.
    6.1 Media Encoding.
    6.1.1 Pulse Code Modulation (PCM).
    6.1.2 Linear Predictive Coding (LPC).
    6.2 Media Transport.
    6.2.1 Real-Time Protocol (RTP).
    6.2.2 DTMF.
    6.3 Security.
    6.4 Summary.
    7. The Control Session.
    7.1 Message Structure.
    7.1.1 Request Message.
    7.1.2 Response Message.
    7.1.3 Event Message.
    7.1.4 Message Bodies.
    7.2 Generic Methods.
    7.3 Generic Headers.
    7.4 Security.
    7.5 Summary.
    PART III. DATA REPRESENTATION FORMATS.
    8. Speech Synthesis Markup Language (SSML).
    8.1 Introduction.
    8.2 Document Structure.
    8.3 Recorded Audio.
    8.4 Pronunciation.
    8.4.1 Phonemic/Phonetic Content.
    8.4.2 Substitution.
    8.4.3 Interpreting Text .
    8.5 Prosody.
    8.5.1 Prosodic Boundaries.
    8.5.2 Emphasis.
    8.5.3 Speaking Voice.
    8.5.4 Prosodic Control.
    8.6 Markers .
    8.7 Metadata.
    8.8 Summary.
    9. Speech Recognition Grammar Specification (SRGS).
    9.1 Introduction.
    9.2 Document Structure.
    9.3 Rules, Tokens, and Sequences.
    9.4 Alternatives.
    9.5 Rule References.
    9.5.1 Special Rules.
    9.6 Repeats.
    9.7 DTMF Grammars.
    9.8 Semantic Interpretation.
    9.8.1 Semantic Literals.
    9.8.2 Semantic Scripts.
    9.9 Summary.
    10. Natural Language Semantics Markup Language (NLSML).
    10.1 Introduction.
    10.2 Document Structure.
    10.3 Speech Recognition Results.
    10.3.1 Serialising Semantic Interpretation Results.
    10.4 Voice Enrollment Results.
    10.5 Speaker Verification Results.
    10.6 Summary.
    11. Pronunciation Lexicon Specification (PLS).
    11.1 Introduction.
    11.2 Document Structure.
    11.3 Lexical Entries.
    11.4 Abbreviations and Acronyms.
    11.5 Multiple Orthographies.
    11.6 Multiple Pronunciations.
    11.7 Summary.
    PART IV. MEDIA RESOURCES.
    12. Speech Synthesiser Resource.
    12.1 Overview.
    12.2 Methods.
    12.2.1 SPEAK.
    12.2.2 PAUSE.
    12.2.3 RESUME.
    12.2.4 STOP.
    12.2.5 BARGE-IN-OCCURRED.
    12.2.6 CONTROL.
    12.2.7 DEFINE-LEXICON.
    12.3 Events.
    12.3.1 SPEECH-MARKER.
    12.3.2 SPEAK-COMPLETE.
    12.4 Headers.
    12.5 Summary.
    13. Speech Recogniser Resource.
    13.1 Overview.
    13.2 Recognition Methods.
    13.2.1 RECOGNIZE.
    13.2.2 DEFINE-GRAMMAR.
    13.2.3 START-INPUT-TIMERS.
    13.2.4 GET-RESULT.
    13.2.5 STOP.
    13.2.6 INTERPRET.
    13.3 Enrollment Methods.
    13.3.1 START-PHRASE-ENROLLMENT.
    13.3.2 ENROLLMENT-ROLLBACK.
    13.3.3 END-PHRASE-ENROLLMENT.
    13.3.4 MODIFY-PHRASE.
    13.3.5 DELETE-PHRASE.
    13.4 Events.
    13.4.1 START-OF-INPUT.
    13.4.2 RECOGNITION-COMPLETE.
    13.4.3 INTERPRETATION-COMPLETE.
    13.5 Recognition Headers.
    13.6 Enrollment Headers.
    13.7 Summary.
    14. Recorder Resource.
    14.1 Overview.
    14.2 Methods.
    14.2.1 RECORD.
    14.2.2 START-INPUT-TIMERS.
    14.2.3 STOP.
    14.3 Events.
    14.3.1 START-OF-INPUT.
    14.3.2 RECORD-COMPLETE.
    14.4 Headers.
    14.5 Summary.
    15. Speaker Verification Resource.
    15.1 Overview.
    15.2 Methods.
    15.2.1 START-SESSION.
    15.2.2 END-SESSION.
    15.2.3 VERIFY.
    15.2.4 VERIFY-FROM-BUFFER.
    15.2.5 VERIFY-ROLLBACK.
    15.2.6 START-INPUT-TIMERS.
    15.2.7 GET-INTERMEDIATE-RESULT.
    15.2.8 STOP.
    15.2.9 CLEAR-BUFFER.
    15.2.10 QUERY-VOICEPRINT.
    15.2.11 DELETE-VOICEPRINT.
    15.3 Events.
    15.3.1 START-OF-INPUT.
    15.3.2 VERIFICATION-COMPLETE.
    15.4 Headers.
    15.5 Summary.
    PART V. PROGRAMMING SPEECH APPLICATIONS.
    16. Voice eXtensible Markup Language (VoiceXML).
    16.1 Introduction.
    16.2 Document Structure.
    16.2.1 Applications and Dialogs.
    16.3 Dialogs.
    16.3.1 Forms.
    16.3.2 Menus.
    16.3.3 Mixed Initiative Dialogs.
    16.4 Media Playback.
    16.5 Media Recording.
    16.6 Speech and DTMF Recognition.
    16.6.1 Specifying Grammars.
    16.6.2 Grammar Scope and Activation.
    16.6.3 Configuring Recognition Settings.
    16.6.4 Processing Recognition Results.
    16.7 Flow Control.
    16.7.1 Executable Content.
    16.7.2 Variables, Scopes, and Expressions.
    16.7.3 Document and Dialog Transitions .
    16.7.4 Event Handling.
    16.8 Resource Fetching.
    16.9 Call Transfer.
    16.10 Summary.
    17. VoiceXML and MRCP Interworking.
    17.1 Introduction.
    17.2 Interworking Fundamentals.
    17.2.1 Play Prompts.
    17.2.2 Play and Recognise.
    17.2.3 Record.
    17.3 Application Example.
    17.3.1 VoiceXML Scripts.
    17.3.2 MRCP Flows.
    17.4 Summary.
    Appendix A. MRCP Version 1.
    A.1 Overview.
    A.2 Session Management and Message Transport.
    A.3 General Protocol Details.
    A.4 Speech Synthesiser Resource.
    A.5 Speech Recogniser Resource.
    Appendix B. XML Primer.
    B.1 Background.
    B.2 Basic Concepts.
    B.3 Namespaces.
    B.4 Document Schemas.
    Appendix C. HTTP Primer.
    C.1 Background.
    C.2 Basic Concepts.
    C.2.1 GET Method.
    C.2.2 POST Method.
    C.3 Caching.
    C.4 Cookies.
    C.5 Security.
    References.
    Index.
    Acronyms.

贊助商廣告
 
金石堂 - 今日66折
數學真有趣,看圖就懂➀巴黎鐵塔是等腰三角形?:形狀與對稱的觀察
作者:費莉西亞.羅
出版社:小宇宙文化
出版日期:2022-12-28
66折: $ 231 
金石堂 - 今日66折
讀書變成「高報酬投資」的刻意自學:「組合式讀學術」翻轉無奈人生,40歲擁有千萬10桶金
作者:本山勝寬
出版社:格致文化
出版日期:2018-09-05
66折: $ 198 
金石堂 - 今日66折
常識選股法:丟掉線圖與財報,我才選到好股票
作者:愛德華.萊恩
出版社:樂金文化
出版日期:2022-10-05
66折: $ 211 
金石堂 - 今日66折
日本三代名醫の肩頸自療法:每天1分鐘!舒緩脊椎肌肉,身體重新調正,自癒力大增!(暢銷新訂版)
作者:竹谷內康修
出版社:方言文化出版事業有限公司
出版日期:2021-08-11
66折: $ 198 
 
金石堂 - 暢銷排行榜
一開始就從裡面… 完全版(全)
作者:みちのくアタミ
出版社:東立出版社
出版日期:2025-06-30
$ 200 
Taaze 讀冊生活 - 暢銷排行榜
你的人生,他們六個說了算!:決定你一生的六種物質
作者:大衛.JP.菲利浦斯
出版社:平安文化有限公司
出版日期:2024-12-30
$ 284 
金石堂 - 暢銷排行榜
我所看見的未來  完全版 (竜樹諒預言漫畫集)
作者:竜樹諒
出版社:大塊文化出版股份有公司
出版日期:2022-07-01
$ 300 
Taaze 讀冊生活 - 暢銷排行榜
既然你都這麼說了我就抱你吧(2)
作者:にやま
出版社:尖端出版
出版日期:2025-05-23
$ 136 
 
金石堂 - 新書排行榜
伊谷納多的新娘(01)特典版
作者:もりもより
出版社:青文出版社股份有限公司
出版日期:2025-05-28
$ 142 
Taaze 讀冊生活 - 新書排行榜
生成式金融危機:當AI接管交易,下一場全球經濟新威脅
作者:詹姆斯.瑞卡茲
出版社:感電出版
出版日期:2025-06-04
$ 322 
Taaze 讀冊生活 - 新書排行榜
魔女的原罪(日本最受矚目法庭推理鬼才,現役律師給社會的反思與救贖)
作者:五十嵐律人
出版社:奇幻基地
出版日期:2025-06-05
$ 338 
 

©2025 FindBook.com.tw -  購物比價  找書網  找車網  服務條款  隱私權政策