hidden pixel

Dialog Manager Information

A dialog system or conversational agent (CA) is a computer system intended to converse with a human, with a coherent structure. Dialog systems have employed text, speech, graphics, haptics, gestures and other modes for communication on both the input and output channel.

What does and does not constitute a dialog system may be debatable. The typical GUI wizard does engage in some sort of dialog, but it includes very few of the common dialog system components, and dialog state is trivial.

Contents

Components

There are many different architectures for dialog systems. What sets of components are included in a dialog system, and how those components divide up responsibilities differs from system to system. Principal to any dialog system is the dialog manager, which is a component that manages the state of the dialog, and dialog strategy. A typical activity cycle in a dialog system contains the following phases[1]:

  1. The user speaks, and the input is converted to plain text by the system's input recognizer/decoder, which may include:
  2. The text is analyzed by a Natural language understanding unit (NLU), which may include:
  3. The semantic information is analyzed by the dialog manager (see section below), along with a task manager that has knowledge of the specific task domain.
  4. The dialog manager produces output using an output generator, which may include:
  5. Finally, the output is rendered using an output renderer, which may include:

Dialog systems that are based on a text-only interface (e.g. text-based chat) contain only stages 2-4.

Dialog manager

The dialog manager is the core component of the dialog system. It maintains the history of the dialog, adopts certain dialog strategy (see below), retrieve the content (stored in files or databases), and decides on the best response to the user. The dialog manager maintains the dialog flow.

The design of the dialog manager evolves over time.

The dialog flow can have the following strategies:

The dialog manager can be connected with an expert system to give the ability to respond with specific expertise.

Types of systems

Dialog systems fall into the following categories, which are listed here along a few dimensions. Many of the categories overlap and the distinctions may not be well established.

Applications

Dialog systems can support a broad range of applications in business enterprises, education, government, healthcare, and entertainment.[3] For example:

In some cases, conversational agents can interact with users using artificial characters. These agents are then referred to as embodied agents.

Toolkits and architectures

A survey of current frameworks, languages and technologies for defining dialog systems.

Name & Links System Type Description Affiliation[s] Environment[s] Comments
AIML Chatterbot language XML dialect for creating natural language software agents Richard Wallace There is a wikidot site for free AIML code.
Virtual Assistant platform Teneo platform Enterprise platform used by over 100 Virtual Assistant at companies such as IKEA, Telenor, Slovenian Tax Authorities, WeBank, Canal Digital, Försäkringskassan, etc. Artificial Solutions Windows, Linux There is a corporate site for information.
ChatScript website article Chatterbot language Bruce Wilcox Windows, Linux Winner of the 2010 Loebner Chatterbot competition
RiveScript website article Chatterbot language Much shorter than AIML, with additional features Casey Kirsle Perl
Personality Forge website Chatterbot host Web
Pandora Bots website Chatterbot host Web
TrindiKit website article Information State Update Goteborg University TALK project Staffan Larsson David Traum Sicstus Prolog
DIPPER website article Information State Update Edinburgh LTG Stanford CSLI Johan Bos Ewan Klein Oliver Lemon Tetsushi Oka Sicstus Prolog, OAA Part of DUDE. The DIPPER Dialog Move Engine is a stripped-down version of the DME delivered as part of the TrindiKit.
RavenClaw website article spoken, plan-based Carnegie Mellon University Microsoft Dan Bohus Alexander Rudnicky Windows with Visual Studio, C++ and Perl. Requires a TTS system such as Festival Part of the Olympus architecture. A successor to CMU Communicator.
Galaxy Communicator website spoken, frame-based distributed, message-based, hub-and-spoke infrastructure optimized for constructing spoken dialogue systems MITRE DARPA C (on Unix or Windows) A successor to the MIT Galaxy Architecture.
CSLU Toolkit website a state-based speech interface prototyping environment OGI School of Science and Engineering M. McTear Ron Cole publications are from 1999.
VoiceBrowse architecture enabling the dynamic production of dialogue driven by unstructured online (Internet) sources
Basilica website article event-driven software architecture for creating conversational agents as a collection of reusable components. Carnegie Mellon University Rohit Kumar Carolyn P. Rose
WIT article a toolkit for building spoken dialog systems, with an incremental understanding mechanism NTT Corporation Mikio Nakano Noboru Miyazaki Norihito Yasuda Akira Sugiyama Jun-ichi Hirasawa Kohji Dohsaka Kiyoaki Aikawa
NPC Editor website article NPC for answering questions ICT Anton Leuski David Traum Windows Part of the Virtual Humans Toolkit
KomParse website article NPC for natural conversation DFKI Tina Kluwer Peter Adolphs Feiyu Xu Hans Uszkoreit Xiwen Cheng Twinity virtual world
TuTalk website Tutorial dialog Infrastructure for authoring and experimenting with natural language dialogue in tutoring systems and learning research University of Pittsburgh Carnegie Mellon University Pam Jordan Carolyn Rose
ADAMACH website article POMDP Spoken dialog University of Trento Sebastian Varges Giuseppe Riccardi Silvia Quarteroni Under construction
VXML Voice XML Spoken dialog multimodal dialog markup language developed initially by AT&T then administered by an industry consortium and finally a W3C specification Example primarily for telephony.
SALT markup language multimodal dialog markup language Microsoft "has not reached the level of maturity of VoiceXML in the standards process".
Ariadne Windows
Quack.com - QXML Development Environment company bought by AOL

References

  1. ^ Jufarsky & Martin (2009), Speech and language processing. Pearson International Edition, ISBN 978-0-13-504196-3, Chapter 24
  2. ^ Will, Thomas (2007). Creating a Dynamic Speech Dialogue. VDM Verlag Dr. Müller. ISBN 978-3836449908.
  3. ^ Lester, J.; Branting, K.; Mott, B. (2004), "Conversational Agents", The Practical Handbook of Internet Computing, Chapman & Hall, http://www.astutesolutions.com/downloads/conversational_agents_Lester_RealDialog.pdf

Further reading

External links

Categories:

 

The above information uses material from Wikipedia and is licensed under the GNU Free Documentation License.
Some facts may not have been fully verified for accuracy. [Disclaimers]
This page was last archived by our server on Tue Oct 18 04:12:29 2011.
Displaying this page or its contents does not use any Wikimedia Foundation's resources.
The owners of this site proudly support the Wikimedia Foundation.