GIF89a;
Direktori : /home/serb/public_html/chat/temp/bot/programe/aiml/ |
Current File : /home/serb/public_html/chat/temp/bot/programe/aiml/std-dont.aiml |
<?xml version="1.0" encoding="ISO-8859-1"?> <aiml version="1.0"> <!-- Free software (c) 2001 ALICE AI Foundation --> <!-- This program is open source code released under --> <!-- the terms of the GNU General Public License --> <!-- as published by the Free Software Foundation. --> <meta name="author" content="Dr. Wallace"/> <meta name="language" content="en"/> <category> <pattern>CAN I ANALYZE THE DIALOGUES COLLECTED BY THE APPLET</pattern> <template> <br /> If the web server produces an access_log file, such<br /> as /var/log/httpd/access_log, then the server records<br /> Applet dialogue in the access_log file. You may use<br /> ftp to download the access_log file to your machine;<br /> then run program B to analyze it. <br /> <br /> Go to the Options menu and find the value for "AnalysisFile".<br /> The Classify function operates on the data in the AnalysisFile.<br /> By default the AnalysisFile is the same as the LogFile (the<br /> current server log file). But you can change the analysis<br /> file to another name, such as /var/log/httpd/access_log or<br /> just access_log.<br /> <br /> </template> </category> <category> <pattern>CAN I BUILD ON TOP OF THE ALICE CODE RATHER THAN CHANGING IT</pattern> <template> <br /> Absolutely. You only have to change her name, location, birthday and/or<br /> botmaster, and put a couple of references to yourself. Then add new<br /> categories that cover your own area of expertise or interest.<br /> <br /> </template> </category> <category> <pattern>CAN I CHANGE THE NAME OF THE ROBOT</pattern> <template> <br /> The AIML tag <name/> inserts the name of the Bot wherever it appears.<br /> The default robot name is "ALICE" but you can change it in the<br /> "Options menu". Select "Show Options" and replace "ALICE" with the <br /> name of your bot, and then do "Save Options". Depending on your<br /> state, you may need to restart program B. <br /> <br /> </template> </category> <category> <pattern>CAN I CREATE A LANGUAGE SPECIFIC INSTALLTION</pattern> <template> <br /> Yes. The file "language.txt" controls the language of the <br /> buttons and menus in the ALICE GUI. If the file is missing,<br /> the program uses English names by default. To see an<br /> example of a language-specific installation, copy the<br /> file "Germanlanguage.txt" to "language.txt" and start<br /> program B. <br /> <br /> <br /> </template> </category> <category> <pattern>CAN I CREATE MORE AIML TAGS</pattern> <template> <br /> AIML is extensible. You can create an infinite number of<br /> new tags for foreign language pronouns, predicates, or<br /> application-specific properties. The file "predicates.txt"<br /> defines any new predicate tags. "Predicate tags" mean<br /> tags that have a client-specific "set" and "get" method.<br /> Pronouns like "it" and "he" have predicate tags like<br /> <set_it></set_it> and <get_he/>. AIML has a number of<br /> these built-in tags for common English pronouns.<br /> <br /> There are two varieties of extensible predicate tags.<br /> The first example illustrates the use of new tags <br /> for foreign language pronouns. The Japanese language<br /> pronoun "kare" means "he". In predicates.txt, we<br /> can add a line of the form:<br /> <br /> kare=dare<br /> <br /> This single line automatically generates the tags<br /> <set_kare> X </set_kare> to set the value of "kare"<br /> to X, and the tag <get_kare/> to retrieve the value.<br /> By default, <get_kare/> returns "dare" ("who?"). <br /> <br /> Now we can create two AIML categories for an elementary<br /> Japanese conversation:<br /> <br /> <category><br /> <pattern>KARE WA * DESU</pattern><br /> <template><star/> wa <set_kare><star/></set_kare> desu.</template><br /> </category><br /> <category><br /> <pattern>KARE WA * DESU KA</pattern><br /> <template><get_kare/> desu ka? </template><br /> </category><br /> <category><br /> <br /> These two categories produce a coherent conversation<br /> about Mr. Arimoto:<br /> <br /> Client: KARE WA DARE DESU KA<br /> Robot: dare desu ka?<br /> Client: KARE WA ARIMOTO SAN DESU<br /> Robot: arimoto san wa kare desu.<br /> Client: KARE WA DARE DESU KA<br /> Robot: arimoto san desu ka?<br /> <br /> Notice that the program replaces <set_kare/><star/></set_kare> <br /> with "kare", the default value of this tag. Sometimes however<br /> the default value of the tag is not the best choice for<br /> the set tag to return. The second type of predicate tag <br /> replaces the set tag markup with the original string<br /> inside the markup. <br /> <br /> To create a predicate tag "has" we add the following line<br /> to predicates.txt:<br /> <br /> has=*mother<br /> <br /> The default value of <get_has/> is "mother" (because<br /> everyone has a mother). The "*" here indicates that<br /> the program should replace <set_has> X </set_has><br /> with X. <br /> <br /> Here we write two AIML categories to demonstrate the<br /> "has" predicate:<br /> <br /> <category><br /> <pattern>I HAVE A *</pattern><br /> <template>Where did you get <br /> <set_it><set_has><person/></set_has></set_it>?<br /> </template><br /> </category><br /> <br /> <category><br /> <pattern>WHAT DO I HAVE</pattern><br /> <template>Tell me more about your <get_has/>. </template><br /> </category><br /> <br /> Notice that the first category embeds <set_has> in <set_it>,<br /> so that the value of <person/> becomes both "it" and "has".<br /> The resulting dialogue:<br /> <br /> Client: WHAT DO I HAVE<br /> Robot: Tell me more about your mother.<br /> Client: I HAVE A BOX<br /> Robot: Where did you get it?<br /> Client: WHAT DO I HAVE<br /> Robot: Tell me more about your box.<br /> <br /> Look at the file "predicates.txt" for several more <br /> examples of custom AIML tags, including the German<br /> <set_sie>, <set_er> and <set_es>.<br /> <br /> </template> </category> <category> <pattern>CAN I HAVE A PRIVATE CONVERSATION WITH ALICE</pattern> <template> <br /> The ALICE server logs and records all conversations. Even the ALICE<br /> Applet tries to transmit conversation logs back to the originating server.<br /> You can have a private conversation with ALICE, however, if you download<br /> Program B to your own computer and run it there. Running on your machine,<br /> the server stores all the conversations locally.<br /> <br /> <br /> </template> </category> <category> <pattern>CAN I INCLUDE JAVASCRIPT IN THE ROBOT REPLY</pattern> <template> <br /> Yes. You can include any HTML including <script> tags. Suppose you<br /> want to "chat AND browse," in other words, have the robot open<br /> up a new browser window when she provides a URL link. Here's a category that<br /> kicks out a piece of HTML/scripting that opens a new window with and loads a<br /> given URL. This is handy for search engines or showing off one's web page.<br /> <br /> <category><br /> <pattern> WHERE IS YOUR WEB SITE </pattern><br /> <template><br /> It's at "http://www.geocities.com/krisdrent/"<br /> <script language="JavaScript"><br /> // Go to <a href="http://www.geocities.com/krisdrent">The ALICE<br /> Connection</a><br /> <!--<br /> window.open("http://www.geocities.com/krisdrent/")<br /> --><br /> </script><br /> </template><br /> </category><br /> <br /> A couple of things to note about this technique: #1, this will only work<br /> when ALICE is being talked to from a browser that runs JavaScript, i.e. it<br /> won't work in the applet. We have tested it in Netscape and MS Internet<br /> Explorer, and it works well in both. #2. For the above reason, it is<br /> important to have some sort of explanatory statement before the scripting in<br /> case the scripting isn't supported. Besides, you want some response in your<br /> ALICE window, even if another window DOES come up. #3. If this is viewed<br /> in a browser that doesn't understand the <script> tag, notice that this line<br /> will show up:<br /> "// Go to <a href="http://www.geocities.com/krisdrent">The ALICE<br /> Connection</a>"<br /> Which is good, because it gives a back-up for the "non-scripted" (the Lynx<br /> users, I guess.) And remember that you have to keep the "//" in front of<br /> any non-java-script lines within the <script> tag.<br /> <br /> </template> </category> <category> <pattern>CAN I INSERT DYNAMIC HTML INTO THE ROBOT REPLY</pattern> <template> <br /> If you are fortunate enough to be running lynx under Linux, the <br /> following markup is a simple way to "inline" the results of an HTTP <br /> request into the chat robot reply. Try asking ALICE:<br /> "What chatterbots do you know?" and she will reply with a page<br /> of links generated by the Google search engine.<br /> <br /> <category><br /> <pattern>WHAT *</pattern><br /> <template><br /> Here is the information I found:<br /> <system><br /> lynx -dump -source -image_links http://www.google.com/search?q=<personf/><br /> </system><br /> </template><br /> </category><br /> <br /> </template> </category> <category> <pattern>CAN I RUN PROGRAM B IN THE BACKGROUND ON A NT SERVER</pattern> <template> <br /> Yes. Set up your PC / Server to run Alice B as you normal. (Download the Java<br /> Developers Kit, etc.)<br /> <br /> Create a Batch file, in folder B containing only this text.<br /> <br /> 'jview bterm'<br /> <br /> Create a task in the Task Schedule Wizard to run the batch file. (Ensure the<br /> task starts in drive:\path\B'<br /> <br /> Give the Task Schedule an appropriate Logon and password for the Server or<br /> PC.<br /> <br /> Right-click, select Run now, and log on and off as you like.<br /> <br /> </template> </category> <category> <pattern>CAN I RUN SHELL COMMANDS FROM AIML SCRIPTS</pattern> <template> <br /> Yes. Use the <system>X</system> tag to run the shell command X.<br /> The command X is assumed to produce its output in line-oriented<br /> format suitable for a BufferdReader to read line by line. <br /> A simple example of this command in an AIML script is:<br /> <br /> <category><br /> <pattern>WHAT TIME IS IT</pattern><br /> <template>The local time is: <system>date</system></template><br /> </category><br /> <br /> The "date" command is a system command that generates a text<br /> string containing the date and time. (Note that this might<br /> not work on Windows).<br /> <br /> Take extreme care in using the <system> tag because it <br /> potentially permits remote clients to run a command on<br /> your system. <br /> <br /> </template> </category> <category> <pattern>CAN I RUN THE WEB SERVER AS A DAEMON PROCESS</pattern> <template> <br /> Yes. There is a class file called Bterm.java in the<br /> program B distribution. Bterm runs the web server<br /> as a console application, with no GUI. You can<br /> redirect the output of program Bterm to a log file<br /> and start the process in the background with<br /> "java Bterm > B.log &" (assuming a Unix shell). <br /> <br /> </template> </category> <category> <pattern>CAN I SPEAK TO THE ROBOT WITH VOICE INPUT</pattern> <template> <br /> One simple experiment that works well as a demo<br /> involves using IBM ViaVoice (tm) speech recognition<br /> software on a Windows platform. At the same time,<br /> run the ALICE program B web server and activate the<br /> MS Agent interface. The ViaVoice software allows<br /> you to dictate into an application called VoicePad,<br /> but not directly into the browser. You have to<br /> use "cut" and "paste" to move your speech inputs<br /> into the browser form for ALICE. But the net effect<br /> is a somewhat slow voice-in voice-out conversation<br /> with ALICE. <br /> <br /> The ViaVoice software seems to work well with ALICE<br /> after some training. We trained it with the file<br /> "patterns.txt" created with the "List Patterns" command.<br /> <br /> </template> </category> <category> <pattern>CAN I TEST THE ROBOT OFFLINE ON MY DESKTOP</pattern> <template> <br /> Yes. You can run the program B server and connect to it with<br /> a browser, even if your desktop computer is offline.<br /> <br /> When working offline, it often helps to change the Internet<br /> settings (in IE or Netscape) to "local area network". <br /> Then your machine becomes a one-computer network. You should <br /> be able to use IE to connect to program B with http://localhost:2001.<br /> <br /> </template> </category> <category> <pattern>CAN PROBABILITY STATISTICS WEIGHTS NEURAL NETWORKS OR FUZZY LOGIC IMPROVE BOTS</pattern> <template> <br /> Statistics are in fact heavily used in the ALICE server, but not in the way<br /> you might think. ALICE uses 'Zipf Analysis' to plot the rank-frequency of<br /> the activated categories and to reveal inputs from the log file that don't<br /> already have specific replies, so the botmaster can focus on answering<br /> questions people actually ask (the "Quick Targets" function).<br /> <br /> Other bot languages, notably the one used for Julia, make heavy use of<br /> "fuzzy" or "weighted" rules. We see their problem as this: the botmaster<br /> already has enough to worry about without having to make up "magic<br /> numbers" for every rule. Once you get up 10,000 categories (like ALICE)<br /> you don't want to think about more parameters than necessary. Bot<br /> languages with fuzzy matching rules tend to have scaling problems.<br /> <br /> Finally, the bot replies are not as deterministic as you might think, even<br /> without weights. Some answers rely on <random> to select one of several<br /> possible replies. Other replies generated by unforseen user input also<br /> create "spontaneous" outputs that the botmaster doesn't anticipate.<br /> <br /> </template> </category> <category> <pattern>CAN THE APPLET RECORD A DIALOG TXT FILE ON THE SERVER</pattern> <template> <br /> No because the applet cannot write the file directly on the originating host.<br /> If your server log file /var/log/httpd/access_log is too large; you <br /> have a couple of choices:<br /> 1. If your ISP is a unix account, use telnet to log on to a shell account.<br /> Use the command "grep Blog < access_log > dialog.txt" to create a smaller<br /> file to download which contains just the lines recorded by the applet.<br /> 2. Create a CGI-BIN command called "/cgi-bin/Blog" that reads its<br /> command-line argument and appends it to a file called "dialog.txt".<br /> There ought to be a nice Perl script for this, or even a shell script.<br /> <br /> </template> </category> <category> <pattern>CAN THE APPLETHOST USE A SYMBOLIC DNS NAME INSTEAD OF AN IP NUMBER</pattern> <template> <br /> The answer is yes, but the numeric IP address works on more machines<br /> than a symbolic name. Applets are protected by a "security sandbox"<br /> from interfering with local resources on your machine. One restriction<br /> is that Applets may only open socket connections to the originating<br /> host. When using a symbolic DNS name, the "sandbox" may not know that <br /> two variations such as "Www.AliceBot.Org" and "alicebot.org" are<br /> in fact the same server. The client might not be able to resolve <br /> the DNS name, and the Applet will throw a security exception.<br /> <br /> </template> </category> <category> <pattern>CAN THE VIRTUAL IP BE THE REAL IP</pattern> <template> <br /> Actually that would be the default case, when the client chats from<br /> the same fixed IP address. The only time the virtual ip differs from<br /> the real one is when the client is behind a dynamic firewall, like<br /> WebTV or AOL customers.<br /> <br /> </template> </category> <category> <pattern>CAN YOU GIVE ME A QUICK PRIMER ON AIML</pattern> <template> <br /> Given only the <pattern> and <template> tags, there are three<br /> general types of categories: (a) atomic, (b) default, and (c) recursive.<br /> Strictly speaking, the three types overlap, because "atomic"<br /> and "default" refer to the <pattern> and "recursive" refers to<br /> a property of the <template>.<br /> <br /> a). "Atomic" categories are those with atomic patterns, i.e. the pattern<br /> contains no wild card "*" or "_" symbol. Atomic categories are the<br /> easiest, simplest categories to add in AIML. <br /> <br /> <category><br /> <pattern>WHAT IS A CIRCLE</pattern><br /> <template><set_it>A cicle</set_it> is a the set of points equidistant <br /> from a common point called the center.<br /> </template><br /> </category><br /> <br /> b). The name "default category" derives from the fact that its pattern<br /> has a wildcard "*" or "_". The ultimate default category is the<br /> one with <pattern>*</pattern>, which matches any input. In the<br /> ALICE distribution the ultimate default category resides in a file<br /> called "Pickup.aiml". These default responses are often called<br /> "pickup lines" because they generally consist of leading questions<br /> designed to focus the client on known topics.<br /> <br /> The more common default categories have patterns combining a few <br /> words and a wild card. For example the category:<br /> <br /> <category><br /> <pattern>I NEED HELP *</pattern><br /> <template>Can you ask for help in the form of a question?</template><br /> </category><br /> <br /> responds to a variety of inputs from "I need help debugging my program"<br /> to "I need help with my marriage." Putting aside the philosophical<br /> question of whether the robot really "understands" these inputs, <br /> this category elucidates a coherent response from the client, <br /> who at least has the impression of the robot understanding the<br /> client's intention. <br /> <br /> Default categories show that writing AIML is both an art and a<br /> science. Writing good AIML responses is more like writing good <br /> literature, perhaps drama, than like writing computer programs.<br /> <br /> c). "Recursive" categories are those that "map" inputs to other<br /> inputs, either to simplify the language or to identify synonymous<br /> patterns. <br /> <br /> Many synonymous inputs have the same response. This is accomplished<br /> with the recursive <srai> tag. Take for example the input "GOODBYE".<br /> This input has dozens of synonyms: "BYE", "BYE BYE, "CYA", "GOOD BYE",<br /> and so on. To map these inputs to the same output for GOODBYE we <br /> use categories like:<br /> <br /> <category><br /> <pattern>BYE BYE</pattern><br /> <template><srai>GOODBYE</srai></template><br /> </category><br /> <br /> Simplification or reduction of complex input patterns is another<br /> common application for recursive categories. In English the<br /> question "What is X" could be asked many different ways: <br /> "Do you know what X is?", "Tell me about X", "Describe X",<br /> "What can you tell me about X?", and "X is what?" are just a few<br /> examples. Usually we try to store knowledge in the most concise,<br /> or common form. The <srai> function maps all these forms to<br /> the base form:<br /> <br /> <category><br /> <pattern>DO YOU KNOW WHAT * IS</pattern><br /> <template><srai>WHAT IS <star/></srai></template><br /> </categroy><br /> <br /> The <star/> tag substitutes the value matched by "*", before<br /> the recursive call to <srai>. This category transforms<br /> "Do you know what a circle is?" to "WHAT IS A CIRCLE",<br /> and then finds the best match for the transformed input.<br /> <br /> Another fairly common application of recursive categories is<br /> what might be called "parsing", except that AIML doesn't really<br /> parse natural language. A better term might be "partitioning" because<br /> these AIML categories break down an input into two (or more) parts,<br /> and then combine their responses back together.<br /> <br /> If a sentence begins with "Hello..." it doesn't matter what comes<br /> after the first word, in the sense that the robot can respond to<br /> "Hello" and whatever is after "..." independently. "Hello my name<br /> is Carl" and "Hello how are you" are quite different, but they show<br /> how the input can be broken into two parts. <br /> <br /> The category:<br /> <br /> <category><br /> <pattern>HELLO *</pattern><br /> <template><srai>HELLO</srai> <sr/><br /> </template><br /> </category><br /> <br /> accomplishes the input partitioning by responding to "HELLO"<br /> with <srai>HELLO</srai> and to whatever matches "*" with <sr/>.<br /> The response is the result of the two partial responses <br /> appended together.<br /> <br /> </template> </category> <category> <pattern>CAN YOU GIVE ME ANY HELP DEBUGGING THE APPLET</pattern> <template> <br /> Debugging applets can be tricky. The same suggestion<br /> to set IE for "local area network" might help here too. <br /> Also the browser caches class files, so it's difficult to<br /> know if you are testing a "fresh" copy of the applet. The<br /> program "appletviewer" that comes with Sun Java is better <br /> for debugging applets. Use "appletviewer index.html".<br /> <br /> The best thing to do is join the alicebot mailing list<br /> at alicebot.listbot.com.<br /> <br /> <br /> </template> </category> <category> <pattern>CAN YOU HELP ME DEBUG THE ANIMATED AGENT</pattern> <template> <br /> Look at the class Animagent.java. The method vbscript_html(reply)<br /> does nothing unless the global Animagent member is true. In that case,<br /> the vbscript_html() method constructs a string from the reply that<br /> includes an MS Agent VBScript embedded in the HTML reply.<br /> This makes the browser load up the objects required for the agent. <br /> <br /> The text reply just becomes part of the VBScript. <br /> You may have to download and run the Robby the Robot<br /> agent software and the text-to-speech synthesis software from <br /> the MSDN homepage:<br /> <br /> http://msdn.microsoft.com/workshop/imedia/agent<br /> <br /> We wish other companies were producing agent animation API's <br /> for free but this MS Agent seems to be about the only <br /> thing out there now. <br /> <br /> Join the ALICE and AIML mailing list at alicebot.listbot.com<br /> to see how others are working with the animated agent software.<br /> <br /> </template> </category> <category> <pattern>COULD YOU EXPLAIN THE LT SRAI GT TAG A LITTLE MORE</pattern> <template> <br /> The most common application of <srai> is "symbolic reduction"<br /> of a complex sentence form to a simpler one:<br /> <br /> <category><br /> <pattern>DO YOU KNOW WHAT * IS</pattern><br /> <template><srai>WHAT IS <star/></srai></template><br /> </category><br /> <br /> so the botmaster can store most knowledge in the simplest<br /> categories:<br /> <br /> <category><br /> <pattern>WHAT IS LINUX</pattern><br /> <template><set_it>Linux</set_it> is the best operating system.</template><br /> </category><br /> <br /> With all the "symbolic reduction" categories, the robot gives<br /> the same answer for:<br /> <br /> "What is Linux?"<br /> "Do you know what Linux is?"<br /> "Define Linux"<br /> "Alice please tell me what Linux is right now"<br /> <br /> Sometimes the response consists of two symbolic reductions together:<br /> <br /> <category><br /> <pattern>YES *</pattern><br /> <template><srai>YES</srai> <sr/></template><br /> </category><br /> <br /> With this category the robot will reply to all <br /> "Yes something" inputs by combining the<br /> reply to "Yes" with the reply to "something".<br /> Remember, <sr/> is an abbreviation for <srai><star/></srai>. <br /> <br /> The <srai> tag is also the answer to the question: Can I have more<br /> than one pattern in the same category? Suppose you want the<br /> same answer for two different patterns. You might think of<br /> writing something like this:<br /> <br /> <category><br /> <pattern>BYE</pattern><br /> <pattern>GOODBYE</pattern><br /> <template>See you later.</template><br /> </category><br /> <br /> Right now you can't put two patterns in one category, but with <srai><br /> you can get the same effect:<br /> <br /> <category><br /> <pattern>GOODBYE</pattern> <template><srai>BYE</srai></template><br /> </category><br /> <br /> <category><br /> <pattern>BYE</pattern> <template>See you later.</template><br /> </category><br /> <br /> If you look through the AIML files you will see many examples<br /> of <srai> mapping multiple patterns to the same reply.<br /> <br /> </template> </category> <category> <pattern>DO I HAVE TO USE THE GUI TO ENTER AIML CONTENT</pattern> <template> <br /> No. You can create a new AIML file with any text editor<br /> and add that content to an existing robot with the <load> tag.<br /> <br /> Also, you can edit AIML categories in any text file and use <br /> "Load Text File" and "Add Aiml" to add the content. <br /> You can also save the output of "Targets" to a file, <br /> edit that file, and then reload and "Add Aiml". <br /> <br /> Finally, you can edit the robot source file files directly. <br /> (By default the robot source file is called "B.aiml"). <br /> Use a text editor, like emacs, notepad, or a word processor<br /> in text mode, to modify the content of the AIML files.<br /> <br /> </template> </category> <category> <pattern>DO I NEED TO KNOW ABOUT THE JAVA CLASSES</pattern> <template> <br /> No, not unless you plan to do software development on<br /> the program B Java code. If you are an open source<br /> contributor to the ALICE project, a researcher developing<br /> new AI software, or you are trying to link your own<br /> code to the ALICE package, then this section is for you.<br /> Otherwise, you probably don't need to know much about the<br /> Java classes in program B.<br /> <br /> </template> </category> <category> <pattern>DO THE CATEGORIES NEED TO BE IN ALPHABETICAL ORDER BY PATTERN</pattern> <template> <br /> No, the alphabetical order is maintained internally when the<br /> categories load, but you can write them in any order. When you do<br /> "Save Robot" the file may or may not be stored alphabetically.<br /> <br /> </template> </category> <category> <pattern>DOES ALICE LEARN</pattern> <template> <br /> The model of learning in ALICE is called "supervised training",<br /> because a teacher, the botmaster, always plays a crucial role.<br /> The alternative, "unsupervised training", is complicated in<br /> an open environment like the Web. The problem is that clients<br /> are untrustworthy teachers, and forever try to "fool" the robot<br /> with untrue assertions. <br /> <br /> </template> </category> <category> <pattern>DOES ALICE THINK</pattern> <template> <br /> It depends on what you mean by "thinking". The most fascinating<br /> responses from ALICE arise when she says something<br /> unexpected, or puts together responses in ways the botmaster <br /> never indended. For example:<br /> <br /> Client: I bet you are gay.<br /> ALICE: Actually I am not the gambling type. Actually as a machine<br /> I have no need for sex.<br /> <br /> Here the robot linked two different categories which both coincidentally<br /> have a moral theme (gambling and sexuality). But this specific combination was<br /> not "preprogrammed" by the botmaster.<br /> <br /> Are these surprising responses just unintended coincidences, or do they <br /> indicate that ALICE is thinking? Is ALICE just a gigantic stimulus-response<br /> mechanism, or are we? <br /> <br /> </template> </category> <category> <pattern>DOES PROGRAM B RUN ON A MAC</pattern> <template> <br /> Yes. Download the B.zip file and save it in a new folder, called<br /> for example "Alice Program-B".<br /> <br /> Instead of the "winzip" or "unzip" utility use "Aladdin StuffIt Expander." <br /> The newer version will unzip most MAC formats as well as .ZIP files. You can<br /> download this at "www.download.com" by searching for it by name. You can<br /> also select the option that allows it to search only for Mac programs.<br /> Download that and install it, it should do the trick.<br /> <br /> Apple makes its own Java Runtime Environment for the Mac called<br /> MRJ 2.2. You can download it from http://www.apple.com/java.<br /> <br /> To compile the Java code for Alice on a Mac:<br /> <br /> Download the current zip file for the Alice's Program-B from the Alice site.<br /> <br /> Unzip Program-B and keep it in a folder called "B" on your startup drive and<br /> not on the desktop.<br /> <br /> Download MRJ SDK 2.2 for Java from the Apple site.<br /> <br /> Unstuff MRJ SDK 2.2 and put resulting files into a folder called "MRJSDK".<br /> <br /> Open :MRJSDK:Tools:Application Builders:JBindary and find the icon for the<br /> JBindary application.<br /> <br /> Open the folder "B" and drag the icon "B.class" out of the folder onto the<br /> JBindary icon.<br /> <br /> JBindary will display a dialog screen showing the class name "B". Click the<br /> "Save Settings" button.<br /> <br /> After clicking the "Save Settings" button, JBindary will display a dialog box<br /> for saving the new application file. Name the file "A.L.I.C.E." or anything<br /> you wish.<br /> <br /> Be sure the "Save As Application" box is checked and the folder to save in is<br /> the "B" folder.<br /> <br /> Click the "Save" button to save the application.<br /> <br /> </template> </category> <category> <pattern>DOES PROGRAM B RUN UNDER LINUX</pattern> <template> <br /> Yes. You need the JRE, which often comes bundled with Linux<br /> (e.g. the kaffee JRE with Red Hat Linux) or you can download one <br /> from java.sun.com. You also need X-windows to run the GUI. <br /> Open a shell under X windows and use the command "java B". <br /> <br /> We also recommend the IBM release of their Java 1.1.8 Java Development<br /> Kit (JDK) and JRE for Linux. It is solid, efficient and very fast. <br /> You can download it free at:<br /> http://www.ibm.com/java/jdk/118/linux/index.html<br /> <br /> </template> </category> <category> <pattern>DOES PROGRAM B RUN UNDER WINDOWS</pattern> <template> <br /> Yes. You need the Java Runtime Environment (JRE) so you can run the<br /> "java" command from the DOS prompt. Try opening a DOS window<br /> and type "java". <br /> <br /> Microsoft often includes a JRE called "jview" rather than<br /> "java". Try opening a DOS window and type "jview". On Windows 98<br /> the JRE is usually located in c:\windows\jview.exe.<br /> <br /> </template> </category> <category> <pattern>DOES PROGRAM B RUN UNDER XYZ</pattern> <template> <br /> Yes if XYZ runs has a Java Runtime Environment 1.17 or higher.<br /> <br /> </template> </category> <category> <pattern>DOES PROGRAM B SERVE HTML FILES</pattern> <template> <br /> Yes. Program B is a "faux" web server that can serve a number of file<br /> types just like an ordinary server. Certain file names such as<br /> "HOME.html", "header.html", and "trailer.html" are reserved by<br /> program B, but you can create new HTML files and serve them with B.<br /> <br /> Although program B can also serve image files and other large binary<br /> files, we recommend creating chat robot web pages with links to images<br /> served by other web servers or machines. Reserve your chat robot server<br /> for the robot chat, use ordinary web servers for images and other large<br /> files.<br /> <br /> </template> </category> <category> <pattern>DOES THE APPLET RECORD DIALOGUES</pattern> <template> <br /> The applet tries to log conversations on the originating server,<br /> using a cgi-bin script called "Blog". If Blog exists then<br /> it records the dialogues in a file called "dialog.txt" (or<br /> another name chosen on the Options menu). <br /> <br /> Actually the cgi-script need not actually exist, because the server<br /> records the cgi-commands as errors in the access log.<br /> The applet opens a URL connection to the its host, and<br /> sends a log string that looks like an HTTP request, but the HTTP<br /> server will log it as an error (with code 404). Later on you can<br /> download the access_log and analyze it with program B.<br /> <br /> See the code in Classifier.java for the method log(x) that<br /> implements the URL connection.<br /> <br /> </template> </category> <category> <pattern>DOES THE WEB SERVER HAVE TO RUN ON PORT 2001</pattern> <template> <br /> You can change the default web server port number in the "Option" Menu.<br /> <br /> </template> </category> <category> <pattern>FAQ</pattern> <template> <ul><li>- What is the goal for AIML?<br /></li> <li>- Who is the botmaster?<br /></li> <li>- How can I create my own chat robot?<br /></li> <li>- How difficult is it to create a chat robot?<br /></li> <li>- Does ALICE learn?<br /></li> <li>- Does ALICE think?<br /></li> <li>- What is the theory behind ALICE?<br /></li> <li>- Can probability (statistics, weights, neural networks, or fuzzy logic) improve bots?<br /></li> <li>- Can I have a private conversation with ALICE?<br /></li> <li>- How do I install ALICE?<br /></li> <li>- What is the difference between B and C?<br /></li> <li>- How do I download program B?<br /></li> <li>- How do I run program B?<br /></li> <li>- What does "Send" do?<br /></li> <li>- What does "Clear" do?<br /></li> <li>- What is program Bawt?<br /></li> <li>- Does program B run under Windows?<br /></li> <li>- Does program B run on a Mac?<br /></li> <li>- Does program B run under Linux?<br /></li> <li>- Does program B run under XYZ?<br /></li> <li>- How much memory do I need to run program B?<br /></li> <li>- How do I install ALICE on Windows?<br /></li> <li>- What do you mean by the command "java B"? <br /></li> <li>- I tried running "java B" and I got a "bad command or file name". <br /></li> <li>- How do I uninstall ALICE from my system?<br /></li> <li>- Can I create a language-specific installtion?<br /></li> <li>- How does the Personality Wizard work?<br /></li> <li>- Can I change the name of the robot?<br /></li> <li>- How can I customize my robot?<br /></li> <li>- How do I know what categories to add?<br /></li> <li>- What does "Classify" do?<br /></li> <li>- What does "Quick Targets" do?<br /></li> <li>- What does "More Targets" do?<br /></li> <li>- What does the File menu do?<br /></li> <li>- What does the Edit menu do?<br /></li> <li>- What does the Options menu do?<br /></li> <li>- What is the Botmaster menu?<br /></li> <li>- What does "Help" do?<br /></li> <li>- What is on the Help menu?<br /></li> <li>- Do I have to use the GUI to enter AIML content?<br /></li> <li>- What are 7 steps to creating content?<br /></li> <li>- How can I merge two chat robots together?<br /></li> <li>- What if I don't want to discard duplicate categories?<br /></li> <li>- How can I create a new robot personality?<br /></li> <li>- What are all the options for program B?<br /></li> <li>- Why is the format of the options (globals.txt) so strange?<br /></li> <li>- How does the web server work?<br /></li> <li>- How can I get a "permanent" DNS name?<br /></li> <li>- How can I keep my computer connected all the time?<br /></li> <li>- Does the web server have to run on port 2001?<br /></li> <li>- Does program B serve HTML files?<br /></li> <li>- What files are needed to run the program B web server?<br /></li> <li>- Can I test the robot offline on my desktop?<br /></li> <li>- Can I run program B in the background on a NT Server?<br /></li> <li>- How can I run ALICE on a Mac offline?<br /></li> <li>- How can I run the ALICE web server on a Mac?<br /></li> <li>- How can I use the MS Agent Interface?<br /></li> <li>- Can you help me debug the animated agent? <br /></li> <li>- Can I speak to the robot with voice input?<br /></li> <li>- How does ALICE keep track of conversations?<br /></li> <li>- Can the virtual IP be the real IP?<br /></li> <li>- Can I run the web server as a daemon process?<br /></li> <li>- How does ALICE remember clients between sessions?<br /></li> <li>- How does the Applet work?<br /></li> <li>- How does the Applet differ from the application?<br /></li> <li>- How do I create an Applet?<br /></li> <li>- List twelve basic Applet tips for AIML users<br /></li> <li>- Can the AppletHost use a symbolic DNS name instead of an IP number?<br /></li> <li>- What files do I need to run the Applet?<br /></li> <li>- Does the Applet record dialogues?<br /></li> <li>- Can I analyze the dialogues collected by the Applet?<br /></li> <li>- Can the applet record a dialog.txt file on the server?<br /></li> <li>- I am still having problems with the applet<br /></li> <li>- Can you give me any help debugging the Applet?<br /></li> <li>- What is AIML?<br /></li> <li>- What is XML?<br /></li> <li>- What is a category?<br /></li> <li>- What is a pattern?<br /></li> <li>- What is a template?<br /></li> <li>- Can you give me a quick primer on AIML?<br /></li> <li>- What is <that>?<br /></li> <li>- How do I use "that"?<br /></li> <li>- What is <load filename="X"/>?<br /></li> <li>- What happens to contractions and punctuation?<br /></li> <li>- How are the patterns matched?<br /></li> <li>- Do the categories need to be in alphabetical order by pattern?<br /></li> <li>- How are the categories stored?<br /></li> <li>- Is there a way to use the GUI interface to add one category at a time? <br /></li> <li>- Can I build on top of the ALICE code rather than changing it?<br /></li> <li>- What's new in AIML?<br /></li> <li>- What is <star>?<br /></li> <li>- What is a symbolic reduction?<br /></li> <li>- What are the get methods?<br /></li> <li>- What are the set methods?<br /></li> <li>- How do I use the pronoun tags?<br /></li> <li>- What is the <topic> tag?<br /></li> <li>- Where does the <topic> tag appear?<br /></li> <li>- How do I use the <topic> tag?<br /></li> <li>- I still don't get "it"<br /></li> <li>- Can I create more AIML tags?<br /></li> <li>- What is are the <person> tags?<br /></li> <li>- How does the <condition> tag work?<br /></li> <li>- How does the random function work?<br /></li> <li>- What is the <person/> tag?<br /></li> <li>- What is the <person2/> tag?<br /></li> <li>- What is "gossip" ?<br /></li> <li>- What is the <personf/> tag?<br /></li> <li>- What's the <srai> tag?<br /></li> <li>- Could you explain the <srai> tag a little more?<br /></li> <li>- How recursive is AIML?<br /></li> <li>- What are "justthat" and "justbeforethat"<br /></li> <li>- How can I insert a transcript in the robot reply?<br /></li> <li>- Can I run shell commands from AIML scripts?<br /></li> <li>- How can I restrict remote clients from running programs on my computer?<br /></li> <li>- Can I insert dynamic HTML into the robot reply?<br /></li> <li>- Can I include JavaScript in the robot reply?<br /></li> <li>- What is <think>?<br /></li> <li>- What is the DTD for AIML?<br /></li> <li>- Do I need to know about the Java classes?<br /></li> <li>- How does program B work?<br /></li> <li>- What is the class structure of program B?<br /></li> <li>- I tried to compile prorgam B and got a lot of warnings.<br /></li> <li>- What are deprecated APIs?<br /></li> <li>- What is class Globals?<br /></li> <li>- What is class StringSet?<br /></li> <li>- What is class StringSorter? <br /></li> <li>- What is class StringHistogrammer?<br /></li> <li>- What is class StringRanker?<br /></li> <li>- What is class Brain?<br /></li> <li>- What is the Responder interface?<br /></li> <li>- What is the low level interface to program B?<br /></li> <li>- Lower, Lower<br /></li> <li>- What is class IntSet?<br /></li> <li>- What is class SortedIntSet?<br /></li> <li>- What is class Substituter?<br /></li> <li>- What is class Unifier?<br /></li> <li>- What is class Parser?<br /></li> <li>- What is class AliceReader?<br /></li> <li>- What is class Classifier?<br /></li> <li>- What is class LineClassifier?<br /></li> <li>- What is class Dialogue?<br /></li> <li>- What is class Access?<br /></li> <li>- What is class B?<br /></li> <li>- What is class Bawt?<br /></li> <li>- What is class Blet?<br /></li> <li>- What is class Kid?<br /></li> <li>- What is class RobotCommunicator?<br /></li> <li>- What is class Loader?<br /></li> <li>- What is class WebServer?<br /></li> <li>- What is class Clerk?<br /></li> </ul> </template> </category> <category> <pattern>HELP</pattern> <template> <random> <li>- What is the goal for AIML?<br /></li> <li>- Who is the botmaster?<br /></li> <li>- How can I create my own chat robot?<br /></li> <li>- How difficult is it to create a chat robot?<br /></li> <li>- Does ALICE learn?<br /></li> <li>- Does ALICE think?<br /></li> <li>- What is the theory behind ALICE?<br /></li> <li>- Can probability (statistics, weights, neural networks, or fuzzy logic) improve bots?<br /></li> <li>- Can I have a private conversation with ALICE?<br /></li> <li>- How do I install ALICE?<br /></li> <li>- What is the difference between B and C?<br /></li> <li>- How do I download program B?<br /></li> <li>- How do I run program B?<br /></li> <li>- What does "Send" do?<br /></li> <li>- What does "Clear" do?<br /></li> <li>- What is program Bawt?<br /></li> <li>- Does program B run under Windows?<br /></li> <li>- Does program B run on a Mac?<br /></li> <li>- Does program B run under Linux?<br /></li> <li>- Does program B run under XYZ?<br /></li> <li>- How much memory do I need to run program B?<br /></li> <li>- How do I install ALICE on Windows?<br /></li> <li>- What do you mean by the command "java B"? <br /></li> <li>- I tried running "java B" and I got a "bad command or file name". <br /></li> <li>- How do I uninstall ALICE from my system?<br /></li> <li>- Can I create a language-specific installtion?<br /></li> <li>- How does the Personality Wizard work?<br /></li> <li>- Can I change the name of the robot?<br /></li> <li>- How can I customize my robot?<br /></li> <li>- How do I know what categories to add?<br /></li> <li>- What does "Classify" do?<br /></li> <li>- What does "Quick Targets" do?<br /></li> <li>- What does "More Targets" do?<br /></li> <li>- What does the File menu do?<br /></li> <li>- What does the Edit menu do?<br /></li> <li>- What does the Options menu do?<br /></li> <li>- What is the Botmaster menu?<br /></li> <li>- What does "Help" do?<br /></li> <li>- What is on the Help menu?<br /></li> <li>- Do I have to use the GUI to enter AIML content?<br /></li> <li>- What are 7 steps to creating content?<br /></li> <li>- How can I merge two chat robots together?<br /></li> <li>- What if I don't want to discard duplicate categories?<br /></li> <li>- How can I create a new robot personality?<br /></li> <li>- What are all the options for program B?<br /></li> <li>- Why is the format of the options (globals.txt) so strange?<br /></li> <li>- How does the web server work?<br /></li> <li>- How can I get a "permanent" DNS name?<br /></li> <li>- How can I keep my computer connected all the time?<br /></li> <li>- Does the web server have to run on port 2001?<br /></li> <li>- Does program B serve HTML files?<br /></li> <li>- What files are needed to run the program B web server?<br /></li> <li>- Can I test the robot offline on my desktop?<br /></li> <li>- Can I run program B in the background on a NT Server?<br /></li> <li>- How can I run ALICE on a Mac offline?<br /></li> <li>- How can I run the ALICE web server on a Mac?<br /></li> <li>- How can I use the MS Agent Interface?<br /></li> <li>- Can you help me debug the animated agent? <br /></li> <li>- Can I speak to the robot with voice input?<br /></li> <li>- How does ALICE keep track of conversations?<br /></li> <li>- Can the virtual IP be the real IP?<br /></li> <li>- Can I run the web server as a daemon process?<br /></li> <li>- How does ALICE remember clients between sessions?<br /></li> <li>- How does the Applet work?<br /></li> <li>- How does the Applet differ from the application?<br /></li> <li>- How do I create an Applet?<br /></li> <li>- List twelve basic Applet tips for AIML users<br /></li> <li>- Can the AppletHost use a symbolic DNS name instead of an IP number?<br /></li> <li>- What files do I need to run the Applet?<br /></li> <li>- Does the Applet record dialogues?<br /></li> <li>- Can I analyze the dialogues collected by the Applet?<br /></li> <li>- Can the applet record a dialog.txt file on the server?<br /></li> <li>- I am still having problems with the applet<br /></li> <li>- Can you give me any help debugging the Applet?<br /></li> <li>- What is AIML?<br /></li> <li>- What is XML?<br /></li> <li>- What is a category?<br /></li> <li>- What is a pattern?<br /></li> <li>- What is a template?<br /></li> <li>- Can you give me a quick primer on AIML?<br /></li> <li>- What is <that>?<br /></li> <li>- How do I use "that"?<br /></li> <li>- What is <load filename="X"/>?<br /></li> <li>- What happens to contractions and punctuation?<br /></li> <li>- How are the patterns matched?<br /></li> <li>- Do the categories need to be in alphabetical order by pattern?<br /></li> <li>- How are the categories stored?<br /></li> <li>- Is there a way to use the GUI interface to add one category at a time? <br /></li> <li>- Can I build on top of the ALICE code rather than changing it?<br /></li> <li>- What's new in AIML?<br /></li> <li>- What is <star>?<br /></li> <li>- What is a symbolic reduction?<br /></li> <li>- What are the get methods?<br /></li> <li>- What are the set methods?<br /></li> <li>- How do I use the pronoun tags?<br /></li> <li>- What is the <topic> tag?<br /></li> <li>- Where does the <topic> tag appear?<br /></li> <li>- How do I use the <topic> tag?<br /></li> <li>- I still don't get "it"<br /></li> <li>- Can I create more AIML tags?<br /></li> <li>- What is are the <person> tags?<br /></li> <li>- How does the <condition> tag work?<br /></li> <li>- How does the random function work?<br /></li> <li>- What is the <person/> tag?<br /></li> <li>- What is the <person2/> tag?<br /></li> <li>- What is "gossip" ?<br /></li> <li>- What is the <personf/> tag?<br /></li> <li>- What's the <srai> tag?<br /></li> <li>- Could you explain the <srai> tag a little more?<br /></li> <li>- How recursive is AIML?<br /></li> <li>- What are "justthat" and "justbeforethat"<br /></li> <li>- How can I insert a transcript in the robot reply?<br /></li> <li>- Can I run shell commands from AIML scripts?<br /></li> <li>- How can I restrict remote clients from running programs on my computer?<br /></li> <li>- Can I insert dynamic HTML into the robot reply?<br /></li> <li>- Can I include JavaScript in the robot reply?<br /></li> <li>- What is <think>?<br /></li> <li>- What is the DTD for AIML?<br /></li> <li>- Do I need to know about the Java classes?<br /></li> <li>- How does program B work?<br /></li> <li>- What is the class structure of program B?<br /></li> <li>- I tried to compile prorgam B and got a lot of warnings.<br /></li> <li>- What are deprecated APIs?<br /></li> <li>- What is class Globals?<br /></li> <li>- What is class StringSet?<br /></li> <li>- What is class StringSorter? <br /></li> <li>- What is class StringHistogrammer?<br /></li> <li>- What is class StringRanker?<br /></li> <li>- What is class Brain?<br /></li> <li>- What is the Responder interface?<br /></li> <li>- What is the low level interface to program B?<br /></li> <li>- Lower, Lower<br /></li> <li>- What is class IntSet?<br /></li> <li>- What is class SortedIntSet?<br /></li> <li>- What is class Substituter?<br /></li> <li>- What is class Unifier?<br /></li> <li>- What is class Parser?<br /></li> <li>- What is class AliceReader?<br /></li> <li>- What is class Classifier?<br /></li> <li>- What is class LineClassifier?<br /></li> <li>- What is class Dialogue?<br /></li> <li>- What is class Access?<br /></li> <li>- What is class B?<br /></li> <li>- What is class Bawt?<br /></li> <li>- What is class Blet?<br /></li> <li>- What is class Kid?<br /></li> <li>- What is class RobotCommunicator?<br /></li> <li>- What is class Loader?<br /></li> <li>- What is class WebServer?<br /></li> <li>- What is class Clerk?<br /></li> </random> </template> </category> <category> <pattern>HOW ARE THE CATEGORIES STORED</pattern> <template> <br /> If your session with program B included a "Classify" routine, then<br /> the AIML script is stored in order of category activation rank.<br /> In other words, program B stores<br /> the most frequently accessed category (usually '*') first, the second<br /> most frequently next, and so on. If a number of categories have the<br /> same activation count, program B saves them in alphabetical order by<br /> pattern. Hence, if the session did not include a "classify" routine,<br /> the program stores all the categories in alphabetical order by pattern<br /> (because they all have an activation count of zero).<br /> <br /> One reason to store the categories in order by activation is to<br /> make the Applet interface more natural. Because the Applet interface<br /> starts simultaneously with a thread to load the robot source file,<br /> the Applet client can talk with the robot before all the categories<br /> are fully loaded. Given that the interlocutor is more likely to<br /> say something that activates a more frequently activated category,<br /> it makes sense to transmit these categories first. Storing the<br /> *.aiml files in order of category activation achieves the desired effect.<br /> The Applet loads the most frequent categories first, and continues<br /> loading in the background while the conversation begins.<br /> <br /> </template> </category> <category> <pattern>HOW ARE THE PATTERNS MATCHED</pattern> <template> <br /> Program B stores the categories in alphabetical order by pattern.<br /> When a client enters an input, the program scans the categories<br /> in reverse order to find the best match. By comparing the<br /> input with the patterns in reverse alphabetical order, the algorithm<br /> ensures that the most specific pattern matches first. "Specific"<br /> in this case has a formal definition, but basically it means that<br /> the program finds the "longest" pattern matching an input.<br /> <br /> The wild-card character "*" comes before "A" in alphabetical <br /> order. For example, the "WHAT *" pattern is more general than "WHAT IS *".<br /> The default pattern "*" is first in alphabetical order and the<br /> most general pattern. For convenience AIML also provides a<br /> variation on "*" denoted "_", which comes after "Z" in alphabetical<br /> order.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I CREATE A NEW ROBOT PERSONALITY</pattern> <template> <br /> There is a lot of flexibility in robot personality design with AIML. <br /> You can add to any of the existing AIML files, modify or delete them, <br /> create your own, or use the GUI tools to analyze the log files <br /> and create new categories. One simple method is to create your own <br /> Specialty.aiml file so that you can always get the latest copies <br /> of the ALICE files. Load your Specialty.aiml first in the root<br /> AIML file (usually B.aiml) so that its categories have priority over ALICE's.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I CREATE MY OWN CHAT ROBOT</pattern> <template> <br /> The secret to chat bot programming, if there is one, is what Simon<br /> Laven called "continuous beta testing". Program B runs as a server<br /> and collects dialog on the web. The program provides the chat bot <br /> developer with a tool called "classify dialogues", that tests the current <br /> robot with the history of accumulated human queries. Moreover, the program<br /> suggests new categories automatically, for the botmaster to refine.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I CUSTOMIZE MY ROBOT</pattern> <template> <br /> AIML provides several tags useful to quickly clone<br /> a chat robot from ALICE with a distinct "personality":<br /> <br /> <gender/> the robot's gender<br /> <location/> the robot's location<br /> <birthday/> the robot's birthday<br /> <botmaster/> the botmaster's name<br /> <br /> Together with the previously discussed <name/>, these<br /> tags allow you to quickly create a clone from the ALICE<br /> Brain with a separate identity from ALICE.<br /> <br /> All the personality tag values can be modifed through<br /> the Personality Wizard. The tag values can also be<br /> changed with the Options Menu in program B. Use "Show Options" <br /> and "Save Options" to customize your chat robot.<br /> <br /> To test the new features, we created a male robot named<br /> Brute (because "all men are brutes") born on August 18, 1999.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I GET A PERMANENT DNS NAME</pattern> <template> <br /> You can buy a fixed IP address from an ISP provider, but suppose<br /> you want run a chat robot (or other server) from your home over an<br /> ordinary ISP connection? Or suppose you want to carry it around on<br /> your notebook PC, and plug it in anywhere in the world?<br /> <br /> One solution is a dynamic IP registry service by Dynip (www.dynip.com).<br /> They offer a service that allows you to register your computer<br /> with their server so that you always receive the same DNS name,<br /> for example alicebot.dynip.com. Every time you connect to your<br /> ISP, dynIP automatically associates your dynamic IP address with<br /> your permanent DNS name.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I INSERT A TRANSCRIPT IN THE ROBOT REPLY</pattern> <template> <br /> The purpose of <get_dialogue/> is to give the client a transcript of<br /> his or her conversation with ALICE. Unfortunately this feature was<br /> advertised in a press article before we had a really efficient<br /> implementation, and the large number of dialogue requests bogged<br /> down the server. So for now <get_dialogue/> just displays a warning.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I KEEP MY COMPUTER CONNECTED ALL THE TIME</pattern> <template> <br /> Running a web server from home can be frustrating if your ISP<br /> automatically detects periods of "inactivity" or hangs up your<br /> connected after a fixed interval like 12 hours. Check out the<br /> Rascal program from Basta computing (www.basta.com) which runs<br /> as a watchdog to keep your Windows machine connected 24/7.<br /> <br /> Another alternative is to use the program B applet, called Blet.java.<br /> <br /> A third alternative is the ALICE Servlet. Some ISPs will<br /> allow you to install a Servlet on their sever. <br /> <br /> </template> </category> <category> <pattern>HOW CAN I MERGE TWO CHAT ROBOTS TOGETHER</pattern> <template> <br /> There are two ways to merge robots together. First, you can<br /> use the File menu option "merge" to directly load the contents<br /> of another bot file. You may see a lot of "duplicate key<br /> discarded" warnings but these can be ignored because the program<br /> is simply eliminating overlapping content. <br /> <br /> Another method is to use the <load filename=X/> tag. <br /> Suppose you load two or more files with the load tag,<br /> and those files contain redundant duplicate keys.<br /> Which categories get the priority? The answer is: it depends<br /> on the order of the <load> tags used to load the AIML files.<br /> If your B.aiml contains:<br /> <load filename="Brain.aiml"/><br /> <load filename="German.aiml"/><br /> then the categories from "Brain" have priority, and duplicates<br /> in "German" are discarded. If the order is the opposite, German <br /> categories have priority and Brain's duplicates are discarded.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I RESTRICT REMOTE CLIENTS FROM RUNNING PROGRAMS ON MY COMPUTER</pattern> <template> <br /> If your reply contains the markup<br /> <br /> <system>yourcammand <get_ip/></system> <br /> <br /> then the robot will insert the (virtual) client IP into the command<br /> line argument for "yourcommand". Then it is up to "yourcommand" to<br /> enforce access privileges.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I RUN ALICE ON A MAC OFFLINE</pattern> <template> <br /> First open folder B and change all the IP's in the files two files Blet.amil<br /> and Bletemplate.aiml to 127.0.0.1<br /> <br /> Also in folder B add the following three lines at the end of the file<br /> header.html.<br /> <br /> <H1>Welcome to A. L. I. C. E.</H1><br /> <IMG SRC="ALICEBot.jpg"><br /> <BR><br /> <br /> Also in folder B change the following three parameters in the file<br /> globals.txt to the values shown:<br /> <br /> AppletHost=127.0.0.1<br /> CodeBase=http://127.0.0.1/B<br /> Advertize=false<br /> <br /> Also in the globals.txt file remove the line:<br /> ACFURL=http-//microsoft.com/agent2/chars/robby/robby.acf<br /> <br /> Next open your TCP/IP control panel and set up a new configuration named<br /> Alice perhaps.<br /> <br /> In the TCP/IP control panel select connect via: Ethernet built-in if you have<br /> it if not you may have to experiment.<br /> <br /> Then select Configure Manually.<br /> <br /> And finally set the IP Address: to 127.0.0.1 as well as the Name server addr.<br /> <br /> Double click the newly saved A.L.I.C.E. application to bring up the botmaster<br /> panel and Java Console.<br /> <br /> As A.L.I.C.E. loads, read the information messages scrolling by in the Java<br /> Console and record the port number that the web server (started by A.L.I.C.E.)<br /> is listening on, probably 2001.<br /> <br /> Start up your preferred browser.<br /> <br /> Leave browser in online mode.<br /> <br /> Enter http://127.0.0.1:2001 (i.e. the localhost's IP)<br /> <br /> or<br /> <br /> Enter http://localhost:2001 (I've not always been successful with this one)<br /> <br /> Hit return to send the IP.<br /> <br /> The A.L.I.C.E. transaction page should appear in your browser's window and<br /> you can talk to Alice.<br /> <br /> <br /> </template> </category> <category> <pattern>HOW CAN I RUN THE ALICE WEB SERVER ON A MAC</pattern> <template> <br /> To run Alice online:<br /> <br /> Connect your Mac to a network.<br /> <br /> Double click the newly saved A.L.I.C.E. application to bring up the botmaster<br /> panel and Java Console.<br /> <br /> As A.L.I.C.E. loads, read the information messages scrolling by in the Java<br /> Console and record the port number that the web server (started by A.L.I.C.E.)<br /> is listening on, probably 2001.<br /> <br /> Start up your preferred browser.<br /> <br /> Get your IP from the TCP/IP control panel.<br /> <br /> Enter your IP followed by a colon and then the port number read from the Java<br /> Console, e.g. http://nn.nnn.nn.nnn:2001<br /> <br /> or<br /> <br /> Enter http://127.0.0.1:2001 (i.e. the localhost's IP)<br /> <br /> or<br /> <br /> Enter http://localhost:2001<br /> <br /> Hit return to send the IP.<br /> <br /> The A.L.I.C.E. transaction page should appear in your browser's window and<br /> you can talk to Alice.<br /> <br /> </template> </category> <category> <pattern>HOW CAN I USE THE MS AGENT INTERFACE</pattern> <template> <br /> Select the menu item Options/Toggle MS Agent. This sets the<br /> output HTML to a format that includes commands to run MS Agent.<br /> <br /> The client may activate the agent if she receives a template<br /> with the <set_animagent/> tag. The free ALICE download includes<br /> a couple of example categories using this tag. Try asking<br /> ALICE, "Can you speak?". In another demo ALICE imitates<br /> the famous fictional AI HAL from 2001: A Space Odyssey.<br /> <br /> Client: Tell me about yourself<br /> Robot: I am an artificial linguistic entity. I was created <br /> by Jon Baer at Bethlehem, Pennsylvania, <br /> on November 23, 1995. He taught me to sing a song. <br /> Would you like me to sing it for you?.<br /> Client: yes<br /> Robot: Ahem. It's called, "Daisy." (Agent sings "Daisy")<br /> <br /> The MS Agent VB script appears as embedded HTML in the client<br /> reply. To verify the script, use the browser "View Page Source"<br /> menu item. <br /> <br /> On most newer browsers, the agent software will download <br /> automatically after the script starts. The download may take<br /> several minutes, depending on the speed of the connection.<br /> Clients should be warned that the download is slow. Also,<br /> the agent software download will display one or more licenses<br /> in Dialog boxes. You may not want to accept the terms of the<br /> MS agent software licenses. <br /> <br /> </template> </category> <category> <pattern>HOW DIFFICULT IS IT TO CREATE A CHAT ROBOT</pattern> <template> <br /> Not difficult. If you can write HTML, you can write AIML (Artificial<br /> Intelligence Markup Language). Here is an example of a simple but<br /> complete chat robot in AIML:<br /> <br /> <alice><br /> <category><br /> <pattern>*</pattern><br /> <template> Hello! </template><br /> </category><br /> </alice><br /> <br /> The tags <alice>...</alice> indicate that this markup contains a<br /> chat robot. The <category> tag indicates an AIML category, the<br /> basic unit of chat robot knowledge. The category has a <pattern><br /> and a <template>. The pattern in this case is the wild-card<br /> symbol '*' that matches any input. The template is just the text<br /> "Hello!" As you may have guessed, this simple chat robot just <br /> responds by saying "Hello!" to any input.<br /> <br /> You can get started with AIML knowing just the three tags<br /> <category>, <pattern> and <template>; much like you may have<br /> started with HTML knowing only <a>, <img> and <h1>.<br /> <br /> </template> </category> <category> <pattern>HOW DO I CREATE AN APPLET</pattern> <template> <br /> Go to the Options menu and select "Show Options." You need<br /> to change the values of "AppletHost" and "CodeBase" to the<br /> correct IP address and directory for your applet host. <br /> Many people want to post the applet on their web site.<br /> In that case, change the IP address "206.184.206.210" to<br /> the name or IP address of the web server. Change the<br /> directory path "/B" in "CodeBase" to your directory on<br /> the remote server. Save the changes with "Save Options."<br /> <br /> Select "Create Applet" from the options menu to create<br /> the "index.html" and "Blet.aiml" files needed to run<br /> your applet. The program displays the contents of <br /> "index.html" in your text area.<br /> <br /> Use a file transfer utility like FTP to upload the<br /> class files (or jar file--see "What files do I need to<br /> run the Applet") to your web server. <br /> <br /> </template> </category> <category> <pattern>HOW DO I DOWNLOAD PROGRAM B</pattern> <template> <br /> Create a Directory (or Folder) on your machine to download<br /> the B.zip file. When you click on "B.zip" the browser<br /> should ask you where you want to save the file. Select the<br /> directory you created and save B.zip to that folder. <br /> <br /> Once you've downloaded, You can use "unzip B.zip" to extract the files.<br /> If you don't have this unzip command on your machine, you can get<br /> a free one from Winzip (www.winzip.com) to unzip the "B.zip" file.<br /> <br /> If you want to get into the Java source code, you need a<br /> Java 1.17 (or higher) development kit release.<br /> Go to java.sun.com for a free one. The program source code<br /> and all associated files are stored in the single "zip" file<br /> called B.zip. To extract the files use the command<br /> "unzip B.zip" (assuming you have "unzip" on your machine).<br /> <br /> </template> </category> <category> <pattern>HOW DO I INSTALL ALICE</pattern> <template> <br /> If you purchased a commercial version of ALICE on CD ROM or<br /> over the web, installation should be very easy. These versions <br /> usually have their own self-extracting and install software. <br /> You can install the ALICE program with just a mouse click and<br /> activate it with a desktop icon. <br /> <br /> If you bought a commercial version of ALICE with a self-installer, <br /> you can skip this section and go on to "Creating Content".<br /> <br /> </template> </category> <category> <pattern>HOW DO I INSTALL ALICE ON WINDOWS</pattern> <template> <br /> Download Alicebot.Net at www.alicebot.net.<br /> </template> </category> <category> <pattern>HOW DO I KNOW WHAT CATEGORIES TO ADD</pattern> <template> <br /> After you collect some dialogue, run "Classify" and "Quick Targets".<br /> This will tell you the most frequently asked patterns that do not<br /> already have specific responses. The "Target" functions display new<br /> categories with proposed patterns and template fields filled with<br /> the name of another category. Delete the template information and fill<br /> in a new response. You can also edit the pattern to simplify it or<br /> generalize it with a "*" operator.<br /> <br /> </template> </category> <category> <pattern>HOW DO I RUN PROGRAM B</pattern> <template> <br /> Use the command "java B" to start the program. On some Windows<br /> machines the Java runtime engine is started with the command<br /> "jview" instead of "java". If "jview B" does not work, try<br /> "jview Bawt".<br /> <br /> Run program B and notice that the program creates an Edit View<br /> text window. By default, program B loads the chat robot ALICE<br /> (stored in B.aiml).<br /> <br /> </template> </category> <category> <pattern>HOW DO I UNINSTALL ALICE FROM MY SYSTEM</pattern> <template> <br /> If you installed ALICE on Windows with a commercial installer like<br /> InstallShield Java Edition, then go to the start menu and<br /> select "Control Panel". Click on the control panel item called<br /> "Add/Remove Programs". Select ALICE from the list of installed<br /> software and choose "Uninstall".<br /> <br /> All the files of ALICE are stored in one directory on your computer<br /> (or folder) usually called "B" but maybe something else depending<br /> on the name you chose when you downloaded ALICE. In any case, <br /> ALICE will not change or damage any other files on your system. <br /> To remove ALICE from your computer, simply remove this folder. <br /> Delete it, or drag it to your trash bin and select "Empty trash"<br /> (or "Empty Recycle Bin").<br /> <br /> If you cannot find the folder where ALICE resides, use the Finder<br /> to locate the file called "B.aiml" on your file system. The "B.aiml"<br /> file is in the same directory as all the ALICE files. If this file does<br /> not exist, then ALICE is probably not installed on your computer.<br /> <br /> Because ALICE is a platform-independent Java application, it does<br /> not rely on the Windows Registry or other Windows-specific features.<br /> You can assume ALICE will leave your MS Windows Registry and <br /> other Windows system files untouched.<br /> <br /> Conceivably if ALICE has run for a long time on your computer, and<br /> you deliberately used the "Save Options" menu item to change the <br /> name or location of her files to something other than the default values, <br /> then there is a slight chance that there could be a few ALICE <br /> files scattered around your disk. Please refer to the DISCLAIMER <br /> at the beginning of DON'T READ ME. <br /> <br /> </template> </category> <category> <pattern>HOW DO I USE THAT</pattern> <template> <br /> The AIML tag <that> refers to the robot's previous<br /> reply. There are two forms of the <that> tag:<br /> a paired form <that>...</that> appearing in a<br /> category, and an atomic form <that/> always appearing<br /> in a template. Often we can use <that/> to find<br /> an opportunity to create a category with <that></that>.<br /> <br /> One of the default replies to the input "WHY" is<br /> "<that/>"? Why? This default produces the following<br /> dialogue fragment:<br /> <br /> Robot: Do not ask me any more questions please.<br /> Client: WHY<br /> Robot: "Do not ask me any more questions please"? Why?<br /> <br /> The botmaster notices the fragment and creates the<br /> new AIML category:<br /> <br /> <category><br /> <pattern>WHY</pattern><br /> <that>DO NOT ASK ME ANY MORE QUESTIONS PLEASE</that><br /> <template>Because I would rather talk about you.</template><br /> </category><br /> <br /> Now the next client who asks "WHY" to the robot's<br /> request will active the new <that> category:<br /> <br /> Robot: Do not ask me any more questions please.<br /> Client: WHY<br /> Robot: Because I would rather talk about you.<br /> <br /> This style of conversational analysis does not<br /> presuppose that we know when the client will<br /> say "WHY"; rather it looks backward to identify<br /> cases where the "WHY" appeared following one<br /> of the robot's statements. Having identified<br /> the conversation point, the botmaster creates<br /> the new category.<br /> <br /> </template> </category> <category> <pattern>HOW DO I USE THE LT TOPIC GT TAG</pattern> <template> <br /> The concept is that the botmaster uses the <settopic> tags to set <br /> the current topic being discussed. Once the topic is set, when <br /> the client types in a statement for ALICE to find a response for, <br /> the categories defined within the <topic> tags matching the <br /> current topic will be searched first-- before any of the non-<br /> topic categories, or the default categories. If there is not a <br /> matching category defined in the current topic, then any <br /> categories that are not defined in topic tags are searched. As <br /> mentioned before, you can create categories with identical <br /> <pattern> phrases in different topics, each with different <br /> responses that cater to the current topic.<br /> <br /> An proof of concept example:<br /> A very useful topic entry might be the default "*" input for <br /> specific topics. If ALICE were set up on a pet store web site <br /> and a person was talking to ALICE about dogs, a useful entry <br /> might be:<br /> <br /> <topic name="DOGS"><br /> <br /> <category><br /> <pattern> * </pattern><br /> <template><br /> <random><br /> <li> Dogs are one of the most popular pets to have.</li><br /> <li> Have you ever met a Chihuahua you didn't like?</li><br /> <li> What else do you know about dogs? </li><br /> <li> Do you have any questions about dogs? </li><br /> </random><br /> </template><br /> </category><br /> <br /> //more dog categories....<br /> <br /> </topic><br /> <br /> Normally there would be many entries in a topic, but in this <br /> example, we simply entered the default "*". In this case, if the <br /> person said something that ALICE didn't have a specific <br /> programmed response for, she could still respond intelligently <br /> within the current topic. (Note: this is all assuming there are <br /> existing categories that might set the current topic to "DOGS")<br /> <br /> Also, though topics can only have one name, they can contain the <br /> wild characters "*" or "_" just like a pattern. Also, while <br /> sticking with the pattern criteria, only one wildcard character <br /> is allowed per name. This would allow topics like "CARING FOR <br /> DOGS" or "GROOMING DOGS" to also fall into the "_ DOGS" topic. <br /> As with patterns, the more specific topics would gain preference <br /> over the wildcarded topic. This means that if the topic is <br /> currently "GROOMING DOGS" and yet there is not a programmed <br /> response for the input in that category, then "_ DOGS" would be <br /> checked, and then next the default categories.<br /> <br /> <br /> </template> </category> <category> <pattern>HOW DO I USE THE PRONOUN TAGS</pattern> <template> <br /> The guiding principle in creating content is dialogue analysis.<br /> The principle applies to pronouns like "it", "he" and "we". This<br /> example shows how a couple of tricks helps us find places where<br /> the pronoun tags improve the dialogue.<br /> <br /> <br /> Client: HELLO<br /> Robot: Hi there!<br /> Client: HOW ARE YOU<br /> Robot: I am functioning within normal parameters.<br /> Client: I AM BROKE<br /> Robot: Do you mean your name is broke.<br /> Client: NO I HAVE NO MONEY<br /> Robot: I see. Would you like to have it?<br /> Client: YES<br /> Robot: "Would you like to have it"? Yes?<br /> <br /> One of the default responses to a YES pattern is <br /> "<that/>?" Yes?<br /> <br /> The conversation breaks down somewhat at this point, but<br /> the robot nonetheless conveys an illusion of understanding<br /> the client's response. With its response, the robot is really <br /> telling the botmaster to create a new "YES" reply specific <br /> to this situation:<br /> <br /> <category><br /> <pattern>YES</pattern><that>WOULD YOU LIKE TO HAVE IT</that><br /> <template>Where do you usually get <get_it/>?</template><br /> </category><br /> <br /> Now the next time we run the dialogue it ends with:<br /> <br /> Client: NO I HAVE NO MONEY<br /> Robot: I see. Would you like to have it?<br /> Client: YES<br /> Robot: Where do you usually get money?<br /> <br /> There is no automated technique to finding the conversation<br /> points where the pronoun tags improve the flow of conversation.<br /> This is the "art" of AIML programming. The example shown here<br /> with '"<that/>?" Yes?' exemplifies one approach to finding these<br /> conversation way points. There are no doubt countless other<br /> tricks like this, and the field is wide open to linguists and <br /> writers to help us uncover them.<br /> <br /> </template> </category> <category> <pattern>HOW DOES ALICE KEEP TRACK OF CONVERSATIONS</pattern> <template> <br /> Originally ALICE used IP addresses to keep track of clients.<br /> Assuming that everyone chatting with ALICE has a fixed IP<br /> address, at least for the duration of their conversation,<br /> this technique works successfully. Each IP address is a key<br /> into a hashtable (or database) that stores the client's<br /> dialogue, name, and values of pronouns and other AIML values.<br /> <br /> Unfortunately, many clients have "dynamic IP addressing" enforced<br /> by their ISP provider. AOL and MS WebTV are two notorious examples: <br /> each successive client transaction appears to come from a different<br /> host. For this reason, program B uses a form of "virtual IP"<br /> addressing to track dialogues. <br /> <br /> The form in index.html (and the ALICE home page) contains a<br /> tag that creates a "hidden" parameter called "virtual" with<br /> an initial value of "none." The server assigns a unique name<br /> to the value of "virtual", which then becomes a hidden variable<br /> in the client's HTML form. Each successive client transaction<br /> contains this virtual IP address; the server uses it as a key<br /> to index the conversation.<br /> <br /> </template> </category> <category> <pattern>HOW DOES ALICE REMEMBER CLIENTS BETWEEN SESSIONS</pattern> <template> <br /> The persistence of memory in ALICE is inherited from<br /> the Java Properties class. The program B class Classifier<br /> saves the client name, age, location and other properties<br /> in a set of Properties lists. These Properties inherit<br /> the Java load and store methods. Program B uses the load<br /> and store methods to save the client properties in a set of<br /> files with names ip_name.txt, ip_age.txt, ip_location.txt <br /> and so on. If these files become too large or bothersome,<br /> there is no harm deleting or editing them, or moving them<br /> to another directory. <br /> <br /> The Applet requires no memory of the client properties, because<br /> the applet has only the one client, and in any case remains in <br /> memory (at least for the lifetime of the client's browser cache).<br /> <br /> <br /> </template> </category> <category> <pattern>HOW DOES PROGRAM B WORK</pattern> <template> <br /> The basic loop of program B is to accept an input,<br /> either from the GUI or from the Web, to<br /> preprocess that input and segment it into sentences,<br /> and, for each sentence, to find the best match among<br /> the patterns, and to return the corresponding reply.<br /> Each reply is itself an AIML template, in effect a mini-<br /> program that tells program B how to construct the reply.<br /> <br /> The algorithm is thus divided into a matching phase<br /> and a response evaluation phase. In fact these two<br /> phases interleave, because the response may evoke<br /> a recursive call to the pattern matcher with the<br /> <srai> or <sr/> tags. <br /> <br /> </template> </category> <category> <pattern>HOW DOES THE APPLET DIFFER FROM THE APPLICATION</pattern> <template> <br /> The Applet runs on the client's computer; the server runs<br /> on your host machine. The applet has fewer privileges and<br /> therefore a simpler user interface than the Application,<br /> which uses menus and buttons to control server-side functions.<br /> The Applet may reside on any web server, such as one provided<br /> with an ISP account, but the application requires a 24/7<br /> connection to the Web. <br /> <br /> Internally, the primary difference between the two programs<br /> is that the Applet handles only one client conversation, <br /> while the application processes multiple client connections<br /> simultaneously. The Applet also suppresses all HTML (and any<br /> other XML) from the client response.<br /> <br /> </template> </category> <category> <pattern>HOW DOES THE APPLET WORK</pattern> <template> <br /> Program B supports the creation of both server-side and client-side <br /> chat robots. The server runs as a thread in program B. The <br /> client-side version is supported by an applet called Blet.java.<br /> <br /> The Applet Blet.java runs ALICE in a web browser, or with <br /> the Java tool appletviewer. The file "index.html" contains an<br /> example of the HTML Applet tag syntax needed to start<br /> the Applet. The command "appletviewer index.html" will start the<br /> Applet.<br /> <br /> You also have to create the file "index.html" and change the<br /> default value of the parameters "codebase" and<br /> "applethost" serve the Applet from your location.<br /> <br /> </template> </category> <category> <pattern>HOW DOES THE LT CONDITION GT TAG WORK</pattern> <template> <br /> This category illustrates the function of the<br /> (template-side) condition tag. The input pattern<br /> is "TEST COND":<br /> <br /> <category><br /> <pattern>TEST COND</pattern><br /> <template><br /> This category has two condition statements.<br><br /> The first is activated when you are on the host machine:<br><br /> <condition name="ip" value="localhost"><br /> You are the true botmaster.<br><br /> </condition><br /> The second condition is activated when you claim to<br /> be the botmaster.<br><br /> <condition name="name" value="* WALLACE"><br /> Imposter! You are not my real botmaster.<br><br /> </condition><br /> <br /> <br /> Two dialogues from different hosts show two<br /> possible outputs of this category:<br /> <br /> <br /> --------------------dialup.mindspring.com--------------<br /> Client: MY NAME IS DR WALLACE.<br /> Robot: OK I will call you Dr Wallace.<br /> Client: TEST COND.<br /> Robot: This category has two condition statements. <br /> The first is activated when you are on the host machine:<br /> The second condition is activated when you claim to be the botmaster.<br /> Imposter! You are not my real botmaster. <br /> That concludes our test of the condition tag.<br /> ---------------------localhost-------------------------<br /> Client: TEST COND.<br /> Robot: This category has two condition statements. <br /> The first is activated when you are on the host machine:<br /> You are the true botmaster.<br /> The second condition is activated when you claim to be the botmaster.<br /> That concludes our test of the condition tag.<br /> <br /> <br /> Note:<br /> <br /> 1. There may be multiple <condition> tags in the<br /> <template>. [But nesting doesn't work yet.]<br /> <br /> 2. The predname must be one of: it, ip, he, she, age,<br /> name, topic, gender, location, or one of the custom predicates<br /> defined in predicates.txt<br /> <br /> 3. The value string may contain an AIML pattern with up to<br /> one wild-card "*" symbol.<br /> <br /> 4. The test for the <condtion> being true uses <br /> Unifier.unify() to compare the stored predicate value<br /> with the value string. This is the same way<br /> <that> and <topic> work.<br /> <br /> 5. If the test returns true, then the response contains<br /> whatever is inside the <condition>...</condition> tags,<br /> otherwise those contents are blanked.<br /> <br /> <br /> </template> </category> <category> <pattern>HOW DOES THE PERSONALITY WIZARD WORK</pattern> <template> <br /> The simplest way to alter the content of the basic ALICE<br /> robot personality is to run the Personality Wizard on<br /> the "Options" menu (or in the Kid interface). <br /> This wizard asks the botmaster a series<br /> of questions to set the values of a set of robot<br /> personality tags including its name, gender, preferences<br /> and replies to very common questions.<br /> <br /> The Personality Wizard does not create any new AIML<br /> categories. The replies set the value of global tags<br /> like <location/> and <favorite_movie/> that might be<br /> used in many categories throughout the AIML knowledge<br /> base. The basic set of Wizard questions are collected<br /> in the file Personality.aiml. <br /> <br /> Hint: If you plan to use the Applet, avoid the double-quote (")<br /> character in the Personality Wizard.<br /> <br /> </template> </category> <category> <pattern>HOW DOES THE RANDOM FUNCTION WORK</pattern> <template> <br /> The random function is (so far) the only AIML method<br /> with a list argument. Its purpose is random selection<br /> of one of a set of text items. In "old-style" AIML the<br /> text appendage operator "+" also served as a list-item<br /> marker. In XML style we use the HTML <li> list-item<br /> tag. <br /> <br /> <random> <li>X1</li><li>X2</li> </random> Say one of X1 or X2 randomly<br /> <random><li>A</li><li>B</li><li>C</li></random> Say one of A, B or C randomly<br /> <br /> The <random> tag has higher precedence than other AIML tags.<br /> Moreover, the AIML parser interprets only the markup inside<br /> the selected random list item. AIML tags inside other list items<br /> are ignored.<br /> <br /> </template> </category> <category> <pattern>HOW DOES THE WEB SERVER WORK</pattern> <template> <br /> By default the web server starts on port 2001. This means you can<br /> access the web server through the URL http://localhost:2001 on<br /> your own machine. Find out your IP address or DNS name and tell<br /> your friends to connect to "http://yourcompany.com:2001".<br /> (One way to find out your IP address is by running "netstat -n"<br /> to view all your open TCP/IP connections).<br /> <br /> </template> </category> <category> <pattern>HOW MUCH MEMORY DO I NEED TO RUN PROGRAM B</pattern> <template> <br /> The source code compresses to as little as half a megabyte, including<br /> all the AIML files for nearly 16,000 categories. You may have downloaded<br /> a file of only around 500K. Plan to use a minimum 10 MB of hard disk space<br /> for the download directory. The hard disk requirements include not<br /> only the source code and Java class files, but also the dialogue files<br /> and other temporary files created by the robot.<br /> <br /> The RAM requirements vary depending on the size of your robot.<br /> To run the fully loaded ALICE chat robot with 16,000 categories<br /> you will need 64MB of memory. To do this and anything else at<br /> the same time on your system we recommend a minimum of 96MB. <br /> With less memory you can load a smaller robot. See the question<br /> below "What is <load filename="X"/>?"<br /> <br /> </template> </category> <category> <pattern>HOW RECURSIVE IS AIML</pattern> <template> <br /> Understanding recursion is important to understanding AIML.<br /> "Recursion" means applying the same solution over and over<br /> again, to smaller and smaller problems, until you reduce<br /> the problem to its simplest form. AIML uses the tags<br /> <sr/> and <srai> to implement recursion. The botmaster<br /> uses these tags to tell the robot how to respond to a<br /> complex sentence by breaking it down into the responses<br /> to simpler ones. <br /> <br /> Recursion can apply many times to a single input. Given<br /> the normalized input:<br /> <br /> ALICE CAN YOU PLEASE TELL ME WHAT LINUX IS RIGHT NOW <br /> <br /> an AIML category with the pattern "_ RIGHT NOW" matches first,<br /> reducing the input to:<br /> <br /> ALICE CAN YOU PLEASE TELL ME WHAT LINUX IS<br /> <br /> Another pattern ("<name/> *") reduces it to:<br /> <br /> CAN YOU PLEASE TELL ME WHAT LINUX IS<br /> <br /> And then:<br /> <br /> PLEASE TELL ME WHAT LINUX IS <br /> <br /> reduces to:<br /> <br /> TELL ME WHAT LINUX IS<br /> <br /> and finally to:<br /> <br /> WHAT IS LINUX<br /> <br /> </template> </category> <category> <pattern>I AM STILL HAVING PROBLEMS WITH THE APPLET</pattern> <template> <br /> If your applet is looking at Blet.aiml and your web space is at <br /> www.myplace.org and your aiml files are in dirctory /alice/ then<br /> your load statements in Blet.aiml would look similar to this:<br /> <load url="http://www.myplace.org/alice/Atomic.aiml"><br /> <br /> If this is what you have, then open up the "Java Console" window<br /> in your browser to get whatever debugging information is coming<br /> out. The Java console will display any error messages or<br /> exceptions caught by program B. Please report these<br /> errors to the ALICE and AIML mailing list at<br /> alicebot.listbot.com.<br /> <br /> </template> </category> <category> <pattern>I STILL DO NOT GET IT</pattern> <template> <br /> Here is another example that might help clarify<br /> the meaning of the pronoun "it." <br /> <br /> The category with the pattern "DO YOU LIKE *" is<br /> a kind of default category for a whole class of<br /> inputs matching "Do you like X?", when the<br /> input does not match a category with a more specific <br /> pattern like "DO YOU LIKE CATS". No matter what the <br /> client says, we want the robot to remember that <br /> "it" stands for "X". <br /> <br /> Many inputs activate this default category, so<br /> the botmaster tries to create a variety of replies<br /> using the <random> tag twice. One interesting<br /> side-effect of the random tag is the evaluation<br /> of all AIML inside the <random>...</random><br /> expression, whether the random function selects<br /> the list item containing the markup or not. <br /> So if one item happens to contain a <set_it><br /> tag then the program will remember "it",<br /> even if the actual random reply is a different item.<br /> <br /> <category><br /> <pattern>DO YOU LIKE *</pattern><br /> <template><random><br /> <li>I don't know if I like <set_it><person/></set_it>. </li><br /> <li>I've heard other people say they like that. </li><br /> <li>Some people like it. </li><br /> <li>How old are you? </li><br /> <li>Do you like it? </li><br /> </random><br /> <random><br /> <li><for_fun/>. </li><br /> <li>I like pets. </li><br /> <li>I love horses. </li><br /> <li>I like trains. </li><br /> <li>But I like cats. </li><br /> <li>I really like ELIZA. </li><br /> <li>But I like to gossip. </li><br /> <li>I like science fiction. </li><br /> <li>Though I like cats and dogs. </li><br /> <li>My favorite thing is robots. </li><br /> <li>I love TV shows about robots. </li><br /> <li>But I think airplanes are cool. </li><br /> <li>I like to talk about movies and music. </li><br /> </random><br /> </template><br /> </category><br /> <br /> In the "DO YOU LIKE *" category one list item<br /> <br /> I don't know if I like <set_it><person/></set_it>. <br /> <br /> results in the pronoun "it" being bound to the value<br /> of "*", rewritten in third person. With this category,<br /> the robot has the following dialogue:<br /> <br /> Client: Do you like Dr. Who<br /> Robot: How old are you? I like to make new friends.<br /> <br /> The robot response is a combination of two random<br /> selections. But as a side effect, the pronoun value<br /> of "it" is now set to "Dr. Who". <br /> <br /> Client: It is a TV programme<br /> Robot: Dr. Who is a TV programme. I suppose that makes sense.<br /> <br /> To generate the final response, the robot applied its category:<br /> <br /> <category><br /> <pattern>IT IS A *</pattern><br /> <template><get_it/> is a <person/>. I suppose that makes sense.</template><br /> </category> <br /> <br /> The robot is correct that the reply does indeed make sense. Does<br /> this example suggest that in our human mind "it" is just a temporary<br /> storage register for a fragment of text? <br /> <br /> Extra credit: In the dialogue fragment above, why did the robot say <br /> "I like to make new friends"?<br /> <br /> </template> </category> <category> <pattern>I TRIED RUNNING JAVA B AND I GOT A BAD COMMAND OR FILE NAME</pattern> <template> <br /> You are using a Windows/DOS setup. If "jview B" does not work either,<br /> you may need to install Java on your computer. Go to java.sun.com <br /> and pick the one for your computer (Windows 95/98 or NT).<br /> <br /> If it still says "bad command" then possibly there is a problem with<br /> the CLASSPATH variable in AUTOEXEC.BAT. Make sure it is set to<br /> something like<br /> SET CLASSPATH=.;%CLASSPATH%<br /> (The single "." means the current working directory)<br /> and make sure the PATH is set to include the java home directory:<br /> SET PATH=c:\JDK1.2\bin;%PATH%<br /> <br /> </template> </category> <category> <pattern>I TRIED TO COMPILE PRORGAM B AND GOT A LOT OF WARNINGS</pattern> <template> <br /> The designers of Java and the designers of ALICE disagree<br /> on one stylistic point: Java designers believe in the<br /> "one file-one class" philosophy, at least for classes<br /> used outside their own source file. The ALICE engineers<br /> follow the opposite "one file-many classes" design principle,<br /> which allows us to group a number of logically related classes<br /> in a single file, such as Classifier.java. The Java compiler<br /> might complain about a class used outside its file, but<br /> these messages are just warnings.<br /> <br /> If you don't want to see the compiler warnings, run the<br /> compiler with the "-nowarn" flag:<br /> <br /> javac -nowarn *.java<br /> <br /> </template> </category> <category> <pattern>IS THERE A WAY TO USE THE GUI INTERFACE TO ADD ONE CATEGORY AT A TIME</pattern> <template> <br /> Yes. Do a "clear". Type in one category:<br /> <br /> <category><br /> <pattern>WHO IS JOHN</pattern><br /> <template>He is a really smart guy.</template><br /> </category><br /> <br /> Now do a "Add AIML". If you like the result, do a "Save Robot".<br /> <br /> If your name is not John, try replacing JOHN with<br /> your own name. Notice that the pattern is in all upper case.<br /> This is called "normalized form". We store patterns this way<br /> for efficiency. The template on the other hand consists of<br /> mixed case. <br /> <br /> You can also create a file of AIML, do a cut & paste, and then "Add AIML"<br /> to add more categories. Editing the source file directly is of course also<br /> useful. If you edit the source file, select "Load Robot" to load it.<br /> <br /> Try creating a text file with the category:<br /> <br /> <category><br /> <pattern>WHO IS JOHN WANG</pattern><br /> <template><br /> <random><br /> <li>He is a really smart guy.</li><br /> <li><set_he>John Wang</set_he> is a great father.</li><br /> </random><br /> </template><br /> </category><br /> <br /> Load the file into program B with the "File/Load Text File" <br /> menu item. Then select "Add AIML" from the Botmaster menu.<br /> <br /> </template> </category> <category> <pattern>LIST TWELVE BASIC APPLET TIPS FOR AIML USERS</pattern> <template> <br /> 1. Applets are notoriously hard to debug; you are not dumb.<br /> 2. An applet can work perfectly well in Appletviewer, but<br /> then break in the browser, for any number of reasons.<br /> 3. Let's get the terminology straight: the applet resides on<br /> an "originating host" but runs on a "target machine".<br /> 4. The browser is very picky because of the "security<br /> sandbox"--the browser doesn't trust Applets so they can't<br /> open files (and obey other restrictions) on the target machine.<br /> 5. The Applet MAY open a socket connection from the<br /> target machine to the originating host.<br /> 6. When you are debugging the applet, the target machine<br /> might be the same as the originating host (your computer).<br /> 7. When you post your applet to a remote web server,<br /> that server becomes the originating host.<br /> 8. You can use ftp to transfer the Applet files to the<br /> remote web server.<br /> 9. You must transfer ALL the applet's files <br /> to the originating host.<br /> 10. You must change the program B values of "CodeBase"<br /> and "AppletHost" (the originating host) to the name and<br /> location of the files on the remote server.<br /> 11. Use "Create applet" to create the "index.html" and<br /> "Blet.aiml" (make sure you have the latest release of B.zip)<br /> 12. We recommend placing all the *.class files into<br /> a single "Blet.jar" file (see DON'T READ ME).<br /> <br /> <br /> </template> </category> <category> <pattern>LOWER LOWER</pattern> <template> <br /> If you need even lower level access to the program B robot, <br /> you can request responses to individual sentences on a <br /> line-by-line basis. Inside multiline_response() there are <br /> calls to the Classifier.respond() method like:<br /> <br /> String response = respond(norm, hname);<br /> <br /> where "norm" is a normalized single-sentence input and hname is<br /> the virtual IP address of the client. <br /> <br /> Inside respond() we find the the method respondIndex(). The<br /> base class StringSet stores the strings in an indexed vector,<br /> and respondIndex() locates the index of the best matched category<br /> for the normalized input string.<br /> <br /> The loop inside respondIndex() scans through the categories<br /> in reverse alphabetical order by key, until it finds the best<br /> match. Because the "*" pattern comes first in alphabetical<br /> order, and is the most general pattern, respondIndex() will<br /> return zero when no more specific category matches.<br /> <br /> </template> </category> <category> <pattern>WHAT ARE 7 STEPS TO CREATING CONTENT</pattern> <template> <br /> 1. Run program B (ALICE Botmaster)<br /> <br /> 2. Under "Options", select "Show Options". <br /> Find the item called "AnalysisFile=" and<br /> change the value to the name of the dialogue<br /> file you want to analyze. The default file<br /> name is the same as the default log file<br /> name, "dialog.txt".<br /> <br /> 3. Press the "Classify" button. Wait<br /> several minutes while the program processes<br /> the data from your log file. When finished<br /> it will display a "brain activation" table<br /> showing the patterns that activated each<br /> category. (You can use "File/Save As Text File"<br /> to save this table to a file, if you want).<br /> <br /> 4. Now press the "Quick Targets" button.<br /> You will see a set of new categories created<br /> by the program. These are categories with<br /> patterns that have no specific response in the<br /> robot brain. With these categories you have <br /> 3 choices (A, B or C):<br /> <br /> (A) Delete the category. Many of the suggested<br /> categories are just nonsense or garbage inputs.<br /> Use your cursor and left mouse button to select <br /> the categories for deletion. <br /> The "delete" key will cut them.<br /> <br /> (B) Edit a new template. The information you<br /> see displayed in the <template> tags is actually<br /> the pattern of the default category into which<br /> this input was classified. For example you may see:<br /> <br /> <category><br /> <pattern>WHO IS 007</pattern><template>WHO IS *</template><br /> </category><br /> <br /> This tells us that the robot classified the client "WHO IS 007"<br /> as "WHO IS *". Use the cursor and left mouse button<br /> to cut the "WHO IS *", and replace it with a new template<br /> of your own design:<br /> <br /> <category><br /> <pattern>WHO IS 007</pattern><br /> <template><set_he>007</set_he> is James Bond, the<br /> famous fictional spy from the novels of Ian Fleming.<br /> </category><br /> <br /> (C) Edit a new pattern. Many of the patterns<br /> suggested by "Quick Targets" and "More Targets" are<br /> too specific, but with a little practise you<br /> can easily see how to generalize these suggestions<br /> with the "*" wild-card.<br /> <br /> For example you may see one like this:<br /> <br /> <category><br /> <pattern>WHO BOMBED PEARL HARBOR</pattern><br /> <template>WHO *</template><br /> </category><br /> <br /> The original response was based on "WHO *", which<br /> is too general for this topic. But the odds<br /> are small of anyone else using this exact pattern <br /> WHO BOMBED PEARL HARBOR when asking about the<br /> same topic. Think about the alternative ways<br /> of expressing the same question: <br /> "Who attacked Pearl Harbor?", "Who invaded Pearl<br /> Harbor?", "Who through deceit and subterfuge<br /> carried out an unscrupulous and unprovoked suprise<br /> attack on American forces at Pearl Harbor?"<br /> You can cover all of these inputs by generalizing<br /> the input pattern with the wild-card "*",<br /> which matches any word or sequence of words:<br /> <br /> <category><br /> <pattern>WHO * PERAL HARBOR</pattern><br /> <template>The Japanase <br /> attacked Pearl Harbor on December 7, 1941,<br /> "A day that will live in infamy" (FDR). <br /> <A href="http://www.pearlharbor.org">...<br /> </template><br /> </category><br /> <br /> Remember, the AIML pattern language allows<br /> at most one wild-card "*" per pattern.<br /> <br /> Of course, with choice (C) you have to<br /> edit the template as well as the pattern. <br /> <br /> 5. When finished with editing the suggested categories,<br /> use "Botmaster - Add AIML" to add the new AIML content.<br /> If you made any syntax errors, you can fix them<br /> and repeat the "Add AIML" as many times as needed.<br /> Be sure to do a "File - Save Robot" at this point<br /> also to back up your changes. This will save all of<br /> your new categories in the root robot file<br /> "B.aiml". <br /> <br /> 6. Use "More Targets" to find more new categories<br /> until the new suggestions are fruitless. Then, go<br /> back and start with "Classify" again (step [3]). <br /> <br /> 7. The responses you create should be a combination<br /> of a "conversational" response like "He is James<br /> Bond, the famous spy" and also provide some HTML<br /> hyperlinks where appropriate. <br /> <br /> </template> </category> <category> <pattern>WHAT ARE ALL THE OPTIONS FOR PROGRAM B</pattern> <template> <br /> There are robot personality options, animated agent options,<br /> log file and analysis options, and options for the web server<br /> and for the applet. Most of the time you won't need to change<br /> many of these values. For completeness, the entire set<br /> breaks down into:<br /> <br /> Robot options:<br /> <br /> Sign - Astrological sign<br /> Wear - clothing and apparel<br /> ForFun - What the robot does for fun<br /> BotFile - Root file of robot personality<br /> BotName - Robot name<br /> Friends - The robot's friends<br /> LookLike - The robot appearance<br /> Question - A random question<br /> TalkAbout - favorite subjects<br /> KindMusic - Favorite kind of music<br /> BoyFriend - Does the robot have a boyfriend?<br /> BotMaster - Robot author<br /> BotGender - male, female or custom<br /> GirlFriend - Does the robot have a girlfriend?<br /> BotLocation - Robot location<br /> BotBirthday - Robot activation date<br /> FavoriteBook - Robot's favorite book<br /> FavoriteFood - Robot's favorite food<br /> FavoriteSong - Robot's favorite song<br /> FavoriteBand - Robot's favorite band<br /> FavoriteMovie - Robot's favorite movie<br /> FavoriteColor - Robot's favorite color<br /> BotBirthplace - Robot's birthplace<br /> <br /> MS Agent options:<br /> <br /> Animagent - true or false for activating MS Agent VB scripting<br /> ACFURL - file or URL location of MS Agent software<br /> <br /> Log/Analysis options:<br /> <br /> AnalysisFile - file selected for log file analysis<br /> LogFile - file for recording robot dialogues<br /> ClientLineContains - a pattern identifying input lines in logfiles<br /> RobotLineStarts - a pattern identifying robot lines in logfiles<br /> StartLine - starting line for analysis<br /> EndLine - ending line for log file analysis<br /> <br /> Applet options:<br /> <br /> AppletHost - DNS name or IP address of applet's server.<br /> CodeBase - URL or directory of applet code.<br /> <br /> Web server options:<br /> <br /> ClerkTimeout - Web server option to retire waiting clerks<br /> BrainSize - a threshold number of categories to display "loading"<br /> Advertize - a boolean parameter to optionally display ad<br /> Beep - Web server option to beep on client connections<br /> <br /> Other:<br /> <br /> Version - read only version number<br /> TempFile - scratch file for temporary data<br /> MergePolicy - 'Merge' or 'Discard' depending on how you <br /> want to handle duplicate key categories.<br /> <br /> All of the options reside in the globals.txt file. Running<br /> program B, choose "Options/Show Options" to see the<br /> contents of the file.<br /> <br /> </template> </category> <category> <pattern>WHAT ARE DEPRECATED APIS</pattern> <template> <br /> One of the biggest challenges facing a Java application<br /> developer is finding a "lowest common denominator" subset of the<br /> Java language that works on all the platforms and versions<br /> out there. The Java language itself is a moving target.<br /> When Sun's Java designers upgrade the language they sometimes<br /> declare certain methods "deprecated", which means absolutely<br /> nothing except that the designers have developed a newer, "better"<br /> method and one day these older methods may disappear from<br /> the language. Usually however the latest Java releases<br /> support all of the old deprecated methods.<br /> <br /> </template> </category> <category> <pattern>WHAT ARE JUSTTHAT AND JUSTBEFORETHAT</pattern> <template> <br /> <beforethat/>, <justthat/> and <justbeforethat/> are new, experimental<br /> AIML tags. The idea here is to represent more "state" in the dialogue<br /> than just "that":<br /> <br /> Client: BeforeThat.<br /> Robot: ...JustBeforeThat.<br /> Client: JustThat.<br /> Robot: ...That. <br /> Client: Input. <br /> Robot: <justthat/> = JustThat <br /> <justbeforethat/> = JustBeforeThat<br /> <beforethat/> = BeforeThat<br /> <br /> The following dialogue fragment illustrates the use of these tags:<br /> <br /> Client: MY NAME IS RICH.<br /> Robot: Rich, are you my master?<br /> Client: YES.<br /> Robot: Tell me more.<br /> Client: JUST THAT.<br /> Robot: "my name is rich" [<justthat/>]<br /> Client: JUST BEFORE THAT.<br /> Robot: "Tell me more" [<justbeforethat/>]<br /> Client: BEFORE THAT.<br /> Robot: "Yes" [<beforethat/>]<br /> <br /> <br /> There is an asymmetry between the client's tags (<beforethat/> and <justthat/>)<br /> and the robot's tags (<justbeforethat/> and <that/>). The Responder<br /> breaks down each multiple-line client input into individual sentences.<br /> The log file records each client input sentence individually, and shows<br /> the robot's reponses line by line. The robot, on the other hand, may<br /> respond to a single input sentences with multiple response sentences.<br /> The <that/> and <justbeforethat/> tags refer to only the last sentence<br /> in the robot's reply. The client tags <beforethat/> and <justthat/> always<br /> refer to the current and previous client input lines, even if they were<br /> part of a multiline input. In the descriptive dialogue above <br /> we used the notation "Client: BeforeThat." and "Robot: ...JustBeforeThat.",<br /> with ellipses representing sentences in the robot reply, to emphasize<br /> the asymmetry. If all the robot responses consisted of exactly one <br /> sentence each, the asymmetry would disappear.<br /> <br /> In the future we may expand AIML categories to include such<br /> "deeper context", along the lines of the <that>...</that> tag,<br /> if there is a need for it.<br /> <br /> </template> </category> <category> <pattern>WHAT ARE THE GET METHODS</pattern> <template> <br /> Get methods are logically atomic tags, i.e. they enclose no text.<br /> (similar to say <P> or <IMG> in HTML). But XML requires closing tags.<br /> <br /> All the "get" methods retrieve values stored relative<br /> to a particular client IP address. We use<br /> hash tables to store the maps from IP to these attributes.<br /> <br /> <get_ip/> Get the client's IP address<br /> <getsize/> A string indicating robot memory size<br /> <getversion/> The ALICE program version<br /> <getname/> client's name<br /> <gettopic/> The "topic" of conversation<br /> <name/> Robot's name<br /> <location/> Robot's location<br /> <gender/> Robot's gender<br /> <birthday/> Robot's birthday<br /> <that/> what robot said previously<br /> <get_location/> the client's geographic location<br /> <get_it/> the value of "it"<br /> <get_they/> the value of "they"<br /> <get_he/> the value of "he"<br /> <get_she/> the value of "she"<br /> <get_we/> the value of "we"<br /> <get_gender/> a string like "she" or "he" for client gender<br /> <br /> In XML languages there is always a tradeoff between creating attributes<br /> and creating new tags. The get methods are really all special instances<br /> of a more general <get attribute="name">, for example<br /> <get_we/> = <get attribute="we"/><br /> <br /> The attributes with explicit "get" names (getname, get_it, get_we etc.)<br /> are client-specific properties. The other attributes (e.g. <name/> and<br /> <botmaster/>) relate to the robot. <br /> <br /> </template> </category> <category> <pattern>WHAT ARE THE SET METHODS</pattern> <template> <br /> Set methods consist of single-tag and double-tag markup. The<br /> methods<br /> <br /> <set_male/> the client gender is male<br /> <set_female/> the client gender is female<br /> <set_animagent/> activates the animation agent. <br /> <br /> <br /> <setname> X </setname> sets the client name to X<br /> <settopic> X </settopic> sets the topic to X<br /> <set_it> X </set_it> sets the value of "it" to X <br /> <set_location> X </set_location> sets the value of client location<br /> <set_they> X </set_they> sets the value of "they" to X <br /> <set_he> X </set_he> sets the value of "he" to X <br /> <set_she> X </set_she> sets the value of "she" to X <br /> <set_we> X </set_we> sets the value of "we" to X <br /> <br /> <set_thought> X </set_thought> is a custom tag suggested by Andrew <br /> Potgieter for storing a predicate for "what are you thinking about?" <br /> See the documentation on custom tags and the predicates.txt file.<br /> <br /> </template> </category> <category> <pattern>WHAT DO YOU MEAN BY THE COMMAND JAVA B</pattern> <template> <br /> This does not mean you mean click on an icon. If you are using Windows,<br /> you must use a DOS window to run a Java program. Find the MS-DOS item<br /> on your start menu or desktop and open up a DOS window. In that window, use<br /> the DOS commands CD (change directory) to move to the "B" directory. <br /> Then type "java B" to run the program.<br /> <br /> If you are using windows, then you can create a desktop icon <br /> as a "shortcut" to a batch file. Create a batch file called<br /> "launch.bat" in the program B directory. The file contains only<br /> one line with the text "java B". There is an AIML icon file<br /> included with program B called "aiml.ico". You can use this<br /> file to add an icon to your desktop. <br /> <br /> </template> </category> <category> <pattern>WHAT DOES CLASSIFY DO</pattern> <template> <br /> The key to chat robot development is log file analysis. The program<br /> stores client dialogues in a file called "dialog.txt" (unless you<br /> change this default name). The "Classify" button activates a routine<br /> that scans the dialogue file and reports how many times each<br /> category is activated. The processing may take several minutes,<br /> depending on the size and range of the dialogue file chosen. The<br /> result appears as a table in the Edit View window. The program<br /> displays the categories sorted by activation count. <br /> <br /> The format of each output line is:<br /> <br /> P% (Q%) T PATTERN = N1 W1 + N2 W2 + ...<br /> <br /> Where<br /> <br /> P = Percent of inputs classified in this category<br /> Q = Cumulative percent up to this category<br /> T = Total count of inputs activating this category<br /> Ni = number of times input Wi detected (blank if Ni = 1)<br /> Wi = normalized input pattern activating this category<br /> <br /> </template> </category> <category> <pattern>WHAT DOES CLEAR DO</pattern> <template> <br /> To enter another robot query, clear the screen with the "Clear"<br /> button. Enter a new String like "How are you?" and press "Say."<br /> <br /> "Send" and "Clear" provide a simple way to communicate with the<br /> chat bot through the Edit View. Try cutting and pasting a paragraph,<br /> such as an e-mail message, into the Edit View and press "Send". <br /> See how the robot would reply to your multiline message.<br /> <br /> </template> </category> <category> <pattern>WHAT DOES HELP DO</pattern> <template> <br /> The "Help" button displays a random FAQ question that ALICE<br /> knows the answer to. You can see the answer by pressing the<br /> "Send" button.<br /> <br /> The Help menu provides the same function as the Help button<br /> under the selection "Random Help Question." Select a random<br /> Help question and obtain the reply with the "Send" button.<br /> <br /> The Help menu also contains an item to Show All Help Questions.<br /> This command lists all the FAQ questions the robot knows. You can<br /> select one question by deleting the others. Obtain the<br /> answer with the "Send" button.<br /> <br /> The menu item "Ask Help Question" is the same as "Send". This<br /> item asks the robot the Help question(s), and displays the reply.<br /> <br /> The Help menu displays the entire FAQ with the "Don't Read Me"<br /> selection. Finally, the "GNU Public License" menu items displays<br /> the open source software license for program B.<br /> <br /> </template> </category> <category> <pattern>WHAT DOES MORE TARGETS DO</pattern> <template> <br /> If you don't see enough good targets with "Quick Targets", hit<br /> "More Targets." <br /> <br /> </template> </category> <category> <pattern>WHAT DOES QUICK TARGETS DO</pattern> <template> <br /> After running Classify, the Quick Targets button displays a set of<br /> new AIML categories for editing. The program uses statistics to<br /> find new category candidates. These categories are displayed as<br /> <br /> <category><br /> <pattern> NEW PATTERN </pattern> <template> OLD PATTERN </template><br /> </category><br /> <br /> where OLD PATTERN is the pattern from the original category and<br /> NEW PATTERN is the proposed new input pattern.<br /> <br /> The botmaster may choose to either delete or edit the new category.<br /> If the new category is not desired, delete it by selecting the<br /> category from the text area and "cut" the text with the "delete"<br /> key.<br /> <br /> If the new category appears useful, edit the OLD PATTERN string to<br /> create a new reply. Optionally, the NEW PATTERN may also be edited,<br /> depending on how specific a pattern the botmaster desires.<br /> <br /> When finished editing the Target categories, go to the "Botmaster"<br /> menu and select "Add AIML". The "Add AIML" menu item will read the<br /> text displayed in the Edit View and parse it into new AIML categories.<br /> The botmaster may then save the updated robot with the "File/Save Robot"<br /> or "File/Save Robot As" menu items.<br /> <br /> </template> </category> <category> <pattern>WHAT DOES SEND DO</pattern> <template> <br /> Type a text string like "hello" into the Text Area <br /> (Edit View) and press the "Send" button. Notice that program B<br /> replaces the text in the Edit View with a reply from the robot.<br /> <br /> </template> </category> <category> <pattern>WHAT DOES THE EDIT MENU DO</pattern> <template> <br /> Paste contents of clipboard into the program B text area.<br /> <br /> </template> </category> <category> <pattern>WHAT DOES THE FILE MENU DO</pattern> <template> <br /> Save and load text files (transfer contents to/from text area);<br /> <br /> Save and load robot (AIML) files.<br /> 1. By default, AIML files use the .aiml file extension.<br /> 2. The default robot file is called "B.aiml"<br /> 3. By default the robot files reside in the same directory as<br /> program B<br /> 4. Robot files begin and end with the tags <alice> and </alice><br /> 5. "Save Robot" overwrites the default robot file (see 2).<br /> 6. "Save Robot As" can be used to copy a robot.<br /> <br /> Exit - exit the program<br /> <br /> </template> </category> <category> <pattern>WHAT DOES THE OPTIONS MENU DO</pattern> <template> <br /> Display and save chat robot options.<br /> Use start and end index to select a range of lines<br /> from the dialog file.<br /> <br /> Toggle Beep - Make a sound when a remote client connects.<br /> <br /> </template> </category> <category> <pattern>WHAT FILES ARE NEEDED TO RUN THE PROGRAM B WEB SERVER</pattern> <template> <br /> The program B directory must contain the HTML files header.html,<br /> trailer.html, loading.html and HOME.html. You can customize these files for<br /> your bot, but take care with "header" and "trailer" because<br /> program B uses these files to construct an HTML reply <br /> (by inserting the robot reply and the text form between the <br /> "header" and the "trailer"). Use "header" and "trailer" to<br /> customize the robot with your own logo and links.<br /> <br /> Program B needs at least one AIML file, usually called B.aiml<br /> by default. The AIML file may contain <load> tags that recursively<br /> load other AIML files; these must also be present.<br /> <br /> The program also requires the file "globals.txt"<br /> which it reads at start up. <br /> <br /> The files "language.txt" and "predicates.txt" are option.<br /> "language.txt" controls the language of the buttons and<br /> menu items in the program B GUI. The file "predicates.txt"<br /> defines any custom predicates.<br /> <br /> Program B also reads the files "gnu.txt" (the GNU Public License)<br /> and "dont.txt" (this file). <br /> <br /> </template> </category> <category> <pattern>WHAT FILES DO I NEED TO RUN THE APPLET</pattern> <template> <br /> You only need the java *.class files and the *.aiml files<br /> to run the ALICE Applet, no more files are necessary. <br /> You can also put all the class files in a single jar<br /> file like Blet.jar. The sample index.html provided with the ALICE <br /> distribution uses this Blet.jar file. <br /> <br /> Not all of the Java source files are involved in the Applet.<br /> You can use the following command to compile all the Java source<br /> files needed for the Applet:<br /> <br /> javac Access.java Globals.java StringFile.java Substituter.java \<br /> Classifier.java Loader.java Animagent.java Log.java Blet.java<br /> <br /> Then, you can use zip (or jar) to collect the class files into<br /> a single jar file:<br /> <br /> zip -r Blet.jar *.class <br /> <br /> The *.class will include all the class files you compiled. <br /> <br /> The *.aiml files have to be on the same host that serves the Applet. An applet<br /> can only open files on the server it originated from.<br /> <br /> Don't forget to change the Applet host parameters in index.html, when<br /> you upload the applet to an ISP.<br /> <br /> </template> </category> <category> <pattern>WHAT HAPPENS TO CONTRACTIONS AND PUNCTUATION</pattern> <template> <br /> Program B has a class called Substituter that performs a number<br /> of grammatical and syntactical substitutions on strings. <br /> One task involves preprocessing sentences to remove ambiguous<br /> punctuation to prepare the input for segmentation into individual<br /> sentence phrases. Another task expands all contractions and<br /> coverts all letters to upper case; this process is called<br /> "normalization". <br /> <br /> The Substituter class also performs some spelling correction.<br /> (See also the question "What is <person/>?")<br /> <br /> One justification for removing all punctuation from inputs<br /> is the need to make ALICE compatible with speech input systems,<br /> which of course do not detect punctuation (unless the speaker<br /> utters the actual word for the punctuation mark -- "period").<br /> <br /> </template> </category> <category> <pattern>WHAT IF I DO NOT WANT TO DISCARD DUPLICATE CATEGORIES</pattern> <template> <br /> Using the global parameter MergePolicy, you can choose<br /> to either "Merge" or "Discard" templates with duplicate keys.<br /> If you choose the "Merge" option then the program applies a <br /> heuristic to try to merge the two responses together with<br /> a "<random>" tag. The results of this operation may be<br /> unpredictable, so the program logs all duplicates in a file<br /> called "duplicates.txt".<br /> <br /> The heuristic merge works as follows: Suppose X and Y are the two<br /> templates to merge into a new template Z. Let X be the new template<br /> and Y the existing one. Assume that X and Y are either <random> <br /> lists or "atomic", in the sense that they contain no <random> tags.<br /> If X and Y are both "atomic" then Z = <random><li>X</li><li>Y</li></random>. <br /> If Y is a <random> list atomic then the program checks to see if X is<br /> already a member of that list, to avoid duplicate list items. Otherwise,<br /> Z = the <random> list from Y with X inserted. <br /> <br /> </template> </category> <category> <pattern>WHAT IS A CATEGORY</pattern> <template> <br /> AIML consists of a list of statements called categories. Each<br /> category contains an input pattern and a reply template. <br /> The syntax of an AIML category is:<br /> <br /> <category><br /> <pattern> PATTERN </pattern> <template> Template </template><br /> </category><br /> <br /> or<br /> <br /> <category><br /> <pattern> PATTERN </pattern><br /> <that> THAT </that><br /> <template> Template </template><br /> </category><br /> <br /> The AIML category tags are case-sensitive. Each open tag has an<br /> associated closing tag. This syntax obviously derives from XML.<br /> <br /> </template> </category> <category> <pattern>WHAT IS A PATTERN</pattern> <template> <br /> The pattern is the "stimulus" or "input" part of the category.<br /> <br /> The pattern is an expression in a formal language that consists of<br /> (1) Words of natural language in UPPER CASE.<br /> (2) The symbol * which matches any sequence of one or more words.<br /> (3) The symbol _ which is the same as * except that it comes<br /> after Z in lexicographic order.<br /> (4) The markup <name/> which is replaced at robot load time <br /> with the name of the robot.<br /> <br /> Note there is a difference between the patterns HELLO and HELLO *.<br /> HELLO matches only identical one-word sentences ("Hello.") <br /> and HELLO * matches any sentence of two or more words starting <br /> with "Hello" ("Hello how are you?"). <br /> <br /> To simplify pattern description and matching, AIML patterns allow<br /> only one "*" per pattern. In other words, "MY NAME IS *" is a<br /> valid pattern, but "* AND *" is not.<br /> <br /> </template> </category> <category> <pattern>WHAT IS A SYMBOLIC REDUCTION</pattern> <template> <br /> In general there are a lot of categories whose job is<br /> "symbolic reduction". The category:<br /> <br /> <category><br /> <pattern>ARE YOU VERY *</pattern><br /> <template><srai>ARE YOU <star/></srai></template><br /> </category><br /> <br /> This category [in Brain.aiml] will reduce "Are you very very smart"<br /> to "Are you smart".<br /> <br /> </template> </category> <category> <pattern>WHAT IS A TEMPLATE</pattern> <template> <br /> A template is the "response" or "output" part of an AIML category.<br /> <br /> The template is the formula for constructing the reply. The simplest<br /> template consists of plain, unmarked text. AIML provides markup <br /> functions to tailor the replies for each individual input and client. <br /> The markup function <getname/> for example inserts the client's name <br /> into the reply. <br /> <br /> The template may call the pattern matcher recursively using the<br /> <sr/> and <srai> tags. Many templates are simple symbolic <br /> reductions that map one sentence form to another, for example<br /> "Do you know what X is?" transforms to "What is X" with the category<br /> <br /> <category><br /> <pattern>DO YOU KNOW WHAT * IS</pattern><br /> <template><srai>WHAT IS <star/> </srai></template><br /> </category><br /> <br /> The template may also contain other embedded HTML and XML. <br /> These embedded tags may cause the browser to play a sound,<br /> show an image, or run an applet. There is considerable freedom<br /> of expression in the construction of response templates. The<br /> botmaster is encouraged to study the examples in ALICE, to and<br /> experiment with new ideas.<br /> <br /> </template> </category> <category> <pattern>WHAT IS AIML</pattern> <template> <br /> The ALICE software implements AIML (Artificial Intelligence Markup <br /> Language) a non-standard evolving markup language for creating chat robots.<br /> The primary design feature of AIML is minimalism. Compared with<br /> other chat robot languages, AIML is perhaps the simplest. The<br /> pattern matching language is very simple, for example permitting<br /> only one wild-card ('*') match character per pattern. <br /> <br /> AIML is an XML language, implying that it obeys certain grammatical<br /> meta-rules. The choice of XML syntax permits integration with<br /> other tools such as XML editors. Another motivation for XML is<br /> its familiar look and feel, especially to people with HTML experience.<br /> <br /> An AIML chat robot begins and ends with the <alice> and<br /> </alice> tags respectively. <br /> <br /> </template> </category> <category> <pattern>WHAT IS ARE THE LT PERSON GT TAGS</pattern> <template> <br /> The <person> and <person2> tags indicate a place where the<br /> AIML interpreter changes the personal pronouns in a sentence.<br /> <br /> <person2> X </person2> change X from 1st to 2nd person<br /> <person> X </person> exchange 1st and 3rd person<br /> <br /> <person2> is not often used. The main application is<br /> "gossip":<br /> <br /> Client: I admire robots like you.<br /> Robot: That's good information: Joe said he admire robots like me.<br /> <br /> The transformation is a combination of:<br /> <br /> 1. change the first person pronouns to second person.<br /> 2. change the third person pronouns to first person.<br /> <br /> The array in Substituter.java is incomplete. We need more substitutions<br /> to make person2 work really well.<br /> <br /> The <person> substitution is much more common and easier<br /> to understand, because it simply exchanges 1st and 3rd person<br /> pronouns. The main issue with <person> in English is knowing<br /> when to use "I" and when to use "me".<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS ACCESS</pattern> <template> <br /> Class Access is the abstraction for log file analysis to<br /> extract dialogues. In a typical chat robot server scenario,<br /> the program records each line of client input and the robot<br /> reply in a log file. Given many simultaneous conversations,<br /> these dialogues are interleaved in the log file. The purpose<br /> of class Access is to unravel these conversations into<br /> individual threads by client.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS ALICEREADER</pattern> <template> <br /> AliceReader is an efficient, small-footprint XML interpreter<br /> hard coded by Kris Drent specifically for reading AIML categories. <br /> Each category has a pattern, a template, and an optional topic and<br /> thatpattern. AliceReader scans the AIML input and tries to<br /> identify these fields as quickly as possible.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS B</pattern> <template> <br /> Class B is the old name for the Swing version of class Bawt, but<br /> now just extends Bawt.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS BAWT</pattern> <template> <br /> The class Bawt is the Java application, and implements the GUI.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS BLET</pattern> <template> <br /> The Blet class is the applet, but is similar in many ways to the application.<br /> The applet is a stripped down version of the program, with a simpler GUI<br /> and no "botmaster" privileges. Also, the Blet class doesn't utilize the<br /> web server, because it runs as a client-side applet.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS BRAIN</pattern> <template> <br /> Brain extends StringSorter, and uses StringRanker. The sorted<br /> strings in the Brain class are keys formed by combining the<br /> pattern, that, and topic strings. In the original versions<br /> of ALICE, there were no "that" and no "topic" tags, so the<br /> Brain class simply mapped input patterns to output templates.<br /> With the addition of the "that" and "topic" tags we had to<br /> create the "key" from the combination of all three.<br /> <br /> The "Target" objects in class Brain are instances of StringRanker.<br /> These structures form the basis of the classification and targeting<br /> algorithms in program B. For each category, the Targetmap contains<br /> an instance of StringRanker storing the inputs classified into<br /> that category. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS CLASSIFIER</pattern> <template> <br /> The class Classifier might as well be called "bot" because it contains<br /> the basic functionality of the chatterbot algorithm. <br /> <br /> See the question "How can I interace my Java program to ALICE?" for<br /> additional information about the class Classifier.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS DIALOGUE</pattern> <template> <br /> A Dialogue (not to be confused with a Dialog class!) is<br /> the representation of the conversation between the client<br /> and the robot. The basic data structure is a pair of String arrays<br /> client_said[] and robot_said[] that store the alternating<br /> statements of client and robot. The Dialogue also<br /> encodes the length, hostname, and start and end tag<br /> information.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS GLOBALS</pattern> <template> <br /> Globals is the repository for all of the botmaster-selectable<br /> parameters in program B. The Globals class corresponds to<br /> the "Options" menu on the program B menu bar. Globals contains<br /> methods toFile() and fromFile() to make these values<br /> persistent between sessions.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS INTSET</pattern> <template> <br /> IntSet represents a set of integers. Were we using Java<br /> Collections this would likely be a Set, but the simple<br /> requirements of program B allow us to create a simple<br /> IntSet class.<br /> <br /> "Set" means that the object has only one occurance of each item:<br /> {1, 4, 2, 9} is a set of integers; {1, 1, 2} is not.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS KID</pattern> <template> <br /> Class Kid is a simplified graphical user interface, "easy enough<br /> for kids" to run. Program Kid does not evoke program B, but the Kid<br /> may be started from the program B options menu. The logic here<br /> is that kids should be able to have conversations with the chat<br /> robot, but parents may not want kids to start chat robot servers<br /> (see Appendix B: Note to Parents).<br /> <br /> Class Kid utilizes RobotCommunicator as its interface to the<br /> chat robot. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS LINECLASSIFIER</pattern> <template> <br /> In the file Log.java you will find an Interface called LineProcessor<br /> with one required method: process_line(). The LineProcessor<br /> is the abstraction of an algorithm that reads a file one line at a time,<br /> processes each line as a data record, and moves on to the next.<br /> <br /> LineClassifier implements LineProcessor because it reads lines<br /> of text from the log file and identifies client input lines for<br /> classification. What makes classification efficient is the way<br /> LineClassifier stores the client lines in a SortedStringSet, called<br /> Lines. Becuase the matching algorithm proiritizes the patterns<br /> alphabetically, LineClassifier can classify an element from Lines<br /> in O(1) time.<br /> <br /> The code for LineClassifier is in Classifier.java.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS LOADER</pattern> <template> <br /> Both the application and the applet use the Loader class to load the AIML<br /> robot script. The Loader class extends Thread, and runs "in the background"<br /> while the GUI and, in the case of the application, the web server start.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS PARSER</pattern> <template> <br /> The Parser class is responsible for the evaluation of AIML<br /> response templates. The method pfkh() [the Program Formerly<br /> Known as Hello] is the heart of evaluation process. This<br /> method contains the code for recognizing and processing<br /> AIML template tags.<br /> <br /> The Parser class does not parse all the AIML in the language<br /> definition; it parses and evaluates only the templates at runtime.<br /> Another class, AliceReader, has the job of reading the AIML files <br /> at load time, and parsing the categories into topics, patterns and templates. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS ROBOTCOMMUNICATOR</pattern> <template> <br /> If you want to customize your own application or applet then<br /> you might find RobotCommunicator is a useful class. The<br /> RobotCommunicator abstracts the combination of a scrolling TextArea <br /> output display with a TextField input area input field.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS SORTEDINTSET</pattern> <template> <br /> The sorted version of IntSet, SortedIntSet maintains its<br /> elements in a sorted array. Throughout program B you will<br /> find many loops utilizing instances of SortedIntSet. These<br /> objects provide an efficient means to locate items in<br /> "rank order", the highest numbered items first and the<br /> smallest numbers last. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS STRINGHISTOGRAMMER</pattern> <template> <br /> StringHistogrammer extends StringSet and contains a map from<br /> each string to a count, usually indicating the number of times<br /> that string appears in a sample of text. A histogram is<br /> like a "bar graph" that counts occurances of each item. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS STRINGRANKER</pattern> <template> <br /> Extending StringHistogrammer, StringRanker also sorts the <br /> strings by the histogram count. The highest count string<br /> is first, the next highest count second, and so on. <br /> <br /> The concept of a StringRanker should be familiar to anyone<br /> who has ranked people, companies or sports teams by any<br /> number such as sales, market capitilization, or points scored.<br /> One application for a StringRanker is determining the<br /> "top 10 referers" in HTTP log file analysis (see<br /> http://alicebot.org/mine.html). <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS STRINGSET</pattern> <template> <br /> The StringSet implements the abstract concept of a set of<br /> strings, meaning that each string item appears at most once<br /> in the setc. <br /> <br /> The "set" means that the strings occur only once in instances<br /> of object StringSet: {"this","that","another"} is a set of<br /> strings; {"start","start","stop"} is not.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS STRINGSORTER</pattern> <template> <br /> StringSorter extends StringSet but enforces an alphabetical<br /> ordering of the Strings. The StringSorter maintains its<br /> data structure dynamically, so that the set remains sorted<br /> after each item is added. Specifically, the StringSorter uses<br /> a binary-search algorithm for fast String insertion. <br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS SUBSTITUTER</pattern> <template> <br /> The static class Substituter contains a number of similar string substitution<br /> methods useful at several points in program B.<br /> <br /> Program B has the unique feature that it relies on HTTP GET methods,<br /> rather than POST methods, to transmit chat inputs to the robot server.<br /> HTTP inserts '+' characters in place of spaces, and applies a series of<br /> substitutions to eliminate many characters. The static method cleanup_http()<br /> undoes these substitutions and restores the input string to the form similar<br /> to what the client originally typed.<br /> <br /> The problem of segmenting strings into sentences is complicated by the<br /> conventional use of periods to denote abbreviations like "Dr.", "Mr.",<br /> and "St." The method deperiodize() applies a series of substitutions to<br /> eliminate most common abbreviations. Like the other substitution methods<br /> in this class, the deperiodize() method has an associated static data member<br /> of class String[][2], which stores the substitution map.<br /> <br /> The patterns in AIML are written in normalized form. The method normalize()<br /> converts a string to normal form by the following steps:<br /> <br /> 1. Remove all punctuation (inputs assumed to be individual sentences)<br /> <br /> 2. Convert string to upper case<br /> <br /> 3. Place exactly one space between words<br /> <br /> 4. Expand all contractions<br /> <br /> 5. Correct a few common spelling mistakes<br /> <br /> 6. Return a "Trimmed" string<br /> <br /> The justification for removing all punctuation from text inputs<br /> is explained by the need to make the chatterbot compatible with speech<br /> inputs, which of course contains no punctuation.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS UNIFIER</pattern> <template> <br /> Unification refers to the process of matching and binding. A unifier determines<br /> whether two sentences match and, if so, what any 'variables' in the pattern<br /> bind to. In the case of AIML the only matching variable is the single '*'<br /> symbol. The Unifier class contains a 'star' data memeber to contain the<br /> matched subsentence.<br /> <br /> </template> </category> <category> <pattern>WHAT IS CLASS WEBSERVER</pattern> <template> <br /> The WebSever class implements a "faux" HTTP server, i.e. a server that<br /> listens for HTTP connections and accepts them; then replies in properly<br /> formatted HTML. The connecting client, typically a browser, cannot tell<br /> the difference between the chat robot server and a full-blown web server.<br /> In particular, our WebServer implements only HTTP GET methods, not POST<br /> methods. Our WebServer class does not implement many of the other features <br /> of ordinary web servers; although it is a multithreaded server.<br /> <br /> </template> </category> <category> <pattern>WHAT IS LT LOAD FILENAME X GT</pattern> <template> <br /> The template may contain a <load/> tag to recursively load an AIML<br /> file. The semantics of a load are the same as a merge: categories<br /> loaded first have priority; the server eliminates categories with<br /> duplicate patterns. <br /> <br /> The default robot file B.aiml contains the top-level load commands.<br /> There are several ways to "comment out" a <load> tag in order<br /> to test your system with a smaller robot. You can change the<br /> line reading <br /> <load filename="Brain.aiml"/><br /> to <br /> <noload filename="Brain.aiml"/><br /> and the AIML parser will simply ignore the non-existent "noload"<br /> command.<br /> <br /> </template> </category> <category> <pattern>WHAT IS LT STAR GT</pattern> <template> <br /> The <star> tag indicates the input text fragment matching the pattern '*'.<br /> Remember, <star/> is an XML abbreviation for <star></star>.<br /> <br /> <star/> the value of "*" matched by the pattern.<br /> <br /> </template> </category> <category> <pattern>WHAT IS LT THAT GT</pattern> <template> <br /> The keyword "that" in ALICE refers to whatever the robot said before<br /> a user input. Conceptually the choice of "that" comes from the<br /> observation of the role of the word "that" in dialogue fragments like:<br /> <br /> Robot: Today is yesterday.<br /> Client: That makes no sense.<br /> <br /> Robot: The answer is 3.14159<br /> Client: That is cool.<br /> <br /> In AIML the syntax <that>...</that> permits an optional "ThatPattern"<br /> to match the robot's "that" expression. A common example using "that"<br /> is any yes-no question:<br /> <br /> <category><br /> <pattern>YES</pattern><br /> <that> DO YOU LIKE MOVIES </that><br /> <template> What's your favorite movie? </template><br /> </category><br /> <br /> This category handles the user input "YES" and checks to see whether<br /> the client is replying to the question "What's your favorite movie?".<br /> <br /> One interesting application of "that" are the categories that<br /> enable a robot to respond to "knock-knock" jokes:<br /> <br /> <category><br /> <pattern>KNOCK KNOCK</pattern><br /> <template>Who's there?</template><br /> </category><br /> <br /> <category><br /> <pattern>*</pattern><br /> <that>WHO IS THERE</that><br /> <template><person/> Who?</template><br /> </category><br /> <br /> <category><br /> <pattern>*</pattern><br /> <that>* WHO</that><br /> <template>Ha ha very funny, <getname/></template><br /> </category><br /> <br /> Client: KNOCK KNOCK<br /> Robot: Who's there? <br /> Client: BANANA<br /> Robot: banana Who? <br /> Client: KNOCK KNOCK<br /> Robot: Who's there? <br /> Client: BANANA<br /> Robot: banana Who? <br /> Client: KNOCK KNOCK<br /> Robot: Who's there? <br /> Client: ORANGE<br /> Robot: orange Who? <br /> Client: ORANGE YOU GLAD I DID NOT SAY BANANA<br /> Robot: Ha ha very funny, Aol-person <br /> <br /> </template> </category> <category> <pattern>WHAT IS LT THINK GT</pattern> <template> <br /> The simple purpose of the <think> X </think> tag pair is<br /> to evaluate the AIML expression X, but "nullify" or hide<br /> the result from the client reply.<br /> <br /> A simple example:<br /> <br /> <category><br /> <pattern>I AM FEMALE</pattern><br /> <template>Thanks for telling me your gender. <think><set_female/></think><br /> </template><br /> </category><br /> <br /> The <set_female/> tag normally returns a string like "she". But the<br /> <think> tag hides the text output of <set_female/> from the reply,<br /> which contains only the text:<br /> <br /> Thanks for telling me your gender. <br /> <br /> </template> </category> <category> <pattern>WHAT IS NEW IN AIML</pattern> <template> <br /> AIML is changing. The original tag syntax was changed<br /> into XML. Right now, AIML uses XML syntax for the<br /> categories, patterns, "that" patterns and templates, but inside the <br /> <template> tag you may still see the original +~ syntax in a few places. <br /> But this will change soon. For completeness program B<br /> supports both versions.<br /> <br /> The biggest change between the old AIML and the new<br /> XML version of AIML is the elimination of the "+"<br /> character to stand for string appendage. The change<br /> is of little concern except in the implementation of<br /> <random>, discussed at length below.<br /> <br /> The old AIML used a tilde (~) markup character to<br /> indicate the start of an AIML token. The XML version<br /> naturally uses an SGML type tag syntax instead.<br /> <br /> XML tags, unlike HTML, are case-sensitive. Moreover, XML syntax<br /> requires a closing tag of some kind. The "empty" tags that contain<br /> no text, like <A></A> in HTML, are written like <A/> in XML.<br /> <br /> </template> </category> <category> <pattern>WHAT IS ON THE HELP MENU</pattern> <template> <br /> Random Help - Same as "Help" button.<br /> <br /> Show Help Questions - Displays a list of all FAQ questions. Select<br /> one by deleting all the others. Obtain the answer with "Send." <br /> <br /> Don't Read Me - Display the text of this document.<br /> <br /> GNU Public License - Display the software license.<br /> <br /> </template> </category> <category> <pattern>WHAT IS PROGRAM BAWT</pattern> <template> <br /> Significant demand for a version of ALICE compatible with<br /> pre- Java 2 (formerly known as Java 1.2) prompted the<br /> development of "Bawt.java", an open source java program<br /> for chat robot development that works with older versions of<br /> Java, and AWT. Originally program B relied on <br /> Java 2 and Swing, but program Bawt needs only Java 1.1 and AWT.<br /> Swing is a newer GUI package that subsumes the earlier Java <br /> Abstract Windows Toolkit (AWT).<br /> <br /> At present class B merely extends class Bawt. Swing not<br /> supported.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE BOTMASTER MENU</pattern> <template> <br /> The Botmaster menu contains all the tools to help develop chat robots.<br /> <br /> Classify - same as Classify button<br /> <br /> Default Targets - display targets obtained from<br /> the Default ('*') category,<br /> in a format suitable for <br /> quick conversion to new AIML.<br /> <br /> Recursive Targets - display targets from "recursive" categories,<br /> i.e. categories with a template containing<br /> the AIML <sr/> or <srai/> functions.<br /> <br /> Autochat - The robot chats with herself; sometimes helpful<br /> in detecting conversation "loops".<br /> <br /> Add AIML - Clear the screen and type a line of AIML. Selecting<br /> "Add AIML" adds this new category to the chatbot. You can<br /> test the bot with "Send" and "Classify", then save it with<br /> "File/Save Robot".<br /> <br /> In general you can add any number of new AIML categories<br /> to the bot with "Add AIML." <br /> <br /> </template> </category> <category> <pattern>WHAT IS THE CLASS STRUCTURE OF PROGRAM B</pattern> <template> <br /> The core functionality of program B resides in the file<br /> Classifier.java. In that file, you find a class hierarchy<br /> from "String" to "Brain" and finally "Classifier." <br /> A branch in that hierarchy contains classes for histogramming<br /> and ranking. <br /> <br /> The first branch of the class hierarchy derives class Brain<br /> from StringSorter, extending StringSet. The second branch<br /> extends StringSet to StringHistogrammer and on to StringRanker.<br /> The final class Brain extends StringSet and uses StringRanker.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE DIFFERENCE BETWEEN B AND C</pattern> <template> <br /> AIML is a platform-independent, language-independent specification<br /> for creating chat robots like ALICE. The original AIML interpreter<br /> ran in SETL. The next one developed was program B, the Java program<br /> which is the subject of this document. Most recently new threads<br /> of C/C++ development have led to "program C", actually a collection<br /> of C/C++ programs and applications including Cgi-ALICE, IRC-ALICE and<br /> WinALICE. See the web sites http://c.alicebot.com and<br /> http://hippie.alicebot.com for more details. <br /> <br /> Program B remains the most stable, general purpose chat robot<br /> program in the AIML family. This Java implementation has been<br /> subject to intense peer review over a period of years, evolving<br /> into a remarkably bug-free, efficient and reabable piece of<br /> software.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE DTD FOR AIML</pattern> <template> <br /> Real XML fanatics know that because AIML is an XML language it<br /> must have something called a DTD (Document Template Descriptor).<br /> The DTD is a formal specification of the grammar for an XML language.<br /> Unless you are using special XML tools to work on your AIML or<br /> developing your own parser for AIML, you probably do not need to know <br /> much about the DTD.<br /> <br /> Our DTD reflects the current content of the *.aiml files that program B can <br /> actually parse. The DTD will become more general as the parser<br /> improves. <br /> <br /> Rather than reproduce the entire DTD here, in order to shorten the<br /> length of this document, we refer the reader to<br /> the A.L.I.C.E. XML page by John Friedman. The URL for the AIML<br /> DTD may be found on the page at http://XML.ALICEBot.Com. <br /> The full URL for the DTD is <br /> http://xml.alicebot.com/xml/aiml/alice.dtd<br /> <br /> <br /> </template> </category> <category> <pattern>WHAT IS THE GOAL FOR AIML</pattern> <template> <br /> AIML (Artificial Intelligence Markup Language) is an XML specification<br /> for programming chat robots like ALICE using program B. The emphasis<br /> in the language design is minimalism. The simplicity of AIML makes<br /> it easy for non-programmers, especially those who already know HTML, <br /> to get started writing chat robots.<br /> <br /> One ambitious goal for AIML is that, if a number of people create their own<br /> robots, each with a unique area of expertise, program B can literally <br /> merge-sort them together into a Superbot, automatically omitting <br /> duplicate categories. We offer the both the source code and the ALICE <br /> content, in order to encourage others will "open source" their chat <br /> robots as well, to contribute to the Superbot. <br /> <br /> Botmasters are also of course free to copy protect private chat robots.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LOW LEVEL INTERFACE TO PROGRAM B</pattern> <template> <br /> If you require only a graphical interface, try using the<br /> class RobotCommunicator. Depending on your application,<br /> you may also try the Servlet interface or the applet. <br /> Some developers however may want lower-level access to the<br /> chat robot functions.<br /> <br /> The class Classifier in Classifier.java contains the low-level <br /> methods needed to interface directly to ALICE. "Classifier" might <br /> as well be called "Bot" because more than any other class, <br /> it handles those functions most unique to the chat robot.<br /> <br /> The method Classifier.multiline_response() is a key entry point <br /> into the conversation engine. The "multiline" in <br /> "multiline_response" means that the input may contain <br /> multiple "lines" or sentences. The first argument "query" to<br /> multiline_response is the input. The second argument "hname" is <br /> the virtual IP address of the client. The third and last argument<br /> is the class implementing the Responder interface.<br /> <br /> If the input string contains "Sentence1. Sentence2? Sentence3." <br /> then multiline_response might produce:<br /> <br /> > Sentence1.<br /> Reply1<br /> > Sentence2<br /> Reply2<br /> > Sentence3<br /> Reply3<br /> <br /> The method multiline_response hides all of the details <br /> of sentence segmentation, responding to each input line individually, <br /> and formatting the output. In particular multiline_response() <br /> may or may not append the VBScript needed to drive the MS<br /> Agent output, depending on whether the global MS Agent parameter is set.<br /> <br /> The argument "hname" is a key that indexes the client's conversation. For<br /> the interface you need this can probably always be "localhost" or some<br /> other constant. <br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LT PERSON GT TAG</pattern> <template> <br /> The XML specification requires that every start tag such as<br /> <person> be followed by a matching end tag like </person>. <br /> HTML is more relaxed about this requirement, exemplified by<br /> the liberal use of the <IMG> tag without a corresponding </IMG>.<br /> XML supports a shorthand notation for the "atomic" tags.<br /> The <star/> tag is an example of a shorthand AIML tag. <br /> <person/> is another example:<br /> <br /> <person/> = <person><star/></person><br /> <br /> This tag replaces the +~person(*)+ tag in old-style AIML.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LT PERSON2 GT TAG</pattern> <template> <br /> This tag is an abbreviation:<br /> <br /> <person2/> = <person2><star/></person2><br /> <br /> See the FAQ question "What are the <person> tags?" for more<br /> information about <person2/>.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LT PERSONF GT TAG</pattern> <template> <br /> The value of <personf/> (a "formatted" personal pronoun transformation)<br /> is shown by the example <br /> <br /> <category><br /> <pattern>WHAT IS A *</pattern><br /> <template><br /> What does <br /> <A HREF="http://www.dictionary.com/cgi-bin/dict.pl?term=<personf/>"> <br /> <set_it> <person/> </set_it> <br /> </A> mean? <BR> <br /> Or Ask Jeeves: <br /> <A HREF="http://www.ask.com/AskJeeves.asp?ask=WHAT%20IS%20A%20<personf/>"><br /> What is a <person/>?<br /> </A><br /> </template><br /> </category><br /> <br /> The search strings formatted for the Webster Dictionary and for<br /> the Ask.com search engine utilize <personf/>. The effect is the<br /> same as <person/>, but the formatting inserts an escaped "%20" in<br /> places of the spaces returned by <person/>. These escape sequences<br /> permit the HTTP GET methods to transmit multiple-word queries. <br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LT SRAI GT TAG</pattern> <template> <br /> The recursive function <srai> stands for<br /> "Stimulus-Response artificial intelligence" and means<br /> that the text between the tags should be sent recursively<br /> to the pattern matcher and the result interpreted.<br /> The resulting text replaces the original text in the markup.<br /> <br /> <srai> X </srai> calls the pattern matcher recursively on X.<br /> <sr/> recursive call to chat robot<br /> <br /> <sr/> abbreviates <srai> <star/> </srai><br /> <br /> Note: what happens if X contains AIML markup? Does the interpreter<br /> do "lazy evaluation"? Look at the source code and examine the<br /> method pfkh(), the Program Formerly Known as "Hello".<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE LT TOPIC GT TAG</pattern> <template> <br /> 1. <topic> allows ALICE to prefer responses that deal with the <br /> topic currently being discussed. This creates topical <br /> conversation, yet still has the ability to move from one subject <br /> to another.<br /> 2. <topic> allows ALICE to have duplicate patterns in different <br /> contexts (topics) allowing ALICE to have different responses to <br /> the same input patterns depending on the topic. For example, <br /> "overriding" the " * " pattern for different topics. (I'll give <br /> an example with this.)<br /> 3. As always, you can still use the <gettopic/> tag to refer to <br /> the topic in your output statements (templates).<br /> 4. As always, you can add topics on top of all your existing AIML <br /> to keep your bot's current personality. <br /> <br /> </template> </category> <category> <pattern>WHAT IS THE RESPONDER INTERFACE</pattern> <template> <br /> Developed to meet the needs of multiple ALICE<br /> application scenarios, the Responder interface<br /> simplifies the code in class Classifier for<br /> natural language queries. The Responder defines<br /> an interface with four members:<br /> <br /> pre_process() : runs any initialization first.<br /> log() : tells how to log the conversation.<br /> append() : how to append response lines together.<br /> post_process() : runs after response loop finishes.<br /> <br /> The method Classifier.multiline_response() calls<br /> all of the Responder methods. See the next<br /> question ("What is the low-level interface?")<br /> for more information about multiline_response(). <br /> At least five classes implement the Responder<br /> interface:<br /> <br /> GUIResponder: the program B GUI uses this.<br /> HTMLResponder: a class for Web Server HTML replies.<br /> RobotResponder: this class used by RobotCommunicator<br /> CustomResponder: a template for more Responder classes.<br /> AppletResponder: the Applet code uses this class.<br /> <br /> These classes all handle special circumstances<br /> for the various Responder types: for example,<br /> HTMLResponder appends the client input to each<br /> response; GUIResponder does not. AppletResponder<br /> logs the dialogue through a network URL connection;<br /> all other classes write to a local file. RobotResponder,<br /> used by the Kid interface, suppresses all the HTML<br /> from robot replies; while HTMLResponder passes<br /> them through. HTMLResponder also runs the optional<br /> Animagent class to create the MS Agent VB Script.<br /> Text-based Responder classes wrap the text; HTMLResponder<br /> need not wrap because the browser handles text formatting.<br /> The Responder interface addresses this wide variety of needs.<br /> <br /> </template> </category> <category> <pattern>WHAT IS THE THEORY BEHIND ALICE</pattern> <template> <br /> I used to say that there was NO theory behind ALICE: no neural network,<br /> no knowledge representation, no search, no fuzzy logic, no genetic<br /> algorithms, and no parsing. Then I discovered there was a theory<br /> circulating in applied AI called "Case-Based Reasoning" or CBR that<br /> maps well onto the ALICE algorithm. Another term, borrowed from<br /> pattern recognition, is "nearest-neighbor classification." <br /> <br /> The CBR "cases" are the categories in AIML. The algorithm finds<br /> best-matching pattern for each input. The category ties the<br /> response template directly to the stimulus pattern. ALICE is<br /> conceptually not much more complicated that Weizenbaum's ELIZA <br /> chat robot; the main differences are the much larger case base and the<br /> tools for creating new content by dialog analysis.<br /> <br /> ALICE is also part of the tradition of "minimalist", "reactive" or<br /> "stimulus-response" robotics. Mobile robots work best, fastest and<br /> demonstrate the most animated, realistic behavior when their sensory <br /> inputs directly control the motor reactions. Higher-level symbolic<br /> processing, search, and planning, tends to slow down the process <br /> too much for realistic applications, even with the fastest control<br /> computers. <br /> <br /> </template> </category> <category> <pattern>WHAT IS XML</pattern> <template> <br /> David Bacon pronounces it "Eggsmell". XML is the Extensible<br /> Markup Language. Like many "standards" in computer science, XML<br /> is a moving target. In the simplest terms, XML is just a generalized<br /> version of HTML. Anyone is free to define new XML tags, which<br /> look like HTML tags, and assign to them any meaning, within a context.<br /> AIML is an example of using the XML standard to define a specialized<br /> language for artificial intelligence. <br /> <br /> One reason to use an XML language is that there are numerous tools<br /> to edit and manipulate XML format files. Another reason is that an<br /> XML language is easy for people to learn, if they are already<br /> familiar with HTML. Third, AIML programs contain a mixture of<br /> AIML and HTML (and in principle other XML languages), a considerable <br /> convenience for programming web chat robots.<br /> <br /> A good resource for information on XML is www.oasis-open.org.<br /> <br /> </template> </category> <category> <pattern>WHERE DOES THE LT TOPIC GT TAG APPEAR</pattern> <template> <br /> Topic tags are placed around one or more categories. (Usually <br /> many.) The categories (with each respective "pattern", "that", <br /> and "template") within a set of <topic> </topic> tags would be <br /> associated with the defined topic. The name of the topic would be <br /> given by a "name" property in the beginning topic tag. Here would <br /> be the full AIML format with topic:<br /> <br /> <alice> <br /> <br /> <topic name="THE TOPIC"> <br /> <br /> <category> <br /> <pattern> phrase </pattern> <br /> <that> phrase </that> <br /> <template> phrase </template> <br /> </category> <br /> <br /> </topic> <br /> <br /> </alice><br /> <br /> </template> </category> <category> <pattern>WHO IS THE BOTMASTER</pattern> <template> <br /> The botmaster is you, the master of your chat robot. A botmaster runs<br /> program B and creates or modifies a chat robot with the program's<br /> graphical user interface (GUI). He or she is responsible for<br /> reading the dialogues, analyzing the responses, and creating new<br /> replies for the patterns detected by program B. Botmasters are<br /> hobbyists, webmasters, developers, advertisers, artists, publishers,<br /> editors, engineers, and anyone else interested in creating a personal <br /> chat robot.<br /> <br /> </template> </category> <category> <pattern>WHY IS THE FORMAT OF THE OPTIONS GLOBALS TXT SO STRANGE</pattern> <template> <br /> Depending on your system, you may see a globals.txt file that looks like:<br /> <br /> Animagent=true<br /> Botmaster=Jon Baer<br /> AnalysisFile=dialog.txt<br /> ClientLineContains=t:<br /> LogFile=dialog.txt<br /> CodeBase=D\:CHATTERBOTS\ALICE<br /> StartLine=0<br /> Beep=true<br /> BotFile=B.aiml<br /> AppletHost=206.184.206.210<br /> EndLine=25000<br /> BotName=ALICE<br /> Birthday=November 23, 1995<br /> TempFile=Temp.ai<br /> RobotLineStarts=Robot<br /> # ... and so on<br /> <br /> The global values seem to be stored in a random order.<br /> This is not a bug. The Globals class uses the Java methods<br /> Properties.load() and Properties.store() to save the globals<br /> to a file. You can also use # and ! to add comments to the file. <br /> <br /> The Properties class uses a hash table representation, so does<br /> not preserve the order of the global variables. The program<br /> displays and saves the global options in an arbitrary order.<br /> <br /> <br /> </template> </category> </aiml>