The corpus element
The corpus
element is always root element in a .corpus file. Its DTD-based definition is:
<!ELEMENT corpus (question|probe)*> <!ATTLIST corpus description CDATA #IMPLIED image CDATA #IMPLIED position CDATA #IMPLIED size CDATA #IMPLIED rank CDATA #IMPLIED evalall CDATA #IMPLIED>
description allows you to attach arbitrary documentation to the corpus.
image is the background image to be shown when the corpus is active. These are within the /static
directory
position is the position (e.g. "200px 300px") of the background image.
size is the size (e.g. "1000px 700px") of the background image.
rank determines the priority of the corpus when compared to other corpora when showing the list of corpora to select (e.g. a corpus with rank "01" will appear before a corpus with rank "05") of the background image.
evalall, if true, lets users execute all the questions in the corpus with a single button click. This defaults to false.
Within the corpus
element, sub-elements represent question or
probe elements.
Here is the simple corpus within the geo configuration.
<?xml version="1.0"?> <corpus> <question text="give me the cities in california"/> <question text="what is the population of new york"/> <question text="give me the largest state"/> <question text="give me the states that border utah"/> <question text="smallest city in the largest state"/> <question text="3 tallest Mountains in alaska"/> <question text="3 tallest mountains on west coast"/> <probe text="names of states"/> <probe text="names of cities"/> <probe text="names of rivers"/> <probe text="names of mountains"/> <probe text="names of lakes"/> <probe text="numbers of roads"/> <probe text="my cities" answer="Ann Arbor,East Lansing,New York,Los Angeles,Berkeley,Austin"/> </corpus>