tool,19-2-N01-1003,bq methodology for automatically training <term> SPoT </term> on the basis of <term> feedback </term>
other,24-2-N01-1003,bq training <term> SPoT </term> on the basis of <term> feedback </term> provided by <term> human judges </term>
other,20-5-N01-1003,bq , and then selects the top-ranked <term> plan </term> . The <term> SPR </term> uses <term> ranking
tech,1-6-N01-1003,bq top-ranked <term> plan </term> . The <term> SPR </term> uses <term> ranking rules </term> automatically
tech,5-7-N01-1003,bq </term> . We show that the trained <term> SPR </term> learns to select a <term> sentence
measure(ment),13-7-N01-1003,bq a <term> sentence plan </term> whose <term> rating </term> on average is only 5 % worse than
hide detail