lr,8-6-J05-1003,ak We apply the <term> boosting method </term> to parsing the <term> Wall Street Journal treebank </term> .
measure(ment),14-8-J05-1003,ak The new <term> model </term> achieved 89.75 % <term> F-measure </term> , a 13 % relative decrease in <term> F-measure error </term> over the <term> baseline model ’s </term> score of 88.2 % .
measure(ment),6-8-J05-1003,ak The new <term> model </term> achieved 89.75 % <term> F-measure </term> , a 13 % relative decrease in <term> F-measure error </term> over the <term> baseline model ’s </term> score of 88.2 % .
model,18-8-J05-1003,ak The new <term> model </term> achieved 89.75 % <term> F-measure </term> , a 13 % relative decrease in <term> F-measure error </term> over the <term> baseline model ’s </term> score of 88.2 % .
model,2-3-J05-1003,ak A second <term> model </term> then attempts to improve upon this initial <term> ranking </term> , using additional <term> features </term> of the <term> tree </term> as evidence .
model,2-8-J05-1003,ak The new <term> model </term> achieved 89.75 % <term> F-measure </term> , a 13 % relative decrease in <term> F-measure error </term> over the <term> baseline model ’s </term> score of 88.2 % .
model,25-11-J05-1003,ak We argue that the method is an appealing alternative — in terms of both simplicity and efficiency — to work on <term> feature selection methods </term> within <term> log-linear ( maximum-entropy ) models </term> .
model,34-7-J05-1003,ak The method combined the <term> log-likelihood under a baseline model </term> ( that of Collins [ 1999 ] ) with evidence from an additional 500,000 <term> features </term> over <term> parse trees </term> that were not included in the original <term> model </term> .
model,40-4-J05-1003,ak The strength of our approach is that it allows a <term> tree </term> to be represented as an arbitrary set of <term> features </term> , without concerns about how these <term> features </term> interact or overlap and without the need to define a <term> derivation </term> or a <term> generative model </term> which takes these <term> features </term> into account .
other,10-3-J05-1003,ak A second <term> model </term> then attempts to improve upon this initial <term> ranking </term> , using additional <term> features </term> of the <term> tree </term> as evidence .
other,10-4-J05-1003,ak The strength of our approach is that it allows a <term> tree </term> to be represented as an arbitrary set of <term> features </term> , without concerns about how these <term> features </term> interact or overlap and without the need to define a <term> derivation </term> or a <term> generative model </term> which takes these <term> features </term> into account .
other,11-2-J05-1003,ak The <term> base parser </term> produces a set of <term> candidate parses </term> for each <term> input sentence </term> , with associated <term> probabilities </term> that define an initial <term> ranking </term> of these <term> parses </term> .
other,14-3-J05-1003,ak A second <term> model </term> then attempts to improve upon this initial <term> ranking </term> , using additional <term> features </term> of the <term> tree </term> as evidence .
other,16-2-J05-1003,ak The <term> base parser </term> produces a set of <term> candidate parses </term> for each <term> input sentence </term> , with associated <term> probabilities </term> that define an initial <term> ranking </term> of these <term> parses </term> .
other,16-9-J05-1003,ak The article also introduces a new <term> algorithm </term> for the <term> boosting approach </term> which takes advantage of the <term> sparsity </term> of the <term> feature space </term> in the <term> parsing data </term> .
other,17-3-J05-1003,ak A second <term> model </term> then attempts to improve upon this initial <term> ranking </term> , using additional <term> features </term> of the <term> tree </term> as evidence .
other,19-4-J05-1003,ak The strength of our approach is that it allows a <term> tree </term> to be represented as an arbitrary set of <term> features </term> , without concerns about how these <term> features </term> interact or overlap and without the need to define a <term> derivation </term> or a <term> generative model </term> which takes these <term> features </term> into account .
other,19-9-J05-1003,ak The article also introduces a new <term> algorithm </term> for the <term> boosting approach </term> which takes advantage of the <term> sparsity </term> of the <term> feature space </term> in the <term> parsing data </term> .
other,21-2-J05-1003,ak The <term> base parser </term> produces a set of <term> candidate parses </term> for each <term> input sentence </term> , with associated <term> probabilities </term> that define an initial <term> ranking </term> of these <term> parses </term> .
other,23-7-J05-1003,ak The method combined the <term> log-likelihood under a baseline model </term> ( that of Collins [ 1999 ] ) with evidence from an additional 500,000 <term> features </term> over <term> parse trees </term> that were not included in the original <term> model </term> .
hide detail