<p>I may be in the minority here, but I do think (1) that the overall point of the article is a good one and (2) that the question is very poorly worded.</p>
<p>The article tries to point out that the results of such a comparative test may not be as helpful in terms of assessment of strength of curriculum, teaching etc. as you may think. In other words, just because country X may have higher results than country Y on this test, it does not force one to conclude that country X has better math instruction, curriculum etc. than country Y. They try to give several reaons why the difference may not be due to curriculum etc. but to other factors, such as the sample etc. </p>
<p>I do not agree with all their arguments, but I suspect that not all biasing influences are controlled for, as the authors suggest. Therefore, I would agree with the authors that such a comparative test may not be as helpful as we would like it to be as an assessment of math instruction in country X. </p>
<p>With regards to the question itself, note that we don’t know what the students were doing wrong. Were they not able to approach the problem at all? Could they not add? </p>
<p>I am curious as to how many students answered (B) 52–the answer without the additional 25 added for the bow. It does take some math knowledge and reasoning here to get to at least this answer. If most students got this answer, it would tell me more than simply that most students got the answer wrong. I would then want to know why they didn’t add the additional 25.</p>
<p>If you look at the question, when you get to “…and have 25 cm left to tie a bow” you could interpret that part as “so that there is 25 cm left to tie a bow.” At least, that it what my initial read was. In this case, you may think that the point is to find out how much is needed simply to cover the box so you have enough left over for the ribbon. Here, the thinking process may be (1) get the least amount to cover the box so (2) there is enough left over for the bow, which requires 25cm. You may then simply find enough to cover the box.</p>
<p>Of course, I then asked myself why did they state 25cm, since that number didn’t seem to be involved in the calculation. Perhaps this was distractor information? I did then reread the question. But I had the luxury of not being involved in a test etc., so I was able to reason it out in a different way. </p>
<p>The question could be made a bit clearer, as in 'How much ribbon is needed for both the box and the bow?" I am curious if it was worded this way, would scores improve?</p>
<p>Now here is a valid point that could be made here to explain the differences between US and other countries. Suppose that the type of interpretation with ‘and’ as noted above is not available in other languages, or that the question was worded in such a way in that language that the interpretation that I first got was simply not available. It may be that this question is more difficult for US students because it leads to more than one interpretation, with some students (like me) first reading it one way, then trying to figure out the point of the 25cm, then wondering if there was something they were missing etc. Students in other countries may not have been faced with such a distractor interpretation, leading to more accurate scores. </p>
<p>In fact, this very point is made in the article itself, though not with regards to this question. </p>
<p>You may find my reasoning here ‘a stretch’ or ‘hilarious’ etc. But I did interpret the question as I did, and I did run into trouble because of this initial interpretation. I wonder how many of the US students taking the test did run into this problem. I also wonder how many foreign students had this problem of interpretation. I guess this is the overall point of the article.</p>