3  The Dataset

The dataset we will be using for much of the following analysis is a set of responses to a 20 question multiple choice test on sentence structure.

The test is designed to help teachers understand some of the most common mistakes we see in their pupils’ writing.

Here is a typical question.

Only one of the following sentences is correct. Select it.

  1. Was very sorry about the mistake.
  1. Gone away.
  1. Skipped down the road.
  1. We kicked the ball round the park.
  1. Jumped over the fence.

You can download the full test here