Distribution of NYT Crossword Difficulty (according to Rex Parker)
We try to do every day’s NYT crossword at work (2+ month streak!) and we check to see what crossword enthusiast, Rex Parker, has to say about each. We often think they’re are harder than he rates them, so I wanted to see how even Rex’s “Relative Difficulty” was distributed.
Turns out it’s not bad. I scraped his blog for the past 1695 days and (brutishly) lumped each score into buckets. My methodology is below:
- If the word “Easy” was in the difficulty, then add “1″ to the “Easy” column for that day; else blank
-
If the word “Medium” was in the difficulty, then add “2″ to the “Medium” column for that day; else blank
-
If the word “Challenging” was in the difficulty, then add “3″ to the “Challenging” column for that day; else blank
- The average of non-blank columns is calculated in the “avg” column
This makes the assumption that Rex is regular in his descriptions of difficulty (he is most of the time, but not all), and that all posts are made by Rex (most are, but not all), and that something like “Easy-Challenging” means “Medium”.
Next, I want to see the distribution over time and the distribution per day of the week.