Computer Generated Crossword Puzzles Posted Wed, 31 Oct 2007

There are two free daily newspapers in Boston. The Boston Metro and the Boston Now. Both run crossword puzzles. The Now runs a puzzle edited by Stanley Newman. The Metro's puzzle is unattributed. When my friend Seth Schoen was in town for several days, he did several crossword puzzles in the Metro. He pointed out to me that a clue in the crossword was repeated on two consecutive days. The crosswords in the Metro, he concluded, were computer generated.

I picked up the Metro each day for several weeks and, sure enough, there was a large amount of overlap in answers. "ALSO" and NIL" were answers three times in two weeks. More suggestive, however, were the clues. In all three instances of each repeated answer, the clues were the same. The clue for "ALSO" was always, "Part of a.k.a.," while the clue for "NIL" was "Zilch." Capitalization and punctuation, even for the uncapitalized "a.k.a.", was consistent. Despite the fact that there was some variation in clues, I found some answers with different clues on different days. The high degree of consistency was undeniable.

Unassisted by a computer, no human editor would use the same clue for puzzles two days in a row. Frequent reuse of clues makes puzzles too easy for regular players and slight variation in clues is easy for a human puzzle editor to do. But even if the puzzles had been written in a different order than they were run in the paper, it is unlikely that a puzzle maker would repeatedly have come up with the same clues. The chance of capitalization, phrasing, and style resulting in identical clue text is even more improbable. Humans simply aren't that consistent. Computers are. Through the reuse of the clues, a computerized provenance is revealed.

Perhaps a little ignorant, I'd always assumed that crosswords were human generated. In fact, computer generated crosswords are widespread. There have been published papers on computer generation of crosswords since the 1970s and a New York Times article on the subject was published in 1996 when the practice was beginning to take off. Computers are able to generate puzzles quickly and in quantity and, as a result, are in common use in magazines and on websites.

There's resistance, however, from both human crossword editors and from solvers who find computer generated puzzles unsatisfying. Great crossword puzzles, they argue, showcase wit and creativity with language; answers are often tied together by themes and wordplay. Computers excel at taking a database of answers and creating grids that match up correctly; they are much faster and more accurate than humans. But as the error that revealed the computer to my friend Seth illustrates, computers are less adept at varying when or how they employ answers and clues in puzzles.

Quoted in an article in Tulsa World, Mark Lagasse, senior executive editor with Dell Magazines, justified his magazine's choice to fund the more laborious human methods of crossword production saying, "with themes and the better, larger puzzles, it's best to have a constructor working them out and filling in the diagrams. A lot of the words are a bit more dry and boring when done with computers." Ultimately, he concludes, computer-generated puzzles simply are not as entertaining as those made by humans.

I did the crossword puzzles in both the Now and the Metro for a couple weeks and I agree with Lagasse. The human generated puzzles are less repetitive, more interesting, and ultimately more satisfying. The computer generated puzzles almost never use word play and have no thematic connections between answers or clues. Of course, I did both Metro and Now puzzles in the past and I always preferred the Now puzzles and found them more fun. But I would have been hard-pressed to justify my feelings. It was not until Seth pointed out the repeated clues, an error, that I was able to understand why I felt the way I did.

Posted by mako | Permalink | errors, games | View/Add Comments: 4
Only Yesterday Posted Thu, 25 Oct 2007

I only recently stumbled across this old revealing error in the wonderful Doh, The Humanity! weblog:

It may seem like only yesterday (Wednesday, 26 July) when...

In the days of newspapers and broadcast media, it was only likely for someone to read a news article on the day it was published. If the publication were weekly or monthly, it would be reasonable to expect readers got to it within the week or month. While libraries and others might keep archived versions, it was always clear to readers of archived material that their material -- and any relative dates mentioned therein -- were out of date.

Even today, news is still written primarily be consumed immediately and the vast majority of readers of an article will read it while it is fresh. But, websites have made archived material live on for months and years. While this is generally a good thing, it creates all sorts of problems for people who use relative dates in articles. The point of reference -- today -- becomes unstable. As a result, if an entertainment reporter describes a show as happening, "next Tuesday," it might appear to refer to any number of incorrect Tuesdays depending on when someone has stumbled across the archived version.

News companies have responded by converting relative dates into absolute ones. No doubt, this was often done by editors but today is also done by computer programs. These programs parse each news story looking for relative dates. When they find one, they compute the corresponding absolute date from the relative one, and add it into the text of the article in a parenthetical aside.

Most people, including myself, never knew or even imagined that articles were being parsed like this until the system screwed up as it did in the screenshot above. No human editor would have thought to provide an absolute date for "yesterday" in the phrase, "it may seem like only yesterday." With this misstep, the script at work is revealed. With the mistakes, the program's previous work -- hopefully more accurate and less noticeable in old articles -- becomes visible as well. Since seeing this image, I've noticed these date absolutefiers at work everywhere.

Posted by mako | Permalink | errors, news | View/Add Comments: 2
Technology-Assisted Deception Posted Thu, 18 Oct 2007

When I told a friend about this project, she (let's call her Alice) told me a story.

Alice lives a major city with fairly good cell coverage, except for a long tunnel that goes underground. Alice often gets calls from her mom. Alice likes her mom, of course, no problems there, but, as we all do, finds long phone calls with her a bit trying. So often when she's on the phone with her mom, Alice will say "Oops, sorry mom, got to go into the tunnel now!" Even when she's not going into the tunnel.

This is a lie that new technology makes possible. But it only makes it possible because there are known failures in the technology. If we only had old landline phones, you'd never be able to claim such a thing -- landline phones don't randomly stop working. And if we had perfect cell coverage, it similarly wouldn't make sense. But since the technology only half-works, the unfinished space can be used for very human forms of deception.

Welcome to Revealing Errors Posted Tue, 02 Oct 2007

Welcome to the Revealing Errors weblog. Our goal is to reveal errors that reveal the technology around us to learn how technology affects our lives.

Introduction

This introduction is adapted from the introduction to the Revealing Errors article published as the featured article in the Error issue (10.5) of the peer-reviewed Media/Culture Journal from the University of Melbourne. Please read the article for more in depth treatment of the subject.

In The World Is Not a Desktop, Marc Weisner, the principal scientist and manager of the computer science laboratory at Xerox PARC, stated that, “a good tool is an invisible tool.” Weisner cited eyeglasses as an ideal technology because with spectacles, he argued, “you look at the world, not the eyeglasses.” Through repetition, and by design, technologies blend into our lives. While technologies, and communications technologies in particular, have a powerful mediating impact, many of the most pervasive effects are taken for granted by most users. When technology works smoothly, its nature and effects are invisible. But technologies do not always work smoothly. A tiny fracture or a smudge on a lens renders glasses quite visible to the wearer.

Blue Screen of Death

The Microsoft Windows “Blue Screen of Death” on subway in Seoul.

Anyone who has seen a famous “Blue Screen of Death”—the iconic signal of a Microsoft Windows crash—on a public screen or terminal knows how errors can thrust the technical details of previously invisible systems into view. Nobody knows that their ATM runs Windows until the system crashes. Of course, the operating system chosen for a sign or bank machine has important implications for its users. Windows, or an alternative operating system, creates affordances and imposes limitations. Faced with a crashed ATM, a consumer might ask herself if, with its rampant viruses and security holes, she should really trust an ATM running Windows?

Technologies make previously impossible actions possible and many actions easier. In the process, they frame and constrain possible actions. They mediate. Communication technologies allow users to communicate in new ways but constrain communication in the process. In a very fundamental way, communication technologies define what their users can say, to whom they say it, and how they can say it—and what, to whom, and how they cannot.

Technology activists, like those at the Free Software Foundation (FSF) and the Electronic Frontier Foundation (EFF), understand the power, importance, and limitations of technology and technological mediation. Largely constituted by technical members, both organisations, like humanist scholars studying technology, have struggled to communicate their messages to a less-technical public. Before one can argue for the importance of individual control over who owns technology, as both FSF and EFF do, an audience must first appreciate the power and effect that their technology and its designers have. To understand the power that technology has on its users, users must first see the technology in question. Most users do not.

Errors are under-appreciated and under-utilised in their ability to reveal technology around us. By painting a picture of how certain technologies facilitate certain mistakes, one can better show how technology mediates. By revealing errors, scholars and activists can reveal previously invisible technologies and their effects more generally. Errors can reveal technology—and its power and can do so in ways that users of technologies confront daily and understand intimately.

About Us

This weblog is maintained by Benjamin Mako Hill, a free/open source technology activist, developer and consultant. This project is done as part of a Fellowship at the MIT Center for Future Civic Media.

Contributions, both in terms of suggestions or pointers to revealing errors or in the form of full articles is graciously accepted. Please email mako@atdot.cc with any such suggestions.