Billy S A king of infinite space
By James Dacey
Imagine this: a much-celebrated author locks himself away to begin work on his masterpiece, a novel called The Meta Book that will comprise an infinite number of words all strung together in the writer’s unique literary style. While this may sound like the plotline to a short-story by one of the great magical realist authors of Latin America, it is actually the idea of a trio of physicists in Sweden.
Sebastian Bernhardsson and his colleagues at Umeå University are interested in the unique “literary fingerprint” left by famous authors. They conceptualize a writer’s use of language as a complex system in the same way that scientists model the climate, the economy or ant colonies.
By feeding an author’s entire oeuvre into their calculations, they find that each writer creates a unique curve on a graph representing the number of different words used as a function of the total number of words. What’s more, this signature curve can be detected in every single work of a particular author regardless of what they are writing about.
Publishing their findings in New Journal of Physics the authors create curves for the works of Thomas Hardy, DH Lawrence and Herman Melville. “It is like everything an author can think of writing is processed by a mental pipeline which imposes a unique fingerprint on an authors’ infinite meta-book,” says Bernhardsson. I think, what he means by this is that (statistically speaking) there is a common thread running through everything these authors wrote — as if they were plucking extracts from their infinite corpus.
Now, the literary purists out there may be reading this and seething at yet another example of uncouth physicists trying to impose rigid mathematical frameworks onto works of unquantifiable beauty, or of “unweaving the rainbow” as Keats famously accused Newton. If anything, however, the results of this research reveal the opposite. For 75 years, language analysts have assumed that all literature, regardless of author, follows the same statistical pattern when viewed as a whole. This was based on the law proposed by American linguist George Kingsley Zipf stating that the frequency of a word is inversely proportional to its occurrence.
In this new view of fiction, however, each author defines their own unique law based on non-trivial mathematics. “It shows that, even statistically speaking, our personality is not drowned by the general rules, and structure of the language itself,” says Bernhardsson.
The researchers intend to develop their work by testing their meta book concept for more authors and languages other than English. So who knows — maybe the magical literary worlds of Borges and Márquez will be next in line to have their curves exposed.