I first featured Justin B Rye at LH in 2005 (his Primer In SF Xenolinguistics); now, thanks to a comment by January First-of-May, I learn that he’s got a post called Fantastic Statistics in which he analyzes his extensive sf collection:
In the twenty‐first century I decided I didn’t want a paper collection anyway – what I want is a story collection. If I switched to ebooks then apart from a few sentimental‐value volumes of Teach Yourself Sumerian and the like the physical copies could go to the charity shop on the corner. This proved a fortunate idea given the number of times I’ve needed to move house recently, but electronic texts have other advantages too […]. For a start, I always convert my ebooks into a consistent HTML format so they’ll work in any browser (including my throwback of a mobile phone); but the part that got me writing this page is that once I’ve done that I can also carry out all sorts of basic text analysis from the command‐line. And thanks to all the old magazines that are out of copyright, it’s getting easier and easier to end up with a moderately comprehensive collection of the big‐name SF award‐winners of the twentieth century (and even quite a few of the ones I might actually want to re‐read). So here are some interesting facts, or at any rate facts, about my virtual bookshelf.
There are all sorts of tidbits, like Most‐Used Title (“the title that shows up most often is The End”), Famous Titles First Published Together (“the all‐time best value for money still has to be Dangerous Visions” — I still haven’t gotten over my copy, signed by many of the authors, getting lost in the mail years ago), Title Length (shortest is We, longest is the Connie Willis short story “‘The Soul Selects Her Own Society: Invasion and Repulsion: A Chronological Reinterpretation of Two of Emily Dickinson’s Poems: A Wellsian Perspective”), word frequencies, and the like. I will single out for special mention the section I was gladdest to see there, The Bechdel Test:
[Read more…]
Recent Comments