The future of dictionaries? Too soon to tell

Posted by on November 17, 2011

At the recent eLEX 2011 conference in Slovenia (for earlier posts, see here and here), the discussion focussed on the future of dictionaries – or, more broadly, on the various ways in which reference needs might be catered for in years to come. What often happens in this field is that people working in universities and research groups develop software tools or learning materials for their own (local) users, but their ideas and methods are then taken up by more mainstream providers. Serge Verlinde of Leuven University in Belgium gave one of the keynote speeches. Serge is a great example of someone who has done pioneering work over many years to develop online reference tools, in this case for learners of French. His ‘Base lexicale du français’ (BLF) not only supplies information about word meanings, grammar, and collocation, but also guides the user to make the right word choices when writing in French or translating from French to another language. The BLF also includes a ‘reading assistant’ (providing various kinds of help to enable you to understand a text), and a tool for helping users write a text is in development.

This theme of aids for writing (what are sometimes called ‘text remediation’ tools) appeared in several other talks, too. We heard from Magali Paquot about the Louvain English for Academic Purposes Dictionary (LEAD), a web-based resource designed to assist in the production of academic writing, and from another group about resources being developed in South Africa to aid ‘text production’ in a number of languages.

One of the features of resources like this is that they should be ‘dynamic’: that is, the system should learn from what users do, and what information they look for, and adapt itself as it goes along. Of course, much of this is still in the planning stage: we know what we want to do, but haven’t yet fully worked out how to do it. Part of the solution lies in having computational tools that can, for example, identify errors or unnatural features in a text supplied by a user. One research group from Barcelona showed how it had achieved a success rate of almost 90% in automatically detecting ‘bad’ collocations, and this kind of tool could form one component of a writing assistant that really worked. In the long term, devices like this could replace conventional dictionaries – at least for language production – because they would do one of the jobs dictionaries have traditionally done, but do it much better.

Another theme at the conference was ‘UGC’ (user-generated content), already such a big feature of the online world. News programmes, for example, routinely include information supplied by their viewers and listeners in the form of tweets, emails, or comments posted on their websites. In the world of reference, Wikipedia is the obvious example of a resource created entirely by its users, but the trend is spreading to dictionaries too. Wordnik has huge amounts of UGC, with numerous words added by members of the public, example sentences ‘harvested’ from the Twittersphere, and all sorts of lists created by the site’s users. A new translation tool being developed by the Russian company ABBYY will include a facility for users to contribute their own translations. And of course Macmillan has its Open Dictionary – an ever-expanding record of the most up-to-date uses of English around the world.

The conference provided a perfect snapshot of current activity and thinking in this exciting field. We may not be much closer to knowing how things will pan out over the next ten years (or even the next two). Developments in information technology, and in the skills, needs and expectations of its users, are all racing ahead at breakneck speed, so we can’t make predictions with any confidence. It brings to mind the famous remark made by Zhou Enlai, the Prime Minister of China till his death in 1976. When asked what he thought was the long-term significance of the French Revolution of 1789, he replied ‘It’s too soon to tell’.

Comments (9)
  • I consider we can work with both tools: dictionaries in books and with applications of dictionaries. For me both are important because if I don’t get a definition in one I could get in the other one, so that’s why I love this tools. Anyone is more or less.

    Posted by Santiago Henriquez on 18th November, 2011
  • Had fun going through the article and at the same time had a glance over the future dictionary. The dictionary would be much user friendly and is likely to co-op with us better. Research and analysis had always made things simpler with catchy features. And in this field too, we expect the same.Great job.Kudos!!!

    Posted by kalyan brata das on 21st November, 2011
  • [...] Letter to Client: On being ‘happy’ versus ‘satisfied’ Worst excuses to keep your rates low The future of dictionaries? Too soon to tell Working Words Well: Greatly Underappreciated Swearing to Make Your Point: A Tale of F**k and Sh*t 3 [...]

    Posted by Weekly favorites (Nov 21-27) | Adventures in Freelance Translation on 28th November, 2011
  • Hello Michael, this is a great article. I agree with Santiago that it is more comfortably to use both the dictionaries and the online applications of dictionaries. I think that in future we will have even more benefits from the online tools that we can imagine now. I just want to congratulate you for this article!!!

    Posted by Svetlin Simeonov on 28th November, 2011
  • [...] of Sentence Discovery. And while we are looking into the future, the folk at MacMillan reported on the future of dictionaries from the 2011 eLEX [...]

    Posted by Superlinguo – all things linguistic-y | Pineapple Donut on 29th November, 2011
  • Great article, thank you for such a concise round-up of themes at the conference. I think that UGC is also an interesting area, but right now it is very separate from traditional paper dictionaries and their online siblings, and it would be interesting to investigate the effect of UGC on official lexicography.

    I am planning to go to the next conference, so may see you there!

    Rachel Bryan (www.veritaslanguagesolutions.com)

    Posted by Rachel Bryan on 1st December, 2011
  • Thanks Rachel. May see you at the next ‘eLEX’ conference, then: it’ll probably be in Tallinn (Estonia) in 2013. I’d expect UGC to be further up the agenda by then. At the moment, its best-known exponent is the Urban Dictionary – which is a long way from any other kind of lexicography. But our own Open Dictionary is much closer, and there’s a lot of thinking going on about other ways of crowd-sourcing lexical data. May be a topic for the blog a bit later…

    Posted by Michael Rundell on 2nd December, 2011
  • [...] more broadly, on the various ways in which reference needs might be catered for in years to come. Via http://www.macmillandictionaryblog.com Discuss Blog · Curated · November 17, 2011 [...]

    Posted by The future of dictionaries? Too soon to tell | Content Rules, Inc. (formerly Oak Hill Corp) on 5th December, 2011
  • [...] The future of dictionaries? Too soon to tell | Macmillan. Like this:LikeBe the first to like this post. [...]

    Posted by The future of dictionaries? Too soon to tell | Macmillan « Your Green Bridge on 26th December, 2011
Leave a Comment
* Required Fields Notify me of follow-up comments via email