Jonathan Richards
Enter our Snapshots of Summer photography competition

A tool designed to deter fraudsters from registering fake e-mail accounts has been recruited to help digitise books and newspapers dating back hundreds of years.
Captchas are little boxes on web pages which show a squiggly set of letters and numbers that the user is required to transcribe correctly in order to register or enter the site.
They were devised eight years ago as a way of preventing computers from setting up e-mail accounts automatically which could then be used to send out spam, but a clever tweak means they are now being used to transcribe newspapers dating from the nineteenth century and earlier.
Instead of displaying a random collection of letters and numbers, the newly designed Captchas present the user with a word from an old manuscript that a computer, somewhere, is having trouble deciphering.
When three people type in the same word, the system deduces that this must be the one displayed on the manuscript, and relays this to the computer which has been stumped by the mystery word.
At present the system, which was devised by a 29-year-old assistant professor at Carnegie Mellon University, is partnered with the Internet Achive, a San Francisco-based not-for-profit organisation which is overseeing the digitisation of books at 70 public universities and libraries in the US - but it could conveivably be employed by any such project.
Luis von Ahn, who also created the original Captcha system, says that the new version - which is free to anyone who signs up and will be rolled out early next month - will be able to help digitise about 160 books a day.
(The original acronym stands for Completely Automated Public Turing Test test to tell Computer and Humans Apart, in honour of the pioneering British computer scientist Alan Turing.)
At present, mass digitisation projects such as the Internet Archive rely on humans to check over mistakes produced by a sophisticated scanning technlogy known as 'optical character recognition' (OCR).
OCR is used to translate the text on a scanned page into a more traditional digital format which can then be searched by users in a database, but the technology is not without its problems. For books published before 1900, for instance, the accuracy rate is estimated to be only 80 per cent
Mr von Ahn said he hoped the new system, called ReCaptcha, would enable his tool to be more productive.
"About 60 million Captchas are solved around the world every day - each taking roughly ten seconds," Mr Von Ahn said. "Individually that's not a lot of time, but in aggregate these puzzles consume more than 150,000 hours of work each day. What if we could make more use of this effort?"
About 45,000 sites - including Facebook and ITV - have begun employing ReCaptcha, Mr von Ahn said, but the number of participants is potentially limitless, and as more join, the speed at which books are being digitised will increase accordingly.
At present the number of sites involved means that the sections of a scanned book that are smudged, faded or otherwise unclear can be deciphered in an average of nine minutes. ReCaptcha also improved the accuracy of transcriptions to more than 99 per cent, Mr von Ahn said.
There are several current "mass digitisation" projects, including Google's, which is working with the Bodleian Library at Oxford, among others, and according to one estimate is digitising books at a rate of ten million per year.
A Google spokesman said the company used a mixture of its proprietary OCR software and humans as part of the digitisation process, but declined to say whether the project would partner with the ReCaptcha tool.
Win a luxury weekend to Newcastle and its neighbour Gateshead, find out more here
Risk, resilience and embracing new technology
Industry sectors news at a glance. Interactive heatmap, video and podcast
Discover the collective power of smart thinking. Submit a solution and be in with a chance to win a Flip MinoHD Camcorder
The inside track on current trends in the charity, not for profit and social enterprise sectors
Everything the Business Traveller needs to know to make a better trip
Make the most of the summer and enter our fabulous photographic competition, you could win a £5000 holiday
Corsica is an island of beauty and contrast, an ideal holiday destination
Enjoy further reading from Travel to Fashion, Business to Sport, discover more
Shortcuts to help you find sections and articles
The clever way to lease a new car is with Car leasing made simple™
2009
42,945
2008
71,450
Car Insurance
Not Specified
MI6
UK-based
£60,000
The Environment Agency
Bristol
Up to £90K
Boots
Midlands
OTE £85k
Credit Protection Association
Nationwide Opportunities
Completely London
Luxury Condo's in Manhattan with NYC views
The best new homes in Wimbledon?
Nationwide
Save up to £1,000 per couple with Elite Vacations at the five-star Constance Lemuria Resort
and do the British Isles this Summer.
Save up to 60% with Oxford Hotels and Inns
Try our inspiring luxury holidays to the Indian Subcontinent and South East Asia.
Great offers available
8 fabulous Canadian cities ...you won’t find cheaper
Contact our advertising team for advertising and sponsorship in Times Online, The Times and The Sunday Times, or place your advertisement.
Times Online Services: Dating | Jobs | Property Search | Used Cars | Holidays | Births, Marriages, Deaths | Subscriptions | E-paper
News International associated websites: Globrix Property Search | Property Finder | Milkround
Copyright 2009 Times Newspapers Ltd.
This service is provided on Times Newspapers' standard Terms and Conditions. Please read our Privacy Policy.To inquire about a licence to reproduce material from Times Online, The Times or The Sunday Times, click here.This website is published by a member of the News International Group. News International Limited, 1 Virginia St, London E98 1XY, is the holding company for the News International group and is registered in England No 81701. VAT number GB 243 8054 69.
The computer does know the captcha word. It distorts the word for presentation in the box. A human being can read it and type it back and the computer checks it against its memory.
Terence Mahoney., Debary,, FL, USA.
The captcha box shows two words, one is already known, the other is unknown. It only uses the known to verify that the user is human, but of course it doesn't say which is which. And, these captcha boxes have a built in audio backup for the benefit of the sight-challenged.
Al, East Lansing, USA
What I don't get is, if the computer doesn't know what the word is, how does it know if you typed in the captcha correctly?
bob, Dallas, USA
Great project. The only "downside" is for those who are sight impaired, thus we need a system that provides audio cues as well. We must ensure that websites are designed to include people of disability including those with sight impairments.
Mark Boyden, Austin, TX, US
Fascinating. I love it. This is what technology is for.
max843, Baltimore, US