Jonathan Richards
Download your 2 for 1 Pizza Express voucher

A tool designed to deter fraudsters from registering fake e-mail accounts has been recruited to help digitise books and newspapers dating back hundreds of years.
Captchas are little boxes on web pages which show a squiggly set of letters and numbers that the user is required to transcribe correctly in order to register or enter the site.
They were devised eight years ago as a way of preventing computers from setting up e-mail accounts automatically which could then be used to send out spam, but a clever tweak means they are now being used to transcribe newspapers dating from the nineteenth century and earlier.
Instead of displaying a random collection of letters and numbers, the newly designed Captchas present the user with a word from an old manuscript that a computer, somewhere, is having trouble deciphering.
When three people type in the same word, the system deduces that this must be the one displayed on the manuscript, and relays this to the computer which has been stumped by the mystery word.
At present the system, which was devised by a 29-year-old assistant professor at Carnegie Mellon University, is partnered with the Internet Achive, a San Francisco-based not-for-profit organisation which is overseeing the digitisation of books at 70 public universities and libraries in the US - but it could conveivably be employed by any such project.
Luis von Ahn, who also created the original Captcha system, says that the new version - which is free to anyone who signs up and will be rolled out early next month - will be able to help digitise about 160 books a day.
(The original acronym stands for Completely Automated Public Turing Test test to tell Computer and Humans Apart, in honour of the pioneering British computer scientist Alan Turing.)
At present, mass digitisation projects such as the Internet Archive rely on humans to check over mistakes produced by a sophisticated scanning technlogy known as 'optical character recognition' (OCR).
OCR is used to translate the text on a scanned page into a more traditional digital format which can then be searched by users in a database, but the technology is not without its problems. For books published before 1900, for instance, the accuracy rate is estimated to be only 80 per cent
Mr von Ahn said he hoped the new system, called ReCaptcha, would enable his tool to be more productive.
"About 60 million Captchas are solved around the world every day - each taking roughly ten seconds," Mr Von Ahn said. "Individually that's not a lot of time, but in aggregate these puzzles consume more than 150,000 hours of work each day. What if we could make more use of this effort?"
About 45,000 sites - including Facebook and ITV - have begun employing ReCaptcha, Mr von Ahn said, but the number of participants is potentially limitless, and as more join, the speed at which books are being digitised will increase accordingly.
At present the number of sites involved means that the sections of a scanned book that are smudged, faded or otherwise unclear can be deciphered in an average of nine minutes. ReCaptcha also improved the accuracy of transcriptions to more than 99 per cent, Mr von Ahn said.
There are several current "mass digitisation" projects, including Google's, which is working with the Bodleian Library at Oxford, among others, and according to one estimate is digitising books at a rate of ten million per year.
A Google spokesman said the company used a mixture of its proprietary OCR software and humans as part of the digitisation process, but declined to say whether the project would partner with the ReCaptcha tool.
Articles from our sister site:
Industry sectors news at a glance. Interactive heatmap, video and podcast
Get ready for the winter sports season, with our resort guides and snow reports
We are backing British business, what is the confidence of the nation and what businesses are succeeding?
Enjoy further reading from Travel to Fashion, Business to Sport, discover more
Shortcuts to help you find sections and articles
2006/06
£POA
Surrey
2009
£114,950
Derbyshire
The best policy at the
best price
Be Wiser Insurance
£POA
Surrey
Highly competitive six figure
Nationwide
Swindon
Competitive benefits package
Chartered Institute of Builders
Ascot
Competitive salary + benefits
NHS Direct
London
£125K
Meltwater News
Nationwide Positions
With Part Exchange Crest Nicholson could get you moving.
Award-winning riverside development, SW11.
Luxury apartments for sale from £350,000.
Find out more about our luxurious apartments and houses for sale in the heart of Sussex.
for sale in the French Alps
from E189,000.
We're offering extra savings on Voyager & Adventure of the seas Mediterranean Cruises fr £549.
Book by 28 Feb!
Includes 3* accommodation throughout, a 15 minute Apollo night helicopter flight down the Las Vegas strip and United Airlines flights from Heathrow.
Same break by air costs £189. Valid for weekend travel until 31 Aug 10.
Get covered on your travels with a superb range of policies at great prices
Visit InsureandGo.com
Family friendly villas with Quality Villas. Book with the specialists.
Contact our advertising team for advertising and sponsorship in Times Online, The Times and The Sunday Times, or place your advertisement.
Times Online Services: Dating | Jobs | Property Search | Used Cars | Holidays | Births, Marriages, Deaths | Subscriptions | E-paper
News International associated websites: Milkround
Copyright 2010 Times Newspapers Ltd.
This service is provided on Times Newspapers' standard Terms and Conditions. Please read our Privacy Policy.To inquire about a licence to reproduce material from Times Online, The Times or The Sunday Times, click here.This website is published by a member of the News International Group. News International Limited, 1 Virginia St, London E98 1XY, is the holding company for the News International group and is registered in England No 81701. VAT number GB 243 8054 69.