Australian Computer Society, Canberra Branch Forum
Scaling up: the technology behind the NLA's newspaper digitisation and the Trove search service
Each week, the NLA's Trove search service, receives around 200K "real" unique visitors and delivers almost 3M page views. Although Trove describes a diverse set of 250M items, its most popular content are digitised Australian newspaper articles.
This talk will describe the technology behind Trove in general then focus on the technical aspects of digitising and full-text-indexing over 60M newspaper articles, discussing aspects of image formats and processing, OCR correction, the scalability of the Lucene text indexing library, search result ranking, and how commodity SSD has made the unthinkable both easy and cheap.
The presentation will encourage audience interaction and questions.
Kent Fitch has worked as a programmer for over 30 years. Since 1982 he has been a principal of the Canberra software development company, Project Computing Pty Ltd. He has developed many commercial systems and communications packages and custom software for many clients. In the past ten years, his work has focused on library-related systems including AustLit, NLA Newspapers Digitisation, and Trove.
About this Event
Date: Tuesday 6th December 2011
Time: 4:45pm registration for 5:15pm start
CPD Hours offered: 02 hours
Who should attend:
All ACS members and non-members.
A light meal of quality hot and cold finger food and refreshments are also provided at the event.
To ENSURE you gain your Professional development (PD) hours, please register online AND attend the events.
Online registration is required.
Event Prices (Inc GST)
Non Members: $40.00
Regular Fee - Guest:
Non Members: $60.00
A cancellation refund will only be given to paying guests provided that notice is sent to Jenalle Wei no later than 2 working days prior to the event.
Branch Events & Office Administrator
Australian Computer Society - Canberra
Tel: (02) 6230 1588 Fax: (02) 6230 0290
Monday, November 21, 2011
National Library of Australia Newspaper Digitalization Project
Kent Fitch will talk about Scaling up: the technology behind the NLA's newspaper digitisation and the Trove search service, at the Australian Computer Society, Canberra Branch Forum, 6 December 2011.
Subscribe to: Post Comments (Atom)
Post a Comment