World Edition

Search engines and search engine statistics worldwide

New directions in multilingual information access

  Jan 4, 2007

Via the SIGIR FORUM comes this 2006 SIGIR Report that examines current issues and the future of multilingual information access (MLIA) . The report does a critical review of The Quaero Project and similar initiatives. Six sessions and multilingual participants shared research free from unrealistic expectatives, wishful thinking or SEO hearsays.

New Directions in Multilingual Information Access

The keynowte speaker was David A. Evans, CEO of Clairvoyance Corporation (formerly professor of computer science and linguistics at Carnegie Mellon). Clairvoyance has substantial practical experience in multilingual applications for both Asian and European languages. The keynote position is summarized by this quote from its abstract:

“Despite the remarkable success of cross-language information retrieval (CLIR) and translingual information retrieval (TLIR) systems to perform on a par with monolingual IR systems in research and evaluation contexts, there has been relatively little commercial development (or success) of TLIR systems and applications. This is due, in part, to lack of demand in the marketplace, but also, in perhaps greater measure, to the special requirements that may be associated with TLIR applications - requirements that are not typically addressed (or assessed) in our research evaluations.”

In his presentation, David Evans presented a graph of current and projected worldwide demand for products for globalization, including multilingual search and software for machine translation. With a 5-year projected growth from 173 million US dollars (2004) to 263 million (2009), Dr Evans wryly observed

“These are not numbers which excite venture capitalists.”

Dr. Evans then described the failed attempts by his company and Japanese partners to introduce multilingual aspects into existing monolingual information management products.

His conclusions from these experiences were sobering:

1. In 2006, the market for multilingual globalization support is “not there yet”
2. Quality and scope of machine translation is a major gating factor
3. The demand for CLIR, per se, is low
4. To be successful today, CLIR Systems (already very complex) must be fashioned around “solutions” – integrated into systems that may need CLIR functionality only as a means to other ends
5. We must be (very) patient; or perhaps we should rethink our goals and refocus our applications”

In the discussion following the keynote, other participants from commercial firms offering similar products said that their organizations had reached the same conclusions.

MAIN THEMES EMERGING FROM THE WORKSHOP

From the workshop and the discussion by participants, a number of different themes emerged to both challenge researchers in this area and suggest new avenues of research and development:

1. Identification of the real world (commercial) use case? This was a main topic of the keynote paper and a running theme throughout the workshop.
2. The difficulty of technology transfer into existing applications (see the paper Braschler et al).
3. The need for new evaluation methodologies which evaluate the whole system including aspects relating to usage and not just system performance from the technical perspective
4. Following on from 3, the importance of replication of research results, implying a need for appropriate tools (see the paper on Data Curation)
5. Again related to 3, the need to study user behaviour - and to define reliable techniques for such studies (papers by He/Oard, and Clough et al)
6. The need to study the relationship between cross-language retrieval, machine translation and multilingual summarization (see presentations by Chin-Yew Lin & D.K.Evans))
7. The pressure to move studies from text to mixed media and “new” genres (see Ao Feng, Gey, Jones)
8. The need for digital content to be prepared with access in mind – and the need for markup. The potential of the semantic web for CLIR must be investigated (information processing and retrieval must move from words or features to concepts via markup)

Dr. E. Garcia

Visited 678 times, 3 so far today

Leave a Reply

Managing Editor

Regional Editors

Latest News

Partners

Useful Resources

Contributors

  • Multilingual Search is looking for new contributors! Contact us for more information.

Tag Cloud

.eu TLD .it 2007 AdSense Adwatch Aegis Alibaba Anti virus AUDIWEB Italy austria badoo Baidu banners italy barry Schwartz blogging blogosphere russia yandex Bloomburg Bulgaria China China mobile china web2.0 review Chinese internet cnnic comScore copyright law corporate banking credit cards Cyrillic+characters Dari Daum DGC Magazine doing business in asia Domain domain name statistics Domain name statistics europe domain names italy Domaining Domains e commerce e payments e shopping Ecommerce Ecommerce statistics Europe egovernment electronic payments email Email Marketing Italy Empas Europe events exposure Facebook Finam Finland youtube video+distribution Flickr France French+internet+statistics Minitel French+online+buyers FTSE 100 companies geographic location geolocation geotargeting Germany Getty Images global social networking global websites Google Google+adwords Google+Bulgaria Google+PR+disaster Google book project Google buys Russian search engine Google global presence google group Google Portugal Guge Hapoel Play65 Kiryat Shalom help home banking households HSPA HSPA+ ICANN Iceland IDN IDN Domains IM india Indian search engines Information Technology Statistics Italy international bloggong international hosting international search summit Internet internet access families europe internet access minors italy Internet Marketing internet statistics europe internet trademarks internet usage internet users Internet users Italy internet connections internet use internet users IPO Isobar Israel italia.it Italy japanese search engines Joop Dorresteijn Kaspersky Korea Latin America lawsuit link building major Italian portals marketing conference marketing driving microhoo microsoft+yahoo middle east mobile instant messaging mobile internet connections MoiKrug MTS multilingual+marketing multilingual blogging multilingual search multilingual seo. multilingual websites MySpace Nate national search engine Naver Netherlands neti nielsen Nielsen Online NetView number of internet users omgili Omniture online advertising online banking online expenditure Europe online marketing expenditure italy online travel statistics europe orli Payment Systems portugal PPC presentations protecting your trademarks online quality search engine results RBR RF RU Domain runet russia CIB CIB2008 conferences Russia Russian Russian Development Bank Sante J. Achille search Search+marketing+France search engine search engine marketing journal search engine rankings search engines italy search in brazil search marketing India search marketing publications search engine users search engines SEO SMX SMX search+marketing+Sweden soccer Social Networking sortfix South Korea Spain speakers sphinncon sponsored links italy stat statistics statistics internt access italy statistics mobile web browsing SU SU Domains Sweden techcrunch teleseker Terkmen TGI TLD TLDs Turkish search engines Turkmenistan Turkmentelekom UK ukraine UMTS United Kingdom us Uzbek video+sharing Vietnamese search engines Vkontakte Vladimir Dolgov walla Web 2.0 web marketing web marketing europe web marketing hotels web marketing italy web marketing papers web marketing travel web2sport Webmaster tools Webmaster. support WebMoney WebProNews World Statistics Worldwide Ya Online Yahoo Yahoo+go+Brazil Yahoo+Ireland Bebo+Ireland Yahoo+advertising+partners yahoo china Yandex Yandex Money Yandex Poste Youku youtube

Related News

Hot Topics Today

Hottest Topics

Subscribe to World Edition