Share this page
Fifth International Workshop On Cross Lingual Information Access: Computational Linguistics and the Information Need of Multilingual Societies.

IJCNLP 2011 Workshop

13 November 2011, Chiang Mai, Thailand

The fifth workshop on Cross Lingual Information Access aims to bring together researchers from a variety of fields and practitioners from government and industry to address the issue of information need of multi-lingual societies. This workshop will also highlight the contributions of NLP and computational linguistic aspects to CLIA, in addition to the previously better represented viewpoint from Information Retrieval.

List of Accepted Papers

Paper title Authors
Using Explicit Semantic Analysis for Cross-Lingual Link Discovery Petr Knoth, Lukas Zilka and Zdenek Zdrahal
Unsupervised Russian POS Tagging with Appropriate Context Li Yang, Erik Peterson, John Chen, Yana Petrova and Rohini Srihari
Integrate Multilingual Web Search Results using Cross-Lingual Topic Models Duo Ding
Soundex-based Translation Correction in Urdu--English Cross-Language Information Retrieval Manaal Faruqui, Prasenjit Majumder and Sebastian Pado
A Cross Language Part-of-Speech Tagger (and other tools) for Kannada Siva Reddy and Serge Sharoff
Extending a multilingual Lexical Resource by bootstrapping Named Entity Classification using Wikipedia's Category System Johannes Knopp

Workshop Program

Invited Talks


    • July 27, 2011 Papers due
    • August 12, 2011 Notification of acceptance
    • August 19, 2011 Camera-ready deadline
    • November 13, 2011 Workshop


Cross lingual information access (CLIA) is concerned with any technologies and applications that enable people to freely access information that is expressed in any languages. With the rapid development of globalization and digital online information in Internet, huge demand for cross lingual information access has emerged from ordinary netizens (polyglots or monoglots) who are surfing the Internet for special information (e.g. travelling, product description), and communicating in soaring social networks (e.g. Facebook, Youtube, Twitter, Myspace), to global companies which provide multilingual services to their multinational customers, and governments who aim to lower the barriers to international commerce and collaboration, and homeland security. This huge demand has triggered vigorous research and development in CLIA.

This workshop is a continuous effort to address the need of cross-lingual information access

on top of its previous four issues which were held during IJCAI 2007 in Hyderabad, IJCNLP 2008 in Hyderabad, NAACL 2009 in Colorado, and COLING 2010 in Beijing. It aims to bring together researchers from a variety of fields such as information retrieval, computational linguistics, machine translation, and digital library, and practitioners from government and industry to address the issues of information need of multilingual society. We also would like to promote and emphasize the potential contributions of NLP and computational linguistic aspects to CLIA, in addition to the previously better represented viewpoint from Information Retrieval. Specifically, the key interests are in:

  1. Novel methods to mine knowledge from multilingual corpora using CL and/or NLP techniques.
  2. Techniques to customize machine translation in order to satisfy the special requirements of CLIA.
  3. Methods to adapt the existing CL/NLP approaches in information extraction, question answering, summarization, categorization, sentiment analysis, opinion extraction from a monolingual context to a multilingual context.
  4. Techniques for leveraging multilingual resources to improve the performance of monolingual information access.
  5. We thus solicit submissions in, but not limited to, the following topics:

General CLIA:

  • Approaches to cross-lingual/ multilingual information access
  • Domain specific multilingual information access
  • Cross-lingual cross media search (speech, video, audio)
  • Machine Learning for multilingual information access
  • Scalability issues in multilingual information access/ system evaluation
  • Web-scale cross-lingual search
Machine translation in CLIA:

  • Interaction between cross-language information retrieval and machine translation
  • Query translation and document translation
  • Developing statistical machine translation on large-scale multilingual corpora
  • Domain adaptation in machine translation
  • Multilingual / Cross-lingual named entity recognition

  • Multilingual summarization
  • Multilingual information extraction
  • Multilingual question answering
  • Multilingual text categorization and clustering
  • Multilingual opinion study and sentiment analysis
  • Monolingual processing leveraging on multilingual resources
CLIA evaluation and deployment:

  • CLIA user studies and user-centric evaluation
  • CLIA system deployment on social networks (e.g., Facebook or Twitter) or handheld computers or smartphones (e.g., iPad or iPhone)

Multilingual knowledge acquisition:

  • Acquisition of multilingual parallel/comparable/non-comparable corpora
  • Multilingual document/sentence/word alignment
  • Multilingual lexicon/term extraction
  • Multilingual new words / named entity detection and translation


Authors are invited to submit papers on original, unpublished work on the topics of this workshop. Submissions should follow the IJCNLP 2011 length and formatting requirements for long papers of eight (8) pages of contents with two (2) additional pages of references, and for short papers of two (4) pages of contents with two (2) additional pages of references. A detailed abstract with two (2) pages to address your on-going work is also welcome.Please use the following URL to submit your papers




Program Committee Members (to be confirmed)

  • Eneko Agirre (University of the Basque Country)
  • Rafael Banchs
  • Sivaji Bandyopadhyay (Jadavpur University)
  • Pushpak Bhattacharya (IIT Bombay)
  • Nicola Cancedda (Xerox Research Centre (Europe))
  • Somnath Chandra (MIT, Govt. of India)
  • Wenliang Chen (Institute for Infocomm Research)
  • Patrick Saint Dizier(IRIT, Universite Paul Sabatier)
  • Xiangyu Duan (Institute for Infocomm Research)
  • Nicola Ferro (University of Padua)
  • Cyril Goutte (National Research Council of Canada)
  • Gareth Jones (Dublin City University)
  • Joemon Jose (University of Glasgow)
  • A Kumaran (Microsoft Research of India)
  • Jun Lang (Institute for Infocomm Research)
  • Swaran Lata (MIT, Govt. of India)
  • Gina-Anne Levow (National Centre for Text Mining (UK))
  • Qun Liu (Institute of Computing Technology, CAS)
  • Yang Liu (Institute of Computing Technology, CAS)
  • Mandar Mitra (ISI Kolkata)
  • Doug Ouard (University of Maryland, College Park)
  • Carol Peters (Istituto di Scienza e Tecnologie dell'Informazione and CLEF campaign)
  • Paolo Rosso (Technical University of Valencia)
  • Sudeshna Sarkar (IIT Kharagpur)
  • Hendra Setiawan (University of Maryland)
  • L Sobha(AU-KBC, Chennai)
  • Rohini Srihari (University at Buffalo, SUNY)
  • Ralf Steinberger (European Commission - Joint Research Centre (Italy))
  • Le Sun (Institute of Software, CAS)
  • Vasudeva Varma (IIIT Hyderabad)
  • Thuy Vu (Institute for Infocomm Research)
  • Haifeng Wang (Baidu)
  • Yunqing Xia
  • Min Zhang
  • Guodong Zhou (SooChow University)
  • Chengqing Zong (Institute of Automation, CAS)
  • Raghavendra Udupa (Microsoft Research)