IJCNLP 2011 Workshop
13 November 2011, Chiang Mai, Thailand
The fifth workshop on Cross Lingual Information Access aims to bring together researchers from a variety of fields and practitioners from government and industry to address the issue of information need of multi-lingual societies. This workshop will also highlight the contributions of NLP and computational linguistic aspects to CLIA, in addition to the previously better represented viewpoint from Information Retrieval.
List of Accepted Papers
| Paper title | Authors |
| Using Explicit Semantic Analysis for Cross-Lingual Link Discovery | Petr Knoth, Lukas Zilka and Zdenek Zdrahal |
| Unsupervised Russian POS Tagging with Appropriate Context | Li Yang, Erik Peterson, John Chen, Yana Petrova and Rohini Srihari |
| Integrate Multilingual Web Search Results using Cross-Lingual Topic Models | Duo Ding |
| Soundex-based Translation Correction in Urdu--English Cross-Language Information Retrieval | Manaal Faruqui, Prasenjit Majumder and Sebastian Pado |
| A Cross Language Part-of-Speech Tagger (and other tools) for Kannada | Siva Reddy and Serge Sharoff |
| Extending a multilingual Lexical Resource by bootstrapping Named Entity Classification using Wikipedia's Category System | Johannes Knopp |
Workshop Program
TBA
Invited Talks
TBA
IMPORTANT DATES
- July 27, 2011 Papers due
- August 12, 2011 Notification of acceptance
- August 19, 2011 Camera-ready deadline
- November 13, 2011 Workshop
CALL FOR PAPERS
Cross lingual
information access (CLIA) is concerned with any technologies and applications
that enable people to freely access information that is expressed in
any languages. With the rapid development of globalization and digital
online information in Internet, huge demand for cross lingual information
access has emerged from ordinary netizens (polyglots or monoglots) who
are surfing the Internet for special information (e.g. travelling, product
description), and communicating in soaring social networks (e.g. Facebook,
Youtube, Twitter, Myspace), to global companies which provide multilingual
services to their multinational customers, and governments who aim to
lower the barriers to international commerce and collaboration, and
homeland security. This huge demand has triggered vigorous research
and development in CLIA.
This workshop is a continuous effort to address the need of cross-lingual information access
on top of its
previous four issues which were held during IJCAI 2007 in Hyderabad,
IJCNLP 2008 in Hyderabad, NAACL 2009 in Colorado, and COLING 2010 in
Beijing. It aims to bring together researchers from a variety of fields
such as information retrieval, computational linguistics, machine translation,
and digital library, and practitioners from government and industry
to address the issues of information need of multilingual society. We
also would like to promote and emphasize the potential contributions
of NLP and computational linguistic aspects to CLIA, in addition to
the previously better represented viewpoint from Information Retrieval.
Specifically, the key interests are in:
- Novel methods to mine knowledge from multilingual corpora using CL and/or NLP techniques.
- Techniques to customize machine translation in order to satisfy the special requirements of CLIA.
- Methods to adapt the existing CL/NLP approaches in information extraction, question answering, summarization, categorization, sentiment analysis, opinion extraction from a monolingual context to a multilingual context.
- Techniques for leveraging multilingual resources to improve the performance of monolingual information access.
- We thus solicit submissions in, but not limited to, the following topics:
General CLIA:
- Approaches to cross-lingual/ multilingual information access
- Domain specific multilingual information access
- Cross-lingual cross media search (speech, video, audio)
- Machine Learning for multilingual information access
- Scalability issues in multilingual information access/ system evaluation
- Web-scale cross-lingual search
- Interaction between cross-language information retrieval and machine translation
- Query translation and document translation
- Developing statistical machine translation on large-scale multilingual corpora
- Domain adaptation in machine translation
- Multilingual / Cross-lingual named entity recognition
- Multilingual summarization
- Multilingual information extraction
- Multilingual question answering
- Multilingual text categorization and clustering
- Multilingual opinion study and sentiment analysis
- Monolingual processing leveraging on multilingual resources
- CLIA user studies and user-centric evaluation
- CLIA system deployment on social networks (e.g., Facebook or Twitter) or handheld computers or smartphones (e.g., iPad or iPhone)
- Acquisition
of multilingual parallel/comparable/non-
comparable corpora - Multilingual document/sentence/word alignment
- Multilingual lexicon/term extraction
- Multilingual new words / named entity detection and translation
PAPER SUBMISSION
Authors are
invited to submit papers on original, unpublished work on the topics
of this workshop. Submissions should follow the IJCNLP 2011 length and
formatting requirements for long papers of eight (8) pages of contents
with two (2) additional pages of references, and for short papers of
two (4) pages of contents with two (2) additional pages of references.
A detailed abstract with two (2) pages to address your on-going work
is also welcome.Please use the following URL to submit your papers https://www.softconf.com/ijcnlp2011/CLIA5/
ORGANIZING COMMITTEE
- Asif Ekbal, IIT Patna, India (Co-chair)
- Deyi Xiong, Institute for InfoComm Research, Singapore (Co-chair)
- Prasenjit Majumder, DAIICT, India
- Mitesh Khapra, IIT Bombay
Program Committee Members (to be confirmed)
- Eneko Agirre (University of the Basque Country)
- Rafael Banchs
- Sivaji Bandyopadhyay (Jadavpur University)
- Pushpak Bhattacharya (IIT Bombay)
- Nicola Cancedda (Xerox Research Centre (Europe))
- Somnath Chandra (MIT, Govt. of India)
- Wenliang Chen (Institute for Infocomm Research)
- Patrick Saint Dizier(IRIT, Universite Paul Sabatier)
- Xiangyu Duan (Institute for Infocomm Research)
- Nicola Ferro (University of Padua)
- Cyril Goutte (National Research Council of Canada)
- Gareth Jones (Dublin City University)
- Joemon Jose (University of Glasgow)
- A Kumaran (Microsoft Research of India)
- Jun Lang (Institute for Infocomm Research)
- Swaran Lata (MIT, Govt. of India)
- Gina-Anne Levow (National Centre for Text Mining (UK))
- Qun Liu (Institute of Computing Technology, CAS)
- Yang Liu (Institute of Computing Technology, CAS)
- Mandar Mitra (ISI Kolkata)
- Doug Ouard (University of Maryland, College Park)
- Carol Peters (Istituto di Scienza e Tecnologie dell'Informazione and CLEF campaign)
- Paolo Rosso (Technical University of Valencia)
- Sudeshna Sarkar (IIT Kharagpur)
- Hendra Setiawan (University of Maryland)
- L Sobha(AU-KBC, Chennai)
- Rohini Srihari (University at Buffalo, SUNY)
- Ralf Steinberger (European Commission - Joint Research Centre (Italy))
- Le Sun (Institute of Software, CAS)
- Vasudeva Varma (IIIT Hyderabad)
- Thuy Vu (Institute for Infocomm Research)
- Haifeng Wang (Baidu)
- Yunqing Xia
- Min Zhang
- Guodong Zhou (SooChow University)
- Chengqing Zong (Institute of Automation, CAS)
- Raghavendra Udupa (Microsoft Research)