COLING 2012: Some facts and figures - IIT Bombay · COLING 2012: Some facts and figures Pushpak...
Transcript of COLING 2012: Some facts and figures - IIT Bombay · COLING 2012: Some facts and figures Pushpak...
COLING 2012: Some facts and figures
Pushpak BhattacharyyaCSE Dept., IIT Bombay
Organized by
CFILT, CSE Department, IIT Bombay, and
TDIL, DIT, Ministry of Communication and Information Technology
Under the umbrella of ICCL (International Committee on Computational Linguistics)
Committees Program Chairs: Prof. Martin Kay, Stanford
University and Prof. Christian Boitet, University Joseph Fourier, France
Organizing Chairs: Prof. Pushpak Bhattacharyya, IIT Bombay and Prof. Rajeev Sangal, IIIT Hyderabad
Workshop Chair: Prof. Laurent Besacier, University Joseph Fourier
Tutorial Chair: Prof. Sadao Kurohashi, Kyoto University, Japan
Publication Chair: Prof. Roger Evans, University of Brighton, UK
Date, Venue etc. Main Conference: 10-14 Dec, 2012 Pre-conference workshops and tutorials: 8, 9
Dec, 2012 Post-conference workshops: 15 Dec, 2012 Venue: VMCC and convocation hall, IIT
Bombay 1000+ submissions Conf URL: http://www.coling2012-iitb.org/
Our sponsors Diamond
TCS LDC-IL, CIIL, Mysore
Gold Microsoft Research Baidu
Silver Yahoo! IBM ezDi enago
COLING2012 PARTICIPATION
Total Registration: 763Total attendance: 612
Main Conference 606Tutorial 99Workshop 336
Number of accepted papers in various categories (as in the proceedings
Oral (173) Poster (138) Demo (66) Reserve (22)
Best paper award “Accurate Unbounded Dependency
Recovery using Generalized CategorialGrammars”, Luan Nguyen1, Marten Van Schijndel2 and William Schuler2
1: University of Minnesota, Dept. of Computer Science and Engineering, Minneapolis, MN
2: The Ohio State University, Dept. of Linguistics, Columbus, OH [email protected], [email protected],
Workshops (1/3) WS1: “Advances in discourse analysis and its computational
aspects”: Eva Hajicova WS2: “Second Workshop on Advances in Text Input Methods
(WTIM 2)”: Kalika Bali, Monojit Choudhury, Yoh Okuno WS3: “Eye-tracking and Natural Language Processing”: Michael
Carl, Pushpak Bhattacharya, Kamal Kumar Choudhary WS4: “3rd Workshop on South and Southeast Asian Natural
Language Processing (SANLP)”: Virach Sornlertlamvanich, AbbasMalik
WS5: “Cognitive Aspects of the Lexicon (CogALex-III)”: Michael Zock, Reinhard Rapp
WS6: “10th Workshop on Asian Language Resources”: RuvanWeerasinghe, Rachel Edita O. Roxas, Virach Sornlertlamvanich, Sarmad Hussain
Workshops (2/3) WS7: “2nd Workshop on Sentiment Analysis where AI meets
Psychology (SAAIP 2012)”: Sivaji Bandyopadhyay, Manabu Okumura
WS8: “Sixth Workshop on Analytics for Noisy Unstructured Text Data”: Lipika Dey, Daniel Lopresti, Christoph Ringlstetter, Shourya Roy, L. Venkata Subramaniam
WS9: “Information Extraction & Entity Analytics on Social Media Data”: Sriram Raghavan, Ganesh Ramakrishnan
WS10: “Machine Translation and Parsing in Indian Languages (MTPIL-2012)”: Radhika Mamidi, Ranjani Parthasarathi, SobhaLalitha Devi, Dipti Misra Sharma, Joseph van Genabith
WS11: “Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT-12 WS and Shared Task)”: Josef van Genabith, Toni Badia, Christian Federmann
Workshops (3/3) WS12: “Speech and Language Processing Tools in Education”:
Radhika Mamidi, Kishore Prahallad WS13: “Reordering for Statistical Machine Translation”: Karthik
Visweswariah, Ananthakrishnan Ramanathan, Mitesh M. Khapra WS14: “Question Answering for Complex Domains”: Nanda
Kambhatla Sachindra Joshi, Ganesh Ramakrishnan, Kiran Kate, Priyanka Agrawal
WS15: “First International Workshop on Optimization Techniques for Human Language Technology”: Pushpak Bhattacharyya, Asif Ekbal, Sriparna Saha, Mark Johnson, Diego Molla-Aliod
Tutorials T1: “Temporal Information Extraction and Shallow Temporal
Reasoning”: Dan Roth, Heng Ji, Taylor Cassidy, Quang Do T2: “Exploiting Web Data Sources for Advanced NLP”: Gerard de
Melo T3: “Multimodal Corpora”: Patrizia Paggio, Dirk Heylen,
Costanza Navarretta T4: “The Hindi/Urdu Treebank: New Frontiers in Hindi and Urdu
Natural Language Processing”: Owen Rambow, Dipti MisraSharma, Ashwini Vaidya
T5: “Open-domain Conversations with Humanoid Robots”: Kristiina Jokinen, Graham Wilcock
T6: “Revisiting Dimensionality Reduction Techniques for NLP”: Jagadeesh Jagarlamudi, Raghavendra Udupa
Tutorial Participation
Temporal Information Extraction and Shallow Temporal Reasoning
25%
Exploiting Web Data Sources for Advanced
NLP29%
Multimodal Corpora
10%
The Hindi/Urdu Treebank: New Frontiers in Hindi and
Urdu NLP9%
Open-domain Conversations with Humanoid Robots
12%
Revisiting Dimensionality Reduction Techniques for
NLP15%
Workshop ParticipationAdvances in discourse
analysis and its computational aspects
8%Second Workshop on
Advances in Text Input Methods
6%
Eye-tracking and Natural Language Processing
5%
3rd Workshop on South and Southeast Asian Natural
Language Processing (SANLP)
8%
Cognitive Aspects of the Lexicon (CogALex-III)
7%
10th Workshop on Asian Language Resources
5%
2nd Workshop on Sentiment Analysis where
AI meets Psychology (SAAIP 2012)
8%
Sixth Workshop on Analytics for Noisy
Unstructured Text Data9%
Information Extraction & Entity Analytics on Social
Media Data10%
Machine Translation and Parsing in Indian
Languages (MTPIL-2012)8%
Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid
MT 4%
Speech and Language Processing Tools in
Education6%
Reordering for Statistical Machine Translation
4%
Question Answering for Complex Domains
6%
First International Workshop on Optimization
Techniques for Human Language Technology
6%
COLING 2012 Participation by Nationality
USA+North America+ South
America, 82
Asia + Africa +AUS+Nz, 454
Europe, 217
India 251United States 70Japan 58China 53Germany 49France 36United Kingdom 24Ireland 17Australia 14Czech Republic 15Sweden 12Taiwan 12Hong Kong 11Canada 10Russia 9Spain 9Italy 8Korea, South 8Poland 6Singapore 6Denmark 5Hungary 5Croatia 4Portugal 4Vietnam 4Egypt 3Finland 3Iran 3Pakistan 3Others 41
Asia+Africa+Aus+Nz192
Europe67
US+Can+Brazil41
Participation by Nationality
Asia+Africa+Aus+Nz
Europe
US+Can+Brazil
(in comparison) IJCNLP2011 Participation by Nationality
T=300
Japan 61China 51United States 35India 28France 17Germany 14Thailand 12Taiwan 9Singapore 8United Kingdom 8Ireland 6Australia 5Canada 5Italy 4Korea 4Phillipines 4Netherlands 3Spain 3Iran 2Pakistan 2Sweden 2Austria 1Belgium 1Brazil 1Finland 1Ghana 1Greece 1Hungary 1Latvia 1Malaysia 1Morocco 1New Zealand 1Norway 1Portugal 1Qatar 1Saudi Arabia 1Swaziland 1Vietnam 1
Invited Talks “The adaptive brain: acquiring a complex cognitive
skill in complex contexts ”: Prof. Barbara Moser-Mercer, University of Geneva (10th Dec, Day 1)
“Digital Book, Digital Library, and Natural Language Processing”: Prof. Makato Nagao, Japan (11th Dec, Day 2)
“Minimum Description Length as the basis of Panini's grammar”: Prof. Paul Kiparsky, Stanford University (13th Dec, Day 4)
“NLP from Paninian Perspective”: Prof. Dipti MishraSharma, IIIT Hyderabad (14th Dec, Day 5)
(Day 3 was excursion day)
Social Events Reception: 10th evening (performances
by delegates and organizers) Banquet: 11th evening: Renaissance Excursion: 12th whole day: Karla and
Bhaja caves Indian classical music: 13th evening:
Prasad Padhye (tabla) and NayanGhosh (Sitar)
Indian language
technology5%
Underresourced languages5%
Morphology & POS tagging
6%Grammar and formalisms
2%
Parsing4%
Semantics5%
Discourse and pragmatics
4%
Coreference resolution3%
Ontologies and terminology3%
Textual entailment1%
Resources and annotation
7%
Psychological and neurological modelling
2%
Empirical machine
translation6%Deployment of NLP-based
applications, software integration & quality
5%
Expert or hybrid machine translation
3%
Hybrid man+machine architectures
1%
Information & content extraction
5%
Information retrieval4%
Named entity recognition3%
Natural language generation3%
Question answering3%
Sentiment and text classification
7%
Speech recognition and synthesis
2%
Summarization5%
Word sense disambiguation4%
Area wise representation of accepted papers
Area and Countrywisestatistics
Area wise statistics
France5%
Hong KongSpecial
administrative region of China 5%
India80%
Singapore5%Spain
5%
Indian Language Technology
Australia4%
China9%
Croatia4%
Denmark4%
Ethiopia4%
Hong Kong Special
Administrative Region of
China5%
India14%
Japan9%
Lithuania5%
New Zealand5%
Russian Federation
5%
Spain9%
Sweden9%
United States14%
Under resourced languages
Bulgaria4%
China17%
Germany4%
India17%
Japan9%
Lebanon4%
Poland4%
Romania4%
Singapore4%
Taiwan4%
United Arab Emirates
8%
United Kingdom
4%
United States17%
Morphology & POS tagging
Canada20%
China10%
Egypt10%
Germany10%
Hong KongSpecial
administrative region of China
10%
Hungary10%
Poland10%
Singapore10%
United States10%
Grammar and Formalisms
Australia11%
China28%
Croatia5%France
5%
Germany17%
Japan11%
Norway6%
Singapore6%
United States11%
Parsing
Canada4% France
4%
Germany19%
India5%
Israel5%
Japan5%
Netherlands5%
Singapore5%
Spain5%
United Kingdom
19%
United States24%
Semantics
Australia5%
China17%
Czech Republic5%
Estonia5%
Finland5%
France11%
Germany6%
Hong Kong Special
Administrative Region of
China6%
Japan6%
United Kingdom
6%
United States28%
Discourse and pragmatics
Belgium9%
China8%
Germany17%
India17%Italy
8%
Japan8%
Poland8%
United States25%
Coreference resolution
Canada7%
China15%
Ethiopia7%
France7%
India7%Ireland
7%Japan7%
Poland7%
Spain7%
United Kingdom
7%
United States22%
Ontologies and terminologies
China25%
Japan25%
United States50%
Textual entailment
Brazil4% China
7%Czech Republic
3%
France10%
Germany14%
Hong Kong Special
Administrative Region of
China3%
India3%
Israel3%
Italy10%Japan
7%
Poland3%
Spain3%
Sweden3%
Taiwan7%
United States20%
Resources and annotations
Australia12%
China13%
Japan25%
United States50%
Psychological and Neurological Modelling
China31%
France9%
Germany13% Iran (Islamic
Republic of)5%
Ireland4%
Macau4%
Qatar4%
Republic of Korea4%
Spain4%
United Kingdom
9%
United States13%
Empirical Machine Translation
Canada5%
China5%
France10%
Germany5%
India5%
Japan15%
Norway5%
Qatar5%
Republic of Korea10%
Romania5%
Russian Federation
5%
Taiwan15%
United Kingdom
5%
United States5%
Deployment of NLP applications
Belgium7%
China22%
Denmark7%
France22%Germany
7%
Greece7%
Japan14%
Taiwan7%
United States7%
Expert or hybrid machine translation
Czech Republic
34%
France33%
Iran (Islamic Republic of)
33%
Hybrid man+machine architectures & human factors
Australia10%
China10%
Germany15%
India5%
Iran (Islamic Republic of)
10%Ireland
5%
Italy5%
Japan5%
Romania5%
Singapore5%
Taiwan5%
Turkey5%
United States15%
Information & content extraction
China28%
France5%
Germany11%
Hong Kong 5%
India17%
Ireland11%
Japan5%
Pakistan6%
Spain6%
United States6%
Information retrieval
Canada9%
China9%
France9%
Germany28%India
9%
Turkey9%
United Arab Emirates
9%
United Kingdom
9%
United States9%
Named entity recognition
Argentina9%
Australia9%
Canada9%
France28%
Germany18%
United Kingdom
27%
Natural Language Generation
Australia8%
China34%
India8%
Italy8%
Japan17%
United Kingdom
8%
United States17%
Question answering
Canada3%
China3%
Denmark3%
Germany18%
India11%
Ireland3%
Italy4%
Japan4%
Portugal4%
Russian Federation
7%
Singapore4%
United Kingdom
7%
United States29%
Sentiment and text classification
India14%
Japan43%
Republic of Korea14%
United States29%
Speech recognition and synthesis
Canada9%
China48%
Greece5%
India9%
Japan9%
Singapore5%
Sweden5%
United States10%
Summarization
China6%
Czech Republic6%
France13%
Germany27%
India7%
Japan7%
Taiwan7%
United States27%
Word sense disambiguation
Message for students
Clear goal Clear Technique Clear evaluation Clear error analysis and path to improve
Message (2/2)
Guide us in straight path: holy Qur’an Lead us not unto temptation: holy Bible Asato ma sat gamaya: Upanishad
Thank you
Material given in the registration kit Bag Conference handbook USB containing
Main conference proceedings Workshop proceedings Tutorial notes Help desk (internet, map, eating joints etc) Excursion pamphlet
Delegate badges with coupons Pen Pad
Observations High Quality papers, tutorials and
workshops India’s presence: commendable
219 out of 251 registered 42 papers from India (main conf) out of 193 (Oral, Poster and Demo)