Handling linguistic divergences in French-Japanese Machine Translation with Its-2

Speaker: Alexis Kauffmann

Date: Thursday, Dec 10, 2009

Time: 12:15pm – 1:00pm

Venue: Room 319, Battelle A

Abstract:

This presentation for graduate students at CUI will be dedicated to machine translation (MT) from French to Japanese and English to Japanese. As Japanese and French have very different syntactic structures, a challenge with MT between those two languages is to avoid generating “unnatural” literal translation, without making any grammar mistake. I will explain the way we try to handle this problem with our grammar-based multilingual translation system “Its-2″, focusing on the example of Japanese “adjectival clause” generation. I will show you some of our results, and compare them with the ones of other MT systems. In the end, I will give a little demonstration of the “Its-2″ MT system.

Evaluation of the ImageCLEF Wikipedia Benchmark 2008/2009

Postponed: New date to be announced

Speaker: Jana Kludas

Date: Thursday, Nov 26, 2009

Time: 12:15pm – 1:00pm

Venue: Room 319, Battelle A

Abstract:

Benchmarks and evaluation competitions have a long tradition in computer science. For 10 years now, ImageCLEF offers an evaluation platform for image retrieval systems in various domains such as medical retrieval, photo and robotics. The Wikipedia task started in 2008 and attracted a dozen participants each year. Its aim is to investigate ad hoc retrieval approaches in the context of a large and heterogeneous collection of images (similar to those encountered on the Web). Each image is associated with user-generated, unstructured text in English. The task encourages the use of multi modal approaches that investigate the combination of evidence from different modalities. The talk will present the Wikipedia task and collection for 2008/2009 as well as what we can learn from it.

TagCaptcha: Annotating images with CAPTCHAs

Speaker: Donn Morrison
Date: Thursday, Nov 12, 2009
Time: 12:15pm – 1:00pm
Venue: Room 319, Battelle A

Abstract:
For this graduate student seminar, I will give an overview of CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) technology and present a system of exploiting CAPTCHAs for annotating images. CAPTCHAs, well known for their utility in the fight against spam, have recently attained recognition as a unique way of channelling the collective, cognitive work of thousands of users onto difficult problems. I propose a novel CAPTCHA system, called TagCaptcha, that exploits the need for human verification on the Web in order to incrementally annotate an image database. I will also present the results from a small usability study and will give a live demonstration of the TagCaptcha system.
Demo: http://dolphin.unige.ch/tagcaptcha/

Presentation slides

ClusterRank: A Graph based Method for Meeting Summarization

Speaker: Nikhil Garg

Date: Thursday, Oct 29, 2009

Time: 12:15pm – 1:00pm

Venue: Room 319, Battelle A

Abstract:

In this talk, I will present an unsupervised, graph based approach for extractive summarization of meeting transcripts. Graph based methods such as TextRank have been used for sentence extraction in structured text like news articles. Text was modeled as a graph with sentences as nodes and edges based on word overlap. A sentence node was then ranked according to its similarity with the rest of the nodes. The spontaneous speech in meetings leads to incomplete, ill-formed sentences and high redundancy which calls for additional measures to extract relevant sentences. I will describe an extension of the TextRank algorithm that segments the meeting transcript into clusters and uses these clusters to construct the graph. The evaluation is on the AMI meeting corpus and the results show a significant improvement over TextRank and some other baseline methods.

Link to Nikhil’s presentation

Graduate Student Seminar Starts!

The student seminar finally starts this Thursday, Oct 29, 2009. On this page, you will be able to find details of the previous talks as well as announcements for the upcoming talks. As this is a blog, you will also be able to comment on individual posts.

Please get in touch with either Donn or me (Nikhil) if you have any questions or comments. As this is a student seminar, we are also looking for student volunteers who would like to give a talk.