Audio player

Comments

111 - Typologically diverse, multi-lingual, information-seeking questions, with Jon Clark

NLP Highlights

Science & Medicine

We invited Jon Clark from Google to talk about TyDi QA, a new question answering dataset, for this episode. The dataset contains information seeking questions in 11 languages that are typologically diverse, i.e., they differ from each other in terms of key structural and functional features. The questions in TyDiQA are information-seeking, like those in Natural Questions, which we discussed in the previous episode. In addition, TyDiQA also has questions collected in multiple languages using independent crowdsourcing pipelines, as opposed to some other multilingual QA datasets like XQuAD and MLQA where English data is translated into other languages. The dataset and the leaderboard can be accessed at https://ai.google.com/research/tydiqa.


More episodes  


Listen to 120 - Evaluation of Text Generation, with Asli Celikyilmaz

120 - Evaluation of Text Generation, with Asli Celikyilmaz

Oct 2, 2020
Listen to 119 - Social NLP, with Diyi Yang

119 - Social NLP, with Diyi Yang

Sep 3, 2020
Listen to 118 - Coreference Resolution, with Marta Recasens

118 - Coreference Resolution, with Marta Recasens

Aug 26, 2020
Listen to 117 - Interpreting NLP Model Predictions, with Sameer Singh

117 - Interpreting NLP Model Predictions, with Sameer Singh

Aug 12, 2020
Listen to 116 - Grounded Language Understanding, with Yonatan Bisk

116 - Grounded Language Understanding, with Yonatan Bisk

Jul 2, 2020