
Speakers:
Improving Text-to-Image Multilingual Search at Toloka and Jina
date:
Thursday, November 16, 2023
Time:
11:35 am
Summary:
In this talk, Evgeniya discusses how multilingual CLIP-style models are changing the game in multimodal AI, particularly in Text-to-Image search applications. She will share her experience and techniques for fine-tuning multilingual and monolingual CLIP models on non-English data, specifically German data gathered through crowdsourcing. Additionally, she will demonstrate how to gather data in over 40 languages for fine-tuning such models. If you are interested in improving the performance of Text-to-Image search across multiple languages, this talk is for you!