Main page
AI tools
WIT by Google AI

WIT by Google AI

A Wikipedia-Based Image Text Dataset For Multimodal Multilingual Machine Learning

A Wikipedia-Based Image Text Dataset For

Product Information

This tool is verified because it is either an established company, has good social media presence or a distinctive use case

Release date22 June, 2023

PlatformDesktop

Visit website

WIT by Google AI Features

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages. Motivation Multimodal visio-linguistic models rely on a rich dataset to help them learn to model the relationship between images and texts. Having large image-text datasets can significantly improve performance, as shown by recent works. Furthermore the lack of language coverage in existing datasets (which are mostly only in English) also impedes research in the multilingual multimodal space – Google AI consider this a lost opportunity given the potential shown in leveraging images (as a language-agnostic medium) to help improve our multilingual textual understanding. To address these challenges and advance research on multilingual, multimodal learning Google AI created the Wikipedia-based Image Text (WIT) Dataset. WIT is created by extracting multiple different texts associated with an image (e.g., as shown in the above image) from Wikipedia articles and Wikimedia image links. This was accompanied by rigorous filtering to only retain high quality image-text sets. The resulting dataset contains over 37.6 million image-text sets – making WIT the largest multimodal dataset (publicly available at the time of this writing) with unparalleled multilingual coverage – with 12K+ examples in each of 108 languages (53 languages have 100K+ image-text pairs). Sljf

Browse AI Tools Similar to WIT by Google AI

4 AI Datasets tools

Trends prompts:

portrait of a rugged 19th century man with mutton

Stable Diffusion

Rugged 19th Century Man Portrait: Highly Detailed Fantasy Art

Get a stunning and highly detailed portrait of a rugged 19th century man with mutton chops in a jacket by Greg Rutkowski. Our cinematic lighting and close-up capture his intricate facial features in stunning digital art.

Hire a professional Rapper from ChatGPT to create catchy and original lyrics for your music projects. Our AI-powered rapper can write verses, hooks, and choruses that perfectly match your style and preferences. Let us help you take your music to the next level

poster of warrior goddess| standing alone on

Stable Diffusion

Warrior Goddess Poster with Intricate Details by Carne Griffiths, Conrad Roset, and Makoto Shinkai

Breathtaking poster of a warrior goddess standing alone on a hill in a vibrant and panoramic anime style. Intricate details and highly detailed gorgeous face.

Street Fighter, 1980s sci-fi style, video game

Stable Diffusion

Street Fighter Chun-Li Photograph by Greg Rutkowski and Magali Villanueva

This MODELSHOOT STYLE photograph of video game character Chun-Li from Street Fighter captures her highly detailed face, beautiful eyes, and toned body in cinematic lighting. With hyperrealism and stunning photography by Greg Rutkowski and Magali Villanueva, this artwork is a must-see.

highly detailed marble and jade sculpture of a

Stable Diffusion

Breathtaking Marble and Jade Sculpture of a Female Necromancer

Witness the breathtaking beauty of a highly detailed marble and jade sculpture of a female necromancer, set against a stunning cyber background with cinematic lighting and stunning environment photography. Featuring ultra-realistic Hyperrealism and wide-angle shots in 3D, this artwork is a true masterpiece.

The personification of the Halloween holiday in

Stable Diffusion

Cute Girl Halloween Holiday Prompt Stable Diffusion

Meet the personification of Halloween in the form of a cute girl with short hair and a villain's smile. Highly detailed, with cute hats and cheeks, rendered in Unreal Engine by Alexei Vinogradov. Check out this art on ArtStation, DeviantArt, and Woo Tooth.

Write an article with ChatGPT

This free ChatGPT prompt will generate a 650 words article. In today’s fast-paced world, the demand for informative and engaging content has never been higher. As a result, the art of writing articles that captivate and educate readers has become increasingly important. A well-crafted article should have a hook to draw in the reader, a clear introduction to set the stage, and a structured format with H1, H2, and H3 headings to make the information easy to digest. The conclusion should summarize the main points and leave the reader with a sense of closure. Writing articles with these key elements not only makes the information more accessible but also demonstrates the writer’s competence and attention to detail. So, use ChatGPT as a tool to give you an idea (inspiration), not just to write the article for you.

young man with long face, with clean shaven short

Stable Diffusion

Line Art Portrait of a Young Man: Ink Stroke Only Style by Moebius

This line art portrait of a young man with a clean-shaven short beard, dark hair, t-shirt, and hoodie is a stroke-only masterpiece by Moebius. Without glasses or hat, this highly stylized artwork is perfect for any art enthusiast.

We use cookies to improve the site and its user experience. By continuing to use the site, you consent to the use of cookies. You can always disable cookies in your browser settings.