Wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Alternatives To Wit
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Polyglot2,21265286 months ago9December 15, 2021166otherPython
Multilingual text (NLP) processing toolkit
Hrconvert2746
6 months ago8gpl-3.0PHP
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 86 file formats in 13 languages.
Imagescanocr16
a year ago3mitC#
Convert image and pdf to text using Window OCR
Mmid10
5 years ago1
Words and their images in 98 languages
Text Position Detector9
6 years agomitJava
Detects rectangular regions containing multilingual text in an image.
Sulu Docker7
7 years ago1
Dockerized Sulu CMS (http://sulu.io/) (Multisite, multilingual CMS based on Symfony full stack and CMF (http://cmf.symfony.com/)
Docker Polyglot Base5
6 years agogpl-3.0
Alpinx-Linux-based image with Polyglot installed. It is a natural language pipeline that supports massive multilingual applications.
Alternatives To Wit
Select To Compare


Alternative Project Comparisons
Popular Image Projects
Popular Multilingual Projects
Popular Media Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Machine Learning
Natural Language Processing
Wikipedia
Multilingual