Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing multimodal
multimodal
x
natural-language-processing
x
12 search results found
Unilm
⭐
16,971
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Modelscope
⭐
5,517
ModelScope: bring the notion of Model-as-a-Service to life.
Courses
⭐
4,018
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Awesome Aigc Tutorials
⭐
2,879
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Chinese Clip
⭐
2,816
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Torchscale
⭐
2,804
Foundation Architecture for (M)LLMs
Deepke
⭐
2,679
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
Data Juicer
⭐
994
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Wit
⭐
896
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Nsmusics
⭐
601
NSMusicS(Nine Songs · Music World:九歌 · 音乐世界),Multi platform Multi mode Super Music Software (Full stack development, audio processing, artificial intelligence, natural language processing)
Fastrag
⭐
591
Efficient Retrieval Augmentation and Generation Framework
Papermage
⭐
494
library supporting NLP and CV research on scientific papers
Awesome Foundation And Multimodal Models
⭐
223
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code]
Agi Papers
⭐
218
Papers and Book to look at when starting AGI 📚
Fashion Clip
⭐
189
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
Multimodalstory Demo
⭐
178
FairyTailor: Multimodal Generative Framework for Storytelling
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Visual Chinese Llama Alpaca
⭐
129
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
Kb Ner
⭐
119
Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.
Mkg_analogy
⭐
56
Code and datasets for the ICLR2023 paper "Multimodal Analogical Reasoning over Knowledge Graphs."
Glami 1m
⭐
47
The largest multilingual image-text classification dataset. It contains fashion products.
Vle
⭐
33
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
Trymore Paperreading
⭐
25
揣摩研习社关注自然语言和信息检索前沿技术,解读热门科技论文,分享实用科研工具,挖掘人工智能冰山之下的
Inverse Dall E For Optical Character Recognition
⭐
24
Inverse DALL-E for Optical Character Recognition
Modality Transferable Mer
⭐
23
Modality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.
Multinerd
⭐
20
Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation)" (NAACL 2022).
Slp
⭐
20
Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
Concatbert
⭐
17
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
Vlog_action_reason
⭐
17
Identifying reasons for human actions in lifestyle vlogs.
Item Alignment
⭐
10
ccks2022 task9 subtask2 商品同款识别
Characterize Anything
⭐
10
Characterize Anything: A Wondrous Chemical Reaction between vision models and AI Characters
Whereisai
⭐
10
AI company, product, and tool collection.
Ikea Dataset
⭐
10
A dataset for multimodal machine translation
Multimodal Datasets
⭐
9
Multimodal datasets.
Icdar Emoreccom
⭐
6
Vse Probing
⭐
6
Code for COLING2020 paper: Probing Multimodal Embeddings for Linguistic Properties.
Spatial Reasoning
⭐
6
Grounding Language Models for Compositional and Spatial Reasoning
Taggpt
⭐
6
TagGPT: A simple ChatGPT based multimodal dialog generation engine that can "see/draw" and "hear/speak"
Feelingblue
⭐
6
FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023
Cbvs Uniclip
⭐
5
A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,414)
Pytorch Natural Language Processing (1,212)
Artificial Intelligence Natural Language Processing (1,010)
Dataset Natural Language Processing (1,010)
Tensorflow Natural Language Processing (909)
Javascript Natural Language Processing (843)
Natural Language Processing Chatbot (726)
1-12 of 12 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.