Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for foundation models
foundation-models
x
73 search results found
Colossalai
⭐
37,814
Making large AI models cheaper, faster and more accessible
Unilm
⭐
16,971
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Llava
⭐
12,514
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Otter
⭐
3,322
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Next Gpt
⭐
2,602
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Ask Anything
⭐
2,404
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Superclue
⭐
2,286
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Eva
⭐
1,430
EVA Series: Visual Representation Fantasies from BAAI
Autodistill
⭐
1,286
Images to inference with no labeling (use foundation models to train supervised models)
Emu
⭐
1,162
Emu Series: Generative Multimodal Models from BAAI
Alpaca_eval
⭐
899
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Meerkat
⭐
778
Creative interactive views of any dataset.
Torchxrayvision
⭐
760
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
Internvideo
⭐
736
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
One Peace
⭐
714
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Awesome Llm Powered Agent
⭐
643
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Awesome Japanese Llm
⭐
585
日本語LLMまとめ - Overview of Japanese LLMs
Awesome Timeseries Spatiotemporal Lm Llm
⭐
541
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Fastervit
⭐
539
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Groundinglmm
⭐
434
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Knowledgeeditingpapers
⭐
423
Must-read Papers on Knowledge Editing for Large Language Models.
Hyena Dna
⭐
379
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
Awesome Foundation Models
⭐
366
A curated list of foundation models for vision and language tasks
Tokenize Anything
⭐
325
Tokenize Anything via Prompting
Gen Cv
⭐
315
Vision AI Solution Accelerator
Mindvideo
⭐
314
Official code base for MinD-Video
Fondant
⭐
293
Production-ready data processing made easy and shareable
Pointllm
⭐
276
[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds
Awesome Segment Anything Extensions
⭐
255
Segment-anything related awesome extensions/projects/repos.
Uni3d
⭐
238
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Clipa
⭐
231
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
Ponderv2
⭐
229
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Medfm
⭐
209
Official Repository of NeurIPS 2023 - MedFM Challenge
Hls Foundation Os
⭐
190
This repository contains examples of fine-tuning Harmonized Landsat and Sentinel-2 (HLS) Prithvi foundation model.
Stu Net
⭐
167
The largest pre-trained medical image segmentation model (1.4B parameters) based on the largest public dataset (>100k annotations), up until April 2023.
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Awesome Prompting On Vision Language Model
⭐
162
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.
Lrv Instruction
⭐
160
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Grid Playground
⭐
125
Platform for General Robot Intelligence Development
Emernerf
⭐
120
PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision
Intelligent App Workshop
⭐
114
Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamline backend workflows. Leverages Microsoft's Copilot stack, Semantic Kernel and Azure primitives to offer an engaging and comprehensive introduction to AI-infused app development and deployment
Voxposer
⭐
103
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Rs5m
⭐
103
RS5M: a large-scale vision language dataset for remote sensing
Pyxu
⭐
98
Modular and scalable computational imaging in Python with GPU/out-of-core computing.
Vip Llava
⭐
81
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Attackvlm
⭐
79
Code of the paper: On Evaluating Adversarial Robustness of Large Vision-Language Models
Blackvip
⭐
65
Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"
Microlens
⭐
63
A huge rec dataset with raw text/audio/image/videos provided (Talk Invited at DeepMind).
Generative Ai Sagemaker Cdk Demo
⭐
56
Deploy Generative AI models from Amazon SageMaker JumpStart using AWS CDK
Language Planner
⭐
53
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Robustness Foundation Models
⭐
45
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
Ssl Wearables
⭐
45
Self-supervised learning for wearables using the UK-Biobank (>700,000 person-days)
Flair
⭐
42
FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.
Femr
⭐
42
FEMR (Framework for Electronic Medical Records) provides tooling for large-scale, self-supervised learning using electronic health records
Awesome Multimodal Llm Autonomous Driving
⭐
35
Multimodal Large Language Models for Autonomous Driving [WACV 2024 Survey Paper]
Cs6101
⭐
32
The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Llava Docker
⭐
32
Docker image for LLaVA: Large Language and Vision Assistant
Dpsda
⭐
31
[ICLR 2024] Generating DP Synthetic Data without Training
Guidance For Natural Language Queries Of Relational Databases On Aws
⭐
22
Demonstration of Natural Language Query (NLQ) of an Amazon RDS for PostgreSQL database, using SageMaker JumpStart Foundation Models, LangChain, Streamlit, and Chroma.
Foundation Models Reading Group
⭐
21
Information and materials for the Turing's Foundation Models reading group.
Promptgen
⭐
18
CLI for managing and generating Foundation Model prompts
Clouds
⭐
13
Official Implementation of Collaborating Foundation models for Domain Generalized Semantic Segmentation
Awesome Foundation Models For Weather And Climate
⭐
12
A comprehesive survey about foundation models for weather and cliamte data understanding.
Autovp
⭐
12
[ICLR24] AutoVP: An Automated Visual Prompting Framework and Benchmark
Awesome Foundation Models In Medical Imaging
⭐
9
A curated list of foundation models for vision and language tasks in medical imaging
Amazon Bedrock With Builder And Command Patterns
⭐
8
A simple, yet powerful implementation in Java that allows developers to write a rather straightforward code to create the API requests for the different foundation models supported by Amazon Bedrock.
Mlc Assistant
⭐
7
Chat with your documents and improve your writing using large-language models within your browser.
Kernel Infonce
⭐
7
Official implementation of ICLR 2024 paper "Contrastive Learning Is Spectral Clustering On Similarity Graph" (https://arxiv.org/abs/2303.15103)
M Mae
⭐
7
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)
Matrix Ssl
⭐
6
Official implementation of paper "Matrix Information Theory for Self-supervised Learning" (https://arxiv.org/abs/2305.17326)
Eden
⭐
6
Official PyTorch Implementation for Towards Foundation Models Learned from Anatomy in Medical Imaging via Self-Supervision
Surgicaldino
⭐
6
[IPCAI'2024] Surgical-DINO: Adapter Learning of Foundation Model for Depth Estimation in Endoscopic Surgery
Multi Temporal Crop Classification Baseline
⭐
5
Baseline model for crop type segmentation as part of the HLS FM downstream task evaluations
1-73 of 73 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.