Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for clips
clips
x
239 search results found
Editly
⭐
4,303
Slick, declarative command line video editing & API
Pushdeer
⭐
4,285
开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android
Marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Mmpretrain
⭐
3,177
OpenMMLab Pre-training Toolbox and Benchmark
Chinese Clip
⭐
2,816
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Zero_nlp
⭐
2,248
中文nlp解决方案(大模型、数据、模型、训练、推理)
Clip Interrogator
⭐
2,181
Image to prompt with BLIP and CLIP
Clip Retrieval
⭐
1,949
Easily compute clip embeddings and build a clip retrieval system with them
Rwidgethelper
⭐
1,604
Android UI 快速开发,专治原生控件各种不服
Awesome Openai Vision Api Experiments
⭐
1,483
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Bilix
⭐
1,433
⚡️Lightning-fast async download tool for bilibili and more | 快如闪电的异步下载工具,支持bilibili及更多
Vlm_survey
⭐
1,405
Vision-Language Models for Vision Tasks: A Survey
Hcaptcha Challenger
⭐
1,247
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
Natural Language Youtube Search
⭐
783
Search inside YouTube videos using natural language
Awesome Clip
⭐
782
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Aphantasia
⭐
757
CLIP + FFT/DWT/RGB = text to image/video
Natural Language Image Search
⭐
741
Search photos on Unsplash using natural language
Uform
⭐
729
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Stable Diffusion Ncnn
⭐
701
Stable Diffusion in NCNN with c++, supported txt2img and img2img
Clip4clip
⭐
663
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Text2live
⭐
642
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
Declip
⭐
603
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Video Chatgpt
⭐
590
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Now
⭐
588
🧞 No-code tool for creating a neural search solution in minutes
React Truncate
⭐
549
React component for truncating multi-line spans and adding an ellipsis.
Keras_cv_attention_models
⭐
523
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,effi alias kecam
Transformer Mm Explainability
⭐
490
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
Zmjimageeditor
⭐
449
ZMJImageEditor is a picture editing component like WeChat. It is powerful and easy to integrate, supporting rendering, text, rotation, tailoring, mapping and other functions. (ZMJImageEditor 是一个和微信一样图片编辑的组件,功能强大,极易集成,支持绘制、文字、旋转、剪裁、贴图等功能)
Openscene
⭐
443
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
Video_features
⭐
397
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
Skypaint Ai Diffusion
⭐
356
基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
Clip.cpp
⭐
335
CLIP inference in plain C/C++ with no extra dependencies
Easyreveal
⭐
329
Android Easy Reveal Library
Cliport
⭐
297
CLIPort: What and Where Pathways for Robotic Manipulation
Autocut Client
⭐
274
AutoCut Client
Clipstyler
⭐
252
Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)
Passl
⭐
234
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PV 等基础视觉算法
Awesome Foundation And Multimodal Models
⭐
223
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code]
Disco_diffusion_local
⭐
213
Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)
Targetclip
⭐
208
[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.
Ksymediaeditorkit_android
⭐
197
金山云短视频编辑SDK Android版本,合成速度快,支持抖动、冲击波、灵魂出窍等特效滤镜 Short video editor SDK powered by KSYUN, which makes it easy to capture, create, view, edit and share your clips and playback anywhere
Fashion Clip
⭐
189
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
Clip Iqa
⭐
183
[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images
Gensim
⭐
177
GenSim: Generating Robotic Simulation Tasks via Large Language Models
Paddlemix
⭐
172
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Flutter Shapeofview
⭐
163
Give a custom shape to any flutter widget, Material Design 2 ready
Capdec
⭐
155
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
Clipspy
⭐
152
Python CFFI bindings for the 'C' Language Integrated Production System CLIPS
Vlmevalkit
⭐
137
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Clip Italian
⭐
133
CLIP (Contrastive Language–Image Pre-training) for Italian
Stylegan3 Clip Notebooks
⭐
132
A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.
Mimix
⭐
131
Chinese Text Generation/中文文本生成,诗词曲联生成、歌词生成、现代诗生成、问题扩增、自动摘要、文言文翻
Clip Image Sorter
⭐
128
Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP model and the web's new File System Access API)
Vodrecovery
⭐
126
The purpose of this script is to obtain videos or clips that are either marked as "sub-only" or have been deleted on Twitch.
Magic
⭐
124
Language Models Can See: Plugging Visual Controls in Text Generation
Ru Clip
⭐
122
CLIP implementation for Russian language
Clip Onnx
⭐
122
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
Segment Anything Clip
⭐
119
Connecting segment-anything's output masks with the CLIP model; Awesome-Segment-Anything-Works
Vand April Gan
⭐
113
[CVPR 2023 Workshop] VAND Challenge: 1st Place on Zero-shot AD and 4th Place on Few-shot AD
Scaling Laws Openclip
⭐
112
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
Picquery
⭐
108
🔍 Search local images with natural language on Android, powered by OpenAI's CLIP model. 在 Android 上用自然语言搜索本地图片 (基于 OpenAI 的 CLIP 模型)
Clip Caption Reward
⭐
104
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Liqe
⭐
102
[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
Vision Language Models Are Bows
⭐
95
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Instruct2act
⭐
94
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Clip4cir
⭐
92
[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
Music Id
⭐
90
🎧 Automatically detect music running in Twitch streams. Effortlessly identifies music in real-time, making it easy for both streamers and viewers to discover new music while watching Twitch streams.
Vqgan Clip App
⭐
90
Local image generation using VQGAN-CLIP or CLIP guided diffusion
Experta
⭐
88
Expert Systems for Python
Motis
⭐
87
Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
Prompt Align
⭐
87
[ICCV23] Prompt-aligned Gradient for Prompt Tuning
Poda
⭐
86
[ICCV 2023] Official implementation of "PØDA: Prompt-driven Zero-shot Domain Adaptation"
Must
⭐
85
PyTorch code for MUST
Yasd Discord Bot
⭐
83
Yet Another Stable Diffusion Discord Bot
Vip Llava
⭐
81
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Moleculestm
⭐
81
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-
Twitch Downloader
⭐
80
Download Twitch VODs and Clips
Speechclip
⭐
80
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
Videoclip
⭐
79
Easily create videoclips with mpv.
Promptdet
⭐
77
PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022
Diffusion Explainer
⭐
77
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion
Searle
⭐
76
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
Nadeshiko
⭐
76
A Linux tool to cut short videos with ffmpeg.
Transformers
⭐
75
Everything you need to know about Transformers! 🤖
Natural Language Joint Query Search
⭐
70
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
Clipn
⭐
68
ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
Plip
⭐
67
Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI. PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.
Clip Imagesearch Ncnn
⭐
67
CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android
Cropbitmap
⭐
66
图片裁剪
Clipfa
⭐
65
CLIPfa: Connecting Farsi Text and Images
Obs Alternative To Shadowplay
⭐
65
An OBS Studio Guide to Replace NVIDIA Shadowplay
Youtube Clips Automator
⭐
64
MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel
Gui Youtube Dl
⭐
64
A cross platform GUI for youtube-dl written entirely in python using the WX library.
Stale
⭐
63
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
Dispict
⭐
62
Design a growing artistic exhibit of your own making, with semantic search powered by OpenAI CLIP
Beyond Inet
⭐
62
Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"
Fast_text2stylegan
⭐
62
Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Clip Container
⭐
58
A containerized REST API around OpenAI's CLIP model.
Clip_surgery
⭐
55
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
React Image Process
⭐
54
🎨 a image process component for react
1-100 of 239 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.